This guide will walk you through the process of creating a Retrieval-Augmented Generation (RAG) service using Compute Nest with Large Language Models (LLM) on Alibaba Cloud's Platform for AI – Elastic Algorithm Service (PAI-EAS), AnalyticDB for PostgreSQL as the vector store, Gradio for the web UI, and Langchain for orchestration.
Ensure you have an Alibaba Cloud account. Sign up here if you still need to do so.
Find the service GenAI-LLM-RAG in Alibaba Cloud->Console->Compute Nest with your Alibaba Cloud credentials. And press the Offical Use.
Set up the necessary parameters of the instance:
Deploy a pre-trained LLM on PAI-EAS:
1. The default username is admin. You could choose another username.
2. You need to create a strong password, for instance.
3. As VPC can be chosen from existing VPC. To create a new VPC, you can activate the slider and put related information.
4. After, press Next: Confirm Order.
Create a web UI with Gradio:
After checking all related information and accepting the Terms of Service by pressing Create Now, the service can be deployed. Need to wait for a while to finish all the steps.
Users can ask questions through the Gradio web UI, and the LLM will process and provide answers.
Users can upload documents converted into vector store and save them in AnalyticDB for PostgreSQL.
Authorized users can access ECS to make changes or updates to the service.
For more detailed information, check the following documentation:
Additional tutorials:
By following this guide, you should be able to set up a functional RAG service on Compute Nest, leveraging the powerful features of PAI-EAS, AnalyticDB, Gradio, and Langchain.
Streamlined Deployment and Integration of Large Language Models with PAI-EAS
Deploy Your Own AI Chat Buddy - The Qwen Chat Model Deployment with Hugging Face Guide
Regional Content Hub - February 1, 2024
Alibaba Cloud Community - January 26, 2024
Alibaba Cloud Community - August 23, 2023
Farruh - September 22, 2023
Farruh - August 11, 2023
Farruh - August 13, 2023
A high-quality personalized recommendation service for your applications.
Learn MoreThis solution provides you with Artificial Intelligence services and allows you to build AI-powered, human-like, conversational, multilingual chatbots over omnichannel to quickly respond to your customers 24/7.
Learn MoreA one-stop service platform for whole-process generative AI engineering and application development, based on Tongyi Qianwen (Qwen) and other popular models
Learn MoreLog into an artificial intelligence for IT operations (AIOps) environment with an intelligent, all-in-one, and out-of-the-box log management solution
Learn MoreMore Posts by Farruh