Data Transmission Service (DTS) lets you deploy a managed RAGFlow knowledge base without setting up or maintaining your own infrastructure. DTS handles the orchestration between your vector database, an OSS bucket for raw data, and the RAGFlow application layer.
Prerequisites
Before you begin, make sure you have:
A supported vector database instance that meets the requirements below
An OSS bucket in the same region as the vector database, with Storage Class set to Standard and Storage Redundancy set to Zone-redundant Storage (Recommended)
Supported vector databases
| Database | Requirements |
|---|---|
| AnalyticDB for PostgreSQL instance | Database engine version: 7.0 Standard Edition. Kernel version: upgraded to 7.2.1.2 or later. Vector search engine optimization: enabled. |
| PolarSearch cluster | A PolarDB for MySQL cluster with the PolarSearch feature enabled. |
| Lindorm instance | Engine Type must include Search Engine and Vector Engine. |
| PolarDB for PostgreSQL cluster | The PGVector (vector search) plugin installed. |
Supported regions
China (Hangzhou), China (Shanghai), China (Beijing), China (Shenzhen), China (Hong Kong), Singapore, and Indonesia (Jakarta).
Usage notes
Once you enable a public endpoint for a RAGFlow knowledge base, you cannot disable it.
A RAGFlow account is valid only for the knowledge base it was registered with.
Billing
Billing follows the pay-as-you-go model. For details, see AI Data Preparation metering methods.
Create a RAGFlow knowledge base
Go to the RAGFlow knowledge base list page.
Log on to the Data Transmission Service (DTS) console.
In the left navigation pane, click Data preparation.
In the upper-left corner, select the target region.
Click the RAGFlow Knowledge Base tab.
Click Create knowledge base.
In the Deployment Scope section, enter an Instance Name.
In the Network and Zone section, select the VPC, Zone and vSwitch, and Secondary Zone and vSwitch.
The secondary zone cannot be the same as the primary zone.
In the RAGFlow knowledge base configuration section, set the Number of knowledge base services.
The default Configuration Plan is Default. With the Default plan, you only need to set the number of services — the Knowledge base service specifications are fixed at 4 vCPU, 16 GB Memory.
In the Vector database configuration section, set Engine to your vector database type and fill in the required fields.
Engine: ADB PostgreSQL
Set Engine to ADB PostgreSQL. Select the target AnalyticDB for PostgreSQL instance as Database. Enter the Database Name, Database schema name, Database account, and Password.
Engine: PolarSearch
Set Engine to PolarSearch. Select the PolarDB for MySQL cluster with PolarSearch enabled as Database. Enter the Database account and Password.
Engine: PolarDB PostgreSQL
Set Engine to PolarDB PostgreSQL. Select the target PolarDB for PostgreSQL cluster as Database. Enter the Database Name, Database schema name, Database account, and Password.
Engine: Lindorm
Set Engine to Lindorm. Select the target Lindorm instance as Database. Enter the Database account and Password.
Vector database is fixed to Import from existing instance. For ADB PostgreSQL and PolarDB PostgreSQL, Database schema name defaults to public and cannot be changed.
In the OSS configuration section, select the OSS Bucket and enter the Path where data will be stored.
Click Buy Now.
Return to the RAGFlow knowledge base list page and wait until the Running Status changes to Running.
Click the refresh icon in the upper-right corner to update the list.
Configure an IP whitelist
Before accessing the knowledge base, add the client's IP address to the whitelist.
In the Actions column of the target knowledge base, click Configure Whitelist.
In the Configure Whitelist panel, add the appropriate IP addresses or CIDR blocks. Separate multiple entries with commas (,). To find your public IP address, run
curl ipinfo.io/iporcurl ifconfig.me.Access method IP whitelist to add Internal network (client and knowledge base in the same VPC) Private IP address or CIDR block of the client Public network (client on a local server) Public IP address or CIDR block of the client Click Set.
Log on to RAGFlow
In the Actions column of the target knowledge base, click Manage.
Alternatively, click Log on to knowledge base in the Actions column and select internal or public network access.
In the Endpoint section, click Log on with Public Endpoint or Log on with Internal Endpoint.
Accessing over the public network requires a public endpoint. See Enable a public endpoint.
On the RAGFlow login page, enter your email address and password, then click Log on.
Manage your knowledge base from the RAGFlow interface.
For RAGFlow-specific operations, see the official RAGFlow documentation.

(Optional) Configure network access for external models
By default, RAGFlow cannot reach external networks. If you want to add a model provider in RAGFlow, configure one of the following options.
Private network: Alibaba Cloud Model Studio
Use PrivateLink to establish a private connection between the VPC and Alibaba Cloud Model Studio. For setup instructions, see Access Alibaba Cloud Model Studio models or application APIs over a private network.
Public network: NAT Gateway
Configure a NAT Gateway for the VPC of the vector database that RAGFlow uses. For setup instructions, see Public NAT Gateway.
Appendix
Enable a public endpoint
You cannot disable a public endpoint after it is enabled.
In the Actions column of the target knowledge base, click Manage.
In the Endpoint section, click Enable Public Endpoint.
In the confirmation dialog, click OK.
Wait until the Running Status in the Basic Information section changes to Running.
Register a RAGFlow account
Go to the RAGFlow login page for the target knowledge base.
Click Register.

Enter your email address, name, and password.

Click Continue. A
confirmation message appears at the top of the page.
A RAGFlow account is only valid for the knowledge base it was registered with.
Configuration parameters
| Section | Parameter | Description | |
|---|---|---|---|
| General | Billing Method | Only Pay-as-you-go is supported. | |
| Region | The region where the knowledge base resides. | ||
| Deployment Scope | Deployment Scope | Fixed to RAGFlow knowledge base. | |
| Instance Name | A display name for the knowledge base. | ||
| Permission Check | SLR Authorization | Make sure the AliyunServiceRoleForADBPG service-linked role exists for AnalyticDB for PostgreSQL. | |
| Network and Zone | Network Type | Fixed to VPC. | |
| VPC | The VPC where the knowledge base resides. | ||
| Zone and vSwitch | The primary zone and vSwitch. | ||
| Deployment Solution | Fixed to Multi-zone deployment. | ||
| Secondary Zone and vSwitch | The secondary zone and vSwitch. Must differ from the primary zone. | ||
| RAGFlow knowledge base configuration | Configuration Plan | Default or Custom. With Default, only the number of services is configurable. | |
| Knowledge base basic service specifications | Fixed to 4 vCPU, 16 GB Memory. | ||
| Number of knowledge base basic services | Default: 4. Affects cost. | ||
| Knowledge base data preparation service specifications | Fixed to 4 vCPU, 16 GB Memory. | ||
| Number of knowledge base data preparation services | Default: 2. Affects cost. | ||
| Vector database configuration | Vector database | Fixed to Import from existing instance. | |
| Engine | The vector database type: ADB PostgreSQL, PolarSearch, PolarDB PostgreSQL, or Lindorm. | ||
| Database | The vector database instance. | ||
| Database Name | Available for ADB PostgreSQL and PolarDB PostgreSQL only. | ||
| Database schema name | Available for ADB PostgreSQL and PolarDB PostgreSQL only. Fixed to public. | ||
| Database account / Password | Credentials for the vector database instance. | ||
| OSS configuration | OSS Bucket | The bucket that receives the data. | |
| Path | The path in the bucket where data is stored. | ||