All Products
Search
Document Center

Data Transmission Service:Create and use a DTS RAGFlow knowledge base

Last Updated:Mar 28, 2026

Data Transmission Service (DTS) lets you deploy a managed RAGFlow knowledge base without setting up or maintaining your own infrastructure. DTS handles the orchestration between your vector database, an OSS bucket for raw data, and the RAGFlow application layer.

Prerequisites

Before you begin, make sure you have:

  • A supported vector database instance that meets the requirements below

  • An OSS bucket in the same region as the vector database, with Storage Class set to Standard and Storage Redundancy set to Zone-redundant Storage (Recommended)

Supported vector databases

DatabaseRequirements
AnalyticDB for PostgreSQL instanceDatabase engine version: 7.0 Standard Edition. Kernel version: upgraded to 7.2.1.2 or later. Vector search engine optimization: enabled.
PolarSearch clusterA PolarDB for MySQL cluster with the PolarSearch feature enabled.
Lindorm instanceEngine Type must include Search Engine and Vector Engine.
PolarDB for PostgreSQL clusterThe PGVector (vector search) plugin installed.

Supported regions

China (Hangzhou), China (Shanghai), China (Beijing), China (Shenzhen), China (Hong Kong), Singapore, and Indonesia (Jakarta).

Usage notes

  • Once you enable a public endpoint for a RAGFlow knowledge base, you cannot disable it.

  • A RAGFlow account is valid only for the knowledge base it was registered with.

Billing

Billing follows the pay-as-you-go model. For details, see AI Data Preparation metering methods.

Create a RAGFlow knowledge base

  1. Go to the RAGFlow knowledge base list page.

    1. Log on to the Data Transmission Service (DTS) console.

    2. In the left navigation pane, click Data preparation.

    3. In the upper-left corner, select the target region.

    4. Click the RAGFlow Knowledge Base tab.

  2. Click Create knowledge base.

  3. In the Deployment Scope section, enter an Instance Name.

  4. In the Network and Zone section, select the VPC, Zone and vSwitch, and Secondary Zone and vSwitch.

    The secondary zone cannot be the same as the primary zone.
  5. In the RAGFlow knowledge base configuration section, set the Number of knowledge base services.

    The default Configuration Plan is Default. With the Default plan, you only need to set the number of services — the Knowledge base service specifications are fixed at 4 vCPU, 16 GB Memory.
  6. In the Vector database configuration section, set Engine to your vector database type and fill in the required fields.

    Engine: ADB PostgreSQL

    Set Engine to ADB PostgreSQL. Select the target AnalyticDB for PostgreSQL instance as Database. Enter the Database Name, Database schema name, Database account, and Password.

    Engine: PolarSearch

    Set Engine to PolarSearch. Select the PolarDB for MySQL cluster with PolarSearch enabled as Database. Enter the Database account and Password.

    Engine: PolarDB PostgreSQL

    Set Engine to PolarDB PostgreSQL. Select the target PolarDB for PostgreSQL cluster as Database. Enter the Database Name, Database schema name, Database account, and Password.

    Engine: Lindorm

    Set Engine to Lindorm. Select the target Lindorm instance as Database. Enter the Database account and Password.

    Vector database is fixed to Import from existing instance. For ADB PostgreSQL and PolarDB PostgreSQL, Database schema name defaults to public and cannot be changed.
  7. In the OSS configuration section, select the OSS Bucket and enter the Path where data will be stored.

  8. Click Buy Now.

  9. Return to the RAGFlow knowledge base list page and wait until the Running Status changes to Running.

    Click the refresh icon in the upper-right corner to update the list.

Configure an IP whitelist

Before accessing the knowledge base, add the client's IP address to the whitelist.

  1. In the Actions column of the target knowledge base, click Configure Whitelist.

  2. In the Configure Whitelist panel, add the appropriate IP addresses or CIDR blocks. Separate multiple entries with commas (,). To find your public IP address, run curl ipinfo.io/ip or curl ifconfig.me.

    Access methodIP whitelist to add
    Internal network (client and knowledge base in the same VPC)Private IP address or CIDR block of the client
    Public network (client on a local server)Public IP address or CIDR block of the client
  3. Click Set.

Log on to RAGFlow

  1. In the Actions column of the target knowledge base, click Manage.

    Alternatively, click Log on to knowledge base in the Actions column and select internal or public network access.
  2. In the Endpoint section, click Log on with Public Endpoint or Log on with Internal Endpoint.

    Accessing over the public network requires a public endpoint. See Enable a public endpoint.
  3. On the RAGFlow login page, enter your email address and password, then click Log on.

  4. Manage your knowledge base from the RAGFlow interface.

    For RAGFlow-specific operations, see the official RAGFlow documentation.

    image

(Optional) Configure network access for external models

By default, RAGFlow cannot reach external networks. If you want to add a model provider in RAGFlow, configure one of the following options.

Private network: Alibaba Cloud Model Studio

Use PrivateLink to establish a private connection between the VPC and Alibaba Cloud Model Studio. For setup instructions, see Access Alibaba Cloud Model Studio models or application APIs over a private network.

Public network: NAT Gateway

Configure a NAT Gateway for the VPC of the vector database that RAGFlow uses. For setup instructions, see Public NAT Gateway.

Appendix

Enable a public endpoint

Important

You cannot disable a public endpoint after it is enabled.

  1. In the Actions column of the target knowledge base, click Manage.

  2. In the Endpoint section, click Enable Public Endpoint.

  3. In the confirmation dialog, click OK.

  4. Wait until the Running Status in the Basic Information section changes to Running.

Register a RAGFlow account

  1. Go to the RAGFlow login page for the target knowledge base.

  2. Click Register.

    screenshot_2025-06-26_14-02-52

  3. Enter your email address, name, and password.

    image

  4. Click Continue. A image confirmation message appears at the top of the page.

A RAGFlow account is only valid for the knowledge base it was registered with.

Configuration parameters

SectionParameterDescription
GeneralBilling MethodOnly Pay-as-you-go is supported.
RegionThe region where the knowledge base resides.
Deployment ScopeDeployment ScopeFixed to RAGFlow knowledge base.
Instance NameA display name for the knowledge base.
Permission CheckSLR AuthorizationMake sure the AliyunServiceRoleForADBPG service-linked role exists for AnalyticDB for PostgreSQL.
Network and ZoneNetwork TypeFixed to VPC.
VPCThe VPC where the knowledge base resides.
Zone and vSwitchThe primary zone and vSwitch.
Deployment SolutionFixed to Multi-zone deployment.
Secondary Zone and vSwitchThe secondary zone and vSwitch. Must differ from the primary zone.
RAGFlow knowledge base configurationConfiguration PlanDefault or Custom. With Default, only the number of services is configurable.
Knowledge base basic service specificationsFixed to 4 vCPU, 16 GB Memory.
Number of knowledge base basic servicesDefault: 4. Affects cost.
Knowledge base data preparation service specificationsFixed to 4 vCPU, 16 GB Memory.
Number of knowledge base data preparation servicesDefault: 2. Affects cost.
Vector database configurationVector databaseFixed to Import from existing instance.
EngineThe vector database type: ADB PostgreSQL, PolarSearch, PolarDB PostgreSQL, or Lindorm.
DatabaseThe vector database instance.
Database NameAvailable for ADB PostgreSQL and PolarDB PostgreSQL only.
Database schema nameAvailable for ADB PostgreSQL and PolarDB PostgreSQL only. Fixed to public.
Database account / PasswordCredentials for the vector database instance.
OSS configurationOSS BucketThe bucket that receives the data.
PathThe path in the bucket where data is stored.