All Products
Search
Document Center

ApsaraDB RDS:Object storage paths

Last Updated:Sep 22, 2023

This topic describes the paths of vector data that is stored in Alibaba Cloud Object Storage Service (OSS) buckets, self-managed Multi-Cloud Object Storage (MinIO) buckets, and Hadoop Distributed File System (HDFS) files.

Paths of vector data in OSS buckets

The path of vector data in an OSS bucket is in the following format:

oss://<access_id>:<secrect_key>@[<Endpoint>]/path_to/file
Note

The following list describes the parameters:

  • access_id and secrect_key specify the AccessKey ID and AccessKey secret of the AccessKey pair that is used to access your Object Storage Service (OSS) bucket. For more information, see Obtain an AccessKey pair.

  • Endpoint specifies the domain name of the OSS bucket. To ensure data accessibility, make sure that your RDS instance and your OSS bucket reside in the same region and are connected by using the internal OSS endpoint. For more information, see OSS endpoints.

  • /path_to/file specifies the directory of the file in the OSS bucket.

Paths of vector data in MinIO buckets

The path of vector data in a MinIO bucket is in the following format. For more information about MinIO buckets, see Multi-Cloud Object Storage.

mio://<id>:<key>@<host>[:<port>]/path_to/file

If you do not specify the port parameter, the HTTPS port is used. To import data at high performance, make sure that your ApsaraDB RDS instance and the self-managed MinIO bucket reside in the same region and can communicate over an internal network.

Paths of vector data in HDFS files

The path of vector data in an HDFS file is in the following format:

HDFS://<user_name>@<host>[:<port>]/path_to/file
  • If you do not specify the user_name parameter, the default value hadoop is used.

  • The host and port parameters specify the endpoint and port of Remote Procedure Call (RPC) in the HDFS configuration file.

  • To import data at high performance, make sure that your ApsaraDB RDS instance and the HDFS cluster reside in the same VPC and communicate over an internal network.