Use the rds_embedding extension to convert text into vectors - ApsaraDB RDS

The rds_embedding extension of ApsaraDB RDS for PostgreSQL allows you to convert text in your ApsaraDB RDS for PostgreSQL instance into vectors. The extension provides custom model configuration and model invocation capabilities to facilitate the conversion and meet specific data processing requirements.

Background information

Embedding is a technique that translates high-dimensional data into a low-dimensional space. In machine learning and natural language processing (NLP), embedding is a common method that is used to represent sparse symbols or objects as continuous vectors.

During embedding, the vectors are obtained based on the model that is referenced. ApsaraDB RDS for PostgreSQL allows you to use the rds_embedding extension to convert text in your RDS instance into vectors based on an external model that is referenced. ApsaraDB RDS for PostgreSQL also allows you to use a vector similarity operator to calculate the similarities between the text in the RDS instance and the specified text in the referenced model. This helps meet your business requirements in various scenarios.

Prerequisites

The major engine version of the RDS instance is supported by the extension. For more information, see Extensions supported by ApsaraDB RDS for PostgreSQL.
The minor engine version of the RDS instance is updated. If the major engine version of the RDS instance meets the requirements but the extension is still not supported, you can update the minor engine version. For more information, see Update the minor engine version.
The API key for the model that is used in this topic is obtained, and the region in which the RDS instance resides supports access to OpenAI. In this topic, the embedding model of OpenAI and the Singapore region are used. For more information, see Embeddings.

The RDS instance is connected over the Internet. By default, you cannot connect to the RDS instance over the Internet. You must create a NAT gateway for the virtual private cloud (VPC) in which the RDS instance resides. This way, you can connect to the RDS instance over the Internet and the RDS instance can access external models. For more information about NAT gateways, see Use the SNAT feature of an Internet NAT gateway to access the Internet.

Create a NAT gateway

Create an Internet NAT gateway.

Log on to the NAT Gateway console.
On the Internet NAT Gateway page, click Create NAT Gateway.
Optional. In the Create Service-Linked Role section of the Internet NAT Gateway page, click Create Service-Linked Role to create a service-linked role. If this is the first time you create an Internet NAT gateway, this step is required. After the service-linked role is created, you can create NAT gateways.

On the Internet NAT Gateway page, configure the parameters and click Buy Now.

Note The following table describes only key parameters. For more information about all parameters, see Use the SNAT feature of an Internet NAT gateway to access the Internet.

Parameter	Description
Region	Select the region in which you want to create the Internet NAT gateway. The region must be the same as the region of your RDS instance.
VPC	Select the VPC to which the Internet NAT gateway belongs. The VPC must be the same as the VPC of your RDS instance. You can go to the Database Connection page of the ApsaraDB RDS console to view the VPC of your RDS instance.
Associate vSwitch	Select the vSwitch to which the Internet NAT gateway belongs. The vSwitch must be the same as the vSwitch of your RDS instance. You can go to the Database Connection page of the ApsaraDB RDS console to view the vSwitch of your RDS instance.
Access Mode	In this example, Configure Later is selected.

On the Confirm page, confirm the configuration, read and select Terms of Service, and then click Confirm.
You can find the new Internet NAT gateway on the Internet NAT Gateway page.

Associate an EIP with the Internet NAT gateway.
1. On the Internet NAT Gateway page, find the new Internet NAT gateway and click its ID to go to the Basic Information tab.
2. On the Associated EIP tab, click Bind Elastic IP Address.
3. In the Associate EIP dialog box, select Purchase and Associate EIP.
4. Click OK.
  After you associate an EIP with the Internet NAT gateway, the EIP is displayed on the Associated EIP tab.

Create an SNAT entry.

On the Internet NAT Gateway page, find the new Internet NAT gateway and click its ID to go to the Basic Information tab.
On the SNAT management tab, click Create SNAT Entry.

On the Create SNAT Entry page, configure the parameters and click Confirm.

Parameter	Description
SNAT Entry	Specify whether to create an SNAT entry for a VPC, a vSwitch, an ECS instance, or a custom CIDR block. In this example, Specify vSwitch is selected. This option specifies that only RDS instances that are attached to a specified vSwitch can access the Internet by using a specified public IP address.
Select VSwitch	Select the vSwitch of your RDS instance from the drop-down list.
Select Public IP Address	Select one or more public IP addresses to access the Internet. In this example, Use One IP Address is selected, and an EIP is selected from the drop-down list.

After the SNAT entry is created, you can view the SNAT entry in the Used in SNAT Entry section. Configured SNAT entry

Enable or disable the extension

Note

You must use a privileged account to execute the statements in this section.

Enable the extension.
Before you enable the rds_embedding extension, you must enable the vector extension. The vector extension supports the required vector data types and basic vector data operations, such as calculations of the distance and similarities between vectors. The rds_embedding extension only translates high-dimensional text into vectors.
```
CREATE EXTENSION vector;
CREATE EXTENSION rds_embedding;
```

Disable the extension.

DROP EXTENSION rds_embedding;
DROP EXTENSION vector;

Example

Create a test table named test.

CREATE TABLE test(info text, vec vector(1536) NOT NULL);

Add a model.
```
SELECT rds_embedding.add_model('text-embedding-ada-002','https://api.openai.com/v1/embeddings','Authorization: Bearer sk-****P','{"input":{"texts":["%s"]},"model":"text-embedding-v1"}','->"data"->0->>"embedding"');
```
Note
- The model that is used in this topic is an OpenAI embedding model. The API key for the model that is used in this topic is obtained, and the region in which the RDS instance resides supports access to OpenAI. In this topic, the embedding model of OpenAI and the Singapore region are used. For more information, see Embeddings.
- For more information about the function that is used in this step, see rds_embedding.add_model().

Insert text and the required vector data.

INSERT INTO test SELECT '风急天高猿啸哀', rds_embedding.get_embedding_by_model('text-embedding-ada-002', 'sk-****P', '风急天高猿啸哀')::real[];
INSERT INTO test SELECT '渚清沙白鸟飞回', rds_embedding.get_embedding_by_model('text-embedding-ada-002', 'sk-****P', '渚清沙白鸟飞回')::real[];
INSERT INTO test SELECT '无边落木萧萧下', rds_embedding.get_embedding_by_model('text-embedding-ada-002', 'sk-****P', '无边落木萧萧下')::real[];
INSERT INTO test SELECT '不尽长江滚滚来', rds_embedding.get_embedding_by_model('text-embedding-ada-002', 'sk-****P', '不尽长江滚滚来')::real[];

Note

For more information about the function that is used in this step, see rds_embedding.get_embedding_by_model().

Calculate the similarities between the text 不尽长江滚滚来 and the vectors of each piece of text in the test table.

SELECT info, vec <=> rds_embedding.get_embedding_by_model('text-embedding-ada-002', 'sk-****P', '不尽长江滚滚来')::real[]::vector AS distance FROM test ORDER BY vec <=> rds_embedding.get_embedding_by_model('text-embedding-ada-002', 'sk-****P', '不尽长江滚滚来')::real[]::vector;

Sample output:

      info      |      distance
----------------+--------------------
 不尽长江滚滚来 |                  0
 无边落木萧萧下 | 0.6855717919553399
 风急天高猿啸哀 | 0.7423166439170339
 渚清沙白鸟飞回 | 0.7926204045363088
(4 rows)

References

Obtain embedding vectors from text by using HTTP requests

Run the curl command to send a POST request to the URL of the embedding model to obtain the embedding vectors of the required text.

curl https://api.openai.com/v1/embeddings \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $OPENAI_API_KEY" \
  -d '{
    "input": "Your text string goes here",
    "model": "text-embedding-ada-002"
  }'

The following table describes the parameters that are configured for the POST request.

Parameter	Example	Description
location	https://api.openai.com/v1/embeddings	The model URL. In this example, the value is the URL of the embedding model over HTTP and described in OpenAPI official documentation.
-H	Authorization: Bearer sk-9****6 Content-Type: application/json	Authorization: the authorization content in the POST request. Format: `Authorization: Bearer <OpenAI API key>`. Note For more information about how to obtain the API key, visit the OpenAI official website. Content-Type: the request type. The value is fixed as `application/json`.
-d	`"input": "Your text string goes here"` `"model": "text-embedding-ada-002"`	The body of the POST request. `input`: the text content `model`: the name of the model that is invoked For more information about the name of the embedding model, see OpenAI official documentation.

Functions supported by the rds_embedding extension

You can run the psql command \dx+ rds_embedding to query all functions that are supported by the extension.

             Objects in extension "rds_embedding"
                      Object description
---------------------------------------------------------------
 function rds_embedding.add_model(text,text,text,text,text)
 function rds_embedding.del_model(text)
 function rds_embedding.get_embedding_by_model(text,text,text)
 function rds_embedding.get_response_by_model(text,text,text)
 function rds_embedding.show_models()
 function rds_embedding.update_model(text,text,text,text,text)
 schema rds_embedding
 table rds_embedding.models
(8 rows)

Function description:

rds_embedding.add_model(): adds a model to the rds_embedding.models table.

If you want to call this function, you must configure the following parameters.

Parameter	Type	Example	Description
mname	text	text-embedding-ada-002	The model name.
murl	text	https://api.openai.com/v1/embeddings	The model URL. In this example, the value is the URL of the embedding model that is obtained by using the HTTP request and described in the OpenAPI official documentation.
mauth_header_template	text	Authorization: Bearer sk-9****6	The authorization content of the POST request. Format: `Authorization: Bearer <OpenAI API key>`. Note For more information about how to obtain the API key, see the OpenAI official website.
mbody_template	text	`{ "input":{ "texts":["%s"] }, "model":"text-embedding-ada-002" }`	The body of the POST request. `input.texts`: the text content. This content is represented by `%s`. You can replace %s with the actual text. `model`: the name of the model that is invoked. For more information about the name of the embedding model, see OpenAI official documentation.
membedding_path	text	`->"data"->0->>"embedding"`	The path to the embedding vectors in the response. The example expression is used to extract embedding vectors from the JSON-formatted response. Important Before you use this expression, make sure that the JSON-formatted response contains the path structure that complies with the expression. Otherwise, the extraction may fail or an error may occur. For more information about the JSON-formatted response, see OpenAI official documentation.

rds_embedding.del_model(): removes a model from the rds_embedding.models table.
If you want to call this function, you must configure the following parameters.
Parameter
Type
Example
Description
mname
text
text-embedding-ada-002
The model name.

rds_embedding.get_embedding_by_model(): obtains the vectors of specified text.

If you want to call this function, you must configure the following parameters.

Parameter	Type	Example	Description
mname	text	text-embedding-ada-002	The model name.
api-key	text	sk-9****6	The API key. For more information about how to obtain the API key, see the OpenAI official website.
texts	text	风急天高猿啸哀	The text from which you want to obtain vectors.

rds_embedding.show_models: displays the model in the rds_embedding.models table.
rds_embedding.update_model: updates the model in the rds_embedding.models table.
If you call this function, the parameter settings are the same as those of rds_embedding.add_model.

Note

rds_embedding.get_response_by_model is unavailable now.