All Products
Search
Document Center

AnalyticDB:CreateModelService

Last Updated:Nov 05, 2025

Creates a model service.

Operation description

Before you call this operation, make sure that you fully understand the billing methods and pricing of AnalyticDB for PostgreSQL.

Debugging

You can run this interface directly in OpenAPI Explorer, saving you the trouble of calculating signatures. After running successfully, OpenAPI Explorer can automatically generate SDK code samples.

Authorization information

The following table shows the authorization information corresponding to the API. The authorization information can be used in the Action policy element to grant a RAM user or RAM role the permissions to call this API operation. Description:

  • Operation: the value that you can use in the Action element to specify the operation on a resource.
  • Access level: the access level of each operation. The levels are read, write, and list.
  • Resource type: the type of the resource on which you can authorize the RAM user or the RAM role to perform the operation. Take note of the following items:
    • For mandatory resource types, indicate with a prefix of * .
    • If the permissions cannot be granted at the resource level, All Resources is used in the Resource type column of the operation.
  • Condition Key: the condition key that is defined by the cloud service.
  • Associated operation: other operations that the RAM user or the RAM role must have permissions to perform to complete the operation. To complete the operation, the RAM user or the RAM role must have the permissions to perform the associated operations.
OperationAccess levelResource typeCondition keyAssociated operation
gpdb:CreateModelServicecreate
*DBInstance
acs:gpdb::{#accountId}:dbinstance/{#DBInstanceId}
    none
none

Request parameters

ParameterTypeRequiredDescriptionExample
DBInstanceIdstringYes

The cluster ID.

Note You can call the DescribeDBInstances operation to query the IDs of all AnalyticDB for PostgreSQL instances in the specified region.
gp-xxxxxxxxx
ModelNamestringYes

The name of the model.

Qwen3-Embedding-8B
DescriptionstringNo

The description.

test
SecurityIPListstringNo

The IP address whitelist.

127.0.0.1 indicates that access from any external IP address is prohibited. You can call the ModifySecurityIps operation to modify the IP address whitelist after the instance is created.

127.0.0.1
AiNodesarrayYes

A list of AI nodes for model deployment.

stringYes

The name of the AI node.

ai-xxxxxx
ModelParamsobjectNo

Model parameters (to be supported).

ResourceGroupIdstringNo

The ID of the resource group to which the instance belongs. For more information about how to get the ID of a resource group, see View the basic information of a resource group.

rg-bp67acfmxazb4p****
ClientTokenstringNo

The client token that is used to ensure the idempotence of the request. For more information, see How do I ensure the idempotence?

0c593ea1-3bea-11e9-b96b-88**********
ReplicasintegerNo

The number of model service replicas.

1
InferenceEnginestringNo

The inference engine. Only vllm is supported.

vllm

Response parameters

ParameterTypeDescriptionExample
object
ModelServiceIdstring

The model service ID.

ms-xxxxxxxxx
RequestIdstring

The unique ID of the request.

ABB39CC3-4488-4857-905D-2E4A051D0521

Examples

Sample success responses

JSONformat

{
  "ModelServiceId": "ms-xxxxxxxxx",
  "RequestId": "ABB39CC3-4488-4857-905D-2E4A051D0521"
}

Error codes

For a list of error codes, visit the Service error codes.