Vector Retrieval Service for Milvus (Milvus) allows you to create Milvus instances for building high-performance, elastically scalable vector databases. It includes Attu, a built-in visual management tool for efficient operations and monitoring. This topic describes how to quickly create a Milvus instance.
Prerequisites
You have an Alibaba Cloud account. If you do not have an account, you can register one. For more information, see the Alibaba Cloud account registration process.
When you make your first purchase, you must grant Milvus permission to access the required cloud resources. For more information, see Alibaba Cloud account role authorization.
If you use a Resource Access Management (RAM) user, you must grant the required permissions to that user. For more information, see RAM user authorization.
Procedure
Go to the Milvus page.
Log on to the Vector Retrieval Service for Milvus console.
In the navigation pane on the left, click Instances.
On the Instances page, click Create Instance.
On the buy page, configure the following parameters.
Parameter
Example
Description
Billing Method
Subscription
Subscription and pay-as-you-go billing methods are supported.
Duration
1 Month
The default subscription duration is 1 month. For supported durations, see the options on the page.
Regions And Zones
China (Hangzhou)
Zone J
The physical location and zone where the instance resides.
ImportantThe region and zone cannot be changed after the instance is created. Choose carefully.
VPC
vpc_Hangzhou/vpc-bp1f4epmkvncimpgs****
A virtual private cloud (VPC) is an isolated network environment that you define on Alibaba Cloud. You have full control over your VPC.
Select an existing VPC. To create a new VPC, click VPC console. For more information, see Create and manage a VPC.
vSwitch
vsw_i/vsw-bp1e2f5fhaplp0g6p****
A vSwitch is a basic network module of a VPC that connects different cloud resources.
Select an existing vSwitch. To create a new vSwitch, click VPC console. For more information, see Create and manage vSwitches.
Service-linked Role
AliyunServiceRoleForCloudMilvus
Your Alibaba Cloud account has been granted the AliyunServiceRoleForCloudMilvus role and the associated AliyunServiceRolePolicyForMilvus policy.
Milvus uses this role to access your resources in other cloud products.
Milvus Version
2.4
The community version number of Milvus.
Specifications
Standard Edition
The specifications of the Milvus instance.
Basic Edition: Suitable for test environments. The resource specifications are fixed.
Standard Edition: Suitable for production environments. You can customize the resource specifications as needed.
ImportantThe Basic Edition is for trial and functional testing only. It cannot be upgraded to the Standard Edition and does not guarantee a SLA.
HA
Enable
High availability is enabled by default for the Standard Edition but is not supported for the Basic Edition. You can enable high availability when you create the instance or after it is created. If High-availability Mode is enabled, the number of replicas for each component must be greater than 1.
NoteIn a production environment, enabling high availability is highly recommended to ensure system stability and receive SLA support.
Service Node
Metadata Service: 4 vCPU 16 GiB 2
Proxy: 2 vCPU 8 GiB 2
Index Node: 4 vCPU 16 GiB 2
Data Node: 2 vCPU 8 GiB 2
Service nodes are responsible for processing client requests and managing the cluster status. They distribute query requests to appropriate compute nodes, collect the results, and return them to the user. They also maintain the cluster's metadata to ensure that requests are correctly routed to the corresponding compute nodes. Configure the following information based on the Milvus instance specifications:
Basic Edition: If you select this edition, you must also configure the Standalone parameter. This parameter defines the node configuration for the Starter Edition. You can use the default value of 4 vCPU 16 GiB.
Standard Edition: If you select this edition, you must also configure the following parameters.
Metadata Service: A service component that stores the status information of the instance. The IndexCoord, QueryCoord, and DataCoord services are deployed on this node.
Proxy: Specify the specifications and number of replicas for the Proxy nodes.
Index Node: Specify the specifications and number of replicas for the IndexNode nodes.
Data Node: Specify the specifications and number of replicas for the DataNode nodes.
Compute Node
Query Node Performance-optimized 4 vCPU 16 GiB 2
Compute nodes are responsible for storing vector data and performing operations such as search, insert, and delete based on tasks assigned by service nodes. For more information about compute nodes, see Compute node specifications and performance comparison.
Automatic Backup
Enabled
ImportantUsing the backup feature incurs storage fees. For more information, see Billing items.
The automatic backup feature is enabled by default. This feature ensures the data security of your instance and guarantees the service SLA. If data is accidentally lost, you can use this feature to recover it.
NoteTo disable this feature, go to the Backup Snapshot tab after the instance is created. For more information, see Backup and restoration.
Storage Costs
Not applicable
You do not need to select a storage capacity in advance. Storage costs are billed based on your actual hourly usage (free during public preview). The usage is displayed on the instance details page.
Resource Group
Custom
Select an existing resource group. To create a new resource group, click Create Resource Group. Resource groups allow you to group your cloud resources based on dimensions such as purpose, permissions, and ownership. For more information, see What is Resource Group?.
Tag
Custom
You can attach tags when you create an instance or add them after the instance is created. Tags help you identify and manage your instance resources. For more information about tags, see What is Tag?.
Password
Custom
Set the password for the root account of the Milvus instance to log on to the database.
NoteIf you forget the password, see How do I reset the instance password?.
Click Buy Now and complete the payment.
The instance is created when its status changes to Running.