All Products
Search
Document Center

Vector Retrieval Service for Milvus:Quickly create a Milvus instance

Last Updated:Sep 22, 2025

Vector Retrieval Service for Milvus (Milvus) allows you to create Milvus instances for building high-performance, elastically scalable vector databases. It includes Attu, a built-in visual management tool for efficient operations and monitoring. This topic describes how to quickly create a Milvus instance.

Prerequisites

  • You have an Alibaba Cloud account. If you do not have an account, you can register one. For more information, see the Alibaba Cloud account registration process.

  • When you make your first purchase, you must grant Milvus permission to access the required cloud resources. For more information, see Alibaba Cloud account role authorization.

  • If you use a Resource Access Management (RAM) user, you must grant the required permissions to that user. For more information, see RAM user authorization.

Procedure

  1. Go to the Milvus page.

    1. Log on to the Vector Retrieval Service for Milvus console.

    2. In the navigation pane on the left, click Instances.

  2. On the Instances page, click Create Instance.

  3. On the buy page, configure the following parameters.

    Parameter

    Example

    Description

    Billing Method

    Subscription

    Subscription and pay-as-you-go billing methods are supported.

    Duration

    1 Month

    The default subscription duration is 1 month. For supported durations, see the options on the page.

    Regions And Zones

    China (Hangzhou)

    Zone J

    The physical location and zone where the instance resides.

    Important

    The region and zone cannot be changed after the instance is created. Choose carefully.

    VPC

    vpc_Hangzhou/vpc-bp1f4epmkvncimpgs****

    A virtual private cloud (VPC) is an isolated network environment that you define on Alibaba Cloud. You have full control over your VPC.

    Select an existing VPC. To create a new VPC, click VPC console. For more information, see Create and manage a VPC.

    vSwitch

    vsw_i/vsw-bp1e2f5fhaplp0g6p****

    A vSwitch is a basic network module of a VPC that connects different cloud resources.

    Select an existing vSwitch. To create a new vSwitch, click VPC console. For more information, see Create and manage vSwitches.

    Service-linked Role

    AliyunServiceRoleForCloudMilvus

    Your Alibaba Cloud account has been granted the AliyunServiceRoleForCloudMilvus role and the associated AliyunServiceRolePolicyForMilvus policy.

    Milvus uses this role to access your resources in other cloud products.

    Milvus Version

    2.4

    The community version number of Milvus.

    Specifications

    Standard Edition

    The specifications of the Milvus instance.

    • Basic Edition: Suitable for test environments. The resource specifications are fixed.

    • Standard Edition: Suitable for production environments. You can customize the resource specifications as needed.

    Important

    The Basic Edition is for trial and functional testing only. It cannot be upgraded to the Standard Edition and does not guarantee a SLA.

    HA

    Enable

    High availability is enabled by default for the Standard Edition but is not supported for the Basic Edition. You can enable high availability when you create the instance or after it is created. If High-availability Mode is enabled, the number of replicas for each component must be greater than 1.

    Note

    In a production environment, enabling high availability is highly recommended to ensure system stability and receive SLA support.

    Service Node

    • Metadata Service: 4 vCPU 16 GiB 2

    • Proxy: 2 vCPU 8 GiB 2

    • Index Node: 4 vCPU 16 GiB 2

    • Data Node: 2 vCPU 8 GiB 2

    Service nodes are responsible for processing client requests and managing the cluster status. They distribute query requests to appropriate compute nodes, collect the results, and return them to the user. They also maintain the cluster's metadata to ensure that requests are correctly routed to the corresponding compute nodes. Configure the following information based on the Milvus instance specifications:

    • Basic Edition: If you select this edition, you must also configure the Standalone parameter. This parameter defines the node configuration for the Starter Edition. You can use the default value of 4 vCPU 16 GiB.

    • Standard Edition: If you select this edition, you must also configure the following parameters.

      • Metadata Service: A service component that stores the status information of the instance. The IndexCoord, QueryCoord, and DataCoord services are deployed on this node.

      • Proxy: Specify the specifications and number of replicas for the Proxy nodes.

      • Index Node: Specify the specifications and number of replicas for the IndexNode nodes.

      • Data Node: Specify the specifications and number of replicas for the DataNode nodes.

    Compute Node

    Query Node Performance-optimized 4 vCPU 16 GiB 2

    Compute nodes are responsible for storing vector data and performing operations such as search, insert, and delete based on tasks assigned by service nodes. For more information about compute nodes, see Compute node specifications and performance comparison.

    Automatic Backup

    Enabled

    Important

    Using the backup feature incurs storage fees. For more information, see Billing items.

    The automatic backup feature is enabled by default. This feature ensures the data security of your instance and guarantees the service SLA. If data is accidentally lost, you can use this feature to recover it.

    Note

    To disable this feature, go to the Backup Snapshot tab after the instance is created. For more information, see Backup and restoration.

    Storage Costs

    Not applicable

    You do not need to select a storage capacity in advance. Storage costs are billed based on your actual hourly usage (free during public preview). The usage is displayed on the instance details page.

    Resource Group

    Custom

    Select an existing resource group. To create a new resource group, click Create Resource Group. Resource groups allow you to group your cloud resources based on dimensions such as purpose, permissions, and ownership. For more information, see What is Resource Group?.

    Tag

    Custom

    You can attach tags when you create an instance or add them after the instance is created. Tags help you identify and manage your instance resources. For more information about tags, see What is Tag?.

    Password

    Custom

    Set the password for the root account of the Milvus instance to log on to the database.

    Note

    If you forget the password, see How do I reset the instance password?.

  4. Click Buy Now and complete the payment.

    The instance is created when its status changes to Running.