Create a StarRocks instance to obtain a managed, high-performance environment. This frees you from building and maintaining infrastructure, which lets you easily perform large-scale data analytics and queries.
Procedure
Go to the EMR Serverless StarRocks instance list page.
Log on to the E-MapReduce console.
In the navigation pane on the left, choose .
In the top menu bar, select the required region.
On the Instance List page, click Create Instance.
On the E-MapReduce Serverless StarRocks page, configure the instance.
NoteThe configuration parameters vary based on the selected instance type. BE is for the compute-storage integrated edition, and CN is for the compute-storage separated edition.
Configuration item
Description
Product Type
The supported types are as follows:
Subscription: A prepaid billing method. You reserve resources in advance and receive larger discounts to maximize your savings.
Pay-as-you-go: A postpaid billing method. You are billed for resources based on actual usage. Bills are generated each billing cycle and the fees are deducted from your account.
Region
The physical location of the instance.
ImportantAfter an instance is created, its region cannot be changed. Select the region with care.
Network And Zone
Select the virtual private cloud, zone, and corresponding vSwitch information.
Virtual private cloud (VPC): A VPC is an isolated network environment that you define in Alibaba Cloud. You have full control over your VPC.
Select an existing VPC, or click Create VPC to go to the VPC console and create one. For more information, see Create and manage a VPC.
NoteWhen you create a VPC, the IPv4 CIDR block must be selected from one of the following three private network ranges defined in RFC 1918:
10.0.0.0/8(10.0.0.0 to 10.255.255.255)172.16.0.0/12(172.16.0.0 to 172.31.255.255)192.168.0.0/16(192.168.0.0 to 192.168.255.255)
If your Serverless StarRocks instance needs to access the internet, for example, to import data or query foreign tables, make sure its VPC has internet access. You can deploy an Internet NAT gateway in the VPC and enable the SNAT feature. This allows the Serverless StarRocks instance to access internet resources through the gateway. For more information, see Use the SNAT feature of an Internet NAT gateway to access the internet.
Zone: The zone where the instance is located.
vSwitch: A vSwitch is a basic network module of a VPC used to connect different cloud resources.
Select an existing vSwitch, or click Create VSwitch to go to the VPC console and create one. For more information, see Create and manage a vSwitch.
Instance Type
Select an instance type based on your scenario:
Compute-storage Integrated Edition: Suitable for scenarios that require extremely high query performance, such as OLAP multidimensional analysis, high-concurrency queries, and real-time data analytics. This instance type stores data on cloud disks or local disks to ensure high data read and write efficiency.
If you select this series, the Instance Types that you will configure are the FE Instance Type and the BE Instance Type.
Compute-storage Separated Edition: Suitable for business scenarios that are sensitive to storage costs but have slightly lower requirements for query efficiency, such as OLAP multidimensional analysis, real-time data analytics, and data warehouse scenarios. This instance type uses a compute-storage separated architecture.
If you select this series, the Instance Type to configure is FE Specifications and CN Specifications.
Multi-zone Disaster Recovery
This parameter applies only to the compute-storage separated edition.
This feature is disabled by default. If you enable it, the system supports real-time backup and disaster recovery across zones. This significantly improves data reliability and system high availability.
ImportantEnabling this feature incurs additional resource and cost overhead:
The system requires extra FE nodes to coordinate and manage cross-zone backup jobs.
If you enable multi-zone backup, the underlying storage uses zone-redundant storage, which incurs additional data storage fees.
The system supports two configuration patterns: two zones or three zones. You can choose the appropriate configuration based on your business needs.
If you have high requirements for data security and disaster recovery, enable this feature. Otherwise, the default disabled state is sufficient for general needs.
For more information, see Disaster recovery management and Data storage (multi-zone) fees.
Instance Series
Supports Starter Edition and Standard Edition. For more information, see Instance series description.
NoteThe Starter Edition is only available in the China (Beijing), China (Shanghai), China (Shenzhen), and China (Hangzhou) regions.
Kernel Version
The community version number of StarRocks.
The latest minor version used is also displayed below, for example: 3.3.13-1.1.1-1.7.13.
FE Specifications
Specification type: The specification type of FE nodes varies with the Instance series of StarRocks.
Starter Edition: Supports Standard.
Standard Edition: Supports Standard and Memory-enhanced specifications.
Compute CU: Select the number of compute units (CUs).
Select the appropriate CU specification as needed. For more information about CU fees, see Billing items.
Data Disk: Only PL1 ESSD is supported. The data disk size ranges from 100 GB to 65,000 GB, with a step size of 100.
For more information about cloud disks, see ESSDs.
High Availability: Enabled by default. The Standard Edition supports high availability (HA). When you enable HA, the number of StarRocks FE nodes increases from 1 to 3 to reduce the risk of failures.
ImportantEnable high availability for production environments.
Number Of Nodes: The number of FE nodes. The value can be an odd number from 1 to 11.
Load Balancing: The following methods are supported.
Built-in PrivateZone: Uses PrivateZone domain name resolution to automatically balance traffic. There are no extra fees. This is suitable for lightweight scenarios or cost-sensitive environments.
Suitable for non-production environments or services with low requirements for load balancing performance.
Server Load Balancer (SLB): Uses the SLB service to achieve high-performance load balancing. This is recommended for production environments, especially for services with high performance and reliability requirements.
The feature to remove the FE leader from query traffic is available only after SLB is enabled.
You must activate the SLB service, which incurs extra fees. For more information, see CLB Billing overview.
BE Specifications
This parameter applies only to the compute-storage integrated edition.
Specification type: The specification type of BE nodes varies with the Instance series of StarRocks.
Starter Edition: Supports Standard.
Standard Edition: Supports the following specifications.
Standard Specifications: The default specifications. One CU is equal to 1 CPU core and 4 GiB of memory. Enterprise SSDs (ESSDs) are used for data storage.
Memory-optimized Specifications: One CU is equal to 1 CPU core and 8 GiB of memory. This option is suitable for scenarios in which a large amount of memory resources are required, such as scenarios where complex queries are made or scenarios where high concurrency is required. ESSDs are used for data storage.
Network-enhanced Specifications: One CU is equal to 1 CPU core and 4 GiB of memory. The network bandwidth is two or more times that of the standard specifications. This option is suitable for analysis of external tables that contain a large amount of data. ESSDs are used for data storage.
High-performance storage: If you select this option, you must select the desired specifications based on your business requirements. This option is suitable for scenarios in which high storage I/O performance is required. Local SSDs are used for data storage.
High-specification Storage: If you select this option, you must select the desired specifications based on your business requirements. Local HDDs are used for data storage. This option is suitable for scenarios in which a large volume of data needs to be stored and cost-effective storage is required, but high storage I/O performance is not required.
Compute CU: Select the number of compute units (CUs).
Select the appropriate CU specification as needed. For more information about CU fees, see Billing items.
Data Disk: Supports PL0 ESSD, PL1 ESSD (recommended), PL2 ESSD, and PL3 ESSD. For more information, see ESSDs.
The cache disk size ranges from 100 to 65,000 GB. The number of cache disks is 1 by default. The value can range from 1 to 8, with a step size of 1.
NoteYou can enter the required storage capacity, and the system will automatically provide a default recommended configuration. If the cloud disk capacity you choose exceeds the recommended threshold, the system will display a prompt to help you make appropriate adjustments to ensure optimal performance.
Number Of Nodes: The number of BE nodes. The value can range from 3 to 50.
CN Specifications
This parameter applies only to the compute-storage separated edition.
Specification type: The specification type of CN nodes varies with the Instance series of StarRocks.
Starter Edition: Supports Standard.
Standard Edition: Supports the following specifications.
Standard Specifications: The default specifications. One CU is equal to 1 CPU core and 4 GiB of memory. Enterprise SSDs (ESSDs) are used for data storage.
Memory-optimized Specifications: One CU is equal to 1 CPU core and 8 GiB of memory. This option is suitable for scenarios in which a large amount of memory resources are required, such as scenarios where complex queries are made or scenarios where high concurrency is required. ESSDs are used for data storage.
Network-enhanced Specifications: One CU is equal to 1 CPU core and 4 GiB of memory. The network bandwidth is two or more times that of the standard specifications. This option is suitable for analysis of external tables that contain a large amount of data. ESSDs are used for data storage.
High-performance storage: If you select this option, you must select the desired specifications based on your business requirements. This option is suitable for scenarios in which high storage I/O performance is required. Local SSDs are used for data storage.
High-specification Storage: If you select this option, you must select the desired specifications based on your business requirements. Local HDDs are used for data storage. This option is suitable for scenarios in which a large volume of data needs to be stored and cost-effective storage is required, but high storage I/O performance is not required.
Compute CU: Select the number of compute units (CUs).
Select the appropriate CU specification as needed. For more information about CU fees, see Billing items.
Data Disk: Supports PL0 ESSD, PL1 ESSD (recommended), PL2 ESSD, PL3 ESSD, Elastic Ephemeral Disk (Standard Edition), and Elastic Ephemeral Disk (Premium Edition). For more information about the fees for cloud disks and elastic ephemeral disks, see Billing items.
For more information, see ESSDs and Elastic ephemeral disks.
NoteYou can enter the required storage capacity, and the system will automatically provide a default recommended configuration. If the cloud disk capacity you choose exceeds the recommended threshold, the system will display a prompt to help you make appropriate adjustments to ensure optimal performance.
Only the Standard, Memory-enhanced, and Compute-enhanced specifications support elastic ephemeral disks. Elastic ephemeral disks have limitations on regions and zones.
Number Of Nodes: The number of CN nodes. The value can range from 1 to 100.
Data Storage
This parameter applies only to the compute-storage separated edition.
Data storage fees are charged on an hourly basis based on the actual usage in GB. For billing details, see Billing items.
Instance Name
The name of the instance. The name must be 1 to 64 characters in length and can contain only Chinese characters, letters, digits, hyphens (-), and underscores (_).
Administrator User
The administrator user for managing StarRocks. The default value is admin and cannot be changed.
Logon Password and Confirm Password
The password for the built-in administrator user of the StarRocks instance. Record this password. You will need it to manage and use the StarRocks instance. If you forget the password, you can reset it. For more information, see How do I reset the password of an instance?
The password must be 8 to 30 characters in length and contain at least one uppercase letter, one lowercase letter, one digit, and one special character:
@#$%^*_+-.Execution RAM Role
This role grants the StarRocks instance the permission to access data in Object Storage Service (OSS).
(Optional) Advanced Settings
Data Disk Encryption: You can enable this feature only when you create a Compute-storage Integrated Edition instance. If you enable this feature, both dynamic data in transit and static data on the data disk are encrypted.
NoteTo use this feature, you must activate Key Management Service (KMS) and create a key. For more information, see Activate KMS and Create a key.
This feature is supported only for cloud disks, not for elastic ephemeral disks or local disks.
Enable Manager Agent: The Manager Agent is an agent service that connects StarRocks Manager to the instance. It is deployed in the FE by default. If your FE has high payload pressure or you have higher requirements for StarRocks Manager stability, you can enable independent Manager Agent resources. For more information, see Manage Manager Agent.
ImportantManager Agent resources are billed based on CUs. For billing details, see Billing items.
Resource Group: You can use the default resource group, select an existing one, or click Create Resource Group to go to the Resource Management console and create one. For more information, see Create a resource group.
NoteA resource group is a mechanism for managing resources by group under an Alibaba Cloud account. Resource groups help you manage the complexity of resource grouping and authorization within a single Alibaba Cloud account. For more information about resource groups, see What is Resource Group?.
Tag: You can attach tags when you create a cluster, or add them after the cluster is created. This helps you identify and manage your cluster resources.
In the Instance Overview section on the right, select the Terms of Service checkbox, and then click Create Instance. Complete the payment as prompted.
If you purchase a subscription instance, you can select a Subscription Duration. If you have a coupon, you can choose to use it.
After you complete the payment, return to the instance list page to view the created instance. The instance is successfully created when its Status changes to Running.
Instance series description
You cannot switch between instance series after an instance is created. Select the instance series carefully when you create an instance.
Item | Starter Edition | Standard Edition |
Scenarios | For experiencing StarRocks features and performing functional testing only. This edition is not recommended for production scenarios. | Suitable for all StarRocks scenarios, including testing and production environments. |
Resource limits | The limits are as follows:
| No resource limits. This edition follows the standard sales specifications. |
Stability | High availability is not supported. No Service-Level Agreement (SLA) is provided, and stability is not guaranteed. | If you enable high availability, an SLA is provided to guarantee stability. |
Instance management operations |
| Supports all operations. |
References
If you are unsure which specifications to choose when creating a StarRocks instance, see Instance specification planning and suggestions.
To quickly create an instance, see the following documents:
You can connect to an instance in multiple ways, such as using Manager SQL Editor, Quick BI, or DMS. For more information, see Connect to an instance.
You can use the built-in StarRocks Manager. For more information, see Connect to a StarRocks instance using EMR StarRocks Manager.