Use this topic to create an Apache Doris cluster on E-MapReduce (EMR) on ECS. The cluster creation wizard guides you through software, hardware, and basic configuration in a single workflow.
After a cluster is created, you cannot change any configuration except the cluster name. Review all settings carefully before you confirm.
Prerequisites
Before you begin, ensure that you have:
An Alibaba Cloud account with sufficient balance to cover cluster costs
The required permissions to create EMR clusters and access related services (VPC, ECS, OSS)
An existing virtual private cloud (VPC) and vSwitch in your target region, or permission to create them
Create a Doris cluster
Step 1: Go to the cluster creation page
Log on to the EMR console. In the left-side navigation pane, click EMR on ECS.
(Optional) In the top navigation bar, select the region where you want to create the cluster, and select a resource group.
You cannot change the region of a cluster after it is created. By default, all resource groups in your account are displayed.
Click Create Cluster.
Step 2: Configure software parameters
| Parameter | Description | Example value |
|---|---|---|
| Region | The region where the cluster is created. Cannot be changed after creation. | cn-hangzhou |
| Business Scenario | The workload type. Select Data Analytics for Doris clusters. | Data Analytics |
| Product Version | The EMR version. The latest version is selected by default. | EMR-5.x.x |
| High Service Availability | When turned on, three master nodes are deployed to ensure high availability of the FE service. Off by default. | On (production), Off (test) |
| Optional Services (Select One At Least) | Additional services to deploy. The relevant processes start automatically when the cluster starts. | — |
| Advanced Settings > Custom Software Configuration | Use a JSON file to specify custom parameters for services such as Hadoop, Spark, and Hive. Off by default. | — |
Step 3: Configure hardware parameters
| Parameter | Description | Example value |
|---|---|---|
| Billing Method | The billing method for the cluster. Subscription is selected by default. For short-term tests or dynamic workloads, use Pay-as-you-go. For stable production workloads, use Subscription. | Pay-as-you-go |
| Zone | A physical area with independent power and network within a region. Clusters in zones within the same region can communicate over the internal network. The default zone works for most scenarios. | cn-hangzhou-h |
| VPC | The VPC where the cluster is deployed. An existing VPC is selected by default. To use a new VPC, create one in the VPC console first. | vpc-xxx |
| vSwitch | The vSwitch in the selected zone. If no vSwitch is available, create one in the VPC console. | vsw-xxx |
| Default Security Group | The security group for the cluster. An existing security group is selected by default. To create a new one, click create a new security group. | sg-xxx |
| Node Group | The node groups in the cluster. Select instance types based on your workload requirements. For details, see Overview of instance families. | ecs.g7.4xlarge |
Do not use an advanced security group created in the ECS console.
Step 4: Configure basic parameters
| Parameter | Description | Example value |
|---|---|---|
| Cluster Name | The name of the cluster. Must be 1–64 characters and can contain only letters, digits, hyphens (-), and underscores (_). | my-doris-cluster |
| Identity Credentials | The method to access cluster nodes. Key Pair (default): uses an SSH key pair. Password: uses a password you set for the master node. | Key Pair |
| Advanced Settings > ECS Application Role | (Optional) An ECS application role that allows EMR to request temporary AccessKey pairs when accessing other Alibaba Cloud services, such as OSS. Eliminates the need to enter AccessKey credentials manually. You can grant the access permissions of the application role on specific Alibaba Cloud services based on your business requirements. | — |
| Advanced Settings > Bootstrap Actions | (Optional) Custom scripts that run before the cluster starts Hadoop. For details, see Manage bootstrap actions. | — |
| Advanced Settings > Tags | (Optional) Tags to attach to the cluster for resource management. | env:production |
| Advanced Settings > Resource Group | (Optional) The resource group to which the cluster belongs. | — |
| Advanced Settings > Data Disk Encryption | (Optional) Encrypts data disks. Can only be enabled at cluster creation time. | — |
Password requirements (when using password-based authentication): 8–30 characters, must include uppercase letters, lowercase letters, digits, and at least one of the following special characters: ! @ # $ % ^ & *
For SSH key pair usage, see SSH key pair overview.
Step 5: Confirm and create
In the Confirm step, read the terms of service and select the check box.
Click Confirm.
Verify the cluster
The cluster is ready when its status changes to Running in the cluster list.