All Products
Search
Document Center

E-MapReduce:Create a ClickHouse cluster

Last Updated:Nov 29, 2025

This topic describes how to create a ClickHouse cluster and configure its settings.

Background

For information about instance types, memory, and disk settings, see Usage Recommendations.

Prerequisites

You have created a virtual private cloud (VPC) and a vSwitch in the destination region. For more information, see Create and manage a VPC and Create and manage a vSwitch.

Procedure

  1. Go to the cluster creation page.

    1. Log on to EMR on ECS.

    2. Optional: In the top menu bar, select a region and a resource group.

      • Region: The cluster is created in the selected region. You cannot change the region after the cluster is created.

      • Resource Group: By default, all resources in your account are displayed.

    3. Click Create Cluster at the top of the page.

  2. Configure the cluster.

    To create a cluster, you need to configure its software, hardware, and basic settings, and then confirm the order.

    Important

    After a cluster is created, you can change only its name. All other configurations cannot be modified. Carefully review all configurations before you create the cluster.

    1. Configure software.

      Configuration Item

      Description

      Region

      The cluster is created in the selected region. You cannot change the region after the cluster is created.

      Business Scenario

      Select the Data Analysis scenario.

      Product Version

      The latest software version is selected by default.

      High Service Availability

      Disabled by default.

      Optional Services (Select One At Least)

      Select ClickHouse.

      Important
      • For clusters that run EMR 5.11.0 or later or EMR 3.45.0 or later, if you select the ClickHouse service, the ZooKeeper service is selected by default.

      • For clusters that run EMR 5.8.0 to EMR 5.10.1 (inclusive), if you select only the ClickHouse service, the cluster uses the built-in ClickHouse Keeper instead of an additional ZooKeeper. However, the performance of the built-in ClickHouse Keeper is different from that of ZooKeeper. Therefore, select the ZooKeeper service.

      • For clusters that run EMR 3.42.0 to EMR 3.44.1 (inclusive):

        • If you enable High Service Availability, the ZooKeeper service is selected by default when you select the ClickHouse service.

        • If you do not enable High Service Availability, the ZooKeeper service is not automatically selected when you select the ClickHouse service. In this case, ClickHouse cannot perform DDL operations. Therefore, select the ZooKeeper service.

      Advanced Settings

      Custom Software Configuration: You can specify a JSON file to configure the basic software of the cluster. For more information, see Configure custom software. This feature is disabled by default.

    2. Configure hardware.

      Configuration Item

      Description

      Billing Method

      The default billing method is subscription. The following billing methods are supported:

      • Pay-as-you-go: A billing method where you pay after you use resources. You are billed for the actual usage duration on an hourly basis. This method is suitable for short-term testing or flexible dynamic tasks.

      • Subscription: A billing method where you pay before you use resources.

        Note

        Use the Pay-as-you-go billing method for testing. After the test is successful, create a new Subscription production cluster for official use.

      Zone

      Zones are different physical areas within the same region. Service interconnection is supported between zones. You can use the default zone.

      VPC

      If no VPC is available, click Create VPC to create one.

      vSwitch

      Select a vSwitch in the zone of the selected VPC. If no vSwitch is available in the zone, create one.

      Default Security Group

      An existing security group is selected by default. For more information about security groups, see Security group overview.

      You can also click Create Security Group to create a new one.

      Important

      Do not use advanced security groups created on ECS.

      Node Group

      You can select instance types as needed. For more information, see Instance families.

      • System Disk: Select an enterprise SSD (ESSD), a standard SSD, or an ultra disk as needed.

      • System Disk Size: Adjust the disk capacity as needed. The default value is 80 GB. The value can range from 80 GB to 5,000 GB.

      • Data Disk: Select an enterprise SSD (ESSD), a standard SSD, or an ultra disk as needed.

      • Data Disk Size: Adjust the disk capacity as needed. The default value is 80 GB. The value can range from 40 GB to 32,768 GB.

      • Instances:

        • If High Service Availability is disabled: The default is one master node and one core node.

        • If High Service Availability is enabled: The default is three master nodes and three core nodes.

      • Assign Public Network IP: Specifies whether to assign an elastic IP address (EIP) to the cluster. This feature is disabled by default.

        Note

        After the cluster is created, to access it over the Internet, apply for a public IP address on the ECS console. For more information about how to apply for an EIP, see the "Apply for an EIP" section in Elastic IP Address.

    3. Configure basic settings.

      Configuration Item

      Description

      Cluster Name

      The name of the cluster. The name must be 1 to 64 characters in length and can contain only Chinese characters, letters, digits, hyphens (-), and underscores (_).

      Identity Credentials

      Key Pair (Default): Use an SSH key pair to log on to the Linux instance.

      For more information about how to use key pairs, see SSH key pairs.

      Password: Set the logon password for the master node. Use the password to log on to the Linux instance.

      The password must be 8 to 30 characters in length and contain uppercase letters, lowercase letters, digits, and special characters.

      Special characters include exclamation points (!), at signs (@), number signs (#), dollar signs ($), percent signs (%), carets (^), ampersands (&), and asterisks (*).

      Application Configurations

      Configure replicas and shards for ClickHouse.

      Advanced Settings

      • ECS Application Role: When your program runs on EMR compute nodes, you can access other Alibaba Cloud services, such as OSS, without providing an AccessKey. EMR automatically requests a temporary AccessKey to authorize the access. The ECS Application Role controls the permissions of this temporary AccessKey.

      • Bootstrap Actions: Optional. You can run custom scripts before Hadoop starts in the cluster. For more information, see Run a bootstrap action script.

      • Resource Group: Optional. For more information, see Use resource groups.

  3. After you confirm that all the information is correct, select the Terms of Service and click Confirm.

    Important
    • Pay-as-you-go clusters: The cluster is created immediately. After the cluster is created, the cluster is in the Running state.

    • Subscription clusters: An order is generated. The cluster will be created after you complete the payment.