All Products
Search
Document Center

E-MapReduce:Create a Doris cluster

Last Updated:Aug 28, 2023

This topic describes how to create a Doris cluster.

Procedure

  1. Go to the cluster creation page.

    1. Log on to the E-MapReduce (EMR) console. In the left-side navigation pane, click EMR on ECS.

    2. Optional. In the top navigation bar, select the region where you want to create a cluster and select a resource group based on your business requirements.

      • You cannot change the region of a cluster after the cluster is created.

      • By default, all resource groups in your account are displayed.

    3. On the EMR on ECS page, click Create Cluster.

  2. Configure the cluster.

    To create a cluster, you must configure software parameters, hardware parameters, and basic parameters as guided by the wizard.

    Important

    After a cluster is created, you cannot modify its parameters except for the cluster name. Make sure that all parameters are correctly configured when you create a cluster.

    1. Configure software parameters.

      Parameter

      Description

      Region

      The region where you want to create the cluster. You cannot change the region of a cluster after the cluster is created.

      Business Scenario

      Select Data Analytics.

      Product Version

      The version of EMR. By default, the latest version is selected.

      High Service Availability

      By default, this switch is turned off. If you turn on this switch, three master nodes are deployed in the cluster to ensure high availability of the FE service.

      Optional Services (Select One At Least)

      The other services that you can select based on your business requirements. By default, the relevant processes for the services you specify are started.

      Advanced Settings

      Custom Software Configuration: specifies custom software settings. You can use a JSON file to specify custom parameters for basic services required for a cluster, such as Hadoop, Spark, and Hive. By default, this switch is turned off.

    2. Configure hardware parameters.

      Parameter

      Description

      Billing Method

      The billing method of the cluster. Subscription is selected by default. EMR supports the following billing methods:

      • Pay-as-you-go: a billing method that allows you to pay for an instance after you use the instance. The system charges you for a cluster based on the hours the cluster is actually used. Bills are generated on an hourly basis at the top of every hour. We recommend that you use pay-as-you-go clusters for short-term test jobs or dynamically scheduled jobs.

      • Subscription: a billing method that allows you to use an instance only after you pay for the instance.

        Note

        We recommend that you create a pay-as-you-go cluster for a test run. If the cluster passes the test, you can create a subscription cluster for production.

      Zone

      The zone where you want to create a cluster. A zone in a region is a physical area with independent power supplies and network facilities. Clusters in zones within the same region can communicate with each other over an internal network. In most cases, you can use the zone that is selected by default.

      VPC

      The virtual private cloud (VPC) where you want to deploy the cluster. An existing VPC is selected by default.

      If you want to use a new VPC, go to the VPC console to create one. For more information, see Create and manage a VPC.

      vSwitch

      The vSwitch of the cluster. Select a vSwitch in the specific zone based on your business requirements. If no vSwitch is available in the zone, go to the VPC console to create one. For more information, see Create and manage a vSwitch.

      Default Security Group

      The security group of the cluster. An existing security group is selected by default. For more information about security groups, see Overview.

      You can also click create a new security group to create a security group in the Elastic Compute Service (ECS) console. For more information, see Create a security group.

      Important

      Do not use an advanced security group that is created in the ECS console.

      Node Group

      The node groups of the cluster. You can select instance types based on your business requirements. For more information, see Overview of instance families.

    3. Configure basic parameters.

      Parameter

      Description

      Cluster Name

      The name of the cluster. The name must be 1 to 64 characters in length and can contain only letters, digits, hyphens (-), and underscores (_).

      Identity Credentials

      Key Pair (default): Use an SSH key pair to access the Linux instance.

      For information about how to use a key pair, see SSH key pair overview.

      Password: Use the password that you set for the master node to access the Linux instance.

      The password must be 8 to 30 characters in length and must contain uppercase letters, lowercase letters, digits, and special characters.

      The following special characters are supported:

      ! @ # $ % ^ & *

      Advanced Settings

      • ECS Application Role: You can assign an ECS application role to a cluster. Then, EMR applies for a temporary AccessKey pair when applications running on the compute nodes of the cluster access other Alibaba Cloud services, such as OSS. This way, you do not need to manually enter an AccessKey pair. You can grant the access permissions of the application role on specific Alibaba Cloud services based on your business requirements.

      • Bootstrap Actions: Optional. You can configure bootstrap actions to run custom scripts before a cluster starts Hadoop. For more information, see Manage bootstrap actions.

      • Tags: Optional. You can add a tag when you create a cluster or add a tag on the Basic Information tab after a cluster is created. For more information, see Manage and use tags.

      • Resource Group: Optional. For more information, see Use resource groups.

      • Data Disk Encryption: Optional. You can turn on this switch only when you create a cluster. For more information, see Enable data disk encryption.

  3. In the Confirm step, read the terms of service, and select the check box.

  4. Click Confirm.

    The cluster is successfully created if the status of the cluster changes to Running.