All Products
Search
Document Center

DataWorks:Cluster management

Last Updated:Jul 04, 2025

To develop and manage E-MapReduce (EMR) or CDH (Cloudera's Distribution Including Apache Hadoop, hereinafter referred to as CDH) jobs in DataWorks, you must attach the corresponding EMR or CDH cluster as a computing resource in DataWorks through cluster management. After attachment is complete, you can use this computing resource in DataWorks for data synchronization, development, and other operations.

Manage clusters

Attach an EMR cluster

Attach a CDH/CDP cluster

  • Supported cluster versions: DataWorks supports CDH5.16.2, CDH6.1.1, CDH6.2.1, CDH6.3.2, and CDP7.1.7 versions that you can directly select. The component versions that come with these cluster versions (the versions of each component in the cluster connection information) are fixed. If these cluster versions do not meet your business requirements, you can select Custom Version.

  • Configuration and attachment:

    • Old version of Data Development: Configure CDH computing resources in Management Hub > Cluster Management. For more information, see Old version of Data Development: Attach CDH computing resources.

    • New version of Data Development: Configure CDH computing resources in Management Hub > Computing Resources. For more information, see New version of Data Development: Attach CDH computing resources.