DataWorks provides official images to support various task types in Data Development. Each image has a pre-configured runtime environment for specific nodes. You can use these official images directly or as a base to create your own custom images. This topic describes the official images available in DataWorks.
Image overview
In Data Development, if you do not specify a runtime environment image for a node, the system uses a default standard image. The default image provides a basic runtime environment that may not meet the requirements of specific tasks. Official images provide pre-configured, standardized environments for various task types. You can use them directly or use them as a base for custom images with additional configurations to support a wider range of use cases.
Available images
For supported versions and regions, refer to the DataWorks console. Images may have multiple versions. The following table lists the capabilities of the latest image versions only.
DataWorks provides the following images:
Image name | Description | Task type |
dataworks_pyodps_py311_task_pod | The official image for DataWorks PyODPS nodes. This image uses Python 3.11. | |
dataworks_pairec_task_pod | The official image for DataWorks PAI-Rec, used to run algorithms generated by PAI-Rec. The specific versions of the feature_store SDK and pyfg are specified in the console. | |
dataworks_pyodps_task_pod | The official image for DataWorks PyODPS nodes. This image uses Python 3.7. | |
dataworks_emr_base_task_pod | A base image for EMR clusters. It supports EMR Serverless Spark, EMR on ECS DataLake, and EMR on ECS Custom cluster types.
| |
dataworks_shell_jdk17_task_pod | The official image for DataWorks Shell nodes. This image uses JDK 17. | |
dataworks_shell_task_pod | The official image for DataWorks Shell nodes, which uses JDK 7. If you need a custom runtime environment that supports Subprocess parameter passing, you can build a Custom images based on this image. | |
dataworks_python_task_pod | The official image for DataWorks Python nodes. System information: py3.11-ubuntu22.04. | |
dataworks_cdh_custom_task_pod | A base image for DataWorks CDH clusters. This image cannot be used directly. You must first install the Custom images by following the instructions in | |
dataworks_controller_task_pod | The official image for DataWorks assignment nodes. If you need a custom runtime environment and need to use assignment nodes or assignment parameters to pass parameters to downstream nodes, build a Custom images based on this image. | |
dataworks-mcp | Applicable for DataWorks Agent for third-party clients task development. System information: py3.11-ubuntu22.04. | |
dataworks-notebook | Applicable for Notebook development task development. System information: py3.11-ubuntu22.04. | |
dataworks_notebook_task_pod | The official image for DataWorks Notebook nodes. System information: py3.11-ubuntu22.04. The Python environment is consistent with the | |
dataworks-maxcompute | Applicable for Build a MaxCompute custom image in a personal development environment. System information: py3.11-ubuntu20.04. |
Using images
In Data Development, you can use official images or custom images that are bound to your workspace.
Use image in new DataStudio: Configure the Run Configuration and Scheduling Configuration in the Properties and Scheduling Configuration panels on the right side of the node development page.
Use image in old DataStudio: On the node development page, configure the Resource Group and Image for the Trial Run and Post-deployment Run of the node in the dialog box that appears after you click Run with Parameters or on the Scheduling Configuration page on the right side.
Use images in a Personal Development Environment: When you create an instance for a personal development environment, you can select different official images in the Image Configuration section.
Note the following when you configure a resource group and an image:
Scheduling Resource Group: Select a serverless resource group.
Image: Select an Official Image or a Published Custom Image.