All Products
Search
Document Center

DataWorks:Upgrade from legacy DataStudio to the new Data Studio

Last Updated:Jun 17, 2026

DataWorks Data Studio is undergoing a comprehensive architectural upgrade to keep pace with the evolution of cloud-native technologies, meet the demands of increasingly complex data processing scenarios, and address the current architecture's challenges in scalability, maintainability, and user experience. This upgrade aims to build a future-ready, high-performance, and highly available data intelligence development platform that provides users with a more efficient, intelligent, and unified experience for data development and governance.

Important

If you encounter any issues during the upgrade, you can get technical support from the DataWorks Data Studio Upgrade Support Group.

1. Background

The original technical architecture of DataWorks was built sixteen years ago. While it met the core needs of the batch processing era, its limitations have become more apparent as technology stacks and business scenarios have evolved. This upgrade is driven by three main factors:

Technical requirements for architectural evolution

  • Lakehouse architecture support: The existing architecture requires an upgrade to natively support a smooth evolution from traditional data warehouses to a lakehouse architecture, unifying metadata management and data processing.

  • Unified offline and real-time processing: To meet real-time business demands, the new architecture must provide a unified development paradigm and metadata view, seamlessly integrating offline batch processing and real-time stream computing.

  • Native integration of AI and large language models (LLMs): The new architecture must natively integrate AI capabilities, such as machine learning, deep learning, and natural language processing, into the entire data development lifecycle.

Existing architecture challenges

  • Limitations of a monolithic, batch-centric design: The legacy architecture is based on an early monolithic design focused on batch processing. It has inherent limitations in supporting cloud-native technologies and stream computing, which restricts the platform's elasticity and performance.

  • Scalability and maintainability bottlenecks: Tightly coupled modules result in long iteration cycles for new features and high maintenance costs, making it difficult to meet the demands of large-scale concurrent access and custom business extensions.

Evolving user demands

  • Support for large-scale and complex workflows: To handle the processing needs of tens of thousands of active users and exabyte-scale data, the new architecture must deliver higher stability, throughput, and resource isolation.

  • Enhanced development efficiency and intelligence: Users expect a development experience as smooth as a local IDE. The new architecture introduces intelligent code assistance, performance diagnostics, and one-click deployment to significantly lower the technical barrier for data development.

2. Scope of upgrade

Module

Changes

Data Studio

An upgrade to the user interface (UI) and user experience (UX) only.

Important

Other product modules, such as Operation Center and Deployment Center, are not related to this upgrade and are not included in the upgrade scope.

3. Impact of the upgrade

Impact on existing business

Item

Impact description

Deployed production tasks

Zero impact. All online task scheduling and execution will continue to run stably without any interruption.

Core data development features

Fully retained and enhanced. Node types, function support, core editing and execution capabilities are all fully retained and have been optimized for performance and experience in the new version.

User asset migration

Asset type

Migration plan

Existing nodes (such as MaxCompute SQL and Shell)

One-click migration to the new UI is supported.

User-defined functions (UDFs)

Resource files (such as .jar, .py , and .txt)

Components

Ad hoc queries

4. Feature migration details

The new Data Studio reorganizes and optimizes the features of the legacy Data Studio. The following sections describe, one by one, the new location and feature changes for each feature module of the legacy Data Studio in the new IDE:

4.1 Data development module

4.1.1. Solutions

  • Legacy: A standalone "Solutions" menu for organizing and managing related workflows.

    image.png

  • New: Upgraded to "Focus Mode". You can enter Focus Mode from any file directory, providing a more immersive and flexible development experience.

    image.png

4.1.2. Workflows

  • Legacy: A fixed directory hierarchy with workflow dashboard support.

    image.png

  • New: You can create directories as needed and flexibly tag them as content from the legacy workflow directories.

    image.png

    image.png

  • New: The workflow dashboard has been upgraded to a directory dashboard. Any directory now supports the "View" capability.

    image.png

  • New: If you need workflow orchestration capabilities, use the "scheduled workflow" feature in the new Data Studio.

    image.png

4.1.3. Node development

  • Legacy: Only available node types were displayed. You could not see other node types supported by DataWorks or write development code for other node types before associating a compute engine.

    image.png

  • New: Integrated into the project directory in Data Studio. You can see all node types and write code before associating a compute engine. The node creation interaction is being continuously optimized for a better node creation and coding experience.

    image.png

4.1.3.1 Run / Run with parameters
  • Legacy: "Run" and "Run with Parameters" were two separate buttons. When you clicked Run with Parameters, you selected the resource group and custom parameters.

    image.png

  • New:

    • The legacy "Run" and "Run with Parameters" buttons are consolidated into a single "Run" button in the new version.

    • The legacy "Engine Instance" is split into data source and compute resource depending on the node type:

      • Data source: Used for metadata suggestions in the intelligent code editor.

      • Compute resource: Used to determine the compute resource for code debugging.

    • The legacy "Resource Group" in the dialog is moved to "Resource Group" in the run configuration.

    • The legacy "Custom Parameters" in the "Run with Parameters" dialog is moved to "Parameters" in the run configuration.

    • The "Debug Configuration" panel is always visible on the right side of the node. The compute resource, resource group, and parameters required for each code run are retrieved from the run configuration.

    image.png

4.1.3.2 Smoke testing / View smoke testing records
  • Legacy: Smoke testing could only be initiated after a successful submission.

    image.png

    image.png

  • New:

    • Standard mode: Smoke testing can be initiated only after a successful deployment to the development environment (equivalent to a successful submission in the legacy version).

    • Basic mode: Smoke testing can be initiated only after a successful deployment to the production environment (equivalent to a successful submission in the legacy version).

    image.png

4.1.3.3 Code review / View code review records
  • Legacy: The relevant buttons were directly visible on the node toolbar.

    image.png

  • New: During the deployment process, you can initiate a code review in the production checker step. In the left-side directory tree, click to view the code review list.

    image.png

    image.png

4.1.3.4 Submit / Submit and allow others to edit
  • Legacy:

    image.png

    image.png

  • New: The legacy "Submit" action is changed to "Deploy" in the new version. The new version allows authorized users to deploy directly. In standard mode, tasks can also be packaged and deployed through Deployment Center.

    image.png

    image.png

    image.png

    image.png

4.1.4. Table development

  • Legacy: Tables were created in Data Studio.

    image.png

  • New: Upgraded to a full-featured "Data Catalog" management experience. You can create tables in Data Catalog with form-based visual creation, code-based creation, or AI-assisted creation.

    image.png

4.1.5. Resource development

  • Legacy: Resources were created in Data Studio.

    image.png

  • New: Consolidated into the "Resource Management" module.

    image.png

4.1.6. Function development

  • Legacy: Functions were created in Data Studio.

    image.png

  • New: Consolidated into the "Resource Management" module.

    image.png


4.2 Component management module

  • Legacy:

    image.png

  • New:

    image.png


4.3 Manual task module

  • Legacy:

    image.png

  • New:

    image.png


4.4 Manual workflow module

  • Legacy:

    image.png

  • New:

    image.png

  • For more complete manual workflow capabilities, use: Project Directory > Workflow > Trigger-based Workflow.

    image.png

    image.png


4.5 Ad hoc query module

  • Legacy: Ad hoc query files of all users in the current workspace.

    image.png

  • New:

    • Ad hoc query files of the current user across all workspaces in the current region.

    • Used only for code debugging. When you confirm that a query needs to be deployed as a production task, you can submit it to the project directory, configure the schedule settings, and then deploy it.

    image.png


4.6 Table management module

  • Legacy:

    image.png

  • New: Upgraded to a full-featured "Data Catalog" management experience. You can create tables in Data Catalog with form-based visual creation, code-based creation, or AI-assisted creation.

    image.png


4.7 Public table module

  • Legacy:

    image.png

  • New: Upgraded to a full-featured "Data Catalog" management experience.

    image.png


4.8 Function list module

  • Legacy:

    image.png

  • New: You can now ask questions about function usage directly in Copilot.

    In the future product roadmap, a Function Management module will be added to manage and create engine-native functions and user-defined functions.

    image.png


4.9 Pre-check module

  • Legacy:

    image.png

  • New:

    image.png


4.10 Run history module

  • Legacy:

    image.png

  • New:

    image.png


4.11 Smoke testing records module

  • Legacy:

    image.png

  • New:

    image.png


4.12 Compute resource module

  • Legacy:

    image.png

  • New:

    image.png


4.13 Settings module

  • Legacy:

    image.png

  • New: Theme switching.

    image.png

  • New: Other settings.

    image.png

  • New: More settings to explore.

    image.png


4.14 Recycle bin module

  • Legacy:

    image.png

  • New:

    image.png

5. Upgrade guide

5.1 Upgrade permissions

  • Only users with permissions equivalent to Workspace Administrator can view and perform the upgrade operation.

5.2 Upgrade procedure

5.2.1 Access the upgrade entry

  • Located in the top navigation bar of the DataStudio (data studio) main interface.

  • A blue "Upgrade to New Version" button next to the workspace selector.

  • Only users with the required permissions can see this button.image.png

5.2.2 Confirm upgrade notes

After you click the "Upgrade to New Version" button, the system displays the upgrade introduction page.

image.png

Downgrade is not supported after the upgrade. Confirm the following before you proceed. If you need assistance during the upgrade, join the DataWorks Data Studio Upgrade Support Group.

Important
  1. If you have used the legacy DataStudio OpenAPI or need to use Migration Assistant, contact the on-duty engineer in the DataWorks Data Studio Upgrade Support Group before upgrading.

  2. Cross-workspace task deployment between the new and legacy Data Studio is not supported between workspaces on the current version and workspaces that have been upgraded to the new version.

  3. During the upgrade, neither the new Data Studio nor the legacy DataStudio in the current workspace can create new content or modify existing content (including UI operations and OpenAPI operations).

  4. The upgrade takes some time. Perform the upgrade during an off-peak period for task development.

  5. After the upgrade, the workspace name and ID remain unchanged.

  6. After the upgrade, the legacy DataStudio becomes read-only. Code changes made in the new Data Studio are not synced back to the legacy DataStudio.

  7. After the upgrade, in the personal directory of the new Data Studio, only the ad hoc query files migrated from the legacy DataStudio for which you are the owner are visible.

5.2.3 Perform the upgrade

  • When the current upgrade status is "Upgrade not started for the current workspace", click the "Start Upgrade" button to start the upgrade process.

  • After you click the "Start Upgrade" button, the system displays the estimated upgrade time and the number of objects to be migrated for the current workspace.

  • Click the "Confirm" button to officially start the upgrade.

    image.png

  • You can view the progress and status in real time during the upgrade.

    image.png

  • During the upgrade, click "Upgrade Details" to view the upgrade details in real time.

    image.png

  • During the upgrade, click "Refresh" to view the latest upgrade status.

    image.png

  • After the upgrade is complete, click "Go to Data Studio" to start using the new Data Studio.

    image.png

6. Upgrade assurance

6.1 Dedicated upgrade services

  • Online help: Detailed operational guidance is available during the upgrade process.

  • Technical consultation: If you encounter any issues, contact the on-duty engineer in the DataWorks Data Studio Upgrade Support Group. On-site support can be provided if needed.

  • Emergency response: A rapid response mechanism is available if any issues occur during the upgrade.

  • Product training: Post-upgrade feature training can be provided if needed.

6.2 Rollback mechanism

If you encounter any issues during the upgrade, contact the on-duty engineer in the DataWorks Data Studio Upgrade Support Group to request a rollback to the pre-upgrade state.