DataWorks allows you to migrate tasks from open source scheduling engines such as Oozie and Azkaban to DataWorks. This topic describes the requirements for the exported files.
Background information
Export a task from Oozie
- Requirements
The package must contain XML-formatted definition files and configuration items of a flow task. The package is exported in the ZIP format.
- Structure
Oozie task descriptions are saved in a HDFS directory. For example, each subdirectory under the apps directory in the Examples package at the Apache Oozie official website is a flow task of Oozie. Each subdirectory contains XML-formatted definition files and configuration items of a flow task.
Export a task from Azkaban
You can download a specific flow task in the Azkaban console.
Export a task of another open source engine
DataWorks provides a standard template for you to export tasks of open source engines except Oozie and Azkaba. Before you run an export task, you must download the standard template and modify the content based on the file structure in the template. You can go to the Open Source engine export page to download the standard template and view the file structure.
- Go to the DataStudio page.
- Log on to the DataWorks console.
- In the left-side navigation pane, click Workspaces.
- In the top navigation bar, select the region where your workspace resides, find the workspace, and then click Data Analytics in the Actions column.
- Click the
icon in the upper-left corner. Then, choose .
- In the left-side navigation pane, choose Open Source engine export scheme selection page. to go to the
- Click the Standard Template tab.
- On the Standard Template tab, click standard format Template to download the template.
- Modify the content in the template to generate a package to be exported.