All Products
Search
Document Center

DataWorks:Data source addition and management

Last Updated:Dec 30, 2024

Data sources of DataWorks are used to connect different data storage services. Before you configure a synchronization task, you must configure information about the data sources that you want to use in DataWorks. This way, when you configure a synchronization task, you can select the names of data sources to determine the database from which you want to read data and the database to which you want to write data. This topic describes the items that you must understand when you add data sources to DataWorks.

Items

Before you add a data source, you need to understand the items that are described in the following table.

Item

Description

Supported data source types and addition of data sources

Before you configure a synchronization task, you must add the data sources that you want to use on the Data Sources page in Management Center of the DataWorks console. Data Integration supports dozens of data source types. For information about the data source types that are supported and the addition of data sources, see Supported data source types, readers, and writers.

Note

The database from which data is read and the database to which data is written are determined based on data sources. Before you configure a synchronization task, you must add the desired databases or data warehouses to DataWorks as data sources. This way, when you configure a synchronization task, you can select the data sources to determine the database from which you want to read data and the database to which you want to write data.

Special authentication for data sources

Data Integration allows you to use third-party identity authentication mechanisms to control access to specific data sources. Before you use an authentication mechanism to perform identity authentication, you must upload the required authentication files on the Authentication File Management page of the DataWorks console. Then, you must enable third-party authentication when you add a data source. This way, only trusted applications and services can access the data source. For more information about third-party identity authentication mechanisms, see Manage third-party authentication files.

Note
  • Only Kerberos authentication is supported. For more information about how to configure Kerberos authentication, see Configure Kerberos authentication.

  • Different types of data sources support different authentication mechanisms. For more information, see the topics for adding different types of data sources.

Support for accessing specific data sources in RAM authorization mode

Data Integration allows you to use the RAM authorization mode to access specific data sources. This mode ensures higher security. You can select the RAM authorization mode and a custom RAM role when you add a data source. This way, you can use the RAM role to access the data source. For more information, see Use the RAM role-based authorization mode to add a data source.

Isolation of the same data source in the development environment and production environment

You can add the same data source separately for the development environment and production environment. This way, the data source in the development environment and that in the production environment are isolated from each other. For more information, see Isolate a data source in the development and production environments.

Note

Only workspaces in standard mode support isolation for data sources.

Access to data sources by adding host or DNS configurations

If the IP address of your data source is frequently changed or you cannot access your data source by using IP addresses, you can add host configurations for the exclusive resource group for Data Integration. For more information about how to add host configurations for an exclusive resource group for Data Integration, see the Associate the exclusive resource group for Data Integration with a VPC section in the "Create and use an exclusive resource group for Data Integration" topic.

Permission management on data sources

After the required data sources are added, you can share the data sources by managing permissions on the data sources.

  • DataWorks allows you to share a data source in a workspace with another workspace or a member in another workspace by managing permissions on the data source, and go to the workspace to view the shared data source. You can configure permission settings, such as Editable, on a data source for a workspace or member. For more information, see Manage permissions on data sources.

  • If you want to control data read and write operations on a table in a data source, you can create a request processing policy for the synchronization task that uses the data source in Approval Center. Approval Center allows the user to whom the Workspace Administrator role is assigned to use the combination of a source and a destination to specify the synchronization task on which the operation requests must be processed. For more information, see Request processing policies for Data Integration tasks.

References