DataWorks is a powerful Platform as a Service (PaaS) service offered by Alibaba Cloud. It provides stable, efficient, and scalable data synchronization capabilities. DataWorks is designed to transmit and synchronize data in a fast and stable manner between heterogeneous data sources in complex network environments.

Data source types supported by DataWorks

DataWorks supports data synchronization for the following types of data sources:

  • Text storage, such as File Transfer Protocol (FTP) and SSH File Transfer Protocol (SFTP) servers, Object Storage Service (OSS), and multimedia files
  • Relational databases, such as Relational Database Service (RDS), PolarDB-X 1.0, MySQL, and PostgreSQL
  • NoSQL databases, such as Memcache, Redis, MongoDB, and HBase
  • Big data services, such as MaxCompute, AnalyticDB for MySQL, and Hadoop Distributed File System (HDFS)

Prerequisites

  • Test data is prepared. For more information, see General workflow to use ApsaraDB RDS for MySQL.
  • An AnalyticDB for MySQL cluster is created, a whitelist is configured, and the public endpoint of the AnalyticDB for MySQL cluster is available. For more information, see Use AnalyticDB for MySQL.
    Note DataWorks and AnalyticDB for MySQL cannot be connected over virtual private clouds (VPCs). You must apply for a public endpoint for an AnalyticDB for MySQL cluster.
  • A table is created in the destination AnalyticDB for MySQL database to store the data synchronized from ApsaraDB RDS for MySQL. For more information, see CREATE TABLE.
    Note We recommend that you import data in real time when you use DataWorks to synchronize data to AnalyticDB for MySQL. In this case, data import is efficient and simple. This method requires you to create real-time tables (fact tables) in the destination AnalyticDB for MySQL database in advance.
  • A workspace is created in the DataWorks console.

Procedure

  1. Add an ApsaraDB RDS for MySQL data source
  2. Add an AnalyticDB for MySQL V3.0 data source
  3. Configure the source and destination connections for a synchronization task