DataWorks Data Integration supports reading from a Hadoop Distributed File System (HDFS) data source and writing to AnalyticDB for MySQL. This topic describes how to import data from HDFS into AnalyticDB for MySQL using a DataWorks batch synchronization task.
Prerequisites
Before you begin, ensure that you have:
Configured the source HDFS and destination AnalyticDB for MySQL databases in Data Source Management in DataWorks. See Data source list.
For the full list of data sources supported by batch synchronization and their configurations, see Supported data sources and synchronization solutions. For instructions on managing data sources, see Data Source Management.
A resource group with a suitable specification, attached to your workspace. See Use a Serverless resource group for Data Integration or Use exclusive resource groups for Data Integration.
A network connection between the resource group and the data source. See Configure network connections.
Import data from HDFS to AnalyticDB for MySQL
Configure the source reader to read data from your HDFS cluster. See HDFS Reader.
Configure the destination writer to write data to AnalyticDB for MySQL. See AnalyticDB for MySQL 3.0 Writer.
Configure the batch synchronization task using the codeless UI. See Configure a batch synchronization task using the codeless UI.