Use the Data Integration feature of DataWorks to synchronize incremental and full data from Tablestore to MaxCompute.
How it works
DataWorks Data Integration can be used to synchronize large amounts of offline data. Data Integration facilitates data transmission between diverse structured and semi-structured data sources. It provides readers and writers for the supported data sources and defines a data transmission channel between the sources and destinations based on simplified data types.
When you synchronize Tablestore data to MaxCompute, you must configure a Tablestore-related Reader plugin and the MaxCompute-related Writer plugin for the offline synchronization task. The following items describe the usage notes of the plugins.
Tablestore-related Reader plugins
The Tablestore-related Reader plugin that is required varies based on the data synchronization mode that you use. The following table describes the mappings between data synchronization modes and Tablestore-related Reader plugins.
Synchronization mode
Plugins used
Plugin description
Full export
Tablestore Reader
The plugin is used to read data from Tablestore tables. You can specify the range of data that you want to extract to perform incremental extraction. For more information, see Tablestore data source.
Incremental synchronization
OTSStream Reader
The plugin is used to export data in Tablestore tables in incremental mode. For more information, see Tablestore Stream data source.
MaxCompute-related Writer plugin
DataWorks uses the MaxCompute-related Writer plugin to write data to MaxCompute, regardless of whether the full export or incremental synchronization mode is used. For more information, see MaxCompute data source.
Synchronization modes
You can export full data from Tablestore to MaxCompute at a time. For more information, see Export full data to MaxCompute.