If your business workloads demand high concurrency, scalability, and availability, or require complex searches and big data analysis, and your current database architecture is inadequate or too costly to upgrade, use DataWorks Data Integration to migrate data from your existing databases to Tablestore. DataWorks Data Integration can also migrate Tablestore data across different instances and accounts, or to OSS and MaxCompute for backup and analysis.
Use cases
DataWorks Data Integration is a stable, efficient, and scalable data synchronization platform. It supports data migration and data synchronization between various heterogeneous data sources, such as MySQL, Oracle, MaxCompute, and Tablestore.
You can use DataWorks Data Integration for various Tablestore data migration scenarios. These include migrating data from databases to Tablestore, synchronizing Tablestore data across instances or accounts, and migrating Tablestore data to OSS or MaxCompute.
Database to Tablestore
DataWorks Data Integration lets you migrate data from various heterogeneous data sources to Tablestore.
For more information, see Supported data source types, Reader plug-ins, and Writer plug-ins.
Across instances or accounts
You can copy data from Tablestore data tables or time series tables by configuring the appropriate Reader and Writer plug-ins in DataWorks. The following table describes the relevant plug-ins.
Plug-in | Description |
OTSReader | Reads data from Tablestore tables. You can also specify a data range for incremental data extraction. |
OTSStreamReader | Incrementally exports data from Tablestore tables. |
OTSWriter | Writes data to Tablestore. |
To OSS or MaxCompute
You can migrate data from Tablestore to OSS or MaxCompute.
MaxCompute is a fast and fully managed data warehouse service that processes data at the terabyte or petabyte scale. You can use MaxCompute to back up data from Tablestore or migrate data to MaxCompute for processing.
OSS is a highly secure, cost-effective, and reliable cloud storage service that can store massive amounts of data. You can use OSS to back up data from Tablestore or synchronize data to OSS and then download the data as files to a local computer.
Migration solutions
Use DataWorks Data Integration to migrate data between Tablestore and other data sources.
Data import solutions allow you to synchronize data from sources like MySQL, Oracle, Kafka, HBase, MaxCompute, and PolarDB-X 2.0 to Tablestore. You can also synchronize data between Tablestore data tables or time series tables.
Data export solutions allow you to synchronize data from Tablestore to MaxCompute and OSS.
Import data
Migration solution | Description |
Synchronize MySQL data to Tablestore | You can migrate data from MySQL databases to Tablestore data tables only. This process uses the MySQL Reader and Tablestore Writer scripts. Configure the data sources as follows:
|
Synchronize Oracle data to Tablestore | You can migrate data from Oracle databases to Tablestore data tables only. This process uses the Oracle Reader and Tablestore Writer scripts. Configure the data sources as follows:
|
Synchronize Kafka data to Tablestore | You can migrate data from Kafka to Tablestore data tables or time series tables. Important
This process uses the Kafka Reader and Tablestore Writer scripts. Configure the data sources as follows:
|
Synchronize HBase data to Tablestore | You can migrate data from HBase databases to Tablestore data tables only. This process uses the HBase Reader and Tablestore Writer scripts. Configure the data sources as follows:
|
Synchronize MaxCompute data to Tablestore | You can migrate data from MaxCompute to Tablestore data tables only. This process uses the MaxCompute Reader and Tablestore Writer scripts. Configure the data sources as follows:
|
Synchronize PolarDB-X 2.0 data to Tablestore | You can migrate data from PolarDB-X 2.0 to Tablestore data tables only. This process uses the PolarDB-X 2.0 Reader and Tablestore Writer scripts. Configure the data sources as follows:
|
Synchronize data between Tablestore data tables | You can migrate data between Tablestore data tables. This process uses the Tablestore Reader and Writer scripts. For details about data source configuration, see Tablestore data source. When configuring the scripts, refer to the instructions for reading and writing wide table data. |
Synchronize data between Tablestore time series tables | You can migrate data between Tablestore time series tables. This process uses the Tablestore Reader and Writer scripts. For details about data source configuration, see Tablestore data source. When configuring the scripts, refer to the instructions for reading and writing time series data. |
Export data
Migration solution | Description |
Synchronize Tablestore data to MaxCompute | You can use MaxCompute to back up Tablestore data or migrate the data to MaxCompute. This process uses the Tablestore Reader and MaxCompute Writer scripts. Configure the data sources as follows:
|
Synchronize Tablestore data to OSS | You can download files synchronized to OSS at any time or store them in OSS as backups. This process uses the Tablestore Reader and OSS Writer scripts. Configure the data sources as follows:
|
Prerequisites
After selecting a migration solution, complete the following prerequisites:
Ensure network connectivity between DataWorks and both the source and destination.
For the source, confirm its version, prepare the required account, and configure the necessary permissions and service-specific settings. For more information, see the source documentation.
For the destination, activate the service and create the required resources. For more information, see the destination documentation.
Usage notes
If you encounter any issues, submit a ticket.
Ensure that DataWorks Data Integration supports data migration for your specific product version.
The data types in the source and destination data sources must match. Otherwise, the migration will result in dirty data.
After you select a migration solution, carefully review the limits and usage notes for your source and destination data sources.
Before you migrate Kafka data, choose the Table Store data model that best fits your business scenario.
Configuration process
Based on your migration solution, follow this process to migrate data using DataWorks Data Integration.
The following table describes the steps in the process.
No. | Step | Description |
1 | Create the required data sources based on your migration solution.
| |
2 | DataWorks Data Integration provides the codeless UI and step-by-step guidance. You can enter settings in a visual interface and follow prompts to configure a batch synchronization task. The codeless UI is easy to learn but lacks some advanced features. | |
3 | Verify the migration results | View the data in the destination data source based on your migration solution.
|
Configuration example
Data import
Use DataWorks Data Integration to synchronize data from databases like MySQL, Oracle, and MaxCompute to Tablestore data tables, or to synchronize Tablestore data across accounts or instances. Examples include synchronizing data from one data table to another.
This topic shows how to use the codeless UI to synchronize data from MaxCompute to a Tablestore data table.
Prerequisites
Step 1: Add a Tablestore data source and a MaxCompute data source
Step 2: Configure batch synchronization
Step 3: View the synchronization results
Data export
You can use DataWorks Data Integration to export data from Tablestore to MaxCompute or OSS.
Sync data from Tablestore to OSS
Billing
Using a migration tool to access Tablestore incurs charges for data reads and writes. Once the data is written, Tablestore also charges storage fees based on the data volume. For more information about billing, see billing overview.
The DataWorks billing model consists of software fees and resource fees. For more information, see billing introduction.
Other solutions
You can download Tablestore data to a local file.
You can also use migration tools such as DataX and Tunnel Service to import data.
Migration tool | Description |
DataX abstracts data synchronization from various sources by using a Reader plugin to read from the source and a Writer plugin to write to the destination. | |
Tunnel Service is an integrated service for consuming full and incremental data that is built on the Tablestore data API. By creating a data channel for a data table, you can easily process historical and new data from the table. This service is ideal for migrating and synchronizing data from a Tablestore data table. For more information, see Synchronize data from one data table to another. | |
Data Transmission Service (DTS) is a real-time data streaming service provided by Alibaba Cloud. It supports data interaction between data sources such as relational databases (RDBMS), NoSQL databases, and online analytical processing (OLAP) systems. DTS integrates data synchronization, migration, subscription, integration, and processing to help you build a secure, scalable, and highly available data architecture. For more information, see Synchronize PolarDB-X 2.0 data to Tablestore and Migrate PolarDB-X 2.0 data to Tablestore. |
Field type mappings
This appendix lists the field type mappings between common services and Tablestore. Use these mappings to configure field mappings.
MaxCompute and Tablestore
MaxCompute type | Tablestore type |
STRING | STRING |
BIGINT | INTEGER |
DOUBLE | DOUBLE |
BOOLEAN | BOOLEAN |
BINARY | BINARY |
MySQL and Tablestore
MySQL type | Tablestore type |
STRING | STRING |
INT, INTEGER | INTEGER |
DOUBLE, FLOAT, DECIMAL | DOUBLE |
BOOL, BOOLEAN | BOOLEAN |
BINARY | BINARY |
Kafka and Tablestore Field Type Mapping
Kafka type | Tablestore type |
STRING | STRING |
INT8, INT16, INT32, INT64 | INTEGER |
FLOAT32, FLOAT64 | DOUBLE |
BOOLEAN | BOOLEAN |
BYTES | BINARY |


icon to the right of Workspace Directories, and select