Data integration supports offline integration, real-time integration, and whole database migration. This topic describes the data source types supported for offline integration, real-time integration, and whole database migration.
Scenarios for different integration types
Integration type | Scenarios |
Offline integration | Suitable for scenarios such as migrating data to the cloud or moving cloud data to on-premises systems. For example, you can migrate data from an on-premises MySQL database to an Alibaba Cloud ApsaraDB RDS instance. |
Whole-database migration | Suitable for synchronizing data from a data center or a self-managed database on an ECS instance to an offline data warehouse, such as Hive, or a big data computing service. For example, you can migrate data from a self-managed MySQL database on an ECS instance to MaxCompute. |
Real-time integration | Suitable for integrating data changes from an entire database or all tables in a source to a destination in real time. This keeps the source and destination data sources synchronized. |
Data sources supported for offline integration
Data source | Read | Write |
Big Data Storage Data Source | ||
MaxCompute | Supported | Supported |
Hive | Supported | Supported |
Hologres | Supported | Supported |
Impala | Supported | Supported |
TDH Inceptor | Supported | Supported |
Kudu | Supported | Supported |
StarRocks | Supported | Supported |
Hudi | Supported | Supported |
Doris | Supported | Supported |
GreenPlum | Supported | Supported |
TDengine | Supported | Not supported |
ArgoDB | Supported | Supported |
Paimon | Not supported | Not supported |
SelectDB | Supported | Supported |
Databricks | Supported | Supported |
Amazon Redshift | Supported | Supported |
DolphinDB | Supported | Supported |
Snowflake | Supported | Supported |
Data Lake Formation | Supported | Supported |
File Data Source | ||
HDFS | Supported | Supported |
FTP | Supported | Supported |
OSS | Supported | Supported |
Amazon S3 | Supported | Supported |
Message Queue Data Source | ||
Log Service | Supported | Not supported |
Kafka | Supported | Supported |
DataHub | Supported | Supported |
Relational Data Source | ||
PolarDB | Supported | Supported |
PolarDB-X (formerly DRDS) | Supported | Supported |
MySQL | Supported | Supported |
SAP HANA | Supported | Supported |
Microsoft SQL Server | Supported | Supported |
PostgreSQL | Supported | Support |
AnalyticDB for MySQL 2.0 | Supported | Not supported |
AnalyticDB for MySQL 3.0 | Supported | Supported |
AnalyticDB for PostgreSQL | Supported | Supported |
OceanBase | Supported | Supported |
Oracle | Supported | Supported |
Vertica | Supported | Supported |
IBM DB2 | Supported | Supported |
Teradata | Supported | Supported |
ClickHouse | Supported | Supported |
DM | Supported | Supported |
GBase 8a | Supported | Supported |
KingbaseES | Supported | Supported |
TiDB | Supported | Supported |
GoldenDB | Supported | Supported |
OpenGauss | Supported | Supported |
GaussDB (DWS) | Supported | Supported |
Amazon RDS for PostgreSQL | Supported | Supported |
Amazon RDS for MySQL | Supported | Supported |
Amazon RDS for SQL Server | Supported | Supported |
Amazon RDS for Oracle | Supported | Supported |
Amazon RDS for DB2 | Supported | Supported |
TDSQL for MySQL | Supported | Supported |
PolarDB-X2.0 | Supported | Supported |
GBase 8c | Supported | Supported |
NoSQL Data Source | ||
HBase0.9.4 | Not supported | Not supported |
HBase1.1x | Supported | Supported |
HBase2.0 | Supported | Supported |
Elasticsearch | Supported | Supported |
MongoDB | Supported | Supported |
Tablestore | Supported | Supported |
Aliyun HBase | Not supported | Not supported |
Redis | Supported | Not supported |
Lindorm (compute engine) | Supported | Supported |
Easysearch | Supported | Supported |
OpenSearch | Supported | Supported |
Semi-structured storage data source | ||
API | Supported | Supported |
SAP Table | Supported | Not supported |
Salesforce | Supported | Not supported |
Lark Bitable | Supported | Supported |
Data sources supported for whole-database migration
Data source type | Data source | References |
Source data source | MySQL, Oracle, Microsoft SQL Server, OceanBase, IBM DB2, MaxCompute, FTP, TDengine, Hive, PostgreSQL, DM, Amazon Redshift, Amazon RDS for PostgreSQL, Amazon RDS for MySQL, Amazon RDS for SQL Server, Amazon RDS for Oracle, Amazon RDS for DB2, TDSQL for MySQL, DolphinDB, PolarDB-X2.0, GBase 8c . | |
Destination data source | ArgoDB, Hive, TDH Inceptor, MaxCompute, AnalyticDB for PostgreSQL, StarRocks, SelectDB, Doris, GaussDB (DWS), Lindorm (compute engine), Databricks, Data Lake Formation. |
Data sources supported for real-time integration
Real-time compute engine | Data source | References | |
Apache Flink (open source Flink) | Flink on YARN |
| |
Flink on K8s |
| ||
Alibaba Cloud Realtime Compute for Flink (VVP) |
| ||