Dataphin Data Integration - sync data between sources - Dataphin

Data Integration is Dataphin's data sync platform. It supports preprocessing and high-speed synchronization between heterogeneous data sources.

Background information

Big data adoption across industries drives demand for data integration: bulk sync task configuration, heterogeneous source connectivity, lightweight data preprocessing, and sync optimization (fault tolerance, rate limiting, concurrency).

Features

Note

If you purchased Dataphin after April 2020, the data synchronization feature has been upgraded to Data Integration.

Data Integration provides the following capabilities:

Batch efficiency: Use full database migration to generate bulk sync tasks, or one-click target table creation to sync data to MaxCompute without manual table setup. For more information, see Configure Integration Tasks Using Full Database Migration.
Data preprocessing: Use flow and transform components for scrubbing, transformation, desensitization, calculation, merging, distribution, and filtering. For more information, see Create and configure a cold migration pipeline.
Multiple developer modes: Supports both Dev-Prod and Basic modes to match your development workflow.
Logical table sync: Sync logical tables from Dataphin to a destination database.
Custom components: Build custom sync components for specific scenarios. RDBMS components connect via JDBC; non-RDBMS components require a JAR package upload.

Data Integration lets you build offline pipelines by dragging and assembling components. Full database migration supports MySQL, SQL Server, and Oracle as sources with MaxCompute as the destination.

Data Integration

Quick access (recommended)

On the Dataphin home page, click Import Data in the product usage path.

Standard access

On the Dataphin home page, choose Data Studio > Data Integration from the top menu bar.