The data catalog feature of DataWorks supports data synchronization from a single MaxCompute table to Hologres to help you efficiently analyze and query big data in real time. This topic describes how to synchronize data from MaxCompute to Hologres. After data synchronization, you can fully utilize the high-performance query capabilities of Hologres.
Background information
When you use the data catalog feature to synchronize data from a MaxCompute internal table to a Hologres internal table, the data is first imported into a Hologres foreign table and then synchronized from the foreign table to a Hologres internal table. Data synchronization from MaxCompute to the Hologres foreign table is implemented by executing the IMPORT FOREIGN SCHEMA statement.
Prerequisites
A MaxCompute project and a Hologres instance are created. For information about how to create a MaxCompute project and a Hologres instance, see Create a MaxCompute project and Purchase a Hologres instance.
The MaxCompute project and Hologres instance are associated with the workspace as computing resources, and the computing resources have passed the network connectivity test. For information, see Associate a computing resource with a workspace (Participate in Public Preview of Data Studio turned on).
Limits
You can create a foreign table and read data from the foreign table only if a source MaxCompute table exists.
Entry point for the feature
Before you configure a data synchronization node, you can perform the following steps to go to the configuration tab on which you can configure data synchronization from MaxCompute to Hologres:
Go to the Workspaces page in the DataWorks console. In the top navigation bar, select a desired region. Find the desired workspace and choose in the Actions column.
In the left-side navigation pane of the Data Studio page, click the
icon to go to the DATA CATALOG pane.In the MaxCompute directory, find the internal table from which you want to synchronize data in the Table folder, right-click the table name, and then select Data Synchronization to Hologres.
In the popover that appears, select the path in which you want to create the node, and click OK.
In the popover that appears, enter a name for the node and press Enter to go to the configuration tab of the node.
Configure the node
On the configuration tab of the node, configure the parameters. For more information, see Node for synchronizing data to Hologres.
The system automatically configures parameters related to the source MaxCompute table based on the internal table that you select. You do not need to manually configure parameters in the Settings for Source Table (MaxCompute) section.