To develop and manage Hologres tasks in DataWorks, you must first associate your Hologres instance as a Hologres computing resource. After the resource is associated, you can use it in DataWorks to connect to the Hologres instance. This lets you perform operations such as data synchronization, data development, and data analysis.
Prerequisites
Purchase a Hologres instance and create a database.
ImportantWhen you purchase a Hologres instance, ensure that it is in the same region as the DataWorks workspace. Otherwise, the instance cannot be associated.
Before you associate the instance, confirm that the Running Status of the Hologres instance is Normal.
A DataWorks workspace is created. The RAM user who performs this operation must be added to the workspace and assigned the Workspace Administrator role.
A resource group is associated to the workspace and network connectivity is established.
If you use a Serverless resource group, ensure that the Hologres computing resource can connect to the Serverless resource group.
If you use a legacy exclusive resource group, ensure that the Hologres computing resource can connect to the exclusive resource group for integration, exclusive resource group for scheduling, and exclusive service resource group, depending on the scenario.
Limits
Feature limits: If you enable SSL authentication when you create a computing resource, the resource cannot be used for data development or periodic scheduling tasks.
Region limits: China (Hangzhou), China (Shanghai), China (Beijing), China (Shenzhen), Japan (Tokyo), Malaysia (Kuala Lumpur), and Indonesia (Jakarta).
Permission limits:
Operator
Required permissions
Alibaba Cloud account
No extra permissions are required.
Alibaba Cloud RAM user/RAM role
DataWorks management permissions: Only workspace members with the O&M or Workspace Administrator role, or members with the
AliyunDataWorksFullAccesspermission can create computing resources. For more information, see Grant the Workspace Administrator permission to a user.Hologres service permissions:
Grant the
AliyunHologresFullAccessaccess policy. Otherwise, the Hologres computing resource cannot be configured.Add the account or role information to User Management. Otherwise, the destination database cannot be configured.
DataStudio: Associate a Hologres computing resource
Associate a Hologres computing resource to a workspace that Use Data Studio (New Version).
Go to the Computing Resource page
Log on to the DataWorks console. Switch to the destination region. In the navigation pane on the left, click . Select the target workspace from the drop-down list and click Go To Management Center.
In the navigation pane on the left, click Computing Resource to go to the Computing Resource page.
Associate a Hologres computing resource
On the Computing Resource page, configure and associate the Hologres computing resource.
Select a computing resource type to associate.
Click Associate Computing Resource to go to the Associate Computing Resource page.
On the Associate Computing Resource page, set the computing resource type to Hologres to go to the Associate Hologres Computing Resource configuration page.
Configure the Hologres computing resource.
On the Associate a Hologres Hologres Computing Resource configuration page, configure the parameters as described in the following table.
Parameter
Description
Hologres Instance
Select the Hologres instance that you created. You can also click Create in the drop-down list to purchase a Hologres instance.
Hologres Compute Group
The current Hologres instance supports compute groups. You can also click Create in the drop-down list to create a Hologres compute group.
Database Name
Select the database that you want to use. You can also click Create in the drop-down list to create a database.
Default Access Identity
Development environment: Only the Executor identity is supported.
Production environment: The Alibaba Cloud Account, Alibaba Cloud RAM User, Alibaba Cloud RAM Role, or Task Owner identity is supported.
NoteIf you are logged on with an Alibaba Cloud account: You can select all identities.
If you are logged on with a RAM user or RAM role:
If you have the AdministratorAccess access policy: You can select all identities.
If you do not have the AdministratorAccess access policy: You can select only your own identity.
Authentication Method
No Authentication: No other operations are required.
SSL Authentication: If you select this option, you must enable SSL Encryption on the Data Security page of the Hologres instance.
NoteIf SSL authentication is enabled for the Hologres instance, the instance cannot be used for data development or periodic scheduling tasks.
SSL Encryption
This parameter is required if you set Authentication Option to SSL Authentication. Only the require encryption mode is supported.
Computing Resource Instance Name
Enter a custom name for the computing resource instance. When a task runs, you can select the computing resource for the task based on the computing resource name.
Test the connectivity.
In the connection configuration section, select the resource group that DataWorks uses to run Hologres tasks and click Test Network Connectivity to ensure that the resource group can access your Hologres instance. For more information, see Overview of network connection solutions.
Click OK to complete the configuration of the Hologres computing resource.
NoteAfter the resource is associated, the system automatically creates a Hologres data source with the same name in the Data Sources list of the current workspace.
Legacy Data Studio: Associate a Hologres computing resource
You can Associate a Hologres computing resource to a workspace that has not Use Data Studio (New Version).
Go to the Computing Resource page
Go to the DataStudio page.
Log on to the DataWorks console. In the top navigation bar, select the desired region. In the left-side navigation pane, choose . On the page that appears, select the desired workspace from the drop-down list and click Go to Data Development.
In the navigation pane on the left, click the
icon to go to the Computing Resource page.
Associate a Hologres computing resource
On the Computing Resource page, configure and Associate a Hologres computing resource.
Select a computing resource type to associate.
Click Create Computing Resource to go to the Create Computing Resource page.
On the Create Computing Resource page, set the computing resource type to Hologres to go to the Create Computing Resource configuration page.
Configure the Hologres computing resource.
On the Create Computing Resource configuration page, configure the parameters as described in the following table.
Parameter
Description
Data Source Name
Enter a custom name for the computing resource.
Authentication Method
Only Alibaba Cloud Account And Alibaba Cloud RAM Role is supported.
Alibaba Cloud Account
You can select Current Alibaba Cloud Account or Other Alibaba Cloud Account.
NoteIf you select Other Alibaba Cloud Account, enter the information about the other account as prompted on the page.
Region
The region where your workspace resides.
Hologres Instance
Select the Hologres instance that you created.
Database Name
Select the database that you want to use.
Default Access Identity
Development environment: Only the Executor identity is supported.
Production environment: The Alibaba Cloud Account, Alibaba Cloud RAM User, Alibaba Cloud RAM Role, or Task Owner identity is supported.
NoteIf you are logged on with an Alibaba Cloud account: You can select all identities.
If you are logged on with a RAM user or RAM role:
If you have the AdministratorAccess access policy: You can select all identities.
If you do not have the AdministratorAccess access policy: You can select only your own identity.
Authentication Method
No Authentication: No other operations are required.
SSL Authentication: If you select this option, you must enable SSL Encryption on the Data Security page of the Hologres instance.
NoteIf SSL authentication is enabled for the Hologres instance, the instance cannot be used for data development or periodic scheduling tasks.
SSL Encryption
This parameter is required if you set Authentication Method to SSL Authentication. Only the require encryption mode is supported.
Test the connectivity.
In the connection configuration section, select the resource group that DataWorks uses to run Hologres tasks and click Test Network Connectivity to ensure that the resource group can access your Hologres instance. For more information, see Overview of network connection solutions.
Click Create and Associate Computing Resource with DataStudio to complete the configuration of the Hologres computing resource.
After the resource is associated, the system automatically creates a Hologres data source with the same name in the Data Sources list of the current workspace.
What to do next
For a smoother development experience, we recommend that you first read Instructions for using DataWorks on Hologres. This document helps you understand the development process, fees, environment preparation, and access control for using Hologres in DataWorks.
New Data Studio: After you configure the Hologres computing resource, you can use a batch synchronization node for data synchronization and Hologres-related nodes for data development in Data Studio.
Legacy DataStudio: After you configure the Hologres computing resource, you can use a node for data synchronization and legacy Hologres-related nodes for data development.
FAQ
Error message: FAILED: Build connection error! Connection to xxx.hologres.aliyuncs.com:80 refused. Check that the hostname and port are correct and that the postmaster is accepting TCP/IP connections.
Solution: Go to the Hologres console to check whether the instance is running correctly. If the instance is shut down, restart the Hologres instance.
Error message: FAILED: An I/O error occurred while sending to the backend.
Solution: On the Data Security page of the Hologres instance, check whether the SSL Encryption switch is turned on. If it is turned on, turn it off and wait 5 to 10 seconds before you run the task again.