When you activate DataWorks, the system provides you with a shared resource group for scheduling. This topic provides an overview of the shared resource group for scheduling.

Limits

  • The shared resource group is deployed in the shared cluster of DataWorks. All DataWorks tenants share the resources in the cluster. During peak hours of resource usage, tenants may preempt resources. As a result, some nodes may not be run as expected due to insufficient resources.
  • The maximum number of resources that can be scheduled from the shared resource group is fixed. Nodes in all workspaces of a user share resources in the shared resource group. If multiple nodes are run at the same time, the workspaces may preempt the resources. As a result, some nodes may not be run as expected due to insufficient resources.
  • If you want to ensure exclusive and sufficient resources for nodes, we recommend that you purchase exclusive resource groups for Data Integration and exclusive resource groups for scheduling.

Scenarios

The shared resource group is automatically created when you activate DataWorks. You can use the shared resource group to perform operations, such as analyzing data and testing nodes.

We recommend that you use the shared resource group only when the number of nodes to be run is small and the requirement for data output timeliness is low.

Billing methods

You are charged based on items such as Elastic Compute Service (ECS) instances in the shared resource group and the data synchronization threads that are used. The shared resource group supports the pay-as-you-go billing method. For more information about the billing of the shared resource group, see the following topics:

Network connection solutions

A DataWorks resource group is a group of Alibaba Cloud ECS instances. To run nodes, such as Data Integration nodes or data analytics nodes, make sure that resource groups and data sources are connected. In addition, make sure that special security settings such as whitelists do not affect the connections between resource groups and data sources.

  • Network connectivity
    • Shared resource group for scheduling
      The shared resource group for scheduling can be connected to data sources of Alibaba Cloud. The following data sources are not supported:
      • Data sources that are deployed on the Internet and configured with whitelists to limit access from unknown IP addresses
      • Data sources that are deployed in virtual private clouds (VPCs) of Alibaba Cloud
      Note For nodes that must access the Internet, we recommend that you use exclusive resource groups.
  • Whitelist settings

    The shared resource group for scheduling provides the security sandbox feature for nodes. This feature can be used to limit access to the resource groups from unknown IP addresses. If you want to access the resource groups, you can add the IP address that you use to the whitelist of the security sandbox. For more information, see Configure security settings.