When you activate DataWorks, the system automatically creates shared resource groups for scheduling. If the shared resource groups cannot meet your business requirements, you can purchase resource plans for shared resource groups or purchase exclusive resource groups. This topic provides an overview of shared resource groups and resource plans for shared resource groups.

Limits

  • Shared resource groups are in the shared cluster of DataWorks. All DataWorks users share the resources in the cluster. During peak hours, users may preempt resources. As a result, some nodes may not be run as expected due to insufficient resources.
  • The maximum number of resources that can be scheduled from shared resource groups is fixed. The nodes in all the workspaces of a user share the resources. If multiple nodes are run at the same time, the workspaces may preempt the resources. As a result, some nodes may not be run as expected due to insufficient resources.
  • If you want to ensure exclusive and sufficient resources for your nodes, we recommend that you purchase exclusive resource groups for Data Integration and exclusive resource groups for scheduling,or purchase resource plans for shared resource groups for scheduling.

Scenarios

Shared resource groups are automatically created when you activate DataWorks. You can use shared resource groups to perform some operations, such as analyzing data and testing nodes.

We recommend that you use shared resource groups only when the number of nodes to run is small and the requirement for data output timeliness is low.

Billing methods

You are charged based on items such as the instances in shared resource groups and the data synchronization processes that are used. Shared resource groups support the pay-as-you-go billing method. You can also purchase subscription resource plans for shared resource groups, such as subscription resource plans for shared resource groups for scheduling. For more information, see the following topics:

Network connection solutions

A DataWorks resource group is a group of Alibaba Cloud Elastic Compute Service (ECS) instances. To run a node, such as a Data Integration node or a data analytics node, you must make sure that resource groups and data sources are connected. In addition, you must make sure that special security settings, such as whitelists, do not affect the connections between resource groups and data sources.

  • Network connectivity
    • Shared resource groups for scheduling
      Shared resource groups for scheduling can be connected to data sources within Alibaba Cloud. The following data sources are not supported:
      • Data sources that are deployed on the Internet and configured with whitelists to block access from unknown IP addresses.
      • Data sources that are deployed in the virtual private clouds (VPCs) of Alibaba Cloud.
      Note For nodes that must access the Internet, we recommend that you use exclusive resource groups.
  • Whitelist settings

    Shared resource groups and resource plans for shared resource groups provide the security sandbox feature for nodes. This feature can be used to block access to the resource groups from unknown IP addresses. If you want to access the resource groups, you can add the IP address that you use to the whitelist of the security sandbox. For more information, see Configure security settings.