DataWorks allows you to bind an exclusive resource group for scheduling to a Virtual Private Cloud (VPC) so that the resource group can access data stores in the VPC.

Prerequisites

Before you use an exclusive resource group for scheduling to access a data store, make sure that the exclusive resource group is properly connected to the data store.

Add an exclusive resource group for scheduling

Notice
  • We recommend that you purchase an exclusive resource group for scheduling in the same zone as the data store to be accessed.
  • The exclusive resource group for scheduling must be bound to the same VPC and VSwitch as the data store to be accessed.
  • If you have purchased an exclusive resource group for scheduling in a different zone from the data store to be accessed, bind the resource group to the same VPC as the data store. Then, configure a routing rule for the resource group to access the data store.
  • An exclusive resource group for scheduling may still fail to access a data store after you bind it to the same VPC and VSwitch as the data store or configure a routing rule for it. This is because the resource group is not in the whitelist configured for the data store. In this case, add relevant information, such as the Elastic Network Interface (ENI) IP address of the VPC, to the whitelist. For more information, see Add an exclusive resource group for scheduling to the whitelist of the data store to be accessed.
  1. Log on to the DataWorks console.
  2. In the left-side navigation pane, click Resource Groups.
  3. Purchase an exclusive resource group for scheduling.
    If you have not purchased any exclusive resource groups for scheduling in the target region, follow these steps to purchase one:
    1. On the Resource Groups page, click the Exclusive Resource Groups tab and click Add Exclusive Resource Group.
    2. In the Add Exclusive Resource Group pane that appears, click Purchase next to Order Number. The purchase page appears.
      Purchase
    3. Set Region, Type, Exclusive scheduling resources, Units, and Duration.
      Note
      • In this example, set Type to Exclusive scheduling resources.
      • Exclusive resource groups cannot be shared among regions. For example, the exclusive resource groups in the China (Shanghai) region can only be used by workspaces in the China (Shanghai) region.
    4. Click Buy Now.
  4. Add the exclusive resource group for scheduling.
    1. Return to the Add Exclusive Resource Group pane and set the parameters as required.
      Parameter Description
      Resource Group Type The type of the exclusive resource group. In this example, select Exclusive Resource Groups.

      The valid values are Exclusive Resource Groups and Exclusive Resource Groups for Data Integration. The two types of resource groups are applicable to general node scheduling and data synchronization, respectively.

      Resource Group Name The name of the exclusive resource group. The name must be unique within all resource groups of a tenant.
      Note A tenant indicates an Alibaba Cloud account. Multiple Resource Access Management (RAM) users may exist under a tenant.
      Resource Group Description The description of the exclusive resource group.
      Order Number The order number of the exclusive resource group.
      Zone The zone where the exclusive resource group resides. If the exclusive resource group needs to access data stores in a VPC, select the zone where the data stores reside.

      If you have purchased an exclusive resource group for scheduling in a different zone from the data store to be accessed, bind the resource group to the same VPC as the data store. Then, configure a routing rule for the resource group to access the data store.

    2. Click Create.
      Note The exclusive resource group is initialized within 20 minutes. Wait until its status changes to Running.
  5. Bind the exclusive resource group to a VPC.
    Exclusive resource groups are deployed in a VPC managed by DataWorks. To allow exclusive resource groups to access data stores in your own VPC, you must bind the exclusive resource groups to your VPC.
    1. On the Exclusive Resource Groups tab, find the target exclusive resource group and click Add VPC Binding in the Actions column.
      Before binding the exclusive resource group to your VPC, authorize DataWorks to access your cloud resources in the RAM console.
    2. After the authorization is completed, click Add Binding.
    3. In the Add VPC Binding pane that appears, set the parameters as required.
      Note The exclusive resource group and the data stores to be accessed must be in the same zone. Select the VSwitch to which the data stores are bound.
      Parameter Description
      Resource Group Name The name of the exclusive resource group to be bound to a VPC.
      VPC The name of the target VPC to which the exclusive resource group will be bound.
      VSwitch The name of the VSwitch to which the data store to be accessed is bound.
      Security Groups The security group to be bound to the target VPC.
    4. Click Create.
    If the exclusive resource group and the data store to be accessed are bound to the same VPC but reside in different zones, follow these steps to configure a routing rule:
    1. On the Exclusive Resource Groups tab, find the exclusive resource group and click Custom routing in the Actions column.
    2. In the Custom routing dialog box that appears, click New route.
    3. In the New route dialog box that appears, set the parameters as required.New route
      Parameter Description
      Purpose type The type of the route destination. Valid values: VPC and IDC.
      Destination VPC The region and name of the destination VPC.
      Note This parameter appears only when Purpose type is set to VPC.
      Connection mode The connection mode between the exclusive resource group and the destination VPC. Valid values: Switch, Fixed IP and Specify the CIDR block.
      • If you set Connection mode to Switch, you must also set Destination Switch instance.
        Note This parameter appears only when Purpose type is set to VPC.
      • If you set Connection mode to Fixed IP, you must also set Fixed IP.
      • If you set Connection mode to Specify the CIDR block, you must also set Destination CIDR block.
      Routing Diagram The routing diagram, which cannot be changed.
    4. Click Generate route.

Add an exclusive resource group for scheduling to the whitelist of the data store to be accessed

An exclusive resource group for scheduling may still fail to access a data store though it resides in the same zone and is bound to the same VPC and VSwitch as the data store. This is because the resource group is not in the whitelist configured for the data store. In this case, add the following information to the whitelist:
  • VSwitch CIDR block or ENI IP address of the VPC to which the exclusive resource group is bound. To obtain the VSwitch CIDR block or the ENI IP address of the VPC, follow these steps:

    Log on to the DataWorks console and click Resource Groups in the left-side navigation pane. On the Resource Groups page that appears, find the exclusive resource group on the Exclusive Resource Groups tab and click Add VPC Binding in the Actions column. On the page that appears, view the values in the Switch CIDR block and Eni ip columns. The values are the VSwitch CIDR block and the ENI IP address of the VPC, respectively.

  • EIP and CIDR block of the exclusive resource group. To add these items to the whitelist, follow these steps:
    1. Log on to the DataWorks console.
    2. In the left-side navigation pane, click Resource Groups.
    3. On the Exclusive Resource Groups tab, find the target resource group and click View Information in the Actions column.
    4. In the dialog box that appears, click the value of the EIPAddress parameter. The value is automatically copied. Add the value to the whitelist of the data store. Use the same method to add the value of CIDR Blocks to the whitelist.

Manage an exclusive resource group for scheduling

  • Change the workspace to which an exclusive resource group for scheduling is bound.
    You must bind an exclusive resource group for scheduling before you can select the resource group in the workspace. You can bind an exclusive resource group for scheduling to multiple workspaces.
    1. Log on to the DataWorks console and click Resource Groups in the left-side navigation pane. On the Resource Groups page that appears, click the Exclusive Resource Groups tab, find the target resource group, and then click Change Workspace in the Actions column.
    2. In the Change Workspace dialog box that appears, select the workspaces to which the resource group will be bound.
    3. Click OK.
  • Change the exclusive resource group used to test a node on the DataStudio page.
    1. Log on to the DataWorks console and click Workspaces in the left-side navigation pane. On the Workspaces page that appears, find the target workspace and click Data Analytics in the Actions column. On the DataStudio page that appears, double-click the target node. The configuration tab of the node appears.
    2. Click the Run with Arguments icon in the toolbar.
    3. In the Arguments dialog box that appears, set Resource Group to the exclusive resource group for testing the node.
    4. Click OK.
  • Change the exclusive resource group used to schedule a node.
    You can change the exclusive resource group used to schedule a node on the Properties tab of the node or in Operation Center.
    • To change the exclusive resource group on the Properties tab, follow these steps:
      1. In a workspace, click the Icon icon in the upper-left corner, choose All Products > DataStudio, and then switch to the target workspace.
      2. On the Data Analytics tab of the DataStudio page, double-click the target workflow. On the workflow editing tab that appears, double-click the target node to go to the node editing tab.
      3. In the right-side navigation pane, click the Properties tab. In the Resource Group section, select the resource group used for node scheduling. Resource Group
    • To change the exclusive resource group in Operation Center, follow these steps:
      1. In a workspace, click the Icon icon in the upper-left corner and choose All Products > Operation Center. On the Operation Center page that appears, switch to the target workspace.
      2. In the left-side navigation pane, choose Cycle Task Maintenance > Cycle Task.
      3. On the page that appears, click the rightwards arrow in the middle of the page to show the node list. Find the target node, click More in the Actions column, and then select Change Resource Group. The Change Resource Group dialog box appears.Change the exclusive resource group
        Notice You are not allowed to change the resource group for zero load nodes or machine learning nodes.
        If you want to change the exclusive resource group for multiple nodes at a time, select the nodes on the Cycle Task page and click Change Resource Group at the bottom of the page.Change the exclusive resource group for multiple nodes at a time
      4. In the Change Resource Group dialog box that appears, select an exclusive resource group for scheduling and click OK.