All Products
Search
Document Center

DataWorks:Activate DataWorks

Last Updated:Jun 26, 2024

The first time you activate DataWorks in a region, you must purchase a specific DataWorks edition and a pay-as-you-go resource group of the new version. After you activate DataWorks, the other pay-as-you-go resources and features provided by DataWorks, such as intelligent monitoring, data quality monitoring, and APIs, are enabled by default. You are charged for the resources and features based on your actual usage. This topic describes how to activate DataWorks.

Precautions

  • Region: You can purchase an edition and a resource group of DataWorks in a region. If you want to use the service capabilities that are provided by the edition and the resource group in multiple regions, you must purchase the edition and the resource group in the regions.

  • Related engines: The first time you activate DataWorks in a region, the system automatically activates MaxCompute (pay-as-you-go) and creates the AliyunServiceRoleForDataWorksEngine and AliyunServiceRoleForDataWorksOnEmr service-linked roles in the region. This helps you quickly experience the core scenarios of the big data platform. If you do not use MaxCompute, no related fees are generated.

Prerequisites

Related accounts are prepared based on your business requirements before you activate DataWorks. The following accounts are required:

  • Alibaba Cloud account: An Alibaba Cloud account has the permissions to perform all operations related to DataWorks services. For more information about how to prepare an Alibaba Cloud account, see Prepare an Alibaba Cloud account.

  • RAM user: RAM users can collaborate with each other to develop data in a workspace in different scenarios. You must use your Alibaba Cloud account to create a RAM user and attach the AliyunBSSOrderAccess and AliyunDataWorksFullAccess policies to the RAM user. For more information about how to prepare a RAM user and grant permissions to a RAM user, see Prepare a RAM user.

Activate DataWorks

When you activate DataWorks, you must select the region where the required services and resources are deployed, a DataWorks edition, a subscription duration, and the virtual private cloud (VPC) with which the resources are associated. This section describes the common procedure of activating DataWorks.

  1. Go to the buy page of DataWorks.

    Go to the product page of DataWorks and click Buy Now or Advanced Edition to go to the buy page of DataWorks.

    Note

    If you have purchased a specific edition of DataWorks in the current region, you can log on to the DataWorks console and click Purchased Resources and Services or Resource Groups in the left-side navigation pane to purchase required services and resources.

  2. Configure the parameters for the services and resources that you want to purchase.

    Configure the parameters as prompted. The following table describes the parameters.

    Parameter

    Description

    Region

    Select the region in which you want to purchase an edition and a resource group of DataWorks. You can use DataWorks services in a region only after you purchase an edition and a resource group in the region.

    Note

    If you want to use the service capabilities that are provided by the edition and the resource group in multiple regions, you must purchase the edition and the resource group in the regions.

    DataWorks

    Select a DataWorks edition and configure the Subscription Duration parameter. DataWorks provides the following editions:

    • Basic Edition: This edition is free of charge and is suitable for students and big data beginners.

    • Standard Edition, Professional Edition, and Enterprise Edition: These are advanced editions that provide more abundant service capabilities than Basic Edition and can meet the business requirements of different users. You are charged based on the subscription billing method for these editions. For more information, see Billing of DataWorks editions.

    Note

    If you purchase an advanced edition in a region and do not select Auto-renewal, the advanced edition is downgraded to Basic Edition in the region after the advanced edition expires. For more information, see Feature downgrades of DataWorks advanced editions upon expiration.

    DataWorks Resource Group

    Select a resource group and configure network information.

    • Resource type: The first time you activate DataWorks in a region, the system automatically purchases a pay-as-you-go resource group of the new version. You cannot cancel the purchase operation. You can specify a resource group name based on your business requirements.

      Note
      • After you purchase a pay-as-you-go resource group of the new version, you are charged based on the amount of resources that are consumed by different operations in DataWorks, such as task development, scheduling of auto triggered tasks, data synchronization, data quality monitoring, and calling of DataService Studio APIs. For more information, see Overview of the pay-as-you-go billing method in DataWorks.

      • Resource groups of the new version support the subscription and pay-as-you-go billing methods. The features and costs of a resource group of the new version vary based on the billing method that the resource group uses. If you have purchased a specific edition of DataWorks in the current region, you can select a billing method based on your business requirements when you purchase a resource group of the new version.

    • Network configurations: Specify the VPC and vSwitch that are associated with the resource group. The VPC and vSwitch need to be used when you connect the resource group to a data source and perform data development operations. If you activate DataWorks for the first time in the current region, the system automatically creates a default VPC and a default vSwitch and associates the resource group with the VPC and vSwitch.

      Important
      • Do not change the default configurations of the VPC and vSwitch that are created by DataWorks. If you change the configurations of the VPC and vSwitch, related tasks may fail.

      • If the default VPC and vSwitch cannot meet your business requirements, you can create a VPC and a vSwitch in the VPC console. For more information about VPC, see What is VPC?

    • Service-linked role: You must create the following service-linked role as prompted before you can purchase a resource group of the new version.

      • Role name: AliyunServiceRoleForDataWorks

      • Policy: AliyunServiceRolePolicyForDataWorks

  3. After you confirm that the configurations are correct, read the terms of service and select the check box for Terms of Service.

  4. Confirm the order.

    1. Click Confirm Order and Pay. In the Verify Resources dialog box, view the details of the order.

    2. After the resources are verified, click Next: View Price List to confirm the price of the order.

      Note

      The price list includes the fees for the DataWorks edition, the resource group of the new version, MaxCompute, and other DataWorks pay-as-you-go resources and features.

    3. In the Price List dialog box, confirm the price and click Next Step: Create Order.

    4. On the Purchase page, click Purchase.

What to do next

  • Experience the use cases

    The first time you activate DataWorks in a region, the system automatically generates a default DataWorks workspace. You can experience use cases in the default workspace. For more information, see Built-in logic of a default workspace.

  • Develop tasks

    Before you develop tasks in DataWorks, we recommend that you create a custom workspace, select a compute engine type based on your business requirements, and add a data source or register a cluster of the compute engine type to DataWorks.

    1. A workspace is the basic unit for task development and member permission management in DataWorks. All data development operations must be performed in a specific workspace. For more information about how to create a workspace, see Create a workspace.

    2. You can develop tasks in DataWorks based only on compute engines. You can add a data source of a specific compute engine type or register a cluster of a specific compute engine type to your workspace. For more information, see Add and manage data sources.

References

  • A resource group of the new version is a general-purpose resource group that can be used to run data synchronization, data computing, data scheduling, and DataService Studio tasks. For more information, see Create and use a resource group of the new version.

  • DataWorks also provides more features and resources, such as intelligent data modeling and advanced analysis. You can enable the features based on your business requirements. For more information, see Billing overview.

  • For more information about DataWorks, see What is DataWorks?