All Products
Search
Document Center

Create OSS schemas by using the wizard

Last Updated: Oct 11, 2019

This topic describes how to synchronize ApsaraDB for RDS (RDS) data or data in a user-created ECS-based database to Object Storage Service (OSS) by using the wizard. That is, to create an OSS data warehouse (schema).

Procedure

  1. Log on to the Data Lake Analytics console.

  2. In the upper-left corner of the page, select the region where your Data Lake Analytics (DLA) service is located.

  3. In the left-side navigation pane, click Schema.

  4. On the Schema page, click Create Schema.

  5. On the Popular tab of the New Schema page, click Create By Wizard in the One Click Data Warehouse section.

    Create by Wizard

  6. Authorize DLA to access OSS and RDS. After authorization, click Next.

  7. Configure the RDS data source and OSS warehouse for storing data as prompted.

    You can synchronize RDS data or data in a user-created ECS-based database to OSS based on the actual data storage mode. This example shows how to synchronize RDS data to OSS.

    Set up the source data for interconnection

    Category Parameter Description
    Cloud RDS Type The type of the source data, which is RDS.

    Click the radio button for an RDS instance to add the instance to Source Data.

    Database Name The explanatory name of the RDS instance.
    Instance ID The ID of the RDS instance. The system automatically pulls RDS instances in the same region as DLA.

    You can search RDS instances by using fuzzy match.

    Self-built database ECS ID The ID of the ECS instance where your user-created database is located.
    VPC ID The ID of the VPC where your ECS instance is located.
    Engine The type of the user-created ECS-based database.
    Source Data Server The RDS data source of the one-click DW task, which is selected from the RDS instance list on the left.
    Port The connection port of the RDS instance, which is always port 3306.
    User Name The username for the RDS instance.
    Password The password for the username.
    Schema Name The database name of the RDS instance.

    After configuring the source data, click Test Connection to test connectivity.

    Position opening configuration Schema Name The schema name, which is the RDS database name mapped in DLA.
    Location The detailed address for storing RDS data in OSS when you create the data warehouse.

    The system automatically pulls the OSS bucket in the same region as DLA. Click Select Location, where you can select buckets and objects based on your business needs.

    To use the one-click DW function, DLA must have the permission to delete OSS data so that it can perform the Extract, Transform and Load (ETL) operations. For more information about authorization, see the topic Authorize DLA to delete OSS files.

    Sync Time The time for synchronizing RDS data to OSS.

    The default data synchronization time is 02:00 a.m. Alternatively, you can change the data synchronization time to off-peak hours based on business conditions, to minimize impact on the business during synchronization.

    Advanced Options The user-defined items, such as filtering fields. For more information, see Advanced options.
  8. After setting the preceding parameters, click Create to create the OSS data warehouse.

    Note: After the data warehouse is created, DLA automatically synchronizes RDS data to OSS at the preset synchronization time. In addition, a same table structure in the RDS instance is created in the OSS bucket, and a corresponding OSS table is created in DLA.