All Products
Search
Document Center

Dataphin:Step 7: Data backfill for operations

Last Updated:Jan 21, 2025

This topic outlines the process for performing data backfill for logical dimension tables, logical fact tables, logical aggregate tables, and pipeline tasks as part of this tutorial.

Background information

Data backfill is required for the product table, customer table, order table integration tasks, and the detail and aggregate tables dim_customer, dim_products, fct_order_buy_di, and dws_all. The procedure for backfilling integration tasks and detail and aggregate tables is identical. This section will use the customer table integration and dim_products logical table as examples.

Data backfill for integration tasks

  1. Navigate to the Dataphin home page and single click Development in the top menu bar.

  2. Refer to the instructions in the figure below to select Product Table Integration for data backfill.

    image.png

  3. In the Data Backfill - Current Task dialog box, set the parameters as follows:

    Parameter

    Description

    Data Backfill Instance Name

    Use the default configuration.

    Runtime

    Choose Run Immediately.

    Data Timestamp

    Opt for By Interval and retain the default configuration for the interval.

    Concurrent Running Groups

    The default is 1 group.

    Data Backfill Order

    Select Ascending Data Timestamp.

    Skip Execution For Corresponding Instances

    Choose Normal Operation.

    Dry-run For Corresponding Instances

    Choose Normal Operation.

  4. Single click OK.

  5. Monitor the runtime results of the data backfill instance.

    1. In the left-side navigation pane, select Data Backfill Instances.

    2. Runtime information will appear under Submitted Instances.

Data backfill for logical table tasks

  1. On the Dataphin home page, single click Development in the top menu bar.

  2. Refer to the instructions in the figure below to select the dim_products task for data backfill.

    image.png

  3. In the Data Backfill - Current Task dialog box, set the parameters as follows:

    Parameter

    Description

    Data Backfill Instance Name

    Use the default configuration.

    Select Field

    Choose Entire Table to backfill all metrics in the current aggregate table.

    Runtime

    Choose Run Immediately.

    Data Timestamp

    Opt for By Interval and retain the default configuration for the interval.

    Concurrent Running Groups

    The default is 1 group.

    Data Backfill Order

    Choose Ascending Data Timestamp.

    Skip Execution For Corresponding Instances

    Choose Normal Operation.

    Dry-run For Corresponding Instances

    Choose Normal Operation.

  4. Monitor the runtime results of the data backfill instance.

    1. In the left-side navigation pane, select Data Backfill Instances.

    2. Runtime information will appear under Submitted Instances.