The Dataphin tag platform offers audience group service capabilities, allowing for the batch export of audience groups via Data Integration to support downstream audience group analysis applications. This guide will instruct you on creating a new audience group offline service.
Limitations
To ensure high availability, the system writes data into a temporary table ({target table}_dpfx_b) when running audience group offline services. Upon completion, the original target table is renamed ({target table}_dpfx_tmp), and the temporary table takes its place. The original target table ({target table}_dpfx_tmp) is then deleted. There will be a brief period when data is unavailable between the renaming of the original target table and the renaming of the temporary table to the target table.
Prerequisites
An application has been created and associated with the offline audience group service. For specific operations, see Create Application.
Before you can create an offline audience group service, you must first establish the corresponding entity. For detailed instructions, see Create entity.
Create audience group offline service
On the Dataphin home page, select Tag > Tag Application from the top menu bar.
In the left-side navigation pane, select Service Management > Audience Group Offline Service.
On the Audience Group Offline Service page, click Create Offline Service.
On the Create Offline Service configuration page, configure the necessary parameters.
Parameter
Description
Basic Information
Task Name
Enter a name for the task. The naming convention is as follows:
Allows Chinese, English, numbers, and underscores (_), within 64 characters.
Application Selection
Select the application associated with the project.
Owner
Select the owner of the audience group offline service. You can search by keyword.
Schedule Type
Supports recurring schedule and manual schedule task types.
Manual Schedule: One-time integration. After the task is published, you can manually execute it on the task list page.
Recurring Schedule: Schedule execution according to the configured recurring schedule.
Description
Enter a brief description of the task, within 1000 characters.
Field Mapping
Audience Group Selection
Select the audience group you need to configure.
Is Tag Sync Needed
The default is No, only syncing the entity ID of the audience group. If you select Yes, you can select the tags to sync in Input Tag.
Target Data Source
Select the target data source corresponding to the audience group service. The target data source can be a MySQL, Oracle, AnalyticDB for PostgreSQL, Greenplum, openGauss data source, or project created in Dataphin.
NoteThe target data source of the project only supports projects joined by the current account (General and Tag project) and the project tenant account has write-through permission.
If there is no required data source, you can click +create Data Source to create one. For specific operations, see Data source management.
Schema
When the target data source type is openGauss, AnalyticDB for PostgreSQL, or Greenplum, you can select the Schema of the data source.
Target Table
Select the target table in the target data source. The target table for openGauss, AnalyticDB for PostgreSQL, or Greenplum data source types is under the Schema.
When cross-project safe mode is enabled, cross-project table creation is not supported. For more information, see Security settings.
If you do not have write table data permission for the production environment of the current target table, you can click Request Permission to apply for permission. For more information, see Request, renew, and return table permissions.
If there is no corresponding target table, after selecting the input tag, you can click One-click Table Creation to create the required target table.
In the table creation statement generated by the system, confirm whether the table name, field type, precision, etc., meet the requirements before clicking Create.
Both the table name and table remarks are automatically generated by the system and can be modified as needed.
The table creation statement generated by the system refers to the type of the input tag and performs preliminary transformation, which can be modified as needed.
When the target data source is a project, the system will default to generating a partitioned table. It is recommended not to adjust.
When the target data source type is AnalyticDB for PostgreSQL, selecting a partitioned table is not supported.
After selecting the input tag, you can configure the code value and code name of the exported tag. You can export the tag value or code name separately, or export both, but at least one must be exported.
When one-click table creation is performed, if the tag has a configured lookup table, you can choose to export the code name. The exported code name is
{tag code}_codename. After one-click table creation, the system will automatically map the fields.
Date Partition
Select the partition field of the target table.
If the selected target table is a partitioned table, the system will default to the first partition field of the table.
If the selected target table is a non-partitioned table, there is no need to select a date partition.
Partition Field Format
Enter the date format or select an existing date format. You can choose yyyyMMdd, yyyy-MM-dd, yyyy/MM/dd, yyyy.MM.dd.
NoteOnly when the compute engine is MaxCompute, the partition field format can be selected as yyyymmdd, yyyy-mm-dd, yyyy/mm/dd, or yyyy.mm.dd.
Loading Policy
The loading policy only supports Overwrite Policy. Under the overwrite data policy, when there is a primary key/constraint conflict, the original data will be deleted first, and then the entire new row of data will be inserted.
Mapping Field
When tag sync is not needed, select the fields to be mapped.
Same name mapping: Click Same Name Mapping to associate tags with the same name and their mapping fields.
Purge: Click
icon to purge the mapped relationships.Outputs: Select the output content of the tag with a configured lookup table. You can select tag value and code name. By default, all tag values are selected. You can select tag value and code name at the bottom of the output tag list for full selection (entire page).
Maintenance Configuration
Recurrence
The cycle for scheduling tasks within a specific time range. Supports daily scheduling. Audience group offline tasks will run according to the configured schedule time.
Click Publish to complete the creation of the audience group offline service task.
Manage audience group offline services
The audience group offline service page displays information such as task name, owner, application name, audience group name, entity, entity ID, task status, execution status, target table, schedule type, and available operations.
Hover your mouse over the target table to view its full name and the name of the data source. If the data source type is openGauss, you can also view the target table Schema.
(Optional) Filter tasks by selecting Only Mine, entering the task name, or clicking Filter to refine by Application name, Entity-Entity ID, Target source type, Task status, Execution status, or Schedule type.
In the Actions column of the audience group offline service task list, you can perform various operations.
Action item
Description
Edit
When the task status is not publishing or unpublishing, you can click
icon to edit on the Edit Offline Service page and republish. Modifying the task name, schedule type, target data source type, and loading policy is not supported.Details
Click
icon to view detailed information about the current tag offline service on the View Offline Service page. When the task status is editing, publish failed, published, or offline, you can click Edit at the bottom of the page to edit the current audience group offline service.View instance
Click
icon to view the execution instance generated by the current audience group offline service.Unpublish
For audience group offline services with task status as published or unpublish failed, you can click
unpublish icon to unpublish the current audience group offline service.Data backfill
For recurring schedule audience group offline services with task status as published, you can click
data backfill icon to perform data backfill. The default data timestamp for backfill is yesterday's (T-1) data.Run
For tasks with a schedule type of manual schedule, you can click
run icon to manually run in the Run dialog box by selecting the data timestamp.Delete
For audience group offline services with a task status of offline or publish failed, click
delete icon to delete the current audience group offline service.