The data opening feature of DataWorks provides tables and views in various dimensions for you to collect metadata. This topic provides a list of such tables and views, and describes structures of the tables and views.

Core metrics in the rpt_v_meta_ind_table_core table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
table_uuid string The unique ID of the table.
owner_yun_acct string The Alibaba Cloud account used by the table owner.
dim_life_cycle bigint The lifecycle. Unit: days.
  • 0: indicates that no lifecycle is configured.
  • Other values: indicate specific lifecycles.
is_partition_table boolean Specifies whether the table is a partitioned table.
  • true: The table is a partitioned table.
  • false: The table is a non-partitioned table.
entity_type bigint The entity type.
  • 0: table
  • 1: view
categories string The detailed information about the categories.
last_access_time bigint The most recent time when the table was accessed. The metric value is a 10-digit UNIX timestamp.
size bigint The size of the table, which indicates the logical storage space occupied by data in the table. Unit: bytes. The volume of data stored in a view is NULL.
column_count bigint The number of fields in the table. Partition key columns are included.
partition_count bigint The number of partitions in the table. This metric is set to NULL for a non-partitioned table.
detail_view_count bigint The number of times table details are viewed on the page.
favorite_count bigint The number of times the table is added to favorites.

Additional metrics in the rpt_v_meta_ind_table_extra table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
table_uuid string The unique ID of the table.
read_count bigint The number of times data is read by using SQL statements. The data includes that of non-scheduled tasks.
read_count_30d bigint The number of times data is read within 30 days by using SQL statements. The data includes that of non-scheduled tasks.
write_count bigint The number of times data is written by using SQL statements. The data includes that of non-scheduled tasks.
join_count bigint The number of times the table is joined.
direct_upstream_count bigint The number of parent tables in the lineage.
direct_downstream_count bigint The number of child tables in the lineage.
output_task_count bigint The number of tasks that generate the current table.

Metrics in the raw_v_meta_database table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
env_type bigint The environment type.
  • 0: development environment
  • 1: production environment
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
database_comment string The description of the database or MaxCompute project.
owner_name string The name of the owner.
created_time_ts bigint The creation time, which is a 13-digit timestamp.
last_modified_time_ts bigint The most recent modification time, which is a 13-digit timestamp.
location string The storage path of the database.
extras string The additional attributes of the database, which are JSON strings.
If the table preview and table visibility range attributes are configured for a MaxCompute project, you can use the allowDataPreview and projectVisibility keys to obtain the values of the attributes.
  • allowDataPreview: specifies whether tables in a MaxCompute project can be previewed.
    • true: Tables in a MaxCompute project can be previewed.
    • Other values or NULL: Tables in a MaxCompute project cannot be previewed.
  • projectVisibility: specifies the visible range of tables in a MaxCompute project.
    • 0: hidden. Tables are visible only for table owners, project administrators, and project owners.
    • 1: visible for tenants.
    • 2: visible for project members.
biz_date string The business date.

Metrics in the raw_v_meta_table table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id string The ID of the DataWorks workspace.
table_uuid string The unique ID of the table.
table_name string The name of the table.
table_type string The type of the table.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
partition_keys string The partition keys in the table. Multi-level partitions are separated by commas (,). This metric is set to an empty string for a non-partitioned table.
table_comment string The description of the table.
table_biz_comment string The description of business in the table.
visibility_scope bigint The visible range of the table.
  • 0: hidden. Tables are visible only for table owners, project administrators, and project owners.
  • 1: visible for tenants.
  • 2: visible for project members.
owner_name string The name of the owner.
created_time_ts bigint The creation time, which is a 13-digit timestamp.
last_modified_time_ts bigint The most recent time when data was modified. The metric value is a 13-digit timestamp.
last_meta_modified_time_ts bigint The most recent time when table metadata was modified. The metric value is a 13-digit timestamp.
location string The storage path of the table.
life_cycle bigint The lifecycle of the day. Unit: days.
data_size bigint The logical storage volume of the table. Unit: bytes. If the table is a partitioned table, this metric is set to NULL. You must collect statistics on the storage volume based on the partition list.
biz_date string The business date.

Metrics in the raw_v_meta_view table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id string The ID of the DataWorks workspace.
table_uuid string The unique ID of the table.
table_name string The name of the table.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_comment string The description of the table.
table_biz_comment string The description of business in the table.
visibility_scope bigint The visible range of the table.
  • 0: hidden. Tables are visible only for table owners, project administrators, and project owners.
  • 1: visible for tenants.
  • 2: visible for project members.
owner_name string The name of the owner.
created_time_ts bigint The creation time, which is a 13-digit timestamp.
last_ddl_time_ts bigint The most recent time when data was modified by using DDL statements. The metric value is a 13-digit timestamp.
view_text string The SQL statement that is used to create a view.
biz_date string The business date.

Metrics in the raw_v_meta_column table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
column_name string The name of the field.
column_comment string The description of the field.
column_biz_comment string The business description of the field.
column_type string The field type.
column_sequence bigint The field sequence, which starts from 1.
is_partition_key boolean Specifies whether the key is a partition key.
is_primary_key boolean Specifies whether the key is a primary key.
biz_date string The business date.

Metrics in the raw_v_meta_partition table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
partition_name string The name of the partition.
size bigint The logical size of the partition. Unit: bytes.
record_number bigint The number of records in the partition.
created_time_ts bigint The creation time, which is a 13-digit timestamp.
last_modified_time_ts bigint The most recent modification time, which is a 13-digit timestamp.
biz_date string The business date.

Metrics in the raw_v_meta_table_lineage table

Note The lineage feature cannot achieve 100% data integrity and accuracy due to the complexity of SQL and user code. We recommend that you do not use this feature for the business that has integrity and accuracy requirements.
Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
src_type string The type of the data source that serves as the source.
src_data_source_id string The ID of the data source that serves as the source.
src_database string The source database.
src_table string The source table.
dest_type string The type of the data source that serves as the destination.
dest_data_source_id string The ID of the data source that serves as the destination.
dest_database string The destination database.
dest_table string The destination table.
schedule_task_id string The ID of the scheduled task.
schedule_instance_id string The instance ID of the scheduled task.
schedule_task_owner string The owner of the scheduled task.
job_start_time_ts bigint The start time of the task, which is a 13-digit timestamp.
job_end_time_ts bigint The end time of the task, which is a 13-digit timestamp.
execute_time bigint The time that is required to run the task. Unit: seconds.
input_record_number bigint The number of input records in the source table.
biz_date string The business date.

Metrics in the raw_v_meta_table_output table

Only MaxCompute output tables are generated on the Data Map platform. The tables are of the types supported by the large lineage.
Note The output information is calculated based on lineage.
Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace in which scheduled tasks are run.
type string The type of the data source.
data_source_id string The ID of the data source.
database string The database.
table string The name of the table.
schedule_task_id string The ID of the scheduled task.
schedule_instance_id string The instance ID of the scheduled task.
schedule_task_owner string The owner of the scheduled task.
job_start_time_ts bigint The start time of the task, which is a 13-digit timestamp.
job_end_time_ts bigint The end time of the task, which is a 13-digit timestamp.
execute_time bigint The time that is required to run the task. Unit: seconds.
biz_date string The business date.

Metrics in the raw_v_meta_table_usage table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace in which scheduled tasks are run.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
schedule_task_id string The ID of the scheduled task.
schedule_task_owner string The owner of the scheduled task. If the existing task is not scheduled on DataWorks, this metric is set to NULL.
job_id string The task ID, which may not be the instance ID of the task scheduled on DataWorks. You can use this metric to collect the number of times data is read from the table and the number of times data is written to the table.
op_type string The operation type, which can be READ, WRITE, or UNKNOWN.
extras string The additional information, which is a JSON string.

If a MaxCompute task is run to perform operations on a table, you can use task_name to obtain the name of the MaxCompute task. If the ID of a task scheduled on DataWorks is not empty, you can use schedule_task_name to obtain the name of the scheduled task. Example: { "task_name": "console_query_task_16056294000000", "schedule_task_name": "SQL test task" }.

biz_date string The business date.

Metrics in the raw_v_meta_column_usage table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace in which scheduled tasks are run.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
column_name string The name of the field.
schedule_task_id string The ID of the scheduled task.
schedule_task_owner string The owner of the scheduled task. If the existing task is not scheduled on DataWorks, this metric is set to NULL.
inst_id string The task ID, which may not be the instance ID of the task scheduled on DataWorks.
op_type string The operation type, which can be SELECT, JOIN, GROUP BY, or WHERE.
extras string The additional information, which is a JSON string.

If a MaxCompute task is run to perform operations on a table, you can use task_name to obtain the name of the MaxCompute task. If the ID of a task scheduled on DataWorks is not empty, you can use schedule_task_name to obtain the name of the scheduled task. Example: { "task_name": "console_query_task_16056294000000", "schedule_task_name": "SQL test task" }.

biz_date string The business date.

Metrics in the raw_v_meta_biz_table_wiki table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace in which scheduled tasks are run.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
version string The Wiki version.
operator string The final operator, which may be an owner of the table.
content string The content of Wiki, which is written by using the Markdown syntax.
update_time_ts bigint The update time, which is a 13-digit timestamp.
biz_date string The business date.

Metrics in the raw_v_meta_table_join_map table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
column_name string The name of the field.
join_database_name string The name of the associated database or MaxCompute project.
join_table_name string The name of the associated table.
join_column_name string The name of the associated field.
join_type string The type of the JOIN operation, which can be left, right, or inner.
schedule_task_id string The ID of the scheduled task.
schedule_task_owner string The owner of the scheduled task.
job_id string The ID of the task at the engine layer.
extras string The additional information, which is a JSON string. If a MaxCompute task is run to perform operations on a table, you can use task_name to obtain the name of the MaxCompute task.
biz_date string The business date.

Metrics in the raw_v_meta_table_detail_log table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
operator string The user who views table details.
view_time_ts bigint The time when table details are viewed. The metric value is a 13-digit timestamp.
biz_date string The business date.

Metrics in the raw_v_schedule_node table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
project_id bigint The ID of the DataWorks workspace.
node_id bigint The ID of the node.
node_name string The name of the node.
node_type bigint The type of the scheduled node.
  • 0: auto triggered task
  • 1: manually triggered task
  • 2: paused task
  • 3: dry-run task
prg_type bigint The node type.
  • 10: MaxCompute SQL task
  • 23: data synchronization task
flow_id bigint The ID of the workflow.
project_env string The environment type.
  • PROD: production environment
  • DEV: development environment
create_time bigint The creation time, which is a 13-digit timestamp.
create_user string The creator.
modify_time bigint The most recent modification time, which is a 13-digit timestamp.
modify_user string The user who modifies data.
prg_name string The node type name.
para_value string The execution parameter.
file_id bigint The ID of the file.
file_version bigint The file version.
owner string The node owner.
resgroup_id bigint The ID of the resource group.
baseline_id bigint The ID of the baseline.
cycle_type bigint The scheduling cycle.
  • 0: daily, weekly, or monthly
  • Other values: hourly or minutely
repeatable bigint The rerun identifier.
  • 0: Only failed tasks can be rerun.
  • 1: All tasks can be rerun.
  • 2: No tasks can be rerun.
connection string The connection string of the data source.
dqc_type bigint The DQC type.
  • 0: associated DQC type
  • 1: unassociated DQC type
dqc_description string The DQC rule string.
task_rerun_time bigint The number of times the task can be rerun.
task_rerun_interval bigint The rerun interval. Unit: milliseconds.
cron_express string The cron expression of the scheduling frequency of the node.
priority bigint The task priority. Valid values: 1, 3, 5, 7, and 8. A larger value indicates a higher priority.
start_effect_date bigint The time when the node takes effect. The metric value is a 13-digit timestamp.
end_effect_date bigint The time when the node loses effect. The metric value is a 13-digit timestamp.
biz_date string The business date.

Metrics in the raw_v_schedule_task table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
project_id bigint The ID of the DataWorks workspace.
node_id bigint The ID of the node.
node_name string The name of the node.
task_id bigint The name of the task.
dag_id bigint The DAG ID of the workflow.
task_type bigint The type of the scheduled task.
  • 0: auto triggered task
  • 1: manually triggered task
  • 2: paused task
  • 3 or 5: dry-run task
dag_type bigint The DAG type.
  • 0: for auto triggered tasks
  • 1: for manually triggered tasks
  • 3: for retroactive tasks
prg_type bigint The node type.
  • 10: MaxCompute SQL task
  • 23: data synchronization task
flow_id bigint The ID of the workflow.
create_time bigint The creation time, which is a 13-digit timestamp.
modify_time bigint The most recent modification time, which is a 13-digit timestamp.
cycle_time bigint The scheduling time, which is a 13-digit timestamp.
in_group_id bigint The serial number of the task.
prg_name string The node type name.
para_value string The execution parameter.
file_id bigint The ID of the file.
file_version bigint The file version.
owner string The node owner.
resgroup_id bigint The ID of the resource group.
baseline_id bigint The ID of the baseline.
cycle_type bigint The scheduling cycle.
  • 0: daily, weekly, or monthly
  • Other values: hourly or minutely
repeatable bigint The rerun identifier.
  • 0: Only failed tasks can be rerun.
  • 1: All tasks can be rerun.
  • 2: No tasks can be rerun.
connection string The connection string of the data source.
dqc_type bigint The DQC type.
  • 0: associated DQC type
  • 1: unassociated DQC type
dqc_description string The DQC rule string.
task_rerun_time bigint The number of times the task can be rerun.
task_rerun_interval bigint The rerun interval. Unit: milliseconds.
begin_waittime_time bigint The time when the task starts to wait for scheduling. The metric value is a 13-digit timestamp.
finish_time bigint The time when the running is complete. The metric value is a 13-digit timestamp.
begin_waitres_time bigint The time when the task starts to wait for resource allocation. The metric value is a 13-digit timestamp.
begin_run_time bigint The time when the task starts to run. The metric value is a 13-digit timestamp.
rerun_times bigint The number of times the task is rerun.
priority bigint The task priority. Valid values: 1, 3, 5, 7, and 8. A larger value indicates a higher priority.
task_key string The unique identifier of the task.
error_msg string The cause of the running error.
status bigint The status of the task.
  • NOT_RUN(1, "Partial ancestor instances are successfully run.")
  • WAIT_TIME(2, "The task waits for the scheduling time specified by dueTime or cycleTime to arrive.")
  • WAIT_RESOURCE(3, "The task is delivered to the execution engine alisa and is waiting for scheduling in a queue.")
  • RUNNING(4, "The task is being run.")
  • CHECKING(7, "The task is run by using alisa, and data is delivered to the DQC for verification.")
  • CHECKING_CONDITION(8, "The task is run by using alisa, and branch conditions are being checked.")
  • FAILURE(5, "The task fails.")
  • SUCCESS(6, "The task is successful.")
biz_date string The business date.

Metrics in the raw_v_schedule_node_relation table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
child_node_id bigint The ID of the descendant node.
parent_node_id bigint The ID of the ancestor node.
step_type bigint The dependency type.
  • 0: common
  • 3: cross-cycle
child_flow_id bigint The ID of the workflow.
project_env string The environment type.
  • PROD: production environment
  • DEV: development environment
create_time bigint The creation time, which is a 13-digit timestamp.
create_user string The creator.
modify_time bigint The most recent modification time, which is a 13-digit timestamp.
modify_user string The user who modifies data.
biz_date string The business date.

Metrics in the raw_v_schedule_di_resgroup table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
project_id bigint The ID of the DataWorks workspace.
node_id bigint The ID of the node.
project_env string The environment of the workspace.
res_group_identifier string The ID of the resource group for data integration.
src_type string The type of the data source that serves as the source.
dst_type string The type of the data source that serves as the destination.
src_datasource string The data source that serves as the source.
dst_datasource string The data source that serves as the destination.
config_concurrent bigint The number of concurrent tasks.
biz_date string The business date.

Metrics in the raw_v_tenant_res_group table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
res_group_id bigint The ID of the resource group.
res_group_identifier string The identifier of the resource group.
res_group_type bigint The type of the resource group.
  • 1: scheduling resource group
  • 2: MaxCompute resource group
  • 4: resource group for data integration
res_group_mode bigint The type of the resource group.
  • 1: subscription
  • 2: pay-as-you-go
  • 3: Developer version (available only for MaxCompute)
status bigint The status of the resource group.
  • 0: The resource group is normal.
  • 1: The resource group is frozen.
  • 2: The resource group is deleted.
  • 3: The resource group is being created.
  • 4: The resource group fails to be created.
  • 5: The resource group is being updated.
  • 6: The resource group fails to be updated.
  • 7: The resource group is being deleted.
  • 8: The resource group fails to be deleted.
biz_ext_key string The extension field of the resource group. The value single indicates an exclusive resource group.
biz_date string The business date.

Metrics in the raw_v_tenant_user table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
yun_account string The Alibaba Cloud account.
account_name string The name of the account.
nick string The display name of the account.
full_yun_account string The Alibaba Cloud account that contains the account provider information.
biz_date string The business date.

Metrics in the raw_v_tenant_workspace table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
project_id bigint The ID of the workspace.
project_name string The name of the workspace.
project_identifier string The identifier of the workspace.
project_desc string The description of the workspace.
project_owner string The owner of the workspace.
status bigint The status of the workspace.
  • 0: The workspace is normal.
  • 1: The workspace is deleted.
  • 2: The workspace is being initialized.
  • 3: The workspace fails to be initialized.
  • 4: The workspace is manually disabled.
  • 5: The workspace is being deleted.
  • 6: The workspace fails to be deleted.
  • 7: The workspace is frozen due to overdue payments.
biz_date string The business date.

Metrics in the raw_v_tenant_workspace_user table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
base_id string The base ID of the user.
status bigint The status of the user.
  • 0: The user is normal.
  • 1: The user is disabled.
  • 2: The user is deleted.
gmt_create_ts bigint The creation time, which is a 13-digit timestamp.
gmt_modified_ts bigint The modification time, which is a 13-digit timestamp.
biz_date string The business date.