The data opening feature of DataWorks provides tables and views in various dimensions for you to collect metadata. This topic provides a list of such tables and views and describes the structures of these tables and views.

Core metrics in the rpt_v_meta_ind_table_core table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
table_uuid string The unique ID of the table.
owner_yun_acct string The Alibaba Cloud account of the table owner.
dim_life_cycle bigint The time to live (TTL). Unit: days.
  • 0: indicates that no TTL is configured.
  • Other values: indicate a specific period of time.
is_partition_table boolean Specifies whether the table is a partitioned table.
  • true: The table is a partitioned table.
  • false: The table is a non-partitioned table.
entity_type bigint The entity type.
  • 0: table
  • 1: view
categories string The detailed information about the categories.
last_access_time bigint The last time when the table was accessed. The metric value is a 10-digit UNIX timestamp.
size bigint The size of the table, which indicates the logical storage space that is occupied by data in the table. Unit: byte. This metric is set to NULL for a view.
column_count bigint The number of fields in the table. Partition key columns are included.
partition_count bigint The number of partitions in the table. This metric is set to NULL for a non-partitioned table.
detail_view_count bigint The number of times that table details are viewed on the page.
favorite_count bigint The number of times that the table is added to favorites.

Additional metrics in the rpt_v_meta_ind_table_extra table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
table_uuid string The unique ID of the table.
read_count bigint The number of times that data is read by using SQL statements. The data includes that of non-scheduled nodes.
read_count_30d bigint The number of times that data is read within 30 days by using SQL statements. The data includes that of non-scheduled nodes.
write_count bigint The number of times that data is written by using SQL statements. The data includes that of non-scheduled nodes.
join_count bigint The number of times that the table is joined.
direct_upstream_count bigint The number of parent tables in the lineage.
direct_downstream_count bigint The number of child tables in the lineage.
output_task_count bigint The number of nodes that generate the data in the table.

Metrics in the raw_v_meta_database table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
env_type bigint The environment type.
  • 0: development environment
  • 1: production environment
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
database_comment string The description of the database or MaxCompute project.
owner_name string The name of the owner.
created_time_ts bigint The creation time. The metric value is a 13-digit timestamp.
last_modified_time_ts bigint The last modification time. The metric value is a 13-digit timestamp.
location string The storage path of the table in the database.
extras string The additional information about the database, which is a JSON string.
If the table preview and table visibility range attributes are configured for a MaxCompute project, you can use the allowDataPreview and projectVisibility keys to obtain the values of the attributes.
  • allowDataPreview: specifies whether tables in a MaxCompute project can be previewed.
    • true: Tables in a MaxCompute project can be previewed.
    • Other values or NULL: Tables in a MaxCompute project cannot be previewed.
  • projectVisibility: specifies the visibility range of tables in a MaxCompute project.
    • 0: hidden. Tables are visible only for table owners, project administrators, and project owners.
    • 1: visible for tenants.
    • 2: visible for project members.
biz_date string The data timestamp.

Metrics in the raw_v_meta_table table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id string The ID of the DataWorks workspace.
table_uuid string The unique ID of the table.
table_name string The name of the table.
table_type string The type of the table.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
partition_keys string The partition keys in the table. Multi-level partitions are separated by commas (,). This metric is set to an empty string for a non-partitioned table.
table_comment string The description of the table.
table_biz_comment string The business description of the table.
visibility_scope bigint The visibility range of the table.
  • 0: hidden. Tables are visible only for table owners, project administrators, and project owners.
  • 1: visible for tenants.
  • 2: visible for project members.
owner_name string The name of the owner.
created_time_ts bigint The creation time. The metric value is a 13-digit timestamp.
last_modified_time_ts bigint The last time when data was modified. The metric value is a 13-digit timestamp.
last_meta_modified_time_ts bigint The last time when table metadata was modified. The metric value is a 13-digit timestamp.
location string The storage path of the table.
life_cycle bigint The TTL of the table. Unit: days.
data_size bigint The logical storage volume of the table. Unit: byte. If the table is a partitioned table, this metric is set to NULL. You must collect statistics on the storage volume based on the partition list.
biz_date string The data timestamp.

Metrics in the raw_v_meta_view table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id string The ID of the DataWorks workspace.
table_uuid string The unique ID of the table.
table_name string The name of the table.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_comment string The description of the table.
table_biz_comment string The business description of the table.
visibility_scope bigint The visibility range of the table.
  • 0: hidden. Tables are visible only for table owners, project administrators, and project owners.
  • 1: visible for tenants.
  • 2: visible for project members.
owner_name string The name of the owner.
created_time_ts bigint The creation time. The metric value is a 13-digit timestamp.
last_ddl_time_ts bigint The last time when the view was modified by using data definition language (DDL) statements. The metric value is a 13-digit timestamp.
view_text string The SQL statement that is used to create a view.
biz_date string The data timestamp.

Metrics in the raw_v_meta_column table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
column_name string The name of the field.
column_comment string The description of the field.
column_biz_comment string The business description of the field.
column_type string The data type of the field.
column_sequence bigint The sequence number of the field, which starts from 1.
is_partition_key boolean Specifies whether the field is a partition key.
is_primary_key boolean Specifies whether the field is a primary key.
biz_date string The data timestamp.

Metrics in the raw_v_meta_partition table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
partition_name string The name of the partition.
size bigint The logical size of the partition. Unit: byte.
record_number bigint The number of records in the partition.
created_time_ts bigint The creation time. The metric value is a 13-digit timestamp.
last_modified_time_ts bigint The last modification time. The metric value is a 13-digit timestamp.
biz_date string The data timestamp.

Metrics in the raw_v_meta_table_lineage table

Note The lineage feature cannot achieve 100% data integrity and accuracy due to the complexity of SQL statements and code. We recommend that you do not use this feature for the business that has integrity and accuracy requirements.
Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
src_type string The type of the data source.
src_data_source_id string The ID of the data source.
src_database string The source database.
src_table string The source table.
dest_type string The type of the data destination.
dest_data_source_id string The ID of the data destination.
dest_database string The destination database.
dest_table string The destination table.
schedule_task_id string The ID of the scheduled node.
schedule_instance_id string The instance ID of the scheduled node.
schedule_task_owner string The owner of the scheduled node.
job_start_time_ts bigint The start time of the node, which is a 13-digit timestamp.
job_end_time_ts bigint The end time of the node, which is a 13-digit timestamp.
execute_time bigint The time that is required to run the node. Unit: seconds.
input_record_number bigint The number of records that were read from the source table.
biz_date string The data timestamp.

Metrics in the raw_v_meta_table_output table

Data Map displays the tables whose data is generated by ODPS nodes. The metadata includes the tables whose data is generated by ODPS nodes and data integration nodes.
Note The output information is generated based on lineage.
Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace in which scheduled nodes are run.
type string The type of the data source.
data_source_id string The ID of the data source.
database string The database.
table string The name of the table.
schedule_task_id string The ID of the scheduled node.
schedule_instance_id string The instance ID of the scheduled node.
schedule_task_owner string The owner of the scheduled node.
job_start_time_ts bigint The start time of the node, which is a 13-digit timestamp.
job_end_time_ts bigint The end time of the node, which is a 13-digit timestamp.
execute_time bigint The time that is required to run the node. Unit: seconds.
biz_date string The data timestamp.

Metrics in the raw_v_meta_table_usage table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace in which scheduled nodes are run.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
schedule_task_id string The ID of the scheduled node.
schedule_task_owner string The owner of the scheduled node. If the node is not scheduled in DataWorks, this metric is set to NULL.
job_id string The node ID, which may not be the instance ID of the node that is scheduled in DataWorks. You can use this metric to count the number of times that data is read from the table and the number of times that data is written to the table.
op_type string The operation type, which can be READ, WRITE, or UNKNOWN.
extras string The additional information, which is a JSON string.

If a MaxCompute node is run to perform operations on a table, you can use the task_name key to obtain the name of the MaxCompute node. If the ID of a node that is scheduled in DataWorks is not empty, you can use the schedule_task_name key to obtain the name of the scheduled node. Example: { "task_name": "console_query_task_16056294000000", "schedule_task_name": "Test SQL node" }.

biz_date string The data timestamp.

Metrics in the raw_v_meta_column_usage table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace in which scheduled nodes are run.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
column_name string The name of the field.
schedule_task_id string The ID of the scheduled node.
schedule_task_owner string The owner of the scheduled node. If the node is not scheduled in DataWorks, this metric is set to NULL.
inst_id string The node ID, which may not be the instance ID of the node that is scheduled in DataWorks.
op_type string The operation type, which can be SELECT, JOIN, GROUP BY, or WHERE.
extras string The additional information, which is a JSON string.

If a MaxCompute node is run to perform operations on a table, you can use the task_name key to obtain the name of the MaxCompute node. If the ID of a node that is scheduled in DataWorks is not empty, you can use the schedule_task_name key to obtain the name of the scheduled node. Example: { "task_name": "console_query_task_16056294000000", "schedule_task_name": "Test SQL node" }.

biz_date string The data timestamp.

Metrics in the raw_v_meta_biz_table_wiki table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace in which scheduled nodes are run.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
version string The version number of Wiki.
operator string The final operator, which may be an owner of the table.
content string The content of Wiki, which is written by using the Markdown syntax.
update_time_ts bigint The modification time. The metric value is a 13-digit timestamp.
biz_date string The data timestamp.

Metrics in the raw_v_meta_table_join_map table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
column_name string The name of the field.
join_database_name string The name of the associated database or MaxCompute project.
join_table_name string The name of the associated table.
join_column_name string The name of the associated field.
join_type string The type of the JOIN operation, which can be left, right, or inner.
schedule_task_id string The ID of the scheduled node.
schedule_task_owner string The owner of the scheduled node.
job_id string The ID of the node at the engine layer.
extras string The additional information, which is a JSON string. If a MaxCompute node is run to perform operations on a table, you can use the task_name key to obtain the name of the MaxCompute node.
biz_date string The data timestamp.

Metrics in the raw_v_meta_table_detail_log table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
catalog_name string The catalog to which the table belongs. This metric is set to odps for MaxCompute projects.
database_name string The name of the database or MaxCompute project.
table_name string The name of the table.
operator string The user who views table details.
view_time_ts bigint The time when table details are viewed. The metric value is a 13-digit timestamp.
biz_date string The data timestamp.

Metrics in the raw_v_meta_category table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
category_id bigint The ID of the category.
category_name string The name of the category.
category_pid bigint The ID of the parent category. This metric is set to 0 or NULL for a level 1 category.
depth bigint The level of the category. This metric is set to 1 for a level 1 category.
sort_field double The field based on which the categories are sorted.
creator_account string The account that creates the category.
created_time_ts bigint The creation time. The metric value is a 13-digit timestamp.
last_modified_time_ts bigint The last modification time. The metric value is a 13-digit timestamp.
biz_date string The data timestamp.

Metrics in the raw_v_schedule_node table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
project_id bigint The ID of the DataWorks workspace.
node_id bigint The ID of the node.
node_name string The name of the node.
node_type bigint The scheduling type of the node.
  • 0: auto triggered node
  • 1: manually triggered node
  • 2: paused node
  • 3: dry-run node
prg_type bigint The type of the node.
  • 10: ODPS SQL node
  • 23: data integration node
flow_id bigint The ID of the workflow.
project_env string The environment type.
  • PROD: production environment
  • DEV: development environment
create_time bigint The creation time. The metric value is a 13-digit timestamp.
create_user string The creator.
modify_time bigint The last modification time. The metric value is a 13-digit timestamp.
modify_user string The user who modifies the node.
prg_name string The name of the node type.
para_value string The execution parameter.
file_id bigint The ID of the file.
file_version bigint The file version.
owner string The owner of the node.
resgroup_id bigint The ID of the resource group.
baseline_id bigint The ID of the baseline.
cycle_type bigint The recurrence.
  • 0: daily, weekly, or monthly
  • Other values: hourly or minutely
repeatable bigint The rerun identifier.
  • 0: Only failed nodes can be rerun.
  • 1: All nodes can be rerun.
  • 2: No nodes can be rerun.
connection string The connection string of the data source.
dqc_type bigint Specifies whether the node uses the Data Quality service.
  • 0: The node uses the Data Quality service.
  • 1: The node does not use the Data Quality service.
dqc_description string The Data Quality rule.
task_rerun_time bigint The number of times that the task can be rerun.
task_rerun_interval bigint The rerun interval. Unit: milliseconds.
cron_express string The CRON expression that specifies the scheduling frequency of the node.
priority bigint The priority of the task. Valid values: 1, 3, 5, 7, and 8. A greater value indicates a higher priority.
start_effect_date bigint The time when the node takes effect. The metric value is a 13-digit timestamp.
end_effect_date bigint The time when the node loses effect. The metric value is a 13-digit timestamp.
biz_date string The data timestamp.

Metrics in the raw_v_schedule_task table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
project_id bigint The ID of the DataWorks workspace.
node_id bigint The ID of the node.
node_name string The name of the node.
task_id bigint The name of the task.
dag_id bigint The directed acyclic graph (DAG) ID of the workflow.
task_type bigint The scheduling type of the task.
  • 0: auto triggered task
  • 1: manually triggered task
  • 2: paused task
  • 3 or 5: dry-run task
dag_type bigint The DAG type.
  • 0: for auto triggered nodes
  • 1: for manually triggered nodes
  • 3: for retroactive data generation
prg_type bigint The type of the node.
  • 10: ODPS SQL node
  • 23: data integration node
flow_id bigint The ID of the workflow.
create_time bigint The creation time. The metric value is a 13-digit timestamp.
modify_time bigint The last modification time. The metric value is a 13-digit timestamp.
cycle_time bigint The scheduling time, which is a 13-digit timestamp.
in_group_id bigint The serial number of the task.
prg_name string The name of the node type.
para_value string The execution parameter.
file_id bigint The ID of the file.
file_version bigint The file version.
owner string The owner of the node.
resgroup_id bigint The ID of the resource group.
baseline_id bigint The ID of the baseline.
cycle_type bigint The recurrence.
  • 0: daily, weekly, or monthly
  • Other values: hourly or minutely
repeatable bigint The rerun identifier.
  • 0: Only failed nodes can be rerun.
  • 1: All nodes can be rerun.
  • 2: No nodes can be rerun.
connection string The connection string of the data source.
dqc_type bigint Specifies whether the node uses the Data Quality service.
  • 0: The node uses the Data Quality service.
  • 1: The node does not use the Data Quality service.
dqc_description string The Data Quality rule.
task_rerun_time bigint The number of times that the task can be rerun.
task_rerun_interval bigint The rerun interval. Unit: milliseconds.
begin_waittime_time bigint The time when the node starts to wait for scheduling. The metric value is a 13-digit timestamp.
finish_time bigint The time when the running is complete. The metric value is a 13-digit timestamp.
begin_waitres_time bigint The time when the node starts to wait for resource allocation. The metric value is a 13-digit timestamp.
begin_run_time bigint The time when the node starts to run. The metric value is a 13-digit timestamp.
rerun_times bigint The number of times that the task is rerun.
priority bigint The priority of the task. Valid values: 1, 3, 5, 7, and 8. A greater value indicates a higher priority.
task_key string The unique identifier of the task.
error_msg string The reason why the task failed.
status bigint The status of the task.
  • NOT_RUN(1, "Not all ancestor instances are successful.")
  • WAIT_TIME(2, "The task is waiting for the scheduling time that is specified by dueTime or cycleTime to arrive.")
  • WAIT_RESOURCE(3, "The task is delivered to the execution engine Alisa and is waiting for scheduling in a queue.")
  • RUNNING(4, "The task is being run.")
  • CHECKING(7, "The task is run by using Alisa, and data is delivered to Data Quality for verification.")
  • CHECKING_CONDITION(8, "The task is run by using Alisa, and branch conditions are being checked.")
  • FAILURE(5, "The task failed.")
  • SUCCESS(6, "The task is successful.")
biz_date string The data timestamp.

Metrics in the raw_v_schedule_node_relation table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
child_node_id bigint The ID of the descendant node.
parent_node_id bigint The ID of the ancestor node.
step_type bigint The dependency type.
  • 0: common
  • 3: cross-cycle
child_flow_id bigint The ID of the workflow.
project_env string The environment type.
  • PROD: production environment
  • DEV: development environment
create_time bigint The creation time. The metric value is a 13-digit timestamp.
create_user string The creator.
modify_time bigint The last modification time. The metric value is a 13-digit timestamp.
modify_user string The user who modifies the node.
biz_date string The data timestamp.

Metrics in the raw_v_schedule_di_resgroup table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
project_id bigint The ID of the DataWorks workspace.
node_id bigint The ID of the node.
project_env string The environment of the workspace.
res_group_identifier string The ID of the resource group for Data Integration.
src_type string The type of the data source.
dst_type string The type of the data destination.
src_datasource string The data source.
dst_datasource string The data destination.
config_concurrent bigint The number of concurrent nodes.
biz_date string The data timestamp.

Metrics in the raw_v_tenant_res_group table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
res_group_id bigint The ID of the resource group.
res_group_identifier string The identifier of the resource group.
res_group_type bigint The type of the resource group.
  • 1: resource group for scheduling
  • 2: MaxCompute quota group
  • 4: resource group for Data Integration
res_group_mode bigint The billing method of the resource group.
  • 1: subscription
  • 2: pay-as-you-go
  • 3: Developer Edition (available only for MaxCompute)
status bigint The status of the resource group.
  • 0: The resource group is normal.
  • 1: The resource group is frozen.
  • 2: The resource group is deleted.
  • 3: The resource group is being created.
  • 4: The resource group fails to be created.
  • 5: The resource group is being updated.
  • 6: The resource group fails to be updated.
  • 7: The resource group is being deleted.
  • 8: The resource group fails to be deleted.
biz_ext_key string The extension field of the resource group. A value of single indicates an exclusive resource group.
biz_date string The data timestamp.

Metrics in the raw_v_tenant_user table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
yun_account string The Alibaba Cloud account.
account_name string The name of the account.
nick string The display name of the account.
full_yun_account string The Alibaba Cloud account that contains the account provider information.
biz_date string The data timestamp.

Metrics in the raw_v_tenant_workspace table

Metric name Data type Description
tenant_id bigint The ID of the tenant.
project_id bigint The ID of the workspace.
project_name string The name of the workspace.
project_identifier string The identifier of the workspace.
project_desc string The description of the workspace.
project_owner string The owner of the workspace.
status bigint The status of the workspace.
  • 0: The workspace is normal.
  • 1: The workspace is deleted.
  • 2: The workspace is being initialized.
  • 3: The workspace fails to be initialized.
  • 4: The workspace is manually disabled.
  • 5: The workspace is being deleted.
  • 6: The workspace fails to be deleted.
  • 7: The workspace is frozen due to overdue payments.
biz_date string The data timestamp.

Metrics in the raw_v_tenant_workspace_user table

Metric name Data type Description
tenant_id bigint The ID of the DataWorks tenant.
project_id bigint The ID of the DataWorks workspace.
base_id string The base ID of the user.
status bigint The status of the user.
  • 0: The user is normal.
  • 1: The user is disabled.
  • 2: The user is deleted.
gmt_create_ts bigint The creation time. The metric value is a 13-digit timestamp.
gmt_modified_ts bigint The last modification time. The metric value is a 13-digit timestamp.
biz_date string The data timestamp.