All Products
Search
Document Center

Platform For AI:Dataset

Last Updated:Sep 28, 2025

Parameter

Type

Description

Example

object

Dataset properties.

DatasetId

string

The dataset ID.

d-c0h44g3****j8o4348

UserId

string

The user ID.

2485765****023475

OwnerId

string

The ID of the Alibaba Cloud account.

1631044****3440

Name

string

The name of the dataset.

myName

Labels

array

The list of labels.

Label

A label.

{"Key": "env", "Value": "test"}

DataSourceType

string

The type of the data source.

NAS

Uri

string

URI examples:

  • If the data source is of the OSS type: oss://bucket.endpoint/object

  • If the data source is of the General-purpose NAS type: nas://<nasfisid>.region/subpath/to/dir/

  • If the data source is of the CPFS 1.0 type: nas://<cpfs-fsid>.region/subpath/to/dir/

  • If the data source is of the CPFS 2.0 type: nas://<cpfs-fsid>.region/<protocolserviceid>/

Note

CPFS 1.0 and CPFS 2.0 are distinguished by the fsid format. The fsid of CPFS 1.0 is in the `cpfs-<8-character ASCII string>` format. The fsid of CPFS 2.0 is in the `cpfs-<16-character ASCII string>` format.

nas://09f****f2.cn-hangzhou/

GmtCreateTime

string

The time when the dataset was created.

2021-01-21T17:12:35.232Z

GmtModifiedTime

string

The time when the dataset was last updated.

2021-01-21T17:12:35.232Z

DataType

string

The data type of the dataset. Valid values:

  • COMMON (default): common data.

  • PIC: images.

  • TEXT: text.

  • VIDEO: videos.

  • AUDIO: audio.

COMMON

Property

string

The property of the dataset. Valid values:

  • FILE: a file.

  • DIRECTORY: a directory.

DIRECTORY

SourceType

string

The source type.

USER

SourceId

string

The source ID.

jdnhf***fnrimv

Description

string

The description of the dataset.

This is a description of a dataset.

WorkspaceId

string

The ID of the workspace to which the dataset belongs.

478**

Options

string

Extended fields (options), in JSON string format. When a Data Lake Compute (DLC) job uses the dataset, you can configure the mountPath field to specify the default mount path.

{ "mountPath": "/mnt/data/" }

Accessibility

string

The visibility of the dataset in the workspace. Valid values:

  • PRIVATE (default): The dataset is visible only to its owner and administrators in the workspace.

  • PUBLIC: The dataset is visible to all users in the workspace.

PRIVATE

ProviderType

string

The type of the data source provider. Valid values:

  • Ecs (default)

  • Lingjun

Ecs

SourceDatasetId

string

The ID of the source dataset for the annotated dataset.

d-bvfasdfxxxxj8o411

SourceDatasetVersion

string

The version of the source dataset for the annotated dataset.

v2

TagTemplateType

string

The annotation template for the iTAG annotated dataset.

text-classification

LatestVersion DatasetVersion

The information about the latest version of the dataset.

MountAccess

string

The mount access permissions.

RO RW

MountAccessReadWriteRoleIdList

array

The list of workspace role IDs with read and write permissions on the dataset. Role IDs that start with `PAI` are built-in. Role IDs that start with `role-` are custom. An asterisk (*) indicates that all roles have read and write permissions.

string

A workspace role ID.

PAI.AlgoOperator

ImportInfo

string

The storage import configuration of the dataset. Data sources such as OSS, NAS, and CPFS are supported.

OSS

{ "region": "${region}",// Region ID "bucket": "${bucket}",// Bucket name "path": "${path}" // File path }

NAS

CPFS

Content

Intelligent Computing CPFS

Content

{ "region": "cn-wulanchabu", "fileSystemId": "bmcpfs-xxxxxxxxxxx", "path": "/mnt", "mountTarget": "cpfs-xxxxxxxxxxxx-vpc-gacs9f.cn-wulanchabu.cpfs.aliyuncs.com", "isVpcMount": true }

Edition

string

The edition of the dataset. Valid values:

BASIC: The basic edition. File metadata management is not supported. ADVANCED: The advanced edition. This edition is supported only for datasets of the OSS type. Each version supports metadata management for up to 1,000,000 files.

BASIC