All Products
Search
Document Center

Platform For AI:Dataset

Last Updated:Apr 23, 2025
ParameterTypeDescriptionExample
object

The dataset property.

DatasetIdstring

The dataset ID.

d-c0h44g3****j8o4348
UserIdstring

The user ID.

2485765****023475
OwnerIdstring

The ID of the Alibaba Cloud account.

1631044****3440
Namestring

The dataset name.

myName
Labelsarray

The labels.

LabelsLabel

The label.

{"Key": "env", "Value": "test"}
DataSourceTypestring

The data source type.

Valid values:

  • NAS
  • OSS
NAS
Uristring

URI examples:

  • Object Storage Service (OSS) data source: oss://bucket.endpoint/object
  • File Storage NAS (NAS) data source: nas://<nasfisid>.region/subpath/to/dir/
  • Cloud Parallel File Storage (CPFS) 1.0 data source: nas://<cpfs-fsid>.region/subpath/to/dir/
  • CPFS 2.0 data source: nas://<cpfs-fsid>.region/<protocolserviceid>/
Note You can distinguish CPFS 1.0 and CPFS 2.0 file systems based on the format of the file system ID: The ID of the CPFS 1.0 file system is in the cpfs-<8-bit ASCII characters> format. The ID of the CPFS 2.0 file system is in the cpfs-<16-bit ASCII characters> format.
nas://09f****f2.cn-hangzhou/
GmtCreateTimestring

The time when the dataset was created.

2021-01-21T17:12:35.232Z
GmtModifiedTimestring

The time when the dataset was modified.

2021-01-21T17:12:35.232Z
DataTypestring

The data type. Valid values:

  • COMMON (default)
  • PIC
  • TEXT
  • Video
  • AUDIO
Enumeration Value:
  • VIDEO: VIDEO.
  • COMMON: COMMON.
  • TEXT: TEXT.
  • PIC: PIC.
  • AUDIO: AUDIO.
COMMON
Propertystring

The dataset property. Valid values:

  • FILE
  • DIRECTORY
DIRECTORY
SourceTypestring

The source type. Valid values:

  • PAI_PUBLIC_DATASET
  • ITAG
  • USER
USER
SourceIdstring

The source ID.

jdnhf***fnrimv
Descriptionstring

The dataset description.

WorkspaceIdstring

The ID of the workspace to which the dataset belongs.

478**
Optionsstring

The extended field that can be used as an option. The value is a JSON string. When you use the dataset in Deep Learning Containers (DLC), you can use the mountPath field to specify the default mount path of the dataset.

{ "mountPath": "/mnt/data/" }
Accessibilitystring

The workspace accessibility. Valid values:

  • PRIVATE (default): The dataset is accessible only to you and the administrator of the workspace.
  • PUBLIC: The dataset is accessible to all members in the workspace.
PRIVATE
ProviderTypestring

The provider type of the dataset. Valid values:

  • Ecs (default)
  • Lingjun
Ecs
SourceDatasetIdstring

The ID of the source dataset for the labeled dataset.

d-bvfasdfxxxxj8o411
SourceDatasetVersionstring

The version of the source dataset for the labeled dataset.

v2
TagTemplateTypestring

The labeling template of the iTAG labeled dataset.

text-classification
LatestVersionDatasetVersion

The latest dataset version.

MountAccessstring

MountAccess

RO RW
MountAccessReadWriteRoleIdListarray

The IDs of the roles that have read and write permissions on the dataset in the workspace. The IDs starting with PAI is the IDs of the basic roles, and the IDs starting with role- is the IDs of the custom roles. If the list contains "*", all roles have read and write permissions.

MountAccessReadWriteRoleIdListstring

The ID of the workspace role.

PAI.AlgoOperator
ImportInfostring

The dataset import information, such as OSS, NAS, and CPFS.

OSS

{ "region": "${region}",//The region ID. "bucket": "${bucket}",//The bucket name. "path": "${path}" //The file path. }

NAS

CPFS

CPFS for Lingjun

{ "region": "cn-wulanchabu", "fileSystemId": "bmcpfs-xxxxxxxxxxx", "path": "/mnt", "mountTarget": "cpfs-xxxxxxxxxxxx-vpc-gacs9f.cn-wulanchabu.cpfs.aliyuncs.com", "isVpcMount": true }