|
Parameter |
Type |
Description |
Example |
|---|---|---|---|
|
object |
Additional parameter settings. |
||
| BusinessUserId |
string |
The ID of the user associated with the job. |
16**** |
| Caller |
string |
The caller. |
SilkFlow |
| Tags |
object |
Custom tags. |
|
|
string |
The tag. |
key:value |
|
| PipelineId |
string |
The workflow ID. |
pid-12**** |
| EnableTideResource |
boolean |
Specifies whether to allow the job to use tide resources. Valid values:
|
true |
| EnableErrorMonitoringInAIMaster |
boolean |
Specifies whether to enable fault tolerance monitoring for the job. Valid values:
|
false |
| ErrorMonitoringArgs |
string |
The configuration parameters for fault tolerance monitoring after it is enabled. For example, you can specify whether to enable log hang detection. |
--enable-log-hang-detection true |
| EnableRDMA |
boolean |
Specifies whether to allow the job to use RDMA. Valid values:
|
true |
| EnableOssAppend |
boolean |
Specifies whether to allow append writes to OSS. Valid values:
|
true |
| OversoldType |
string |
The usage policy for oversold resources for the job. |
AcceptQuotaOverSold |
| AdvancedSettings |
object |
Additional advanced parameter settings. |
|
| Driver |
string |
The NVIDIA driver configuration. |
535.54.03 |
| EnableSanityCheck |
boolean |
Specifies whether to enable computing power health checks for the job. Valid values:
|
true |
| SanityCheckArgs |
string |
The configuration parameters for computing power health checks. |
--sanity-check-timing=AfterJobFaultTolerant --sanity-check-timeout-ops=MarkJobFail |
| JobReservedMinutes |
integer |
The duration in minutes to retain the job after it finishes. |
30 |
| JobReservedPolicy |
string |
The policy for retaining the job after it finishes. |
Always |
| EnableCPUAffinity |
boolean |
The CPU affinity settings. This parameter is valid only when you use subscription general computing resources. |
true |
| DisableEcsStockCheck |
boolean |
Specifies whether to skip the inventory check. Valid values:
|
false |
| AllocateAllRDMADevices |
boolean |
Specifies whether to mount all RDMA network interface controllers (NICs). |
|
| EnableDSWDev |
boolean |
||
| DataJuicerConfig | DataJuicerConfig |
The configuration for the DataJuicer task. |
|
| ModelConfig | ModelConfig |
The configuration for the output model. This parameter is valid only in federated training scenarios. |
|
| AllowUnschedulableNodes |
boolean |