Retrieves a list of self-healing rules.
Operation description
Try it now
Test
RAM authorization
|
Action |
Access level |
Resource type |
Condition key |
Dependent action |
|
cs:ListAutoRepairPolicies |
none |
*Cluster
|
None | None |
Request syntax
GET /clusters/{cluster_id}/auto_repair_policies HTTP/1.1
Path Parameters
|
Parameter |
Type |
Required |
Description |
Example |
| cluster_id |
string |
No |
The ID of the cluster. |
ce0da5a1d627e4e9e9f96cae8ad07**** |
Request parameters
|
Parameter |
Type |
Required |
Description |
Example |
No parameters required.
Response elements
|
Element |
Type |
Description |
Example |
|
object |
The response schema. |
||
| items |
array<object> |
A list of auto-repair policies. |
|
|
array<object> |
An auto-repair policy. |
||
| name |
string |
The name of the auto-repair policy. |
test |
| id |
string |
The ID of the auto-repair policy. |
r-xxxxx |
| resource_type |
string |
The resource type that the policy applies to. |
nodepool |
| resource_sub_type |
string |
The resource subtype that the policy applies to. |
ess |
| resource_ids |
array |
A list of resource IDs that the policy applies to. |
|
|
string |
A resource ID. |
npxxxxx |
|
| rules |
array<object> |
A list of sub-rules for the auto-repair policy. |
|
|
array<object> |
Details of a sub-rule. |
||
| incidents |
array<object> |
A list of incidents that trigger this rule. |
|
|
object |
Details of an incident. |
||
| name |
string |
The name of the incident. |
Node.FaultNeedReboot.HOST |
| type |
string |
The diagnosis type. |
system |
| repair_procedure |
array<object> |
The repair procedure to run. |
|
|
array<object> |
Details of a step in the repair procedure. |
||
| name |
string |
The name of the procedure. |
QuarantineGPU |
| intervention |
object |
Configuration for manual intervention. |
|
| enable |
boolean |
Specifies whether to enable manual approval. |
true |
| type |
string |
The type of manual approval. |
label |
| inquiring_label |
object |
Configuration for the approval-request label. At this stage, ACK applies this label to the node and waits for you to authorize the procedure. |
|
| key |
string |
The label's |
k8s.aliyun.com/incident |
| value |
string |
The label's |
inquiring |
| approved_label |
object |
The configuration for the approval-granted label. To approve this repair step, apply this label to the node. After the step completes, ACK automatically removes both the request and approval labels. If you fail to apply this label in time, ACK will not proceed with the repair, and the node may remain impaired. |
|
| key |
string |
The label's |
k8s.aliyun.com/incident |
| value |
string |
The label's |
approved |
| config |
object |
The configuration parameters for the procedure. |
|
|
any |
Additional configuration parameters for the procedure. |
{ "drain_timeout": "30s" } |
Examples
Success response
JSON format
{
"items": [
{
"name": "test",
"id": "r-xxxxx",
"resource_type": "nodepool",
"resource_sub_type": "ess",
"resource_ids": [
"npxxxxx"
],
"rules": [
{
"incidents": [
{
"name": "Node.FaultNeedReboot.HOST",
"type": "system\n"
}
],
"repair_procedure": [
{
"name": "QuarantineGPU",
"intervention": {
"enable": true,
"type": "label",
"inquiring_label": {
"key": "k8s.aliyun.com/incident",
"value": "inquiring"
},
"approved_label": {
"key": "k8s.aliyun.com/incident",
"value": "approved"
}
},
"config": {
"key": "{\n\"drain_timeout\": \"30s\" \n}"
}
}
]
}
]
}
]
}
Error codes
See Error Codes for a complete list.
Release notes
See Release Notes for a complete list.