Resets and restores a cluster.

Description

You can call the operation to reset and restore a cluster only when the cluster is in the Exception state. You can call the ListClusters operation to query the ID and status of a cluster.

We recommend that you export all job data before you restore a cluster. When you reset and restore a cluster, take note of the following impacts:

  • The system disks of all nodes are changed. By default, new system disks are configured based on the settings that you specified when the cluster was created.
  • The data on the system disks and data disks of all cluster nodes is lost. The data includes user information, job information, scheduler queue information, and configuration data of auto-scaling queues. However, the data on Apsara File Storage NAS file systems is retained.
  • The self-managed queues in the cluster are deleted. All nodes are retained and migrated to the default queue of the cluster.

Debugging

OpenAPI Explorer automatically calculates the signature value. For your convenience, we recommend that you call this operation in OpenAPI Explorer. OpenAPI Explorer dynamically generates the sample code of the operation for different SDKs.

Request parameters

Parameter Type Required Example Description
Action String Yes RecoverCluster

The operation that you want to perform. Set the value to RecoverCluster.

ClusterId String Yes ehpc-hz-FYUr32****

The ID of the cluster. The cluster must be in the Exception state.

You can call the ListClusters operation to query the cluster ID and status.

OsTag String No CentOS_7.2_64

The image tag of the operating system.

You can call the ListImages and ListCustomImages operations to query the image tags supported by Elastic High Performance Computing (E-HPC).

AccountType String No nis

The service type of the domain account. Valid values:

  • nis
  • ldap

Default value: nis

SchedulerType String No pbs

The type of the scheduler. Valid values:

  • pbs
  • slurm
  • opengridscheduler
  • deadline

Default value: pbs

ImageOwnerAlias String No system

The type of the image. Valid values:

  • system: public image
  • self: custom image
  • others: shared image

Default value: system

ImageId String No m-bp18133n0335yq****

The ID of the image.

You can call the ListImages and ListCustomImages operations to query the images that are supported by E-HPC.

ClientVersion String No 1.0.76

The version of the E-HPC client. The default value is the latest version of the client.

You can call the ListCurrentClientVersion operation to query the current version of the E-HPC client.

Response parameters

Parameter Type Example Description
RequestId String 18FB21E3-F423-4B84-BB63-D8887A29****

The ID of the request.

TaskId String 18FB21E3-F423-4B84-BB63-D8887A29****

The ID of the task.

Examples

Sample requests

https://ehpc.cn-hangzhou.aliyuncs.com/?Action=RecoverCluster
&ClusterId=ehpc-hz-FYUr32****
&<Common request parameters>

Sample success responses

XML format

<RecoverClusterResponse>
      <TaskId>18FB21E3-F423-4B84-BB63-D8887A29****</TaskId>
      <RequestId>18FB21E3-F423-4B84-BB63-D8887A29****</RequestId>
</RecoverClusterResponse>

JSON format

{
	"TaskId": "18FB21E3-F423-4B84-BB63-D8887A29****",
	"RequestId": "18FB21E3-F423-4B84-BB63-D8887A29****"
}

Error codes

HttpCode Error code Error message Description
503 ServiceUnavailable The request has failed due to a temporary failure of the server The error message returned because the request has failed. The service is temporarily unavailable.

For a list of error codes, visit the API Error Center.