archive vaults offer low-cost storage for infrequently accessed data, enabling long-term retention and on-demand file retrieval. This topic describes how to use Cloud Backup to archive a data source.
Prerequisites
A data source has been added. For more information, see Add a data source.
Limitations
When you archive NAS files, the maximum length of a single file name is 248 bytes. If a file name exceeds this limit, the file cannot be archived or retrieved, and the job is marked as partially complete.
Archiving methods
Cloud Backup provides two methods for archiving a data source:
-
Use this method if you enabled Enable Data Source Analysis when adding the data source. This method allows you to analyze your data source before you archive it.
-
Use this method if you did not enable Enable Data Source Analysis when adding the data source. This method allows you to archive your data source without prior analysis.
Visual Analysis and Archive
Log on to the Cloud Backup console.
-
In the left-side navigation pane, choose Archive.
-
In the upper-left corner of the top navigation bar, select a region.
-
On the Analyze and Archive tab, click Archive for the target data source.
-
In the dialog box, click Visual Analysis and Archive.
-
In the Create Archive Plan panel, complete the following steps.
-
Configure one or more archive filters and then click Next.
You can set filter conditions such as directory name, file extension, modification time, access time, and file size to select the files to archive. To add multiple filters, click Add Archive Filter.
-
Configure the basic parameters and then click OK.
Parameter type
Parameter
Description
Basic Settings
Archive Plan Name
(Optional) The name of the Archive Plan. If you leave this parameter empty, the system assigns a random name.
Archive Start Time
The start time of the archive job. The time is accurate to the second.
Archive Run Interval
The interval at which the archive job runs. You can set the interval in days or weeks.
When file with the same path exists in archive vault
-
Skip This File: Skips the file and does not archive it.
-
overwrite files in archive vault: Archives the file again and overwrites the existing archived file.
-
save as another version: Archives the file again as a new version.
Bandwidth Throttling
Bandwidth throttling allows you to control the network traffic for archive jobs during peak business hours to avoid affecting your regular services. To use bandwidth throttling, specify a Time Range (Hour), enter a Max Bandwidth (MB), and then click Add.
Advanced Settings
Task Slice Size
A backup client group runs an archive job in parallel across multiple clients. This setting slices the job based on the number of files. Each slice, containing at least the specified number of files, is assigned to a client for archiving.
Minimum Number of Files Per Tasks
Valid values: 0 to 1,000,000. Default value: 0. If you leave this parameter empty, the default value is used.
Maximum Number of Files Per Tasks
Valid values: 10,000 to 1,000,000. Default value: 100,000. If you leave this parameter empty, the default value is used.
Minimum Amount of Data Per Task
Valid values: 0 to 102,400. Default value: 0. If you leave this parameter empty, the default value is used.
Maximum Amount of Data Per Task
Valid values: 256 to 102,400. Default value: 4,096. If you leave this parameter empty, the default value is used.
Configure Archive Vault
Configure Archive Vault
You can select an existing archive vault. If no archive vault is available, click Create Archive Vault and enter a name to create one. The name of the archive vault cannot exceed 64 bytes in length.
NoteAn archive vault is a cloud repository where Cloud Backup stores archived data. Multiple clients can archive data to the same vault. An archive vault is specific to a region. You can only select or create an archive vault in the current region.
Archive Vault Name
(Optional) The name of the archive vault. If you leave this parameter empty, the system assigns a random name.
Resource Group
This parameter is required only if you select Create Archive Vault for Configure Archive Vault. It specifies the Resource Group to which the archive vault belongs.
Resource groups are a mechanism for managing resources by group under a single Alibaba Cloud account. They help you simplify resource management and authorization for complex scenarios. For more information, see Create a resource group.
Storage Class
This parameter is required only if you select Create Archive Vault for Configure Archive Vault. It specifies the storage class of the archive vault. Valid values:
-
Archive
-
Cold Archive
For more information about storage classes, see Storage classes.
Archive Retention Policy
The data retention policy. Valid values:
-
Specify retention period: Retains archived data for a specific period. This period is determined by the Archive Retention Period.
-
Permanent: Retains archived data forever.
Archive Retention Period
This parameter is required only if you set Archive Retention Policy to Specify retention period. It specifies how long to retain the archived data. You can set the period in days, weeks, months, or years.
-
After an Archive Plan is created, you can find it in the list on the Archive Plan tab. On this tab, you can view details of the plan, such as the data source, plan name/ID, backup vault name/ID, archive file path, execution schedule, and plan status. You can manage the plan by clicking Edit or More.
-
Directly Archive
Log on to the Cloud Backup console.
-
In the left-side navigation pane, choose Archive.
-
In the upper-left corner of the top navigation bar, select a region.
-
On the Analyze and Archive tab, click Archive for the target data source.
-
In the dialog box, click Directly Archive.
-
In the Create Archive Plan panel, configure the following key parameters and click OK.
Parameter type
Parameter
Description
Configure Archive Vault
Configure Archive Vault
You can select an existing archive vault. If no archive vault is available, click Create Archive Vault and enter a name to create one. The name of the archive vault cannot exceed 64 bytes in length.
NoteAn archive vault is a cloud repository where Cloud Backup stores archived data. Multiple clients can archive data to the same vault. An archive vault is specific to a region. You can only select or create an archive vault in the current region.
Archive Vault Name
(Optional) The name of the archive vault. If you leave this parameter empty, the system assigns a random name.
Resource Group
This parameter is required only if you select Create Archive Vault for Configure Archive Vault. It specifies the Resource Group to which the archive vault belongs.
Resource groups are a mechanism for managing resources by group under a single Alibaba Cloud account. They help you simplify resource management and authorization for complex scenarios. For more information, see Create a resource group.
Storage Class
This parameter is required only if you select Create Archive Vault for Configure Archive Vault. It specifies the storage class of the archive vault. Valid values:
-
Archive
-
Cold Archive
For more information about storage classes, see Storage classes
Archive Retention Policy
The data retention policy. Valid values:
-
Specify retention period: Retains archived data for a specific period. This period is determined by the Archive Retention Period.
-
Permanent: Retains archived data forever.
Archive Retention Period
This parameter is required only if you set Archive Retention Policy to Specify retention period. It specifies how long to retain the archived data. You can set the period in days, weeks, months, or years.
Archive Plan
Archive Plan Name
(Optional) The name of the Archive Plan. If you leave this parameter empty, the system assigns a random name.
Archive File Path
The path to the source files to archive. Example: /myshare.
Archive File Rules
Rules for including or excluding files. The following rules are supported:
-
Include All Files: Archives all files in the specified path.
-
Include Files: Archives only the specified files in the path.
-
Exclude Files: Archives all files in the path except for the specified files.
The file list is a subpath of the archive path. The matching rules are as follows:
-
If the subpath starts with a forward slash (
/), the filter matches the full path under the archive path.For example, if the archive file path is /ifs/dataset and you specify /subdir/data, files and directories in the /ifs/dataset/subdir/data path are archived.
For example, if the archive file path is /ifs/dataset and you specify /abc*, files and directories that start with "abc" in the /ifs/dataset path are archived.
-
If the subpath does not start with a forward slash (
/), it is treated as a relative path. The filter matches any file or directory at any level under the archive path that matches the rule.For example, if the archive file path is /ifs/dataset and you specify abc*, any file or directory whose name starts with "abc" within the /ifs/dataset path is archived.
For example, if the archive file path is /ifs/dataset and you specify abc, any file or directory named "abc" within the /ifs/dataset path is archived.
-
Archive Start Time
The start time of the archive job. The time is accurate to the second.
Archive Run Interval
The interval at which the archive job runs. You can set the interval in days or weeks.
When file with the same path exists in archive vault
-
Skip This File: Skips the file and does not archive it.
-
overwrite files in archive vault: Archives the file again and overwrites the existing archived file.
-
save as another version: Archives the file again as a new version.
Enable Bandwidth Throttling
Bandwidth throttling allows you to control the network traffic for archive jobs during peak business hours to avoid affecting your regular services. To use bandwidth throttling, specify a Time Range (Hour), enter a Max Bandwidth (MB), and then click Add.
Advanced Settings
Backup Sub-task Slice Size
A backup client group can contain multiple clients that run an archive job in parallel. This setting divides the job into slices based on the number of files. Each slice, containing at least the specified number of files, is assigned to a client for archiving.
Minimum Number of Files Per Tasks
Valid values: 0 to 1,000,000. Default value: 0. If you leave this parameter empty, the default value is used.
Maximum Number of Files Per Tasks
Valid values: 10,000 to 1,000,000. Default value: 100,000. If you leave this parameter empty, the default value is used.
Minimum Amount of Data Per Task
Valid values: 0 to 102,400. Default value: 0. If you leave this parameter empty, the default value is used.
Maximum Amount of Data Per Task
Valid values: 256 to 102,400. Default value: 4,096. If you leave this parameter empty, the default value is used.
After the Archive Plan is created, it appears in the list on the Archive Plan tab.
-
Related operations
After you create an Archive Plan, you can perform the following operations:
|
Actions |
Description |
|
Run Now |
Runs the archive job immediately. |
|
Modify Plan |
Modifies the Archive Plan settings. |
|
Delete Plan |
Deletes the Archive Plan. Use this action when you no longer need to archive data. |