All Products
Search
Document Center

Cloud Backup:Add a data source

Last Updated:Sep 21, 2023

You must add a data source the first time you archive data. The following data sources are supported: on-premises NAS file systems, Isilon (PowerScale) NAS file systems, Hadoop Distributed File System (HDFS) file systems, and S3-Compatible Storage buckets. This topic describes how to use Cloud Backup to add a data source for archiving.

Prerequisites

Cloud Backup is authorized, and an Cloud Backup client is installed. For more information, see Prepare for data archiving.

Procedure

  1. Log on to the Cloud Backup console.

  2. In the left-side navigation pane, click Archive.

  3. In the top navigation bar, select a region.

  4. On the Analyze and Archive tab, click Create Data Source.

  5. In the Create Data Source panel, configure the parameters and click Next.

    • Set the Source Type parameter to Local NAS.

      1. Configure the parameters described in the following table.

        Parameter

        Description

        Source Type

        The type of the data source. Select Local NAS.

        NAS Type

        The type of the NAS file system. Valid values:

        • Isilon (PowerScale): Isilon (PowerScale) NAS file systems.

        • Other NAS: other NAS file systems.

        NAS Instance Name

        The name of the NAS file system.

        NAS Management Username

        This parameter is required only if you set the NAS Type parameter to Isilon (PowerScale). This parameter specifies the username of the account that is used to manage the NAS file system.

        NAS Management Password

        This parameter is required only if you set the NAS Type parameter to Isilon (PowerScale). This parameter specifies the password of the account.

        NAS Network Address

        The network address of the NAS file system.

        NAS Management Port

        This parameter is required only if you set the NAS Type parameter to Isilon (PowerScale). This parameter specifies the number of the port that is used to manage the NAS file system.

        NAS Share Path

        The name of the NAS directory from which you want to archive data.

        The name can contain letters, digits, and the following special characters: , - _ = / . : \.

        • If you set the NAS Type parameter to Isilon (PowerScale), the directory is relative to the /ifs directory. For example, if you enter /myshare, Cloud Backup archives data from the /ifs/myshare directory.

        • If you set the NAS Type parameter to Other NAS, the directory is relative to the / root directory. For example, if you enter /myshare, Cloud Backup archives data from the /myshare directory.

        Protocol Type

        The protocol type of the NAS file system. Valid values:

        • NFS: Network File System (NFS) protocol

        • SMB: Server Message Block (SMB) protocol

        Important

        If you want to archive data from an Apsara File Storage NAS file system, configure the vers parameter in the Advanced Settings section.

      2. Optional. Click Advanced Settings and then click +Set Mount Parameters.

        The following table describes the mount parameters that you can configure.

        Parameter

        Description

        vers

        The protocol version of the file system.

        • vers=3: uses NFSv3 to mount the file system.

        • vers=4: uses NFSv4 to mount the file system.

        • vers=4.0: uses NFSv4.0 to mount the file system.

        nolock

        Specifies whether to enable file locking.

        proto

        The protocol that you want to use to mount the file system.

        rsize

        The size of each data block that a client can read from the file system.

        Recommended value: 1048576. Unit: bytes.

        wsize

        The size of each data block that a client can write to the file system.

        Recommended value: 1048576. Unit: bytes.

        hard

        Specifies that applications no longer access a file system when the file system is unavailable and access the file system again when the file system becomes available. We recommend that you enable this parameter.

        timeo

        The period of time for which the NFS client waits before the client retries to send a request. Unit: deciseconds (tenths of a second).

        Recommended value: 600 (60 seconds).

        retrans

        The number of retries after the NFS client fails to send a request.

        Recommended value: 2.

    • Set the Source Type parameter to HDFS.

      Configure the parameters described in the following table.

      Parameter

      Description

      Source Type

      The type of the data source. Select HDFS.

      Data Source Name

      The name of the HDFS data source. You can specify a name based on your business requirements. Example: back-end-hdfs.

      NameNode Network Address

      The network address of an HDFS primary server.

      NameNode serves as a primary server that is used to manage the namespaces of HDFS file systems and control access from clients to files in the file systems.

      NameNode Port

      The port number of the HDFS primary server.

      Secondary NameNode Network Address

      The network address of a secondary HDFS node.

      A secondary HDFS node helps the primary server perform management tasks.

      Secondary NameNode Port

      The port number of the secondary HDFS node.

      HDFS Username

      The name of an HDFS user.

      Note

      Make sure that the specified HDFS user has sufficient permissions. Otherwise, you may be unable to read files when you archive data, write files when you retrieve data, or restore the information of user groups. We recommend that you set this parameter to hadoop or hdfs.

    • Set the Source Type parameter to S3-Compatible Storage.

      Configure the parameters described in the following table.

      Parameter

      Description

      Source Type

      The type of the data source. Select S3-Compatible Storage.

      Data Source Name

      The name of the S3 data source. You can specify a name based on your business requirements. Example: awss3.

      Buckets

      The name of the S3-Compatible Storage bucket.

      Access Points

      The access point of the bucket that can be used to perform operations on S3 objects. For more information, see AWS S3.

      Access Key ID

      The security credential for accessing S3-Compatible Storage as an Identity and Access Management (IAM) user.

      For more information, see AWS S3.

      Secret Access Key

      The security credential for accessing S3-Compatible Storage as an IAM user. For more information, see AWS S3.

  6. Associate a client group and click Next.

    You can add multiple clients to a client group to concurrently run an archive job. You can also select an existing client group. In this example, set the Client Group From parameter to Create Backup Client Group. Then, enter a name for Client Group Name and select the clients that you want to add to the client group.

  7. Configure a data analysis plan and click OK.

    In the Configure Analysis Plan step, configure the parameters. The following table describes the parameters.

    Parameter

    Description

    Enable Data Source Analysis

    Specifies whether to enable the data analysis feature. If you turn on Enable Data Source Analysis, Cloud Backup analyzes data after the data source is added. You can use Cloud Backup to scan, analyze, or search for a data source only if the data analysis feature is enabled for the data source.

    Note

    If you turn off Enable Data Source Analysis, Cloud Backup directly archives data from the data source.

    Meta Index Start Time

    The time when Cloud Backup starts to perform an index operation on metadata.

    Meta Index Interval

    The interval at which Cloud Backup performs index operations on metadata. Unit: days or weeks.

    After you add a data source, the data source is displayed on the Analyze and Archive tab.

Related operations

You can click More next to an existing data source and select the required operation. The following table describes the available operations.

Operation

Description

Configure Analysis Plan

Configures a data analysis plan for the data source.

Run Meta Indexing

Creates an index for the data source. This way, you can efficiently analyze and search for data.

View Data Source

Views the details of the data source. The details include the type, NAS network address, NAS share path, and backup client group.

Edit Data Source

Modifies the parameters of the data source.

Unregister Data Source

Unregisters a data source. If you no longer require an archive plan, you can perform this operation.

Edit Backup Clients Group

Changes the name of a client or a client group.

What to do next

Analyze a data source