All Products
Search
Document Center

Container Service for Kubernetes:Configure datasets and source code repositories for a training job

Last Updated:Nov 29, 2023

This topic describes how to configure datasets and source code repositories for a training job.

Prerequisites

Configure a dataset

  1. Log on to AI Developer Console. For more information, see Step 2: Log on to AI Developer Console.

  2. In the left-side navigation pane of AI Developer Console, click Data Config.

  3. On the Data Config page, click New Data Configuration.

  4. On the New Data Configuration page, set Name, Namespace, and Persistent Volume Claim for the dataset and specify local directory based on your requirements.

    • For more information about PVCs, see Create a PVC.

    • When the ACK cluster runs the job, the ACK cluster mounts the local directory to the container in which the job runs. This enables the job to access the data and model stored in the local directory.

  5. Click Submit.

  6. After you complete the configuration, you can view the detailed information about the dataset on the Data tab of the Data Config page.

Configure a source code repository

  1. In the left-side navigation pane of the AI Developer Console, click Data Config.

  2. On the Data Config page, click New Git configuration.

  3. In the New Code Configuration dialog box, set Name, Git Repository, and Default Branch for the source code repository and specify local directory based on your requirements. When the ACK cluster runs the job, the ACK cluster mounts the local directory to the container in which the job runs. This enables the job to access the source code stored in the local directory.

  4. Click Submit.

  5. After you complete the configuration, you can view the detailed information about the dataset on the Data tab of the Data Config page.