All Products
Search
Document Center

Container Service for Kubernetes:Prepare environments

Last Updated:Mar 11, 2025

Introduction

This topic provides an overview for deploying and configuring key components of AI ecosystem within Alibaba Cloud Container Service for Kubernetes (ACK). It covers the installation of essential tools and services to streamline AI/ML workflows, including infrastructure setup, model management, and storage integration. Prepare the relevant environment to optimize your Kubernetes environment for scalable and efficient AI task execution.

Deploy the cloud-native AI suite

ACK offers a deployment guide for the cloud-native AI suite, which integrates various AI and big data components such as deep learning frameworks, data storage solutions, and model management tools. This unified platform allows for the management and optimization of AI task environments. For details about how to deploy the cloud-native AI suite in an ACK cluster, including component installation, configuration, and usage, see Deploy the cloud-native AI suite.

Install the Arena client

Arena is a containerized machine learning tool developed by Alibaba Cloud, designed to simplify the deployment and execution of machine learning tasks on Kubernetes. For details on how to install it, including environment configuration, component deployment, and submission of training and inference jobs, see Configure the Arena client.

Configure MLflow Model Registry

MLflow is an open-source platform for model management and experiment tracking, supporting deployment and management of machine learning models on Kubernetes. It enables full lifecycle model management, enhancing traceability and reusability. For details on how to set up an MLflow Model Registry in an ACK cluster, including component installation, configuration, and model upload and management processes, see Configure MLflow Model Registry.

Configure external storage

By configuring external storage, you can achieve persistent data storage and efficient management, ensuring reliable data access for AI training and inference tasks. For guidance on configuring external storage in an ACK cluster, accommodating data storage needs with Alibaba Cloud services like Object Storage Service (OSS) and File Storage NAS, or other third-party storage systems, see Configure external storage.

Summary

These features allow users to efficiently manage and deploy AI tasks in Kubernetes environments, improving both development and operational productivity.