All Products
Search
Document Center

Platform For AI:LLM training and deployment

Last Updated:Mar 31, 2026

Deploy, fine-tune, evaluate, and process data for LLMs on PAI. Choose a workflow based on your goal: one-click deployment through Model Gallery, custom training with DSW and DLC, or large-scale distributed training on Lingjun.

Deploy models

PAI provides multiple deployment paths. Use Model Gallery for one-click deployment of popular open-source models, or use Elastic Algorithm Service (EAS) for custom deployment with advanced configurations such as accelerated inference engines and auto-scaling.

Model Gallery: one-click deployment

Model Gallery supports deploying models with built-in inference optimization. Each tutorial covers the full workflow from deployment to API invocation. Select a model series:

EAS: custom deployment

  • Quickly deploy LLMs in EAS - Deploy open-source LLMs via EAS with standard or accelerated inference modes. Supports WebUI and API access.

Lingjun: distributed serving at scale

  • Fully managed Qwen on Lingjun - End-to-end workflow for distributed training, three-stage instruction tuning, offline inference, and online deployment of Qwen models (7B to 72B) on serverless GPU clusters.

Fine-tune and train models

Adapt pre-trained LLMs to your domain or task. Model Gallery provides a no-code fine-tuning interface. For more control, use DSW notebooks for parameter-efficient fine-tuning (PEFT) or DLC for distributed full-parameter training.

Model Gallery: no-code fine-tuning

The Model Gallery tutorials listed in the Deploy models section also cover fine-tuning for each model. Select a model series above to get started.

DSW and DLC: custom training

  • Fine-tune a Llama3-8B model - Use PEFT techniques in a DSW notebook for cost-effective domain adaptation while preserving the base model's capabilities.

Advanced training techniques

  • Continued pre-training for LLMs - Adapt models to specific domains using unlabeled text data. Unlike fine-tuning (supervised), continued pre-training uses unsupervised learning to extend a model's domain knowledge.

  • Data augmentation and model distillation for LLMs - Transfer knowledge from large teacher models to smaller student models. Combines data augmentation, instruction refinement, and distillation to create efficient models that preserve performance.

Evaluate models

Compare the performance of foundation models, fine-tuned versions, and quantized versions to determine which meets your requirements. PAI supports automated evaluation using custom or public datasets such as MMLU and C-Eval.

Process training data

Machine Learning Designer provides algorithms for processing text, video, and image data to improve training data quality. Built-in templates are available and can be extended through secondary development.

Text data

Image and video data

Build LLM applications

Apply fine-tuned LLMs to production use cases. These tutorials demonstrate end-to-end workflows from data preparation through deployment.