All Products
Search
Document Center

Platform For AI:NIM models

Last Updated:Apr 08, 2026

Alibaba Cloud Platform for AI (PAI) is an authorized NVIDIA NIM partner in China.

NVIDIA NIM provides pre-built containers for deploying AI models for inference on the cloud, in data centers, and on workstations. NIM models are optimized with NIM optimization tools to deliver inference performance improvements over their original open-source counterparts.

PAI offers several NIM models in Model Gallery. To find them, select NIM from the Model Source filter. Two deployment methods are available:

Supported NIM models

PAI Model Gallery supports the following NIM models:

Model name

Model Gallery link

Instance types

qwen2.5-7b-instruct-NIM

qwen2.5-7b-instruct-NIM

ecs.gn7e series

ecs.gn8is series

MolMIM

MolMIM

General-purpose GPU instance

Earth-2 FourCastNet

FourCastNet

General-purpose GPU instance

NVIDIA Retrieval QA Mistral 7B Embedding v2

NV-EmbedQA-Mistral-7b-v2

ecs.gn7e series

Eye Contact

eyecontact

General-purpose GPU instance

NV-CLIP

nvclip

ecs.gn7e series

ecs.gn7i series

AlphaFold2-Multimer

AlphaFold2-Multimer

General-purpose GPU instance

Snowflake Arctic Embed Large Embedding

Arctic-embed-l

ecs.gn7e series

ecs.gn7i series

NVIDIA Retrieval QA Mistral 4B Reranking v3

NV-RerankQA-Mistral-4B-v3

ecs.gn7e series

ecs.gn7i series

NVIDIA Retrieval QA E5 Embedding v5

NV-EmbedQA-E5-v5

ecs.gn7e series

ecs.gn7i series

Parakeet CTC Riva 1.1b

Parakeet-CTC-1.1b-asr

General-purpose GPU instance

FastPitch HifiGAN Riva

FastPitchHifi-GAN

General-purpose GPU instance

VISTA-3D

Vista-3D

General-purpose GPU instance

AlphaFold2

AlphaFold2

General-purpose GPU instance

ProteinMPNN

ProteinMPNN

General-purpose GPU instance

megatron-1b-nmt

megatron-1b-nmt

General-purpose GPU instance

Deploy from PAI Model Gallery

  1. Go to the PAI Model Gallery.

  2. On the left, set Model Source to NIM to filter the available NIM models.

    image

  3. Select a NIM model to open its details page, and then click Deploy in the upper-right corner. Deploying NIM models in PAI requires membership in the NVIDIA AI Enterprise or NVIDIA Developer Program.

    image

  4. Configure deployment settings such as compute resources, and then click Deploy to create the model service. For service invocation details, see the model description.

    image.png

Deploy on-premises

Download the image and model files to deploy a NIM model on-premises. This requires membership in the NVIDIA AI Enterprise or NVIDIA Developer Program.

  1. Set up your environment. See NVIDIA's Getting Started documentation.

  2. On the model details page, click Resource Download. Review and accept the NVIDIA license agreement to obtain the image and model download URLs.

  3. Pull the container image. Replace ${IMAGE_URL} with the image download URL.

    docker pull ${IMAGE_URL}
  4. Download the model files by using ossutil.

  5. Start the container. In this example, model files are saved to /local/model/. Replace ${MOUNT_PATH} with the mount path inside the container and ${IMAGE_URL} with the image download URL.

    docker run --rm \
    --runtime=nvidia \
    --gpus all \
    -u $(id -u) \
    -v /local/model/:${MOUNT_PATH} ${IMAGE_URL}

Get started with PAI

New Alibaba Cloud users can follow these steps to access PAI Model Gallery:

  1. Go to the Alibaba Cloud homepage. Click Sign in in the upper-right corner to log in or register.

    image.png

  2. After logging in and completing real-name verification, go to Platform for AI (PAI).

    First-time PAI users must complete real-name verification and grant the required authorizations. Keep the default settings and confirm activation. After activation completes, go to the default workspace to deploy models.

    image.png