All Products
Search
Document Center

Platform For AI:Guide to Using NIM models

Last Updated:May 14, 2025

Alibaba Cloud Platform for AI (PAI) is an authorized NIM partner of NVIDIA in China.

NIM is a set of easy-to-use prebuilt container tools introduced by NVIDIA, aimed at helping enterprise customers securely and reliably deploy high-performance AI models in the cloud, data centers, and workstations. NIM models are performance-optimized models processed by the NIM optimization tool, offering significantly improved inference performance compared to original open source models.

Model Gallery of PAI provides multiple NIM models. To obtain NIM models, select NIM for Model Source in the left-side filter pane of Model Gallery. You can deploy NIM models by using one of the following methods:

Supported NIM models

The following table describes the NIM models that can be deployed in Model Gallery of PAI.

Model name

Model page in Model Gallery

Supported model type for NIM model inference optimization

qwen2.5-7b-instruct-NIM

qwen2.5-7b-instruct-NIM

ecs.gn7e

ecs.gn8is

MolMIM

MolMIM

General-purpose GPU

Earth-2 FourCastNet

FourCastNet

General-purpose GPU

NVIDIA Retrieval QA Mistral 7B Embedding v2

NV-EmbedQA-Mistral-7b-v2

ecs.gn7e edition

Eye Contact

eyecontact

General-purpose GPU

NV-CLIP

nvclip

  • ecs.gn7e edition

  • ecs.gn7i edition

AlphaFold2-Multimer

AlphaFold2-Multimer

General-purpose GPU

Snowflake Arctic Embed Large Embedding

Arctic-embed-l

  • ecs.gn7e edition

  • ecs.gn7i edition

NVIDIA Retrieval QA Mistral 4B Reranking v3

NV-RerankQA-Mistral-4B-v3

  • ecs.gn7e edition

  • ecs.gn7i edition

NVIDIA Retrieval QA E5 Embedding v5

NV-EmbedQA-E5-v5

  • ecs.gn7e edition

  • ecs.gn7i edition

Parakeet CTC Riva 1.1b

Parakeet-CTC-1.1b-asr

General-purpose GPU

FastPitch HifiGAN Riva

FastPitchHifi-GAN

General-purpose GPU

VISTA-3D

Vista-3D

General-purpose GPU

AlphaFold2

AlphaFold2

General-purpose GPU

ProteinMPNN

ProteinMPNN

General-purpose GPU

megatron-1b-nmt

megatron-1b-nmt

General-purpose GPU

Deploy a model in Model Gallery of PAI

  1. Go to Model Gallery.

  2. In the left-side filter pane, select NIM for Model Source to search for NIM models.

    image.png

  3. Find the model that you want to manage and click the model card. In the upper-right corner of the Overview tab, click Deploy.

    Note

    Before you deploy an NIM model in PAI, make sure that you are an NVIDIA Al Enterprise user or an NVIDIA Developer Program user.

    image.png

  4. In the Deploy panel, configure the resources and click Deploy. For information about how to call the deployed model service, see the introduction on the Overview tab of the model details page.

    image.png

Deploy a model on your on-premises machine

You can download the image and model files of an NIM model and deploy the model on your on-premises machine. Before you deploy an NIM model in PAI, make sure that you are an NVIDIA Al Enterprise user or an NVIDIA Developer Program user.

  1. Configure the environment. For more information, see Getting Started.

  2. On the Overview tab of the model details page, click Resource Download. In the dialog box that appears, select the download statements and click OK to obtain the image URL and model URL.

  3. Run the following command to pull the image: (Replace ${Image URL} with the actual image URL.)

    docker pull ${Image URL}
  4. Download the model file using the Alibaba Cloud tool ossutil.

  5. Run the following commands to start the container: (In this example, the /local/model/ directory is used. Replace ${Model URL} with the actual model URL and replace ${Image URL} with the actual image URL.)

    docker run --rm \
    --runtime=nvidia \
    --gpus all \
    -u $(id -u) \
    -v /local/model/:${Model URL} ${Image URL}

Appendix: Activation process for using PAI

For users new to Alibaba Cloud, the following steps outline the initial setup process for using Model Gallery of PAI:

  1. Visit Alibaba Cloud, click Sign Up in the upper-right corner, and then enter your account and password, or register a new user.

    image.png

  2. After logon and real-name verification, visit Platform for AI (PAI).

    If you use PAI for the first time, you must complete real-name verification and authorization. Retain the default settings for all options and click Activate. After a few seconds, PAI is activated. Then, you can access the default workspace for deployment and usage.

    image.png