This topic describes the features of vGPU-accelerated instance families of Elastic Compute Service (ECS) and lists the instance specifications of each instance family.

sgn7i-vws, vGPU-accelerated instance family with shared CPUs

Features:
  • This instance family uses the third-generation SHENLONG architecture to provide predictable and consistent ultra-high performance. This instance family utilizes fast path acceleration on chips to improve storage performance, network performance, and computing stability by an order of magnitude. This way, data storage and model loading can be performed more quickly.
  • Instances of the sgn7i-vws instance family share CPU and network resources to maximize the utilization of underlying resources. Each instance has exclusive access to its memory and GPU memory to ensure data isolation and high performance.
    Note If you want to use exclusive CPU resources, select the vgn7i-vws instance family.
  • This instance family comes with a NVIDIA GRID vWS license and provides certified graphics acceleration capabilities for Computer Aided Design (CAD) software to meet the requirements of professional graphic design. Instances of this instance family can serve as lightweight GPU-accelerated compute-optimized instances to reduce the costs of small-scale AI inference tasks.
  • Compute:
    • Uses NVIDIA A10 GPUs that have the following features:
      • Innovative NVIDIA Ampere architecture
      • Support for acceleration features (such as vGPU, RTX, and TensorRT) to provide diversified business support
    • Uses 2.9 GHz Intel® Xeon® Scalable (Ice Lake) processors that deliver an all-core turbo frequency of 3.5 GHz.
  • Storage:
    • Is an instance family in which all instances are I/O optimized.
    • Supports only enhanced SSDs (ESSDs).
      Note For more information about the performance of cloud disks, see EBS performance.
  • Network:
    • Supports IPv6.
    • Provides high network performance based on large computing capacity.
  • Supported scenarios:
    • Concurrent AI inference tasks that require high-performance CPUs, memory, and GPUs, such as image recognition, speech recognition, and behavior identification
    • Compute-intensive graphics processing tasks that require high-performance 3D graphics virtualization capabilities, such as remote graphic design and cloud gaming
    • 3D modeling in fields that require the use of Ice Lake processors, such as animation and film production, cloud gaming, and mechanical design
Instance types
Instance typevCPUsMemory (GiB)GPUGPU memoryNetwork baseline/burst bandwidth (Gbit/s)Packet forwarding rate (pps)Network interface controller (NIC) queuesElastic network interfaces (ENIs)
ecs.sgn7i-vws-m2.xlarge415.5NVIDIA A10 × 1/1224GB × 1/121.5/5500,00042
ecs.sgn7i-vws-m4.2xlarge831NVIDIA A10 × 1/624GB × 1/62.5/101,000,00044
ecs.sgn7i-vws-m8.4xlarge1662NVIDIA A10 × 1/324GB × 1/35/202,000,00084
ecs.sgn7i-vws-m2s.xlarge48NVIDIA A10 × 1/1224GB × 1/121.5/5500,00042
ecs.sgn7i-vws-m4s.2xlarge816NVIDIA A10 × 1/624GB × 1/62.5/101,000,00044
ecs.sgn7i-vws-m8s.4xlarge1632NVIDIA A10 × 1/324GB × 1/35/202,000,00084
Note

vgn7i-vws, vGPU-accelerated instance family

Features:
  • This instance family uses the third-generation SHENLONG architecture to provide predictable and consistent ultra-high performance. This instance family utilizes fast path acceleration on chips to improve storage performance, network performance, and computing stability by an order of magnitude. This way, data storage and model loading can be performed more quickly.
  • This instance family comes with a NVIDIA GRID vWS license and provides certified graphics acceleration capabilities for CAD software to meet the requirements of professional graphic design. Instances of this instance family can serve as lightweight GPU-accelerated compute-optimized instances to reduce the costs of small-scale AI inference tasks.
  • Compute:
    • Uses NVIDIA A10 GPUs that have the following features:
      • Innovative NVIDIA Ampere architecture
      • Support for acceleration features (such as vGPU, RTX, and TensorRT) to provide diversified business support
    • Uses 2.9 GHz Intel® Xeon® Scalable (Ice Lake) processors that deliver an all-core turbo frequency of 3.5 GHz.
  • Storage:
    • Is an instance family in which all instances are I/O optimized.
    • Supports only ESSDs.
      Note For more information about the performance of cloud disks, see EBS performance.
  • Network:
    • Supports IPv6.
    • Provides high network performance based on large computing capacity.
  • Supported scenarios:
    • Concurrent AI inference tasks that require high-performance CPUs, memory, and GPUs, such as image recognition, speech recognition, and behavior identification
    • Compute-intensive graphics processing tasks that require high-performance 3D graphics virtualization capabilities, such as remote graphic design and cloud gaming
    • 3D modeling in fields that require the use of Ice Lake processors, such as animation and film production, cloud gaming, and mechanical design
Instance types
Instance typevCPUsMemory (GiB)GPUGPU memoryNetwork bandwidth (Gbit/s)Packet forwarding rate (pps)NIC queuesENIs
ecs.vgn7i-vws-m4.xlarge430NVIDIA A10 × 1/624GB × 1/631,000,00044
ecs.vgn7i-vws-m8.2xlarge1062NVIDIA A10 × 1/324GB × 1/352,000,00086
ecs.vgn7i-vws-m12.3xlarge1493NVIDIA A10 × 1/224GB × 1/283,000,00086
ecs.vgn7i-vws-m24.7xlarge30186NVIDIA A10 × 124GB × 1166,000,000128
Note

vgn6i and vgn6i-vws, vGPU-accelerated instance families

vgn6i-vws:
  • In light of the NVIDIA GRID driver upgrade, Alibaba Cloud upgrades the vgn6i instance family to vgn6i-vws instance family. The vgn6i-vws instance family uses the latest NVIDIA GRID driver and provides a NVIDIA GRID vWS license. Submit a ticket to apply for free images that have the NVIDIA GRID driver pre-installed.
  • To use other public images or custom images that do not contain a NVIDIA GRID driver, submit a ticket to apply for the GRID driver file and install the NVIDIA GRID driver separately. Alibaba Cloud does not charge additional fees for the license of the GRID driver.
vgn6i:
  • If you want your vgn6i instance to support graphics features such as Open Graphics Library (OpenGL), you must purchase a GRID license from NVIDIA. After the instance is created, manually install a GRID driver and activate the license.
  • Compute:
    • Uses NVIDIA T4 GPUs.
    • Uses vGPUs.
      • Supports the 1/4 and 1/2 computing capacity of NVIDIA Tesla T4 GPUs.
      • Supports 4 GB and 8 GB of GPU memory.
    • Offers a CPU-to-memory ratio of 1:5.
    • Uses 2.5 GHz Intel® Xeon® Platinum 8163 (Skylake) processors.
  • Storage:
    • Is an instance family in which all instances are I/O optimized.
    • Supports only standard SSDs and ultra disks.
  • Network:
    • Supports IPv6.
    • Provides high network performance based on large computing capacity.
  • Supported scenarios:
    • Real-time rendering for cloud gaming
    • Real-time rendering for AR and VR applications
    • AI (deep learning and machine learning) inference for elastic Internet service deployment
    • Educational environment of deep learning
    • Modeling experiment environment of deep learning
Instance types
Instance typevCPUsMemory (GiB)GPUGPU memoryNetwork bandwidth (Gbit/s)Packet forwarding rate (pps)NIC queues (primary ENI/secondary ENI)ENIsPrivate IP addresses per ENI
ecs.vgn6i-m4.xlarge423NVIDIA T4 × 1/416GB × 1/42500,0004/2310
ecs.vgn6i-m8.2xlarge1046NVIDIA T4 × 1/216GB × 1/24800,0008/2410
ecs.vgn6i-m4-vws.xlarge423NVIDIA T4 × 1/416GB × 1/42500,0004/2310
ecs.vgn6i-m8-vws.2xlarge1046NVIDIA T4 × 1/216GB × 1/24800,0008/2410
ecs.vgn6i-m16-vws.5xlarge2092NVIDIA T4 × 116GB × 17.51,200,0006410
Note

vgn5i, vGPU-accelerated instance family

Features:
  • If you want your vgn5i instance to support graphics features such as OpenGL, you must purchase a GRID license from NVIDIA. After the instance is created, manually install a GRID driver and activate the license.
  • Compute:
    • Uses NVIDIA P4 GPUs.
    • Uses vGPUs.
      • Supports the 1/8, 1/4, 1/2, and 1/1 computing capacity of NVIDIA Tesla P4 GPUs.
      • Supports 1 GB, 2 GB, 4 GB, and 8 GB of GPU memory.
    • Offers a CPU-to-memory ratio of 1:3.
    • Uses 2.5 GHz Intel® Xeon® E5-2682 v4 (Broadwell) processors.
  • Storage:
    • Is an instance family in which all instances are I/O optimized.
    • Supports standard SSDs and ultra disks.
  • Network:
    • Supports IPv6.
    • Provides high network performance based on large computing capacity.
  • Supported scenarios:
    • Real-time rendering for cloud gaming
    • Real-time rendering for AR and VR applications
    • AI (deep learning and machine learning) inference for elastic Internet service deployment
    • Educational environment of deep learning
    • Modeling experiment environment of deep learning
Instance types
Instance typevCPUsMemory (GiB)GPUGPU memoryNetwork bandwidth (Gbit/s)Packet forwarding rate (pps)NIC queuesENIsPrivate IP addresses per ENI
ecs.vgn5i-m1.large26NVIDIA P4 × 1/88GB × 1/81300,000226
ecs.vgn5i-m2.xlarge412NVIDIA P4 × 1/48GB × 1/42500,0002310
ecs.vgn5i-m4.2xlarge824NVIDIA P4 × 1/28GB × 1/23800,0002410
ecs.vgn5i-m8.4xlarge1648NVIDIA P4 × 18GB × 151,000,0004520
Note