Data security in transit
PAI supports VPC direct connections to deployed model services through the Python SDK or custom call logic. VPC network isolation protects data in transit.
VPC security is covered in the Security system overview.
Data security at rest
PAI provides a dataset management module that supports importing public datasets, Alibaba Cloud service data, or OSS folder scans to create indexed datasets for labeling and training.
PAI integrates with Alibaba Cloud storage services such as OSS and NAS to store datasets and models with high availability and reliability.
|
Storage medium |
Documentation |
|
OSS |
|
|
NAS |
|
|
Cloud disk |
|
|
MaxCompute |
Data backup and recovery
PAI uses images to back up and recover environments and datasets. Save an instance as an image to capture its environment and configurations, then redeploy to prevent data loss. For datasets on other storage media, use the backup and recovery features of the corresponding service.
|
Storage medium |
Documentation |
|
OSS |
|
|
NAS |
|
|
MaxCompute |
PAI also provides EasyCkpt, a checkpoint framework for PyTorch foundation model training. EasyCkpt provides lossless saving and recovery of training progress with near-zero overhead, and supports Megatron and DeepSpeed.