All Products
Search
Document Center

Platform For AI:Data security

Last Updated:Apr 02, 2024

Data transmission security

To enhance data transmission security for a model service that you deployed in the Elastic Algorithm Service (EAS) module of Platform for AI (PAI), you can access the service through Virtual Private Cloud (VPC) direct connections by using the official SDK for Python or implementing custom logic. VPCs are isolated from each other, which ensures private high-speed access. You can also use HTTPS to secure data transmission in EAS. You need to only replace the http prefix in the service endpoint with https.

For information about VPC security, see Security system overview.

Data storage security

PAI provides a dataset management module that allows you to create datasets by using public datasets or data that is stored in Alibaba Cloud storage services. The module also allows you to scan Object Storage Service (OSS) objects to generate index datasets that you can use for intelligent labeling and model training.

To ensure the storage security of data files such as datasets and models, we recommend that you use the following Alibaba Cloud storage services:

Data storage service

Related topic

OSS

Security and compliance

Apsara File Storage NAS

Functions and features

Cloud disks

Disks

MaxCompute

FAQ about security configurations

Data backup and restoration

PAI allows you to use images to back up and restore datasets and development environments. For example, you can create an image to save the data and development environment of an instance. This way, you can deploy the image based on your business requirements without data loss. When you use Alibaba Cloud storage services to store data or datasets, backup and restoration is guaranteed by the corresponding service.

Data storage service

Related topic

OSS

Data backup and recovery

Apsara File Storage NAS

Back up and restore files

MaxCompute

Backup and restoration

When you use PyTorch to train a foundation model, you can use EasyCkpt to save and resume the model training progress to minimize data loss and resource waste. EasyCkpt is a high-performance checkpoint framework provided by PAI that is compatible with popular model training frameworks such as Megatron and DeepSpeed. For more information, see Use EasyCkpt to save and resume foundation model trainings.