All Products
Search
Document Center

Artificial Intelligence Recommendation:Activate and initialize the service

Last Updated:May 09, 2026

When you use PAI-Rec to build a recommendation system for the first time, purchase a PAI-Rec instance and configure the initial environment.

  • Selection guide

    Instance selection

    When you first use PAI-Rec, purchase a standard instance with the recommendation solution customization feature. After you become familiar with the service, you can purchase the operations tool feature.

    • Recommendation solution customization lets you customize feature engineering, recall strategies, and fine-ranking strategies to configure your recommendation system with greater flexibility and efficiency.

    • The operations tool improves operational efficiency and provides more control over recommendation results.

    Cloud product resource selection

    Building a PAI-Rec recommendation solution requires multiple cloud products. The specific cloud product resources required vary based on your business needs.

    Dependent cloud products (click to view details)

    image

    Cloud product

    Function

    Required cloud resources

    Modeling

    OSS

    Stores model checkpoints, saved model files, and model configuration files.

    Create an OSS bucket

    Note

    Do not enable the Versioning feature.

    MaxCompute

    Used for data cleansing, feature engineering, and preparing training samples.

    • Create a project

    • To use PAI-DLC to train models, activate Data Transmission Service. You need Data Transmission Service to read training data from MaxCompute tables. It is also used to initialize item features from MaxCompute tables when you initialize a model in PAI-EAS.

    Platform for AI

    PAI serves as the entry point for the PAI-Rec developer platform. It includes features such as PAI-FeatureStore association, model training, model exporting, and model evaluation.

    Note

    PAI and DataWorks workspaces are interconnected at the underlying layer. When you create a PAI workspace, a workspace with the same name is automatically generated in DataWorks.

    You can also manually create a DataWorks workspace.

    DataWorks

    Used for data cleansing, feature engineering, model training and evaluation, model updates, and data synchronization with online stores. It also schedules all offline data production, model training, and evaluation tasks.

    Engine

    ID and Database of the Hologres Instance

    A real-time feature storage engine.

    It can be used with FeatureDB. For example, use Hologres to store vector recall data, user exposure data, and u2i2i trigger data. Use FeatureDB to store offline and real-time features of users and items.

    Use PAI-FeatureStore

    A real-time feature storage engine.

    ApsaraDB for Redis Instance ID

    Stores fallback data. Can be replaced by FeatureDB in PAI-FeatureStore.

    PAI-EAS Resource Group

    Deploys the recommendation system engine to orchestrate processes such as recall, filtering, coarse-ranking, fine-ranking, and reranking. It also deploys the user-side vector inference service for vector recall and the model scoring service for coarse-ranking and fine-ranking.

    Monitoring and others

    Log Service

    Users can use SLS to manage request logs.

    DataHub Project

    Used for real-time log ingestion to continuously update user behavior for model training.

    We recommend that you prioritize using DataHub.

    Instance ID and Resource Group of Message Queue for Apache Kafka

    Flink VVP Streaming Service

    Processes real-time data and collects real-time feature statistics. The results can be written to a FeatureDB database.

    Suggestions for solutions

    Suggestions based on recommendation system complexity (click to view details)

    Note

    The complexity of a recommendation system's recall, filtering, model, and reranking processes is closely related to business requirements. We divide system development into the following stages: initial, intermediate, performance improvement, and operational intervention.

    Phases

    Description

    Recall model suggestions

    Ranking and reranking suggestions

    Initial stage

    Use Customized Recommendation Solution to build the entire recommendation pipeline. For more information, see PAI-Rec: Customizing recommendation algorithms.

    Use collaborative filtering (etrec), the Swing algorithm tool, and group-based hot item retrieval.

    Use FeatureDB to store user exposure filter data, recall data, and feature data.

    Use feature configuration (note the use of real-time sequence features) and sorting configuration to set up a single-objective multi-tower model. This model offers fast inference, good performance, and conserves PAI-EAS resources.

    Use a diversity reranking configuration.

    Intermediate stage

    Add vector recall and a multi-objective ranking model.

    Add vector recall. The item index does not need to be updated because the index is stored inside the processor. For more information, see the Faiss index section of the TorchEasyRec Processor documentation.

    For multiple prediction targets such as clicks, purchases, and likes, use the DBMTL multi-objective ranking model.

    Business needs to quickly perceive item changes

    Implement cold start for items.

    Provide real-time item feature feedback to the ranking model.

    Use the item cold-start algorithm. For more information, see Recommendation cold-start solution.

    Create a new recommendation solution customization. In the feature configuration, set up real-time statistics. Then, in PAI-FeatureStore, create a new feature view and a new model feature. Export the new training samples and train a new model.

    Operational intervention

    Set exposure ratios for different users and item categories.

    Ensure a minimum number of exposures for new items.

    Other suggestions (click to view details)

    • PAI-EAS: Configure scheduled scale-out for peak hours and automatic scale-in to reduce resources during off-peak hours. Consider combining subscription resources with elastic scaling resources.

Prerequisites

This topic uses an offline modeling scenario as an example. This scenario requires the following cloud product resources. For more information about other cloud product resources, see Cloud product resource selection.

Purchase a PAI-Rec instance and configure cloud products

  1. On the instance purchase page, set Region, Recommendation Solution Customization, Operations Tool, and Subscription Duration. Click Buy Now, confirm the order, and complete the payment.

  2. In the PAI-Rec management console, switch to the target region. In the navigation pane on the left, choose System Configuration > Cloud Service Configuration.

  3. On the Data Modeling tab, click Edit, select the cloud product resources you created, and then click Exit.

    The settings on the Engine and Monitoring and others tabs are similar. First, configure the required cloud resources, then associate them in the PAI-Rec console.

Why do I need to use an Alibaba Cloud account to access Cloud Product Configuration?

  • You must use a root account to access System Configuration > Cloud Service Configuration because a RAM user does not have sufficient permissions. For an explanation of account types, see Quick start: Create and authorize a RAM user. The configuration process involves actions that require root-level permissions, including: activating multiple services (such as PAI, DataWorks, MaxCompute, and OSS), creating projects or workspaces, and assigning the PAI-Rec service-linked role (aliyunserviceroleforpairec). Failure to correctly assign this role will cause subsequent operations to fail.