All Products
Search
Document Center

OpenLake:Prerequisites

Last Updated:Jan 19, 2026

Alibaba Cloud OpenLake is an integrated solution for big data, search, and AI built on an open, controllable data lakehouse. It uses Data Lake Formation (DLF) to manage structured, semi-structured, and unstructured data. DLF provides secure access to lakehouse tables and files and offers I/O acceleration. The solution supports multi-engine integration and peer-to-peer collaborative computing. It uses DataWorks for unified development and ensures large-scale task scheduling. You can activate all cloud products in the OpenLake solution with a single click. After you complete the purchase, an experience environment for the OpenLake solution is initialized. Resource instances are created, and best practice cases are imported into the environment. This helps you explore the solution's capabilities.

Solution description

The Alibaba Cloud OpenLake solution is built on an open, controllable data lakehouse. It provides integrated services for big data, search, and AI. It uses a public data lake based on Object Storage Service (OSS) and combines it with the Data Lake Formation (DLF) data management platform. This supports the management of structured, semi-structured, and unstructured data. DLF ensures secure access to data tables and files and provides Create, Read, Update, and Delete (CRUD) and I/O acceleration capabilities. The solution supports multi-engine integration for big data, search, and AI. This enables peer-to-peer collaborative computing among engines. Using the DataWorks integrated development environment (IDE) or a Notebook, you can perform unified SQL or Python development across multiple engines. It also provides visual scheduling for multitasking and guarantees large-scale concurrent execution. You can easily build OpenLake data lake tables and perform data operations across different compute engines. By building multi-modal indexes, you can expose data for search and retrieval-augmented generation (RAG) capabilities. In the same development environment, you can combine AI feature engineering, model training, and online prediction to improve data processing and analysis efficiency.

To help you quickly use the full capabilities of the OpenLake solution, you can activate the required products and initialize an experience environment with a single click.

Prerequisites

  • Only an Alibaba Cloud account or a Resource Access Management (RAM) user with AdministratorAccess permissions can activate the OpenLake solution. For more information, see Product and console access control details: RAM Policy.

  • A free trial quota is only available to users who have completed enterprise identity verification. Users who have not completed enterprise identity verification can still activate the OpenLake solution with one click through the Free Trial entry point. However, the associated OpenLake product services are billed on a pay-as-you-go basis by default.

Product list

The OpenLake free trial activates the products in the following list for you:

Category

Product

Development platform

DataWorks (DataWorks billing, DataWorks Basic Edition, DataWorks general-purpose resource group), Platform for AI (PAI)

Storage service

Data Lake Formation (DLF), Object Storage Service (OSS)

Computing resources

MaxCompute, Hologres, EMR Serverless Spark, EMR Serverless StarRocks, Realtime Compute for Apache Flink, OpenSearch - Vector Search Edition

Note

The usernames and passwords for some of the products activated with a single click are as follows:

  • OpenSearch account credentials: Username: admin, Password: admin123.

  • StarRocks account credentials: Username: admin, Password: Admin@01.