Data Lake on Alibaba Cloud

Store, manage, and analyze data of all sizes and types in real-time

Alibaba Cloud Data Lake

Alibaba Cloud Data Lake allows you to store, manage, and analyze massive structured, semi-structured, and unstructured streaming data, enabling you to break down data silos and gain business insights. You can store data of all sizes and formats as-is, configure rules throughout the data lifecycle to flexibly store hot and cold data in response to changing business requirements, and seamlessly integrate your data lake with different computing and AI engines (such as MaxCompute, Hologres, DataBricks, Platform for AI, etc.) for unified batch and stream processing.

  • Limit-Free Storage

    Unified data storage with unlimited scalability integrates on-premises and cloud data with low cost and no O&M.

  • Centralized Management

    Centralized metadata services with high compatibility for data storage formats enable unified data governance and support for different compute engines.

  • Cloud-Native

    Containerized compute empowers hyper-scale elasticity and near-real-time data analysis by streaming data to the data lake.

Power up with Alibaba Cloud Data Lake

  • Fully-Managed Data Lake

    Reduce the burden of O&M workload and facilitate data development and management on a fully-managed Serverless architecture that features multiple data analysis scenarios

  • Unified Batch and Stream Processing

    Unify real-time data processing and offline (batch) processing in one programming model, replacing the operational complexity of code with accuracy, consistency, and high performance

  • One-Stop Data Management

    Complete data integration, development, catalog, and security and service management for data of all types and sizes in a data lake with 99.9999999999% (12 9's) durability, and backed by industry-leading SLA

  • Elastic Data Processing Resources

    Create and scale compute nodes in/out dynamically for the processing requirements of Flink on Alibaba Cloud E-MapReduce that decouples compute and storage resources and supports multiple open-source compute engines

Learn more about Data Lake on Alibaba Cloud

Contact Sales

How It Works

Alibaba Cloud Data Lake is based on a cohesively layered architecture. Each layer features a range of products and tools designed to integrate, store, and process data.

  • Data Lake Storage Layer: ingests raw data of all types in a scalable and secure storage, with corresponding storage classes to store hot and cold data for different access requirements

    Data Lake Formation Layer: Builds a cloud-native data lake that supports batch and stream data processing by centralizing the metadata from different data sources throughout the data lifecycle, with enterprise-level permission control.

    Data Lake Computing Layer: Seamlessly connects to a variety of computing engines and AI platforms to execute user queries, perform advanced data analysis, and derive business insights from managed data

    Data Development and Governance Layer: Provides efficient, secure and reliable data development and governance services based on data computing engines

Start for Free and Enjoy 60% in Renewal Discounts

Try 50+ free tier products and enjoy a 60% renewal discount after concluding the free trial

Security and Compliance

We are committed to providing stable, reliable, secure, and compliant cloud computing infrastructure services across major jurisdictions worldwide.
Learn More
  • ISO 27001
  • SOC2 Type II Report
  • C5
  • MLPS 2.0
  • MTCS

Related Resources


Data Lake Is Becoming the Innovation Standard
for Enterprise Data Applications

This short article discusses the rise and future of data lakes as a standard practice.


An Overview of Alibaba Cloud's Comprehensive Cloud-Native Data Lake System

This article introduces the establishment of a cloud-native data lake system based on Alibaba Cloud products.


Construction, Analysis, and Development Governance of a Cloud-Native Data Lake

This article introduces the best practices and cases for building, analyzing, developing, and governing cloud-native data lakes.


Cloud-Native Data Lake: Driving Agile Enterprise Innovation

This video introduces the challenges and advantages of data lakes and how Alibaba Cloud can help build your data lake to power enterprise innovation.


Build a Unified, Secure, and Intelligent Data Lake Governance System

This video introduces how to leverage Alibaba Cloud solutions and services to build a unified, secure, and intelligent data lake governance system.


Data Lake Storage: The Power Source of Innovation

This video introduces how Alibaba Cloud Data Lake solution supports innovation in enterprises.

Start with Alibaba Cloud Solutions

Learn and experience the power of Alibaba Cloud.

Contact Sales