All Products
Search
Document Center

MaxCompute:Data Lakehouse Solution 1.0 (Deprecated)

Last Updated:Nov 07, 2025

MaxCompute provides a data lakehouse solution that breaks down the barriers between data lakes and data warehouses. This solution combines the flexibility and rich ecosystem of data lakes with the enterprise-grade deployment capabilities of data warehouses, which lets you build an integrated data management platform.

Applicability

Data Lakehouse Solution 1.0 is available only in the following regions: China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Hong Kong), Singapore, and Germany (Frankfurt).

Implementation methods

The MaxCompute data lakehouse solution integrates data warehouses and data lakes. The following implementation methods are supported:

  • Build a data lakehouse using MaxCompute, Data Lake Formation, and Object Storage Service: The metadata (schema) of the data lake is stored in Data Lake Formation (DLF). MaxCompute uses the metadata management capabilities of DLF to improve data processing for semi-structured formats in Object Storage Service (OSS). Supported formats include Delta Lake, Hudi, AVRO, CSV, JSON, PARQUET, and ORC.

  • Build a data lakehouse using MaxCompute and Hadoop: This method supports deployments in on-premises data centers, on cloud-based virtual machines, or through Alibaba Cloud E-MapReduce. After the VPCs for the MaxCompute and Hadoop platforms are connected, MaxCompute can directly access the Hive metadata service. The metadata is then mapped to an external project in MaxCompute.

References