MaxCompute: Conduct Petabyte-Scale Data Warehousing

MaxCompute

Conduct large-scale data warehousing with MaxCompute

Buy Now Console Migrations

Data type editions support selection in new projects >

Overview

MaxCompute (previously known as ODPS) is a general purpose, fully managed, multi-tenancy data processing platform for large-scale data warehousing. MaxCompute supports various data importing solutions and distributed computing models, enabling users to effectively query massive datasets, reduce production costs, and ensure data security.

{"moduleinfo":{"benefits":"Benefits","outputBuyBtn4Partner":true,"os_count":[{"count_phone":0,"count":0}],"products_count":[{"count_phone":0,"count":0}],"benefits_count":[{"count_phone":4,"count":4}],"news_count":[{"count_phone":0,"count":0}],"floor":"floor1","outputNews4Partner":"false","regions_count":[{"count_phone":0,"count":0}]},"regions":[],"os":[],"products":[],"benefits":[{"icon":"https://img.alicdn.com/tfs/TB1bCuxXrH1gK0jSZFwXXc7aXXa-108-108.png","title":"Large-scale computing and storage","content":"Supports EB-level data storage and computing."},{"icon":"https://img.alicdn.com/tfs/TB13TuxXrY1gK0jSZTEXXXDQVXa-108-108.png","title":"Multiple computational models","content":"Supports SQL, MapReduce, and Graph computational models and Message Passing Interface (MPI) iterative algorithms."},{"icon":"https://img.alicdn.com/tfs/TB1Lc9yXAL0gK0jSZFxXXXWHVXa-108-108.png","title":"Reliable data security measures","content":"Provides stable offline analysis services for more than seven years, and enables multi-level sandbox protection and monitoring."},{"icon":"https://img.alicdn.com/tfs/TB1P1OwXAT2gK0jSZFkXXcIQFXa-108-108.png","title":"Cost-effective","content":"Provides more efficient computing and storage services than an enterprise private cloud, and reduces the purchase cost by 20% to 30%."}],"news":[],"$root":{"moduleinfo":{"benefits":"Benefits","outputBuyBtn4Partner":true,"os_count":[{"count_phone":0,"count":0}],"products_count":[{"count_phone":0,"count":0}],"benefits_count":[{"count_phone":4,"count":4}],"news_count":[{"count_phone":0,"count":0}],"floor":"floor1","outputNews4Partner":"false","regions_count":[{"count_phone":0,"count":0}]},"regions":[],"os":[],"products":[],"benefits":[{"icon":"https://img.alicdn.com/tfs/TB1bCuxXrH1gK0jSZFwXXc7aXXa-108-108.png","title":"Large-scale computing and storage","content":"Supports EB-level data storage and computing."},{"icon":"https://img.alicdn.com/tfs/TB13TuxXrY1gK0jSZTEXXXDQVXa-108-108.png","title":"Multiple computational models","content":"Supports SQL, MapReduce, and Graph computational models and Message Passing Interface (MPI) iterative algorithms."},{"icon":"https://img.alicdn.com/tfs/TB1Lc9yXAL0gK0jSZFxXXXWHVXa-108-108.png","title":"Reliable data security measures","content":"Provides stable offline analysis services for more than seven years, and enables multi-level sandbox protection and monitoring."},{"icon":"https://img.alicdn.com/tfs/TB1P1OwXAT2gK0jSZFkXXcIQFXa-108-108.png","title":"Cost-effective","content":"Provides more efficient computing and storage services than an enterprise private cloud, and reduces the purchase cost by 20% to 30%."}],"news":[]},"$moduleId":"3093554200"}

Benefits

: Large-scale computing and storage
Supports EB-level data storage and computing.

: Multiple computational models
Supports SQL, MapReduce, and Graph computational models and Message Passing Interface (MPI) iterative algorithms.

: Reliable data security measures
Provides stable offline analysis services for more than seven years, and enables multi-level sandbox protection and monitoring.

: Cost-effective
Provides more efficient computing and storage services than an enterprise private cloud, and reduces the purchase cost by 20% to 30%.

{"moduleinfo":{"title":"Features","li_count":[{"count_phone":4,"count":4}]},"li":[{"spm":"a","img":"https://img.alicdn.com/tfs/TB1EYCxXEH1gK0jSZSyXXXtlpXa-108-108.png","more":[{"p":"Supports multiple data tunnels, history data tunnels, and incremental data tunnels.","tce_rule_count":"1","title":""},{"p":"MaxCompute uses tunnels to transmit data. Tunnels are scalable, and import and export PB-level data on a daily basis. You can import all data or history data through multiple tunnels. The tunnel service supports Java SDKs. You can use commands on the MaxCompute client to exchange files and data with the cloud.","tce_rule_count":"1","title":"Multiple data tunnels and history data tunnels"},{"p":"MaxCompute provides the DataHub service to upload real-time data. This service features low latency and is easy to use. It is very suitable for importing incremental data. DataHub supports multiple data transmission plug-ins, such as Logstash, Flume, Fluentd, and Sqoop. You can also use Log Service to easily ship logs to MaxCompute, and use big data development kits to perform log analysis and mining.","tce_rule_count":"1","title":"Real-time incremental data tunnels"}],"h1":"Data channel","imgAction":"https://img.alicdn.com/tfs/TB1nbCJXxn1gK0jSZKPXXXvUXXa-108-108.png"},{"spm":"a","img":"https://img.alicdn.com/tfs/TB11gGxXxD1gK0jSZFKXXcJrVXa-108-108.png","more":[{"p":"MaxCompute stores all data in tables to hide the file system. The compressed column storage greatly reduces your costs using a high compression ratio. MaxCompute provides a compression ratio of 5.","tce_rule_count":"1","title":""}],"h1":"Data storage in a two-dimensional table","imgAction":"https://img.alicdn.com/tfs/TB1cAWIXET1gK0jSZFrXXcNCXXa-108-108.png"},{"spm":"A","img":"https://img.alicdn.com/tfs/TB133GxXxD1gK0jSZFKXXcJrVXa-108-108.png","more":[{"p":"Supports multiple computational models, such as SQL, MapReduce, and Graph.","tce_rule_count":"1","title":""},{"p":"MaxCompute SQL follows standard SQL syntax and Hive syntax. This combined syntax is similar to Hibernate Query Language (HQL), so SQL or HQL programmers can use MaxCompute SQL easily. MaxCompute provides a more efficient computing framework than a common MapReduce model, to run the SQL computational model. However, MaxCompute SQL does not support transactions, indexes, update, and delete.","tce_rule_count":"1","title":"SQL"},{"p":"MaxCompute provides the Java MapReduce programming model. MaxCompute does not have any file API. You have to read data from and write data to tables in the system. Therefore, the MapReduce model in MaxCompute is different from the MapReduce model in an open-source software community. A modified model may be less flexible. For example, you cannot customize sorting and hashing algorithms. However, the development process is simplified. More importantly, MaxCompute provides the Extended MapReduce (MR²) model. In this model, multiple Reduce operations can follow a Map operation.","tce_rule_count":"1","title":"MapReduce"},{"p":"In some complex iterative computation scenarios, such as K-Means and PageRank, MapReduce takes a long time to complete the tasks. Therefore, MaxCompute uses the Graph model to efficiently run these tasks.","tce_rule_count":"1","title":"Graph"}],"h1":"Computational models","imgAction":"https://img.alicdn.com/tfs/TB1xieFXpY7gK0jSZKzXXaikpXa-108-108.png"},{"spm":"a","img":"https://img.alicdn.com/tfs/TB1aHaxXEz1gK0jSZLeXXb9kVXa-108-108.png","more":[{"p":"MaxCompute is a multi-tenant computing platform. By default, tenants are isolated and do not share data. However, they can use MaxCompute to assign the permissions on certain data to other members in the same project group.","tce_rule_count":"1","title":""}],"h1":"Secure","imgAction":"https://img.alicdn.com/tfs/TB1FbCIXAY2gK0jSZFgXXc5OFXa-108-108.png"}],"$root":{"moduleinfo":{"title":"Features","li_count":[{"count_phone":4,"count":4}]},"li":[{"spm":"a","img":"https://img.alicdn.com/tfs/TB1EYCxXEH1gK0jSZSyXXXtlpXa-108-108.png","more":[{"p":"Supports multiple data tunnels, history data tunnels, and incremental data tunnels.","tce_rule_count":"1","title":""},{"p":"MaxCompute uses tunnels to transmit data. Tunnels are scalable, and import and export PB-level data on a daily basis. You can import all data or history data through multiple tunnels. The tunnel service supports Java SDKs. You can use commands on the MaxCompute client to exchange files and data with the cloud.","tce_rule_count":"1","title":"Multiple data tunnels and history data tunnels"},{"p":"MaxCompute provides the DataHub service to upload real-time data. This service features low latency and is easy to use. It is very suitable for importing incremental data. DataHub supports multiple data transmission plug-ins, such as Logstash, Flume, Fluentd, and Sqoop. You can also use Log Service to easily ship logs to MaxCompute, and use big data development kits to perform log analysis and mining.","tce_rule_count":"1","title":"Real-time incremental data tunnels"}],"h1":"Data channel","imgAction":"https://img.alicdn.com/tfs/TB1nbCJXxn1gK0jSZKPXXXvUXXa-108-108.png"},{"spm":"a","img":"https://img.alicdn.com/tfs/TB11gGxXxD1gK0jSZFKXXcJrVXa-108-108.png","more":[{"p":"MaxCompute stores all data in tables to hide the file system. The compressed column storage greatly reduces your costs using a high compression ratio. MaxCompute provides a compression ratio of 5.","tce_rule_count":"1","title":""}],"h1":"Data storage in a two-dimensional table","imgAction":"https://img.alicdn.com/tfs/TB1cAWIXET1gK0jSZFrXXcNCXXa-108-108.png"},{"spm":"A","img":"https://img.alicdn.com/tfs/TB133GxXxD1gK0jSZFKXXcJrVXa-108-108.png","more":[{"p":"Supports multiple computational models, such as SQL, MapReduce, and Graph.","tce_rule_count":"1","title":""},{"p":"MaxCompute SQL follows standard SQL syntax and Hive syntax. This combined syntax is similar to Hibernate Query Language (HQL), so SQL or HQL programmers can use MaxCompute SQL easily. MaxCompute provides a more efficient computing framework than a common MapReduce model, to run the SQL computational model. However, MaxCompute SQL does not support transactions, indexes, update, and delete.","tce_rule_count":"1","title":"SQL"},{"p":"MaxCompute provides the Java MapReduce programming model. MaxCompute does not have any file API. You have to read data from and write data to tables in the system. Therefore, the MapReduce model in MaxCompute is different from the MapReduce model in an open-source software community. A modified model may be less flexible. For example, you cannot customize sorting and hashing algorithms. However, the development process is simplified. More importantly, MaxCompute provides the Extended MapReduce (MR²) model. In this model, multiple Reduce operations can follow a Map operation.","tce_rule_count":"1","title":"MapReduce"},{"p":"In some complex iterative computation scenarios, such as K-Means and PageRank, MapReduce takes a long time to complete the tasks. Therefore, MaxCompute uses the Graph model to efficiently run these tasks.","tce_rule_count":"1","title":"Graph"}],"h1":"Computational models","imgAction":"https://img.alicdn.com/tfs/TB1xieFXpY7gK0jSZKzXXaikpXa-108-108.png"},{"spm":"a","img":"https://img.alicdn.com/tfs/TB1aHaxXEz1gK0jSZLeXXb9kVXa-108-108.png","more":[{"p":"MaxCompute is a multi-tenant computing platform. By default, tenants are isolated and do not share data. However, they can use MaxCompute to assign the permissions on certain data to other members in the same project group.","tce_rule_count":"1","title":""}],"h1":"Secure","imgAction":"https://img.alicdn.com/tfs/TB1FbCIXAY2gK0jSZFgXXc5OFXa-108-108.png"}]},"$moduleId":"5055897200"}

Features

Data channel

Supports multiple data tunnels, history data tunnels, and incremental data tunnels.

Multiple data tunnels and history data tunnels

MaxCompute uses tunnels to transmit data. Tunnels are scalable, and import and export PB-level data on a daily basis. You can import all data or history data through multiple tunnels. The tunnel service supports Java SDKs. You can use commands on the MaxCompute client to exchange files and data with the cloud.

Real-time incremental data tunnels

MaxCompute provides the DataHub service to upload real-time data. This service features low latency and is easy to use. It is very suitable for importing incremental data. DataHub supports multiple data transmission plug-ins, such as Logstash, Flume, Fluentd, and Sqoop. You can also use Log Service to easily ship logs to MaxCompute, and use big data development kits to perform log analysis and mining.
Data storage in a two-dimensional table

MaxCompute stores all data in tables to hide the file system. The compressed column storage greatly reduces your costs using a high compression ratio. MaxCompute provides a compression ratio of 5.
Computational models

Supports multiple computational models, such as SQL, MapReduce, and Graph.

SQL

MaxCompute SQL follows standard SQL syntax and Hive syntax. This combined syntax is similar to Hibernate Query Language (HQL), so SQL or HQL programmers can use MaxCompute SQL easily. MaxCompute provides a more efficient computing framework than a common MapReduce model, to run the SQL computational model. However, MaxCompute SQL does not support transactions, indexes, update, and delete.

MapReduce

MaxCompute provides the Java MapReduce programming model. MaxCompute does not have any file API. You have to read data from and write data to tables in the system. Therefore, the MapReduce model in MaxCompute is different from the MapReduce model in an open-source software community. A modified model may be less flexible. For example, you cannot customize sorting and hashing algorithms. However, the development process is simplified. More importantly, MaxCompute provides the Extended MapReduce (MR²) model. In this model, multiple Reduce operations can follow a Map operation.

Graph

In some complex iterative computation scenarios, such as K-Means and PageRank, MapReduce takes a long time to complete the tasks. Therefore, MaxCompute uses the Graph model to efficiently run these tasks.
Secure

MaxCompute is a multi-tenant computing platform. By default, tenants are isolated and do not share data. However, they can use MaxCompute to assign the permissions on certain data to other members in the same project group.

Customer Scenarios

Cost-effective Investment and Maintenance
Data Warehouses
Big Data Analysis for Logs
Redefined Management
Big data-based Precision Marketing
Analysis of Large Amounts of Marketing Data

Cost-effective Investment and Maintenance

East Environment Energy

Cost-effective and quick data uploads to the cloud

MaxCompute helps migrate all related services to the cloud within three months. East Environment Energy does not have to build a big data platform. Instead, all related services are migrated to Alibaba Cloud's MaxCompute. This shortens the data processing time by more than two thirds. MaxCompute also secures green energy usage data in the cloud.

Customer benefits

Focused on core businesses

MaxCompute helps migrate all related services to the cloud within three months. A large amount of resources in the cloud can serve businesses.
Cost-effective investment and maintenance

MaxCompute greatly reduces the need to invest in manpower, materials, research and development.
Secure and reliable

Comprehensive services and stable performance secure your data in the cloud.

Related Products & Services

Data Warehouses

Xiaohongchun

Data warehouses

Data warehouses are vital to cloud computing and big data applications, and are continuously developing. Xiaohongchun uses MaxCompute to build data warehouses.

Challenges

Upload data to the cloud

Step 1: Synchronize data to MaxCompute using DataX and tunnels.
Scrub data

Step 2: Integrate Alibaba Cloud services to synchronize and scrub data in DataWorks.
Display data

Quick BI generates reports based on the data from extract, transform, load (ETL), and online analytical processing (OLAP) in DataWorks.

Related Products & Services

Big Data Analysis for Logs

Moji Weather

Efficient development and cost-effective storage and computing

Moji Weather's log analysis business has migrated to MaxCompute. Development efficiency is enhanced by five times, and storage and computing costs are reduced by 70%. MaxCompute processes and analyzes 2TB of logs each day to benefit personalized operating strategies.

Customer benefits

Enhanced business efficiency

MaxCompute analyzes all logs using SQL, to enhance business efficiency by more than five times.
Improved storage efficiency

MaxCompute reduces the storage and computing costs by 70% and improves the performance and stability.
Easy application of big data

MaxCompute provides multiple open-source plugins for easy migration to the cloud.

Related Products & Services

Redefined Management

Nailist

Efficient use of large amounts of data for the refined management of millions of users

Nailist mainly deals with the e-commerce business and currently serves millions of users. Therefore, Nailist needs to extract the best value from its large amounts of user data in order to improve user experience.

Customer benefits

Improved business insight

Nailist can manage millions of users with MaxCompute.
Data-based business

MaxCompute provides analysis and monitoring of business data to enhance business efficiency.
Quick response to business demands

MaxCompute is scalable to meet the demand of increasing business data analysis.

Related Products & Services

Big data-based Precision Marketing

Huihe Marketing

Big data-based precision marketing

Huihe Marketing has built a core precision marketing platform based on big data by using MaxCompute. On this platform, MaxCompute stores all logs, and DataWorks performs offline scheduling and analysis.

Benefits

Cost-effective analysis of large amounts of data

Performs statistics and analysis of large amounts of logs to ensure cost-effective development.
Real-time data searching and analysis

The system responds to customer search requests in milliseconds and returns the number of users of a specified tag.
User-friendly machine learning platform

The revenue of the precision marketing provider depends on the quality of algorithm models.

Integrations

Analysis of Large Amounts of Marketing Data

PING++

Analysis of large amounts of marketing data

Currently, Ping++ deals with millions of transactions each day. Therefore, Ping++ needs to use a secure, reliable, and stable big data platform. This platform analyzes large amounts of transaction data to enhance business efficiency and to increase user loyalty.

Challenges

Innovative

The comprehensive big data platform provides a variety of features, such as storage, computing, BI, and machine learning.
Quick and cost-effective

The Internet-based enterprise can provide Ping++ in a cost-effective method.
Secure, stable, and reliable

The system provides strict data privacy and protection. Users can only analyze their own data.

Integrations

{"moduleinfo":{"content":"Get Started for Free","url":"https://account.alibabacloud.com/register/intl_register.htm?","forReseller":{"noOutPut":true}},"category":[{"title":"Documentation","description":"MaxCompute documentation","target":"_blank","output4Partner":"true","link":"https://www.alibabacloud.com/help/product/27797.htm"},{"title":"Resources","description":"API & SDKs for developers","output4Partner":"true","target":"_blank","link":"https://www.alibabacloud.com/support/developer-resources"},{"title":"Products","description":"See more Alibaba Cloud products","link":"https://www.alibabacloud.com/product","target":"_blank","output4Partner":"true"}],"$root":{"moduleinfo":{"content":"Get Started for Free","url":"https://account.alibabacloud.com/register/intl_register.htm?","forReseller":{"noOutPut":true}},"category":[{"title":"Documentation","description":"MaxCompute documentation","target":"_blank","output4Partner":"true","link":"https://www.alibabacloud.com/help/product/27797.htm"},{"title":"Resources","description":"API & SDKs for developers","output4Partner":"true","target":"_blank","link":"https://www.alibabacloud.com/support/developer-resources"},{"title":"Products","description":"See more Alibaba Cloud products","link":"https://www.alibabacloud.com/product","target":"_blank","output4Partner":"true"}]},"$moduleId":"9095656410"}

Get Started with $500 Coupon

MaxCompute

Benefits

Features

Data channel

Data storage in a two-dimensional table

Computational models

Secure

Customer Scenarios

Customer benefits

Related Products & Services

Challenges

Related Products & Services

Customer benefits

Related Products & Services

Customer benefits

Related Products & Services

Benefits

Integrations

Challenges

Integrations

Upgraded Support For You

1 on 1 Presale Consultation

24/7 Technical Support

6 Free Tickets per Quarter

Faster Response

Sales Support

Technical Support

Connect & Report Abuse