This topic describes the latest updates to MaxCompute documentation in 2022. These updates allow you to understand the new features, new syntax, and permission updates in MaxCompute and help you improve project development efficiency.

Updates in December 2022

Release dateFeatureCategoryDescriptionReferences
2022-12-15MASK_HASH function New descriptionThe MASK_HASH function returns a hash value that is calculated by using a string expression specified by expr. If the values calculated by a string expression are the same, a consistent hash value is returned. MASK_HASH
2022-12-14List of flag parameters New descriptionThis topic provides information about flag parameters, including the parameter name, parameter description, and scope of impact. Flag parameters
2022-12-14WINDOW keyword New descriptionYou can use the WINDOW keyword to specify a custom window and specify a name for a window function. WINDOW keyword
2022-12-13List of time zones New descriptionYou can use the FROM_UTC_TIMESTAMP function to convert a UTC timestamp into a timestamp in a specified time zone. Time zones
2022-12-11Error codes New descriptionMaxCompute provides an error code mechanism to help you quickly locate the error type, severity, and cause of an error and resolve the related issues. This topic describes the error message format and a list of error codes of MaxCompute. Error code overview
2022-12-09Migration of the compressed data from Object Storage Service (OSS) to MaxCompute New descriptionThis topic describes how to use the external table feature of MaxCompute to migrate the file data that is compressed in the Snappy format from OSS to MaxCompute. The data in the file that is compressed in the Snappy format is separated by commas (,). Migrate compressed data from OSS to MaxCompute
2022-12-02Merging of small files New descriptionA distributed file system stores data by block. A file whose size is less than the block size 64 MB is called a small file. In most cases, small files are generated in a distributed file system. For example, small files are generated to save the computing results of SQL statements, the computing results of a distributed engine, and data that is collected by using Tunnel commands. You can merge small files to optimize system performance. This topic describes how to merge small files in MaxCompute. Merge small files

Updates in November 2022

Release dateFeatureCategoryDescriptionReferences
2022-11-15ANALYZE statement New descriptionThe ANALYZE statement is used to obtain the metadata of a table by collecting statistics about data in the table. This statement helps optimize query performance. ANALYZE

Updates in October 2022

Release dateFeatureCategoryDescriptionReferences
2022-10-13MaxCompute schema New descriptionMaxCompute provides the schema feature. You can use schemas to classify tables, resources, and functions in projects. This topic describes the concepts and permissions that are related to schemas. This topic also describes how to use schemas. Schema-related operations

Updates in September 2022

Release dateFeatureCategoryDescriptionReferences
2022-09-28Built-in functions (in alphabetical order) New descriptionBuilt-in functions provided by MaxCompute under the Built-in functions (in alphabetical order) node are sorted in alphabetical order. This helps you easily find and use the built-in functions. Overview
2022-09-28All statements (in alphabetical order) New descriptionAll SQL statements supported by MaxCompute under the All statements (in alphabetical order) node are sorted in alphabetical order. This helps you easily find and use the SQL statements. ADD ACCOUNTPROVIDER

Updates in August 2022

Release dateFeatureCategoryDescriptionReferences
2022-08-27Aggregate functions Updated descriptionThe following aggregate functions are added: BITWISE_AND_AGG, MIN_BY, and MAX_BY. Aggregate functions
2022-08-19New function for materialized views Updated descriptionA function is added to MaxCompute. This function allows you to query the status of materialized views. You can use this function to check whether the data of the current materialized view is the same as the data of the original table. You can also use this function to check whether the data of a partition in the current materialized view is the same as the data of the mapped partition in the original table. If the data is the same, True is returned. If the data is not the same, False is returned. Materialized view operations
2022-08-04Use of computing resourcesNew descriptionMaxCompute allows you to specify quotas at the job level. This helps you use quotas in a flexible manner. Use of computing resources

Updates in July 2022

Release dateFeatureCategoryDescriptionReferences
2022-07-14String functions Updated descriptionThe REGEXP_EXTRACT_ALL function is added. String functions
2022-07-07UNLOAD statements Updated descriptionWhen you use UNLOAD statements to export data from MaxCompute to Object Storage Service (OSS), you can configure prefixes and extensions for the exported data files. UNLOAD
2022-07-07Table split size settings Updated descriptionMaxCompute allows you to configure a split size for tables to control the parallelism of jobs. If resources are sufficient but jobs run at a low speed or if jobs take a large amount of time to wait for resource allocation and resources are insufficient, you can adjust the split size to improve the computing efficiency. SELECT syntax
2022-07-07Aggregate functions Updated descriptionThe following aggregate functions are added: BITWISE_OR_AGG, MAP_AGG, MULTIMAP_AGG, MAP_UNION, MAP_UNION_SUM, and HISTOGRAM. Aggregate functions

Updates in June 2022

Release dateFeatureCategoryDescriptionReferences
2022-06-01String functions Updated descriptionThe KEYVALUE_TUPLE function is added. String functions

Updates in April 2022

Release dateFeatureCategoryDescriptionReferences
2022-04-14Support for the EXCEPT and REPLACE clauses Updated descriptionWhen a SELECT statement is used to query data, the EXCEPT or REPLACE clause can be added to the SELECT statement to exclude some columns or modify data in some columns. SELECT syntax
2022-04-14Optimization of the content of OSS external tables Updated descriptionThe structure of the document content that is related to OSS external tables is optimized to present useful information. Set up a Spark on MaxCompute development environment
2022-04-07Python user-defined function (UDF) examples New exampleExamples on how to use Python UDFs to process data of complex data types and reference tables or resources are added.

Updates in March 2022

Release dateFeatureCategoryDescriptionReferences
2022-03-24Best practices for transposing rows to columns or columns to rowsNew descriptionThe best practices for transposing rows to columns or columns to rows are added.

Examples are provided to demonstrate how to use SQL statements to transpose rows to columns or columns to rows.

Transpose rows to columns or columns to rows
2022-03-18Data in open source formatsUpdated descriptionThe following properties are added for data in the Parquet format:
  • parquet.file.cache.size
  • parquet.io.buffer.size
Create an OSS external table
2022-03-18Complex type functionsUpdated descriptionExamples on using the GET_JSON_OBJECT and JSON_TUPLE functions are added. Complex type functions
2022-03-18String functionsUpdated descriptionDescriptions of the TRIM, LTRIM, and RTRIM functions are updated. String functions
2022-03-18DISTRIBUTED MAPJOINNew descriptionA topic is added to describe DISTRIBUTED MAPJOIN.

DISTRIBUTED MAPJOIN is an optimized version of MAPJOIN. DISTRIBUTED MAPJOIN is suitable for scenarios in which small tables need to be joined with large tables.

DISTRIBUTED MAPJOIN
2022-03-15SHOW commandsNew descriptionA topic is added to describe all SHOW commands that are supported in MaxCompute.

SHOW commands
2022-03-09FAQUpdated description
  • Issues that are related to sandboxes and third-party packages are added.
  • A common issue about running SQL jobs in Java business code is added.
  • MaxCompute FAQ is provided.
2022-03-01Permission managementUpdated descriptionThe document content that is related to permissions is modified. Permission overview
2022-03-01Compatibility with Apache Spark 3.1Updated descriptionMaxCompute is compatible with Apache Spark 3.1 in addition to the following Apache Spark versions: 1.6, 2.3, and 2.4. Set up a Spark on MaxCompute development environment

Updates in February 2022

Release dateFeatureCategoryDescriptionReferences
2022-02-28Management of external projects based on SQL statements Updated descriptionThe document content that is related to external project management based on SQL statements is modified. Use SQL statements to manage an external project
2022-02-23Addition or deletion of columns and column order change New descriptionThe syntax and examples for adding or deleting columns are added. The descriptions that are related to the column order change are added. Partition and column operations
2022-02-14Common errors for built-in functions New descriptionA topic is added to describe common errors for built-in functions. Common errors for built-in functions
2022-02-11Graph operations Updated descriptionDescriptions about how to write a Graph job by using the single source shortest path (SSSP) algorithm are optimized. Write a Graph job

Updates in January 2022

Release dateFeatureCategoryDescriptionReferences
2022-01-30FAQ Updated descriptionFAQ is categorized based on the issue type. This helps you quickly find the solution to a specific issue. FAQ
2022-01-30External projects Updated descriptionThe data lakehouse solution can be implemented in MaxCompute by using external projects. Project
2022-01-24Best practice for permission management New practiceThe best practice for policy-based permission management for users who are assigned built-in roles is added. Policy-based permission management for users assigned built-in roles
2022-01-21Typical use cases of built-in functions New descriptionDescriptions related to the implementation of the GROUP_CONCAT function and the precision issue of the ROUND function are added.
2022-01-05Change of complex type functions Updated descriptionThe mapDupKeyPolicy parameter is added to the following functions: MAP_CONCAT, MAP_FROM_ARRAYS, MAP_FROM_ENTRIES, MAP, and STR_TO_MAP. The odps.sql.map.key.dedup.policy parameter can be configured at the session level to take the same effect as the mapDupKeyPolicy parameter. Complex type functions