This topic describes the latest updates to MaxCompute documentation in 2022. These updates allow you to understand the new features, new syntax, and permission updates in MaxCompute and help you improve project development efficiency.
Updates in December 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-12-15 | MASK_HASH function | New description | The MASK_HASH function returns a hash value that is calculated by using a string expression specified by expr. If the values calculated by a string expression are the same, a consistent hash value is returned. | MASK_HASH |
2022-12-14 | List of flag parameters | New description | This topic provides information about flag parameters, including the parameter name, parameter description, and scope of impact. | Flag parameters |
2022-12-14 | WINDOW keyword | New description | You can use the WINDOW keyword to specify a custom window and specify a name for a window function. | WINDOW keyword |
2022-12-13 | List of time zones | New description | You can use the FROM_UTC_TIMESTAMP function to convert a UTC timestamp into a timestamp in a specified time zone. | Time zones |
2022-12-11 | Error codes | New description | MaxCompute provides an error code mechanism to help you quickly locate the error type, severity, and cause of an error and resolve the related issues. This topic describes the error message format and a list of error codes of MaxCompute. | Error code overview |
2022-12-09 | Migration of the compressed data from Object Storage Service (OSS) to MaxCompute | New description | This topic describes how to use the external table feature of MaxCompute to migrate the file data that is compressed in the Snappy format from OSS to MaxCompute. The data in the file that is compressed in the Snappy format is separated by commas (,). | Migrate compressed data from OSS to MaxCompute |
2022-12-02 | Merging of small files | New description | A distributed file system stores data by block. A file whose size is less than the block size 64 MB is called a small file. In most cases, small files are generated in a distributed file system. For example, small files are generated to save the computing results of SQL statements, the computing results of a distributed engine, and data that is collected by using Tunnel commands. You can merge small files to optimize system performance. This topic describes how to merge small files in MaxCompute. | Merge small files |
Updates in November 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-11-15 | ANALYZE statement | New description | The ANALYZE statement is used to obtain the metadata of a table by collecting statistics about data in the table. This statement helps optimize query performance. | ANALYZE |
Updates in October 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-10-13 | MaxCompute schema | New description | MaxCompute provides the schema feature. You can use schemas to classify tables, resources, and functions in projects. This topic describes the concepts and permissions that are related to schemas. This topic also describes how to use schemas. | Schema-related operations |
Updates in September 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-09-28 | Built-in functions (in alphabetical order) | New description | Built-in functions provided by MaxCompute under the Built-in functions (in alphabetical order) node are sorted in alphabetical order. This helps you easily find and use the built-in functions. | Overview |
2022-09-28 | All statements (in alphabetical order) | New description | All SQL statements supported by MaxCompute under the All statements (in alphabetical order) node are sorted in alphabetical order. This helps you easily find and use the SQL statements. | ADD ACCOUNTPROVIDER |
Updates in August 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-08-27 | Aggregate functions | Updated description | The following aggregate functions are added: BITWISE_AND_AGG , MIN_BY , and MAX_BY . | Aggregate functions |
2022-08-19 | New function for materialized views | Updated description | A function is added to MaxCompute. This function allows you to query the status of materialized views. You can use this function to check whether the data of the current materialized view is the same as the data of the original table. You can also use this function to check whether the data of a partition in the current materialized view is the same as the data of the mapped partition in the original table. If the data is the same, True is returned. If the data is not the same, False is returned. | Materialized view operations |
2022-08-04 | Use of computing resources | New description | MaxCompute allows you to specify quotas at the job level. This helps you use quotas in a flexible manner. | Use of computing resources |
Updates in July 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-07-14 | String functions | Updated description | The REGEXP_EXTRACT_ALL function is added. | String functions |
2022-07-07 | UNLOAD statements | Updated description | When you use UNLOAD statements to export data from MaxCompute to Object Storage Service (OSS), you can configure prefixes and extensions for the exported data files. | UNLOAD |
2022-07-07 | Table split size settings | Updated description | MaxCompute allows you to configure a split size for tables to control the parallelism of jobs. If resources are sufficient but jobs run at a low speed or if jobs take a large amount of time to wait for resource allocation and resources are insufficient, you can adjust the split size to improve the computing efficiency. | SELECT syntax |
2022-07-07 | Aggregate functions | Updated description | The following aggregate functions are added: BITWISE_OR_AGG , MAP_AGG , MULTIMAP_AGG , MAP_UNION , MAP_UNION_SUM , and HISTOGRAM . | Aggregate functions |
Updates in June 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-06-01 | String functions | Updated description | The KEYVALUE_TUPLE function is added. | String functions |
Updates in April 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-04-14 | Support for the EXCEPT and REPLACE clauses | Updated description | When a SELECT statement is used to query data, the EXCEPT or REPLACE clause can be added to the SELECT statement to exclude some columns or modify data in some columns. | SELECT syntax |
2022-04-14 | Optimization of the content of OSS external tables | Updated description | The structure of the document content that is related to OSS external tables is optimized to present useful information. | Set up a Spark on MaxCompute development environment |
2022-04-07 | Python user-defined function (UDF) examples | New example | Examples on how to use Python UDFs to process data of complex data types and reference tables or resources are added. |
Updates in March 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-03-24 | Best practices for transposing rows to columns or columns to rows | New description | The best practices for transposing rows to columns or columns to rows are added. Examples are provided to demonstrate how to use SQL statements to transpose rows to columns or columns to rows. | Transpose rows to columns or columns to rows |
2022-03-18 | Data in open source formats | Updated description | The following properties are added for data in the Parquet format:
| Create an OSS external table |
2022-03-18 | Complex type functions | Updated description | Examples on using the GET_JSON_OBJECT and JSON_TUPLE functions are added. | Complex type functions |
2022-03-18 | String functions | Updated description | Descriptions of the TRIM, LTRIM, and RTRIM functions are updated. | String functions |
2022-03-18 | DISTRIBUTED MAPJOIN | New description | A topic is added to describe DISTRIBUTED MAPJOIN. DISTRIBUTED MAPJOIN is an optimized version of MAPJOIN. DISTRIBUTED MAPJOIN is suitable for scenarios in which small tables need to be joined with large tables. | DISTRIBUTED MAPJOIN |
2022-03-15 | SHOW commands | New description | A topic is added to describe all SHOW commands that are supported in MaxCompute.
| SHOW commands |
2022-03-09 | FAQ | Updated description |
| |
2022-03-01 | Permission management | Updated description | The document content that is related to permissions is modified. | Permission overview |
2022-03-01 | Compatibility with Apache Spark 3.1 | Updated description | MaxCompute is compatible with Apache Spark 3.1 in addition to the following Apache Spark versions: 1.6, 2.3, and 2.4. | Set up a Spark on MaxCompute development environment |
Updates in February 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-02-28 | Management of external projects based on SQL statements | Updated description | The document content that is related to external project management based on SQL statements is modified. | Use SQL statements to manage an external project |
2022-02-23 | Addition or deletion of columns and column order change | New description | The syntax and examples for adding or deleting columns are added. The descriptions that are related to the column order change are added. | Partition and column operations |
2022-02-14 | Common errors for built-in functions | New description | A topic is added to describe common errors for built-in functions. | Common errors for built-in functions |
2022-02-11 | Graph operations | Updated description | Descriptions about how to write a Graph job by using the single source shortest path (SSSP) algorithm are optimized. | Write a Graph job |
Updates in January 2022
Release date | Feature | Category | Description | References |
---|---|---|---|---|
2022-01-30 | FAQ | Updated description | FAQ is categorized based on the issue type. This helps you quickly find the solution to a specific issue. | FAQ |
2022-01-30 | External projects | Updated description | The data lakehouse solution can be implemented in MaxCompute by using external projects. | Project |
2022-01-24 | Best practice for permission management | New practice | The best practice for policy-based permission management for users who are assigned built-in roles is added. | Policy-based permission management for users assigned built-in roles |
2022-01-21 | Typical use cases of built-in functions | New description | Descriptions related to the implementation of the GROUP_CONCAT function and the precision issue of the ROUND function are added. | |
2022-01-05 | Change of complex type functions | Updated description | The mapDupKeyPolicy parameter is added to the following functions: MAP_CONCAT, MAP_FROM_ARRAYS, MAP_FROM_ENTRIES, MAP, and STR_TO_MAP. The odps.sql.map.key.dedup.policy parameter can be configured at the session level to take the same effect as the mapDupKeyPolicy parameter. | Complex type functions |