This topic describes the latest updates to MaxCompute documentation in year 2020. These updates allow you to understand the new features, syntax, and permissions in MaxCompute and help you improve project development efficiency.

Updates in December 2020

Release date Feature Category Description Documentation
2020-12-21 STACK and GET_USER_ID functions New description The STACK function is added to separate specified data into a specified number of columns. The GET_USER_ID function is added to obtain the ID of the current account. Other functions
2020-12-21 DataWorks-based permission management for roles on a project New description DataWorks is used to manage permissions of a role on a project. Use DataWorks to manage permissions of a role on a project
2020-12-21 Information Schema service provided by default Updated description By default, a new MaxCompute project provides the Information Schema service. You do not need to manually install the Information Schema permission package. Overview of Information Schema
2020-12-15 Version updates of Mars New description Version updates of Mars are provided. Version updates
2020-12-15 Version updates of SDK for Java New description Version updates of SDK for Java are provided. Version updates
2020-12-15 Version updates of the JDBC driver New description Version updates of the JDBC driver are provided. Version updates
2020-12-11 TASK view added to the metadata view list New description The TASK view is added to store real-time snapshots of running jobs. Metadata views
2020-12-10 Instructions on how to add project members and configure roles New description If you create a project by using your Alibaba Cloud account and require RAM users for data development, you can add the RAM users as project members and configure roles for these users. Add project members and configure roles
2020-12-09 AdminEvent events New description The CreateProject, UpdateProject, and DeleteProject events are added to record project-related logs. Audit logs
2020-12-08 Adjustment and optimization of the content of common commands Updated description The content of common commands is adjusted, and sample commands are optimized. Common SQL statements
2020-12-08 Adjustment and optimization of the content of INSERT statements Updated description The content of INSERT statements is adjusted, and statement examples are optimized.

Updates in November 2020

Release date Feature Category Description Documentation
2020-11-30 Information of public datasets Updated description Information of public datasets is added to the International site (alibabacloud.com) for testing and trial use. Overview
2020-11-30 Query editor New feature The MaxCompute V2.0 console provides a query editor for you to execute SQL statements and analyze data. Query editor
2020-11-27 Configuration upgrade and downgrade Updated description Configuration upgrade and downgrade are performed in the MaxCompute V2.0 console. Upgrade or downgrade configurations
2020-11-26 Fields of audit logs Updated description Fields of audit logs are updated. Audit logs
2020-11-20 ORDER BY without LIMIT New description MaxCompute supports the execution of ORDER BY without LIMIT. SELECT syntax
2020-11-17 Instructions on how to reference third-party packages in Python user-defined functions (UDFs) New description Instructions on how to reference third-party packages in Python UDFs are added. These packages include NumPy packages, third-party packages that need to be compiled, and third-party packages that are dependent on dynamic-link libraries (DLLs). Reference third-party packages in Python UDFs
2020-11-04 Optional parameters for enabling MaxCompute Query Acceleration (MCQA) on the MaxCompute JDBC driver Updated description Optional parameters are added for enabling MCQA on the MaxCompute JDBC driver. Usage notes

Updates in October 2020

Release date Feature Category Description Documentation
2020-10-30 MAPJOIN hint supported by SEMI JOIN Updated description SEMI JOIN supports the MAPJOIN hint, which improves the performance of LEFT SEMI or ANTI JOIN and resolves data skew issues. SEMI JOIN
2020-10-30 SORT BY without DISTRIBUTE BY Updated description MaxCompute supports the execution of SORT BY without DISTRIBUTE BY. This resolves data reordering issues and improves the filtering performance during the execution of SQL statements. SELECT syntax
2020-10-30 ZORDER BY clause supported by INSERT New feature The INSERT statement supports the ZORDER BY clause, which can co-locate rows with similar data together. This improves the filtering performance during queries and reduces storage costs. Insert or update data into a table or static partitions (INSERT INTO and INSERT OVERWRITE)
2020-10-30 Deletion of multiple partitions at the same time by using conditional filtering New feature If you want to delete one or more partitions that meet a specific condition at the same time, you can use a conditional expression to delete the partitions that match the condition at the same time. Partition and column operations
2020-10-30 GBK encoding supported for OSS external tables in the CSV or TSV format Updated description The odps.text.option.encoding property supports GBK encoding. Access OSS data by using the built-in extractor
2020-10-30 DATETIME data type supported by time functions YEAR, QUARTER, MONTH, DAY, HOUR, MINUTE, and SECOND Updated description The time functions YEAR, QUARTER, MONTH, DAY, HOUR, MINUTE, and SECOND support the DATETIME data type. Date functions
2020-10-30 WIDTH_BUCKET function New feature The WIDTH_BUCKET function is added. This function returns the ID of the bucket into which the value of a specific field falls. Mathematical functions
2020-10-12 Commercial use of MCQA Updated description The MCQA feature is available for commercial use and charged. Overview
2020-10-10 Modification of the clustering attribute of tables New description The description of how to modify the clustering attribute of tables is added. Table operations

Updates in September 2020

Release date Feature Category Description Documentation
2020-09-24 PyODPS module New description The description of PyODPS jobs is added. PyODPS
2020-09-17 Description of creating a RAM user New description The description of creating a RAM user is added. Create RAM users
2020-09-08 Description of the LOAD command New description The description of the LOAD command is added. LOAD
2020-09-03 Description of Tunnel Upload Updated description The description of the Tunnel Upload command is updated. Tunnel commands
2020-09-01 Best practice to migrate data from BigQuery to MaxCompute New practice A best practice is added to describe how to migrate data from BigQuery to MaxCompute. Migrate data from BigQuery to MaxCompute
2020-09-01 Best practice to migrate data from Amazon Redshift to MaxCompute New practice A best practice is added to describe how to migrate data from Amazon Redshift to MaxCompute. Migrate data from Amazon Redshift to MaxCompute

Updates in August 2020

Release date Feature Category Description Documentation
2020-08-07 Best practice to optimize costs New practice A best practice is added to describe how to optimize the computing, storage, data upload, and data download costs. Overview
2020-08-06 Best practice to assign the Super_Administrator role to a RAM user for a MaxCompute project New practice A best practice is added to describe how to assign the Super_Administrator role to a RAM user for a MaxCompute project. It also describes how to manage members and permissions by using the Super_Administrator role. Set a RAM user as the super administrator for a MaxCompute project
2020-08-05 Best practices to segment Chinese texts by using Jieba in a PyODPS node New practice The following best practices are added: 1. Segment Chinese texts by using Jieba, an open source segmentation tool, and write the segmented words and phrases to a new table in a PyODPS node in DataWorks. 2. Use Jieba to segment Chinese texts based on a custom dictionary referenced by a closure function. Use a PyODPS node to segment Chinese text based on Jieba
2020-08-05 Best practice to grant access to a specific UDF only to a specified user New practice A best practice is added to describe how to set resources (tables or UDFs) to be accessible only to specified users. Grant access to a specific UDF to a specified user
2020-08-05 Best practice to migrate data from Oracle to MaxCompute New practice A best practice is added to describe how to use the data integration feature of DataWorks to migrate data from Oracle to MaxCompute. Best practice to migrate data from Oracle to MaxCompute
2020-08-05 Best practice to migrate data from a self-managed MySQL database on an Elastic Compute Service (ECS) instance to MaxCompute New practice A best practice is added to describe how to use exclusive resource groups for data integration to migrate data from a self-managed MySQL database on an ECS instance to MaxCompute. Migrate data from a user-created MySQL database on an ECS instance to MaxCompute
2020-08-05 odps.text.option.use.quote property supported by SERDEPROPERTIES New description The description of the odps.text.option.use.quote property is added. This property specifies whether to recognize a double quotation mark (") as the column delimiter in a CSV file. Access OSS data by using the built-in extractor

Updates in July 2020

Release date Feature Category Description Documentation
2020-07-29 Best practice to migrate data from MaxCompute to Tablestore New practice A best practice is added to describe how to migrate data from MaxCompute to Tablestore. Migrate data from MaxCompute to Tablestore
2020-07-29 Best practice to migrate data from MaxCompute to Object Storage Service (OSS) New practice A best practice is added to describe how to use the data synchronization feature of DataWorks to migrate data from MaxCompute to OSS. Migrate data from MaxCompute to OSS
2020-07-24 Data encryption New feature The description of the data encryption feature is added. MaxCompute uses Key Management Service (KMS) to encrypt data for storage. This way, MaxCompute can provide static data protection to meet corporate governance and security compliance requirements. Data encryption
2020-07-23 Aggregate functions New description The description of aggregate functions APPROX_DISTINCT, ANY_VALUE, ARG_MAX, and ARG_MIN is added. Aggregate functions
2020-07-23 New data types supported by Python UDFs New description New data types are supported by Python UDFs.
2020-07-23 SQL functions New feature The description of SQL functions that allow you to reference SQL UDFs in SQL scripts is added. SQL functions
2020-07-23 Code-embedded UDFs New feature The description of code-embedded UDFs is added. Code-embedded UDFs allow you to embed Java or Python code into SQL scripts.
2020-07-20 Audit logs New feature The description of the features, scenarios, scope, and fields of audit logs is added. Audit logs
2020-07-15 Best practice to use Tunnel to upload log data to MaxCompute New practice A best practice is added to describe how to use Tunnel to upload log data to MaxCompute. Use Tunnel to upload log data to MaxCompute
2020-07-15 Best practice to use DataHub to migrate log data to MaxCompute New practice A best practice is added to describe how to use DataHub to migrate log data to MaxCompute. Use DataHub to migrate log data to MaxCompute
2020-07-15 Best practice to use the data integration feature of DataWorks to migrate log data to MaxCompute New practice A best practice is added to describe how to use the data integration feature of DataWorks to synchronize data collected by LogHub to MaxCompute. Use DataWorks Data Integration to migrate log data to MaxCompute
2020-07-08 CLONE TABLE New feature The description of the CLONE TABLE statement is added. MaxCompute supports the CLONE TABLE statement that allows you to clone data from one table to another. This statement facilitates data migration and replication. CLONE TABLE

Updates in June 2020

Release date Feature Category Description Documentation
2020-06-03 Tunnel OVERWRITE command New description The description of the Tunnel OVERWRITE command is added. Tunnel commands
2020-06-01 Optimization of access to instances in a VPC from Spark on MaxCompute New description and example The following content is added:
  • Limits on VPC whitelists and regions
  • Examples of merged JSON text when Spark on MaxCompute is used to access different instances
Access instances in a VPC from Spark on MaxCompute
2020-06-01 Policy-based access control and download control New example Examples on how to use policy-based access control and permission revocation are added. Policy-based access control and download control

Updates in January 2020

Release date Feature Category Description Documentation
2020-01-14 Improved SQL compatibility New description The execution rules of the GET_IDCARD_AGE, CONCAT_WS, and LIKE functions are modified.
2020-01-09 Parameter description New description The parameters in examples are described. Project data protection