The Big Data & AI Product Monthly Newsletter for May 2026 covers technology updates, product and feature releases, market updates, and customer application practices to help you quickly understand the latest developments in Alibaba Cloud's Big Data & AI offerings.

By adding LLM on Modul Studio and deeply integrating them into PAI product modules, we provide users with more out-of-the-box model choices to meet the growing demand for model services.
MaxCompute SQL adds 6 new scenario-based AI Functions including data classification, extraction, sentiment analysis, and translation.
MaxCompute SQL now supports array expansion via the UNNEST operator, simplifying nested data processing. It also supports coordination with JOIN syntax via ON conditions for associative filtering, suitable for array split statistics, struct field extraction, and associative match filtering scenarios.
MaxCompute SQL script mode now supports temporary tables. Within the script execution lifecycle, you can create temporary tables via CREATE TEMPORARY TABLE to cache intermediate results for multiple reuses within the same script, reducing repeated query overhead.
MaxQA supports automatic elastic upper limit CU (Autoscale). Interactive Quota types can be set with a step size of 25 CU, and the configurable automatic elastic upper limit CU value range is [0, Level-1 Quota AutoscaleLimitCU].
MaxCompute supports storing Apache Iceberg tables on Alibaba Cloud Object Storage Service (OSS), and manages metadata, permissions, and data lifecycle through MaxCompute for efficient querying, writing, and management. Iceberg tables are compatible with open-source engines like Spark, supporting multi-engine sharing of the same data, suitable for Lakehouse architecture.
With the Blob type, raw files, metadata, and annotation information of multimodal data can be uniformly stored in the same MaxCompute table, queried and maintained via SQL, and processed in batch through MaxFrame and SQL UDF.
MaxFrame AI Function further integrates with Alibaba Cloud Bailian Platform. Without the need to self-encapsulate UDFs or maintain DashScope Keys, you can directly call Model Studio's multimodal models within MaxFrame DataFrame expressions, supporting declarative invocation, automatic batch distributed inference, row-level fault tolerance, and precise reruns.
The MaxFrame multimodal operator module officially extends its capability boundary to audio, adding the Series.mf.audio accessor, covering the full pipeline of speech processing.
For industry scenarios such as intelligent driving, smart cabin, and visual perception, MaxFrame officially releases the MaxFrame Intelligent Driving Video Processing Skill. It covers core pipelines including video frame extraction, keyframe labeling, image labeling/vectorization, and image table embedding appending.
Output dtypes can now be inferred via function type hints, eliminating the need to manually write dtypes; apply / apply_chunk can now capture raw failure data when errors occur, facilitating issue troubleshooting.
Added with_network_options, supporting VPC network link configuration for private network environment access.
New MaxFrame DataFrame ↔ MaxCompute table column-level lineage tracking: covering aggregation / join / projection / selection / setitem / source / sink scenarios, viewable in DataWorks Data Lineage for data governance and impact analysis.
Hologres introduces an AI Assistant powered by large models and intelligent Agent systems, providing comprehensive capabilities including Q&A support for Hologres, automated operations, performance optimization guidance, development assistance, and data analysis.
Provides higher OpenAPI free call quotas for Standard and Professional Edition customers, avoiding "stop on over-limit" and ensuring continuous API access with elastic scalability.
The AI Assistant service integrates with IM platforms such as DingTalk and Feishu, enabling natural language interactions for task diagnosis, alert analysis, and operations directly from the IM client — transforming from passive alert response to proactive intelligent operations.
During the code review process in data development, reviewers can leverage AI capabilities to generate summarized review suggestions for review tickets, significantly improving review efficiency.
DataWorks Data Agent adds three new panels: context timing waterfall chart visualizes the call chain, deliverables details supports right-side preview for code, documents, and images, and session environment centrally displays workspace information, comprehensively upgrading interaction experience and readability.
For Python/MaxFrame code logic that involves MaxCompute tables, datasets, or OSS paths as inputs or outputs, AI can intelligently parse the data lineage and register it to the lineage chain with one click, effectively improving development efficiency.
DataWorks Data Quality now adds multiple quality rule templates covering validity, consistency, accuracy, and timeliness. Users can perform quality validation for scenarios including JSON field format validation, intra-table field value comparisons, aggregated value fluctuation comparisons by specified dimensions and metric fields, and time-series data delay detection — enabling users to quickly implement metric-oriented data quality governance.
Addresses user requirements for expanded data source support and enhanced data synchronization capabilities within Data Integration tasks, enabling seamless data movement across more heterogeneous data sources.
The search scenario compute-storage separation feature DFS now supports version 8.17, and single availability zone instances can be scaled to multi availability zones, improving search performance and high availability.
Milvus introduces vector lake capabilities based on External Collection, supporting direct querying of vector data stored in DLF data lakes without repeated import into Milvus. This effectively reduces storage costs and achieves unified integration of data lake and vector retrieval capabilities.
Milvus Manager is a visual management tool provided by Alibaba Cloud Vector Retrieval Service Milvus. Users can complete instance data object management, data operations, vector retrieval debugging, user and permission management, and runtime status viewing through the Web console. The tool also supports integration with Alibaba Cloud products such as DLF and RAM, further improving development, debugging, and operations efficiency.
Vector Retrieval Service Milvus now supports upgrading single availability zone instances to same-city disaster recovery instances, helping users enhance business high availability and disaster recovery capabilities, meeting scenarios with higher stability and continuity requirements.
One SQL Query to Generate an Ad: How Hologres Unifies Creative Production and Campaign Analytics
19 posts | 0 followers
FollowAlibaba Cloud Big Data and AI - May 15, 2026
Alibaba Cloud Big Data and AI - April 13, 2026
CloudSecurity - May 18, 2026
CloudSecurity - February 11, 2026
CloudSecurity - April 15, 2026
CloudSecurity - March 16, 2026
19 posts | 0 followers
Follow
ECS(Elastic Compute Service)
Elastic and secure virtual cloud servers to cater all your cloud hosting needs.
Learn More
Big Data Consulting for Data Technology Solution
Alibaba Cloud provides big data consulting services to help enterprises leverage advanced data technology.
Learn More
MaxCompute
Conduct large-scale data warehousing with MaxCompute
Learn More
Big Data Consulting Services for Retail Solution
Alibaba Cloud experts provide retailers with a lightweight and customized big data consulting service to help you assess your big data maturity and plan your big data journey.
Learn MoreMore Posts by Alibaba Cloud Big Data and AI