On May 20, Alibaba Cloud officially released the AI-native global intelligent O&M platform STAROps.
The platform uses large model and agent technology as the core engine and Alibaba Cloud observable product system as the data base to deeply integrate cross-domain observable data with large language model reasoning capability. Users only need to define operation and maintenance objectives in natural language, and operation and maintenance agents can independently complete the full closed loop of dynamic planning, safe execution and result verification.
STAROps is designed around four capability dimensions: Sense global perception, Target goal orientation, Autonomy autonomous operationand maintenance, and Resilience business continuity. It provides three core functions:
At the technical architecture level, the competitiveness of STAROps is reflected in four dimensions.
Unified Observability Data
Unifies logs, metrics, traces, events, topology, and changes with PB-scale daily ingestion, EB-scale storage, low-latency analysis, multi-AZ deployment, and 99.95% reliability.
Operational Digital Twin
Builds a unified graph model (UModel) from entities, relationships, observability data, and operational knowledge, helping agents understand systems, trace blast radiuses, and reason about root causes in a shared context.
AI Analytics Operators
Supports anomaly detection, log clustering, trace analysis, performance profiling, and change analysis, reducing the cost of processing massive raw data while improving diagnostic efficiency and result stability.
Continuous Improvement Flywheel
Builds a realistic evaluation loop with simulation, fault injection, diagnostic assessment, and feedback, creating a measurable, roll-back-ready system for continuous agent improvement.
The essence of cloud computing lies in orchestrating computing resources as a service in an efficient way, and what STAROps is doing is extending this principle to operations and maintenance. Manpower-intensive O&M tasks are intelligently performed by using agents to schedule large-scale O&M operations. The digital employee mechanism of STAROps provides enterprises with this progressive path: it not only supports embedding AI in existing processes to improve efficiency, but also supports building a new agent native O&M mode.
In terms of access form, STAROps provides a variety of access solutions such as OpenAPI and MCP integration, page embedding, and mainstream IM access. Enterprises can release value in existing workflows at the lowest migration cost. The built-in manual approval mechanism of the platform ensures that key decision nodes are still under manual control, striking a balance between the efficiency of agent independent execution and security compliance.
Along with the product release, Alibaba Cloud synchronizes the open source UModel unified data model project with the RCA-100 evaluation benchmark set, and jointly launched the "Enterprise Common Semantic Standard Industry Initiative" with more than 10 industry partners and academic institutions such as the Institute of Information and Communications Technology, Xiaopeng Automobile, and the Software Institute of the Chinese Academy of Sciences.
Currently, STAROps has been officially launched on the Alibaba Cloud official website. As AI reshapes every aspect of software development, O&M, as the last line of defense to ensure business resilience, is ushering in a paradigm transition from tool assistance to agent autonomy. Alibaba Cloud uses STAROps as a starting point to push Agentic Ops from concept to production-level implementation.
The Second Half of the Enterprise Agent Era: How to Make Agents Smarter the More They Are Used?
740 posts | 60 followers
FollowAlibaba Cloud Native Community - May 26, 2026
Alibaba Cloud Native Community - November 6, 2025
Alibaba Cloud Native Community - December 6, 2022
Alibaba Cloud Community - May 26, 2026
Alibaba Developer - March 8, 2021
Alibaba Cloud Community - March 2, 2022
740 posts | 60 followers
Follow
CloudMonitor
Automate performance monitoring of all your web resources and applications in real-time
Learn More
Managed Service for Prometheus
Multi-source metrics are aggregated to monitor the status of your business and services in real time.
Learn More
Simple Log Service
An all-in-one service for log-type data
Learn More
IoT Solution
A cloud solution for smart technology providers to quickly build stable, cost-efficient, and reliable ubiquitous platforms
Learn MoreMore Posts by Alibaba Cloud Native Community