Log Management for AIOps

Log into an artificial intelligence for IT operations (AIOps) environment with an intelligent, all-in-one, and out-of-the-box log management solution

Overview

Artificial Intelligence for IT operations (AIOps) automates and enhances your IT system. Gartner predicts that large enterprises' exclusive use of AIOps and digital experience monitoring tools to monitor applications and infrastructure will rise from 5% in 2018 to 30% in 2023. Alibaba Cloud Log Service (SLS) is an all-in-one intelligent log management platform that paves your way to AIOps. Log Service can enable you to collect, store, query, analyze, visualize your logging data on a single SaaS platform with intelligent alert and AI patrol, saving significant time on system troubleshooting.

Solution Highlights

  • Fully-Managed Log Management

    Enjoy Alibaba Cloud’s complete, out-of-the-box, fully-managed, and O&M-free log management SaaS platform with 99.9% SLA, including data collection, storage, transformation, query, analysis, visualization, and alerting

  • AI Patrol

    Prevent errors proactively with the machine learning algorithms that recognize data patterns automatically, perform real-time data modeling and prediction, and self-correct based on data markings of user feedback

  • Flexible Alert Management

    Optimize your alert and reduce alert noise with the Log Service noise reduction functionality in a notification method of emails, webhooks, SMS messages, and voice calls

  • Multi-Source Data Storage

    Collect logs, metrics, and trace from 40+ data sources including cloud-based products, hosts, web terminals, and mobile and IoT devices in real-time for centralized storage

  • Aggregated Data Analytics

    Support SQL syntax and multiple aggregate functions to help you achieve quick and customized data analytics for various business needs

  • Full-Link Data Traceability

    Locate errors in your IT management system down to the code level with easy access to full-link trace data of user actions for swift troubleshooting and fault recovery

Learn More about Log Managment for AIOps

Contact Sales

How It Works

Your Challenges

Large volumes of logs stored in containers that disappear by time cause trouble for data analysis. The static threshold-based alarm system lowers accuracy. In addition, long service call chains slow down root cause analysis.

Our Solution

  • This solution adopts SLS Sidecar and DaemonSet to collect system logs from containers dynamically and store them in the Log Service to prevent data loss. Fault detection and data analysis are performed in real-time with elastic expansion for data writing, PB-level data throughput per day, and billion-level query returns in seconds. This solution uses AI-based patrolling that features dynamic thresholds combined with intelligent noise reduction to converge the alarm number and improve alarm accuracy. The fast access to trace-type data associated with logs and metrics data combined with automatic data visualization accelerates and simplifies full-link traceability, so you can locate code-level problems in your IT management system and troubleshoot with ease.

Log Service

An all-in-one service that supports the collection, consumption, shipping, search, and analysis of logs

Learn More

Your Challenges

It is difficult to monitor IT management systems and locate failure in a multi-cloud environment. Device-related failures in data centers cannot be traced to the application layer, and different device specifications increase the complexity and cost of monitoring.

Our Solution

  • This solution enables Log Service Logtail for devices in multi-cloud environments to send host indicators and application logs to the cloud for real-time monitoring. Logtail data are used to associate the foundation layer and application layer of application systems automatically to locate application failures for troubleshooting. The AI-based patrolling capability helps you monitor multi-dimensional system abnormalities dynamically and simplifies rule configuration with sophisticated machine learning algorithms that learn from monitoring data and user feedback.

Log Service

An all-in-one service that supports the collection, consumption, shipping, search, and analysis of logs

Learn More

Global Accelerator

A network acceleration service for your Internet-facing application globally with guaranteed bandwidth and high reliability

Learn More

Virtual Private Cloud

A virtual private cloud service that provides an isolated cloud network to operate resources in a secure environment

Learn More

Your Challenges

Cross-service log migration also takes time and leads to delayed troubleshooting. Alarm services without noise reduction and alarm event management functions tend to cause alarm storms and ineffective alarm turn-off.

Our Solution

  • This solution leverages SLS Prometheus and Grafana to receive alarms from external monitoring systems through Webhook for fast access to alarms data without data migration. The Log Service platform provides flexible noise reduction functions, such as alarm silencing, suppression, and deduplication, as well as multi-dimensional management functions based on real-life scenarios to adjust user groups, shift arrangements, and holiday plans. The alarms will be delivered in real-time, which will be analyzed and classified automatically and distributed to corresponding personnel in rule-based methods for accurate and swift troubleshooting.

Log Service

An all-in-one service that supports the collection, consumption, shipping, search, and analysis of logs

Learn More

Learn More about Log Managment for AIOps

Contact Sales

Security and Compliance

We are committed to providing stable, reliable, secure, and compliant cloud computing infrastructure services across major jurisdictions around the world.
Learn More
  • CSA STAR
  • ISO 27001
  • SOC2 Type II Report
  • C5
  • MLPS 2.0
  • MTCS

Start with Alibaba Cloud Solutions

Learn and experience the power of Alibaba Cloud.

Contact Sales