This topic shows you how to use EMR AI Assistant with Serverless StarRocks and provides conversation examples. You can adapt the examples to your specific business scenarios.
Background
You can use EMR AI Assistant, also known as EMR Agent, to quickly query resource information, initiate operations, diagnose component anomalies, and obtain technical support, improving your O&M efficiency and user experience. It is an intelligent tool from Alibaba Cloud E-MapReduce (EMR) designed to help you manage and maintain EMR clusters more efficiently.
As of January 9, 2024, in addition to supporting EMR on ECS, EMR AI Assistant now supports Serverless StarRocks. EMR AI Assistant is currently in public preview, and its features are continuously updated. We invite you to try it and provide feedback.
Supported regions
China (Hangzhou), China (Shanghai), China (Qingdao), China (Beijing), China (Ulanqab), and China (Shenzhen).
Feature overview
EMR AI Assistant includes several modules, such as AI chat, anomaly diagnosis, and optimization suggestions.
|
Module |
Description |
|
AI chat |
When you are unfamiliar with operational procedures or encounter issues while using or maintaining E-MapReduce, you can ask EMR AI Assistant for help. EMR AI Assistant provides instant AI-powered Q&A services, giving you quick access to technical support, solutions, or documentation. |
|
Anomaly diagnosis |
If you encounter abnormal component statuses or failed O&M operations, you can describe the issue in detail. EMR AI Assistant uses intelligent diagnostic tools to quickly troubleshoot the problem, providing a detailed diagnostic report and solution to improve your O&M efficiency. |
|
Optimization suggestions |
For Serverless StarRocks instances, EMR AI Assistant automatically scans query history and system metrics to accurately identify the root causes of slow SQL queries and provide actionable optimization suggestions. It also dynamically recommends optimal configuration parameter values based on workload characteristics, reducing the effort of manual parameter tuning. |
Quick start
-
Open EMR AI Assistant
Log in to the EMR console. In the sidebar, click the
icon to open the chat interface. -
Start a conversation
In the input box, ask your question or state your requirement in natural language. For example:
-
"How do I upgrade the instance version?"
-
"How do I upgrade the specifications of a compute group?"
-
"How do I migrate from a self-managed StarRocks to Alibaba Cloud StarRocks?"
The system automatically recognizes your intent and returns a corresponding result.
-
-
The content produced by this service is generated by an artificial intelligence model. We cannot fully guarantee the security, reliability, availability, or stability of this model, nor can we ensure that the generated content is compliant, complete, and accurate. The generated content does not represent the views of Alibaba Cloud. While we continuously improve the quality of our services, we make no commitment to their availability or reliability and are not responsible for the results of your use of this service. Please evaluate the generated content with caution and do not rely on it excessively. You are solely responsible for any loss or damage to yourself, other users, or Alibaba Cloud resulting from actions based on this content.
-
You are solely responsible for all your actions when using this service. Please ensure that any content you publish, upload, link to, or provide through the service is legal and compliant, does not disrupt public order, does not infringe on the legal rights of others, and does not involve fabricating or spreading false information.
-
The diagnostic features of this service collect relevant data from the target instance, including its basic information, system table contents, monitoring metrics, and critical error information from events and logs.
Feature details
AI chat
-
Function
Use natural language to quickly get information about StarRocks product features, architecture, version differences, best practices, and cloud deployment solutions.
-
Typical questions
-
What issues are fixed in the latest version of StarRocks?
-
What are the instances under my account?
-
What does the
pipeline_exec_thread_pool_thread_numparameter do?
For example, after a user enters the question
What are the advantages of the storage-compute separation architecture of Serverless StarRocks?, the AI assistant retrieves relevant documents and returns a structured answer covering four core advantages: 1. Inexpensive and seamlessly scalable storage (supports object storage such as Alibaba Cloud OSS, S3, and HDFS, offering low storage costs and nearly unlimited capacity expansion); 2. Elastically scalable computing power (CN nodes are stateless and can be dynamically added or removed in seconds without data redistribution or cross-node shuffles); 3. High-performance cache mechanism (Hot data is cached on local disks. When a cache hit occurs, performance is comparable to storage-compute integrated architectures. The cache lifecycle is managed automatically); 4. Enhanced resource isolation and stability (Compute and storage resources are completely decoupled). -
Anomaly diagnosis
-
Function
-
Check the health status of an instance using natural language. The assistant analyzes issues like CPU spikes, memory leaks, full disks, query timeouts, bad tablets, and metadata lock conflicts, then provides repair steps.
-
Retrieve the running status of SQL queries in an instance by using natural language. It provides in-depth insights by analyzing top SQL queries, aggregating different SQL types, and tracking daily/weekly trends in SQL execution. It also supports detailed diagnosis and analysis of individual SQL queries.
-
-
Typical questions
-
The CPU usage spiked around 10:00. What was the cause?
-
What are the top 10 slow SQL queries for the instance today, and how do they compare to yesterday?
After EMR AI Assistant determines the cluster type and performs a Top SQL query, it returns a Top SQL Query Summary (including cluster name, time range, and sort order) and a Top 10 SQL Summary Table. The table columns include QueryID, SQL fingerprint (first 80 characters), CPU (ms), Mem (MB), Scan (MB), and Status. The table is sorted by CPU Time in descending order, allowing you to quickly see which SQL queries consume the most resources.
-
Optimization suggestions
-
Function
-
Intelligently parses slow query logs to highlight hotspots and data skew in SQL execution plans, and generates optimization suggestions for indexes, partitions, and materialized views.
-
Intelligently analyzes the current running status and load of an instance and provides parameter tuning suggestions.
-
-
Typical questions
-
QueryID: 6cb2***-****-****-**-******8716 is currently slow. How can I optimize it?
-
Disk usage has been consistently high for the past week. How can I resolve this?
EMR AI Assistant performs a Query Profile analysis on the StarRocks query (Query ID: 056b***4359) and identifies a serious performance issue. Query Performance Overview: Total execution time is 4.571 seconds, and the CPU time is 3 minutes and 20 seconds (98.31%). It scans 592 million rows and returns 591 million rows (almost a full table scan), reads 3.14 GB from disk, and produces a result size of 25.51 GB. Key Performance Bottleneck Analysis: 1. OLAP_SCAN operator (3.163 seconds) - Scans nearly 600 million rows without an effective predicate filter. The root cause is a missing WHERE clause or a filter condition that cannot be pushed down. 2. PROJECT operator (3.416 seconds) - Processes a projection for 591 million rows. The root cause is likely a SELECT statement containing a large number of columns or complex expressions.
-
Feedback and support
When you use EMR AI Assistant, rate its responses using the
or
icons. Your feedback helps EMR AI Assistant learn and improve to provide you with more accurate and efficient service.