All Products
Search
Document Center

Managed Service for OpenTelemetry:Analyze traces

Last Updated:Mar 11, 2026

The Trace Analysis page consolidates charts, filters, trace lists, application topology, and end-to-end aggregation for a single application. Use these tools to identify latency bottlenecks, locate errors, and map service dependencies.

Prerequisites

Before you begin, make sure that you have:

  • A Managed Service for OpenTelemetry instance with trace data reporting enabled

  • An application that appears on the Applications page

Open the Trace Analysis page

  1. Log on to the Managed Service for OpenTelemetry console. In the left-side navigation pane, click Applications.

  2. On the Applications page, select a region in the top navigation bar and click the name of the application.

  3. On the Applications page, select a region in the top navigation bar and click the name of the application.

  4. In the left-side navigation pane, click Trace Analysis.

    Trace Analysis page

Filter traces

Narrow down the trace list by time range, duration, errors, span names, servers, or tags.

  1. In the upper-right corner of the Trace Analysis page, select a predefined or custom time range.

  2. Click the search box in the upper part of the page. In the drop-down list, set one or more of the following filter conditions and click Search:

    FilterHow to use
    Time consumedEnter values in the Min and Max fields to filter traces by duration range
    ExceptionSelect this option to show only traces that contain errors
    Span nameSpecify one or more span names to narrow results
    Machine/IPSpecify one or more server IP addresses to narrow results
    Label (Tag)Specify one or more tags to narrow results

    You can combine multiple span names, IP addresses, and tags in a single search.

Enable full real-time analysis

By default, the Trace Analysis page displays sampled data summaries. After you specify a sampling ratio for span data, you can enable full real-time analysis to query the complete sampled dataset in real time rather than a statistical summary.

Important

If no sampling ratio is specified or the sampling ratio is set to 100%, enabling full real-time analysis displays all collected span data within your Alibaba Cloud account.

  1. On the Trace Analysis page, click Enable full real-time analysis in the upper-right corner.

  2. Select a time range.

    After you select a time range, the page displays the complete sampled data for that period and updates in real time.

Interpret the analysis charts

The Time change trend section contains three charts that help you spot patterns in span volume, duration, and errors over time.

ChartWhat it shows
Span count trendTotal spans and exceptional spans over time
Duration trendSpan duration over time
Duration distributionHow span durations are distributed across buckets

To interact with the charts:

  • Hover over a data point to view detailed statistics.

  • Select a time range within a chart to zoom in on that period.

  • Click a legend entry to hide or show that data series.

Time change trend section

Use the span name and Machine/IP lists

The Span name and Machine/IP lists show all spans and servers within your Alibaba Cloud account. Use these lists to quickly locate and filter by a specific span or server.

Span name list

In either list:

  • Enter a keyword in the search box and click the search icon to filter entries.

  • Click Span quantity or Response time in the upper-right corner to sort entries in descending order.

  • Click a span name or IP address to apply it as a filter condition on the Trace Analysis page.

Group traces by field

Group traces by a specific attribute to compare behavior across span names, servers, or tags.

  1. On the Trace Analysis page, select a field from the Group By drop-down list.

  2. Click All Span or API call:

    OptionScope
    All SpanGroups traces based on the selected field across all spans
    API callGroups traces based on the selected field for the first span (the entry point) only

    Click a field value to view the matching traces.

View trace details

The Traces tab lists each trace with its ID, start time, span name, duration, and server IP address.

Traces tab

On the Traces tab:

  • Click a trace ID to open the waterfall chart for that trace.

  • Hover over a span name to view its duration, start time, tags, and log events.

View the application topology

The Application Topology tab maps dependencies between the current application and its upstream or downstream services. Use it to understand how requests flow between services and where latency concentrates.

Note

For performance reasons, the topology aggregates request data from up to 5,000 traces.

Application topology

Topology metrics

Each dependency line between two applications displays a label such as 90%/3x, which combines the request percentage and call multiplier. You can also view the duration ratio between applications.

MetricDefinitionExample
Request percentageThe ratio of requests from Application A that involve a call to Application B, relative to the total requests received by A. Some requests are sent to Application B because Application A may filter all the requests based on an IF statement.A receives 100 requests. 90 of those call B. Request percentage = 90%.
Call multiplierThe ratio of spans generated when A calls B, relative to the total spans received by A.A receives 100 spans. Calls from A to B generate 300 spans. Call multiplier = 3x.
Combined labelFormat: {request percentage}/{call multiplier}.90%/3x means 90% of requests in A call B, and each such request calls B three times on average.

View end-to-end aggregation

The End-to-End Aggregation tab aggregates traces by span name and application name, providing a breakdown of request counts, durations, and error rates across the entire call chain. Use it to find which spans consume the most time or produce the most errors.

Note

For performance reasons, end-to-end aggregation covers up to 5,000 traces.

End-to-End Aggregation tab

Aggregation metrics

MetricDescriptionExample
Request Count / Request PercentageThe ratio of the current span's request count to the total request count.With 100 total requests and a 10% request percentage, 10 requests reach the current span.
Span Amount / Request MultiplierHow many times each request calls the current span on average. Calculated as span count divided by request count.1.5x means each request calls this span 1.5 times on average.
Average Self Elapsed Time / PercentageThe exclusive duration of a span, excluding time spent in child spans. For asynchronous calls, child span durations are not deducted.Span A takes 10 ms total. Its child Span B takes 8 ms. The self elapsed time for Span A is 2 ms.
Exception Count / Exception PercentageThe ratio of exceptional requests to total requests. The exception count may differ from the number of exceptional requests when the request multiplier exceeds 1, because a single exceptional request can contain multiple exceptions.3% means 3 out of every 100 requests are exceptional.

On the End-to-End Aggregation tab:

  • Hover over a span name (shown in blue) to view related traces. Click a trace ID to open its waterfall chart.

  • Click an application name to go to the application overview.

View the waterfall chart of a trace

The waterfall chart displays the span name, timeline, application name, start time, server IP address, and status of each span in a trace. Use this view to trace the execution path of a request and pinpoint where latency or errors occur.

Note

The IP Address column may display IP addresses or server names, depending on your settings on the Application Settings page. For more information, see Manage applications and tags.

Waterfall chart

In the Span Name column, hover over a span name to view its duration, start time, tags, and log events.

Span detail overlay showing tags and log events