All Products
Search
Document Center

Apsara Video SDK:What is ApsaraVideo MediaBox SDK

Last Updated:Nov 04, 2025

ApsaraVideo MediaBox SDK (hereinafter referred to as ApsaraVideo MediaBox SDK) is an all-in-one SDK that integrates core capabilities such as live stream ingest, video playback, short video production, retouching, and audio effects. It supports multi-terminal access on mobile devices, desktops, and the web, and comes with open-source UI components and product-level demos. You can integrate the required SDKs or use a low-code approach to quickly build applications based on your business needs.

Service architecture

Built on ApsaraVideo Live, ApsaraVideo VOD, and Intelligent Media Services, MediaBox SDK covers core processes in live streaming and VOD businesses, from shooting, editing, uploading, and streaming to playback. It also integrates client-side enhancement, retouching, and other video AI capabilities, providing product-level demos and low-code integration through AUI Kits.

image

Benefits

  • Integrated SDK solution

    • ApsaraVideo MediaBox SDK integrates the core capabilities of ApsaraVideo Live and ApsaraVideo VOD, and reuses core components to reduce the SDK package size.

    • You can integrate multiple SDKs simultaneously to improve integration efficiency and reduce R&D costs.

  • Individual SDK solution

    You can combine and package SDKs to meet your specific requirements.

  • Product-level demos

    Product-level demos offer complete functionality for live streaming and Video on Demand (VOD), enabling rapid integration.

  • Diverse services

    A comprehensive metric system ensures service quality. Audio and video AI technologies improve utilization and distribution efficiency.

  • Stable features

    ApsaraVideo MediaBox SDK has been tested through hundreds of millions of requests and provides stable and reliable audio and video services. You can use it to generate smooth and clear audio and video content.

Demo experience

The ApsaraVideo MediaBox SDK Demo provides a complete, production-ready interactive UI and business source code to help developers quickly implement business requirements and save development time and costs. For demo downloads and detailed instructions, see Demo Experience.

Scenarios

Audio and video playback

Provides highly stable, high-performance, cross-platform audio and video decoding capabilities, suitable for audio and video consumption scenarios such as live streaming, video on demand, and audio listening.

2023-08-03_175845.png

Scenario

Description

Short videos

Use the Player SDK with network optimization, preloading, and local caching features to achieve instant playback and provide a smooth playback experience.

Long videos

Use the Player SDK's adaptive bitrate streaming, speed adjustment without pitch change, and client-side image enhancement features. Combined with cloud services, it provides various features for long video viewing such as progress bar thumbnails, non-invasive live comments, and ASS captions, while ensuring video security through multiple security policies.

Online education

The Player SDK's cross-platform coverage fully meets business requirements, supporting mobile (Android, iOS), desktop (Windows, macOS, Linux), and cross-platform solutions (Web, Flutter). It also supports standard encryption, private encryption, and DRM encryption to effectively implement copyright protection.

Live streaming and Live to VOD

The Player SDK supports common protocols for both live streaming and VOD, seamlessly integrates with live services, and supports live time-shifting, simulated live streaming, and live recording playback, creating a complete audio and video business loop.

Live streaming ingest and co-streaming interaction

Improves resistance to network fluctuations and transmission speed, providing stable and ultra-fast live streaming experiences, commonly used in education, entertainment, gaming, and co-streaming scenarios.

2023-08-03_175928.png

Scenario

Description

Education live streaming

Provides Interactive Messages SDK for real-time text interaction, enhancing teacher-student interaction. The Live Stream Push SDK allows teachers to answer students' questions anytime, anywhere. Cloud recording and transcoding features enable students to review courses and reinforce knowledge points at any time, enhancing learning effectiveness.

Entertainment live streaming

Provides real-time chat, likes, and reward features to enhance interaction between streamers and viewers. It also strictly monitors content security (such as pornography and terrorism) and effectively reduces review costs through intelligent live stream review.

Video game live streaming

Provides screen recording technology to merge current game screens with camera capture, and initiates streaming through the Live Stream Push SDK, which supports screen recording. It also provides Interactive Messages SDK for chat, likes, rewards, and other interactive behaviors, and offers live recording playback for replaying exciting game content.

Co-streaming interaction

Provides stable, high-quality, ultra-low latency interactive real-time audio and video services. Supports real-time co-streaming between streamers, between streamers and viewers, cross-room streamer interaction, and other multi-screen real-time video interactions. The interactive video streams can be pushed to the live center for large-scale distribution to viewers who are not participating in co-streaming.

Short video production

Quickly launch essential features for short video capture, production, and playback, suitable for entertainment, social networking, education, news, e-commerce, and other scenarios that require short video shooting, production, and creative applications.

2023-08-03_175958.png

Scenario

Description

News and information short videos

Provides mobile short video recording for real-time news capture and sharing the latest information. Integrates editing, uploading, transcoding, storage, resource management, and cloud editing to enable quick news production and publishing.

Social entertainment short videos

Enables quick shooting on mobile devices, combined with retouching filters, face stickers, dynamic stickers, subtitles, music, editing, and splicing features to create personalized videos for quick uploading and real-time sharing.

E-commerce product short videos

Provides quick shooting and short film production capabilities to comprehensively and three-dimensionally showcase product features. Combined with animated images, subtitles, music, and other effects, along with object recognition, intelligent recommendations, and shop-while-watching technologies, it effectively improves purchase conversion.

Features

Integrated SDKs

ApsaraVideo MediaBox SDK currently provides integrated SDKs for four scenarios: Basic Live Streaming SDK, Interactive Live Streaming SDK, Short Video Scenario SDK, and Standard Integrated SDK.

Integrated SDK

Basic Live SDK

Interactive Live SDK

Short Video SDK

Standard Integrated SDK

SDK name

AliVCSDK_BasicLive

AliVCSDK_InteractiveLive

AliVCSDK_UGC

AliVCSDK_Standard

Included SDKs

  • Player

  • Live stream ingest

  • Basic retouching

  • Ultra-low latency live streaming

  • Player

  • Live stream ingest

  • Basic retouching

  • Ultra-low latency live streaming

  • RTC co-streaming

  • Player

  • Short video

  • Basic retouching

  • Player

  • Live stream ingest

  • Basic retouching

  • Ultra-low latency live streaming

  • RTC co-streaming

  • Short video

Intended application scenarios

Suitable for online education, music teaching, and other live streaming scenarios.

Building on basic live streaming scenarios, it provides stable and smooth point-to-point and multi-person real-time audio and video call services, supporting multi-person video or audio-only interaction.

Suitable for end-to-end short video creation, paired with powerful video playback features to provide smooth and high-definition playback experiences, meeting the needs of mainstream audio and video scenarios.

Suitable for various live streaming scenarios such as lifestyle streaming, online education, video social networking, and gaming entertainment. Provides facial retouching, face shaping, makeup, filters, stickers, and other editing capabilities. Through retouching effects, it supports more creative applications in live streaming and video production.

Individual module SDKs

Individual module SDKs mainly include the following: Player SDK, Live Stream Push SDK, Short Video SDK, and Retouching SDK.

Feature

Player SDK

Live Stream Push SDK

ARTC SDK

Short Video SDK

Retouching SDK

Live streaming by streamer

Support

  • Streamer-viewer co-streaming

  • Streamer challenge

Support

(Interactive version)

Support

Audio and video calls

Support

Video recording

Support

Video editing

Support

Video upload

Support

Live streaming

Support

VOD

Support

Retouching

Support

Gesture recognition

Supported

Auto chroma key

Support

You can select an SDK or a combination of SDKs based on your actual requirements. For more information about the combination methods for each scenario and SDK selection, see SDK Selection and Download. We recommend that you use the standard integrated SDK of MediaBox Audio and Video SDK to effectively resolve dependency library conflicts caused by integrating multiple SDKs separately.

License management

Player Mobile SDK, Retouching SDK, and other SDKs are now officially charged, while other SDKs are free but require license authorization. You can purchase the official version through the ApsaraVideo MediaBox SDK purchase page. More favorable package deals are also available. For more information, see Player SDK billing. For license authorization instructions after purchase, see Integrate License.