ApsaraVideo MediaBox SDK (hereinafter referred to as ApsaraVideo MediaBox SDK) is an all-in-one SDK that integrates core capabilities such as live stream ingest, video playback, short video production, retouching, and audio effects. It supports multi-terminal access on mobile devices, desktops, and the web, and comes with open-source UI components and product-level demos. You can integrate the required SDKs or use a low-code approach to quickly build applications based on your business needs.
Service architecture
Built on ApsaraVideo Live, ApsaraVideo VOD, and Intelligent Media Services, MediaBox SDK covers core processes in live streaming and VOD businesses, from shooting, editing, uploading, and streaming to playback. It also integrates client-side enhancement, retouching, and other video AI capabilities, providing product-level demos and low-code integration through AUI Kits.
Benefits
Integrated SDK solution
ApsaraVideo MediaBox SDK integrates the core capabilities of ApsaraVideo Live and ApsaraVideo VOD, and reuses core components to reduce the SDK package size.
You can integrate multiple SDKs simultaneously to improve integration efficiency and reduce R&D costs.
Individual SDK solution
You can combine and package SDKs to meet your specific requirements.
Product-level demos
Product-level demos offer complete functionality for live streaming and Video on Demand (VOD), enabling rapid integration.
Diverse services
A comprehensive metric system ensures service quality. Audio and video AI technologies improve utilization and distribution efficiency.
Stable features
ApsaraVideo MediaBox SDK has been tested through hundreds of millions of requests and provides stable and reliable audio and video services. You can use it to generate smooth and clear audio and video content.
Demo experience
The ApsaraVideo MediaBox SDK Demo provides a complete, production-ready interactive UI and business source code to help developers quickly implement business requirements and save development time and costs. For demo downloads and detailed instructions, see Demo Experience.
Scenarios
Audio and video playback
Provides highly stable, high-performance, cross-platform audio and video decoding capabilities, suitable for audio and video consumption scenarios such as live streaming, video on demand, and audio listening.

Scenario | Description |
Short videos | Use the Player SDK with network optimization, preloading, and local caching features to achieve instant playback and provide a smooth playback experience. |
Long videos | Use the Player SDK's adaptive bitrate streaming, speed adjustment without pitch change, and client-side image enhancement features. Combined with cloud services, it provides various features for long video viewing such as progress bar thumbnails, non-invasive live comments, and ASS captions, while ensuring video security through multiple security policies. |
Online education | The Player SDK's cross-platform coverage fully meets business requirements, supporting mobile (Android, iOS), desktop (Windows, macOS, Linux), and cross-platform solutions (Web, Flutter). It also supports standard encryption, private encryption, and DRM encryption to effectively implement copyright protection. |
Live streaming and Live to VOD | The Player SDK supports common protocols for both live streaming and VOD, seamlessly integrates with live services, and supports live time-shifting, simulated live streaming, and live recording playback, creating a complete audio and video business loop. |
Live streaming ingest and co-streaming interaction
Improves resistance to network fluctuations and transmission speed, providing stable and ultra-fast live streaming experiences, commonly used in education, entertainment, gaming, and co-streaming scenarios.

Scenario | Description |
Education live streaming | Provides Interactive Messages SDK for real-time text interaction, enhancing teacher-student interaction. The Live Stream Push SDK allows teachers to answer students' questions anytime, anywhere. Cloud recording and transcoding features enable students to review courses and reinforce knowledge points at any time, enhancing learning effectiveness. |
Entertainment live streaming | Provides real-time chat, likes, and reward features to enhance interaction between streamers and viewers. It also strictly monitors content security (such as pornography and terrorism) and effectively reduces review costs through intelligent live stream review. |
Video game live streaming | Provides screen recording technology to merge current game screens with camera capture, and initiates streaming through the Live Stream Push SDK, which supports screen recording. It also provides Interactive Messages SDK for chat, likes, rewards, and other interactive behaviors, and offers live recording playback for replaying exciting game content. |
Co-streaming interaction | Provides stable, high-quality, ultra-low latency interactive real-time audio and video services. Supports real-time co-streaming between streamers, between streamers and viewers, cross-room streamer interaction, and other multi-screen real-time video interactions. The interactive video streams can be pushed to the live center for large-scale distribution to viewers who are not participating in co-streaming. |
Short video production
Quickly launch essential features for short video capture, production, and playback, suitable for entertainment, social networking, education, news, e-commerce, and other scenarios that require short video shooting, production, and creative applications.

Scenario | Description |
News and information short videos | Provides mobile short video recording for real-time news capture and sharing the latest information. Integrates editing, uploading, transcoding, storage, resource management, and cloud editing to enable quick news production and publishing. |
Social entertainment short videos | Enables quick shooting on mobile devices, combined with retouching filters, face stickers, dynamic stickers, subtitles, music, editing, and splicing features to create personalized videos for quick uploading and real-time sharing. |
E-commerce product short videos | Provides quick shooting and short film production capabilities to comprehensively and three-dimensionally showcase product features. Combined with animated images, subtitles, music, and other effects, along with object recognition, intelligent recommendations, and shop-while-watching technologies, it effectively improves purchase conversion. |
Features
Integrated SDKs
ApsaraVideo MediaBox SDK currently provides integrated SDKs for four scenarios: Basic Live Streaming SDK, Interactive Live Streaming SDK, Short Video Scenario SDK, and Standard Integrated SDK.
Integrated SDK | Basic Live SDK | Interactive Live SDK | Short Video SDK | Standard Integrated SDK |
SDK name | AliVCSDK_BasicLive | AliVCSDK_InteractiveLive | AliVCSDK_UGC | AliVCSDK_Standard |
Included SDKs |
|
|
|
|
Intended application scenarios | Suitable for online education, music teaching, and other live streaming scenarios. | Building on basic live streaming scenarios, it provides stable and smooth point-to-point and multi-person real-time audio and video call services, supporting multi-person video or audio-only interaction. | Suitable for end-to-end short video creation, paired with powerful video playback features to provide smooth and high-definition playback experiences, meeting the needs of mainstream audio and video scenarios. | Suitable for various live streaming scenarios such as lifestyle streaming, online education, video social networking, and gaming entertainment. Provides facial retouching, face shaping, makeup, filters, stickers, and other editing capabilities. Through retouching effects, it supports more creative applications in live streaming and video production. |
Individual module SDKs
Individual module SDKs mainly include the following: Player SDK, Live Stream Push SDK, Short Video SDK, and Retouching SDK.
Feature | Player SDK | Live Stream Push SDK | ARTC SDK | Short Video SDK | Retouching SDK |
Live streaming by streamer | Support | ||||
| Support (Interactive version) | Support | |||
Audio and video calls | Support | ||||
Video recording | Support | ||||
Video editing | Support | ||||
Video upload | Support | ||||
Live streaming | Support | ||||
VOD | Support | ||||
Retouching | Support | ||||
Gesture recognition | Supported | ||||
Auto chroma key | Support |
You can select an SDK or a combination of SDKs based on your actual requirements. For more information about the combination methods for each scenario and SDK selection, see SDK Selection and Download. We recommend that you use the standard integrated SDK of MediaBox Audio and Video SDK to effectively resolve dependency library conflicts caused by integrating multiple SDKs separately.
License management
Player Mobile SDK, Retouching SDK, and other SDKs are now officially charged, while other SDKs are free but require license authorization. You can purchase the official version through the ApsaraVideo MediaBox SDK purchase page. More favorable package deals are also available. For more information, see Player SDK billing. For license authorization instructions after purchase, see Integrate License.