ApsaraVideo Media Processing (MPS) allows you to convert an audio or video file to one or more files to adapt to different network bandwidths, terminal processing capabilities, and user needs. MPS performs multimodal analysis on the content, text, speeches, and scenes of media files and offers various features, including automated review, content recognition, and smart editing.
- Adapt to different terminal devices: You can convert media formats to support the playback on multiple types of terminal devices, such as PCs, TVs, and mobile devices.
- Adapt to different network conditions: You can produce video files of varied definitions to adapt the bitrates to different network bandwidths. Transcoded video files can be in the standard definition, high definition, or ultra high definition.
- Add watermarks: You can add enterprise logos, TV station logos, or user nicknames as watermarks to a video to highlight the brand and copyright information and increase product recognition.
- Take snapshots: You can take snapshots of a video at specific time points. You can use a snapshot as the video thumbnail or multiple snapshots to generate a sprite.
- Edit videos: You can edit, crop, and merge original videos to generate new ones.
- Perform video enhancement: You can remove the blurs or mosaics from poor-quality videos to generate restored versions in higher definitions.
- Reduce storage and traffic costs: You can adjust the video bitrate, increase the compression rate, and reduce the file size without compromising the video quality. This reduces video stuttering during playback and saves the storage and traffic costs.
- Generate media fingerprints: You can extract fingerprint features such as images and audio from a video to generate a media fingerprint. The media fingerprint can be used to find and remove duplicate videos, trace the source of video clips, filter videos that infringe copyright, and identify user-generated content (UGC).
- Perform automated reviews: MPS intelligently detects pornographic content, terrorism content, ads, and undesirable content in the speeches, text, and scenes of a video. This feature can help you reduce manpower costs for manual review and lower the risks that may be brought by non-compliant content.
- Improve the conversion rate (CVR) of videos: MPS learns from image aesthetics and a large amount of user behavior data, selects the optimal keyframe from a video, and then generates images, animated stickers, or short videos as the video thumbnail.
|Audio extraction||Extract only audio from a video by disabling the video image part.|
|Video extraction||Extract only video from a video file by disabling the audio part.|
|Container format conversion||Convert the container format of a video to another without changing the encoding format. You can convert the container format of an audio or video file to MP4, M3U8, or FLV.|
|Video to animated sticker||Capture a highlight clip from a video and covert the clip to an animated sticker in GIF or WEBP format.|
|Codec||The video codec format.
|Bitrate||The bitrate of the transcoded video.
|Fps||The frame rate of the transcoded video.
|Width* Height||The resolution of the transcoded video.
|Scale||Auto scaling. You can enable proportional scaling by height or width.|
|Gop||The group of pictures (GOP) size. The GOP size indicates the maximum interval of keyframes
or the maximum number of frames in a frame group.
|Profile||The codec profile. This parameter is only valid when the codec format is H.264. You can set this parameter to Baseline, Main, or High.|
|PixFmt||The pixel format for video color encoding.
|Rotate||The rotation angle of the video, in the clockwise direction.
|ScanMode||The scan mode. Valid values: interlaced and progressive.|
|Rate Control Modes||The bitrate control method. The following bitrate control methods are supported: VBR, CBR, and CRF.|
|Crop||Specifies whether to crop the video image. MPS can automatically detect and remove the black borders of a video image. You can also set cropping-related parameters as needed.|
|Pad||Specifies whether to add black borders to the video image. You can add black borders to a video image.|
|Codec||The audio codec format.
|Samplerate||The sampling rate.
|Bitrate||The audio bitrate.
|Channels||The number of sound channels.
|HLS MasterPlayList||This feature allows you to package one or more video streams at different bitrates, subtitles in different languages, and audio tracks into a Master Playlist file.|
|Conditional transcoding||If the bitrate or resolution specified in the transcoding template is higher than
that of the input video, you can select one of the following options:
|Workflow||MPS automatically executes the workflow in the cloud after an audio or video file is uploaded.|
MPS provides a series of preset templates so that transcoded videos can adapt to a range of network bandwidths.
- Intelligent preset templates
Intelligent preset templates automatically adjust transcoding parameters based on the input video file so that the output video files can meet requirements. Whether an intelligent preset template is applicable to an input video file depends on the resolution, bitrate, and other properties of the input video file. Therefore, you must submit an analysis task to obtain a list of intelligent preset templates that are available to the input video file. MPS tries to balance the need to reduce the bitrate of the media file and the need to reduce quality loss in the transcoding process. If you use an intelligent preset template, quality is prioritized.
- Static preset templates
Analysis tasks are not required before you use this type of template. You can use static preset templates to transcode a video file, transcode an audio file to the MP3 format, or convert the container format of a media file. Media files generated by using this type of template can adapt to common playback devices and network bandwidths. Static preset templates control the output bitrate first.
- Preset Narrowband HDTM templates
Analysis tasks are not required before you use this type of template. You can use this type of template to generate videos in the FLV, MP4, or M3U8 format. Preset Narrowband HDTM templates, including preset Narrowband HDTM 1.0 templates, are exclusively provided by MPS. Compared with other transcoding templates, a preset Narrowband HDTM template can generate an output video at a lower bitrate without compromising the video quality. This way, MPS helps you save more costs.
A custom template contains a set of transcoding parameters, such as the audio, video, and container parameters. You can set the transcoding parameters based on your needs to create a regular template. Alternatively, you can submit a ticket to configure the created template to a Narrowband HDTM 1.0 .
|Video editing||This feature allows you to crop a video of the specified duration from the specified time point to generate a video clip.|
|Video merging||This feature allows you to merge up to 20 videos into one.|
|Blurring||This feature allows you to blur the specified area of a video.|
|Video head and tail||
|Dynamic watermarks||You can specify the time period during which watermarks are displayed.|
|Sprite and WebVTT-based thumbnail||A sprite is generated by taking a series of snapshots. This feature allows you to obtain information about multiple snapshots in one request. This way, the number of requests is reduced, and the client performance is improved.|
|Smart thumbnail||MPS learns from image aesthetics, recognizes the content of a video, selects the optimal keyframe from the video, and then generates an image as the video thumbnail.|
|Narrowband HDTM 1.0||Narrowband HDTM 1.0 is a media processing feature that is developed based on the transcoding technologies of Alibaba Cloud. Narrowband HDTM 1.0 intelligently analyzes the scenes, actions, content, and textures in a video. This helps reduce the bitrate of output videos and bandwidth costs without compromising the video quality.|
For long videos of more than 30 minutes, MPS can speed up the transcoding process by concurrently transcoding video clips. The transcoding speed can be increased by 5 times.
|Media fingerprint||A media fingerprint can be used to uniquely identify a video. The media fingerprint is usually a binary string. A media fingerprint is unique. Different videos have different media fingerprints. A media fingerprint is also stable. The media fingerprint of an audio or video file remains the same if the file is converted to another format, cropped, merged, compressed, or rotated.|
|Automated review||This feature is implemented based on a large amount of labeled data and deep learning algorithms. MPS analyzes the content, thumbnail, title, and comments of videos and accurately recognizes prohibited content in terms of speech, text, and visual elements. Prohibited content includes pornographic, terrorism, and politically sensitive content, ads, and content in video blacklists. This feature can be used in multiple scenarios, such as short video platforms, live streaming platforms, and media auditing.|
|Media information||You can obtain the encoding and content information of audio and video files that are stored in OSS.|
|Custom duration of M3U8 output media segments||
|External subtitle||You can import an external subtitle file and specify the encoding format of the subtitle file for a transcoding job.|