All Products
Search
Document Center

Intelligent Media Management:OSS data processing guide

Last Updated:Jun 22, 2026

Alibaba Cloud Object Storage Service (OSS) is integrated with Intelligent Media Management (IMM) to provide data analytics and processing capabilities such as media processing and document processing. You can access these IMM capabilities directly through OSS data processing features.

Prerequisites

  • You can use IMM-related features only in regions that support IMM. For more information, see endpoints.

    Note

    The new data processing APIs and SDKs support regions that are listed in the IMM service endpoints. However, this feature is in grayscale release in the console and is available only to specific users in Germany (Frankfurt) and China (Qingdao).

  • You have activated the IMM service and authorized it.

Billing

Using IMM features incurs charges. For more information, see Billing overview.

Procedure

Step 1: Create an IMM project

  1. Log on to the IMM console.

  2. In the navigation pane on the left, click Try New Version. (If the button at the bottom of the page is labeled Switch to Old Version, you can skip this step.)

  3. In the navigation pane on the left, click Project List.

  4. Click Create Project.

Step 2: Attach a Bucket

Use OpenAPI Explorer to attach a bucket. For more information about the API operation, see Attach an OSS bucket.

  1. Log on to OpenAPI Explorer. In the upper-left corner of the page, select the IMM API version and a region.

Select version 2020-09-30 and a region, such as China (Shanghai).

Important

To reduce cross-region network latency and costs, the OSS bucket that you attach must be in the same region as your project. Ensure that the region you select for the API call matches the region where your OSS data is stored.

  1. In the parameter settings, enter the Project Name and the OSS bucket to attach, and then click Initiate Call.

  2. View the call result on the right side of the page.

If the status code 200 is returned, the bucket is attached.

Step 3: Configure permissions

To configure permissions, see permissions.

Note

Permission configuration is required only for Resource Access Management (RAM) users and RAM roles. Skip this step if you are using your Alibaba Cloud account.

OSS data processing capabilities

Note

This feature does not support anonymous access. To use a signed URL for access, see sign (generate signed URL).

Media processing

For more information about the parameters, see Audio and video processing.

Operation

Parameter

Description

Video transcoding

video/convert

Convert video files in OSS to your desired format.

Converting a video to an animated image

video/animation

Convert video files in OSS to animated formats such as GIF or WebP.

Video sprite generation

video/sprite

Capture frames from video files in OSS and combine them into a sprite image in your desired format.

Multi-frame video capture

video/snapshots

Capture frames from video files in OSS and convert them to your desired image format.

Video concatenation

video/concat

Concatenate multiple video files in OSS into a single video and convert it to your desired format.

Audio transcoding

audio/convert

Convert audio files in OSS to your desired format.

Audio concatenation

audio/concat

Concatenate multiple audio files in OSS into a single audio file and convert it to your desired format.

Audio information extraction

audio/info

Extract media format and stream information from audio files in OSS.

Video information extraction

video/info

Extract media format and stream information from video files in OSS.

Document processing

For more information about the parameters, see Document processing and Intelligent document processing.

Operation

Parameter

Description

WebOffice online preview

doc/preview

Preview documents stored in OSS.

WebOffice online editing

doc/edit

Collaboratively edit documents stored in OSS.

Document snapshot

doc/convert

Back up documents stored in OSS.

Document format conversion

doc/convert

Convert documents in OSS to different formats.

Intelligent document translation

doc/translate

Translate text into multiple languages such as Chinese and English.

Intelligent document polishing

doc/polish

Polish document content.

Intelligent document summarization

doc/summarize

Automatically generate a concise summary of document content.

Intelligent document continuation

doc/continue

Generate coherent and logical follow-up content based on given starting text, topic, style, and other conditions.

Intelligent document enrichment

doc/enrich

Enhance the language expression and style of existing documents.

Intelligent document tone rephrasing

doc/rephrase

Adjust document tone and refine expression.

Image intelligence

For more information about the parameters, see Image intelligence processing parameters.

Operations

Parameter

Description

Face detection

image/faces

Detect face locations in images and analyze facial attributes.

Human body detection

image/bodies

Detect human body locations in images.

Vehicle detection

image/cars

Detect and analyze vehicles and license plates in images.

QR code recognition

image/codes

Recognize QR codes in images.

Image label detection

image/labels

Identify scene, object, and event labels in images.

Image quality scoring

image/score

Score image aesthetics based on factors such as color and saturation.

Blind watermarking

image/blindwatermark

Add a text-based blind watermark to images.

Blind watermarking

image/deblindwatermark

Extract text-based blind watermarks from images.