All Products
Search
Document Center

Intelligent Media Management:User guide for using x-oss-process

Last Updated:Apr 10, 2025

Intelligent Media Management (IMM) capabilities are integrated into Object Storage Service (OSS), allowing you to utilize various data processing and analysis features of IMM within OSS. This topic describes how to use the x-oss-process capability of OSS to access IMM features.

Prerequisites

  • IMM is available in the region where the OSS bucket is located. For more information about regions in which IMM is available, see Endpoints.

    Note

    You can access features of the new version in all supported regions by using the API and SDKs. Access to features of the new version from the console is in phased testing and available only in Germany (Frankfurt) and China (Qingdao).

  • IMM is activated. For more information, see Activate IMM.

Billing

You are charged for using IMM. For more information, see Billing overview.

Procedure

Step 1: Create an IMM project

  1. Log on to the IMM Console.

  2. In the lower part of the left-side navigation pane, click Try New Version. If Switch to Old Version is displayed in the lower part of the navigation pane, skip this step.

  3. In the left-side navigation pane, click Project List.

  4. Click Create Project. For more information, see Create a project.

Step 2: Bind an OSS bucket

You can call the AttachOSSBucket operation to bind the project to an OSS bucket. In this example, the operation is called in OpenAPI Explorer to bind the project to an OSS bucket.

  1. Log on to OpenAPI Explorer. In the upper-left corner of the page, select the IMM API version and region.

image..png

Important

You can bind the project only to an OSS bucket in the same region.

  1. On the Parameters tab, enter the name of the project in the ProjectName field and the name of the bucket in the OSSBucket field, and click Initiate Call.

    image..png

  2. View the response of the call on the Response tab on the right side of the page.

image..png

Step 3: Grant permissions

For more information about permissions, see Permissions.

Note

Permission configurations are required only for a RAM user or RAM role. If you are using an Alibaba Cloud account, skip this step.

Step 4: Apply for a quota

  1. Log on to the OSS console.

  2. In the left-side navigation pane, click Self-service Tools.

  3. Click More Tools.

  4. From the drop-down list, select the OSS bucket.

    image

  5. Click Apply for Quota next to Use GET Method to Use Data Processing Capabilities of New IMM Version and Use POST Method to Use Data Processing Capabilities of New IMM Version.

    image

  6. Wait for approval.

Features supported by the x-oss-process capability

Note

Anonymous access is not supported. For more information about how to generate signed URLs for access, see sign.

Media processing

For more information, see Media processing.

Feature

Parameter

Description

Video transcoding

video/convert

Convert a video object in OSS to the required format.

Video-to-animated-image conversion

video/animation

Convert a video object in OSS to animated image formats, such as GIF and WebP.

Generate CSS sprites from video snapshots

video/sprite

Create a sprite in the specified format from frames of a video object in OSS.

Frame capture

video/snapshots

Capture frames from a video object in OSS and convert them to the specified format.

Video merging

video/concat

Merge multiple video objects in OSS into a single video in the specified format.

Audio transcoding

audio/convert

Convert an audio object in OSS to the specified format.

Audio merging

audio/concat

Merge multiple audio objects in OSS into a single audio in the specified format.

Extract audio information

audio/info

Extract the media format information and media stream information of an audio object in OSS.

Extract video information

video/info

Extract the media format information and media stream information of a video object in OSS.

Document processing

For more information, see Document processing and Intelligent document processing.

Feature

Parameter

Description

WebOffice online preview

doc/preview

Preview a document in OSS online.

WebOffice Online Editing

doc/edit

Collaboratively edit a document in OSS online.

Document snapshot

doc/convert

Capture a snapshot of a document in OSS online.

Document conversion

doc/convert

Convert the format of a document in OSS online.

Intelligent document translation

doc/translate

Translate text into multiple languages, such as Chinese and English.

Intelligent document polishing

doc/polish

Polish a document.

Intelligent document summarization

doc/summarize

Generate a brief summary of a document.

Intelligent document continuation

doc/continue

Generate coherent and logically consistent follow-up content based on conditions such as the existing text, subject, and style.

Intelligent document enrichment

doc/enrich

Optimize the comprehensive expression and style of existing documents.

Intelligent document tone rephrasing

doc/rephrase

Adjust the tone of a document and improve the expressions.

Image intelligence

For more information, see Image intelligence.

Feature

Parameter

Description

Face detection

image/faces

Detect multiple faces and analyze face information in an image.

Human body detection

image/bodies

Detect the positions of human bodies in an image.

Vehicle detection

image/cars

Detect the vehicle information and license plate information in an image.

QR code recognition

image/codes

Detect the QR code information in an image.

Image label detection

image/labels

Detect image content, such as scenes, objects, and events.

Image quality assessment

image/score

Assess the visual quality of an image based on aesthetics, such as color and saturation of the image.

Blind watermarking

image/blindwatermark

Add blind watermarks to an image.

Blind watermarking

image/deblindwatermark

Parse blind watermarks in an image.