All Products
Search
Document Center

Intelligent Media Services:Script-based automated video production user guide

Last Updated:Feb 27, 2026

Script-to-Video assembles bulk videos from your media assets based on a predefined script. It suits e-commerce and marketing workflows that require consistent, high-volume video output from existing images, clips, and voiceover scripts.

Note

Script-to-Video assembles videos from a predefined script. It does not auto-match voiceover and visuals. To automatically match voiceover or text scripts to video scenes, use Image-Text Matching.

Billing

For pricing details, see Billing of Script-to-Video.

Before you begin

Before creating a Script-to-Video task, confirm the following:

  • Activate Intelligent Media Services and access the Intelligent Media Services console.

  • Prepare your media assets (video clips and images). You can upload them to your asset library in advance, or upload them directly when adding materials during task creation.

  • Obtain valid API credentials (AccessKey ID and AccessKey Secret) if using the API.

How it works

Script-to-Video provides two generation modes: Global Scripts and Segmented Scripts.

AspectGlobal ScriptsSegmented Scripts
How it worksRandomly combines multiple complete voiceover scripts with video assets to generate many similarly styled videos.Splits a voiceover script into segments and matches each segment to a specific group of assets.
Best forVideos with a coherent story and consistent tone across all outputs.Videos where each segment has specific content requirements and needs precise timing and synchronization.
Matching methodHolistic matching ensures a harmonious feel from start to finish.Segment-based matching aligns each part of the voiceover with its corresponding visual content.
Voiceover inputConfigured on the Broadcast tab. Multiple scripts allowed; the system randomly selects one per output video.Configured per media group on the Script Node Configurations tab. Multiple voiceover scripts per group allowed; the system randomly selects one during generation.

Choose Global Scripts to produce many stylistically uniform videos from a set of scripts and assets. Choose Segmented Scripts when each section of your video needs to pair with specific visuals.

Create a task in the console

  1. Log on to the Intelligent Media Services console.

  2. In the upper-left corner, select a region.

  3. Navigate to Intelligent Production > Intelligent Batch Video Production.

  4. On the Script-to-Video tab, click Create Task.

  5. Select a generation mode (Global Scripts or Segmented Scripts) and click OK.

  6. Configure the following tabs and sections.

  7. Click Initiate Intelligent Task to submit the task.

Script node configurations (required)

On the Script Node Configurations tab, add script nodes, set descriptions, and associate media assets for each node.

  • A script node defines one segment of your video structure: its description and the media assets assigned to it.

  • In Segmented Scripts mode, each media group includes its own voiceover settings: Duration Mode, Broadcast Text (multiple entries allowed; the system randomly selects one during generation), Volume, and Split Mode.

Background music (optional)

On the Background Music tab, add background music. If you leave this empty, the system uses its default music.

Sticker (optional)

On the Sticker (Logo, Watermark) tab, add image assets as stickers or watermarks. Add multiple assets, and the system randomly selects one for each output video.

Title (optional)

On the Title tab, add title text. AI text generation from keywords is supported. Add multiple titles, and the system randomly selects one for each output video.

Broadcast (optional, Global Scripts mode only)

On the Broadcast tab, add the voiceover script for the video narration. AI text generation from keywords is supported. Add multiple scripts, and the system randomly selects one for each output video.

Note

The Broadcast tab is available only in Global Scripts mode. In Segmented Scripts mode, configure voiceover scripts per media group on the Script Node Configurations tab.

Synthesis configurations (required)

In the Synthesis Configurations section, configure the following output settings:

  • Output Videos: Number of output videos (1 to 100).

  • Generate Timeline Only: When enabled, generates a timeline for preview without synthesizing the video.

  • Output Size (Width x Height): Resolution of the output video (128 to 4,096 pixels per dimension; default: 1080 x 1920).

  • Video Naming Conventions: File naming pattern for output videos (example: xxxx_{index}.mp4).

  • Maximum Duration of Single Segment (Global Scripts only): Maximum duration in seconds for each segment.

  • Fixed Duration of Single Video (Global Scripts only): Fixed total duration in seconds. If specified, do not set broadcast text.

  • Storage Address: Storage location for output videos. To add a storage address, go to Storage Management.

Create a task using the API

Script-to-Video and Image-Text Matching both use the SubmitBatchMediaProducingJob operation to submit tasks. For parameter differences between the two, see Parameter differences.

API operationDescription
SubmitBatchMediaProducingJobSubmits a Script-to-Video task. For parameter details, see Script-to-Video parameters.
ListBatchMediaProducingJobsQueries batch video production tasks that meet specified conditions.
GetBatchMediaProducingJobRetrieves details for a batch video production task.
SubmitTextGenerateJobSubmits a task to generate voiceover scripts and titles using AI.

Advanced configuration

Fine-tune the visual quality of output videos with these settings:

  • Subtitle styles

  • Entrance and exit animations

  • Transitions

  • Special effects

  • Voiceover effects

  • Matching policies

API: See Editing logic and advanced configurations for parameter details.

Console: Configure these settings on the Advanced Settings of Editing Policy tab on the right side of the page.

Advanced Settings of Editing Policy

What's next