All Products
Search
Document Center

Intelligent Media Services:Script-Based Automated Video Production Guide

Last Updated:Jun 17, 2026

Script-based automated video production lets you bulk-generate high-quality videos from predefined script nodes and associated media assets. It is ideal for e-commerce, local lifestyle marketing, and other scenarios where you have a clear video structure and ready-to-use assets.

Billing Information

For detailed pricing information about script-based automated video production, see Script-Based Video Production Billing.

Notes

Script-based automated video production assembles media assets according to a predefined script but does not intelligently match voiceovers with visuals. To match voiceover or script copy with video footage, use Intelligent Text-to-Visual Matching Video Production.

Function Introduction

Note
  • Script-based automated video production and intelligent text-to-visual matching video production share the same OpenAPI for submitting tasks. For details on how to distinguish between them using parameters, see Parameter Differences.

  • Script-based automated video production offers two generation modes: Global Voiceover and Grouped Voiceover, each tailored to different video production needs.

Production Mode

Description

Scenarios

Matching Method

Global Voiceover Mode

Randomly pairs complete voiceover scripts with video assets to quickly generate stylistically consistent videos. Emphasizes the overall feel of the video.

  • Create videos that emphasize coherence, consistency, and tell a complete story.

  • Maintain a unified style across all generated videos.

Overall matching ensures visual and auditory harmony from start to finish.

Grouped Voiceover Mode

Splits a voiceover script into segments, each aligned to a specific script node for precise control. Ideal for videos that require fine-grained content matching.

  • When specific requirements exist for each part of the video.

  • A video that must accurately convey the content of each segment.

Segment-by-segment matching ensures each clip aligns precisely with its corresponding voiceover.

Create a Script-Based Automated Video Production Task

Create a Task Using the Console

  1. Log on to the Intelligent Media Services console.

  2. In the upper-left corner, select a region as needed.

  3. Navigate to Intelligent Production > Intelligent Batch One-Click Video Production.

  4. On the Script-Based Automated Video Production tab, click Create Script-Based Automated Video Production to start.

  5. Configure script nodes, background music, stickers, titles, voiceover regions, and synthesis settings. See the table below for details.

  6. Click Start Intelligent Task to submit the task.

Production Mode

Instructions

Global Voiceover Mode

  • In the script node configuration section, add script nodes, set node descriptions, and associate media assets (required).

  • In the background music section, add background music (optional; official music is used by default if none is provided).

  • In the sticker section, add image assets to use as stickers or watermarks for the entire video. You can add multiple assets; one is randomly selected per video (optional).

  • In the title section, add title text. You can generate text using AIGC based on keywords. Add multiple texts; one is randomly selected per video (optional).

  • In the voiceover section, add voiceover text. You can generate text using AIGC based on keywords. Add multiple texts; one is randomly selected per video (optional).

  • In the synthesis configuration section, specify the expected number of videos, file naming convention, storage path, and other settings to start the script-based automated video production task (required).

Grouped Voiceover Mode

  • In the script node configuration section, add script nodes, set node descriptions, and associate media assets. For each media asset group, you can set multiple voiceover scripts; one is randomly selected per video (required).

  • In the background music section, add background music (optional; official music is used by default if none is provided).

  • In the sticker section, add image assets to use as stickers or watermarks for the entire video. You can add multiple assets; one is randomly selected per video (optional).

  • In the title section, add title text. You can generate text using AIGC based on keywords. Add multiple texts; one is randomly selected per video (optional).

  • In the synthesis configuration section, specify the expected number of videos, file naming convention, storage path, and other settings to start the script-based automated video production task (required).

Create a Task Using the API

Advanced Configuration Options

Note

You can further customize synthesized videos by adjusting caption styles, entrance and exit animations, transitions, effects, voiceover effects, and matching strategies.

Parameter Settings via API

If you create tasks using the API, see Batch One-Click Video Production Editing Logic and Advanced Configuration for parameter details.

Parameter Settings via Console

If you create tasks using the console, configure parameters on the Advanced Editing Strategy Settings tab on the right side of the page during task creation. On this tab, set the following parameters: Media Volume ranges from 0 to 10, where 1 represents the original volume; Title Configuration uses the TitleConfig JSON structure; Voiceover Volume adjusts voiceover loudness; Voiceover Captions are configured through SpeechConfig.AsrConfig; Voiceover Tone accepts one or more tones separated by commas, such as zhimiao_emo,zhimi_emo.