All Products
Search
Document Center

Alibaba Cloud Model Studio:Lip-sync replacement for videos - VideoRetalk

Last Updated:Jun 24, 2026

VideoRetalk synchronizes lip movements with audio, generating a new video from an input video and an audio file.

Important

This document applies only to the China (Beijing) region. To use the models, you must obtain an API key from the China (Beijing) region.

Model overview

Example

Input example

Output example

Character video:

Voice audio:

Billing and rate limits

Model

Unit price

RPS limit for task submission

Number of concurrent tasks

videoretalk

$0.011469/second (pay-as-you-go, by generated video duration)

1

1

(one task runs at a time; others are queued)

Calling the model

  • VideoRetalk requires API calls (pay-as-you-go) — it cannot be tested in the Model Studio console.

  • Call VideoRetalk with a clear, front-facing character video and an audio file to generate lip-synced output. For more information, see VideoRetalk video generation.