All Products
Search
Document Center

Alibaba Cloud Model Studio:Lip-sync replacement for videos - VideoRetalk

Last Updated:Mar 15, 2026

VideoRetalk synchronizes lip movements with audio, generating a new video from an input video and an audio file.

Important

This document applies to China (Beijing). To use the model, you must use an API key from the China (Beijing) region.

Model overview

Example

Input example

Output example

Character video:

Voice audio:

Billing and rate limits

Model

Unit price

RPS limit for task submission

Number of concurrent tasks

videoretalk

$0.011469/second (pay-as-you-go, by generated video duration)

1

1

(one task runs at a time; others are queued)

To increase the RPS limit, email modelstudio@service.aliyun.com with your Alibaba Cloud account ID, the model name, and the required RPS.

Calling the model

  • VideoRetalk requires API calls (pay-as-you-go) — it cannot be tested in the Model Studio console.

  • Call VideoRetalk with a clear, front-facing character video and an audio file to generate lip-synced output. For more information, see VideoRetalk video generation.