Question 1

Will the lip sync be perfect?

Accepted Answer

Audio-driven mode (when you upload an mp3/wav) gives phoneme-accurate sync — that's the best path. TTS mode is still real lip-sync (script → speech → mouth motion), but quality varies more by language and script length. Short English lines fare best in either mode.

Question 2

Audio upload vs. TTS — which should I use?

Accepted Answer

Upload audio when you have a real recording (talent VO, podcast clip, your own voice memo) — that path uses dedicated audio-driven avatar models for tightest sync. Use TTS when you only have a script — the chosen voice is synthesized and lipped automatically.

Question 3

Can I upload photos of celebrities or public figures?

Accepted Answer

Do not use non-consensual celebrity likenesses — policy and law apply. The system prompt refuses harassment and impersonation patterns.

Question 4

Does it work in non-English languages?

Accepted Answer

Yes for major languages, with quality varying by model. English is strongest today; Spanish, French, and Japanese are reasonable; others may need short test runs. For non-English audio, upload your own recording for best results.

Question 5

How long can the clips be?

Accepted Answer

Length is bounded by the underlying avatar model — typically a few seconds. Jobs are async; poll status after submit.

Question 6

Which models power it?

Accepted Answer

Dedicated audio-driven and TTS-driven avatar models — `heygen-avatar-4` (default, both modes), `multitalk-avatar-tts` (text + ElevenLabs voice), `kling-avatar-v2-pro` (premium audio-driven), and `hunyuan-avatar` (high-fidelity audio-driven). Switch based on whether you have audio and how much premium fidelity you need.

Question 7

Is this safe for commercial use?

Accepted Answer

Yes for content with proper rights and disclosure. Talent agreements should explicitly cover AI-generated lipsync; ad platforms may require AI labeling.

AI Lipsync Generator

Mouth motion from text beats random wobble

How to brief lipsync that actually syncs

Use cases that benefit from lipsync over generic animation

Short ad hooks

Product launch lines

Multilingual variants

Mascot greetings

Best for

Why "best-effort sync" is the honest framing

Pro tips for cleaner lipsync output

Lipsync FAQ