Kling v2 Avatar Pro
◈375 / clip
Kling v2 Avatar Standard
◈175 / clip
WAN 2.2 Speech-to-Video
◈100 / clip
Kling v2 Avatar Pro — Takes a reference image and audio dialogue clip, then generates a realistic talking-avatar video. Preserves identity, lip syncs accurately, adds natural head movement, eye motion, expressions, and cinematic lighting.

Inputs

Upload
URL

Choose image

Uploading…
image.jpg✕ Remove
Preview
Upload
URL

Choose audio

Uploading…
audio.mp3✕ Remove
💡 Tips: Use a front-facing portrait. Keep audio under 60s. Avoid heavy background music in the dialogue clip.
KAMOD estimate ◈375 · Kling v2 Avatar Pro
Results
Generate a talking avatar to see results here.
Kling v3.0 Pro
◈56 / sec
Kling v3.0 Standard
◈35 / sec
Kling v3.0 Motion Control — Precisely define camera movement (pan, tilt, orbit, dolly, zoom) and subject/object dynamics. Provide a reference image + a reference motion video, and the model generates a new video following those motion patterns.

Inputs

Upload
URL

Choose image

Uploading…
image.jpg✕ Remove
Preview
Upload
URL

Choose video

Uploading…
video.mp4✕ Remove
💡 Describe camera movement: pan, tilt, orbit, dolly, zoom. Or describe subject behaviour.
Results
Submit a motion control job to see results.
HeyGen Photo Avatar — Upload a clear front-facing portrait and HeyGen builds a custom photo avatar you can re-use in talking-head videos. Powered by HeyGen v3 Avatars API.

Inputs

Upload
URL

Choose portrait

Uploading…
portrait.jpg✕ Remove
Preview
💡 Use a clean, well-lit portrait with the face centered. Square or vertical works best. Creation may take 1–3 minutes.
Results
Create a HeyGen photo avatar to see results here.

Render Talking Video

Turn a HeyGen avatar into a talking-head video. Avatar id auto-fills after creating one above, or paste an existing one.

HeyGen Voice
Upload Audio
My Voices
Filter: · or enter ID manually
≈ 0 sec · ◈0 tokens