Method 1: Image + Text
Carat AI automatically converts text to speech and completes the lip sync in one step.
Carat AI automatically generates the voice (TTS) and creates a video with lip sync applied to the image.
Method 2: Image + Audio file
Use this when you already have a recorded audio file (MP3, WAV, etc.).Cost-saving tipLip sync is a feature that uses a lot of Usage (credits). To save costs, you can first generate a “video of talking lip movements (without sound)”, then use the Add audio to video feature to add narration audio separately.