Launching VEED’s Lipsync API: World's Most Powerful Lip-Syncing Tech
by
Sabba Keynejad

Launching VEED’s Lipsync API: World's Most Powerful Lip-Syncing Tech

AI
Video Software
Audio Editing

We’re excited to introduce the VEED Lipsync API: the world’s most powerful lip-syncing technology, now available for developers and teams. The API takes any video and remaps the speaker’s lip movements to match a new audio track. Submit a video URL and an audio URL, and it returns a synchronized MP4. You can access it today on fal.ai.

Key takeaways:

  • The VEED lip sync API is a video-to-video API that replaces lip movements to match new audio
  • It accepts MP4, MOV, WebM, M4V, and GIF video files plus MP3, OGG, WAV, M4A, and AAC audio files
  • Two inputs are required: a video URL and an audio URL
  • The API returns an MP4 with synchronized lip movements as a JSON response
  • Pricing is $0.40 per minute of processed video, accessed through fal.ai

What does our Lipsync API do?

The VEED lip sync API analyzes facial movements in a source video and remaps lip positions to match a replacement audio track. It handles natural mouth shapes, speech timing, and facial dynamics automatically. No manual animation or frame-by-frame configuration is required.

We built this for teams that produce video at scale: dubbing pipelines, localization workflows, AI avatar systems, and content rephrasing. The API processes videos at approximately 2 to 2.5 minutes of generation time per minute of video input.

Here's a sample AI video showing our tech in action:

API documentation

The VEED lip sync API is hosted on fal.ai under the veed/lipsync endpoint. Below is everything a developer needs to evaluate and integrate the API.

Authentication

The API uses fal.ai's authentication system. Set your FAL_KEY as an environment variable in your runtime:

export FAL_KEY="your-fal-api-key"

Get your API key by signing up at fal.ai.

Endpoint

fal-ai endpoint: veed/lipsync

Full model path: https://fal.ai/models/veed/lipsync

Input parameters

Two parameters are required:

  • video_url (required): URL of the source video. Accepted formats: MP4, MOV, WebM, M4V, GIF.
  • audio_url (required): URL of the replacement audio. Accepted formats: MP3, OGG, WAV, M4A, AAC.

The API accepts any video aspect ratio. If your audio and video durations are out of sync, fal.ai's lipsync mode defaults to cut_off (trims to the shorter of the two). Other sync modes include loop, bounce, silence, and remap.

Sample request (JavaScript)

Install the fal client:

npm install @fal-ai/client

Submit a request:

import { fal } from "@fal-ai/client";const result = await fal.queue.submit("veed/lipsync", {  input: {    video_url: "https://your-video-url.mp4",    audio_url: "https://your-audio-url.mp3"  }});

Response format

The API returns a JSON object. The output video is accessible at response.video.url:

{  "video": {    "url": "https://v3.fal.media/files/output.mp4",    "file_name": "output.mp4",    "file_size": 1593113,    "content_type": "application/octet-stream"  }}

Pricing

The VEED lip sync API costs $0.40 per minute of processed video. A 5-minute video costs $2.00. Pricing is usage-based with no seat fees or monthly minimums. For enterprise volume discounts, contact VEED's sales team.

Key use cases

Get started with the VEED lip sync API

The lip sync API is available now on fal.ai. Sign up for a fal.ai account, set your FAL_KEY, and you can run your first request in minutes. Additional provider integrations are coming soon.

For enterprise use, contact our sales team to request a demo and discuss volume pricing. You can also explore VEED's full AI video creation platform for browser-based editing tools alongside the API.

Generate realistic lipsync from any audio.

Faq

What is a lip sync API?

A lip sync API is a programmatic tool that analyzes a speaker's facial movements in a video and remaps the lip positions to match a new audio track. You submit a video URL and an audio URL, and the API returns an output video with synchronized mouth movements. The VEED lip sync API handles this automatically, with no manual animation required.

What is the VEED lip sync API pricing per minute?

The VEED lip sync API costs $0.40 per minute of processed video. This is a usage-based rate with no monthly minimums. A 10-minute video costs $4.00 to process. Enterprise pricing is available for high-volume use.

Where can I find the VEED lip sync API documentation?

Full API documentation, including the input schema, output format, and code examples, is available on fal.ai/models/veed/lipsync. This page also includes a live playground where you can test the API with your own files before writing any code.

Does the lip sync API work in real time?

The VEED lip sync API is an asynchronous processing API. It does not return results in real time. Current processing speed is approximately 2 to 2.5 minutes of generation time per 1 minute of video. Real-time lip sync for live video is a different capability not covered by this API.

What video formats does the API accept?

The VEED lip sync API accepts MP4, MOV, WebM, M4V, and GIF video files. For audio, it accepts MP3, OGG, WAV, M4A, and AAC. The API works with any video aspect ratio.

Can I use the lip sync API with AI-generated audio?

Yes. The API accepts any audio input, which makes it compatible with text-to-speech workflows. Generate speech with a TTS model, then pass the audio URL directly into the lip sync API to produce a video where the speaker's lips match the generated voice.

When it comes to  amazing videos, all you need is VEED

Create your first video
No credit card required