The most scalable image-to-video API: VEED Fabric 1.0

Create lifelike video from any image and audio with
the world’s most powerful image-to-video API.

What’s unique about VEED Fabric 1.0?

Make anything talk
Fabric 1.0 animates any photo, illustration, or render, replacing limiting preset avatars.
Smooth audio-driven motion
Speech guides precise lip sync as well as expressive head, hand, and body movement for realistic animation for engaging storytelling.
Longer, more useful video outputs
Supports video outputs up to 7× longer than typical I2V tools (30 seconds at launch).
Optimised pipeline
Diffusion Transformer core with efficient inference for lower compute cost and scalable generation.

How VEED Fabric 1.0 works

It’s a developer-first, unbundled API that drops
 straight into your stack, so you can focus on great UX.
Choose
an image
Provide audio
or text
Generate your talking video
Publish
anywhere

Choose an image

Upload any character or product shot. Fabric accepts photos, illustrations, 3D renders and more.

Provide audio or text

Drop in a voice recording or type a script. If you choose text, VEED’s AI voice generator can synthesize natural narration.

Generate your talking video

Fabric matches the voice to your character and animates the mouth and face. Within seconds, you have a video file.

Publish anywhere

Export in square, vertical or landscape formats and share to TikTok, Instagram, YouTube, ads or presentations.

Why developers choose VEEd Fabric 1.0

Plug in the VEED Fabric 1.0 API and turn any image into high quality, expressive video with smooth audio-driven motion.

Pricing that’s made for growth

Fabric uses VEED’s existing credit system. Credits are deducted based on the resolution and duration of your video.
480p
Suitable for social clips and quick demos
$0.08
/second
Fast 480p
Same model, 2.5x faster
$0.10
/second
720p
Higher resolution for ads and tutorials
$0.15
/second
Fast 720p
Same model, 2.5x faster
$0.20
/second