Veo 3.1 Lite does not include native audio generation. If you need synchronized audio, use Google Veo 3.1 or Google Veo 3.1 Fast instead.
Google’s most cost-efficient video model
Google Veo 3.1 Lite, released March 31, 2026, completes the three-tier Veo 3.1 lineup — sitting below Veo 3.1 Fast in cost while matching it in inference speed. It uses the same Transformer backbone with spatio-temporal patches as the broader Veo 3.1 family, but is optimized for cost-efficient deployment in developer pipelines, high-volume generation, and use cases where output quantity matters alongside quality. Duration is selectable at 4, 6, or 8 seconds per clip, giving fine-grained control over credit consumption.Capabilities
Cost-efficient generation
Less than 50% of the credit cost of Veo 3.1 Fast — the most affordable way to access the Veo 3.1 architecture on ImagineArt.
Flexible duration control
Select exactly 4, 6, or 8 seconds per clip — granular duration control for precise credit management and pipeline optimization.
1080p output
Generates at 1080p resolution, maintaining visual quality suitable for most digital delivery formats.
Veo 3.1 backbone
Shares the same Transformer backbone with spatio-temporal patches as Veo 3.1 Fast and Veo 3.1 — consistent scene coherence and prompt adherence.
Fast inference
Same generation speed as Veo 3.1 Fast — rapid turnaround without paying for the higher-cost tier.
Text and image input
Supports text-to-video and image-to-video workflows for creative flexibility.
Veo 3.1 family comparison
| Model | Audio | Duration | Resolution | Relative cost | Best for |
|---|---|---|---|---|---|
| Veo 3.1 Lite | No | 4, 6, or 8s | 1080p | Lowest | High-volume, cost-sensitive |
| Veo 3.1 Fast | Yes | 8s | Up to 4K | Medium | Balanced speed + quality |
| Veo 3.1 | Yes | Up to 60s | Up to 4K | Highest | Broadcast quality, long-form |
Specifications
| Feature | Details |
|---|---|
| Developer | Google DeepMind |
| Released | March 31, 2026 |
| Resolution | 1080p |
| Duration | 4, 6, or 8 seconds (selectable) |
| Aspect ratios | 16:9, 9:16 |
| Audio | No |
| Architecture | Transformer backbone, spatio-temporal patches |
How to use
Write your prompt
Describe the scene, subject, camera movement, and mood. Veo 3.1 Lite responds well to cinematographic language.
Prompting tips
- Be specific and concise — Veo 3.1 Lite performs best with clear, direct prompts. Focus on the key visual elements: subject, action, setting, and lighting.
- Match duration to content complexity — Use 4 seconds for simple product shots or single-action clips; 8 seconds for scenes with more movement or transitions.
- Use cinematographic language — “Wide establishing shot,” “tight close-up,” “slow pan right” all guide the visual framing effectively.
Example prompts
A hummingbird hovers near a red flower in a sunlit garden. Close-up, natural light, green bokeh background. 6 seconds, 16:9.
An architect reviews blueprints spread across a large table. Overhead shot, warm studio lighting, slight camera drift. 8 seconds.

