GOOGLE
Veo 3.1
Cinematic video model от Google с нативным аудио, I2V, V2V и поддержкой 4K
Strengths
What it's the best tool for
- 1080p with synced ambient audio
- Extend up to 148 seconds
- Ingredients to Video
- Frame-level insert/remove
- Reference-image support
Limitations
When to reach for something else
- Max 8s per pass before extend
- Price scales with duration
- Strict celebrity/face filters
- Google content filters
Sample output
How Veo 3.1 responds
Prompt
Culinary Reels: chef slicing vegetables in a studio kitchen, light music, knife and oil sounds, 9:16, 8s.
https://netroom.ai/media/demo/veo-31-chef.mp4
Where teams use it
Four scenarios where it pays for itself
01
Cooking Reels
Audio + 1080p
02
Longer stories
Extend to 148s
03
Ingredients to Video
Build scenes modularly
04
Image spots
Frame-accurate edits
About model
More about Veo 3.1
Veo 3.1 Online — Google's AI Video Model
Veo 3.1 shipped on October 14, 2025: native audio, lip-sync, 1080p. Clip lengths 4/6/8s with extend up to 148s. Hosted on NetRoom.
Capabilities
1080p, synced ambient audio, up to 3 reference images, first/last-frame guidance. Ingredients to Video builds scenes from people, objects and backgrounds.
Formats
16:9 and 9:16, extend for longer stories, insert/remove for frame-level edits.