# Gemini Omni
Gemini Omni is a multimodal video generation and editing model from [[Google DeepMind]], part of the broader [[Gemini]] family. It brings the conversational, iterative editing style of modern image generation to video — Google's own framing is "like [[Nano Banana]], but for video."
## What it does
- **Conversational video editing** — edit by natural-language prompt; each edit builds on the last while scene coherence holds
- **Multimodal input** — takes images, text, video, and audio as references and combines them into one scene
- **World knowledge plus physics** — pairs an intuitive grasp of physics (gravity, kinetic energy, fluid dynamics) with Gemini's knowledge of history, science, and culture
- **Object and motion control** — character and object swapping via reference images, motion transfer, sketch-to-video, style reimagining
- **Output** — video with synchronised visuals, text rendering, and motion
## Prompting
Google's prompt guide frames a good prompt around six elements: shot framing and motion, style, lighting, location, action, and camera technique (e.g. "oner", "dolly zoom", "push in"). Reference images carry consistency across iterative edits.
## Availability
Via the Gemini app, Google Flow, and YouTube Shorts. A Google AI subscription is required; features vary by tier and region, and early access has been tightly rate-limited.
## How it's received
Reception is mixed. The conversational editing and physics grounding impress, but practitioners report the hard part is still unsolved: deep spatial understanding. Testers describe geometry that morphs when it leaves and re-enters frame, and objects that disappear or merge under physical stress (a falling Jenga tower was a cited failure). Several found ByteDance's Seedance and OpenAI's [[Sora]] stronger on raw sample quality, and access limits drew as much comment as the model itself.
## References
- Model page: https://deepmind.google/models/gemini-omni/
- Prompt guide: https://deepmind.google/models/gemini-omni/prompt-guide/
- Hacker News discussion: https://news.ycombinator.com/item?id=48196609
## Related
- [[Gemini]]
- [[Google DeepMind]]
- [[Gemini 3]]
- [[Nano Banana]]
- [[Gemini 3.1 Flash Live]]
- [[Gemini 3.1 Flash TTS]]
- [[Gemini Mobile App]]
- [[Sora]]