Gemini Omni Flash

FreemiumAI Video Generation

Google's any-to-any video model with conversational editing and native audio.

Gemini Omni Flash is Google DeepMind's any-to-any multimodal video model: text, images, audio or video in — short video with native audio out (720p, 24fps, 3-10s via API). Its standout is conversational, iterative video editing — character swaps, relighting, angle changes — plus SynthID watermarking. Consumer rollout began at Google I/O 2026 and it's now available to Google AI Plus/Pro/Ultra subscribers in the Gemini app and Flow, free on YouTube Shorts, with an open public-preview API (gemini-omni-flash-preview) at ~$0.10 per second of output.

Key features

  • Any-to-any generation (text/image/audio/video in → video + audio out)
  • Conversational, iterative video editing
  • Native audio generation
  • SynthID watermarking
  • Gemini app, Flow, YouTube Shorts + open-preview API

Pros & cons

Pros

  • Editing-by-conversation is unique among video models
  • Broad availability across Google surfaces
  • Cheap per-second API pricing

Cons

  • Short clips (3-10s via API)
  • 720p API output for now
  • API tier still labeled public preview

Gemini Omni Flash pricing

Gemini Omni Flash uses a freemium pricing model, with paid plans from Included in Google AI plans; free on YouTube Shorts; API ~$0.10/sec of video (public preview). A free tier lets you test it before committing. Browse all freemium AI tools we track.

Who should use Gemini Omni Flash?

Gemini Omni Flash is best suited for fast short-form video generation and natural-language video editing inside the Google ecosystem. It earns its place for editing-by-conversation is unique among video models. The main trade-off: short clips (3-10s via API).

Comparing options? See our best ai video generation tools guide, or browse every ai video generation tool tracked on Benchquill.

Key concepts behind ai video generation tools: Text-to-video · Diffusion model · Multimodal.

Gemini Omni Flash FAQ

Is Gemini Omni Flash free?

Gemini Omni Flash is freemium. Pricing starts at Included in Google AI plans; free on YouTube Shorts; API ~$0.10/sec of video (public preview).

What is Gemini Omni Flash best for?

Gemini Omni Flash is best for fast short-form video generation and natural-language video editing inside the Google ecosystem. Its standout strength: editing-by-conversation is unique among video models.

What are the best Gemini Omni Flash alternatives?

The closest alternatives are Google Veo 3.1 and HeyGen. Google Veo 3.1 is google's flagship text-to-video model with native audio and 4K output, while HeyGen is photorealistic AI avatars and video translation for creators and marketers.

AI models behind Gemini Omni Flash

Related models on our leaderboard: Gemini 3.1 Pro, Gemini 3.5 Flash, Gemini 3 Flash, Gemini 2.5 Flash-Lite.

Gemini Omni Flash alternatives

Other top ai video generation tools worth comparing.

All 12 alternatives →

Google Veo 3.1

AI Video Generation
Paid

Google's flagship text-to-video model with native audio and 4K output.

100
Best for: Filmmakers, motion designers, and marketers who want the highest all-around realism with native audio.

HeyGen

AI Video Generation
Freemium

Photorealistic AI avatars and video translation for creators and marketers.

100
Best for: Marketers, sales teams, and creators wanting the most natural-looking avatars and fast localization.

Kling AI

AI Video Generation
Freemium

Kling 3.0 — multi-shot AI video with native audio at low per-second cost.

100
Best for: Budget-conscious creators wanting premium cinematic, multi-shot video at the lowest per-second cost.

Runway

AI Video Generation
Freemium

Runway Gen-4.5 — pro AI video with camera control and character consistency.

100
Best for: Professional creative teams and filmmakers who need precise control over camera, motion, and character consistency.