Grok Imagine Video 1.5

FreemiumAI Video Generation

xAI's image-to-video model with native one-pass audio.

Grok Imagine Video 1.5 is xAI's image-to-video generator (general availability June 2026) that produces 720p clips with native, one-pass audio — synchronized dialogue, sound effects, ambience and music generated together with the video, with no separate audio step. It improves physics and temporal coherence over v1.0 and is fast (the Fast variant renders a 6-second 720p clip in ~25s), and ranked #1 on the Image-to-Video Arena at release. It's available in the Grok apps, at grok.com/imagine, and via the Imagine API.

Key features

  • Image-to-video generation
  • Native one-pass synchronized audio (dialogue, SFX, music)
  • 720p clips with improved physics and temporal coherence
  • Fast: ~6s 720p clip in ~25s (Fast variant)
  • Imagine API + Grok web/iOS/Android apps

Pros & cons

Pros

  • Native audio generated with the video (no post-production)
  • Among the cheapest premium AI video (~$4.80/min via API)
  • Fast generation; ranked #1 on the Image-to-Video Arena at launch

Cons

  • 720p output (not yet 4K)
  • Short clip lengths
  • Best results come from image inputs rather than pure text

Grok Imagine Video 1.5 pricing

Grok Imagine Video 1.5 uses a freemium pricing model, with paid plans from Imagine API $0.080/sec (~$4.80/min); also available in Grok / SuperGrok plans (verify). A free tier lets you test it before committing.

Who should use Grok Imagine Video 1.5?

Grok Imagine Video 1.5 is best suited for Fast, low-cost image-to-video with built-in synchronized audio for marketing and social content. It earns its place for native audio generated with the video (no post-production) — though it's worth weighing the trade-off that 720p output (not yet 4K).

Comparing options? See our best ai video generation tools guide, or browse every ai video generation tool tracked on Benchquill.

Grok Imagine Video 1.5 FAQ

Is Grok Imagine Video 1.5 free?

Grok Imagine Video 1.5 is freemium. Pricing starts at Imagine API $0.080/sec (~$4.80/min); also available in Grok / SuperGrok plans (verify).

What is Grok Imagine Video 1.5 best for?

Grok Imagine Video 1.5 is best for Fast, low-cost image-to-video with built-in synchronized audio for marketing and social content.

AI models behind Grok Imagine Video 1.5

Related models on our leaderboard: Grok 4.3, Grok 4.1 Fast.

Grok Imagine Video 1.5 alternatives

Other top ai video generation tools worth comparing.

All 11 alternatives →

Google Veo 3.1

AI Video Generation
Paid

Google's flagship text-to-video model with native audio and 4K output.

100
Best for: Filmmakers, motion designers, and marketers who want the highest all-around realism with native audio. View →

HeyGen

AI Video Generation
Freemium

Photorealistic AI avatars and video translation for creators and marketers.

100
Best for: Marketers, sales teams, and creators wanting the most natural-looking avatars and fast localization. View →

Kling AI

AI Video Generation
Freemium

Kling 3.0 — multi-shot AI video with native audio at low per-second cost.

100
Best for: Budget-conscious creators wanting premium cinematic, multi-shot video at the lowest per-second cost. View →

Runway

AI Video Generation
Freemium

Runway Gen-4.5 — pro AI video with camera control and character consistency.

100
Best for: Professional creative teams and filmmakers who need precise control over camera, motion, and character consistency. View →