Published Jun 20, 2026
Grok Imagine

Grok Imagine

AI Video Generators·grok.com ↗
Starting at
$30/mo
Free tier
No
Best for
SuperGrok subscribers who want video generation bundled with their AI assistant, and API developers building cost-efficient image-to-video pipelines
Model
Freemium
LIVEfreemium

Overview

Grok Imagine is xAI's image and video generation tool, accessible at grok.com/imagine and inside the Grok chat interface on web, iOS, and Android. Video 1.5 launched June 16, 2026, making this one of the most recently launched AI video models on the market.

The tool takes a different position than standalone video platforms like RunwayML or Kling AI. You are not subscribing to a video tool - you are subscribing to Grok, xAI's AI assistant, and video generation comes bundled with that. That distinction matters for how you evaluate the pricing.

What It Does

Grok Imagine handles both image generation and video generation from a single interface:

Image generation: The Aurora model generates images from text prompts. You can also use the Quality mode (grok-imagine-image-quality) for higher-resolution output at 1024x1024 or 2048x2048.

Video generation (Video 1.5): Animates a still image into a clip with synchronized audio - sound effects, ambient layers, and lip-synced dialogue generated in the same pass, no separate audio step. Output is 720p at up to 15 seconds per clip. The model ranks first on the Image-to-Video Arena leaderboard as of its June 2026 launch, with a 52 Elo jump over its predecessor.

Speed: A 6-second 720p video generates in approximately 25 seconds, down from 40-plus seconds with Video 1.0. For high-volume social content, this speed advantage is real.

Spicy Mode: Grok Imagine includes an opt-in mode that removes standard content guardrails, unlike most competing platforms.

Pricing

Access routes through two paid tiers and a limited free option:

PlanPriceGrok Imagine accessNotes Free (grok.com, logged in)$0/moImage generation requires a free account; logged-out visitors see a sign-in modal and cannot generateFree account sign-up at grok.com SuperGrok Lite$10/moImage + video (480p, 6-sec clips)Launched March 2026; daily caps apply SuperGrok$30/moFull Grok Imagine (image + Video 1.5)Rolling-window daily limits apply SuperGrok Heavy$300/moMaximum rate limitsFor power users and API-heavy workflows X Premium+$40/moSame Grok access as SuperGrokIncludes X platform perks

API pricing (verified 2026-06-20 via docs.x.ai/developers/pricing):

  • Image: $0.02/image (standard) / $0.05/image (1024x1024 quality) / $0.07/image (2048x2048)
  • Video: $0.08/sec at 480p ($4.80/min) or $0.14/sec at 720p ($8.40/min)

Quota reality check: The SuperGrok $30/month plan officially targets around 100 image generations per day and approximately 15-20 videos per day in 720p. In practice, xAI uses a rolling-window fair use throttle rather than a fixed daily reset. Users report hitting limits after 10-15 videos in 720p during peak hours, and failed generations (including moderation blocks) count against your quota. xAI quietly tightened these limits in May 2026 without a public announcement, changing subscription page copy from "near-unlimited" to "highest usage limits."

If you only want video generation and not Grok chat, $30/month for 15-20 usable videos per day puts your cost per video at roughly $1-3, which is more expensive than paying per-video on Kling AI or Luma Labs on their API plans.

Features

  • Image-to-video pipeline: Upload or generate an image, describe the motion, get a 6-15 second 720p clip with synchronized audio in one pass
  • Native audio: Scene-matched sound effects, ambient audio, and dialogue generated alongside video - no separate audio sourcing needed
  • Aurora image model: Text-to-image generation with Quality mode option for higher resolution
  • Projects: Organize your generations into folders (new in June 2026 update)
  • Parallel agents: Run multiple prompts simultaneously within a session
  • Spicy Mode: Optional NSFW content mode, unlike most competing platforms
  • API access: Available via xAI API at documented per-image and per-second rates
  • Cross-platform: Web, iOS, Android, and API

Pros

Native audio is a genuine differentiator. Most video tools require a separate step to add sound. Grok Imagine 1.5 generates dialogue, ambience, and sound effects in the same pass. When it works, it saves meaningful production time.

Generation speed. 25 seconds for a 6-second 720p clip is fast relative to competitors. For social content creators running high volume, this adds up.

API cost. $4.80/minute at 480p and $8.40/minute at 720p undercuts most standalone platforms. For developers building on top of video generation, this is a real advantage.

Image-to-Video Arena #1 ranking (as of launch week, June 2026). Independent leaderboard validation that the quality improvement over 1.0 is real.

Bundled value if you use Grok. If you already use Grok for chat, DeepSearch, or coding help, video generation comes in the same subscription. You are not paying twice.

Conversational prompting. Because Grok Imagine is embedded in a chat interface, you can iterate on prompts conversationally - refine the image, describe the motion, adjust in plain language - without switching tools.

Cons

Quota throttling is unpredictable. xAI changed limits in May 2026 without announcement. At $30/month with rolling-window throttles, you may hit your ceiling mid-session with no clear reset time. Failed generations and moderation blocks both count against your quota.

Image generation requires an account. You need a free xAI/X account to generate anything on grok.com. A logged-out visitor hits a sign-in modal and cannot generate. For unlimited or high-volume access, you need at minimum SuperGrok Lite ($10/month).

720p ceiling. Video 1.5 tops out at 720p. Kling AI 3.0 outputs native 4K at 60fps. Runway Gen-4.5 leads on cinematic physics. If output resolution matters for your workflow, Grok Imagine is not the right choice.

Image-to-video only (for Video 1.5). Text-to-video in the traditional sense (no source image needed) is available but the primary strength is animating an existing image. Pure text-to-video workflows may get inconsistent results.

Content moderation over-blocking. Following January 2026 deepfake controversies, xAI tightened filters significantly. Artistic or ambiguous prompts get flagged more often than on competing platforms, and false positives still consume quota.

Not a standalone tool. Grok Imagine is built into the Grok ecosystem. If you want video generation without the Grok subscription, you are paying for features you may not use. Dedicated tools like Luma Labs or Kling AI give you more control over what you are buying.

Early-stage quality consistency. Video 1.5 is a meaningful improvement, but it is days old at time of writing. Long-term quality consistency across varied prompts is not yet established.

Who It Is For

Good fit if:

  • You already subscribe to SuperGrok for Grok chat and want video generation without a second subscription
  • You build on the xAI API and need a cost-efficient image-to-video option
  • You create high-volume short-form social content and value generation speed over 4K resolution
  • You want native audio in video output without adding an audio production step

Not a good fit if:

  • You need 1080p or 4K output (use Kling AI or Runway instead)
  • You want predictable, high-volume daily quotas (Kling AI's credit system is more transparent)
  • You only want video generation and not the rest of the Grok subscription bundle
  • You have straightforward text-to-video workflows with no source image (Luma Labs Ray3 or Kling AI handle this better)

How It Compares

Grok Imagine Video 1.5 occupies a specific lane: fast, audio-native, image-to-video at 720p, priced aggressively on the API. It is not trying to match Runway Gen-4.5's physics simulation or Kling 3.0's 4K fidelity. On the Image-to-Video Arena leaderboard, it ranked first at launch above Seedance 2.0 and Google Veo - a meaningful independent data point, though it has not held that position long enough to assess stability.

The closest parallel to how it is priced and bundled is Sora, which shut down in April 2026. Grok Imagine effectively fills the "fast, consumer-friendly video generator from a large AI lab" position that Sora vacated, with the added integration of an AI assistant.

For a broader look at the current video generation field, see the Best AI Video Generation Tools 2026 roundup.

FAQ

What is Grok Imagine?
Grok Imagine is xAI's image and video generation tool, available at grok.com/imagine and within the Grok chat interface. It generates images from text prompts using the Aurora model and converts images to video with synchronized audio using the Video 1.5 model (released June 16, 2026).

How much does Grok Imagine cost?
Grok Imagine is included in the SuperGrok subscription at $30/month (or $300/year). A lighter tier, SuperGrok Lite, costs $10/month and includes video generation at 480p. Image generation on grok.com requires at minimum a free xAI/X account; a logged-out visitor cannot generate. API pricing is $0.08/sec ($4.80/min) at 480p or $0.14/sec ($8.40/min) at 720p for video, or $0.02-$0.07 per image depending on resolution.

Is Grok Imagine free?
Image generation on grok.com requires a free xAI/X account. If you visit without signing in, you hit a sign-in modal and cannot generate. Once you have a free account, access to image generation is limited; for high-volume or video generation you need at minimum a SuperGrok Lite plan ($10/month).

What resolution does Grok Imagine Video 1.5 produce?
Video 1.5 outputs at 720p. It does not currently support 1080p or 4K. Clips run up to 15 seconds per generation.

Does Grok Imagine generate audio automatically?
Yes. One of the core features of Video 1.5 is native audio generation in the same pass as video - scene-matched sound effects, ambient audio, and lip-synced dialogue. No separate audio tool is needed.

How does Grok Imagine compare to Kling AI or Runway?
Grok Imagine is faster (25 seconds per 6-second clip), less expensive via API ($4.80/min at 480p or $8.40/min at 720p), and includes native audio. Kling AI 3.0 outputs native 4K at 60fps and offers more granular creative control. Runway Gen-4.5 leads on physics simulation. Choose Grok Imagine for speed and cost on social content; choose Kling or Runway for high-fidelity cinematic work.

What is Grok Imagine's Spicy Mode?
Spicy Mode is an opt-in setting that disables standard content guardrails, allowing generation of adult or NSFW content. It is available on paid tiers and is one of Grok Imagine's distinguishing features compared to competing platforms.

Grok Imagine Interface Overview
Grok Imagine — Interface Overview
+What works
  • Native audio is a genuine differentiator - dialogue, ambience, and sound effects generated in the same pass as video
  • Generation speed: 25 seconds for a 6-second 720p clip, fast relative to competitors
  • API cost: $4.80/min at 480p and $8.40/min at 720p undercuts most standalone platforms
  • Image-to-Video Arena #1 ranking at launch (June 2026) - independent leaderboard validation
  • Bundled value: video generation included in SuperGrok subscription at no additional cost
What doesn't
  • Quota throttling is unpredictable - xAI changed limits in May 2026 without announcement; failed generations count against quota
  • Image generation requires an account - logged-out visitors hit a sign-in modal and cannot generate
  • 720p ceiling - Kling AI 3.0 outputs native 4K at 60fps; Runway Gen-4.5 leads on cinematic physics
  • Image-to-video only for Video 1.5 - pure text-to-video workflows may get inconsistent results
  • Content moderation over-blocking after January 2026 tightening; false positives still consume quota
  • Not a standalone tool - paying for Grok subscription features you may not use if you only want video
Best for

SuperGrok subscribers who want video generation bundled with their AI assistant, and API developers building cost-efficient image-to-video pipelines

Skip if

Quota throttling is unpredictable - xAI changed limits in May 2026 without announcement; failed generations count against quota. Image generation requires an account - logged-out visitors hit a sign-in modal and cannot generate.

Pricing

As of Jun 2026
Freemium
$30/mo
Grok Imagine Pricing Plans
Grok Imagine — Pricing Plans