xai/grok-imagine-image

SOTA image model from xAI

Input

*string
Shift + Return to add a new line

Text prompt for image generation or editing

file

Add file

Input image for editing. When provided, the model edits this image based on the prompt. Supports jpg, jpeg, png, webp.

string

Aspect ratio of the generated image. Ignored when editing an image.

Default: "1:1"

Output

Output
Generated in5.8 seconds
Example 1

README

Grok Imagine Image

Grok Imagine Image is xAI's state-of-the-art image generation and editing model, delivering ultra-fast inference (~5 seconds) with high visual quality. It supports both text-to-image generation and image editing via natural language prompts.

☀️ Why it stands out

  • Ultra-fast generation Typically completes in under 6 seconds, one of the fastest image models available.
  • Image editing Provide an input image and describe the edits in natural language — style transfer, object modification, background replacement, and more.
  • 14 aspect ratio options From standard (1:1, 16:9) to ultra-wide (20:9) and phone-native (9:19.5), plus auto detection.
  • High visual quality State-of-the-art output quality with photorealistic rendering, detailed textures, and accurate text generation.
  • Simple API Just 3 parameters — prompt, optional image, and aspect ratio. Easy to integrate and iterate.

⚙️ How to use

  • Input: text prompt, optionally with an input image for editing
  • Output: generated or edited image
  • Aspect ratios: 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, 2:3, 2:1, 1:2, 19.5:9, 9:19.5, 20:9, 9:20, auto
  • Generation modes:
    • Text-to-image — prompt only
    • Image editing — prompt + input image (aspect ratio is ignored when editing)

🔥 Pricing

RouteAnyReplicate
Per image$0.003$0.003

💡 Best Use Cases

  • Rapid prototyping — Generate visual concepts in seconds for quick iteration.
  • Social media content — Create eye-catching images with phone-native aspect ratios (9:16, 9:19.5).
  • Image editing — Transform existing photos with natural language descriptions.
  • Marketing & branding — Generate banners, ads, and product visuals across all standard formats.
  • Batch generation — Fast inference and low cost make it ideal for high-volume workflows.

📝 Notes

  • Prompts exceeding 2500 characters will be truncated.
  • Supported input image formats: jpg, jpeg, png, webp.
  • Aspect ratio is ignored when editing an input image.

🌐 Where Grok Imagine Image Fits In

Compare with other image models:

  • Nano Banana 2 (Google) – More parameters (resolution, search grounding, multi-image fusion) but slower (~25s). Grok Imagine Image is ~4x faster with a simpler API.
  • Nano Banana Pro (Google) – Highest quality with 4K output and safety controls, but significantly slower (~25-48s) and more expensive.
  • Grok Imagine Video (xAI) – Same family, but generates video instead of images.

Related Models