xai/grok-imagine-video

Generate videos using xAI's Grok Imagine Video model

Input

*string
Shift + Return to add a new line

Text prompt for video generation

file

Add file

Input image to generate video from (image-to-video). Supports jpg, jpeg, png, webp.

string

Duration of the video in seconds

Default: "6"

string

Aspect ratio of the video. For text-to-video, defaults to 16:9. For image-to-video, defaults to the input image's native aspect ratio. Ignored when editing a video.

Default: "auto"

string

Resolution of the video. Ignored when editing a video.

Default: "720p"

string

Style of the video generation. Fun produces playful results, Normal is standard, Spicy produces more expressive or artistic outcomes.

Default: "normal"

Output

Generated in31.8 seconds

README

Grok Imagine Video

Grok Imagine Video is xAI's video generation model, supporting text-to-video, image-to-video, and video editing. It generates clips from 1 to 15 seconds with fast turnaround (~30 seconds for 5s clips).

☀️ Why it stands out

  • Flexible duration Generate videos from 1 to 15 seconds — more granular control than most competitors that only offer fixed durations.
  • Video editing Provide an existing video and edit it with natural language prompts. Supports mp4, mov, and webm up to 8.7 seconds.
  • Image-to-video Animate any still image into a video while preserving composition and style.
  • Fast generation ~30 seconds for a 5-second clip at 720p.
  • Multiple aspect ratios 8 options including 16:9, 9:16, 4:3, 1:1, and auto detection for image-to-video.

⚙️ How to use

  • Input: text prompt, optionally with an image or video
  • Output: MP4 video
  • Duration: 1-15 seconds
  • Resolution: 480p or 720p
  • Aspect ratios: auto, 16:9, 9:16, 4:3, 3:4, 1:1, 3:2, 2:3
  • Generation modes:
    • Text-to-video — prompt only
    • Image-to-video — prompt + input image
    • Video editing — prompt + input video (duration, resolution, aspect ratio ignored)

🔥 Pricing

ConfigRouteAnyReplicate
5s 480p$0.05$0.05
5s 720p$0.075$0.075
10s 720p$0.15$0.15
15s 720p$0.225$0.225

💡 Best Use Cases

  • Short-form social content — Generate TikTok, Reels, and Shorts in 9:16 format.
  • Product animations — Animate product photos into engaging video clips.
  • Video editing — Transform existing clips with style changes, effects, or scene modifications.
  • Storyboarding — Quickly visualize scenes from text descriptions.
  • Looping backgrounds — Generate short ambient loops for presentations or websites.

📝 Notes

  • Input videos for editing must be max 8.7 seconds. Supported formats: mp4, mov, webm.
  • When editing a video, duration, resolution, and aspect ratio parameters are ignored.
  • For image-to-video, aspect ratio defaults to the input image's native ratio when set to "auto".

🌐 Where Grok Imagine Video Fits In

FeatureGrok Imagine VideoVeo 3.1 (Google)
Max Duration15 seconds8 seconds
Max Resolution720p1080p
AudioNoContext-aware
Video EditingYesNo
Reference ImagesNoUp to 3 (R2V)
Speed (5s clip)~30 sec~100 sec
Best ForFast clips, video editingCinematic quality, audio

Need higher resolution, audio, and reference image support? Try Veo 3.1.

Related Models