
grok-imagine-video-1.5Grok Imagine Video 1.5 Preview is xAI's image-to-video model that turns a single reference image into a high-quality clip with natively synchronized audio — dialogue, sound effects, ambience and music in one pass, no separate audio tools. In blind tests it topped the Image-to-Video Arena (720p, 1473 Elo), edging out Seedance 2.0, with lifelike motion, refined facial expressions, stronger physics and consistent characters. Provide a reference image, an optional prompt (up to 4096 chars), an aspect ratio, 480p or 720p, and a 1-15s duration. Pricing is per second by resolution plus a small per-image fee: 480p $0.098/s, 720p $0.166/s, +$0.015 per input image.
Topped the I2V Arena (720p, 1473 Elo), ahead of Seedance 2.0 — lifelike motion and consistent characters
Synchronized dialogue, SFX, ambience and music in a single workflow — no separate audio step
Precisely follows detailed prompts for motion, camera work and character behavior
Quick generation with stable performance for SaaS, automation and content tools
原生音频:模型在生成视频的同时自动产出同步音频(对话 / 音效 / 环境音 / 配乐)。
Generated video will appear here
Provide URLs and click Generate
Grok Imagine Video 1.5 is a Video Generation API provided by xAI. Grok Imagine Video 1.5 Preview is xAI's image-to-video model that turns a single reference image into a high-quality clip with natively synchronized audio — dialogue, sound effects, ambience and music in one pass, no separate audio tools. In blind tests it topped the Image-to-Video Arena (720p, 1473 Elo), edging out Seedance 2.0, with lifelike motion, refined facial expressions, stronger physics and consistent characters. Provide a reference image, an optional prompt (up to 4096 chars), an aspect ratio, 480p or 720p, and a 1-15s duration. Pricing is per second by resolution plus a small per-image fee: 480p $0.098/s, 720p $0.166/s, +$0.015 per input image. Through API Models platform, you can access this model via a unified API at prices significantly lower than official rates. Current pricing: 480p · per second: $0.098, 720p · per second: $0.166, + per input image: $0.015, 480p · 8s + 1 image: $0.799, 720p · 8s + 1 image: $1.343.
Related model: Grok Imagine Video 1.5 (Beta) — flat per-clip price, cheaper 720p, predictable cost — pick it for budgeting
The two channels are complementary, not the same endpoint. Beta is about flat, predictable pricing (and cheaper 720p); Preview is about native audio, top-ranked quality and flexible length. Pick by your priority:
| Dimension | Beta (flat price) | Preview (per-second) · this page |
|---|---|---|
| Billing model | Flat price per clip (by duration) | Per second + per input image |
| Price example (720p, 8s) | $0.56 / clip (fixed) | ~$1.34 / clip (varies) |
| Cost predictability | Known before you generate | Varies with duration & images |
| Resolution | 480p / 720p — same price | 480p / 720p — different price |
| Native synced audio | No | Yes (dialogue / SFX / music) |
| Duration | 5 / 8 / 10 / 12 / 15s | Any 1-15s |
| Image-to-Video Arena | — | #1 (720p, 1473 Elo) |
| Upstream channel | RunningHub stable workflow | xAI Grok Preview (KIE) |
| Best for | Budgeting, batch, cost control | Top quality, audio, flexible length |
→ See also Grok Imagine Video 1.5 (Beta) — flat per-clip price, cheaper 720p, predictable cost — pick it for budgeting
Quickly generate brand promotion videos for ad campaigns and social media marketing.
Create compelling short-form video content for platforms like TikTok, Instagram, and YouTube.
Generate product feature demonstrations and tutorials to improve user conversion.
Produce course explanations, knowledge explainers, and training videos at low cost.
Grok Imagine Video 1.5 is available through API Models at: 480p · per second: $0.098, 720p · per second: $0.166, + per input image: $0.015, 480p · 8s + 1 image: $0.799, 720p · 8s + 1 image: $1.343. This is up to 95% cheaper than official pricing.
Sign up at API Models, get your API key, and call our unified API endpoint. We provide detailed API documentation with code examples in cURL, Python, and Node.js.
API Models offers the same Grok Imagine Video 1.5 model at 60-95% lower cost through our aggregation platform. We provide a unified API interface so you do not need separate accounts for each provider - one API key to access all models.
Yes. If predictable cost matters more to you, use Grok Imagine Video 1.5 (Beta) (grok-imagine-video-1.5-beta): flat per-clip pricing by duration (5 / 8 / 10 / 12 / 15s), with 480p and 720p costing the same, and 720p cheaper than this Preview channel. The trade-off is no native audio and fixed duration tiers. For top quality and native synchronized audio, stay on this Preview page.
On API Models, Grok Imagine Video 1.5 runs alongside 60+ models on one API key and one balance, so choosing is about fit, not lock-in. It supports Image to Video, Native Audio, Strong Prompt Adherence, 480p / 720p, 1-15s, and you can weigh it on price and capability against other Video Generation models, then switch by changing a single model-name string — no new account or integration. Browse every Video Generation option with live pricing at apimodels.app/models.
Grok Imagine Video 1.5 supports: Image to Video, Native Audio, Strong Prompt Adherence, 480p / 720p, 1-15s. See the API Models docs for full parameters and call examples.
Yes. API Models exposes Grok Imagine Video 1.5 through a single unified API and one key — no separate provider accounts, and no need to handle each provider's regional network access yourself.
We support Stripe (Visa, Mastercard, and other international cards) and Alipay. Credits are available instantly after payment.