ltx-2.3LTX-2.3 is Lightricks' open-source text-to-video foundation model (released March 2026) — a diffusion-transformer (DiT) that generates high-fidelity video AND synchronized audio from one model, in a single pass, with no separate dubbing step. Its 4× larger gated-attention text connector sharply improves complex-prompt fidelity: multi-subject scenes, spatial relationships and style instructions are reproduced far more accurately. A remastered VAE delivers crisper detail, realistic textures and cleaner edges, while an upgraded vocoder produces clearer, better-synced audio. LTX-2.3 outputs native 1080p in both portrait (9:16) and landscape (16:9) — no cropping — at selectable 24 / 48 fps, and renders 5–20 second clips with full picture-and-sound delivered together, ready to ship. Through API Models you call it via one unified endpoint: send a prompt for text-to-video, or add one reference image for image-to-video. Billed per second by resolution (480p $0.02/s, 720p $0.04/s, 1080p $0.045/s) — cheaper than the official $0.06/s at every tier (up to ~67% lower at 480p, ~25% lower even at 1080p), pay-as-you-go with only successful requests charged.
4× larger gated-attention text connector — multi-subject scenes, spatial relations and style instructions reproduced accurately
Picture and sound generated together in one pass (upgraded vocoder) — ship without any post-dubbing
9:16 and 16:9 at true 1080p (no cropping), selectable 24 / 48 fps
Sharper fine detail, realistic textures and cleaner edges than LTX-2
Prompt only → text-to-video; add one image → image-to-video (auto-routed). 5-15s (T2V) / 5-20s (I2V)
Per second: 480p $0.02/s · 720p $0.04/s · 1080p $0.045/s — below the official $0.06/s at every tier (up to ~67% lower)
Generated video will appear here
Provide URLs and click Generate
LTX-2.3 Video is a Video Generation API provided by API Models. LTX-2.3 is Lightricks' open-source text-to-video foundation model (released March 2026) — a diffusion-transformer (DiT) that generates high-fidelity video AND synchronized audio from one model, in a single pass, with no separate dubbing step. Its 4× larger gated-attention text connector sharply improves complex-prompt fidelity: multi-subject scenes, spatial relationships and style instructions are reproduced far more accurately. A remastered VAE delivers crisper detail, realistic textures and cleaner edges, while an upgraded vocoder produces clearer, better-synced audio. LTX-2.3 outputs native 1080p in both portrait (9:16) and landscape (16:9) — no cropping — at selectable 24 / 48 fps, and renders 5–20 second clips with full picture-and-sound delivered together, ready to ship. Through API Models you call it via one unified endpoint: send a prompt for text-to-video, or add one reference image for image-to-video. Billed per second by resolution (480p $0.02/s, 720p $0.04/s, 1080p $0.045/s) — cheaper than the official $0.06/s at every tier (up to ~67% lower at 480p, ~25% lower even at 1080p), pay-as-you-go with only successful requests charged. Through API Models platform, you can access this model via a unified API at prices significantly lower than official rates. Current pricing: 480p · per second: $0.02, 720p · per second: $0.04, 1080p · per second: $0.045, 720p · 5s: $0.2, 1080p · ~16s: $0.72.
Quickly generate brand promotion videos for ad campaigns and social media marketing.
Create compelling short-form video content for platforms like TikTok, Instagram, and YouTube.
Generate product feature demonstrations and tutorials to improve user conversion.
Produce course explanations, knowledge explainers, and training videos at low cost.
LTX-2.3 Video is available through API Models at: 480p · per second: $0.02, 720p · per second: $0.04, 1080p · per second: $0.045, 720p · 5s: $0.2, 1080p · ~16s: $0.72. This is up to 95% cheaper than official pricing.
Sign up at API Models, get your API key, and call our unified API endpoint. We provide detailed API documentation with code examples in cURL, Python, and Node.js.
API Models offers the same LTX-2.3 Video model at 60-95% lower cost through our aggregation platform. We provide a unified API interface so you do not need separate accounts for each provider - one API key to access all models.
LTX-2.3 is Lightricks' open-source text-to-video foundation model, released March 2026. It's a diffusion-transformer (DiT) that generates video and synchronized audio from one model in a single pass — no separate dubbing. Versus the previous generation it adds a 4× larger gated-attention text connector that markedly improves complex-prompt fidelity (multi-subject scenes, spatial relationships, style instructions), a remastered VAE for sharper detail and cleaner edges, and an upgraded vocoder for clearer, better-synced audio.
(1) High complex-prompt fidelity via the 4× text connector; (2) native synchronized audio — picture and sound generated together, ready to ship without post-dubbing; (3) native 1080p in both portrait (9:16) and landscape (16:9), no cropping; (4) selectable 24 / 48 fps; (5) full 5–20s clip with audio in one pass; (6) open source with LoRA fine-tuning. On API Models it is also a unified text-to-video + image-to-video endpoint (prompt only → T2V, add an image → I2V).
Use it where you need picture and sound in one shot: vertical short-form video (TikTok / Reels / Shorts / 小红书), social ads and product clips, trailers/intros with built-in music and SFX, talking/voiceover content, game and anime-style shorts, and high-volume content pipelines. Native portrait 1080p + synchronized audio make it especially strong for mobile short-form, and open-source + LoRA suits teams that need custom styles or characters.
Yes. The official LTX-2.3 rate is $0.06 per second at 1080p (about 16 seconds for $1). On API Models it is billed per second by resolution — 480p $0.02/s, 720p $0.04/s, 1080p $0.045/s — cheaper than the official $0.06/s at every tier (about 67% lower at 480p, and ~25% lower even at 1080p). You also get one key for every model, China-reachable access, no verification, only-successful-requests billing, and $0.3 free on signup.
LTX-2.3 Video supports: Text & Image to Video, Native Synchronized Audio, 1080p Portrait & Landscape, 24 / 48 fps, 5-20s, Open Source (Lightricks). See the API Models docs for full parameters and call examples.
Yes. API Models exposes LTX-2.3 Video through a single unified API and one key — no separate provider accounts, and no need to handle each provider's regional network access yourself.
We support Stripe (Visa, Mastercard, and other international cards) and Alipay. Credits are available instantly after payment.