Models/LTX-2.3 Video

LTX-2.3 Video

ltx-2.3

LTX-2.3 is Lightricks' open-source text-to-video foundation model (released March 2026) — a diffusion-transformer (DiT) that generates high-fidelity video AND synchronized audio from one model, in a single pass, with no separate dubbing step. Its 4× larger gated-attention text connector sharply improves complex-prompt fidelity: multi-subject scenes, spatial relationships and style instructions are reproduced far more accurately. A remastered VAE delivers crisper detail, realistic textures and cleaner edges, while an upgraded vocoder produces clearer, better-synced audio. LTX-2.3 outputs native 1080p in both portrait (9:16) and landscape (16:9) — no cropping — at selectable 24 / 48 fps, and renders 5–20 second clips with full picture-and-sound delivered together, ready to ship. Through API Models you call it via one unified endpoint: send a prompt for text-to-video, or add one reference image for image-to-video. Billed per second by resolution (480p $0.02/s, 720p $0.04/s, 1080p $0.045/s) — cheaper than the official $0.06/s at every tier (up to ~67% lower at 480p, ~25% lower even at 1080p), pay-as-you-go with only successful requests charged.

Text & Image to VideoNative Synchronized Audio1080p Portrait & Landscape24 / 48 fps5-20sOpen Source (Lightricks)

480p · per second$0.020/s

720p · per second$0.040/s

1080p · per second$0.045/s

720p · 5s$0.200/s

1080p · ~16s$0.720/s

Complex-Prompt Fidelity

4× larger gated-attention text connector — multi-subject scenes, spatial relations and style instructions reproduced accurately

Native Synchronized Audio

Picture and sound generated together in one pass (upgraded vocoder) — ship without any post-dubbing

Native 1080p, Portrait & Landscape

9:16 and 16:9 at true 1080p (no cropping), selectable 24 / 48 fps

Remastered VAE

Sharper fine detail, realistic textures and cleaner edges than LTX-2

One Model, Two Modes

Prompt only → text-to-video; add one image → image-to-video (auto-routed). 5-15s (T2V) / 5-20s (I2V)

Cheaper Than Official

Per second: 480p $0.02/s · 720p $0.04/s · 1080p $0.045/s — below the official $0.06/s at every tier (up to ~67% lower)

API Docs

Generate Video

Prompt (Required)

参考图 (可选 — 传图即图生视频,不传即文生视频)

Upload

Resolution: 720p$0.04/s

Aspect Ratio

Duration: 5s($0.20)文生 5-15s

5s15s

Result

Generated video will appear here

Provide URLs and click Generate

TL;DR LTX-2.3 Video is a API Models video generation model, callable via API Models' unified API (model name `ltx-2.3`). Pricing: 480p · per second: $0.02, 720p · per second: $0.04, 1080p · per second: $0.045, 720p · 5s: $0.2, 1080p · ~16s: $0.72. One API key for all image / video / LLM / audio models — 60-95% cheaper than official, $1 free on signup.

About LTX-2.3 Video

LTX-2.3 Video is a Video Generation API provided by API Models. LTX-2.3 is Lightricks' open-source text-to-video foundation model (released March 2026) — a diffusion-transformer (DiT) that generates high-fidelity video AND synchronized audio from one model, in a single pass, with no separate dubbing step. Its 4× larger gated-attention text connector sharply improves complex-prompt fidelity: multi-subject scenes, spatial relationships and style instructions are reproduced far more accurately. A remastered VAE delivers crisper detail, realistic textures and cleaner edges, while an upgraded vocoder produces clearer, better-synced audio. LTX-2.3 outputs native 1080p in both portrait (9:16) and landscape (16:9) — no cropping — at selectable 24 / 48 fps, and renders 5–20 second clips with full picture-and-sound delivered together, ready to ship. Through API Models you call it via one unified endpoint: send a prompt for text-to-video, or add one reference image for image-to-video. Billed per second by resolution (480p $0.02/s, 720p $0.04/s, 1080p $0.045/s) — cheaper than the official $0.06/s at every tier (up to ~67% lower at 480p, ~25% lower even at 1080p), pay-as-you-go with only successful requests charged. Through API Models platform, you can access this model via a unified API at prices significantly lower than official rates. Current pricing: 480p · per second: $0.02, 720p · per second: $0.04, 1080p · per second: $0.045, 720p · 5s: $0.2, 1080p · ~16s: $0.72.

Key Features

Complex-Prompt Fidelity -- 4× larger gated-attention text connector — multi-subject scenes, spatial relations and style instructions reproduced accurately
Native Synchronized Audio -- Picture and sound generated together in one pass (upgraded vocoder) — ship without any post-dubbing
Native 1080p, Portrait & Landscape -- 9:16 and 16:9 at true 1080p (no cropping), selectable 24 / 48 fps
Remastered VAE -- Sharper fine detail, realistic textures and cleaner edges than LTX-2
One Model, Two Modes -- Prompt only → text-to-video; add one image → image-to-video (auto-routed). 5-15s (T2V) / 5-20s (I2V)
Cheaper Than Official -- Per second: 480p $0.02/s · 720p $0.04/s · 1080p $0.045/s — below the official $0.06/s at every tier (up to ~67% lower)

Use Cases

Marketing Videos

Quickly generate brand promotion videos for ad campaigns and social media marketing.

Social Media Content

Create compelling short-form video content for platforms like TikTok, Instagram, and YouTube.

Product Demos

Generate product feature demonstrations and tutorials to improve user conversion.

Educational Content

Produce course explanations, knowledge explainers, and training videos at low cost.

Why API Models

Unified API -- One API key to access all models, no need to register on multiple platforms
Cost Savings -- 60-95% cheaper than official pricing, ideal for indie developers and startups
Instant Access -- Start using immediately after signup, supports Stripe and Alipay payments
Full Documentation -- Detailed API docs with code examples in cURL, Python, and Node.js

Frequently Asked Questions

How much does LTX-2.3 Video cost?

LTX-2.3 Video is available through API Models at: 480p · per second: $0.02, 720p · per second: $0.04, 1080p · per second: $0.045, 720p · 5s: $0.2, 1080p · ~16s: $0.72. This is up to 95% cheaper than official pricing.

How to use LTX-2.3 Video API?

Sign up at API Models, get your API key, and call our unified API endpoint. We provide detailed API documentation with code examples in cURL, Python, and Node.js.

What is the difference between API Models and the official API Models API?

API Models offers the same LTX-2.3 Video model at 60-95% lower cost through our aggregation platform. We provide a unified API interface so you do not need separate accounts for each provider - one API key to access all models.

What is LTX-2.3?

LTX-2.3 is Lightricks' open-source text-to-video foundation model, released March 2026. It's a diffusion-transformer (DiT) that generates video and synchronized audio from one model in a single pass — no separate dubbing. Versus the previous generation it adds a 4× larger gated-attention text connector that markedly improves complex-prompt fidelity (multi-subject scenes, spatial relationships, style instructions), a remastered VAE for sharper detail and cleaner edges, and an upgraded vocoder for clearer, better-synced audio.

What makes LTX-2.3 special?

(1) High complex-prompt fidelity via the 4× text connector; (2) native synchronized audio — picture and sound generated together, ready to ship without post-dubbing; (3) native 1080p in both portrait (9:16) and landscape (16:9), no cropping; (4) selectable 24 / 48 fps; (5) full 5–20s clip with audio in one pass; (6) open source with LoRA fine-tuning. On API Models it is also a unified text-to-video + image-to-video endpoint (prompt only → T2V, add an image → I2V).

What is LTX-2.3 best for?

Use it where you need picture and sound in one shot: vertical short-form video (TikTok / Reels / Shorts / 小红书), social ads and product clips, trailers/intros with built-in music and SFX, talking/voiceover content, game and anime-style shorts, and high-volume content pipelines. Native portrait 1080p + synchronized audio make it especially strong for mobile short-form, and open-source + LoRA suits teams that need custom styles or characters.

Is LTX-2.3 cheaper than the official API?

Yes. The official LTX-2.3 rate is $0.06 per second at 1080p (about 16 seconds for $1). On API Models it is billed per second by resolution — 480p $0.02/s, 720p $0.04/s, 1080p $0.045/s — cheaper than the official $0.06/s at every tier (about 67% lower at 480p, and ~25% lower even at 1080p). You also get one key for every model, China-reachable access, no verification, only-successful-requests billing, and $0.3 free on signup.

What can LTX-2.3 Video do?

LTX-2.3 Video supports: Text & Image to Video, Native Synchronized Audio, 1080p Portrait & Landscape, 24 / 48 fps, 5-20s, Open Source (Lightricks). See the API Models docs for full parameters and call examples.

Can I access the LTX-2.3 Video API from anywhere (incl. China)?

Yes. API Models exposes LTX-2.3 Video through a single unified API and one key — no separate provider accounts, and no need to handle each provider's regional network access yourself.

What payment methods are supported?

We support Stripe (Visa, Mastercard, and other international cards) and Alipay. Credits are available instantly after payment.