85+ AI APIs Compare & Try Live — 95% Off Official | API Models

Catálogo de modelos

Preços Saldo Convide e ganhe Chaves de API Documentação da API Registros Perfil

Catálogo de modelos

Preços Saldo Convide e ganhe Chaves de API Documentação da API Registros Perfil

apimodels.app

Uma API unificada para modelos de imagem, vídeo, LLM e áudio — 60–95% mais barata que a oficial.

Produto

Modelos
Preços
Documentação da API
Integrações
Chaves de API

Empresa

Sobre
Blog
Contato
GitHub

Jurídico

Termos de Serviço
Política de Privacidade

© 2026 apimodels.app · Todos os direitos reservados.

All

Seedance 2.0 Real Person

ByteDance

NEWReal-PersonMultimodal

Seedance 2.0 Real Person — full multimodal video generation that ACCEPTS real faces (unlike Ark-direct Seedance, which rejects them). Combine up to 3 reference images (character / scene / lighting), a driving video for camera and motion transfer, and an audio track for voice, plus a text prompt. 4-15s, 480p to 4k. With a driving video, billing follows the minimum-billing table (input + output seconds). Use only with consented subjects.

Real-PersonUp to 3 ImagesDriving Video + Audio4-15s480p–4k

$0.15/s+ / call

Seedance 2.0 Real Person Fast

ByteDance

NEWReal-PersonFast

Seedance 2.0 Real Person Fast — faster, lower-cost real-person video. One reference image plus an optional driving video for motion/camera transfer, with a text prompt. Real-person supported. 4-15s, 480p to 4k. Use only with consented subjects.

Real-PersonImage + Driving VideoFast4-15s480p–4k

$0.12/s+ / call

LTX-2.3 Video

Lightricks

LTX-2.3 unified text-to-video and image-to-video. Send a prompt for T2V, or add one reference image for I2V — fast, with 480p / 720p / 1080p output. Billed per second by resolution.

T2V & I2V480p / 720p / 1080p16:9 / 9:165-20s

$0.02/s+ / call

Grok Imagine Video 1.5 (Beta)

xAI

NEWBetaFlat Price

Grok Imagine Video 1.5 (Beta) — an alternative RunningHub channel for xAI Grok image-to-video. Turn one reference image into a cinematic clip with an optional prompt. Simple flat per-clip pricing by duration (5 / 8 / 10 / 12 / 15s), 480p or 720p.

Image to VideoFlat per-clip price480p / 720p5-15s

Grok Imagine Video 1.5

xAI

NEW#1 I2V ArenaNative Audio

xAI Grok Imagine Video 1.5 Preview — image-to-video with native synchronized audio. #1 on the Image-to-Video Arena, with lifelike motion, strong prompt adherence and consistent characters. 480p / 720p, 1-15s.

Image to VideoNative Audio480p / 720p1-15s

$0.098/s+ / call

Omni Flash (Stable)

Google

NEWStableFull Suite

Omni Flash (Stable) — lower-cost, full-suite Gemini Omni video. Text / image (up to 7 refs) / video-to-video, plus reusable voices and consistent characters. 720p / 1080p / 4k, 4 / 6 / 8 / 10s, 16:9 or 9:16, optional seed.

T2V / I2V / V2VUp to 7 Ref ImagesVoice + Character720p / 1080p / 4k

Seedance 2.0

ByteDance

NEWDirect OfficialStable

ByteDance Seedance 2.0 cinematic video — direct official Volcengine API, stable and high-concurrency. Text, image and multimodal generation with friendly per-second pricing.

T2V & I2VMultimodal Reference4-15s480p/720p/1080p

$0.092/s+ / call

Omni Flash (Gemini)

Google

NEWUnified T2V + I2V4K

Gemini Omni Flash — unified video generator for both text-to-video and image-to-video (1 or 3 reference images). 720p / 1080p / 4k, 4 / 6 / 8 / 10s, optional 16:9 or 9:16 framing. One slug, two modes — drop in a prompt, optionally drop in images.

T2V & I2V720p / 1080p / 4k4-10s1 or 3 Ref Images

Seedance 2.0 Cinematic

ByteDance

Real-Person RefCinematicHigh Quality

Film-grade edition of Seedance 2.0 — cinematic lighting, mood and camera motion, and it ACCEPTS real-person / realistic human reference images (unlike Ark-direct Seedance 2.0, which rejects real faces). Up to 4 reference images for identity-locked image-to-video — ideal for film-grade portrait and character work. Quality tier sits above the Ark variants; generation takes longer (typically 60-180s). Use only with consented subjects.

Real-Person ReferenceFilm-GradeT2V & I2VUp to 4 Ref Images

Seedance 2.0 Fast

ByteDance

NEWDirect OfficialStable

Seedance 2.0 Fast — direct official Volcengine API, faster and lower-cost. Text, image and multimodal video generation, stable under high concurrency.

Text to VideoImage to VideoMultimodal Reference4-15s480p/720p

$0.071/s+ / call

DreamActor V2 (Motion Transfer)

ByteDance

ByteDance DreamActor V2 motion transfer. Drive any character image with reference video motion, supporting multi-person, anime and pets.

Motion TransferMulti-PersonAnime SupportLip SyncMax 30s

Kling Lip-Sync Video

Kling

Kling AI lip-sync video generation. Frame-level lip synchronization with audio for real humans, 3D and 2D characters.

Lip SyncMulti-CharacterAudio AlignmentMinute-Level Duration

$0.065/5s / call

Kling Lip-Sync TTS

Kling

Kling text-to-speech synthesis with multi-language support, voice cloning, speed control and emotion styles.

Text to SpeechMulti-LanguageVoice CloneSpeed Control

Omni Video V3.1-Lite (Start-End, Official Stable)

Google

Smooth cinematic transitions between a required first frame and required last frame. Outputs 720p or 1080p with native audio. Official stable channel — pricier than V3.1-fast but reliable, ideal for production.

Start + End FrameBoth Required720p / 1080p16:9 / 9:16Native AudioOfficial Stable

$0.72-$1.152 / call

VEO 3.1 Fast HD

Google

VEO 3.1 Fast HD (720p) video generation. 8s fixed duration, 16:9 aspect ratio, reference image support.

Text to VideoImage to Video720p8s

VEO 3.1 Fast Full HD

Google

VEO 3.1 Fast Full HD (1080p) video generation. 8s fixed duration, 16:9 aspect ratio, reference image support.

Text to VideoImage to Video1080p8s

Grok Video 3 (Official)

xAI

Grok Video 3 (alias of grok-video-3, same upstream). Per-second pricing $0.01/s, 6-30s output, T2V + I2V supported.

Text to VideoImage to Video6-30s720p / 480p$0.01/s

$0.06-$0.30 / call

VEO 3.1 Fast 4K (Beta)

Google

VEO 3.1 Fast 4K video generation. Requires start frame image. Supports start-end frame video generation.

Image to VideoStart-End Frame4K1080p

$0.07-$0.10 / call

P-Video

Pruna AI

Fast video generation in ~10 seconds. Text/image/audio-to-video with draft mode for 4x faster previews. Built-in audio generation, up to 1080p 48FPS.

T2V & I2VDraft ModeBuilt-in Audio720p/1080p1-20sMulti-Aspect

Kling Motion Control

Kling

Generate videos with character motion control. Provide a reference image and motion video to create animated content.

Motion Control720p / 1080pUp to 30s Video

$0.06/s+ / call

Kling V3 Omni

Kling

Kling V3 Omni-Video with extended duration and keep-original-sound support for video editing. Flat $0.15/s billing.

Multi-ModalText/Image/Video Input5-15sAudioKeep Original Sound

Kling V3

Kling

Latest Kling V3 video generation. Supports 3-15s flexible duration, text-to-video and image-to-video with optional audio.

Text to VideoImage to Video3-15sAudioPer-Second Pricing

$0.12/s+ / call

Omni Video V3.1-fast (Start-End Frame, Budget)

Google

High-performance start-end frame video. Provide first + optional last frame, the model interpolates motion between them in seconds. Budget channel — cheaper than the official VEO, less stable.

Start + End Frame8s720p / 1080p / 4K16:9 / 9:16Budget Channel

$0.08-$0.20 / call

Grok Video 3 (10s)

xAI

Grok Video 3. 10-second video at $0.01/s. Supports text-to-video and image-to-video (up to 7 reference images).

Text to VideoImage to Video10s720p / 480p$0.01/s

Grok Video 3

xAI

Grok Video 3. Per-second pricing $0.01/s, 6-30 second output. Both T2V (omit images) and I2V (1-7 reference images) supported.

Text to VideoImage to Video6-30s720p / 480p$0.01/s

$0.06-$0.30 / call

Video Generation

All ByteDance Lightricks xAI Google Kling Pruna AI