85+ AI APIs Compare & Try Live — 95% Off Official

Text to ImageImage to Image1K/2K

Kling V3 image generation. Text-to-image and single-reference image-to-image, 1K/2K resolution. $0.05 per image.

$0.05 / call

Kling V3 Omni

Kling

Multi-ImageElement ConsistencySeries Output1K/2K/4K

Kling V3 Omni image generation. Multi-image reference & fusion, element consistency, single/series output, 1K/2K/4K — 1K/2K $0.05, 4K $0.10 per image.

$0.05+ / call

Kling Lip-Sync Video

Kling

Lip SyncMulti-CharacterAudio AlignmentMinute-Level Duration

Kling AI lip-sync video generation. Frame-level lip synchronization with audio for real humans, 3D and 2D characters.

$0.065/5s / call

Kling Lip-Sync TTS

Kling

Text to SpeechMulti-LanguageVoice CloneSpeed Control

Kling text-to-speech synthesis with multi-language support, voice cloning, speed control and emotion styles.

$0.01 / call

Kling Custom Voice

Kling

Custom VoiceAudio UploadVideo ReferenceFor TTS/Lip Sync

Create custom voice profiles from audio samples. Upload .mp3/.wav/.mp4/.mov (5-30s) or reference a video ID.

$0.006 / per call

Motion Control720p / 1080pUp to 30s Video

Kling Motion Control

Kling

-70%Ativo

Generate videos with character motion control. Provide a reference image and motion video to create animated content.

$0.06/s+ / call

Kling Face Recognition

Kling

Face DetectionVideo InputSession-based

Identify faces in a video and return a session ID and face IDs for Kling lip-sync video generation.

$0.01 / per call

Kling Omni-Image

Kling

Text to ImageImage Editing1k/2k QualityMulti-Image Input

AI image generation and editing by Kling (omni-image, model kling-image-o1). Supports 1K/2K resolution and multi-image input. $0.05 per image.

$0.05 / call

Text to SFX3-10sSound Effects

Kling Sound Effects

Kling

-85%Ativo

Generate sound effects from text descriptions. 3-10 second audio with natural quality.

$0.030 / per call

Video DubbingSFX + BGMASMR Mode

Kling Video-to-Audio

Kling

-85%Ativo

Auto-generate sound effects and background music for videos. Supports ASMR mode for immersive content.

$0.003 / per call

TTSMultiple VoicesSpeed Control

Kling TTS

Kling

-85%Ativo

Text-to-speech with multiple voice options. Adjustable speed and multi-language support.

$0.01 / per call

Kling V3 Omni

Kling

Multi-ModalText/Image/Video Input5-15sAudioKeep Original Sound

Kling V3 Omni-Video with extended duration and keep-original-sound support for video editing. Flat $0.15/s billing.

$0.15/s / call

Kling V3

Kling

Text to VideoImage to Video3-15sAudioPer-Second Pricing

Latest Kling V3 video generation. Supports 3-15s flexible duration, text-to-video and image-to-video with optional audio.

$0.12/s+ / call