Kling
Kling V3 image generation. Text-to-image and single-reference image-to-image, 1K/2K resolution. $0.05 per image.
Kling
Kling V3 Omni image generation. Multi-image reference & fusion, element consistency, single/series output, 1K/2K/4K — 1K/2K $0.05, 4K $0.10 per image.
Kling
Kling AI lip-sync video generation. Frame-level lip synchronization with audio for real humans, 3D and 2D characters.
Kling
Kling text-to-speech synthesis with multi-language support, voice cloning, speed control and emotion styles.
Kling
Create custom voice profiles from audio samples. Upload .mp3/.wav/.mp4/.mov (5-30s) or reference a video ID.
Kling
Generate videos with character motion control. Provide a reference image and motion video to create animated content.
Kling
Identify faces in a video and return a session ID and face IDs for Kling lip-sync video generation.
Kling
AI image generation and editing by Kling (omni-image, model kling-image-o1). Supports 1K/2K resolution and multi-image input. $0.05 per image.
Kling
Generate sound effects from text descriptions. 3-10 second audio with natural quality.
Kling
Auto-generate sound effects and background music for videos. Supports ASMR mode for immersive content.
Kling
Text-to-speech with multiple voice options. Adjustable speed and multi-language support.
Kling
Kling V3 Omni-Video with extended duration and keep-original-sound support for video editing. Flat $0.15/s billing.
Kling
Latest Kling V3 video generation. Supports 3-15s flexible duration, text-to-video and image-to-video with optional audio.