Kling
Kling AI lip-sync video generation. Frame-level lip synchronization with audio for real humans, 3D and 2D characters.
Kling
Kling text-to-speech synthesis with multi-language support, voice cloning, speed control and emotion styles.
Kling
Kling V3 video via Stable-QN channel. Supports text-to-video and image-to-video, 3-15s with optional audio.
Kling
Kling V3 Omni-Video via Stable-QN channel. Multi-modal input with image_list, video_list and keep-original-sound.
Kling
Create custom voice profiles from audio samples. Upload .mp3/.wav/.mp4/.mov (5-30s) or reference a video ID.
Kling
Sync one or multiple faces in a video with custom audio. Supports precise timing control.
Kling
Generate videos with character motion control. Provide a reference image and motion video to create animated content.
Kling
Identify faces in video for advanced lip-sync. Returns session ID and face IDs.
Kling
AI image generation and editing by Kling. Supports 1k/2k resolution and multi-image input for creative editing.
Kling
Generate sound effects from text descriptions. 3-10 second audio with natural quality.
Kling
Kling OmniImage via Stable-QN channel. Text-to-image with reference image support, 1K/2K resolution.
Kling
Auto-generate sound effects and background music for videos. Supports ASMR mode for immersive content.
Kling
Text-to-speech with multiple voice options. Adjustable speed and multi-language support.