
kling-video-to-audioKling Video-to-Audio auto-generates matching sound effects and background music (SFX + BGM) for a video, supporting 3–20 second clips, with an ASMR mode. It's a fast way to score silent footage with fitting ambience and music, turning raw or generated clips into immersive, ready-to-share content.
Add audio to any video
Sound effects and music
Enhanced detail sounds
Video ID or URL
Your generated audio will appear here
Kling Video-to-Audio is a Audio & Speech API provided by Kling. Kling Video-to-Audio auto-generates matching sound effects and background music (SFX + BGM) for a video, supporting 3–20 second clips, with an ASMR mode. It's a fast way to score silent footage with fitting ambience and music, turning raw or generated clips into immersive, ready-to-share content. Through API Models platform, you can access this model via a unified API at prices significantly lower than official rates. Current pricing: per call: $0.003.
Generate professional-grade voiceovers for videos, animations, and ads with diverse voice options.
Quickly produce podcast audio content with support for multi-character dialogue.
Convert text content into natural, fluid speech for audiobook production.
AI-powered multilingual dubbing and translation to help content reach global audiences.
Kling Video-to-Audio is available through API Models at: per call: $0.003. This is up to 95% cheaper than official pricing.
Sign up at API Models, get your API key, and call our unified API endpoint. We provide detailed API documentation with code examples in cURL, Python, and Node.js.
API Models offers the same Kling Video-to-Audio model at 60-95% lower cost through our aggregation platform. We provide a unified API interface so you do not need separate accounts for each provider - one API key to access all models.
It auto-generates matching sound effects and background music (SFX + BGM) for a video, supporting 3–20 second clips, with an ASMR mode. Good for scoring silent footage with ambience and quickly adding a soundtrack to short videos for immersive content.
On API Models, Kling Video-to-Audio runs alongside 60+ models on one API key and one balance, so choosing is about fit, not lock-in. It supports Video Dubbing, SFX + BGM, ASMR Mode, 3-20s Video, and you can weigh it on price and capability against other Audio & Speech models, then switch by changing a single model-name string — no new account or integration. Browse every Audio & Speech option with live pricing at apimodels.app/models.
Kling Video-to-Audio supports: Video Dubbing, SFX + BGM, ASMR Mode, 3-20s Video. See the API Models docs for full parameters and call examples.
Yes. API Models exposes Kling Video-to-Audio through a single unified API and one key — no separate provider accounts, and no need to handle each provider's regional network access yourself.
We support Stripe (Visa, Mastercard, and other international cards) and Alipay. Credits are available instantly after payment.