
kling-custom-voiceCreate custom voice profiles from audio samples. Upload .mp3/.wav/.mp4/.mov (5-30s, clean single voice) or reference a historical video ID. Custom voices can be used in TTS and Lip Sync models.
Upload .mp3/.wav/.mp4/.mov samples
Use a historical video ID as source
Create reusable voice profiles
$0.006 per voice creation
Clean single voice, 5-30 seconds, no background noise
Create a custom voice to see the result
Kling Custom Voice is a Audio & Speech API provided by Kling. Create custom voice profiles from audio samples. Upload .mp3/.wav/.mp4/.mov (5-30s, clean single voice) or reference a historical video ID. Custom voices can be used in TTS and Lip Sync models. Through API Models platform, you can access this model via a unified API at prices significantly lower than official rates.
Generate professional-grade voiceovers for videos, animations, and ads with diverse voice options.
Quickly produce podcast audio content with support for multi-character dialogue.
Convert text content into natural, fluid speech for audiobook production.
AI-powered multilingual dubbing and translation to help content reach global audiences.
Kling Custom Voice is available through API Models at significantly lower prices than official rates. Visit the model page for current pricing.
Sign up at API Models, get your API key, and call our unified API endpoint. We provide detailed API documentation with code examples in cURL, Python, and Node.js.
API Models offers the same Kling Custom Voice model at 60-95% lower cost through our aggregation platform. We provide a unified API interface so you do not need separate accounts for each provider - one API key to access all models.
We support Stripe (Visa, Mastercard, and other international cards) and Alipay. Credits are available instantly after payment.