Models/Kling Face Recognition

Kling Face Recognition

kling-identify-face

Kling Face Recognition detects faces in a video — pass a videoUrl or videoId and it returns a sessionId plus a list of faceIds. Those IDs feed Kling Lip-Sync Video: it's the first step of the lip-sync flow, letting you target exactly which face to sync in a multi-person clip before aligning audio to that face.

Face DetectionVideo InputSession IDFace ID List

per call$0.010/s

Face Detection

Auto-detect all faces in a video

Session-based

Returns sessionId + faceId for lip-sync

Video Input

Accepts videoUrl or videoId

Per-Call Pricing

$0.01 per call

API Docs

Kling lip-sync is a 3-step flow — run them in order; intermediate values carry forward automatically.

Step 1 · Face Recognition

Source video (MP4/MOV, 2-60s, 720p/1080p, clear face)

Upload

Upload or paste a public video URL; recognition returns sessionId + faceId.

Step 2 · Prepare Audio

Step 3 · Generate Lip-Sync Video

soundStartTime (ms)

soundEndTime (ms)

soundInsertTime (ms)

Trimmed audio must be ≥2s; the insert window must overlap the face window by ≥2s.

Last updated: 2026-06-21

TL;DR Kling Face Recognition is a Kling video generation model, callable via API Models' unified API (model name `kling-identify-face`). Pricing: per call: $0.01. One API key for all image / video / LLM / audio models — 60-95% cheaper than official.

About Kling Face Recognition

Kling Face Recognition is a Video Generation API provided by Kling. Kling Face Recognition detects faces in a video — pass a videoUrl or videoId and it returns a sessionId plus a list of faceIds. Those IDs feed Kling Lip-Sync Video: it's the first step of the lip-sync flow, letting you target exactly which face to sync in a multi-person clip before aligning audio to that face. Through API Models platform, you can access this model via a unified API at prices significantly lower than official rates. Current pricing: per call: $0.01.

Key Features

Face Detection -- Auto-detect all faces in a video
Session-based -- Returns sessionId + faceId for lip-sync
Video Input -- Accepts videoUrl or videoId
Per-Call Pricing -- $0.01 per call

Use Cases

Marketing Videos

Quickly generate brand promotion videos for ad campaigns and social media marketing.

Social Media Content

Create compelling short-form video content for platforms like TikTok, Instagram, and YouTube.

Product Demos

Generate product feature demonstrations and tutorials to improve user conversion.

Educational Content

Produce course explanations, knowledge explainers, and training videos at low cost.

Why API Models

Unified API -- One API key to access all models, no need to register on multiple platforms
Cost Savings -- 60-95% cheaper than official pricing, ideal for indie developers and startups
Instant Access -- Start using immediately after signup, supports Stripe and Alipay payments
Full Documentation -- Detailed API docs with code examples in cURL, Python, and Node.js

Frequently Asked Questions

How much does Kling Face Recognition cost?

Kling Face Recognition is available through API Models at: per call: $0.01. This is up to 95% cheaper than official pricing.

How to use Kling Face Recognition API?

Sign up at API Models, get your API key, and call our unified API endpoint. We provide detailed API documentation with code examples in cURL, Python, and Node.js.

What is the difference between API Models and the official Kling API?

API Models offers the same Kling Face Recognition model at 60-95% lower cost through our aggregation platform. We provide a unified API interface so you do not need separate accounts for each provider - one API key to access all models.

What is Kling Face Recognition?

It detects faces in a video: pass a videoUrl or videoId and it returns a sessionId plus a list of faceIds. Those IDs feed Kling Lip-Sync Video — identify the face you want to lip-sync, then sync that specific face.

What role does it play in the lip-sync flow?

It's the first step of lip-sync: detect faces → get a faceId → specify that faceId plus audio in Kling Lip-Sync Video to produce the synced clip. In multi-person videos it lets you target exactly which face to lip-sync.

How does Kling Face Recognition compare to other Video Generation models?

On API Models, Kling Face Recognition runs alongside 60+ models on one API key and one balance, so choosing is about fit, not lock-in. It supports Face Detection, Video Input, Session ID, Face ID List, and you can weigh it on price and capability against other Video Generation models, then switch by changing a single model-name string — no new account or integration. Browse every Video Generation option with live pricing at apimodels.app/models.

What can Kling Face Recognition do?

Kling Face Recognition supports: Face Detection, Video Input, Session ID, Face ID List. See the API Models docs for full parameters and call examples.

Can I access the Kling Face Recognition API from anywhere (incl. China)?

Yes. API Models exposes Kling Face Recognition through a single unified API and one key — no separate provider accounts, and no need to handle each provider's regional network access yourself.

What payment methods are supported?

We support Stripe (Visa, Mastercard, and other international cards) and Alipay. Credits are available instantly after payment.