session_id that must be used in the subsequent lip sync generation request.
Workflow Overview
session_id binds the video to the face analysis result. Without it, lip sync generation cannot proceed.
Video Requirements
- The video must contain at least one clearly visible, front-facing face
- Faces that are well-lit and without heavy obstruction produce better results
- The video must be accessible via a public URL, or use a previously generated Kling video ID (
video_id)
Quick Example
Request Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
video_url | string | One of the two | Public URL of the source video |
video_id | string | One of the two | ID of a previously generated Kling video |
Response
This endpoint is synchronous — results are returned immediately with no polling needed.| Field | Description |
|---|---|
data.session_id | Session ID to pass to the lip sync endpoint |
data.face_list[] | All faces detected in the video |
face_list[].face_id | Unique identifier for each face |
face_list[].face_rect | Face bounding box {x, y, width, height} (pixels) |
face_list will contain multiple entries.
Next Step: Lip Sync Generation
Use the session_id to generate a lip sync video.
API Reference
View the interactive API documentation for Kling Face Identification.

