Key Capabilities
- Multi-image input — Use up to 4 reference images as character/scene sources
- Text guidance — Combine images and text prompts to control video narrative
- Flexible duration — Generate 5-second or 10-second videos
- Aspect ratio — Supports 16:9, 9:16, and 1:1 output formats
- Standard & Pro modes — Choose standard mode or high-quality pro mode
Quick Example
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model_name | string | No | Must be kling-v1-6 |
image_list | array | Yes | Reference images (1–4), each item format: {"image": "url_or_base64"} |
prompt | string | Yes | Text prompt (up to 2500 characters) |
negative_prompt | string | No | Negative prompt (up to 2500 characters) |
mode | string | No | std or pro, default std |
duration | string | No | 5 or 10 seconds, default 5 |
aspect_ratio | string | No | 16:9, 9:16, or 1:1, default 16:9 |
callback_url | string | No | Task status callback URL |
API Reference
View the interactive API Playground for Kling 1.6 Multi-Image-to-Video.

