Key Capabilities
- OpenAI compatible — Drop-in replacement for the OpenAI SDK with no code changes required
- 1M token context — Supports large-scale documents and multi-turn conversations, up to 66K output tokens
- Multimodal input — Supports text, image, and video input
- Agentic coding — Significant improvement on coding agent benchmarks
- Reasoning mode — Enable built-in chain-of-thought via the
enable_thinkingparameter - Built-in tools — Supports web search, code interpreter, webpage scraping, and image search via the Responses API
Quick Example
Note: Bothimage_urlandvideo_urlsupport remote URLs (https://...) and base64 data URIs (data:image/png;base64,.../data:video/mp4;base64,...). Image tokens and video tokens are reflected inusage.prompt_tokens_details.
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | Must be qwen3.6-flash |
messages | array | Yes | Array of { role, content } objects. Supports image_url and video_url multimodal input |
max_completion_tokens | integer | No | Maximum number of tokens to generate |
temperature | float | No | 0–2, controls randomness, default 1 |
stream | boolean | No | Enable SSE streaming, default false |
top_p | float | No | Nucleus sampling threshold, default 1 |
stop | array | No | Stop sequences. Must be array format, default null |
enable_thinking | boolean | No | Enable reasoning mode via extra_body, default false |
API Reference
View the interactive API Playground for Qwen3.6-Flash.

