Key Capabilities
- OpenAI-compatible — Drop-in replacement for the OpenAI SDK, no other code changes needed
- Best coding model — Top-performing for code generation, debugging, and refactoring
- Fast and efficient — Lower latency and cost compared to Opus models
- Powerful reasoning — Expert at multi-step problem solving and analysis
- Long context — Supports large document processing and multi-turn conversations
- Streaming — Real-time token streaming via SSE
Quick Example
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | Fixed value: claude-sonnet-4-20250514 |
messages | array | Yes | Array of { role, content } objects |
max_tokens | integer | No | Maximum number of tokens to generate |
temperature | float | No | 0–2, controls randomness, default 1 |
stream | boolean | No | Enable SSE streaming, default false |
top_p | float | No | Nucleus sampling threshold, default 1 |
stop | string / array | No | Sequences where generation stops |
API Reference
View the interactive API Playground for Claude Sonnet 4.

