Skip to main content
Claude Opus 4.8 is Anthropic’s most capable general-purpose model, available through Starrise AI via OpenAI-compatible interface. Built on Claude Opus 4.7, it features improvements in long-horizon agentic coding, tool-trigger reliability, and compression handling.

Key Capabilities

  • OpenAI-compatible — Drop-in replacement for the OpenAI SDK, no other code changes needed
  • 1M context window — 128K max output tokens
  • Adaptive thinking — Triggers reasoning only when the task requires it, reducing unnecessary thinking tokens
  • Fast mode — Up to 2.5× output speed (research preview, premium pricing)
  • In-conversation system messages — Append updated instructions without resending the full system prompt
  • Lower cache threshold — Minimum cacheable prompt length reduced to 1,024 tokens
  • Streaming — Real-time token streaming via SSE

Quick Example

curl https://ai.alad.com/v1/messages \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-opus-4-8",
    "messages": [
      { "role": "user", "content": "Explain quantum entanglement in simple terms." }
    ]
  }'

Parameters

ParameterTypeRequiredDescription
modelstringYesFixed value: claude-opus-4-8
messagesarrayYesArray of { role, content } objects
max_tokensintegerNoMaximum number of tokens to generate
streambooleanNoEnable SSE streaming, default false
stopstring / arrayNoSequences where generation stops
Claude Opus 4.8 does not support temperature, top_p, and top_k parameters. Setting non-default values returns a 400 error — guide model behavior through prompts instead.

API Reference

View the interactive API Playground for Claude Opus 4.8.