Skip to main content
Gemini 2.5 Flash Image (Streaming) is available through Starrise AI via the native Gemini API, supporting real-time SSE streaming of image generation results. Thinking chunks are pushed first, followed immediately by the final image chunk.

Key Capabilities

  • SSE streaming — Real-time delivery of thinking chunks and image chunks
  • Thinking mode — Internal reasoning chunks (thought: true) streamed before the image
  • Text-to-image — Generate images from text descriptions
  • Image editing — Pass a reference image via inline_data combined with text instructions
  • Aspect ratio control1:1, 4:3, 3:4, 16:9, 9:16
  • Resolution control1K (~1024px), 2K (~2048px), 4K (~4096px, by longest side)

SSE Response Format

The streaming endpoint returns newline-delimited SSE data lines, each starting with data: followed by a JSON object. There are three chunk types:
  1. Thinking chunk — Arrives first; parts[0].thought is true
  2. Image chunk — Contains parts[0].inlineData with mimeType and base64 data (note: camelCase in streaming responses)
  3. Final usage chunk — Contains top-level usageMetadata with thoughtsTokenCount and per-modality token details
data: {"candidates":[{"content":{"role":"model","parts":[{"text":"...","thought":true}]}}],"usageMetadata":{"trafficType":"ON_DEMAND"},"modelVersion":"gemini-2.5-flash-image","createTime":"...","responseId":"..."}

data: {"candidates":[{"content":{"role":"model","parts":[{"inlineData":{"mimeType":"image/png","data":"<base64>"}}]}}],...}

data: {"usageMetadata":{"promptTokenCount":8,"candidatesTokenCount":1120,"totalTokenCount":1392,"trafficType":"ON_DEMAND","promptTokensDetails":[{"modality":"TEXT","tokenCount":8}],"candidatesTokensDetails":[{"modality":"IMAGE","tokenCount":1120}],"thoughtsTokenCount":264}}
In streaming responses, the image field is inlineData (camelCase), while in the request body it is inline_data (snake_case). This is native Gemini API behavior.

Text-to-Image Example

curl "https://ai.alad.com/v1beta/models/gemini-2.5-flash-image:streamGenerateContent?key=YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [
      {
        "role": "user",
        "parts": [
          { "text": "Generate an image of a mountain sunset" }
        ]
      }
    ],
    "generationConfig": {
      "responseModalities": ["TEXT", "IMAGE"],
      "imageConfig": {
        "aspectRatio": "16:9",
        "imageSize": "1K"
      }
    }
  }'

Image Editing Example (with Reference Image)

Pass both a text instruction and an inline_data reference image in the same parts array.
# First convert image to base64:
# BASE64=$(base64 -i your_photo.jpg)
#
# Then send the request:
curl "https://ai.alad.com/v1beta/models/gemini-2.5-flash-image:streamGenerateContent?key=YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "contents": [
      {
        "role": "user",
        "parts": [
          {
            "text": "This is a photo of me, please add an alpaca beside me"
          },
          {
            "inline_data": {
              "mime_type": "image/jpeg",
              "data": "<YOUR_BASE64_ENCODED_IMAGE>"
            }
          }
        ]
      }
    ],
    "generationConfig": {
      "responseModalities": ["TEXT", "IMAGE"],
      "imageConfig": {
        "aspectRatio": "1:1",
        "imageSize": "1K"
      }
    }
  }'

Parameters

ParameterTypeRequiredDescription
keystringYesAPI key (query parameter)
altstringNoSet to sse to explicitly enable SSE mode (optional, streaming is the default behavior)
contents[].parts[].textstringYesText prompt or instruction
contents[].parts[].inline_data.mime_typestringNoReference image type: image/jpeg, image/png, image/webp
contents[].parts[].inline_data.datastringNoBase64-encoded reference image data
generationConfig.responseModalitiesarrayYes["IMAGE"] or ["TEXT", "IMAGE"]
generationConfig.imageConfig.aspectRatiostringNo1:1 / 4:3 / 3:4 / 16:9 / 9:16
generationConfig.imageConfig.imageSizestringNo1K / 2K / 4K (default 1K)

API Reference

View the interactive API Playground for Gemini 2.5 Flash Image (Streaming).