Key Capabilities
- Text-to-video — Generate videos from natural language descriptions
- Image-to-video — Use a reference image as the first frame
- Video remixing — Reuse the structure, motion, and framing from a previous video
- Flexible duration — 4, 8, or 12 seconds
- Multiple resolutions — Portrait (720x1280), landscape (1280x720), and more
Workflow
The Sora 2 API is asynchronous. Follow these steps:- Create task —
POST /v1/videos - Query status —
GET /v1/videos/{id}(poll untilstatusiscompleted) - Download video —
GET /v1/videos/{id}/content
Quick Example
Step 1: Create Task
Step 2: Query Status
Step 3: Download Video
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
model | string | Yes | Must be sora-2 |
prompt | string | Yes | Natural language description of the video. Include shot type, subject, action, scene, and lighting. Keep to a single purpose for best results. |
seconds | string | No | 4, 8, 12. Default: 4 |
size | string | No | 720x1280, 1280x720, 1024x1792, 1792x1024. Default: 720x1280 |
input_reference | file | No | Reference image for the first frame. Accepts image/jpeg, image/png, image/webp. |
remix_video_id | string | No | ID of a completed video to reuse its structure, motion, and framing. |
API Reference
View the interactive API Playground for Sora 2.

