Sora 2 is OpenAI’s flagship text-to-video and audio generation system designed to produce short cinematic clips with synchronized dialogue, sound effects, persistent scene state, and markedly improved physical realism. Sora 2 represents OpenAI’s step forward in producing short, controllable videos with synchronized audio (speech and sound effects), improved physical plausibility (motion, momentum, buoyancy), and stronger safety controls compared with earlier text-to-video systems.
Model Type: Video