Technical Specifications of `doubao-seedance-1-5-pro-251215`

Specification	Details
Model ID	`doubao-seedance-1-5-pro-251215`
Provider family	Doubao / ByteDance Seed
Model type	Native audio-video generation model
Core modality	Joint video-and-audio generation from a single model
Primary use cases	Text-to-video, image-to-video, and prompt-driven cinematic video generation
Input formats	Text prompts; image-conditioned generation is also supported
Output formats	Video with synchronized audio, including voices and sound effects
Strengths	Complex prompt following, audio-visual synchronization, multilingual speech/lip-sync support, cinematic motion and storytelling
Deployment context	Available through ByteDance/Volcengine model platforms and exposed on CometAPI under this model ID
Commercial availability	Commercial/API availability is indicated by provider and platform listings

What is `doubao-seedance-1-5-pro-251215`?

doubao-seedance-1-5-pro-251215 is CometAPI’s platform identifier for ByteDance’s Seedance 1.5 Pro model, a native audio-visual generation model designed to create video and sound together rather than treating audio as a separate post-processing step. ByteDance describes Seedance 1.5 Pro as a joint audio-video model that follows complex instructions accurately and supports both text-prompted and image-driven generation.

In practice, this means the model is aimed at creators and developers who want short-form generated videos with synchronized dialogue, ambient sound, effects, and motion coherence in one workflow. ByteDance’s technical materials describe it as a foundation model for native joint audio-video generation built with a dual-branch Diffusion Transformer architecture and cross-modal modeling for tighter synchronization between what appears on screen and what is heard.

The model is also positioned within the broader Doubao/Seed ecosystem as ByteDance’s video generation offering, with Volcengine listing Doubao-Seedance-1.5-pro among its production models and highlighting “audio and image generated together” as a core capability.

Main features of `doubao-seedance-1-5-pro-251215`

Native audio-video generation: The model generates video and audio jointly, which helps produce more natural timing between speech, sound effects, and visual action than pipelines that stitch audio on afterward.
Strong prompt adherence: ByteDance explicitly presents Seedance 1.5 Pro as a model that can accurately follow complex instructions, making it suitable for structured creative prompts and scenario-controlled outputs.
Text-to-video and image-driven workflows: The model supports synthesizing audio-video content from text prompts and also generating from image-driven inputs, giving developers flexibility for ideation, storyboarding, and first-frame-based creation.
Voices, spatial sound, and effects: Provider materials highlight diverse voices and spatial sound effects coordinated with visuals, which is useful for richer storytelling and more immersive clips.
Multilingual and dialect support: Seedance 1.5 Pro is described as supporting a wide range of languages and dialects with strong lip-sync and motion alignment, which can matter for global-facing media applications.
Research-backed synchronization design: The published technical report emphasizes cross-modal joint modeling and a specialized training pipeline to improve audio-visual synchronization and generation quality.
Production ecosystem availability: The model appears in official ByteDance/Volcengine model listings and is exposed through CometAPI as doubao-seedance-1-5-pro-251215, making it practical for API-based integration into apps and media workflows.

How to access and integrate `doubao-seedance-1-5-pro-251215`

Sign up on CometAPI and create an API key from your dashboard. After you have an active key, store it securely as an environment variable so your application can authenticate requests to the API.

Step 2: Send Requests to `doubao-seedance-1-5-pro-251215` API

Use CometAPI's ByteDance/Volcengine-compatible endpoint at POST /volc/v3/contents/generations/tasks.

curl https://api.cometapi.com/volc/v3/contents/generations/tasks \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $COMETAPI_API_KEY" \
  -d '{
    "model": "doubao-seedance-1-5-pro-251215",
    "content": [
      {
        "type": "text",
        "text": "A paper boat drifts on a quiet lake at sunrise. --duration 5 --ratio 16:9"
      }
    ]
  }'

Step 3: Retrieve and Verify Results

The API returns a task ID. Poll GET /volc/v3/contents/generations/tasks/{task_id} to check generation status and retrieve the output video URL when the task completes.

Item	Official Price	⚡️ Our Price (20% OFF)
Video Generation (with Audio)	$2.40	$1.92
Video Generation (Silent)	$1.20	$0.96
Batch Video Generation (with Audio)	$1.20	$0.96
Batch Video Generation (Silent)	$0.60	$0.48