Can Wan 2.6 API generate videos from text, images, and references?

Yes. Wan 2.6 supports text-to-video, image-to-video, and reference-to-video workflows within the same model family.

How long can Wan 2.6 video generations be?

Wan 2.6 generally supports clips between 2 and 15 seconds depending on mode.

Does Wan 2.6 API support native audio generation and lip sync?

Yes. Native audio generation, voice references, and synchronized lip-sync workflows are major features.

When should I use Wan 2.6 instead of Wan 2.7?

Choose Wan 2.6 for established multimodal workflows and Wan 2.7 for stronger controllability and planning.

Can Wan 2.6 maintain character consistency across scenes?

Yes. Reference workflows preserve appearance and continuity more reliably than earlier versions.

Is Wan 2.6 suitable for cinematic multi-shot storytelling?

Yes. Multi-shot generation supports narrative workflows for ads and short-form content.

What are the biggest limitations of Wan 2.6 video generation?

Main limitations include short durations, sparse benchmark reporting, and occasional motion instability.

Affordable Wan2.6 API | image-to-video

Technical Specifications of Wan 2.6

Item	Wan 2.6 Video Suite
Provider	Alibaba / Tongyi Lab
Model family	Wan 2.6
Release timeframe	December 2025 generation
Input types	Text, images, reference videos, audio inputs
Output type	Video with optional synchronized audio
Core modes	Text-to-Video (T2V), Image-to-Video (I2V), Reference-to-Video (R2V)
Flash variants	I2V Flash, R2V Flash
Resolution support	720P and 1080P
Duration support	2–15 seconds (workflow dependent)
Audio capabilities	Native audio generation, voice references, lip sync
Multi-shot support	2–8 scene segments in a single workflow
Reference support	Up to 5 references (mixed image/video depending on workflow)
API workflow	Async task creation + polling

What is Wan 2.6?

Wan 2.6 is Alibaba’s multimodal video generation system focused on controllable short-form production. Rather than being purely prompt-driven, the model combines text prompts, image references, reference videos, audio conditioning, and scene chaining for creator workflows. The major upgrade over prior Wan releases was the introduction of stronger reference-driven consistency and longer narrative generation.

Main Features of Wan 2.6

Reference-to-video workflows: Users can feed image or video references to maintain character identity, style, and voice continuity across generations.
Multi-shot narrative generation: Supports chaining multiple prompts together for scene transitions and story progression in a single generation workflow.
Native audio synchronization: Built-in support for generated audio, custom audio uploads, and lip synchronization workflows.
Flexible input modes: Supports prompt-only generation, first-frame animation, and reference-driven workflows.
Flash variants for iteration: Faster versions enable rapid testing before final high-quality renders.
Longer clips: Extended clip duration compared with earlier generations, supporting narrative content creation.

Benchmark Performance of Wan 2.6

Formal benchmark transparency for Wan 2.6 remains limited; Alibaba has published fewer standardized benchmark numbers than text LLM providers. Most evaluation comes from workflow testing and ecosystem comparisons rather than public leaderboards. Community testing consistently highlights:

Improved character consistency versus older Wan releases.
Better audio-video synchronization.
Stronger multi-shot continuity.
More reliable reference conditioning.

Because benchmark publication is sparse, production testing remains important before deployment.

Wan 2.6 vs Other Video Models

Feature	Wan 2.6	Wan 2.7	Veo-family models
Native audio generation	Strong	Stronger	Strong
Multi-shot workflow	Yes	Improved	Moderate
Reference-to-video	Strong emphasis	Stronger controls	Moderate
Clip duration	Up to 15s	Similar / workflow dependent	Varies
Multi-reference support	Up to 5 refs	Expanded workflows	Moderate
Editing workflows	Moderate	Better editing support	Strong

Limitations of Wan 2.6

Short clip duration still limits long-form production.
High-motion scenes may still show temporal instability.
Reference-heavy workflows increase setup complexity.
Public benchmark reporting remains limited.
Async generation pipelines increase integration complexity.

Representative Use Cases

Character-consistent marketing videos.
Multi-scene social media clips.
Creator avatar animation.
Reference-driven product videos.
AI storytelling with synchronized audio.
Brand content requiring identity preservation.

Pricing for Wan2.6

Explore competitive pricing for Wan2.6, designed to fit various budgets and usage needs. Our flexible plans ensure you only pay for what you use, making it easy to scale as your requirements grow. Discover how Wan2.6 can enhance your projects while keeping costs manageable.

Wan Video Generation Pricing

Pricing (Per Second)

Model	720p	1080p
`wan2.6`	$0.08	$0.12
`wan2.7`	$0.08	$0.12

💡 Billed per second. Total cost = price per second × video duration (seconds).

Sample code and API for Wan2.6

Access comprehensive sample code and API resources for Wan2.6 to streamline your integration process. Our detailed documentation provides step-by-step guidance, helping you leverage the full potential of Wan2.6 in your projects.

Python
JavaScript
Curl

# Create a video with wan2.6 using raw HTTP requests
import os
import time
import requests

api_key = os.environ.get("COMETAPI_KEY")
base_url = "https://api.cometapi.com/v1"
headers = {"Authorization": f"Bearer {api_key}"}

# Step 1: Submit the video generation request
print("Submitting video generation request...")
response = requests.post(
    f"{base_url}/videos",
    headers=headers,
    files={
        "model": (None, "wan2.6"),
        "prompt": (None, "Create a cinematic multi-shot chase across a moonlit desert market. Shot 1 [0-2s]: a wide establishing view of lanterns and dust in the air. Shot 2 [2-4s]: a small brass robot darts between fabric stalls. Shot 3 [4-5s]: close-up on the robot finding a glowing compass."),
        "seconds": (None, "5"),
        "size": (None, "1280x720"),
    },
)

result = response.json()
print(f"Response: {result}")

video_id = result.get("id") or result.get("task_id")
print(f"Video ID: {video_id}")

# Step 2: Poll for progress until 100%
print("
Checking video generation progress...")
while True:
    try:
        status_response = requests.get(f"{base_url}/videos/{video_id}", headers=headers)
        status_result = status_response.json()

        data = status_result.get("data") or status_result
        progress = data.get("progress", "0%")
        status = data.get("status", "unknown")

        print(f"Progress: {progress}, Status: {status}")

        if status in ["FAILURE", "failed", "error"]:
            print("Video generation failed!")
            print(status_result)
            exit(1)

        if progress == "100%" or progress == 100 or status in ["completed", "success"]:
            print("Video generation completed!")
            break
    except Exception as e:
        print(f"Temporary error: {e}, retrying...")

    time.sleep(10)

# Step 3: Download the video to output directory
print(f"
Downloading video to ./output/{video_id}.mp4...")
os.makedirs("./output", exist_ok=True)

video_response = requests.get(f"{base_url}/videos/{video_id}/content", headers=headers)

output_path = f"./output/{video_id}.mp4"
with open(output_path, "wb") as f:
    f.write(video_response.content)

if os.path.exists(output_path):
    file_size = os.path.getsize(output_path)
    print(f"Video saved to {output_path}")
    print(f"File size: {file_size} bytes")
else:
    print("Failed to download video")
    exit(1)

Python Code Example

# Create a video with wan2.6 using raw HTTP requests
import os
import time
import requests

api_key = os.environ.get("COMETAPI_KEY")
base_url = "https://api.cometapi.com/v1"
headers = {"Authorization": f"Bearer {api_key}"}

# Step 1: Submit the video generation request
print("Submitting video generation request...")
response = requests.post(
    f"{base_url}/videos",
    headers=headers,
    files={
        "model": (None, "wan2.6"),
        "prompt": (None, "Create a cinematic multi-shot chase across a moonlit desert market. Shot 1 [0-2s]: a wide establishing view of lanterns and dust in the air. Shot 2 [2-4s]: a small brass robot darts between fabric stalls. Shot 3 [4-5s]: close-up on the robot finding a glowing compass."),
        "seconds": (None, "5"),
        "size": (None, "1280x720"),
    },
)

result = response.json()
print(f"Response: {result}")

video_id = result.get("id") or result.get("task_id")
print(f"Video ID: {video_id}")

# Step 2: Poll for progress until 100%
print("\nChecking video generation progress...")
while True:
    try:
        status_response = requests.get(f"{base_url}/videos/{video_id}", headers=headers)
        status_result = status_response.json()

        data = status_result.get("data") or status_result
        progress = data.get("progress", "0%")
        status = data.get("status", "unknown")

        print(f"Progress: {progress}, Status: {status}")

        if status in ["FAILURE", "failed", "error"]:
            print("Video generation failed!")
            print(status_result)
            exit(1)

        if progress == "100%" or progress == 100 or status in ["completed", "success"]:
            print("Video generation completed!")
            break
    except Exception as e:
        print(f"Temporary error: {e}, retrying...")

    time.sleep(10)

# Step 3: Download the video to output directory
print(f"\nDownloading video to ./output/{video_id}.mp4...")
os.makedirs("./output", exist_ok=True)

video_response = requests.get(f"{base_url}/videos/{video_id}/content", headers=headers)

output_path = f"./output/{video_id}.mp4"
with open(output_path, "wb") as f:
    f.write(video_response.content)

if os.path.exists(output_path):
    file_size = os.path.getsize(output_path)
    print(f"Video saved to {output_path}")
    print(f"File size: {file_size} bytes")
else:
    print("Failed to download video")
    exit(1)

JavaScript Code Example

// Create a video with wan2.6 using raw HTTP requests
import fs from "fs";
import path from "path";

const apiKey = process.env.COMETAPI_KEY;
const baseUrl = "https://api.cometapi.com/v1";
const headers = { Authorization: `Bearer ${apiKey}` };

function sleep(ms) {
  return new Promise((resolve) => setTimeout(resolve, ms));
}

// Step 1: Submit the video generation request
console.log("Submitting video generation request...");
const formData = new FormData();
formData.append("model", "wan2.6");
formData.append("prompt", "Create a cinematic multi-shot chase across a moonlit desert market. Shot 1 [0-2s]: a wide establishing view of lanterns and dust in the air. Shot 2 [2-4s]: a small brass robot darts between fabric stalls. Shot 3 [4-5s]: close-up on the robot finding a glowing compass.");
formData.append("seconds", "5");
formData.append("size", "1280x720");

const submitResponse = await fetch(`${baseUrl}/videos`, {
  method: "POST",
  headers,
  body: formData,
});

const result = await submitResponse.json();
console.log("Response:", JSON.stringify(result, null, 2));

const videoId = result.id || result.task_id;
console.log("Video ID:", videoId);

// Step 2: Poll for progress until 100%
console.log("\nChecking video generation progress...");
while (true) {
  try {
    const statusResponse = await fetch(`${baseUrl}/videos/${videoId}`, { headers });
    const statusResult = await statusResponse.json();
    const data = statusResult.data || statusResult;
    const progress = data.progress || "0%";
    const status = data.status || "unknown";

    console.log(`Progress: ${progress}, Status: ${status}`);

    if (status === "FAILURE" || status === "failed" || status === "error") {
      console.log("Video generation failed!");
      console.log(JSON.stringify(statusResult, null, 2));
      process.exit(1);
    }

    if (progress === "100%" || progress === 100 || status === "completed" || status === "success") {
      console.log("Video generation completed!");
      break;
    }
  } catch (e) {
    console.log(`Temporary error: ${e.message}, retrying...`);
  }

  await sleep(10000);
}

// Step 3: Download the video to output directory
console.log(`\nDownloading video to ./output/${videoId}.mp4...`);
fs.mkdirSync("./output", { recursive: true });

const videoResponse = await fetch(`${baseUrl}/videos/${videoId}/content`, { headers });
const outputPath = path.join("./output", `${videoId}.mp4`);
fs.writeFileSync(outputPath, Buffer.from(await videoResponse.arrayBuffer()));

if (fs.existsSync(outputPath)) {
  const stats = fs.statSync(outputPath);
  console.log(`Video saved to ${outputPath}`);
  console.log(`File size: ${stats.size} bytes`);
} else {
  console.log("Failed to download video");
  process.exit(1);
}

Curl Code Example

# Create a video with wan2.6
# Step 1: Submit the video generation request
echo "Submitting video generation request..."
response=$(curl -s https://api.cometapi.com/v1/videos \
  -H "Authorization: Bearer $COMETAPI_KEY" \
  -F "model=wan2.6" \
  -F "prompt=Create a cinematic multi-shot chase across a moonlit desert market. Shot 1 [0-2s]: a wide establishing view of lanterns and dust in the air. Shot 2 [2-4s]: a small brass robot darts between fabric stalls. Shot 3 [4-5s]: close-up on the robot finding a glowing compass." \
  -F "seconds=5" \
  -F "size=1280x720")

echo "Response: $response"

# Extract video_id from response (handle JSON with spaces like "id": "xxx")
video_id=$(echo "$response" | tr -d '\n' | sed 's/.*"id"[[:space:]]*:[[:space:]]*"\([^"]*\)".*/\1/')
echo "Video ID: $video_id"

# Step 2: Poll for progress until 100%
echo ""
echo "Checking video generation progress..."
while true; do
  status_response=$(curl -s "https://api.cometapi.com/v1/videos/$video_id" \
    -H "Authorization: Bearer $COMETAPI_KEY")
  
  progress=$(echo "$status_response" | grep -o '"progress"[[:space:]]*:[[:space:]]*"\?[^",}]*"\?' | head -1 | sed 's/.*:[[:space:]]*"\?//;s/"$//')
  status=$(echo "$status_response" | grep -o '"status"[[:space:]]*:[[:space:]]*"[^"]*"' | head -1 | sed 's/.*"status"[[:space:]]*:[[:space:]]*"//;s/"$//')
  
  echo "Progress: $progress, Status: $status"
  
  if [ "$status" = "FAILURE" ] || [ "$status" = "failed" ] || [ "$status" = "error" ]; then
    echo "Video generation failed!"
    exit 1
  fi
  
  if [ "$progress" = "100%" ] || [ "$progress" = "100" ] || [ "$status" = "completed" ] || [ "$status" = "success" ]; then
    echo "Video generation completed!"
    break
  fi
  
  sleep 10
done

# Step 3: Download the video to output directory
echo ""
echo "Downloading video to ./output/$video_id.mp4..."
mkdir -p ./output
curl -s "https://api.cometapi.com/v1/videos/$video_id/content" \
  -H "Authorization: Bearer $COMETAPI_KEY" \
  -o "./output/$video_id.mp4"

if [ -f "./output/$video_id.mp4" ]; then
  echo "Video saved to ./output/$video_id.mp4"
  ls -la "./output/$video_id.mp4"
else
  echo "Failed to download video"
  exit 1
fi

Versions of Wan2.6

The reason Wan2.6 has multiple snapshots may include potential factors such as variations in output after updates requiring older snapshots for consistency, providing developers a transition period for adaptation and migration, and different snapshots corresponding to global or regional endpoints to optimize user experience. For detailed differences between versions, please refer to the official documentation.

version
wan2.6

Wan2.6