Claude Fable 5 is now on CometAPI — state-of-the-art performance in coding, agents, and scientific research. Try it now
Q

Wan2.7

Per Second:$0.08
Wan2.7 is a video generation model designed for high-quality visual synthesis and improved motion consistency. It is suitable for cinematic content creation and professional video production workflows.
New
Commercial Use

Technical Specifications of Wan 2.7

ItemWan 2.7 (Video Suite)
ProviderAlibaba Tongyi Lab
Model familyWan 2.7 Video Suite
Architecture27B parameter Mixture-of-Experts (MoE)
Input typesText, images, videos, audio references
Output typesGenerated / edited video clips with optional audio
Supported modesText-to-video (T2V), Image-to-video (I2V), Reference-to-video (R2V), Video Editing
Resolution720P and 1080P outputs
Video duration2–15 seconds
Audio supportNative audio generation, voice references, lip-sync workflows
Reference capabilityMulti-reference images/videos, identity consistency
Character consistencySupports up to multiple reference subjects depending on workflow
Release generationMajor successor to Wan 2.6

What is Wan 2.7?

Wan 2.7 is Alibaba’s flagship multimodal video generation suite built for controllable AI filmmaking workflows rather than simple prompt-to-video creation. The model family combines generation, editing, continuation, and reference-driven consistency into one system, enabling creators to build short cinematic clips with stronger subject preservation and scene control.

Unlike earlier video generators that focused mostly on prompt quality, Wan 2.7 emphasizes controllability through frame anchoring, reference inputs, audio synchronization, and structured multi-shot workflows.

Main Features of Wan 2.7

  • Thinking Mode planning pipeline: The model plans scene composition and motion before rendering, improving prompt adherence and reducing coherence failures.
  • First-frame and last-frame control: Users can define opening and ending frames so the system interpolates motion between them.
  • Reference-driven identity consistency: Maintain character appearance, clothing, objects, and style across multiple shots.
  • Native multimodal workflows: Supports text, image, audio, and video references inside the same workflow.
  • Integrated audio generation: Background music, environmental sounds, and voice synchronization can be generated alongside visuals.
  • Editing and continuation support: Existing videos can be extended, transformed, or re-styled without rebuilding from scratch.

Benchmark Performance of Wan 2.7

Public benchmark disclosure for Wan 2.7 remains limited compared with text LLMs, but third-party evaluations and community testing indicate notable improvements in motion stability, prompt adherence, and controllability over Wan 2.6.

Reported ecosystem observations include:

  • Stronger motion continuity versus earlier Wan releases.
  • Higher leaderboard placement in third-party text-to-video evaluations.
  • Improved multi-subject consistency and reference preservation.
  • Better audio integration than many earlier open video models.

Formal benchmark transparency is still limited, so performance claims should be interpreted cautiously.

Wan 2.7 vs Other Video Models

FeatureWan 2.7Veo 3.1Seedance 2.0
Native audio workflowsStrongStrongModerate
Reference-driven consistencyStrongModerateModerate
First + Last Frame controlYesPartialLimited
Video editing workflowsYesYesLimited
Max common resolution1080PHigher-end cinematic output1080P
Multi-reference supportStrong emphasisModerateModerate

Limitations of Wan 2.7

  • Short clip duration compared with long-form production tools.
  • 1080P maximum output limits ultra-high-resolution workflows.
  • Fast motion scenes may still produce instability artifacts.
  • Multi-reference workflows increase complexity and prompt engineering requirements.
  • Public benchmark reporting remains relatively sparse.

Representative Use Cases

  1. Character-consistent short films and storyboards.
  2. Marketing clips with audio synchronization.
  3. Social media video generation.
  4. Product visualization and concept trailers.
  5. Video continuation and scene interpolation workflows.
  6. Reference-based avatar and character animation.

How to Use the WAN 2.7 Video API in CometAPI

Step 1: Try the WAN 2.7 Video API in the Kie Al Playground

First, test the WAN 2.7 functionality using the WAN 2.7 Video API in the CometAPI Playground. Upload images, add prompts, or use references to preview the generated WAN video before integrating the WAN 2.7 AI video into your production workflow.

Step 2: Obtain the WAN 2.7 API key and review the API documentation

Obtain the WAN 2.7 API key from the CometAPI console and review the documentation. Understand the WAN 2.7 Video API endpoints, authentication, and parameters to support text-to-video, image-to-video, and WAN video workflows.

Step 3: Generate WAN 2.7 AI videos and integrate them into your workflow

Use the WAN 2.7 Video API to generate WAN 2.7 AI videos with prompts, images, or references. Integrate WAN 2.7 outputs into product workflows, content pipelines, or AI video tools to enable scalable video creation.

FAQ