Claude Fable 5 is now on CometAPI — state-of-the-art performance in coding, agents, and scientific research. Try it now

Models

Claude Fable 5
C

Claude Fable 5

Input:$8/M
Output:$40/M
Anthropic's most capable, widely released model, for the most demanding reasoning and long-horizon agentic work
GPT Image 2
O

GPT Image 2

Input:$4/M
Output:$24/M
GPT Image 2 is openai state-of-the-art image generation model for fast, high-quality image generation and editing. It supports flexible image sizes and high-fidelity image inputs.
Doubao-Seedance-2-0
D

Doubao-Seedance-2-0

Per Second:$0.063
Seedance 2.0 is ByteDance’s next-generation multimodal video foundation model focused on cinematic, multi-shot narrative video generation. Unlike single-shot text-to-video demos, Seedance 2.0 emphasizes reference-based control (images, short clips, audio), coherent character/style consistency across shots, and native audio/video synchronization — aiming to make AI video useful for professional creative and previsualization workflows.
Happy Horse 1.0
Q

Happy Horse 1.0

Per Second:$0.112
Happy Horse 1.0 — A high-quality audio-video generation model that supports text-to-video and image-to-video creation. It can generate synchronized visuals, audio, and lip movements, making it suitable for short films, advertising creatives, and product showcases.
Claude Opus 4.8
C

Claude Opus 4.8

Input:$4/M
Output:$20/M
Claude Opus 4.8 is a premium AI model designed for advanced reasoning, deep analysis, and high-quality content generation. It excels at handling complex instructions, long-context understanding, and sophisticated problem-solving across professional and technical domains.
Gemini 3.5 Flash
G

Gemini 3.5 Flash

Input:$1.2/M
Output:$7.2/M
Gemini 3.5 Flash is a high-speed AI model designed for fast response and efficient coding performance. It delivers significantly improved generation speed while maintaining strong reasoning ability, making it suitable for real-time applications and developer workflows.
Gemini 3.1 Pro
G

Gemini 3.1 Pro

Input:$1.6/M
Output:$9.6/M
Gemini 3.1 Pro is the next generation in the Gemini series of models, a suite of highly-capable, natively multimodal, reasoning models. Gemini 3 Pro is now Google’s most advanced model for complex tasks, and can comprehend vast datasets, challenging problems from different information sources, including text, audio, images, video, and entire code repositories
Kimi K2.7 Code
M

Kimi K2.7 Code

Input:$0.76/M
Output:$3.19998/M
Kimi K2.7 Code is Kimi's most intelligent coding model to date, reliably following instructions in long contexts and completing programming tasks with a higher success rate. It supports text, image, and video input, and only supports thought mode, dialogue, and agent tasks.
Claude Mythos 5
C

Claude Mythos 5

Coming soon
Input:$8/M
Output:$40/M
Anthropic's most capable, widely released model, for the most demanding reasoning and long-horizon agentic work
Claude Opus 4.7
C

Claude Opus 4.7

Input:$4/M
Output:$20/M
Claude Opus 4.7 is a hybrid reasoning model designed specifically for frontier-level coding, AI agents, and complex multi-step professional work. Unlike lighter models (e.g., Sonnet or Haiku variants), Opus 4.7 prioritizes depth, consistency, and autonomy on the hardest tasks.
MiniMax-M3
M

MiniMax-M3

Input:$0.48/M
Output:$1.92/M
Minimax-m3 is a multimodal AI model designed for strong reasoning, natural conversation, and creative content generation. It provides balanced performance across text and visual understanding tasks, making it suitable for general-purpose AI applications.
Grok 4.3
X

Grok 4.3

Input:$1/M
Output:$2/M
Grok 4.3 is a general-purpose AI model designed for strong reasoning, real-time information processing, and conversational intelligence. It delivers improved accuracy and responsiveness, making it suitable for coding, analysis, and everyday productivity tasks.
GPT 5.5 Pro
O

GPT 5.5 Pro

Input:$24/M
Output:$144/M
GPT-5.5 Pro combines state-of-the-art intelligence, precision, and efficiency to tackle sophisticated challenges. From software development and data analysis to research and decision support, it delivers expert-level assistance with speed and consistency.
GPT 5.5
O

GPT 5.5

Input:$4/M
Output:$24/M
Model 5.5 is a next-generation AI model designed for stronger reasoning, faster responses, and improved accuracy across a wide range of tasks. It excels at understanding complex instructions, generating high-quality content, and assisting with coding, analysis, and problem-solving.
DeepSeek V4 Pro
D

DeepSeek V4 Pro

Input:$0.416/M
Output:$0.832/M
DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding, and long-horizon agent workflows, with strong performance across knowledge, math, and software engineering benchmarks.
DeepSeek V4 Flash
D

DeepSeek V4 Flash

Input:$0.12/M
Output:$0.24/M
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and high-throughput workloads, while maintaining strong reasoning and coding performance.
MiniMax-M2.7
M

MiniMax-M2.7

Input:$0.24/M
Output:$0.96/M
MiniMax-M2.7 offers the same top-tier intelligence as the standard version—including recursive self-evolution and expert-level office productivity—but is designed for applications requiring sub-second latency and high-speed token generation. Leveraging an enhanced inference backbone architecture, its output speed is 66% faster than the standard model (reaching 100 tps). It is the preferred choice for interactive programming assistants, real-time agent loop execution, and high-throughput enterprise pipelines with stringent completion time requirements.
GPT-5.4 nano
O

GPT-5.4 nano

Context:400,000
Input:$0.16/M
Output:$1/M
GPT-5.4 Nano is an ultra-lightweight AI model built for maximum speed and efficiency. It is optimized for simple tasks, real-time interactions, and large-scale deployments where low latency and minimal resource consumption are essential.
GPT-5.4 mini
O

GPT-5.4 mini

Context:400,000
Input:$0.6/M
Output:$3.6/M
GPT-5.4 Mini is a lightweight and efficient AI model optimized for speed and everyday productivity. It provides reliable conversational capabilities, content generation, and task assistance while maintaining low latency and resource usage.
GPT-5.4 pro
O

GPT-5.4 pro

Context:1,050,000
Input:$24/M
Output:$144/M
GPT-5.4 Pro is a high-performance AI model designed for professional and business applications. It offers strong reasoning, reliable accuracy, and efficient execution across tasks such as content creation, coding, research, and data analysis.
Nano Banana 2
G

Nano Banana 2

Input:$0.4/M
Output:$2.4/M
Core Capabilities Overview: Resolution: Up to 4K (4096×4096), on par with Pro. Reference Image Consistency: Up to 14 reference images (10 objects + 4 characters), maintaining style/character consistency. Extreme Aspect Ratios: New 1:4, 4:1, 1:8, 8:1 ratios added, suitable for long images, posters, and banners. Text Rendering: Advanced text generation, suitable for infographics and marketing poster layouts. Search Enhancement: Integrated Google Search + Image Search. Grounding: Built-in thinking process; complex prompts are reasoned before generation.
Claude Sonnet 4.6
C

Claude Sonnet 4.6

Input:$2.4/M
Output:$12/M
Claude Sonnet 4.6 is our most capable Sonnet model yet. It’s a full upgrade of the model’s skills across coding, computer use, long-context reasoning, agent planning, knowledge work, and design. Sonnet 4.6 also features a 1M token context window in beta.
Q

Qwen3.7 Plus

Q

Qwen3.7 Plus

Input:$0.32/M
Output:$1.28/M
Qwen3.7 Plus is a high-performance large language model developed by Alibaba Cloud. It supports long-context understanding up to 128K tokens, function calling, and multilingual tasks. Designed for complex reasoning, coding, and instruction-following scenarios.
Gemini omni fast
G

Gemini omni fast

Per Request:$0.4
Gemini Omni Fast is a lightweight multimodal video generation model designed for fast and flexible content creation. It enables efficient video generation with support for multiple input types, making it suitable for interactive and iterative workflows.
GPT 5.6
O

GPT 5.6

Coming soon
Input:$60/M
Output:$480/M
coming soon
Qwen3.7-Max
Q

Qwen3.7-Max

Input:$1.36/M
Output:$4.08/M
Qwen3.7-Max's core strength lies in the breadth and depth of its agentic capabilities. In coding, it handles everything from front-end prototyping to complex multi-file engineering projects. For office and productivity work, it enables workflow automation through MCP integration and multi-agent collaboration. In long-horizon autonomous execution, it maintained coherent reasoning throughout a 35-hour, fully autonomous kernel optimization experiment involving over 1,000 tool calls — convincingly demonstrating its sustained, stable execution. Furthermore, it delivers consistently strong cross-framework generalization, performing reliably whether deployed in Claude Code, OpenClaw, Qwen Code, or other frameworks.
GPT Image 2 ALL
O

GPT Image 2 ALL

Per Request:$0.04
GPT Image 2 ALL is a comprehensive image generation model designed to handle a wide range of creative and professional visual tasks. It combines high-quality image creation, advanced prompt understanding, and versatile style support to deliver exceptional results across diverse use cases.
GPT 5.5 ALL
O

GPT 5.5 ALL

Input:$2.4/M
Output:$14.4/M
GPT-5.5 excels in code writing, online research, data analysis, and cross-tool operations. The model not only improves its autonomy in handling complex multi-step tasks but also significantly improves reasoning capabilities and execution efficiency while maintaining the same latency as its predecessor, marking an important step towards automated office automation in AI.
Grok 4.20
X

Grok 4.20

Context:2,000,000
Input:$1/M
Output:$2/M
Grok 4.20 release introduces a multi-agent architecture (multiple specialized agents coordinated in real time), expanded context modes, and focused improvements to instruction-following, hallucination reduction, and structured/tooled outputs.
Q

Wan2.7

Q

Wan2.7

Per Second:$0.08
Wan2.7 is a video generation model designed for high-quality visual synthesis and improved motion consistency. It is suitable for cinematic content creation and professional video production workflows.