CometAPI Update

2026-07-17

CometAPI Adds Kimi K3

Model ID: kimi-k3

kimi-k3 is Kimi’s flagship model, designed for long-range programming and end-to-end knowledge work. It features a 1M token context window and leading-edge comprehensive intelligence.

Developer Documentation Quick Integration Guide (Chat Format)

2026-07-14

DeepSeek, Qwen, and Doubao Model Retirement & Migration Notice

Following upstream lifecycle changes and updates to the CometAPI model catalog, the following models will be retired according to the schedule below.

Deprecated Model	Retirement Date	Recommended Replacement
DeepSeek models outside the V4 series	2026-07-17	deepseek-v4-flash / deepseek-v4-pro
Qwen3.5 series	2026-07-17	qwen3.7-plus / qwen3.7-max / qwen3.6-plus
Qwen3 series	2026-07-17	qwen3.7-plus / qwen3.7-max / qwen3.6-plus
doubao-seedance-1-0-pro	2026-07-17	doubao-seedance-2-0 / doubao-seedance-2-0-fast
doubao-seedream-4-0-250828	2026-07-23	doubao-seedream-5-0-260128 / seedream-5-0-pro-260628
doubao-seed-1-8	2026-09-21	doubao-seed-2-1-turbo-260628 / doubao-seed-2-1-pro-260628
doubao-seedance-1-5-pro	2026-09-21	doubao-seedance-2-0 / doubao-seedance-2-0-fast

Each model will become unavailable after 23:59 on its listed retirement date.

Some routes may experience capacity limits or become unavailable earlier due to upstream lifecycle and capacity adjustments. Please complete migration and production validation in advance.

2026-07-10

CometAPI Adds GPT-5.6 Series

Model IDs:

gpt-5.6
gpt-5.6-sol
gpt-5.6-terra
gpt-5.6-luna

Overview

The GPT-5.6 series is now available on CometAPI with support for both Chat and Responses API formats.

gpt-5.6-sol and gpt-5.6 are flagship models for complex reasoning and coding.

gpt-5.6-terra balances intelligence and cost for general-purpose workloads.

gpt-5.6-luna is designed for cost-sensitive, high-volume use cases.

Developer Documentation

Chat API Guide

Responses API Guide

2026-07-09

grok-4.5 Now Available

Model ID: grok-4.5

CometAPI now supports grok-4.5 in chat format. It is our flagship model for coding, agentic tool use, low-hallucination responses, configurable reasoning, and general-purpose workflows.

Developer Documentation:

https://apidoc.cometapi.com/api/text/chat

Seedream 5.0 Pro Now Available

Model ID: seedream-5-0-pro-260628

CometAPI now supports seedream-5-0-pro-260628 for image generation. Seedream 5.0 Pro is built for high-precision generation scenarios, with improved control over composition, positioning, and visual elements. It supports text-to-image generation, single-image reference generation, and multi-image reference generation.

Developer Documentation:

https://apidoc.cometapi.com/api/image/seededit-seedream/bytedance-image-generation

Azure OpenAI GPT-4 Series Deprecation & Migration

Following upstream Azure OpenAI model lifecycle updates, the following Azure-backed GPT-4 series models have entered the Deprecated stage and are scheduled for retirement in October 2026.

Impact Time: Starting from 2026-07-08, subject to actual upstream provider availability and change timing.

Deprecated Model	Retirement Date	Recommended Replacement
gpt-4o	2026-10-01	gpt-5.1 / gpt-5-mini
gpt-4o-mini	2026-10-01	gpt-5-mini
gpt-4.1	2026-10-14	gpt-5.1 / gpt-5-mini
gpt-4.1-mini	2026-10-14	gpt-5-mini
gpt-4.1-nano	2026-10-14	gpt-5-mini

Some legacy Azure OpenAI routes may become capacity-limited or unavailable before the official retirement date. Please migrate production traffic in advance to avoid service interruption.

2026-07-02

Claude Fable 5 Access Restored

Model ID: claude-fable-5

Access to claude-fable-5 has been restored. It brings 5th-generation intelligence to ambitious coding and professional workflows.

Developer Documentation

Anthropic Messages API Guide

2026-07-01

CometAPI Adds Gemini 3.1 Flash Lite Image and Claude Sonnet 5

Model IDs: gemini-3.1-flash-lite-image
claude-sonnet-5 gemini-3.1-flash-lite-image is an efficient image generation model designed for ultra-low latency and cost-effective image generation and editing. claude-sonnet-5 is a high-performance model for coding and agent workflows.

Developer Documentation Gemini Image Generation API Guide Anthropic Messages API Guide

2026-06-26

CometAPI now fully integrates HappyHorse 1.1 and Kling 3.0!

We now integrate HappyHorse 1.1 & Update Kling 3.0 and Kling 3.0 Omni HappyHorse 1.1 (happyhorse-1.1): Significant upgrades in motion smoothness, multi-reference consistency (up to 9 images), prompt adherence for complex scenes, facial realism, and native audio-video synchronization with multilingual lip-sync. Ideal for production-quality short videos, branding, e-commerce, and storytelling. Kling 3.0（kling-3.0 and kling-3.0-omni）: Advanced multimodal capabilities with Omni One architecture, director-level camera control, physics-aware motion, multi-shot storytelling, and native audio for hyper-realistic 1080p/4K cinematic output. Access both via one unified OpenAI-compatible API. Perfect for cinematic shorts, ads, e-commerce, and storytelling. Read our new blog comparing What is HappyHorse 1.1 and HappyHorse 1.1 vs 1.0: Sign up for free credits → cometapi.com Developer Documentation HappyHorse Video API Guide

Kling Text-to-Video API Guide

2026-06-23

CometAPI Adds New Doubao Seed Models

Model IDs:

doubao-seed-2-1-pro-260628
doubao-seed-2-1-turbo-260628
doubao-seed-evolving

Overview

doubao-seed-2-1-pro-260628 is a next-generation production-ready model with upgraded coding, agent, and multimodal capabilities for complex enterprise tasks.

doubao-seed-2-1-turbo-260628 balances performance and cost, with upgraded coding, agent, and multimodal capabilities for real-world complex tasks.

doubao-seed-evolving is designed for coding and agent scenarios, delivering weekly capability updates through a unified model ID.

Developer Documentation

Quick Integration Guide (Chat Format)

2026-06-17

CometAPI Adds GLM-5.2

Model ID: glm-5.2

GLM-5.2 supports a 1M context window, 128K max output, streaming tool-call output, forced deep thinking, and improved coding and reasoning capabilities.

Pricing

Model	Input	Output
GLM 5.2	`$1.12/M`	`$3.528/M`
GLM 5.1	`$1.12/M`	`$3.528/M`

Developer Documentation

Quick Integration Guide (Chat Format)

2026-06-13

CometAPI Removes Claude Fable 5

claude-fable-5

claude-fable-5: Due to changes in upstream model provider access policies and compliance requirements, Claude Fable 5 has been removed from CometAPI. Effective immediately, this model is no longer available for API aggregation service calls.

2026-06-12

CometAPI Adds Kimi K2.7 Code

kimi-k2.7-code

kimi-k2.7-code: Kimi's most intelligent coding model to date, reliably follows instructions in long contexts and completes programming tasks with a higher success rate.

Developer Documentation

Quick Integration Guide (Chat Format)

🌟 2026-06-10

🎉 CometAPI Adds Claude Fable 5! 🎉

🔹 claude-fable-5

claude-fable-5:Anthropic's most capable, widely released model, for the most demanding reasoning and long-horizon agentic work

📚 Developer Documentation

🎉 CometAPI Adds Qwen3.7 Plus! 🎉

🔹 qwen3.7-plus

qwen3.7-plus: Alibaba Cloud's high-performance large language model, supporting up to 128K-token long-context understanding, function calling, multilingual tasks, complex reasoning, coding, and instruction-following scenarios.

📚 Developer Documentation

Quick Integration Guide (OpenAI Format)

⚠️ Model Deprecation Notice

Impact Time: 2026-09-08, subject to the actual change time.

To continuously improve service quality and optimize underlying model resources, CometAPI will delist the following Qwen models on 2026-09-08. Please switch to the recommended replacement models in advance.

Deprecated Model	Recommended Replacement
qwen3.6-max-preview	qwen3.7-max
qwen3-max-preview	qwen3.7-max
qwen3-max	qwen3.7-max
qwen3-coder-plus	qwen3.7-plus

🎬 CometAPI: New Video Models Now Support `/v1/videos`

✨ Feature Update

CometAPI now supports several new video generation models, including wan2.6, wan2.7, happyhorse-1.0, viduq3-turbo, and viduq3. These models support the /v1/videos API format. Pricing varies by model and resolution and is billed by generated video duration, in USD / second.

Model Series	Supported Resolutions	Pricing
`wan2.6` / `wan2.7` series	720p, 1080p	Billed per second
`happyhorse-1.0`	720p, 1080p	Billed per second
`viduq3` / `viduq3-turbo` series	360p, 540p, 720p, 1080p	Billed per second

🤖 CometAPI: `minimax-m3` Model Now Available

✨ Model Update

CometAPI now supports the minimax-m3 model.

📚 Developer Documentation

Note: This model only supports the Chat Completions endpoint: /v1/chat/completions
Quick integration guide: https://apidoc.cometapi.com/chat

⚠️ CometAPI: Veo3 Series Supply and Billing Adjustment

✨ Adjustment Details

Due to ongoing resource stability challenges with the veo3 series, Comet will adjust the resource channels and billing rules for the veo3 series.

Previously, the veo3 series was billed per generation. It will now be adjusted to official same-price per-second billing, with the website’s default 20% discount applied.

Model	Resolution	Official Per-second Price
`veo3.1`	720p	$0.4 / second
`veo3.1`	1080p	$0.4 / second
`veo3.1`	4K	$0.6 / second
`veo3.1-fast`	720p	$0.1 / second
`veo3.1-fast`	1080p	$0.12 / second
`veo3.1-fast`	4K	$0.3 / second

For example, when generating a 1080p video with veo3.1, the price after the 20% discount is:

$0.4 × 0.8 = $0.32 / second

For detailed pricing and available groups, please refer to the model details page: https://www.cometapi.com/models/

🌟 2026-05-29

🎉 CometAPI Adds Claude Opus 4.8! 🎉

🔹 claude-opus-4-8

claude-opus-4-8: Most intelligent model for agents and coding

📚 Developer Documentation

🎨 CometAPI: Midjourney V8 Support

CometAPI now supports Midjourney V8. When using the Midjourney image generation API, you can enable V8 by adding the --v 8 parameter directly in the prompt.

For more details about the Midjourney image generation API, please refer to the documentation: https://apidoc.cometapi.com/api/image/midjourney/imagine

⚠️ CometAPI: Model Deprecation Notice

Following official model updates and iterations, Comet will retire the following models. Please switch to the latest available models as soon as possible.

Details page: https://www.cometapi.com/models/

May 29, 2026 — 12:00 PM UTC

Deprecated Models
`glm-4.x` series
`kimi-k2` series
`minimax-m2.1`
`minimax-m2`
`o1-mini`
`gpt-3.5-turbo`
`gemini-3.1-flash-lite-preview`
`gemini-3-pro-preview`
`gemini-2.5-flash-preview-09-25`
`gemini-2.5-flash-lite-preview-09-2025`
`gemini-2.5-flash-image-preview`

June 15, 2026

Deprecated Models
`claude-opus-4`
`claude-sonnet-4-20250514`

July 23, 2026

Deprecated Models
`gpt-5-codex`
`gpt-5.1-codex`
`gpt-5.2-codex`
`gpt-5-chat-latest`
`gpt-5.1-chat-latest`
`gpt-5.2-chat-latest`
`gpt-4o-realtime`
`gpt-realtime-mini`
`gpt-audio-mini`
`gpt-4o-mini-search-preview`

October 23, 2026

Deprecated Models
`o4-mini`
`gpt-4.1-nano`
`o1-pro`
`o3-mini`

🌟 2026-05-22

🎉 CometAPI Adds Qwen3.7-Max! 🎉

🔹 qwen3.7-max

qwen3.7-max: Qwen3.7-Max's core strength lies in the breadth and depth of its agentic capabilities, excelling at tool use, task planning, and complex instruction execution.

📚 Supported Endpoints

Chat Completions (/v1/chat/completions) ｜ Reference Guide

🌟 2026-05-20

🎉 CometAPI Adds Gemini-3.5-Flash! 🎉

🔹 gemini-3.5-flash

gemini-3.5-flash: Google's most intelligent model, built for speed, combining frontier intelligence with outstanding search and factual grounding.

📚 Supported Endpoints

Chat Completions (/v1/chat/completions) ｜ Reference Guide
Gemini Generating Content (/v1beta/models/{model}:{operator}) ｜ Reference Guide

📅 2026-05-06

🚀 🧠 CometAPI: Grok-4.3 Now Available

✨ Change Details

New model: grok-4.3 is now live — excels at autonomous reasoning, knowledge work, and tool use. Ideal for complex agent workflows and deep analysis tasks.

🍡 Recommended Models

Model name: grok-4.3

📚 Developer Documentation

⚠️ Note: This model only supports the Chat Completions endpoint (/v1/chat/completions).
Quick Integration Guide (OpenAI Format)

💰 📉 CometAPI: Grok-4.2 Series Price Reduction

✨ Pricing Changes

The grok-4.2 series pricing has been reduced to match grok-4.3 pricing.
Changes take effect immediately. No action required on your end.

⚠️ 🔄 CometAPI: Grok Legacy Model Deprecation Notice

✨ Change Details

Deprecating soon: The following models will be retired on May 15, 2026 at 12:00 PM Pacific Time (PT). Please refer to the migration table below:

Deprecated Model	Recommended Replacement
`grok-4-1-fast-reasoning`	`grok-4.3`
`grok-4-1-fast-non-reasoning`	`grok-4.20-non-reasoning`
`grok-4-fast-reasoning`	`grok-4.3`
`grok-4-fast-non-reasoning`	`grok-4.20-non-reasoning`
`grok-4-0709`	`grok-4.3`
`grok-code-fast-1`	`grok-4.3`

Migration guide: Reasoning models → grok-4.3; Non-reasoning models → grok-4.20-non-reasoning.

📅 2026-04-29

⚠️ 🔄 CometAPI: DeepSeek-Chat / DeepSeek-Reasoner Deprecation Notice

✨ Change Details

Deprecating soon: Per the official announcement, the deepseek-chat and deepseek-reasoner model families are being phased out and will cease API service.
Migration guide: Please migrate to the new DeepSeek V4 series as soon as possible.
- General / high-throughput workloads → deepseek-v4-flash
- Advanced reasoning / coding / long-horizon agent workflows → deepseek-v4-pro

🍡 Recommended Models

Model names: deepseek-v4-flash, deepseek-v4-pro

📚 Developer Documentation

⚠️ Note: These models only support the Chat Completions endpoint (/v1/chat/completions).
Quick Integration Guide (OpenAI Format)

💰 🎨 CometAPI: Doubao Seedream Image Model Pricing Update

✨ Pricing Changes

Model	Before (List)	After (List)	Discounted
`doubao-seedream-4-5-251128`	$0.04 / image	$0.04 / image (unchanged)	$0.032 / image
`doubao-seedream-4-0-250828`	$0.03 / image	$0.04 / image	$0.032 / image
`doubao-seedream-5-0-260128`	$0.035 / image	$0.04 / image	$0.032 / image

List prices are now unified at $0.04 / image; discounted price unified at $0.032 / image (20% off).
Pricing changes take effect immediately. No action required on your end.

🌟 2026-04-25

🎉 CometAPI Adds GPT-5.5 & GPT-5.5-Pro Models! 🎉

🔹 GPT-5.5

Model: gpt-5.5
Details: A next-generation multimodal flagship model balancing exceptional performance with efficient response, dedicated to providing comprehensive and stable general-purpose AI services.
⚠️ Note: This model supports the standard Chat interaction format.
- 👉 Chat Guide

🔹 GPT-5.5-Pro

Model: gpt-5.5-pro
Details: An advanced model engineered for extremely complex logic and professional demands, representing the highest standard of deep reasoning and precise analytical capabilities.
⚠️ Note: This model supports the Response interaction format only.
- 👉 Response Guide

🌟 2026-04-24

🎉 CometAPI Adds GPT-5.5 & GPT-Image-2 Series! 🎉

🔹 GPT-5.5 Series

Models: gpt-5.5-all / gpt-5.5-medium-all / gpt-5.5-high-all / gpt-5.5-xhigh-all / gpt-5.5-low-all
Details: Designed to cover varying levels of task complexity.
⚠️ Note: All models above support standard Chat and Response interaction formats.
- 👉 Chat Guide ｜ Response Guide

🔹 GPT-Image-2 Series

Model: gpt-image-2-all (Multimodal and image generation model)
Endpoints:
- 💬 Chat Completions (/v1/chat/completions): Supports multimodal inputs (image-to-image) and complex instructions. ⚠️ Note: supports stream: true only. Reference Guide
- 🎨 Image Generations (/v1/images/generations): Supports standard text-to-image generation. Reference Guide

🎉 CometAPI Adds DeepSeek V4 Series! 🎉

🔹 DeepSeek V4

deepseek-v4-pro: A 1.6T parameter MoE model supporting a 1M-token context. Designed for advanced reasoning, coding, and long-horizon agent workflows.
deepseek-v4-flash: A 284B parameter efficiency-optimized MoE model supporting a 1M-token context. Designed for fast inference and high-throughput workloads.

📚 Developer Documentation

⚠️ Note: These models currently only support the Chat Completions endpoint (/v1/chat/completions).
Quick Integration Guide (OpenAI Format)

🌟 2026-04-21

🎉 CometAPI Adds GPT-Image-2! 🎉

🔹 gpt-image-2

gpt-image-2: Adopts a new autoregressive multimodal architecture with a core breakthrough in near-perfect text rendering capabilities. It promises generation via natural language while preserving character, lighting, and scene context, capable of directly outputting 4K resolution commercial design materials.

📚 Supported Endpoints This model supports following standard OpenAI format:

Image Generations (/v1/images/generations): Supports standard text-to-image generation.Reference Guide

🌟 2026-04-20

🎉 CometAPI Adds Doubao Seedance 2.0 Video Generation Models! 🎉

🔹 doubao-seedance-2-0 / doubao-seedance-2-0-fast

doubao-seedance-2-0: ByteDance's latest high-quality video generation model, supporting both text-to-video & image-to-video.
doubao-seedance-2-0-fast: The accelerated version of Seedance 2.0 — faster generation, same powerful quality.

⚠️ Important Notice

The legacy Seedance-series official API format is deprecated. All Seedance models (including doubao-seedance-1-5-pro and doubao-seedance-1-0-pro) should now use the unified v1/videos endpoint going forward.

Note: doubao-seedance-1-5-pro and doubao-seedance-1-0-pro do not support image-to-video (input_reference is not available for these models).

📋 Parameters

Parameter	Description
Duration (seconds)	4–15 seconds, default 5
Aspect Ratio (size)	`21:9` / `16:9` / `4:3` / `1:1` / `3:4` / `9:16`, default `16:9`
Resolution (resolution)	`480p` / `720p` / `1080p`*, default `720p`

*1080p only available for doubao-seedance-1-5-pro and doubao-seedance-1-0-pro

🚀 Usage Example

curl --location --request POST 'https://api.cometapi.com/v1/videos' \
--header 'Authorization: sk-your-key' \
--header 'Content-Type: multipart/form-data' \
--form 'prompt="a cat running on the beach"' \
--form 'model="doubao-seedance-2-0"' \
--form 'seconds="5"' \
--form 'size="16:9"' \
--form 'resolution="720p"' \
--form 'input_reference=@"your_image.png"'

💡 input_reference is optional — include a reference image for image-to-video, or omit it for text-to-video. Only supported by doubao-seedance-2-0 and doubao-seedance-2-0-fast.

📖 Documentation

For full API details, please refer to: https://apidoc.cometapi.com/api/video/seedance/create

🌟 2026-04-16

🎉 CometAPI Adds Claude Opus 4.7! 🎉

🔹 claude-opus-4-7

claude-opus-4-7: The smartest agentic and coding model.

📚 Developer Documentation

🎉 CometAPI Adds Kimi K2.6, Qwen3.6-Plus, and GLM-5.1! 🎉

🔹 kimi-k2.6

kimi-k2.6: Kimi K2.6 preview version is now open for integration testing.

🔹 qwen3.6-plus

qwen3.6-plus: Newly launched, featuring comprehensively enhanced code development capabilities and synchronized improvements in multimodal recognition and reasoning efficiency, delivering an outstanding Vibe Coding experience.

🔹 glm-5.1

glm-5.1: Zhipu's latest flagship model, with greatly enhanced coding capabilities and significantly improved performance in long-horizon tasks.

The above models all follow the standard OpenAI chat format call. For details, refer to: https://apidoc.cometapi.com/chat

🌟 2026-03-27

🎉 CometAPI Now Supports Suno v5.5! 🎉

Superior Audio Quality: Significantly enhanced audio clarity, vocal performance, and mixing precision.
Immersive Experience: Delivers lifelike vocals and powerful creative control.
Professional Creation: Generates emotionally rich, genre-accurate, high-quality songs.

🛠 Usage

Set the request parameter mv to chirp-fenix.

{
    "mv": "chirp-fenix",
    "gpt_description_prompt": "cat"
}

🌟 2026-03-25

🎉 CometAPI Adds Xiaomi MiMo-V2 Series! 🎉

🔹 mimo-v2-flash , mimo-v2-omni , mimo-v2-pro

Xiaomi MiMo-V2 Model Series: Towards the Agentic Era. Integrating trillion-scale parameters, omni-modal perception, and human-like interaction—unifying understanding and action, from the present into the future.
- These models follow the standard OpenAI chat format call. For details, refer to: https://apidoc.cometapi.com/chat
- These models also support the messages format call. For details, refer to: https://apidoc.cometapi.com/anthropic-messages

🌟 2026-03-18

🎉 cometapi Adds GPT-5.4 mini/nano, GLM-5 Turbo, and Qwen3.5 Series! 🎉

🔹 gpt-5.4-mini 🔹 gpt-5.4-mini-2026-03-17 🔹 gpt-5.4-nano 🔹 gpt-5.4-nano-2026-03-17

gpt-5.4-mini, gpt-5.4-mini-2026-03-17: OpenAI's most powerful small model to date. It brings the capabilities of GPT-5.4 to a faster, more efficient architecture, designed specifically for coding, computer operations, and high-volume workloads.
gpt-5.4-nano, gpt-5.4-nano-2026-03-17: The most affordable GPT-5.4 class model, designed specifically for simple, massive-scale tasks where speed and cost are prioritized (such as classification, data extraction, and sorting).

The above models follow the standard OpenAI chat format for calls. For details, refer to: https://apidoc.cometapi.com/chat
The above models support the OpenAI response format for calls. For details, refer to: https://apidoc.cometapi.com/responses

🔹 glm-5-turbo

glm-5-turbo: GLM-5-Turbo is a base model deeply optimized for OpenClaw Lobster scenarios, delivering excellent performance and precision in domain-specific tasks.

This model follows the standard OpenAI chat format for calls. For details, refer to: https://apidoc.cometapi.com/chat

🔹 qwen3.5-122b-a10b 🔹 qwen3.5-27b 🔹 qwen3.5-35b-a3b 🔹 qwen3.5-flash

qwen3.5 Series: The latest generation of model families from Alibaba Cloud Qwen (Tongyi Qianwen). The entire series features significant improvements in coding, mathematics, and logical reasoning capabilities. The Flash version offers ultimate inference speed, while the MoE architecture versions (122B-A10B/35B-A3B) significantly reduce computational overhead while maintaining flagship-level performance, perfectly balancing performance and cost.

The above models follow the standard OpenAI chat format for calls. For details, refer to: https://apidoc.cometapi.com/chat

📣 Log Retention Notice

To ensure service stability and manage storage costs, logs on this site will only be retained for 3 months. Expired logs will be automatically deleted and cannot be recovered!

If you need long-term retention, please download or back up your logs within 3 months!

⏰ Effective Date: Immediately

Thank you for your understanding and support!

🌟 2026-03-12

🎉 CometAPI Adds Grok 4.20 Beta Series! 🎉

🔹 grok-4.20-multi-agent-beta-0309, grok-4.20-beta-0309-reasoning, grok-4.20-beta-0309-non-reasoning

grok-4.20-beta series: Grok 4.20 Beta is X.ai's newest flagship model with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherence, delivering consistently precise and truthful responses.
grok-4.20-multi-agent-beta-0309:

This model supports OpenAI response format calls. For details, refer to: https://apidoc.cometapi.com/responses

grok-4.20-beta-0309-reasoning, grok-4.20-beta-0309-non-reasoning:

These models follow the standard OpenAI chat format call and response format calls. For details, refer to: https://apidoc.cometapi.com/chat https://apidoc.cometapi.com/responses

🌟 2026-03-06

🎉 CometAPI Adds gpt-5.4 Series! 🎉

🔹 gpt-5.4-pro-2026-03-05, gpt-5.4-2026-03-05, gpt-5.4-pro, gpt-5.4

gpt-5.4-pro, gpt-5.4-pro-2026-03-05: GPT-5.4 Pro utilizes more powerful computing capabilities for deeper thinking to consistently deliver superior answers, designed to solve complex problems. GPT-5.4 Pro supports reasoning.effort: medium, high, xhigh.
- This model only supports OpenAI response format calls. For details, refer to: https://apidoc.cometapi.com/responses
gpt-5.4-2026-03-05, gpt-5.4: GPT-5.4 is a frontier model for handling complex professional tasks. reasoning.effort supports the following options: none (default), low, medium, high, and xhigh.
- The above models follow the standard OpenAI chat format call. For details, refer to: https://apidoc.cometapi.com/chat
- This model supports OpenAI response format calls. For details, refer to: https://apidoc.cometapi.com/responses

🌟 2026-03-04

🎉 CometAPI Adds gpt-5.3 chat! 🎉

🔹 gpt-5.3-chat-latest

gpt-5.3-chat-latest: This model not only provides more accurate answers but also delivers richer, more contextually relevant results. It focuses on the experience details users perceive most in daily use: tone, response relevance, and conversational flow.

📚 Developer Documentation

Quick Integration Guide (OpenAI Format)

🌟 2026-03-04

🎉 CometAPI Adds gemini-3.1-flash-lite! 🎉

🔹 gemini-3.1-flash-lite-preview 🔹 gemini-3.1-flash-lite

gemini-3.1-flash-lite is the most cost-effective model in the Gemini series, optimized for high-volume agent tasks, translation, and simple data processing.

📚 Developer Documentation

🌟 2026-02-28

🎉 CometAPI Adds grok-imagine-video! 🎉

grok-imagine-video: The latest video generation model from xAI, supporting Text-to-Video and Video Editing tasks.
- Configurable: Supports custom duration (e.g., 5s, 10s), aspect_ratio (e.g., 16:9, 1:1), and resolution (e.g., 720p) via simple parameters.
- Async API: The endpoint operates asynchronously. It returns a request_id immediately upon submission; use the GET endpoint to check status and retrieve the generated video.

For details, please refer to: https://apidoc.cometapi.com/video/grok/create

🌟 2026-02-27

🚀 Nano Banana 2 (Gemini 3.1) Flagship Image Model Released! Designed for professional workflows, integrating reasoning with high-fidelity illustration.

💎 Available Models: gemini-3.1-flash-image-preview gemini-3.1-flash-image

🔥 Core Evolutions:

Ultimate Quality: Native 4K (4096px) + Supports extreme 1:8 / 8:1 aspect ratios.
Superior Consistency: Supports 14 Reference Images (10 objects + 4 characters), perfectly replicating styles and character consistency.
Intelligent Reasoning: Built-in Thinking Process (Chain of Thought) to understand complex Prompts.
All-around Enhancements: Advanced text rendering (poster-ready) + Google Web Search verification.

🔗 Integration Docs: https://apidoc.cometapi.com/gemini-image-generation 📘 Usage Guide: https://apidoc.cometapi.com/guide-nanobanana

⚠️ Important Migration Notice: Gemini 3 Pro Deprecation

Affected Models: gemini-3-pro-preview, gemini-3-pro-preview-thinking Current Status: ❌ Deprecated Shutdown Date: March 9, 2026

🚨 Action Required: To avoid service interruption, please migrate to Gemini 3.1 Pro Preview as soon as possible.

🌟 2026-02-25

🎉 CometAPI Adds gpt-5.3-codex! 🎉

🔹 gpt-5.3-codex,gpt-audio-1.5,gpt-realtime-1.5

gpt-5.3-codex: GPT-5.3-Codex is the most powerful agent programming model to date. Optimized for agent programming tasks in Codex or similar environments. GPT-5.3-Codex supports reasoning parameters set to Low, Medium, High, and Ultra-High.

This model only supports the OpenAI response format. For details, please refer to: https://apidoc.cometapi.com/responses

gpt-audio-1.5: The best voice model for audio input and audio output in Chat Completions.

This model follows the standard OpenAI chat format. For details, please refer to: https://apidoc.cometapi.com/chat

gpt-realtime-1.5: The best voice model for audio input and audio output.

This model follows the OpenAI Realtime API format.

🌟 2026-02-24

🎉 CometAPI Adds doubao-seedream-5-0-260128! 🎉

doubao-seedream-5-0-260128 - Doubao-Seedream-5.0-lite is ByteDance's latest image generation model. This model is the first to feature web retrieval capabilities, integrating real-time online information to enhance the timeliness of generated images. Additionally, the model's intelligence has been upgraded, enabling it to accurately parse complex prompts and visual content. Furthermore, it boasts enhancements in the breadth of world knowledge, reference consistency, and generation quality for professional scenarios, better satisfying enterprise-level visual creation needs.
- Model ID: doubao-seedream-5-0-260128

📚 Developer Documentation

Quick Integration Guide (doubao-image)

🌟 2026-02-19

🎉 CometAPI Adds Gemini 3.1 Series! 🎉

🔹 Gemini 3.1 Series

gemini-3.1-pro-preview, gemini-3.1-pro-preview-thinking: Gemini 3.1 Pro is the next generation in the Gemini series of models, a suite of highly-capable, natively multimodal, reasoning models. Gemini 3 Pro is now Google’s most advanced model for complex tasks, and can comprehend vast datasets, challenging problems from different information sources, including text, audio, images, video, and entire code repositories

📚 Developer Documentation

🌟 2026-02-18

🚀 CometAPI: Support for Claude Sonnet 4.6

✨ Core Features

Most Powerful All-Round Model: Claude Sonnet 4.6 delivers a world-class experience in coding and logical reasoning.
Dual-Protocol Support: Seamlessly compatible with both the OpenAI Standard Format and the Anthropic Native Format.

🔌 Call Parameters

Model Names: claude-sonnet-4-6, cometapi-sonnet-4-6

📚 Developer Documentation

🌟 2026-02-17

🎉 CometAPI Adds Qwen3.5 Series! 🎉

🔹 Qwen3.5 Series

qwen3.5-397b-a17b: The Qwen3.5 series 397B-A17B native vision-language model is based on a hybrid architecture design, fusing linear attention mechanisms with sparse Mixture-of-Experts (MoE) models to achieve higher inference efficiency. In various tasks such as language understanding, logical reasoning, code generation, agent tasks, image understanding, video understanding, and graphical user interfaces (GUI), it demonstrates excellent performance comparable to current top-tier frontier models. It possesses powerful code generation and agent capabilities, with good generalization for various agent scenarios.
qwen3.5-plus, qwen3.5-plus-2026-02-15, qwen3.5-plus-thinking: The Qwen3.5 native vision-language series Plus model is based on a hybrid architecture design, fusing linear attention mechanisms with sparse Mixture-of-Experts (MoE) models to achieve higher inference efficiency. In multiple task evaluations, the 3.5 series demonstrates excellent performance comparable to current top-tier frontier models, achieving a leap in performance in both pure text and multimodal aspects compared to the 3 series. This version is a snapshot from February 15, 2026.

📚 Developer Documentation

Quick Integration Guide (OpenAI Format)

🌟 2026-02-14

🎉 CometAPI Now Supports Doubao Seed 2.0 Series! 🎉

🔹 Doubao Seed 2.0 Series

doubao-seed-2-0-code-preview-260215 Focuses on long-chain reasoning capabilities and complex task stability, adapted for complex scenarios in real business environments. As the coding-enhanced version of Seed 2.0, it is better suited for Agentic Coding.
doubao-seed-2-0-lite-260215 Balances generation quality with response speed, making it suitable as a general-purpose production-grade model.
doubao-seed-2-0-mini-260215 Designed for low-latency, high-concurrency, and cost-sensitive scenarios. It emphasizes rapid response and flexible inference deployment, supporting four-level thinking and multimodal understanding capabilities.

📚 Developer Documentation

Quick Integration Guide (OpenAI Format)

🌟 2026-02-13

🎉 CometAPI Adds minimax-m2.5! 🎉

🔹 minimax-m2.5

The world's first production-grade model natively designed for Agents. Its Coding & Agentic performance benchmarks directly against Claude Opus 4.6.

Full-Stack Coding: Supports PC, App, and cross-platform application development.
Office SOTA: Leads the industry in core productivity scenarios such as advanced Excel processing, in-depth research, and PPT generation.

📚 Developer Documentation

Quick Integration Guide (OpenAI Format)

🌟 2026-02-12

🎉 CometAPI Adds glm-5! 🎉

🔹 glm-5

Zhipu's new generation flagship base model, built for Agentic Engineering. It provides reliable productivity in complex system engineering and long-horizon Agent tasks; the usage experience in real-world coding scenarios approaches Claude Opus 4.5.

📚 Developer Documentation

Quick Integration Guide (OpenAI Format)

🌟 2026-02-06

🚀 cometapi: Supports Claude Opus 4.6

✨ Core Features

Ultimate Intelligence Model: Claude Opus 4.6 delivers world-class programming and logical reasoning experience.
Dual Protocol Support: Perfectly compatible with OpenAI Standard Format and Anthropic Native Format.

🔌 Call Parameters

Model Names: claude-opus-4-6 ,cometapi-opus-4-6

📚 Developer Documentation

⚠️ 2026-02-05

🔄 CometAPI: chatgpt-4o-latest Deprecation Notice

✨ Change Details

Upcoming Shutdown: In accordance with the official schedule, chatgpt-4o-latest will be discontinued on Feb 17, 2026.
Migration: Please migrate to the latest flagship GPT-5.2 Series. We recommend gpt-5.2 for most use cases or gpt-5.2-chat-latest for the newest chat improvements.

🔌 Recommended Models

Model Names: gpt-5.2 , gpt-5.2-chat-latest

📚 Developer Documentation

👉 API Docs

⚠️ 2026-02-04 🔄 CometAPI: Doubao Model Update Notice ✨ Change Details

Legacy Deprecation: In compliance with official policy, the Doubao 1.5 / 1.6 Series have been discontinued.

Migration: Please switch to doubao-seed-1.8.

🔌 Recommended Model

Model Name: doubao-seed-1.8

📚 Developer Documentation

👉 API Docs

🌟 2026-01-28

🦌 Comet Update: Qwen3 Flagship / Kimi Long Context / OCR v2

🚀 New Models

qwen3-max-2026-01-23 (General Flagship)
The strongest snapshot of the Qwen3 series, introducing a Deep Reasoning module. Improves complex logic deduction and code refactoring capabilities by 40%. Ideal for research assistance and system-level instructions.
kimi-k2.5 (Long Context)
Kimi's smartest model to date. Built on a native multimodal architecture, supporting both vision and text inputs simultaneously.
deepseek-ocr-2 (Visual Extraction)
Specialized in handwriting and complex table restoration. Eliminates hallucinations in dense formulas and supports direct Markdown/JSON structured output.
👉 API Docs

🌟 2026-01-19

🎉 Major Update! CometAPI Now Supports gpt-5.2-codex ！ 🎉

🚀 Available Models & Usage Guide

🔹 `gpt-5.2-codex` (For Professional Code Tasks)

Model ID:gpt-5.2-codex
Description: Optimized for coding tasks like code generation, completion, and analysis to leverage its best-in-class coding capabilities.
Required Endpoint: /v1/responses (Note: This endpoint must be used for this model.)
Documentation: 👉 Check out the Responses API documentation

2026-01-08

1️⃣ Doubao-Seed-1.8 (Multimodal)

Deep reasoning and powerful multimodal understanding

Model ID: doubao-seed-1-8-251228
Endpoint: /v1/chat/completions
👉 API Docs

2️⃣ Kling 2.6 (Video Generation)

Cinematic quality with native audio synthesis and audio-visual synchronization

Model ID: kling-v2-6
Features: Text/Image-to-Video | 5s/10s | Multiple Audio Modes
👉 API Docs

📅 2026-01-04

🌟 CometAPI Major Release: FLUX 2 MAX is Now Live 🎉

🚀 Multiple Access Methods Now Available:

🔹 Compatible Format

Model Name: black-forest-labs/flux-2-max
👇 Integration Docs: Create Predictions - API Doc (Replicate Format)

🔹 BFL Native Format

Model Name: flux-2-max
👇 Integration Docs: Flux Generate Image - API Doc (Native Format)

💡 FLUX 2 MAX Core Highlights: 🎯 Ultimate Complex Editing Capabilities 🛍️ E-commerce Photography Revolution: From 0 to 1 🎬 Cinematic-Grade Keyframe Generation 🎨 Hex-Code Level Color Control 👓 Single-Image 3D View Generation 🌐 Real-Time Info Driven Creation

🌟 2025-12-17

🔥 Comet New Release: Gemini 3 Flash — lightweight, efficient multimodal model & GPT-Image-1.5 — state-of-the-art image generation model

1️⃣ Multimodal Conversation Model

gemini-3-flash ⚡️ Key Features:
Fast response
Ultra-low latency
Multimodal understanding and generation
Lightweight and efficient, ideal for real-time scenarios

✅ Recommended Endpoint:

/v1/chat/completions

👉 Integration Guide

1️⃣ Image Generation Model

gpt-image-1.5 ⚡️ Key Features:
Ultra-fast generation
Strong prompt understanding
High-fidelity image quality
Stable faces and identity consistency
✅ Recommended Endpoint:
/v1/images/generations

👉 Integration Guide

🌟 2025-12-12

🔥 Comet New Release: GPT-5.2 Series

⚡️ Key Features: Comprehensive performance upgrade! Pro version delivers ultimate logical reasoning & stability; Chat Latest features an up-to-date knowledge base.

💎 New Models & Integration Guide:

1️⃣ Standard / Latest

gpt-5.2
gpt-5.2-chat-latest ✅ Recommended Endpoints: /v1/chat , /v1/responses 👉 Chat API Docs 👉 Responses API Docs

2️⃣ Pro Version

gpt-5.2-pro ⚠️ Required Endpoint: /v1/responses (MUST use this endpoint) 👉 Responses API Docs

🚀 Fully available now. Happy coding!

🌟 2025-12-04

✨ New Models

deepseek-v3.2 - Official stable version now available
- Model ID: deepseek-v3.2
ByteDance Seedream 4.5 - Advanced image generation
- Model ID: doubao-seedream-4-5-251128
- Key improvements: Better quality, precise detail control, multi-image support
- Documentation: CometAPI - ByteDance Image Generation
Sora - Now supports character creation
- Documentation: CometAPI - sora

🔄 Model Deprecation - Action Required

⚠️ gpt-4o-realtime-preview-2024-10-01 will be deprecated on December 3, 2025.

Please migrate to: gpt-realtime
New features: Improved reliability, better tool calling, enhanced interruption handling, and 2 new voices (Cedar & Marin).

🌟 2025-11-27

🚨 [URGENT] Announcement: Deprecation and Upgrade of Claude 3 Series & Gemini 2.5 Preview Models

According to the latest official notifications from Anthropic and Google, our platform will officially deprecate the legacy Claude 3 Series and Gemini 2.5 Preview Series Seriesmodels on December 1st at 00:00. To avoid API call failures, please ensure you switch to the following Model IDs before the deadline:

1. Claude Series (Upgrade to 4.5)

Version	Please replace with new Model ID
Intelligent (Sonnet)	`claude-sonnet-4-5-20250929`
Most Powerful (Opus)	`claude-opus-4-5-20251101`
Fastest (Haiku)	`claude-haiku-4-5-20251001`

2. Gemini Series (Upgrade to 2.5 Stable / 3.0 Preview)

Version	Please replace with new Model ID
Standard (Flash)	`gemini-2.5-flash`
Image Enhanced	`gemini-2.5-flash-image` or `gemini-3-pro-image-preview`
Professional (Pro)	`gemini-2.5-pro` or `gemini-3-pro-preview`

⚠️ Note: The old models will cease to function immediately after December 1st. Please migrate as soon as possible to ensure business continuity.

📅 2025-11-26

🌟 CometAPI Major Launch: FLUX.2 Series - Limited Time Offer 🎉

🚀 Now Supporting Asynchronous Format Models: 🔹 black-forest-labs/flux-2-pro 🔹 black-forest-labs/flux-2-dev 🔹 black-forest-labs/flux-2-flex

💰 Limited Time Promotion: Lower than Replicate Official Pricing!

💡 FLUX.2 Key Highlights: 🖼️ Multi-Reference Editing: Supports 8-10 reference images to satisfy complex character generation needs. 📸 Ultra-High Quality: Up to 4MP resolution for ultimate natural realism. ⚡ Flexible Selection: • Pro: Designed for high-efficiency production and fast delivery. • Flex: Maximizes image quality with adjustable parameters. • Dev: Developer-friendly optimization.

👇 Start Building Now Create Predictions - API Doc

🌟 2025-11-25 🎉 CometAPI Launches Claude Opus 4.5 Series!

🚀 Available Models: 🔹 claude-opus-4-5-20251101-thinking 🔹 claude-opus-4-5-20251101 🔹 cometapi-opus-4-5-20251101-thinking 🔹 cometapi-opus-4-5-20251101

💡 Why Claude Opus 4.5? Top choice for intensive reasoning, code automation, and complex Agent systems.

✨ Key Highlights: 🧠 Superior Reasoning: Handles complex logic. 📝 Automation: Enterprise-grade efficiency. 🤖 Agents: Advanced tool integration. ⚡ Stability: Reliable long-context performance.

📖 Documentation: 👉 Chat - API Doc-CometAPI 👉 Anthropic Messages - API Doc-CometAPI

Experience world-class AI capabilities today! 🚀

🌟 2025-11-20

🎉 CometAPI Launches Nano Banana Pro ! 🎉

🔹 gemini-3-pro-image-preview,gemini-3-pro-image Gemini 3 Pro Image (also known as nanobanana pro) is Google’s flagship image generation model designed for high-fidelity professional workflows. This release introduces "Deep-Context" understanding for highly complex prompts, perfects in-image typography generation, offers distinct object editing without manual masking, and significantly enhances photorealism and lighting physics. Follows the Google standard format. See details: CometAPI Chat Documentation https://apidoc.cometapi.com/gemini-generates-image-20873272e0 GUIDE：https://apidoc.cometapi.com/guide-to-calling-gemini-2-5-flash-image-1425263m0

🎉 CometAPI Launches Grok 4.1 Fast Series Models! 🎉

🚀 Available Models:

🔹 grok-4-1-fast-reasoning, grok-4-1-fast-non-reasoning

A cutting-edge multimodal model designed specifically for high-performance tool calling and complex interaction scenarios. It delivers exceptional logical processing capabilities while maintaining ultra-fast response speeds. Supports a maximum context of 2M tokens.

Flexible Dual Modes:
- reasoning: Enhanced logical reasoning, ideal for complex problem-solving.
- non-reasoning: Optimized for extreme speed, ideal for high-concurrency tasks.

Format Support: Chat format

Documentation: 👉 Check out the Chat API documentation

🌟 2025-11-19

🎉 CometAPI Launches Gemini 3 Pro Model! 🎉

🔹 gemini-3-pro-preview,gemini-3-pro-preview-thinking

Google's most intelligent model with SOTA (state-of-the-art) reasoning and multimodal understanding capabilities, featuring powerful agentic and vibe coding abilities. Max context: 2M tokens; Knowledge cutoff: January 1, 2025.

Key Features:

Unified Multimodal: Text, image, audio, and video processing with real-time analysis
Million-Token Context: Handle massive documents and codebases
Advanced Reasoning: Multi-step problem-solving with RL optimization
High Performance: Sparse MoE architecture + Google TPU v6

Best For: AI agents, code generation, multimodal understanding

Format Support: Chat format

Documentation: 👉 Check out the Chat API documentation

🌟 2025-11-14

🎉 Major Update! CometAPI Now Supports the Full GPT-5.1 Model Series! 🎉

🚀 Available Models & Usage Guide

GPT-5.1 is OpenAI's latest flagship model, designed for advanced coding and agent tasks.

General Specs: 400k context window, 128k max output, with a knowledge cutoff of September 30, 2024.

🔹 `gpt-5.1` & `gpt-5.1-chat-latest` (For Dialogue & General Tasks)

Model IDs: gpt-5.1, gpt-5.1-chat-latest
Description: OpenAI's flagship models, ideal for building multi-turn conversational applications that demand powerful reasoning and comprehension.
Recommended Endpoint: /v1/chat
Documentation: 👉 Check out the Chat API documentation

🔹 `gpt-5.1-codex` (For Professional Code Tasks)

Model ID: gpt-5.1-codex
Description: Optimized for coding tasks like code generation, completion, and analysis to leverage its best-in-class coding capabilities.
Required Endpoint: /v1/responses (Note: This endpoint must be used for this model.)
Documentation: 👉 Check out the Responses API documentation

🎉 CometAPI Grand Launch of qwen-image and qwen-image-edit! 🎉

🚀 Available Models:

🔹 `qwen-image`

🔹 `qwen-image-edit`

qwen-image: It is a universal image generation model, mainly used to generate completely new images based on text, emphasizing the ability to create from scratch. It is suitable for scenarios such as creative generation, stylized drawing, and more.

It is trained on large-scale vision-language models, supports multi-language prompts, but its core focus is on generation rather than editing.

qwen-image-edit: An optimized version based on Qwen-Image, specifically tailored for image editing tasks. It features stronger capabilities in local modifications and consistency preservation. It can not only generate new images but also perform precise edits on existing images.

The above models follow the OpenAI standard image generation format for calls. For details, refer to: Text-to-Image, Image-to-Image

🌟 2025-11-12

🎉 CometAPI proudly launches the new gpt-image-1-mini model! 🎉

🚀 Available Models:

🔹 `gpt-image-1-mini`

gpt-image-1-mini: OpenAI's cost-effective image generation model, supporting text/image as input and outputting images; suitable for large-scale, cost-sensitive generation scenarios.
The above models follow the OpenAI standard image generation format for calls. For details, refer to: Text-to-Image, Image-to-Image

📢 Additional Announcements:

CometAPI Partners with Bria! CometAPI has reached a cooperation with Bria, and in November 2025, bria all interfaces will be freely open to all users for calls. Have a try!
Sora Asynchronous Format Update: CometAPI has completed the replacement of Sora's asynchronous format and no longer supports the open chat format.
- Please use sora-2-pro or sora-2 models, which call this interface (official per-second billing): Sora API.
- Use sora-2-all or sora-2-pro-all models, which call this interface (billed per item, after discount: sora-2-all: 0.08, sora-2-pro-all: 0.8): Sora All API.

🌟 2025-11-10

🎉 CometAPI Excitingly Launches K2-Thinking Series New Models! 🎉

🚀 Available Models:

🔹 `k2-thinking`

🔹 `k2-thinking-turbo`

k2-thinking: Moonshot AI's most advanced open reasoning model, extending the K2 series. It is a thinking model with universal Agentic capabilities and reasoning abilities. Supports 256K tokens context window.
k2-thinking-turbo: Based on k2-thinking, it provides faster response speeds and higher concurrency capabilities, supporting the same 256K context and reasoning functions, suitable for high-efficiency scenarios.
The above models follow the OpenAI Chat standard format for invocation. For details, refer to: https://apidoc.cometapi.com/chat

🌟 2025.11.07

Comet Major Update Announcement: Sora-2 Invocation Method Optimization

To improve efficiency and stability, we will optimize the Sora-2 invocation method starting from UTC 2025-11-11 8:00.

Key Changes

No longer supported: Using the OpenAI reverse-engineered Chat format for invocation.
New asynchronous format: Model name switches to sora-2-all or sora-2-pro-all to call the asynchronous interface format (notification will be sent as soon as it's live).
Pricing remains unchanged.

Recommended Actions

Please complete the interface switch by the update time to avoid service interruption.
We will provide the new format as soon as possible for testing to ensure a smooth transition.
Currently, you can continue using the official format (billed per second). For details, see the documentation: https://apidoc.cometapi.com/create-video-22425640e0.

If you have any questions, please contact customer service. We are committed to providing a better experience—thank you for your support!

🌟 2025.11.03

🎬 KLING New Features

🧑‍💻 Digital Human Tasks: Supports creating digital human tasks (first perform speech synthesis to obtain taskid and audioid), generating highly realistic digital human videos to enhance interactivity.
🚀 Advanced Model: Added kling-v2-5-turbo, providing faster speed and higher video quality (Pro mode only).
🛠️ Other Features: Including image recognition, face recognition (lip-sync), task creation (lip-sync), speech synthesis, seamlessly integrated into the video generation workflow.
📚 API Calls: View API Documentation

💥 Sora-2 Pricing Update (CometAPI Official Format)

📉 Price Reduction: Sora-2 model calls via CometAPI are now discounted to 80% of the official price, making high-quality video generation more accessible and cost-effective for all users.
🛠️ Integration Details: Seamlessly integrate Sora-2 into your workflows with standard CometAPI calls. For full API reference, check the updated documentation.
🚀 Availability: This pricing update is live now—start saving on your Sora-2 tasks today!

🔥 All New Features Are Now Fully Available, Welcome to Test and Experience! 🔥

⭐️ 2025-10-17

🎉 CometAPI Model Update Announcement 🎉

We've added three powerful AI models, all supporting chat format calls to accelerate your AI application development!

🚀 New Models

Claude Haiku 4.5

Model ID: claude-haiku-4-5-20251001 / cometapi-haiku-4-5-20251001
⚡ Low Latency & High Throughput: Optimized for real-time, high-concurrency scenarios
🧠 Configurable Reasoning Depth: Supports "extended thinking" mode
📄 Massive Context: Up to 200K input tokens, 8K output tokens
💻 Strong Code Capabilities: Code generation, debugging, tool calling
💰 Cost Advantage: ~1/3 the cost of Sonnet 4
🔧 Format Support: Claude native message format + chat format

GLM-4.6

Zhipu AI's latest flagship model with 355B total params, 32B active
💻 Coding Excellence: Aligns with Claude Sonnet 4, best in China
📚 Extended Context: Expanded from 128K to 200K tokens
🧠 Enhanced Reasoning: Supports tool calling during inference
🔍 Search Optimization: Improved tool calling and agent performance
✍️ Better Writing: Enhanced style, readability, and role-playing alignment
🌍 Multilingual: Boosted cross-language translation capabilities
🔧 Format Support: Chat format

Veo3.1 & Veo3.1-Pro

Google's latest AI video generation models for high-quality video creation
🎬 High Resolution: 1080p video generation
🎵 Synchronized Audio: Dialogue, ambient sounds, effects with native lip-sync
⏱️ Video Length: Generate seamless clips up to 8 seconds
🎨 Creative Control: Reference image support, first/last frame setting, cinematic presets
⚡ Dual Variants: Veo3.1 (standard quality) + Veo3.1-Pro (maximum quality)
🔧 Format Support: Async calls + chat format

All models support chat format calls, with Claude models additionally supporting native message format for maximum integration flexibility!

🌟 2025-10-10

🎉 Major Model Update - 3 New AI Services! 🎉

🔥 GPT-5 Complete Series (7 Models)

World's most advanced reasoning models with 400k context window

gpt-5-minimal - Lightning-fast for simple tasks
gpt-5-low - Speed-optimized (212 tokens/sec)
gpt-5-medium - Balanced performance for general use
gpt-5-high - Maximum "deep thinking" mode
gpt-5-codex-low/medium/high - Specialized for coding & software engineering
✨ Features: State-of-the-art coding, mathematics, visual perception & complex reasoning

🎬 Sora-2 & Sora-2 Pro

Official OpenAI video generation with synced audio
Realistic physics & object interactions
Professional-grade cinematic quality
Same pricing as OpenAI official rates

📋 API Documentation:

GPT-5 Series: https://apidoc.cometapi.com/response
Sora-2: https://apidoc.cometapi.com/create-video
🚀 All models live now!

🌟 2025-10-06

🎉 CometAPI Now Supports GPT-5 Pro! 🎉

We're excited to announce that GPT-5 Pro - OpenAI's most advanced AI model - is now available on CometAPI!

🚀 Key Features:

Enhanced Reasoning: Superior performance on complex tasks
Advanced Problem Solving: Unparalleled accuracy and depth
Multi-Domain Excellence: Exceptional capabilities across all fields

🛠 Usage

Use the following model names in your API calls:

gpt-5-pro-2025-10-06
gpt-5-pro

📖 API Documentation:

https://apidoc.cometapi.com/response-18535147e0

Ready to experience the next generation of AI? Start building with GPT-5 Pro today!

🌟 2025-10-01

CometAPI Now Supports Sora 2 API Calls

We're excited to announce that CometAPI now fully supports OpenAI's latest Sora 2 video generation model! Developers can now easily access this groundbreaking AI video generation technology through our unified API interface.

Sora 2 Features:

✨ Highly Realistic Video Generation: Creates physics-accurate, visually stunning video sequences perfect for short-form content
🎵 Synchronized Audio & Video: Supports synchronized audio and dialogue generation for complete video content
⏱️ Temporal Consistency: Ensures objects and scenes remain coherent throughout the video duration
🎬 Multi-Style Support: From cinematic quality to anime styles, meeting diverse creative needs
👤 Real-World Cameo Feature: Inject real people, animals, or objects with accurate likeness reproduction
🎯 Advanced Control: Precise editing controls for re-rendering specific objects or scenes
🛡️ Built-in Safety: Comprehensive safety measures and content moderation

Important Notes:

⚠️ Due to limited official compute capacity during the initial launch, you may experience some instability - we appreciate your patience
📡 For video generation using chat format, please use streaming output

API Integration:

Sora-2 is now live and compatible with OpenAI Chat Completions. Switch the base URL to CometAPI and use the key obtained from the CometAPI console to make calls.

Minimal example (replace Authorization with your CometAPI key):

bash curl --location --request POST 'https://api.cometapi.com/v1/chat/completions'
--header 'Authorization: sk-'
--header 'Content-Type: application/json'
--header 'Accept: /'
--header 'Host: api.cometapi.com'
--header 'Connection: keep-alive'
--data-raw '{ "model": "sora-2", "stream": true, "messages": [ { "role": "user", "content": "Generate a cute kitten sitting on a cloud, cartoon style" } ] }'

👉 Visit https://www.cometapi.com to start experiencing Sora 2's powerful capabilities! For questions, join our Discord community: https://discord.com/invite/HMpuV6FCrG

🌟 2025-09-30

🎉 CometAPI Now Supports Claude Sonnet 4.5, DeepSeek-V3.2-Exp, and Gemini 2.5 Flash New Versions! 🎉

🚀 Claude Sonnet 4.5

Available Model Names: claude-sonnet-4-5-20250929-thinking, claude-sonnet-4-5-20250929, claude-sonnet-4-5, cometapi-sonnet-4-5-20250929-thinking, cometapi-sonnet-4-5-20250929, cometapi-sonnet-4-5
Claude Sonnet 4.5 has world-leading coding capabilities (SOTA-Level Coding). It achieved an astonishing 77.2% accuracy on the authoritative SWE-bench benchmark, which measures real-world software engineering abilities, making it the world's strongest coding model. This means it has made a qualitative leap in handling complex programming tasks, debugging, and even architectural design.

🚀 DeepSeek-V3.2-Exp Highlights

The DeepSeek-V3.2-Exp model is an experimental (Experimental) version. As an intermediate step towards the next-generation architecture, V3.2-Exp introduces DeepSeek Sparse Attention (a sparse attention mechanism) based on V3.1, and conducts exploratory optimization and verification for the training and inference efficiency of long texts.

🚀 Gemini 2.5 Flash Highlights

gemini-2.5-flash-preview-09-2025: A model that excels in cost-effectiveness and provides comprehensive features. 2.5 Flash is best suited for large-scale processing of low-latency, high-data-volume tasks that require thinking, as well as agent application scenarios.
gemini-2.5-flash-lite-preview-09-2025: The fastest Flash model, specially optimized for cost-benefit and high throughput.
Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025-09-23

🚀 New and Updated Models：

🔹 grok-4-fast-non-reasoning

grok-4-fast-non-reasoning: The non-reasoning variant of xAI's Grok-4 Fast series, with a unified architecture for handling fast responses, suitable for real-time search and simple queries. It possesses extremely powerful technical parameters and ecosystem capabilities: context window supports up to 2,000,000 tokens, cost-efficient (input $0.20/million tokens), leading mainstream models.

🔹 grok-4-fast-reasoning

grok-4-fast-reasoning: The reasoning variant of xAI's Grok-4 Fast series, supporting long-chain thinking and tool calls, suitable for complex tasks such as mathematical reasoning and agent workflows. Ranked first in the LMArena search arena (1163 Elo), it possesses extremely powerful technical parameters and ecosystem capabilities: context window supports up to 2,000,000 tokens, leading mainstream models.

🔹 grok-code-fast-1

grok-code-fast-1: xAI's fast model specifically designed for agent coding, optimized for tool integration such as grep and file editing, achieving 70.8% performance on SWE-Bench-Verified, suitable for automated code generation and debugging. Currently supports text modality, with vision and other features coming soon. It possesses extremely powerful technical parameters and ecosystem capabilities: context window supports up to 256,000 tokens, leading coding-specific models.
Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025-09-11

🚀 New and Updated Models: minimax-hailuo-02, bytedance-seedream-4-0-250828, VEO3 Updated!

🔹 minimax-hailuo-02

Support for minimax-hailuo-02 model, which is MiniMax's latest masterpiece, an AI video generation model aimed at completely transforming the video creation process. It not only inherits the advantages of the previous generation Hailuo 01, but also achieves a qualitative leap in core technology and user experience.
Click the link to experience it now: https://apidoc.cometapi.com/minimax-conch-generation-14660582e0

🔹 bytedance-seedream-4-0-250828

Support for bytedance-seedream-4-0-250828, as a new-generation image creation model, Seedream 4.0 integrates image generation and image editing capabilities into a unified architecture. This enables it to flexibly handle complex multimodal tasks, including knowledge-based generation, complex reasoning, and reference consistency. Compared to its predecessor, it has faster inference speed and can produce stunning high-definition images up to 4K resolution.
Click the link to experience it now: https://apidoc.cometapi.com/bytedance-image-generation-19773064e0

🔹 VEO3

The entire VEO3 series follows the official price reduction, with comet prices reduced to half of the original, welcome to call.
VEO3 now supports asynchronous interfaces for task processing, optimizing the calling efficiency of long-duration tasks and enhancing the overall experience.
Click the link to experience it now: https://apidoc.cometapi.com/submit-video-generation-task-18941528e0

🌟 2025-09-07

🎉 CometAPI Heavyweight Launch: kimi-k2-250905 and qwen3-max-preview! 🎉

🔹 kimi-k2-250905

kimi-k2-250905: Moonshot AI's Kimi K2 series 0905 version, supporting ultra-long context (up to 256k tokens, frontend and tool calling).
🧠 Enhanced Tool Calling: 100% accuracy, seamless integration, suitable for complex tasks and integration optimization.
⚡️ More Efficient Performance: TPS up to 60-100 (standard API), up to 600-100 in Turbo mode, providing faster responses and improved reasoning capabilities, with knowledge cutoff to mid-2025.

🔹 qwen3-max-preview

qwen3-max-preview: Alibaba's Tongyi Qianwen team's latest developed Qwen3-Max-Preview model, positioned as the peak performance in the series.
🧠 Powerful Multimodal and Reasoning: Supports ultra-long context (up to 128k tokens) and multimodal input, excels in complex reasoning, code generation, translation, and creative content.
⚡️ Breakthrough Improvements: Significant optimization in multiple technical indicators, faster response speed, knowledge cutoff to 2025, suitable for enterprise-level high-precision AI applications.

✅ All models belong to the default group, with seamless integration. It is recommended to choose the most suitable version based on your specific business scenarios (performance, speed, cost) to maximize application value.

Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🌟 2025-08-27

🔹 gemini-2.5-flash-image-preview, gemini-2.5-flash-image

gemini-2.5-flash-image-preview, gemini-2.5-flash-image: Gemini 2.5 Flash Image (also known as nano-banana) is Google’s most advanced image generation and editing model. This update enables you to blend multiple images into a single image, maintain character consistency to tell richer stories, perform targeted transformations using natural language, and use Gemini’s world knowledge to generate and edit images.
Please click: https://apidoc.cometapi.com/gemini-generates-image-20873272e0

🌟 2025-08-22

🔹 deepseek-v3.1, deepseek-v3-1-250821

deepseek-v3.1, deepseek-v3-1-250821: DeepSeek-V3.1 is DeepSeek's all-new hybrid inference model.
🧠 Hybrid inference: Think & Non-Think — one model, two modes
⚡️ Faster thinking: DeepSeek-V3.1 reaches answers in less time vs. DeepSeek-R1-0528
Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

🔹 Kling

✨ Massive Video Effects Library Expansion: Added 63 new video effects (62 single-subject effects and 1 two-person interactive effect), bringing the total to 80 available effects for more creative choices.
🔊 Video-to-Audio Optimization: The video-to-audio generation feature now supports full-resolution video uploads for more precise sound effect matching.
📈 Multi-Image to Video Performance Skyrockets: Experience a 102% improvement over the previous version! See significant enhancements in subject consistency, dynamic quality, and interaction naturalness. This is a seamless upgrade with no code changes required.
🎬 Text-to-Video Quality Upgrade: Version 1.6 now supports the generation of higher-quality videos.
- Parameter Example: "mode": "pro"
Documentation: Kling Video Generation
🎨 Image Generation Model Update: The new kling-v2-new model is now live, supporting nearly 300 image styles to maximize your creativity!
Documentation: Kling Image Generation

🌟 2025-08-18

🚀 New and Updated Models: Runway, VEO3, Hunyuan3D, Midjourney Fully Updated!

🔹 Runway

Runway model adds multiple core functions, expanding video and image generation capabilities:
Video to Video: Video to video generation.
Text to Image: Text to image generation.
Video Upscale: Video super-resolution enhancement.
Control a Character: Character control function.
Click the link to experience it now: https://apidoc.cometapi.com/generate-a-video-from-a-video-20308134e0

🔹 VEO3

VEO3 now supports asynchronous interface for task processing, optimizing the calling efficiency of long-duration tasks and enhancing the overall experience.
Click the link to experience it now: https://apidoc.cometapi.com/submit-video-generation-task-18941528e0

🔹 Hunyuan3D

Supports Hunyuan3D-2, providing powerful 3D content creation capabilities to assist in efficiently generating high-quality 3D models.
Click the link to experience it now: https://apidoc.cometapi.com/hunyuan3d-20073774e0

🌟 2025-08-08

🔹 GPT-5 Series

gpt-5, gpt-5-2025-08-07: OpenAI's flagship model, widely recognized as the industry's most powerful for coding, reasoning, and agentic tasks. It is designed to handle the most complex cross-domain challenges and excels in code generation, advanced reasoning, and autonomous agents, making it the premier choice for users demanding peak performance.
gpt-5-chat-latest: The continuously updated version of GPT-5. It always incorporates the latest features and optimizations, recommended for applications that need to stay current with the latest model capabilities.

🔹 GPT-5 Mini Series

gpt-5-mini, gpt-5-mini-2025-08-07: The cost-effective version of GPT-5, specifically optimized for speed and cost. It strikes an excellent balance between performance and affordability, making it the ideal choice for everyday tasks like general chat, content creation, and routine Q&A.

🔹 GPT-5 Nano Series

gpt-5-nano, gpt-5-nano-2025-08-07: The fastest and most cost-effective lightweight version in the GPT-5 family. It is perfect for scenarios requiring high throughput and instant responses, such as text classification, sentiment analysis, summary extraction, and data formatting.

API Call Instructions:

gpt-5-chat-latest should be called using the standard /v1/chat/completions format.
For other models (gpt-5, gpt-5-mini, gpt-5-nano, and their dated versions), using the /v1/responses format is recommended.
For details, please refer to: https://apidoc.cometapi.com/api-13851472

Note

Important: top_p is not supported by this series of models.
Temperature Settings
- gpt-5-chat-latest: Supports custom temperature values between 0 and 1 (inclusive).
- All other GPT-5 models: The temperature is fixed at 1. You may set it to 1 or omit it (defaults to 1).
When calling the GPT-5 series models (excluding gpt-5-chat-latest), the max_tokens field should be changed to max_completion_tokens.

🌟 2025-08-06

🔹 claude-opus-4-1-20250805

claude-opus-4-1-20250805: Anthropic's flagship Claude Opus 4.1 model, achieving major breakthroughs in programming, reasoning, and agentic tasks, with SWE-bench Verified reaching 74.5%.
Significantly enhanced multi-file code refactoring, debugging precision, and detail-oriented reasoning capabilities. This model is suitable for demanding programming and reasoning scenarios.
We have also added cometapi-opus-4-1-20250805 specifically for Cursor integration.

🔹 claude-opus-4-1-20250805-thinking

claude-opus-4-1-20250805-thinking: Claude Opus 4.1 version with extended thinking capabilities, providing up to 64K tokens of deep reasoning capacity.
Optimized for research, data analysis, and tool-assisted reasoning tasks, with powerful detail-oriented reasoning abilities.
We have also added cometapi-opus-4-1-20250805-thinking specifically for Cursor integration.

🔹 gpt-oss-120b

gpt-oss-120b: OpenAI's released 117B parameter Mixture of Experts (MoE) open-source model, designed for high-level reasoning, agentic, and general production use cases.

🔹 gpt-oss-20b

gpt-oss-20b: 21B parameter open-source MoE model with 3.6B active parameter architecture, optimized for low-latency inference and consumer-grade hardware deployment.
All above models follow the OpenAI chat standard format for API calls. For details, please refer to: https://apidoc.cometapi.com/api-13851472

🌟 2025-08-05

🚀 Feature Updates: gemini-2.5-flash-lite, o3 & o4-mini Deep Research, Volcano Engine Generation Models

- gemini-2.5-flash-lite - Google's most cost-effective model, built for large-scale tasks!

⚡️ High Efficiency: Designed for large-scale, low-latency applications.
🔧 Standard Format: Follows the OpenAI chat standard format, see details: CometAPI Chat Documentation

- o3 & o4-mini Deep Research Agents - Get in-depth analysis reports with web-connected research agents!

🧠 Advanced Analysis: Supports multi-step reasoning and provides reports with citations.
🤖 Available Models: o3-deep-research, o3-deep-research-2025-06-26, o4-mini-deep-research, o4-mini-deep-research-2025-06-26
📚 How to Call: The four deep research models above must be called using the following format:

curl --location 'https://api.cometapi.com/v1/responses'
--header 'Authorization: Bearer sk-xxxxx'
--header 'Content-Type: application/json'
--data '{ "model": "o3-deep-research-2025-06-26", "stream": true, "reasoning": { "summary": "detailed" }, "tools": [ { "type": "web_search_preview" } ], "input": "who are you" }'

- Volcano Engine Video & Image Models - Experience powerful new video and image models!

🎬 Video Generation: Create videos from images (bytedance-seedance-1-0-pro, bytedance-seedance-1-0-lite-i2v-250428) or text (bytedance-seedance-1-0-lite-t2v-250428).
🎨 Image Generation & Editing: Generate images with bytedance-seedream-3.0-t2i or edit them using prompts with bytedance-seedEdit-3.0-i2i.

🌟 2025-07-31

🚀 Feature Updates: MJ Video Generation, Flux-Kontext Multi-Image Reference, Kling-v1-6 Multi-Image Reference

- MJ Video Generation - Transform static images into dynamic video effects!

🎬 New: /mj/submit/imagine endpoint now supports video generation
🎨 Use cases: Animated effects, creative video generation
📚 View Documentation

- Flux-Kontext Multi-Image Reference - Enhanced AI creation!

🖼️ Update: Now supports up to 4 reference images (previously 1)
🔧 Models: flux-kontext-max and flux-kontext-pro only
📚 View Documentation

- Kling-v1-6 Multi-Image Reference - Better video quality!

📸 Feature: Up to 4 reference images for improved generation
🎯 Model: kling-v1-6 only
📚 View Documentation

🌟 2025-07-11

🚀 CometAPI supports Claude Code!

Add power to your development workflow. We're excited to announce that CometAPI now fully supports the powerful Claude Code.

What does this mean for you?

Top Artificial Intelligence features: Easily generate, debug and optimize code using models built specifically for developers.
⚙️ Flexible Model Selection: Our comprehensive range of models allows you to develop more seamlessly.
Seamless Integration: APIs are always available. Integrate Claude Code directly into your existing workflow in minutes.

Ready to build faster? Please click on the link below to make a call.

Click: https://apidoc.cometapi.com/doc-1266358

Endringslogg

CometAPI Update

Developer Documentation Quick Integration Guide (Chat Format)

DeepSeek, Qwen, and Doubao Model Retirement & Migration Notice

Some routes may experience capacity limits or become unavailable earlier due to upstream lifecycle and capacity adjustments. Please complete migration and production validation in advance.

2026-07-10

CometAPI Adds GPT-5.6 Series

2026-07-09

grok-4.5 Now Available

Seedream 5.0 Pro Now Available

Azure OpenAI GPT-4 Series Deprecation & Migration

2026-07-02

Claude Fable 5 Access Restored

2026-07-01

CometAPI Adds Gemini 3.1 Flash Lite Image and Claude Sonnet 5

2026-06-26

CometAPI now fully integrates HappyHorse 1.1 and Kling 3.0!

2026-06-23

2026-06-17

2026-06-13

CometAPI Removes Claude Fable 5

claude-fable-5

2026-06-12

CometAPI Adds Kimi K2.7 Code

kimi-k2.7-code

🌟 2026-06-10

🎉 CometAPI Adds Claude Fable 5! 🎉

🔹 claude-fable-5

🎉 CometAPI Adds Qwen3.7 Plus! 🎉

🔹 qwen3.7-plus

⚠️ Model Deprecation Notice

🎬 CometAPI: New Video Models Now Support /v1/videos

🤖 CometAPI: minimax-m3 Model Now Available

⚠️ CometAPI: Veo3 Series Supply and Billing Adjustment

🌟 2026-05-29

🎉 CometAPI Adds Claude Opus 4.8! 🎉

🔹 claude-opus-4-8

🎨 CometAPI: Midjourney V8 Support

⚠️ CometAPI: Model Deprecation Notice

May 29, 2026 — 12:00 PM UTC

June 15, 2026

July 23, 2026

October 23, 2026

🌟 2026-05-22

🎉 CometAPI Adds Qwen3.7-Max! 🎉

🔹 qwen3.7-max

🌟 2026-05-20

🎉 CometAPI Adds Gemini-3.5-Flash! 🎉

🔹 gemini-3.5-flash

🚀 🧠 CometAPI: Grok-4.3 Now Available

💰 📉 CometAPI: Grok-4.2 Series Price Reduction

⚠️ 🔄 CometAPI: Grok Legacy Model Deprecation Notice

⚠️ 🔄 CometAPI: DeepSeek-Chat / DeepSeek-Reasoner Deprecation Notice

💰 🎨 CometAPI: Doubao Seedream Image Model Pricing Update

🌟 2026-04-25

🎉 CometAPI Adds GPT-5.5 & GPT-5.5-Pro Models! 🎉

🔹 GPT-5.5

🔹 GPT-5.5-Pro

🌟 2026-04-24

🎉 CometAPI Adds GPT-5.5 & GPT-Image-2 Series! 🎉

🔹 GPT-5.5 Series

🔹 GPT-Image-2 Series

🎉 CometAPI Adds DeepSeek V4 Series! 🎉

🔹 DeepSeek V4

🌟 2026-04-21

🎉 CometAPI Adds GPT-Image-2! 🎉

🔹 gpt-image-2

🌟 2026-04-20

🎉 CometAPI Adds Doubao Seedance 2.0 Video Generation Models! 🎉

🔹 doubao-seedance-2-0 / doubao-seedance-2-0-fast

⚠️ Important Notice

📋 Parameters

🚀 Usage Example

📖 Documentation

🌟 2026-04-16

🎉 CometAPI Adds Claude Opus 4.7! 🎉

🔹 claude-opus-4-7

🎉 CometAPI Adds Kimi K2.6, Qwen3.6-Plus, and GLM-5.1! 🎉

🔹 kimi-k2.6

🔹 qwen3.6-plus

🎬 CometAPI: New Video Models Now Support `/v1/videos`

🤖 CometAPI: `minimax-m3` Model Now Available

🔹 `gpt-5.2-codex` (For Professional Code Tasks)