128K

Chat

MoonshotAI

Kimi K2 API

Kimi K2 is an open‑source, trillion‑parameter Mixture‑of‑Experts language model with a 128K‑token context window, optimized for high‑performance coding, agentic reasoning, and efficient inference.

Get Free API Key

import os
from openai import OpenAI

client = OpenAI(
    base_url="https://api.cometapi.com/v1",
    api_key="<YOUR_API_KEY>",    
)

response = client.chat.completions.create(
    model="Kimi K2",
    messages=[
        {
            "role": "system",
            "content": "You are an AI assistant who knows everything.",
        },
        {
            "role": "user",
            "content": "Tell me, why is the sky blue?"
        },
    ],
)

message = response.choices[0].message.content

print(f"Assistant: {message}")

All AI Models in One API
500+ AI Models

Free For A Limited Time! Register Now

Get 1M Free Token Instantly！

Kimi K2 API

Kimi K2 API is an open‑source, trillion‑parameter Mixture‑of‑Experts language model with a 128K‑token context window, optimized for high‑performance coding, agentic reasoning, and efficient inference.

Basic Information & Features

Model Name: Kimi K2
Architecture: Mixture‑of‑Experts (MoE)
Total Parameters: 1 trillion
Activated Parameters: 32 billion per forward pass
Context Length: 128 K tokens
Vocabulary Size: 160 K tokens
Primary Use Cases: Coding, tool integration, complex task decomposition, general reasoning.

Technical Architecture

Kimi K2 employs a 384‑expert MoE design, selecting 8 experts per token to balance performance with inference efficiency. It comprises 61 layers, including 1 dense layer, and utilizes Multi‑Layer Attention (MLA) alongside the SwiGLU activation function. Training leveraged the Muon optimizer over 15.5 trillion tokens, ensuring stability and high throughput across diverse benchmarks.

Benchmark Performance

SWE‑bench Verified: 65.8% single‑attempt accuracy—surpassing GPT‑4.1’s 54.6% and trailing only Claude Sonnet 4 among top models.
Multilingual SWE‑bench: 47.3% accuracy, leading open‑source contenders.
LiveCodeBench: 53.7%, the highest among non‑proprietary models.
EvalPlus: 80.3, outperforming DeepSeek‑V3 and Qwen 2.5 series.

These results underscore Kimi K2’s state‑of‑the‑art coding proficiency and robust reasoning capabilities.

How to call Kimi K2 API from CometAPI

`Kimi K2` API Pricing in CometAPI，20% off the official price:

Input Tokens: $0.11/ M tokens
Output Tokens: $1.99/ M tokens

Required Steps

Log in to cometapi.com. If you are not our user yet, please register first
Get the access credential API key of the interface. Click “Add Token” at the API token in the personal center, get the token key: sk-xxxxx and submit.
Get the url of this site: https://api.cometapi.com/

Use Method

Select the “kimi-k2-0711-preview” endpoint to send the API request and set the request body. The request method and request body are obtained from our website API doc. Our website also provides Apifox test for your convenience.
Replace <YOUR_API_KEY> with your actual CometAPI key from your account.
Insert your question or request into the content field—this is what the model will respond to.
. Process the API response to get the generated answer.

CometAPI provides a fully compatible REST API—for seamless migration. Key details to API doc:

Base URL: https://api.cometapi.com/v1/chat/completions
Model Names: “kimi-k2-0711-preview“
Authentication: Bearer token via Authorization: Bearer YOUR_CometAPI_API_KEY header
Content-Type: application/json .

API Integration & Examples

Kimi K2 is accessible via CometAPI (OpenAI‑compatible) and the Moonshot AI API. Below is a Python snippet for a ChatCompletion call through CometAPI:

Python snippet for a ChatCompletion call through OpenRouter:

pythonimport openai

openai.api_key = "YOUR_OPENROUTER_API_KEY"
openai.api_base = "https://api.cometapi.com/v1/chat/completions"

messages = [
    {"role": "system",  "content": "You are a helpful assistant."},
    {"role": "user",    "content": "Summarize Kimi K2's main features."}
]

response = openai.ChatCompletion.create(
    model="moonshotai/kimi-k2",
    messages=messages,
    temperature=0.7,
    max_tokens=500
)

print(response.choices[0].message['content'])

model: "kimi-k2-0711-preview"
temperature: Controls response randomness (0.0–1.0)
max_tokens: Response length limit

Start Today

One API
Access 500+ AI Models!

Free For A Limited Time! Register Now
Get 1M Free Token Instantly！

Get Free API Key

API Docs

128K

Chat

MoonshotAI

Kimi K2 API

All AI Models in One API
500+ AI Models

Kimi K2 API

Basic Information & Features

Technical Architecture

Benchmark Performance

How to call Kimi K2 API from CometAPI

`Kimi K2` API Pricing in CometAPI，20% off the official price:

Required Steps

Use Method

API Integration & Examples

Start Today

One API
Access 500+ AI Models!

Models API

Developer

Resources

Get in touch

128K

Chat

MoonshotAI

Kimi K2 API

All AI Models in One API 500+ AI Models

Kimi K2 API

Basic Information & Features

Technical Architecture

Benchmark Performance

How to call Kimi K2 API from CometAPI

Kimi K2 API Pricing in CometAPI，20% off the official price:

Required Steps

Use Method

API Integration & Examples

Start Today

One API Access 500+ AI Models!

Related posts

What is Kimi K2? How to Access it?

Moonshot ‘s Kimi K2: A Overview of Next‑Generation Mixture‑of‑Experts Model

Models API

Developer

Resources

Get in touch

All AI Models in One API
500+ AI Models

How to call Kimi K2 API from CometAPI

`Kimi K2` API Pricing in CometAPI，20% off the official price:

One API
Access 500+ AI Models!