Home/Models/Anthropic/Claude Haiku 4.5
C

Claude Haiku 4.5

Input:$0.8/M
Output:$4/M
Context:200K
Max Output:8K
Fastest, most cost-effective model.
New
Commercial Use
Playground
Overview
Features
Pricing
API
Versions

Claude Haiku 4.5 is a purpose-optimized, smaller-class language model from Anthropic, released in mid-October 2025. It’s positioned as a fast, low-cost option in the Claude lineup that preserves strong capability on tasks like coding, agent orchestration, and interactive “computer-use” workflows while enabling much higher throughput and lower unit cost for enterprise deployments.

Key features

  • Speed & cost-efficiency: Haiku 4.5 is described as more than twice as fast as Sonnet 4 and about one-third the cost of Sonnet 4 (and much cheaper than Opus), making it attractive for scaled usage.
  • Extended thinking: First Haiku model to support extended thinking (summarized / interleaved thought, configurable thinking budgets) for deeper multi-step reasoning while balancing latency.
  • Tools & computer use: Full support for Claude tools (bash, code execution, text editor, web search, and computer-use automation). Designed for agentic workflows and sub-agent architectures.
  • Large context window: 200k token context window (with 1M context options available on larger models as beta for other model classes).

Technical details

  • Training data & cutoff: Haiku 4.5 was trained on a proprietary mix of public and licensed data with a training cutoff around February 2025.
  • Extended-thinking (a hybrid reasoning mode) is supported so the model can trade latency for deeper reasoning when requested.
  • Context window at release is 200,000 tokens, and the model is explicitly context-aware (it tracks how much of the window has been used).
  • Performance / throughput: Early community reports and Anthropic testing cite very high OTPS (output tokens/sec) and anecdotal speeds around ~200+ tokens/sec in some internal/early tests —far faster than many comparable mid-tier models.

Benchmark performance

SWE-Bench (coding): Haiku 4.5 scored ~73.3% on SWE-Bench Verified — a result Anthropic highlights as placing Haiku 4.5 among the world’s best coding models for its class.

Terminal / command-line / tool tests: Anthropic reported ~41% on Terminal-Bench (command-line focused) and comparable results to Sonnet 4 and several competing mid-range frontier models on many tool-use benchmarks.

Instruction-following & slide text: internal Anthropic examples claim Haiku 4.5 outperformed previous models on some instruction-following tasks (e.g., slide text generation: 65% vs 44% for a prior premium model in their benchmark).

Real-world automation / agent tasks: third-party evaluations and early adopters report competitive success rates on automated UI/agent tasks (for example, OSWorld-style or agent benchmarks reporting ≈50% success on complex automation in some tests), showing usefulness for scaled workflows though with nontrivial failure modes.

Limitations & safety notes

  • Not a frontier model: Anthropic explicitly classifies Haiku 4.5 as not frontier-advancing; it is optimized for efficiency rather than pushing the absolute state of the art. (Anthropic)
  • Occasional sensitive-topic behavior: in some scientific / bio-safety related prompts Haiku 4.5 sometimes returns high-level information with caveats rather than strict refusals; Anthropic flags that as an area under ongoing improvement.
  • Extended-thinking can change behavior (it sometimes increases asymmetry in responses).

Recommended use cases

  • Agentic coding & multi-agent orchestration: fast sub-agents, iterative code refactor, autotests and patch generation. (Good fit.)
  • Real-time, high-volume customer workflows: chat assistants, internal automation where cost-per-request matters. (Good fit.)
  • Tool-enabled workflows & computer control: automating GUI/CLI tasks, document workflows and tool chains where low latency helps. (Good fit.)
  • Not recommended (without controls): standalone roles that require frontier-level scientific sequence design or high-assurance biosecurity tasks. (Exercise caution.)

How to access Claude Haiku 4.5 API

Step 1: Sign Up for API Key

Log in to cometapi.com. If you are not our user yet, please register first. Sign into your CometAPI console. Get the access credential API key of the interface. Click “Add Token” at the API token in the personal center, get the token key: sk-xxxxx and submit.

cometapi key

Step 2: Send Requests to Claude Haiku 4.5 API

Select the “claude-haiku-4-5-20251001” endpoint to send the API request and set the request body. The request method and request body are obtained from our website API doc. Our website also provides Apifox test for your convenience. Replace <YOUR_API_KEY> with your actual CometAPI key from your account. base url is Anthropic Messages and Chat.

Insert your question or request into the content field—this is what the model will respond to . Process the API response to get the generated answer.

Step 3: Retrieve and Verify Results

Process the API response to get the generated answer. After processing, the API responds with the task status and output data.

FAQ

How does Claude Haiku 4.5 compare to Claude Sonnet 4 in coding performance?

Claude Haiku 4.5 matches Sonnet 4's performance on coding, computer use, and agent tasks while running 4-5x faster at a fraction of the cost. It scores 73.3% on SWE-bench Verified.

Is Claude Haiku 4.5 fast enough for real-time customer service agents?

Yes, Haiku 4.5 is optimized for latency-sensitive applications. It delivers near-instant responses ideal for customer service agents, chatbots, and interactive experiences where response time is critical.

Can Claude Haiku 4.5 power sub-agents in multi-agent coding systems?

Yes, Haiku 4.5 excels at parallelized execution and sub-agent orchestration, enabling multi-agent systems that tackle complex refactors, migrations, and large feature builds with quality and speed.

When should I use Claude Haiku 4.5 instead of Claude Sonnet 4.5?

Choose Haiku 4.5 for high-volume operations, free tier products, and parallelized sub-agents where speed and cost matter most. Use Sonnet 4.5 for complex reasoning that requires frontier intelligence.

Does Claude Haiku 4.5 support computer use and browser automation?

Yes, Haiku 4.5 delivers strong performance on computer use tasks, making it suitable for browser-based automation workflows that need speed over maximum accuracy.

Features for Claude Haiku 4.5

Explore the key features of Claude Haiku 4.5, designed to enhance performance and usability. Discover how these capabilities can benefit your projects and improve user experience.

Pricing for Claude Haiku 4.5

Explore competitive pricing for Claude Haiku 4.5, designed to fit various budgets and usage needs. Our flexible plans ensure you only pay for what you use, making it easy to scale as your requirements grow. Discover how Claude Haiku 4.5 can enhance your projects while keeping costs manageable.
Comet Price (USD / M Tokens)Official Price (USD / M Tokens)Discount
Input:$0.8/M
Output:$4/M
Per Second:$1
-20%

Sample code and API for Claude Haiku 4.5

Claude Haiku 4.5 is a purpose-optimized, smaller-class language model from Anthropic, released in mid-October 2025. It’s positioned as a fast, low-cost option in the Claude lineup that preserves strong capability on tasks like coding, agent orchestration, and interactive “computer-use” workflows while enabling much higher throughput and lower unit cost for enterprise deployments.
Python
JavaScript
Curl
import anthropic
import os

# Get your CometAPI key from https://api.cometapi.com/console/token, and paste it here
COMETAPI_KEY = os.environ.get("COMETAPI_KEY") or "<YOUR_COMETAPI_KEY>"
BASE_URL = "https://api.cometapi.com"

message = anthropic.Anthropic(
    base_url=BASE_URL,
    api_key=COMETAPI_KEY,
)
messages = message.messages.create(
    model="claude-haiku-4-5-20251001",
    max_tokens=1024,
    messages=[{"role": "user", "content": "Hello, Claude"}],
)
print(messages.content[0].text)

Versions of Claude Haiku 4.5

The reason Claude Haiku 4.5 has multiple snapshots may include potential factors such as variations in output after updates requiring older snapshots for consistency, providing developers a transition period for adaptation and migration, and different snapshots corresponding to global or regional endpoints to optimize user experience. For detailed differences between versions, please refer to the official documentation.
VersionDescription
claude-haiku-4-5-20251001
cometapi-haiku-4-5-20251001cursor special version

More Models