Claude 4.5 is now on CometAPI

  • Home
  • Models
    • Grok 4 API
    • Suno v4.5
    • GPT-image-1 API
    • GPT-4.1 API
    • Qwen 3 API
    • Llama 4 API
    • GPT-4o API
    • GPT-4.5 API
    • Claude Opus 4 API
    • Claude Sonnet 4 API
    • DeepSeek R1 API
    • Gemini2.5 pro
    • Runway Gen-3 Alpha API
    • FLUX 1.1 API
    • Kling 1.6 Pro API
    • All Models
  • Enterprise
  • Pricing
  • API Docs
  • Blog
  • Contact
Sign Up
Log in
Technology

OpenAI Responses API gets a major upgrade instead of Assistants API

2025-05-28 anna No comments yet

OpenAI has rolled out a significant upgrade to its Responses API, introducing a suite of powerful tools and enterprise-grade features that transform how developers build agentic applications. Announced on May 21, 2025, this release builds upon the initial Responses API launched in March 2025, which replaced the Assistants API and has already processed trillions of tokens across models like GPT-4o and the o-series reasoning engines.

How It Differs from the Traditional ChatGPT (Chat Completions) API

  • Agent-First Primitive vs. Text-Only Completions: Unlike the Chat Completions API, which returns plain text based on prompts, the Responses API is designed as a core primitive for building “agentic” experiences—allowing models to plan and execute multi-step tasks by calling external tools directly within their chain-of-thought.
  • Built-In Tool Orchestration: While Chat Completions offers function-calling capability, Responses unifies tool invocation—such as image generation or code execution—into a single, streamlined API call, reducing boilerplate and improving developer productivity.
  • Preserved Reasoning State: Models like o3 and o4-mini maintain reasoning tokens across calls and tool invocations, yielding richer contextual understanding and lower latency compared to stateless completions.
  • Enterprise-Grade Reliability: Features such as background mode for asynchronous tasks, reasoning summaries for auditability, and encrypted reasoning items for Zero Data Retention customers deliver stronger SLAs and privacy controls than the standard Chat Completions endpoint.

New Capabilities

  1. Remote MCP Server Support: Connect any Model Context Protocol server—Shopify, Stripe, Twilio, and more—to extend model context with third-party data sources via just a few lines of code .
  2. Native Image Generation: Access the gpt-image-1 model as a tool within Responses, enabling streamed previews and multi-turn edits without separate API calls .
  3. Integrated Code Interpreter: Perform data analysis, complex computations, and image manipulations directly within the agentic flow, boosting performance on industry benchmarks .
  4. Enhanced File Search: Query across multiple vector stores with attribute filters to pull relevant document snippets into context, simplifying knowledge-base integrations.
  5. Enterprise Features: Background mode to manage long-running reasoning tasks, automatic reasoning summaries for debugging, and encrypted reasoning items for compliant deployments .

Pricing and Availability

All new tools and features are available immediately in the Responses API for GPT-4o, GPT-4.1, and the o-series models (o1, o3, o3-mini, o4-mini); image generation is supported on o3 only. Pricing remains consistent with existing tool rates:

  • Image Generation: $5.00 per 1 M text input tokens, $10.00 per 1 M image input tokens, $40.00 per 1 M image output tokens (75% off cached inputs)
  • Code Interpreter: $0.03 per container execution
  • File Search: $0.10 per GB of vector storage per day(first GB free); $2.50 per 1 K tool calls
  • Remote MCP Servers: No extra fee—standard output token billing applies.

No separate Responses API fee—tokens are billed at the input/output rates of the selected model (e.g., GPT-4.1 at its published per-token rates).

Developers and enterprises can begin integrating these capabilities today via the client.responses.create endpoint. With these enhancements, OpenAI aims to empower more intelligent, reliable, and secure AI-driven applications across industries.The Responses API is live for all developers today, and the legacy Assistants API will be fully deprecated by mid-2026. Existing Assistants integrations can be migrated with minimal code changes, thanks to compatible request and response schemas .

Getting Started

CometAPI provides a unified REST interface that aggregates hundreds of AI models—including ChatGPT family—under a consistent endpoint, with built-in API-key management, usage quotas, and billing dashboards. Instead of juggling multiple vendor URLs and credentials.

Developers can access latest chatgpt API GPT-4.1 API through CometAPI. To begin, explore the model’s capabilities in the Playground and consult the API guide for detailed instructions. Before accessing, please make sure you have logged in to CometAPI and obtained the API key.

  • OpenAI
  • Responses API

One API
Access 500+ AI Models!

Free For A Limited Time! Register Now
Get Free Token Instantly!

Get Free API Key
API Docs
anna

Anna, an AI research expert, focuses on cutting-edge exploration of large language models and generative AI, and is dedicated to analyzing technical principles and future trends with academic depth and unique insights.

Post navigation

Previous
Next

Search

Start Today

One API
Access 500+ AI Models!

Free For A Limited Time! Register Now
Get Free Token Instantly!

Get Free API Key
API Docs

Categories

  • AI Company (2)
  • AI Comparisons (64)
  • AI Model (122)
  • guide (17)
  • Model API (29)
  • new (27)
  • Technology (508)

Tags

Anthropic API Black Forest Labs ChatGPT Claude Claude 3.7 Sonnet Claude 4 claude code Claude Opus 4 Claude Opus 4.1 Claude Sonnet 4 cometapi deepseek DeepSeek R1 DeepSeek V3 Gemini Gemini 2.0 Flash Gemini 2.5 Flash Gemini 2.5 Flash Image Gemini 2.5 Pro Google GPT-4.1 GPT-4o GPT -4o Image GPT-5 GPT-Image-1 GPT 4.5 gpt 4o grok 3 grok 4 Midjourney Midjourney V7 Minimax o3 o4 mini OpenAI Qwen Qwen 2.5 Qwen3 runway sora Stable Diffusion Suno Veo 3 xAI

Contact Info

Blocksy: Contact Info

Related posts

How Many Parameters does GPT-5 have
Technology

How Many Parameters does GPT-5 have

2025-10-18 anna No comments yet

OpenAI has not published an official parameter count for GPT-5 — from around 1.7–1.8 trillion parameters (dense-model style estimates) to tens of trillions if you count the total capacity of Mixture-of-Experts (MoE) style architectures. None of these numbers are officially confirmed, and differences in architecture (dense vs. MoE), parameter sharing, sparsity and quantization make a […]

How Many GPUs to train gpt-5
Technology

How Many GPUs to train gpt-5? All You Need to Know

2025-10-14 anna No comments yet

Training a state-of-the-art large language model (LLM) like GPT-5 is a massive engineering, logistical, and financial undertaking. Headlines and rumors about how many GPUs were used vary wildly — from a few tens of thousands to several hundreds of thousands — and part of that variance comes from changing hardware generations, efficiency gains in software, […]

How to Access Sora 2 — The latest complete guide to omnichannel
Technology

How to Access Sora 2 — The latest complete guide to omnichannel

2025-10-14 anna No comments yet

Sora 2 is one of the fastest-moving AI products of 2025: a next-generation video + audio generation system from OpenAI that produces short cinematic clips with synchronized audio, multi-shot coherence, improved physics, and a “cameos” system for inserting people into generated scenes. Because Sora 2 is new and evolving rapidly — launched in late September […]

500+ AI Model API,All In One API. Just In CometAPI

Models API
  • GPT API
  • Suno API
  • Luma API
  • Sora API
Developer
  • Sign Up
  • API DashBoard
  • Documentation
  • Quick Start
Resources
  • Pricing
  • Enterprise
  • Blog
  • AI Model API Articles
  • Discord Community
Get in touch
  • support@cometapi.com

© CometAPI. All Rights Reserved.  

  • Terms & Service
  • Privacy Policy