Hurry! 1M Free Tokens Waiting for You – Register Today!

  • Home
  • Models
    • Suno v4.5
    • GPT-image-1 API
    • GPT-4.1 API
    • Qwen 3 API
    • Grok-3-Mini
    • Llama 4 API
    • GPT-4o API
    • GPT-4.5 API
    • Claude 3.7-Sonnet API
    • Grok 3 API
    • DeepSeek R1 API
    • Gemini2.5 pro
    • Runway Gen-3 Alpha API
    • FLUX 1.1 API
    • Kling 1.6 Pro API
    • All Models
  • Enterprise
  • Pricing
  • API Docs
  • Blog
  • Contact
Sign Up
Log in
Technology

OpenAI Responses API gets a major upgrade instead of Assistants API

2025-05-28 anna No comments yet

OpenAI has rolled out a significant upgrade to its Responses API, introducing a suite of powerful tools and enterprise-grade features that transform how developers build agentic applications. Announced on May 21, 2025, this release builds upon the initial Responses API launched in March 2025, which replaced the Assistants API and has already processed trillions of tokens across models like GPT-4o and the o-series reasoning engines.

How It Differs from the Traditional ChatGPT (Chat Completions) API

  • Agent-First Primitive vs. Text-Only Completions: Unlike the Chat Completions API, which returns plain text based on prompts, the Responses API is designed as a core primitive for building “agentic” experiences—allowing models to plan and execute multi-step tasks by calling external tools directly within their chain-of-thought.
  • Built-In Tool Orchestration: While Chat Completions offers function-calling capability, Responses unifies tool invocation—such as image generation or code execution—into a single, streamlined API call, reducing boilerplate and improving developer productivity.
  • Preserved Reasoning State: Models like o3 and o4-mini maintain reasoning tokens across calls and tool invocations, yielding richer contextual understanding and lower latency compared to stateless completions.
  • Enterprise-Grade Reliability: Features such as background mode for asynchronous tasks, reasoning summaries for auditability, and encrypted reasoning items for Zero Data Retention customers deliver stronger SLAs and privacy controls than the standard Chat Completions endpoint.

New Capabilities

  1. Remote MCP Server Support: Connect any Model Context Protocol server—Shopify, Stripe, Twilio, and more—to extend model context with third-party data sources via just a few lines of code .
  2. Native Image Generation: Access the gpt-image-1 model as a tool within Responses, enabling streamed previews and multi-turn edits without separate API calls .
  3. Integrated Code Interpreter: Perform data analysis, complex computations, and image manipulations directly within the agentic flow, boosting performance on industry benchmarks .
  4. Enhanced File Search: Query across multiple vector stores with attribute filters to pull relevant document snippets into context, simplifying knowledge-base integrations.
  5. Enterprise Features: Background mode to manage long-running reasoning tasks, automatic reasoning summaries for debugging, and encrypted reasoning items for compliant deployments .

Pricing and Availability

All new tools and features are available immediately in the Responses API for GPT-4o, GPT-4.1, and the o-series models (o1, o3, o3-mini, o4-mini); image generation is supported on o3 only. Pricing remains consistent with existing tool rates:

  • Image Generation: $5.00 per 1 M text input tokens, $10.00 per 1 M image input tokens, $40.00 per 1 M image output tokens (75% off cached inputs)
  • Code Interpreter: $0.03 per container execution
  • File Search: $0.10 per GB of vector storage per day(first GB free); $2.50 per 1 K tool calls
  • Remote MCP Servers: No extra fee—standard output token billing applies.

No separate Responses API fee—tokens are billed at the input/output rates of the selected model (e.g., GPT-4.1 at its published per-token rates).

Developers and enterprises can begin integrating these capabilities today via the client.responses.create endpoint. With these enhancements, OpenAI aims to empower more intelligent, reliable, and secure AI-driven applications across industries.The Responses API is live for all developers today, and the legacy Assistants API will be fully deprecated by mid-2026. Existing Assistants integrations can be migrated with minimal code changes, thanks to compatible request and response schemas .

Getting Started

CometAPI provides a unified REST interface that aggregates hundreds of AI models—including ChatGPT family—under a consistent endpoint, with built-in API-key management, usage quotas, and billing dashboards. Instead of juggling multiple vendor URLs and credentials.

Developers can access latest chatgpt API GPT-4.1 API through CometAPI. To begin, explore the model’s capabilities in the Playground and consult the API guide for detailed instructions. Before accessing, please make sure you have logged in to CometAPI and obtained the API key.

  • OpenAI
  • Responses API
anna

Post navigation

Previous
Next

Search

Categories

  • AI Company (2)
  • AI Comparisons (40)
  • AI Model (81)
  • Model API (29)
  • Technology (325)

Tags

Alibaba Cloud Anthropic Black Forest Labs ChatGPT Claude Claude 3.7 Sonnet Claude 4 Claude Opus 4 Claude Sonnet 4 Codex cometapi DALL-E 3 deepseek DeepSeek R1 DeepSeek V3 FLUX Gemini Gemini 2.0 Gemini 2.0 Flash Gemini 2.5 Flash Gemini 2.5 Pro Google GPT-4.1 GPT-4o GPT -4o Image GPT-Image-1 GPT 4.5 gpt 4o grok 3 Midjourney Midjourney V7 o3 o4 mini OpenAI Qwen Qwen 2.5 Qwen3 sora Stable AI Stable Diffusion Stable Diffusion 3.5 Large Suno Suno Music Veo 3 xAI

Related posts

Technology

What is Sora Relaxed Mode? All You Need to Know

2025-06-20 anna No comments yet

In the rapidly evolving landscape of AI-driven content creation, OpenAI’s Sora platform has emerged as a frontrunner in video generation technology. While many users are familiar with Sora’s priority queue—where subscribers expend credits for expedited render times—the platform also offers a lesser-known feature known as Relaxed Mode. This mode provides an alternative workflow for generating […]

Technology

When is GPT‑5 Coming Out? What we know so far as of June 2025

2025-06-19 anna No comments yet

OpenAI’s next leap in conversational AI, ChatGPT‑5, has become one of the most anticipated technology releases of 2025. With speculation swirling around its exact launch date, potential features, and the strategic decisions shaping its development, stakeholders across industries are eager for clarity. Drawing on the latest statements from OpenAI’s leadership, industry rumors, and expert analyses, […]

Technology, AI Comparisons

Is Claude AI Better Than ChatGPT ? A Comprehensive Comparison

2025-06-17 anna No comments yet

We’ve seen an explosion of AI advances in 2025: Claude Opus 4, Sonnet 4, Claude Gov, fine‑grained tool streaming, ChatGPT’s GPT‑4.1 and GPT‑4o, voice‑mode upgrades, new pricing plans—the list goes on. In this article, we’ll explore all these updates so you and I can figure out: is Claude AI really better than ChatGPT? What are the key […]

500+ AI Model API,All In One API. Just In CometAPI

Models API
  • GPT API
  • Suno API
  • Luma API
  • Sora API
Developer
  • Sign Up
  • API DashBoard
  • Documentation
  • Quick Start
Resources
  • Pricing
  • Enterprise
  • Blog
  • AI Model API Articles
  • Discord Community
Get in touch
  • [email protected]

© CometAPI. All Rights Reserved.   EFoxTech LLC.

  • Terms & Service
  • Privacy Policy