Technical Specifications of `gpt-realtime-mini`

Specification	Details
Model ID	`gpt-realtime-mini`
Model type	Realtime multimodal model
Description	An economical version of the real-time GPT—capable of responding to audio and text inputs in realtime via WebRTC, WebSocket, or SIP connections.
Input modalities	Text, audio, image
Output modalities	Text, audio
Context window	32,000 tokens
Max output tokens	4,096 tokens
Supported interfaces	WebRTC, WebSocket, SIP
Supported features	Function calling supported; structured outputs, fine-tuning, distillation, and predicted outputs not supported
Recommended use	Low-latency voice agents, realtime multimodal applications, and cost-sensitive interactive experiences

What is `gpt-realtime-mini`?

gpt-realtime-mini is a cost-efficient realtime model designed for applications that need fast, natural interaction with users through live audio and text. It is intended for low-latency multimodal experiences, allowing developers to build assistants that can listen, respond, and stream output in realtime rather than relying on slower multi-step pipelines.

Compared with larger realtime variants, gpt-realtime-mini is positioned as the economical option for developers who want realtime speech and text capabilities while managing cost and maintaining responsive performance. It works across browser, server, and telephony-style connection patterns through WebRTC, WebSocket, and SIP.

Main features of `gpt-realtime-mini`

Realtime audio and text interaction: Supports low-latency conversations with streaming input and output, making it suitable for live assistants, voice bots, and interactive agents.
Cost-efficient deployment: Positioned as an economical version of the realtime model family, making it attractive for high-volume or budget-sensitive applications.
Multiple connection methods: Can be integrated through WebRTC for browser clients, WebSocket for server-side systems, and SIP for telephony or VoIP scenarios.
Multimodal input support: Accepts text, audio, and image input, enabling richer user interactions and more flexible application design.
Speech-capable output: Produces both text and audio output, which is useful for conversational interfaces and spoken response systems.
Function calling support: Supports function calling, allowing applications to connect the model to tools, workflows, or backend actions during realtime sessions.
Built for voice agents: Well suited for speech-to-speech assistants and realtime customer interaction experiences where interruption handling and fast turn-taking matter.

How to access and integrate `gpt-realtime-mini`

To get started, sign up on CometAPI and generate your API key from the dashboard. Once you have your key, keep it secure and store it in your environment variables for server-side use.

Step 2: Connect to `gpt-realtime-mini` API

The Realtime API uses WebSocket connections. Connect to CometAPI's WebSocket endpoint:

const ws = new WebSocket(
  "wss://api.cometapi.com/v1/realtime?model=gpt-realtime-mini",
  {
    headers: {
      "Authorization": "Bearer " + process.env.COMETAPI_API_KEY,
      "OpenAI-Beta": "realtime=v1"
    }
  }
);

ws.on("open", () => {
  ws.send(JSON.stringify({
    type: "session.update",
    session: {
      modalities: ["text", "audio"],
      instructions: "You are a helpful assistant."
    }
  }));
});

ws.on("message", (data) => {
  console.log(JSON.parse(data));
});

Step 3: Retrieve and Verify Results

The Realtime API streams responses through the WebSocket connection as server-sent events. Listen for response.audio.delta events for audio output and response.text.delta for text. Verify the session is established and responses are streaming correctly.

คุณสมบัติสำหรับ gpt-realtime-mini

สำรวจคุณสมบัติหลักของ gpt-realtime-mini ที่ออกแบบมาเพื่อเพิ่มประสิทธิภาพและความสะดวกในการใช้งาน ค้นพบว่าความสามารถเหล่านี้สามารถเป็นประโยชน์ต่อโครงการของคุณและปรับปรุงประสบการณ์ของผู้ใช้ได้อย่างไร

ราคาสำหรับ gpt-realtime-mini

สำรวจราคาที่แข่งขันได้สำหรับ gpt-realtime-mini ที่ออกแบบมาให้เหมาะสมกับงบประมาณและความต้องการการใช้งานที่หลากหลาย แผนการบริการที่ยืดหยุ่นของเรารับประกันว่าคุณจะจ่ายเฉพาะสิ่งที่คุณใช้เท่านั้น ทำให้สามารถขยายขนาดได้ง่ายเมื่อความต้องการของคุณเพิ่มขึ้น ค้นพบว่า gpt-realtime-mini สามารถยกระดับโปรเจกต์ของคุณได้อย่างไรในขณะที่ควบคุมต้นทุนให้อยู่ในระดับที่จัดการได้

ราคา Comet (USD / M Tokens)	ราคาทางการ (USD / M Tokens)	ส่วนลด
อินพุต:$0.48/M เอาต์พุต:$0.96/M	อินพุต:$0.6/M เอาต์พุต:$1.2/M	-20%

โค้ดตัวอย่างและ API สำหรับ gpt-realtime-mini

เข้าถึงโค้ดตัวอย่างที่ครอบคลุมและทรัพยากร API สำหรับ gpt-realtime-mini เพื่อปรับปรุงกระบวนการผสานรวมของคุณ เอกสารประกอบที่มีรายละเอียดของเราให้คำแนะนำทีละขั้นตอน ช่วยให้คุณใช้ประโยชน์จากศักยภาพเต็มรูปแบบของ gpt-realtime-mini ในโครงการของคุณ

รุ่นของ gpt-realtime-mini

เหตุผลที่ gpt-realtime-mini มีสแนปช็อตหลายตัวอาจรวมถึงปัจจัยที่อาจเกิดขึ้น เช่น ความแปรผันของผลลัพธ์หลังการอัปเดตที่ต้องการสแนปช็อตรุ่นเก่าสำหรับความสม่ำเสมอ การให้ช่วงเวลาเปลี่ยนผ่านสำหรับนักพัฒนาเพื่อการปรับตัวและการย้ายข้อมูล และสแนปช็อตที่แตกต่างกันซึ่งสอดคล้องกับเอนด์พอยต์ระดับโลกหรือระดับภูมิภาคเพื่อเพิ่มประสิทธิภาพประสบการณ์ผู้ใช้ สำหรับความแตกต่างโดยละเอียดระหว่างเวอร์ชัน โปรดอ้างอิงเอกสารทางการ

version
gpt-realtime-mini

Technical Specifications of `gpt-realtime-mini`

Specification	Details
Model ID	`gpt-realtime-mini`
Model type	Realtime multimodal model
Description	An economical version of the real-time GPT—capable of responding to audio and text inputs in realtime via WebRTC, WebSocket, or SIP connections.
Input modalities	Text, audio, image
Output modalities	Text, audio
Context window	32,000 tokens
Max output tokens	4,096 tokens
Supported interfaces	WebRTC, WebSocket, SIP
Supported features	Function calling supported; structured outputs, fine-tuning, distillation, and predicted outputs not supported
Recommended use	Low-latency voice agents, realtime multimodal applications, and cost-sensitive interactive experiences

What is `gpt-realtime-mini`?

Main features of `gpt-realtime-mini`

Realtime audio and text interaction: Supports low-latency conversations with streaming input and output, making it suitable for live assistants, voice bots, and interactive agents.
Cost-efficient deployment: Positioned as an economical version of the realtime model family, making it attractive for high-volume or budget-sensitive applications.
Multiple connection methods: Can be integrated through WebRTC for browser clients, WebSocket for server-side systems, and SIP for telephony or VoIP scenarios.
Multimodal input support: Accepts text, audio, and image input, enabling richer user interactions and more flexible application design.
Speech-capable output: Produces both text and audio output, which is useful for conversational interfaces and spoken response systems.
Function calling support: Supports function calling, allowing applications to connect the model to tools, workflows, or backend actions during realtime sessions.
Built for voice agents: Well suited for speech-to-speech assistants and realtime customer interaction experiences where interruption handling and fast turn-taking matter.

How to access and integrate `gpt-realtime-mini`

To get started, sign up on CometAPI and generate your API key from the dashboard. Once you have your key, keep it secure and store it in your environment variables for server-side use.

Step 2: Connect to `gpt-realtime-mini` API

The Realtime API uses WebSocket connections. Connect to CometAPI's WebSocket endpoint:

const ws = new WebSocket(
  "wss://api.cometapi.com/v1/realtime?model=gpt-realtime-mini",
  {
    headers: {
      "Authorization": "Bearer " + process.env.COMETAPI_API_KEY,
      "OpenAI-Beta": "realtime=v1"
    }
  }
);

ws.on("open", () => {
  ws.send(JSON.stringify({
    type: "session.update",
    session: {
      modalities: ["text", "audio"],
      instructions: "You are a helpful assistant."
    }
  }));
});

ws.on("message", (data) => {
  console.log(JSON.parse(data));
});

gpt-realtime-mini

Technical Specifications of `gpt-realtime-mini`

What is `gpt-realtime-mini`?

Main features of `gpt-realtime-mini`

How to access and integrate `gpt-realtime-mini`

Step 2: Connect to `gpt-realtime-mini` API

Step 3: Retrieve and Verify Results

คุณสมบัติสำหรับ gpt-realtime-mini

ราคาสำหรับ gpt-realtime-mini

โค้ดตัวอย่างและ API สำหรับ gpt-realtime-mini

รุ่นของ gpt-realtime-mini

โมเดลเพิ่มเติม

Nano Banana 2

Claude Opus 4.6

Claude Sonnet 4.6

GPT-5.4 nano

GPT-5.4 mini

Claude Mythos Preview

Claude Mythos Preview

gpt-realtime-mini

Technical Specifications of `gpt-realtime-mini`

What is `gpt-realtime-mini`?

Main features of `gpt-realtime-mini`

How to access and integrate `gpt-realtime-mini`

Step 2: Connect to `gpt-realtime-mini` API

Step 3: Retrieve and Verify Results

คุณสมบัติสำหรับ gpt-realtime-mini

ราคาสำหรับ gpt-realtime-mini

โค้ดตัวอย่างและ API สำหรับ gpt-realtime-mini

รุ่นของ gpt-realtime-mini

โมเดลเพิ่มเติม

Nano Banana 2

Claude Opus 4.6

Claude Sonnet 4.6

GPT-5.4 nano

GPT-5.4 mini

Claude Mythos Preview

Claude Mythos Preview

gpt-realtime-mini

Technical Specifications of gpt-realtime-mini

What is gpt-realtime-mini?

Main features of gpt-realtime-mini

How to access and integrate gpt-realtime-mini

Step 1: Sign Up for API Key

Step 2: Connect to gpt-realtime-mini API

Step 3: Retrieve and Verify Results

โมเดลเพิ่มเติม

Nano Banana 2

Claude Opus 4.6

Claude Sonnet 4.6

GPT-5.4 nano

GPT-5.4 mini

Claude Mythos Preview

Claude Mythos Preview

gpt-realtime-mini

Technical Specifications of gpt-realtime-mini

What is gpt-realtime-mini?

Main features of gpt-realtime-mini

How to access and integrate gpt-realtime-mini

Step 1: Sign Up for API Key

Step 2: Connect to gpt-realtime-mini API

Step 3: Retrieve and Verify Results

โมเดลเพิ่มเติม

Nano Banana 2

Claude Opus 4.6

Claude Sonnet 4.6

GPT-5.4 nano

GPT-5.4 mini

Claude Mythos Preview

Claude Mythos Preview

Technical Specifications of `gpt-realtime-mini`

What is `gpt-realtime-mini`?

Main features of `gpt-realtime-mini`

How to access and integrate `gpt-realtime-mini`

Step 2: Connect to `gpt-realtime-mini` API

Technical Specifications of `gpt-realtime-mini`

What is `gpt-realtime-mini`?

Main features of `gpt-realtime-mini`

How to access and integrate `gpt-realtime-mini`

Step 2: Connect to `gpt-realtime-mini` API