ModelsSupportEnterpriseBlog
500+ AI Model API, All In One API.Just In CometAPI
Models API
Developer
Quick StartDocumentationAPI Dashboard
Resources
AI ModelsBlogEnterpriseChangelogAbout
2025 CometAPI. All right reserved.Privacy PolicyTerms of Service
Home/Models/OpenAI/TTS
O

TTS

Input:$12/M
Output:$12/M
OpenAI Text-to-Speech
New
Commercial Use
Overview
Features
Pricing
API
Versions

Technical Specifications of tts

AttributeDetails
Model Nametts
ProviderOpenAI
CategoryText-to-Speech
InputText
OutputAudio
Context WindowN/A
Typical Use CasesVoice generation, narration, accessibility, audio content creation
PricingRefer to CometAPI pricing for tts

What is tts?

tts is CometAPI’s platform identifier for OpenAI Text-to-Speech. It enables developers to convert written text into natural-sounding audio for applications such as assistants, audiobooks, customer support tools, accessibility features, and media workflows.

Through CometAPI, tts can be accessed using a unified API format, making it easier to integrate speech synthesis into products without managing multiple provider-specific interfaces. It is suited for teams that want reliable text-to-audio generation in a straightforward API workflow.

Main features of tts

  • Natural speech generation: Converts input text into clear, human-like spoken audio suitable for many application scenarios.
  • Unified API access: Available through CometAPI’s standardized interface, simplifying integration and provider management.
  • Developer-friendly workflow: Designed for quick implementation in apps, services, and backend pipelines that need speech output.
  • Broad application fit: Useful for accessibility tools, virtual assistants, content narration, training materials, and interactive experiences.
  • Scalable deployment: Can be used in prototypes as well as production systems that require consistent text-to-speech generation.

How to access and integrate tts

Step 1: Sign Up for API Key

First, sign up for a CometAPI account and obtain your API key. After logging into the dashboard, create or copy your API key and store it securely. This key will be used to authenticate all requests to the tts API.

Step 2: Send Requests to tts API

Use CometAPI’s standard API endpoint and specify the model as tts. Then send your input text in the request body together with any required parameters.

curl --request POST \
  --url https://api.cometapi.com/v1/audio/speech \
  --header "Authorization: Bearer YOUR_COMETAPI_KEY" \
  --header "Content-Type: application/json" \
  --data '{
    "model": "tts",
    "input": "Hello! This is a text-to-speech request using CometAPI."
  }'

Step 3: Retrieve and Verify Results

After submitting the request, CometAPI will return the generated audio result or a response containing the audio output details, depending on your integration method. Verify that the response is successful, confirm the audio content matches the input text, and handle any API errors or retries as needed in your application.

Features for TTS

Explore the key features of TTS, designed to enhance performance and usability. Discover how these capabilities can benefit your projects and improve user experience.

Pricing for TTS

Explore competitive pricing for TTS, designed to fit various budgets and usage needs. Our flexible plans ensure you only pay for what you use, making it easy to scale as your requirements grow. Discover how TTS can enhance your projects while keeping costs manageable.
Comet Price (USD / M Tokens)Official Price (USD / M Tokens)Discount
Input:$12/M
Output:$12/M
Input:$15/M
Output:$15/M
-20%

Sample code and API for TTS

Access comprehensive sample code and API resources for TTS to streamline your integration process. Our detailed documentation provides step-by-step guidance, helping you leverage the full potential of TTS in your projects.

Versions of TTS

The reason TTS has multiple snapshots may include potential factors such as variations in output after updates requiring older snapshots for consistency, providing developers a transition period for adaptation and migration, and different snapshots corresponding to global or regional endpoints to optimize user experience. For detailed differences between versions, please refer to the official documentation.
version
kling_tts
tts-1-1106
tts-1-hd
tts-1-hd-1106
qwen3-tts-flash-2025-11-27
qwen3-tts-vc-realtime-2026-01-15
gpt-4o-mini-tts
tts-1
qwen-tts-realtime
qwen3-tts-flash-realtime-2025-11-27
qwen3-tts-vd-realtime-2025-12-16

More Models

O

gpt-realtime-1.5

Input:$3.2/M
Output:$12.8/M
The best voice model for audio in, audio out.
O

gpt-audio-1.5

Input:$2/M
Output:$8/M
The best voice model for audio in, audio out with Chat Completions.
O

Whisper-1

Input:$24/M
Output:$24/M
Speech to text, creating translations
K

Kling TTS

Per Request:$0.006608
[Speech Synthesis] Newly launched: text-to-broadcast audio online, with preview function ● Can simultaneously generate audio_id, usable with any Keling API.
K

Kling video-to-audio

K

Kling video-to-audio

Per Request:$0.03304
Kling video-to-audio
K

Kling text-to-audio

K

Kling text-to-audio

Per Request:$0.03304
Kling text-to-audio

Related Blog

Can ChatGPT Do Text to Speech? The Latest 2026 Guide to Voice, TTS Models
Apr 2, 2026

Can ChatGPT Do Text to Speech? The Latest 2026 Guide to Voice, TTS Models

ChatGPT can do text to speech, but the answer depends on what you mean. In the ChatGPT app, Voice lets ChatGPT speak aloud and has recently been updated to follow instructions better and use tools like web search more effectively. For developers, OpenAI also provides a dedicated text-to-speech API via the audio/speech endpoint, with models including gpt-4o-mini-tts, tts-1, and tts-1-hd. OpenAI says its latest TTS snapshot delivered roughly 35% lower word error rate on Common Voice and FLEURS compared with the previous generation.