模型支持企业博客
500+ AI 模型 API,一次搞定,就在 CometAPI
模型 API
开发者
快速入门文档API 仪表板
资源
AI 模型博客企业更新日志关于
2025 CometAPI。保留所有权利。隐私政策服务条款
Home/Models/OpenAI/TTS
O

TTS

输入:$12/M
输出:$12/M
OpenAI Text-to-Speech
新
商用
概览
功能亮点
定价
API
版本

Technical Specifications of tts

AttributeDetails
Model Nametts
ProviderOpenAI
CategoryText-to-Speech
InputText
OutputAudio
Context WindowN/A
Typical Use CasesVoice generation, narration, accessibility, audio content creation
PricingRefer to CometAPI pricing for tts

What is tts?

tts is CometAPI’s platform identifier for OpenAI Text-to-Speech. It enables developers to convert written text into natural-sounding audio for applications such as assistants, audiobooks, customer support tools, accessibility features, and media workflows.

Through CometAPI, tts can be accessed using a unified API format, making it easier to integrate speech synthesis into products without managing multiple provider-specific interfaces. It is suited for teams that want reliable text-to-audio generation in a straightforward API workflow.

Main features of tts

  • Natural speech generation: Converts input text into clear, human-like spoken audio suitable for many application scenarios.
  • Unified API access: Available through CometAPI’s standardized interface, simplifying integration and provider management.
  • Developer-friendly workflow: Designed for quick implementation in apps, services, and backend pipelines that need speech output.
  • Broad application fit: Useful for accessibility tools, virtual assistants, content narration, training materials, and interactive experiences.
  • Scalable deployment: Can be used in prototypes as well as production systems that require consistent text-to-speech generation.

How to access and integrate tts

Step 1: Sign Up for API Key

First, sign up for a CometAPI account and obtain your API key. After logging into the dashboard, create or copy your API key and store it securely. This key will be used to authenticate all requests to the tts API.

Step 2: Send Requests to tts API

Use CometAPI’s standard API endpoint and specify the model as tts. Then send your input text in the request body together with any required parameters.

curl --request POST \
  --url https://api.cometapi.com/v1/audio/speech \
  --header "Authorization: Bearer YOUR_COMETAPI_KEY" \
  --header "Content-Type: application/json" \
  --data '{
    "model": "tts",
    "input": "Hello! This is a text-to-speech request using CometAPI."
  }'

Step 3: Retrieve and Verify Results

After submitting the request, CometAPI will return the generated audio result or a response containing the audio output details, depending on your integration method. Verify that the response is successful, confirm the audio content matches the input text, and handle any API errors or retries as needed in your application.

TTS 的功能

了解 TTS 的核心能力,帮助提升性能与可用性,并改善整体体验。

TTS 的定价

查看 TTS 的竞争性定价,满足不同预算与使用需求,灵活方案确保随需求扩展。
Comet 价格 (USD / M Tokens)官方定价 (USD / M Tokens)折扣
输入:$12/M
输出:$12/M
输入:$15/M
输出:$15/M
-20%

TTS 的示例代码与 API

获取完整示例代码与 API 资源,简化 TTS 的集成流程,我们提供逐步指导,助你发挥模型潜能。

TTS 的版本

TTS 可能存在多个快照,原因包括:更新后保持一致性需要保留旧版、给开发者留出迁移窗口,以及全球/区域端点提供的优化差异。具体差异请参考官方文档。
version
kling_tts
tts-1-1106
tts-1-hd-1106
qwen-tts-realtime
qwen3-tts-flash-realtime-2025-11-27
qwen3-tts-vc-realtime-2026-01-15
qwen3-tts-vd-realtime-2025-12-16
tts-1
tts-1-hd
qwen3-tts-flash-2025-11-27
gpt-4o-mini-tts

更多模型

O

gpt-realtime-1.5

输入:$3.2/M
输出:$12.8/M
用于音频输入与音频输出的最佳语音模型。
O

gpt-audio-1.5

输入:$2/M
输出:$8/M
用于在 Chat Completions 中实现音频输入与音频输出的最佳语音模型。
O

Whisper-1

输入:$24/M
输出:$24/M
Speech to text, creating translations
K

Kling TTS

每次请求:$0.006608
[Speech Synthesis] Newly launched: text-to-broadcast audio online, with preview function ● Can simultaneously generate audio_id, usable with any Keling API.
K

Kling video-to-audio

K

Kling video-to-audio

每次请求:$0.03304
Kling video-to-audio
K

Kling text-to-audio

K

Kling text-to-audio

每次请求:$0.03304
Kling text-to-audio

相关博客

ChatGPT 能进行文本转语音吗?2026 年最新语音与 TTS 模型指南
Apr 2, 2026

ChatGPT 能进行文本转语音吗?2026 年最新语音与 TTS 模型指南

ChatGPT 可以进行文本转语音,不过答案取决于你所指的具体含义。在 ChatGPT 应用中,Voice 让 ChatGPT 能够语音朗读,并且近期已更新,使其更好地遵循指令并更有效地使用诸如网页搜索之类的工具。对于开发者,OpenAI 还通过 audio/speech 端点提供专用的文本转语音 API,包含的模型有 gpt-4o-mini-tts、tts-1 和 tts-1-hd。OpenAI 称,其最新的 TTS 快照在 Common Voice 和 FLEURS 上的词错误率相比上一代降低了约 35%。