Technical Specifications of tts-1-1106
| Attribute | Details |
|---|---|
| Model ID | tts-1-1106 |
| Provider family | OpenAI text-to-speech model family |
| Primary capability | Converts text input into natural-sounding spoken audio |
| Typical endpoint | /v1/audio/speech |
| Optimization focus | Low-latency, realtime-oriented speech generation |
| Input modality | Text |
| Output modality | Audio |
| Supported output formats | mp3, opus, aac, flac, wav, pcm |
| Voice support | Compatible with OpenAI’s built-in TTS voices; the tts-1 / tts-1-hd family supports a smaller subset including alloy, ash, coral, echo, fable, nova, onyx, sage, and shimmer |
| Max input length | 4096 characters per request |
| Pricing reference | OpenAI lists TTS speech generation pricing at $15.00 per 1M characters for the TTS category |
| Compliance note | End users should be clearly informed when the voice they hear is AI-generated |
What is tts-1-1106?
tts-1-1106 is CometAPI’s platform identifier for an OpenAI text-to-speech model in the tts-1 family, designed to transform written text into spoken audio. OpenAI documents tts-1 as a model optimized for speed and realtime use cases, making it suitable for interactive applications that need fast speech generation rather than maximum offline rendering quality.
In practice, this model class is used for scenarios such as narration, voice assistants, accessibility features, conversational interfaces, and automated audio playback. It is accessed through the speech-generation workflow of the Audio API, where developers provide text, select a supported voice, and receive audio in a chosen output format.
Because the exact -1106 suffix appears to be a platform-side identifier rather than the public OpenAI model alias, the safest interpretation is that tts-1-1106 maps to the behavior and integration pattern of OpenAI’s tts-1 generation family. That means developers should expect a fast TTS model focused on responsive synthesis, standard speech endpoint usage, and built-in voice selection.
Main features of tts-1-1106
- Realtime-oriented speech generation: The underlying
tts-1family is optimized for speed, which makes it well suited for live applications, assistants, and other latency-sensitive audio experiences. - Natural-sounding text-to-audio conversion: It converts plain text into lifelike spoken output for playback, narration, and voice-enabled product features.
- Multiple output formats: Developers can request audio in
mp3,opus,aac,flac,wav, or rawpcm, which supports both consumer playback and lower-latency system integration. - Built-in voice options: The model family supports a set of preset voices, letting teams choose a delivery style that fits their product tone without training a custom speaker model.
- Straightforward API integration: The model is designed to work through the standard speech endpoint, reducing implementation complexity for teams already using OpenAI-compatible audio APIs.
- Language flexibility: OpenAI states its TTS stack generally follows Whisper language support, enabling speech generation across many languages even though voices are primarily optimized for English.
- Streaming-friendly usage: OpenAI’s speech API supports streamed audio delivery, allowing playback to begin before the full file is finished in suitable implementations.
- Practical for production apps: With documented rate limits, standardized endpoint behavior, and usage-policy guidance around AI voice disclosure, the model family is suitable for real application deployment.
How to access and integrate tts-1-1106
Step 1: Sign Up for API Key
Sign up on CometAPI and generate your API key from the dashboard. Store the key securely and configure it as an environment variable in your application so your backend can authenticate requests to the tts-1-1106 API.
Step 2: Send Requests to tts-1-1106 API
Send a POST request to the OpenAI-compatible audio speech endpoint through CometAPI, setting model to tts-1-1106 and including the input text plus any supported options such as voice and response_format.
curl --request POST \
--url https://api.cometapi.com/v1/audio/speech \
--header "Authorization: Bearer $COMETAPI_API_KEY" \
--header "Content-Type: application/json" \
--data '{
"model": "tts-1-1106",
"input": "Welcome to CometAPI text to speech.",
"voice": "alloy",
"response_format": "mp3"
}' \
--output speech.mp3
Step 3: Retrieve and Verify Results
Save the returned audio file or stream the response directly in your application, then verify that the speech content, selected voice, format, and playback quality match your expected output for tts-1-1106.