H

hunyuan-turbos-20250313

Ingresso:$0.08912/M
Uscita:$0.2228/M
Uso commerciale

Technical Specifications of hunyuan-turbos-20250313

SpecificationDetails
Model IDhunyuan-turbos-20250313
Provider familyTencent Hunyuan TurboS
Release dateMarch 13, 2025
Model typeLarge language model for text generation, understanding, reasoning, and chat-style interactions
Architecture lineageHunyuan-TurboS, a hybrid Transformer-Mamba Mixture-of-Experts model family
Parameter scale56B activated parameters, 560B total parameters, according to Tencent’s Hunyuan-TurboS technical report repository
Context lengthHunyuan-TurboS is reported by Tencent as supporting up to 256K context; CometAPI lists hunyuan-turbos-20250313 with a maximum context length of 128,000 tokens on its model page, so integrators should verify the effective limit in their deployed endpoint configuration
Training scalePretrained on 16T high-quality tokens in the Hunyuan-TurboS technical description
Capability focusImproved math, logic, code, bilingual Chinese/English performance, text creation, text understanding, knowledge Q&A, and conversation
Commercial availabilityAvailable on CometAPI with commercial use listed on the model page
CometAPI pricingInput: $0.08912 per 1M tokens; Output: $0.2228 per 1M tokens

What is hunyuan-turbos-20250313?

hunyuan-turbos-20250313 is CometAPI’s platform identifier for the March 13, 2025 release of Tencent’s Hunyuan-TurboS flagship model. Tencent describes this version as an updated TurboS release with stronger reasoning ability, better user experience, upgraded pretraining scale, and improvements in math, logic, coding, and general Chinese/English usage quality.

At the model-family level, Hunyuan-TurboS is positioned as a hybrid Transformer-Mamba MoE large language model designed to combine efficient long-sequence processing with strong contextual understanding. Tencent’s public technical materials say the family uses adaptive long/short chain-of-thought behavior so it can respond quickly to simpler prompts while allocating deeper reasoning for harder tasks.

In practice, that makes hunyuan-turbos-20250313 a general-purpose API model suited to chatbots, knowledge assistants, writing tools, code-related workflows, multilingual applications, and other text-centric products that need a balance of speed, reasoning, and cost efficiency. This positioning is also consistent with CometAPI’s listing, which exposes the model as a commercial API endpoint.

Main features of hunyuan-turbos-20250313

  • Improved reasoning performance: Tencent states that the March 13, 2025 TurboS release delivers stronger thinking ability, with explicit gains in mathematics, logic, and coding-related tasks.
  • Hybrid Transformer-Mamba design: The Hunyuan-TurboS family combines Transformer contextual modeling with Mamba-based long-sequence efficiency, aiming to improve throughput and long-context handling.
  • Mixture-of-Experts scalability: Tencent’s technical description reports a 560B total-parameter MoE design with 56B activated parameters, supporting high capability without activating the full parameter set for every token.
  • Adaptive response mode: Hunyuan-TurboS is described as using adaptive long-short chain-of-thought, which helps the model shift between fast answers and deeper multi-step reasoning depending on task complexity.
  • Long-context support: Tencent reports up to 256K context support for the Hunyuan-TurboS family, which is useful for lengthy documents, multi-turn conversations, and large prompt windows; however, CometAPI’s listing for this specific endpoint shows 128,000 tokens, so developers should confirm the runtime ceiling in their account.
  • Strong multilingual and general text capability: Tencent highlights improved Chinese and English experience, including text creation, text understanding, knowledge answering, and open-ended conversation.
  • Efficiency-oriented deployment profile: Tencent’s public Hunyuan-TurboS materials emphasize strong benchmark results combined with lower inference cost than many reasoning-heavy models, making the family attractive for production workloads.
  • Commercial API availability through CometAPI: CometAPI exposes hunyuan-turbos-20250313 as an API model with listed token pricing and commercial-use availability, which simplifies access for developers who want to integrate Tencent’s model family through a unified API platform.

How to access and integrate hunyuan-turbos-20250313

Step 1: Sign Up for API Key

Sign up on CometAPI and create an API key from your dashboard. After that, store the key securely as an environment variable so your application can authenticate requests to the API. CometAPI lists hunyuan-turbos-20250313 as an available commercial model endpoint.

Step 2: Send Requests to hunyuan-turbos-20250313 API

Send OpenAI-compatible requests to CometAPI’s chat completions endpoint and set the model field to hunyuan-turbos-20250313.

curl https://api.cometapi.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $COMETAPI_API_KEY" \
  -d '{
    "model": "hunyuan-turbos-20250313",
    "messages": [
      {
        "role": "user",
        "content": "Explain the main strengths of this model in a few bullet points."
      }
    ]
  }'
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_COMETAPI_API_KEY",
    base_url="https://api.cometapi.com/v1"
)

response = client.chat.completions.create(
    model="hunyuan-turbos-20250313",
    messages=[
        {"role": "user", "content": "Explain the main strengths of this model in a few bullet points."}
    ]
)

print(response.choices[0].message.content)

Step 3: Retrieve and Verify Results

Read the response text from the first completion choice, then validate output quality for your use case with prompt tests that cover reasoning, multilingual tasks, long-context inputs, and code or logic workloads. Because Tencent’s public materials and CometAPI’s listing show different context-window figures for the broader TurboS family versus this endpoint listing, it is also wise to verify token limits, latency, and output behavior in your own integration environment before production rollout.