O

GPT-4o mini Search Preview

輸入:$60/M
輸出:$240/M
GPT-4o mini Search Preview 是 GPT-4o 家族中面向搜尋導向互動與檢索工作流程的緊湊型多模態模型。它能解讀與重述查詢、綜合精煉答案,並在透過工具/函式呼叫整合外部搜尋時,為回應提供依據。典型用途包括產品內搜尋助理、知識庫問答、電商探索,以及用於排序與路由的查詢理解。技術亮點包括文字與影像輸入、遵循指令、結構化輸出格式,以及面向 RAG 管線的工具使用整合。
商業用途

Technical Specifications of gpt-4o-mini-search-preview

SpecificationDetails
Model IDgpt-4o-mini-search-preview
Model familyGPT-4o mini
Primary modalityMultimodal
Supported inputsText, image
Core strengthsSearch-oriented interactions, query understanding, concise answer synthesis, retrieval workflow support
Instruction followingStrong support for guided prompting and task formatting
Structured outputsSuitable for JSON and other schema-based response formats
Tool useDesigned to work well with external search and function/tool calling
Typical latency/cost profileCompact model optimized for lighter-weight deployments and high-throughput use cases
Common use casesIn-product search assistants, knowledge-base QA, e-commerce discovery, ranking/routing query understanding, RAG pipelines

What is gpt-4o-mini-search-preview?

gpt-4o-mini-search-preview is a compact multimodal model in the GPT-4o family built for search-centric experiences and retrieval-enhanced applications. It is well suited for systems that need to interpret user intent, rewrite or decompose queries, synthesize concise responses from retrieved information, and support grounded workflows through external search integration.

Because it accepts both text and image inputs, the model can participate in broader discovery and assistance experiences beyond plain text search. It is especially useful in applications where fast query understanding, controlled response formatting, and tool-enabled orchestration matter more than long-form generation. Common examples include customer-facing search copilots, internal knowledge assistants, product discovery flows, and retrieval pipelines that depend on query classification, ranking assistance, and answer generation.

Main features of gpt-4o-mini-search-preview

  • Search-oriented reasoning: Helps interpret ambiguous user intent, reformulate queries, and support retrieval-focused interactions.
  • Multimodal input support: Accepts both text and image inputs, enabling richer search and discovery workflows.
  • Concise answer synthesis: Produces short, useful summaries and direct responses appropriate for search-style UX.
  • Tool integration readiness: Works effectively with function calling and external tools for search, browsing, and RAG orchestration.
  • Structured output compatibility: Can generate responses in organized formats such as JSON for downstream systems.
  • Instruction-following behavior: Handles guided prompts reliably for classification, routing, extraction, and answer formatting tasks.
  • Knowledge-base QA support: Fits well in systems that retrieve documents first and then ask the model to produce grounded answers.
  • E-commerce and catalog discovery: Useful for interpreting shopping intent, refining filters, and improving product search interactions.
  • Ranking and routing assistance: Can help classify queries and prepare them for retrieval, ranking, or workflow branching logic.
  • Efficient deployment profile: As a compact model, it is appropriate for scalable, cost-aware integrations that still need multimodal and tool-aware behavior.

How to access and integrate gpt-4o-mini-search-preview

Step 1: Sign Up for API Key

To get started, create an account on CometAPI and generate your API key from the dashboard. After that, store the key securely and use it in the Authorization header for all requests.

Step 2: Send Requests to gpt-4o-mini-search-preview API

Use CometAPI’s OpenAI-compatible endpoint and specify the model as gpt-4o-mini-search-preview.

curl https://api.cometapi.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $COMETAPI_API_KEY" \
  -d '{
    "model": "gpt-4o-mini-search-preview",
    "messages": [
      {
        "role": "user",
        "content": "Summarize the main intent behind this search query: best running shoes for flat feet"
      }
    ]
  }'
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_COMETAPI_API_KEY",
    base_url="https://api.cometapi.com/v1"
)

response = client.chat.completions.create(
    model="gpt-4o-mini-search-preview",
    messages=[
        {
            "role": "user",
            "content": "Summarize the main intent behind this search query: best running shoes for flat feet"
        }
    ]
)

print(response.choices[0].message.content)

Step 3: Retrieve and Verify Results

Parse the model output in your application and, when needed, chain it with retrieval, reranking, or verification steps. For production search and RAG systems, it is a good practice to validate outputs against trusted sources and log responses for quality monitoring.