ModelsSupportEnterpriseBlog
500+ AI Model API, All In One API.Just In CometAPI
Models API
Developer
Quick StartDocumentationAPI Dashboard
Resources
AI ModelsBlogEnterpriseChangelogAbout
2025 CometAPI. All right reserved.Privacy PolicyTerms of Service
Home/Models/OpenAI/GPT-4o mini Search Preview
O

GPT-4o mini Search Preview

Input:$60/M
Output:$240/M
GPT-4o mini Search Preview is a compact multimodal model in the GPT-4o family geared toward search-oriented interactions and retrieval workflows. It interprets and reformulates queries, synthesizes concise answers, and can ground responses via external search when integrated through tool/function calling. Typical uses include in-product search assistants, knowledge-base QA, e-commerce discovery, and query understanding for ranking and routing. Technical highlights include text-and-image inputs, instruction following, structured output formats, and tool use integration for RAG pipelines.
Commercial Use
Overview
Features
Pricing
API
Versions

Technical Specifications of gpt-4o-mini-search-preview

SpecificationDetails
Model IDgpt-4o-mini-search-preview
Model familyGPT-4o mini
Primary modalityMultimodal
Supported inputsText, image
Core strengthsSearch-oriented interactions, query understanding, concise answer synthesis, retrieval workflow support
Instruction followingStrong support for guided prompting and task formatting
Structured outputsSuitable for JSON and other schema-based response formats
Tool useDesigned to work well with external search and function/tool calling
Typical latency/cost profileCompact model optimized for lighter-weight deployments and high-throughput use cases
Common use casesIn-product search assistants, knowledge-base QA, e-commerce discovery, ranking/routing query understanding, RAG pipelines

What is gpt-4o-mini-search-preview?

gpt-4o-mini-search-preview is a compact multimodal model in the GPT-4o family built for search-centric experiences and retrieval-enhanced applications. It is well suited for systems that need to interpret user intent, rewrite or decompose queries, synthesize concise responses from retrieved information, and support grounded workflows through external search integration.

Because it accepts both text and image inputs, the model can participate in broader discovery and assistance experiences beyond plain text search. It is especially useful in applications where fast query understanding, controlled response formatting, and tool-enabled orchestration matter more than long-form generation. Common examples include customer-facing search copilots, internal knowledge assistants, product discovery flows, and retrieval pipelines that depend on query classification, ranking assistance, and answer generation.

Main features of gpt-4o-mini-search-preview

  • Search-oriented reasoning: Helps interpret ambiguous user intent, reformulate queries, and support retrieval-focused interactions.
  • Multimodal input support: Accepts both text and image inputs, enabling richer search and discovery workflows.
  • Concise answer synthesis: Produces short, useful summaries and direct responses appropriate for search-style UX.
  • Tool integration readiness: Works effectively with function calling and external tools for search, browsing, and RAG orchestration.
  • Structured output compatibility: Can generate responses in organized formats such as JSON for downstream systems.
  • Instruction-following behavior: Handles guided prompts reliably for classification, routing, extraction, and answer formatting tasks.
  • Knowledge-base QA support: Fits well in systems that retrieve documents first and then ask the model to produce grounded answers.
  • E-commerce and catalog discovery: Useful for interpreting shopping intent, refining filters, and improving product search interactions.
  • Ranking and routing assistance: Can help classify queries and prepare them for retrieval, ranking, or workflow branching logic.
  • Efficient deployment profile: As a compact model, it is appropriate for scalable, cost-aware integrations that still need multimodal and tool-aware behavior.

How to access and integrate gpt-4o-mini-search-preview

Step 1: Sign Up for API Key

To get started, create an account on CometAPI and generate your API key from the dashboard. After that, store the key securely and use it in the Authorization header for all requests.

Step 2: Send Requests to gpt-4o-mini-search-preview API

Use CometAPI’s OpenAI-compatible endpoint and specify the model as gpt-4o-mini-search-preview.

curl https://api.cometapi.com/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer $COMETAPI_API_KEY" \
  -d '{
    "model": "gpt-4o-mini-search-preview",
    "messages": [
      {
        "role": "user",
        "content": "Summarize the main intent behind this search query: best running shoes for flat feet"
      }
    ]
  }'
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_COMETAPI_API_KEY",
    base_url="https://api.cometapi.com/v1"
)

response = client.chat.completions.create(
    model="gpt-4o-mini-search-preview",
    messages=[
        {
            "role": "user",
            "content": "Summarize the main intent behind this search query: best running shoes for flat feet"
        }
    ]
)

print(response.choices[0].message.content)

Step 3: Retrieve and Verify Results

Parse the model output in your application and, when needed, chain it with retrieval, reranking, or verification steps. For production search and RAG systems, it is a good practice to validate outputs against trusted sources and log responses for quality monitoring.

Features for GPT-4o mini Search Preview

Explore the key features of GPT-4o mini Search Preview, designed to enhance performance and usability. Discover how these capabilities can benefit your projects and improve user experience.

Pricing for GPT-4o mini Search Preview

Explore competitive pricing for GPT-4o mini Search Preview, designed to fit various budgets and usage needs. Our flexible plans ensure you only pay for what you use, making it easy to scale as your requirements grow. Discover how GPT-4o mini Search Preview can enhance your projects while keeping costs manageable.
Comet Price (USD / M Tokens)Official Price (USD / M Tokens)Discount
Input:$60/M
Output:$240/M
Input:$75/M
Output:$300/M
-20%

Sample code and API for GPT-4o mini Search Preview

Access comprehensive sample code and API resources for GPT-4o mini Search Preview to streamline your integration process. Our detailed documentation provides step-by-step guidance, helping you leverage the full potential of GPT-4o mini Search Preview in your projects.

Versions of GPT-4o mini Search Preview

The reason GPT-4o mini Search Preview has multiple snapshots may include potential factors such as variations in output after updates requiring older snapshots for consistency, providing developers a transition period for adaptation and migration, and different snapshots corresponding to global or regional endpoints to optimize user experience. For detailed differences between versions, please refer to the official documentation.
version
gpt-4o-mini-search-preview
gpt-4o-mini-search-preview-2025-03-11

More Models

G

Nano Banana 2

Input:$0.4/M
Output:$2.4/M
Core Capabilities Overview: Resolution: Up to 4K (4096×4096), on par with Pro. Reference Image Consistency: Up to 14 reference images (10 objects + 4 characters), maintaining style/character consistency. Extreme Aspect Ratios: New 1:4, 4:1, 1:8, 8:1 ratios added, suitable for long images, posters, and banners. Text Rendering: Advanced text generation, suitable for infographics and marketing poster layouts. Search Enhancement: Integrated Google Search + Image Search. Grounding: Built-in thinking process; complex prompts are reasoned before generation.
A

Claude Opus 4.6

Input:$4/M
Output:$20/M
Claude Opus 4.6 is Anthropic’s “Opus”-class large language model, released February 2026. It is positioned as a workhorse for knowledge-work and research workflows — improving long-context reasoning, multi-step planning, tool use (including agentic software workflows), and computer-use tasks such as automated slide and spreadsheet generation.
A

Claude Sonnet 4.6

Input:$2.4/M
Output:$12/M
Claude Sonnet 4.6 is our most capable Sonnet model yet. It’s a full upgrade of the model’s skills across coding, computer use, long-context reasoning, agent planning, knowledge work, and design. Sonnet 4.6 also features a 1M token context window in beta.
O

GPT-5.4 nano

Input:$0.16/M
Output:$1/M
GPT-5.4 nano is designed for tasks where speed and cost matter most like classification, data extraction, ranking, and sub-agents.
O

GPT-5.4 mini

Input:$0.6/M
Output:$3.6/M
GPT-5.4 mini brings the strengths of GPT-5.4 to a faster, more efficient model designed for high-volume workloads.
A

Claude Mythos Preview

A

Claude Mythos Preview

Coming soon
Input:$60/M
Output:$240/M
Claude Mythos Preview is our most capable frontier model to date, and shows a striking leap in scores on many evaluation benchmarks compared to our previous frontier model, Claude Opus 4.6.