Hurry! 1M Free Tokens Waiting for You – Register Today!

  • Home
  • Models
    • Suno v4.5
    • GPT-image-1 API
    • GPT-4.1 API
    • Qwen 3 API
    • Grok-3-Mini
    • Llama 4 API
    • GPT-4o API
    • GPT-4.5 API
    • Claude 3.7-Sonnet API
    • Grok 3 API
    • DeepSeek R1 API
    • Gemini2.5 pro
    • Runway Gen-3 Alpha API
    • FLUX 1.1 API
    • Kling 1.6 Pro API
    • All Models
  • Enterprise
  • Pricing
  • API Docs
  • Blog
  • Contact
Sign Up
Log in
Technology

Google launches new Gemini 2.5 Flash-Lite model

2025-06-18 anna No comments yet

Google DeepMind has today announced significant expansions to its Gemini 2.5 family, unveiling the stable releases of Gemini 2.5 Pro and Gemini 2.5 Flash alongside a preview of the all‑new Gemini 2.5 Flash‑Lite model. These updates reflect Google’s continued commitment to offering a spectrum of AI models that balance cost, speed, and performance for diverse workloads .

Stable Releases: Gemini 2.5 Pro & Flash

On June 17, 2025, Google marked the general availability of Gemini 2.5 Pro and Gemini 2.5 Flash. The Pro variant delivers maximum reasoning power and is tailored for high‑complexity tasks such as advanced code generation, scientific analysis, and large‑scale data synthesis. In contrast, Gemini 2.5 Flash offers a mid‑tier option optimized for everyday uses that demand low latency—ideal for chatbots, summarization, and content creation at scale.

Overview: Three Models in the Gemini -2.5 Family

ModelStatusStrengthsIdeal Use Cases
Gemini 2.5 Flash‑Lite (preview)PreviewFastest & cheapest; multimodal; controllable reasoning; tool-enabledHigh-volume tasks like chatbots, summarization, search
Gemini 2.5 FlashStableBalanced: low latency, good reasoning, multimodalReal-time conversations, customer support
Gemini 2.5 ProStableMost capable: deep reasoning, huge context, multimodalResearch, complex coding, scientific tasks

Gemini 2.5 Flash‑Lite: Preview Highlights

Ultra‑low latency & cost savings:Designed for high-volume, real-time applications like translation, classification, and summarization. Boasts faster inference and lower cost per call compared to both 2.0 Flash‑Lite and the full Flash version .

Improved foundational performance: Outperforms earlier Flash‑Lite models across benchmarks in code generation, logic, math, multimodal reasoning, and science.

Cost and efficiency: Flash‑Lite pricing (preview): ~\$0.10 per 1M input tokens and ~\$0.40 per 1M output tokens—significantly cheaper than Flash (\$0.30/\$2.50) and Pro (\$1.25/\$10) .

Full Gemini -2.5 capabilities:

  • Controllable Thinking: Users can set “thinking budgets” (token limits) to trade speed for depth—Flash‑Lite can toggle this on as needed.
  • Multimodal Input: Supports text, image, audio, and video (including hour-long clips), with abilities to parse charts, UI, scenes, event summaries .
  • Tool Integration: Includes Google Search, code execution and a million-token context window, matching the capabilities of Flash and Pro.

Positioning on the Price‑Performance Curve

Google positions Flash‑Lite’s high speed and low cost at the Pareto frontier, meaning it’s among the most cost‑efficient-yet-capable models worldwide ([36kr.com][3]). In comparative evaluations, Flash‑Lite represents the best value: smart yet affordable .


About Flash and Pro

  • Gemini 2.5 Flash: Stable, low-latency, multi-modal thinking model. Positioned below Pro but roughly on par with GPT-4o in capability, with superior speed and cost efficiency ([ai.google.dev][10]).
  • Gemini 2.5 Pro: Google’s most advanced model. Renowned for handling hours-long video/audio, complex code and math, and huge-context reasoning. Also introduces selective “thinking budgets” and improved code quality to serve as a long-term stable flagship AI .

Deployment & Pricing

  • Availability: All three models are accessible through Google AI Studio, Google Cloud Vertex AI, and the Gemini app .
  • Cost structure (Vertex AI pricing from June 16, 2025):
  • Pro: \$1.25/1M input, \$10/1M output (higher beyond 200K tokens)
  • Flash: \$0.15/1M input, \$3.50/1M output in “thinking” mode—and includes 1,500 free grounded prompts daily ([cloud.google.com][8])
  • Flash‑Lite (preview): ~\$0.10/\$0.40 per 1M tokens

Getting Started

CometAPI provides a unified REST interface that aggregates hundreds of AI models—under a consistent endpoint, with built-in API-key management, usage quotas, and billing dashboards. Instead of juggling multiple vendor URLs and credentials.

Developers can access Gemini-2.5 Pro Preview API and Gemini-2.5 Flash Pre API through CometAPI, the latest models listed are as of the article’s publication date. To begin, explore the model’s capabilities in the Playground and consult the API guide for detailed instructions. Before accessing, please make sure you have logged in to CometAPI and obtained the API key. CometAPI offer a price far lower than the official price to help you integrate.

The latest integration Gemini 2.5 Flash‑LiteI will soon appear on CometAPI, so stay tuned!While we finalize Gemini 2.5 Flash‑Lite Model upload, explore our other models on the Models page or try them in the AI Playground.

  • Gemini 2.5 Flash
  • Gemini 2.5 Flash‑LiteI
  • Gemini 2.5 Pro
anna

Post navigation

Previous
Next

Search

Categories

  • AI Company (2)
  • AI Comparisons (40)
  • AI Model (81)
  • Model API (29)
  • Technology (314)

Tags

Alibaba Cloud Anthropic Black Forest Labs ChatGPT Claude Claude 3.7 Sonnet Claude 4 Claude Sonnet 4 Codex cometapi DALL-E 3 deepseek DeepSeek R1 DeepSeek V3 FLUX Gemini Gemini 2.0 Gemini 2.0 Flash Gemini 2.5 Flash Gemini 2.5 Pro Google GPT-4.1 GPT-4o GPT -4o Image GPT-Image-1 GPT 4.5 gpt 4o grok 3 Midjourney Midjourney V7 Minimax o3 o4 mini OpenAI Qwen Qwen 2.5 Qwen3 sora Stable AI Stable Diffusion Stable Diffusion 3.5 Large Suno Suno Music Veo 3 xAI

Related posts

Technology, AI Comparisons

Gemini 2.5 Pro vs OpenAI’s GPT-4.1: A Complete Comparison

2025-06-12 anna No comments yet

The competition between leading AI developers has intensified with Google’s launch of Gemini 2.5 Pro and OpenAI’s introduction of GPT-4.1. These cutting-edge models promise significant advancements in areas ranging from coding and long-context comprehension to cost-efficiency and enterprise readiness. This in-depth comparison explores the latest features, benchmark results, and practical considerations for selecting the right […]

Technology, AI Comparisons

Gemini 2.5 Pro vs Claude Sonnet 4: A Comprehensive Comparison

2025-06-09 anna No comments yet

In the rapidly evolving landscape of large language models (LLMs), Google’s Gemini 2.5 Pro and Anthropic’s Claude Sonnet 4 represent two of the latest contenders, each touting groundbreaking improvements in reasoning, coding, and user customization. While Gemini 2.5 Pro focuses on delivering enterprise-grade stability, configurable compute, and deep reasoning enhancements, Claude Sonnet 4 emphasizes cost-effective […]

Technology

Google Unveils Gemini 2.5 Pro Preview-0605

2025-06-06 anna No comments yet

Google yestoday announced the launch of the Gemini 2.5 Pro(The version is gemini-2.5-pro-preview-06-05 in CometAPI.) upgraded preview, the latest evolution of its powerful AI model. Designed to be smarter, faster, more reliable, and more creative, Gemini 2.5 Pro delivers state-of-the-art performance and marks a new milestone in Google’s AI capabilities. The model is currently available […]

500+ AI Model API,All In One API. Just In CometAPI

Models API
  • GPT API
  • Suno API
  • Luma API
  • Sora API
Developer
  • Sign Up
  • API DashBoard
  • Documentation
  • Quick Start
Resources
  • Pricing
  • Enterprise
  • Blog
  • AI Model API Articles
  • Discord Community
Get in touch
  • [email protected]

© CometAPI. All Rights Reserved.   EFoxTech LLC.

  • Terms & Service
  • Privacy Policy