Google unveils Gemini 3.1 Flash-Lite — a fast, low-cost LLM
Google introduced Gemini 3.1 Flash-Lite, the newest member of the Gemini 3 family designed specifically as a high-throughput, low-latency, cost-efficient engine for developer and enterprise workloads. Google positions Flash-Lite as the “fastest and most cost-efficient” model in the Gemini 3 line: a lightweight variant that aims to deliver streaming interactions, large-scale background processing, and high-frequency production tasks (for example, translation, extraction, UI generation, and large-volume classification) at a much lower price point than its Pro counterparts