Kimi K2.7 Code is now on CometAPI — Kimi's most intelligent coding model to date, reliably follows instructions in long contexts and completes programming tasks with a higher success rate. Try it now
Models
Pricing
Enterprise
Resources
Integrations
Quickstart
CometAPI vs Competitors
Compare
Support
Blog
English
繁體中文
日本語
한국어
Français
Deutsch
Español
Italiano
Português
Русский
العربية
ไทย
Tiếng Việt
Bahasa Indonesia
Bahasa Melayu
Türkçe
Polski
Nederlands
Danish
Norsk
Қазақ
اردو
Start Free
Start Free
Mistral Small 4 Blog
Mistral Small 4 Blog
Mar 23, 2026
Mistral Small 4
How to Run Mistral Small 4 Locally
Mistral Small 4 is a newly released open-weight multimodal AI model by Mistral AI (March 2026) that combines reasoning, coding, and vision capabilities in a single architecture. It can be deployed locally using frameworks like Ollama, vLLM, or llama.cpp (quantized), requiring GPUs (≥24GB VRAM recommended) or high-end CPUs with quantization. Its key advantage is high performance at significantly lower inference cost and latency, making it ideal for on-device AI applications.