Q

qwen3-vl-32b

Input:$0.24/M
Output:$0.96/M
Qwen3-VL-32B is the 32-billion-parameter dense variant in Alibaba’s Qwen3 vision-language model family. It is a multimodal (vision + language + video) transformer designed for unified perception, long-context reasoning, robust OCR and visual grounding, and agentic/toolified workflows.
New
Commercial Use