Context:2,000,000
Input:$1.6/M
Output:$4.8/M
Grok 4.20 release introduces a multi-agent architecture (multiple specialized agents coordinated in real time), expanded context modes, and focused improvements to instruction-following, hallucination reduction, and structured/tooled outputs.Per Second:$0.04
Generate videos from text prompts, animate still images, or edit existing videos with natural language. The API supports configurable duration, aspect ratio, and resolution for generated videos — with the SDK handling the asynchronous polling automatically.Context:2M
Input:$0.16/M
Output:$0.4/M
Grok 4.1 Fast is xAI’s production-focused large model, optimized for agentic tool-calling, long-context workflows, and low-latency inference. It’s a multimodal, two-variant family designed to run autonomous agents that search, execute code, call services, and reason over extremely large contexts (up to 2 million tokens).Context:256K
Input:$0.16/M
Output:$1.2/M
Grok Code Fast 1 is an AI programming model launched by xAI, designed for fast and efficient basic coding tasks. The model can process 92 tokens per second, has a 256k context window, and is suitable for rapid prototyping, code debugging, and generating simple visual elements.Context:2M
Input:$0.16/M
Output:$0.4/M
Grok 4 Fast is a new artificial intelligence model launched by xAI, integrating Inference and non-Inference capabilities into a single architecture. This model has a 2 million token context window and is designed for high-throughput applications such as search and coding. The model offers two versions: Grok-4-Fast-Reasoning and Grok-4-Fast-Non-Reasoning, optimized for different tasks.Context:256K
Grok 4 is an artificial intelligence model provided by XAI. Currently supports text modality, with vision, image generation, and other features coming soon. Possesses extremely powerful technical parameters and ecosystem capabilities: Context Window: Supports context processing of up to 256,000 tokens, leading mainstream models.