Grok 3 Fast

JSON →
xai llm
text

The latest fast inference variant of the Grok 3 large language model, optimized for speed.

context window 131K tokens
max output 131K tokens
input price $5 / 1M tokens
output price $25 / 1M tokens
streamingreasoningcode-generationfunction-callingtool-useprompt-caching