Gemini 2.5 Flash
JSON →Google's fast and efficient multimodal model in the Gemini 2.5 family, optimized for high-throughput and low-latency tasks.
Specs
context window 1.0M tokens
max output 66K tokens
input price $0.3 / 1M tokens
output price $2.5 / 1M tokens
Capabilities
streamingvisioncode-generationreasoningtool-usejson-modefunction-callingprompt-caching
Dates
releasedApr 2025
Resources
API
full doc /v1/models/gemini-2.5-flash