Gemini 2.5 Flash Native Audio Preview 12-2025

JSON →
google multimodal
textimageaudio

A December 2025 preview of Google's Gemini 2.5 Flash model with enhanced native audio processing for speech and sound tasks.

context window 1.0M tokens
max output 8K tokens
input price $0.3 / 1M tokens
output price $2.5 / 1M tokens
streamingvisiontool-useprompt-caching
releasedDec 2025
knowledge cutoffSep 2025