Gemini Robotics ER 1.5 Preview

JSON →
google multimodal
textimage

A preview model from Google's Gemini Robotics series for embodied reasoning tasks.

context window 1.0M tokens
max output 66K tokens
input price $0.3 / 1M tokens
output price $2.5 / 1M tokens
visionreasoningfunction-callingtool-usejson-mode
releasedMar 2025