Gemini Robotics ER 1.5 Preview

google multimodal

textimage

A preview model from Google's Gemini Robotics series for embodied reasoning tasks.

Specs

context window 1.0M tokens

max output 66K tokens

input price $0.3 / 1M tokens

output price $2.5 / 1M tokens

visionreasoningfunction-callingtool-usejson-mode

releasedMar 2025