GPT Realtime

JSON →
openai llm
textaudio

A low-latency model optimized for real-time conversational applications and streaming interactions.

context window 32K tokens
max output 4K tokens
input price $4 / 1M tokens
output price $16 / 1M tokens
streamingtool-usefunction-calling
releasedDec 2024
knowledge cutoffOct 2024
full doc /v1/models/gpt-realtime