GPT-3.5 Turbo 16K

JSON →
openai llm deprecated
text

An extended-context variant of GPT-3.5 Turbo supporting up to 16,384 tokens for longer conversations and documents.

context window 16K tokens
max output 4K tokens
input price $3 / 1M tokens
output price $4 / 1M tokens
streamingfunction-callingjson-modetool-useprompt-caching
releasedJun 2023
knowledge cutoffSep 2021
deprecatedJul 2024