Llama 3.1 8B Instant

JSON →
meta llm
text

A fast, lightweight instruction-tuned language model from Meta optimized for low-latency text generation and chat applications.

context window 128K tokens
max output 8K tokens
input price $0.05 / 1M tokens
output price $0.08 / 1M tokens
streamingcode-generationfunction-callingtool-usejson-mode
releasedJul 2024
knowledge cutoffDec 2023