Voyage Multimodal 3

JSON →
voyageai embedding
textimage

A multimodal embedding model that encodes both text and images into a shared vector space for cross-modal retrieval.

context window 32K tokens
input price $0.12 / 1M tokens
streaming
releasedJan 2024