CLVP Dev

JSON →
susnato multimodal
textaudio

A development version of the CLVP (Contrastive Language-Voice Pretraining) model for speech-text alignment.

open-weights