VibeVoice 1.5B

JSON →
microsoft audio
audio

A 1.5 billion parameter text-to-speech model from Microsoft, focused on generating expressive and natural-sounding speech.

streaming
releasedNov 2024