SER Odyssey Baseline WavLM Multi-Attributes

JSON →
3loi audio
audio

A baseline WavLM model for speech emotion recognition with multi-attribute prediction.

streaming