checklist.day
Registry
Models
MCP
Docs
About
llms.txt
Models
search
all
llm
embedding
vision
audio
image-generation
reranking
multimodal
320
audio
models ·
clear
sort by
name
newest
3loi/SER-Odyssey-Baseline-WavLM-Multi-Attributes
audio
3loi
7wolf/wav2vec2-base-gender-classification
audio
7wolf
AI-Music-Detection/ai_music_detection_large_60s
audio
AI-Music-Detection
AbelZimba/whisper-bemba-stt
audio
abelzimba
Aniemore/wavlm-emotion-russian-resd
audio
Aniemore
Aratako/MioTTS-2.6B
audio
aratako
Beijuka/voice-gender-classifier
audio
Beijuka
CohereLabs/cohere-transcribe-03-2026
audio
cohere
DBD-research-group/AudioProtoPNet-20-BirdSet-XCL
audio
DBD-research-group
DBD-research-group/AudioProtoPNet-5-BirdSet-XCL
audio
DBD-research-group
DBD-research-group/Bird-MAE-Base
audio
DBD-research-group
DBD-research-group/Bird-MAE-Huge
audio
DBD-research-group
DBD-research-group/Bird-MAE-Large
audio
DBD-research-group
Dpngtm/wav2vec2-emotion-recognition
audio
dpngtm
FluidInference/parakeet-tdt-0.6b-v3-coreml
audio
FluidInference
FunAudioLLM/Fun-CosyVoice3-0.5B-2512
audio
FunAudioLLM
Gustking/wav2vec2-large-xlsr-deepfake-audio-classification
audio
Gustking
Hemgg/Deepfake-audio-detection
audio
Hemgg
HowMannyMore/wav2vec2-lg-xlsr-ur-speech-emotion-recognition
audio
HowMannyMore
HumeAI/tada-1b
audio
humeai
HumeAI/tada-3b-ml
audio
HumeAI
IndexTeam/IndexTTS-2
audio
IndexTeam
JaesungHuh/voice-gender-classifier
audio
JaesungHuh
Jzuluaga/accent-id-commonaccent_ecapa
audio
Jzuluaga
Jzuluaga/accent-id-commonaccent_xlsr-en-english
audio
Jzuluaga
KBLab/wav2vec2-large-voxrex-swedish
audio
KBLab
KELONMYOSA/wav2vec2-xls-r-300m-emotion-ru
audio
KELONMYOSA
Khalsuu/filipino-wav2vec2-l-xls-r-300m-official
audio
Khalsuu
Krithika-p/my_awesome_emotions_model
audio
Krithika-p
Lajavaness/wav2vec2-lg-xlsr-fr-speech-emotion-recognition
audio
Lajavaness
MIT/ast-finetuned-audioset-10-10-0.4593
audio
MIT
MIT/ast-finetuned-audioset-14-14-0.443
audio
mit
MIT/ast-finetuned-audioset-16-16-0.442
audio
MIT
MIT/ast-finetuned-speech-commands-v2
audio
mit
4K ctx
MahmoudAshraf/mms-300m-1130-forced-aligner
audio
MahmoudAshraf
MelodyMachine/Deepfake-audio-detection-V2
audio
melodymachine
Misha24-10/F5-TTS_RUSSIAN
audio
Misha24-10
NbAiLab/nb-wav2vec2-1b-bokmaal-v2
audio
NbAiLab
NbAiLab/nb-wav2vec2-1b-nynorsk
audio
nbailab
OpenMOSS-Team/MOSS-TTS
audio
openmoss
OpenMOSS-Team/MOSS-TTS-Local-Transformer
audio
OpenMOSS-Team
OpenMOSS-Team/MOSS-TTS-Nano-100M
audio
openmoss
OpenMOSS-Team/MOSS-TTS-Realtime
audio
OpenMOSS-Team
OpenMOSS-Team/MOSS-TTS-v1.5
audio
openmoss-team
OpenMOSS-Team/MOSS-TTSD-v1.0
audio
OpenMOSS-Team
OpenMOSS-Team/MOSS-VoiceGenerator
audio
OpenMOSS-Team
OpenMuQ/MuQ-MuLan-large
audio
OpenMuQ
OpenMuQ/MuQ-large-msd-iter
audio
OpenMuQ
Qwen/Qwen3-ASR-0.6B
audio
Qwen
Qwen/Qwen3-ASR-1.7B
audio
Qwen
← prev
1 / 7
next →