checklist.day
Registry
Models
MCP
Docs
About
llms.txt
Models
search
all
llm
embedding
vision
audio
image-generation
reranking
multimodal
320
audio
models ·
clear
sort by
name
newest
Qwen/Qwen3-ForcedAligner-0.6B
audio
qwen
Qwen/Qwen3-TTS-12Hz-0.6B-Base
audio
qwen
Qwen/Qwen3-TTS-12Hz-0.6B-CustomVoice
audio
qwen
Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
audio
Qwen
Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign
audio
qwen
ResembleAI/chatterbox
audio
ResembleAI
Revai/reverb-diarization-v1
audio
revai
SWivid/E2-TTS
audio
SWivid
SWivid/F5-TTS
audio
swivid
Serveurperso/OmniVoice-GGUF
audio
Serveurperso
Serveurperso/Qwen3-TTS-GGUF
audio
Serveurperso
Speech-Arena-2025/DF_Arena_1B_V_1
audio
speech-arena-2025
Speech-Arena-2025/DF_Arena_500M_V_1
audio
speech-arena-2025
Sunbird/spark-tts-salt
audio
Sunbird
Supertone/supertonic-3
audio
supertone
Systran/faster-whisper-base
audio
Systran
Systran/faster-whisper-large-v2
audio
Systran
Systran/faster-whisper-large-v3
audio
Systran
Systran/faster-whisper-medium
audio
systran
Systran/faster-whisper-small
audio
Systran
Systran/faster-whisper-small.en
audio
Systran
Systran/faster-whisper-tiny
audio
systran
Systran/faster-whisper-tiny.en
audio
Systran
TalTechNLP/voxlingua107-epaca-tdnn
audio
TalTechNLP
Xenova/ast-finetuned-audioset-10-10-0.4593
audio
Xenova
Xenova/speecht5_tts
audio
Xenova
YatharthS/LuxTTS
audio
YatharthS
Yehor/w2v-xls-r-uk
audio
yehor
Zyphra/Zonos-v0.1-transformer
audio
Zyphra
african-low-resource/omnivoice-amharic
audio
african-low-resource
ai4bharat/IndicF5
audio
ai4bharat
ai4bharat/indic-parler-tts
audio
ai4bharat
airesearch/wav2vec2-large-xlsr-53-th
audio
airesearch
alefiury/wav2vec2-large-xlsr-53-gender-recognition-librispeech
audio
alefiury
alvanlii/wav2vec2-BERT-cantonese
audio
alvanlii
alvanlii/whisper-small-cantonese
audio
alvanlii
anton-l/wav2vec2-random-tiny-classifier
audio
anton-l
anuragshas/wav2vec2-large-xlsr-53-telugu
audio
anuragshas
argmaxinc/parakeetkit-pro
audio
argmaxinc
argmaxinc/speakerkit-coreml
audio
argmaxinc
argmaxinc/speakerkit-pro
audio
argmaxinc
argmaxinc/whisperkit-coreml
audio
argmaxinc
audeering/wav2vec2-large-robust-12-ft-emotion-msp-dim
audio
audeering
audeering/wav2vec2-large-robust-24-ft-age-gender
audio
audeering
audeering/wav2vec2-large-robust-6-ft-age-gender
audio
audeering
aufklarer/Qwen3-ForcedAligner-0.6B-4bit
audio
aufklarer
aufklarer/WeSpeaker-ResNet34-LM-CoreML
audio
aufklarer
aufklarer/WeSpeaker-ResNet34-LM-MLX
audio
aufklarer
awsaf49/sonics-spectttra-alpha-120s
audio
awsaf49
awsaf49/sonics-spectttra-gamma-5s
audio
awsaf49
← prev
2 / 7
next →