Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
baojian1024
's Collections
Video
Audio
Image
OCR
Comfyui
LTX-2.3
3D models
Audio
updated
7 days ago
Upvote
-
microsoft/VibeVoice-ASR
Automatic Speech Recognition
•
9B
•
Updated
Jan 27
•
732k
•
1.05k
CohereLabs/cohere-transcribe-03-2026
Automatic Speech Recognition
•
Updated
5 days ago
•
304k
•
912
JiongzeYu/SparkVSR
Updated
22 days ago
•
1.44k
•
57
smthem/SparkVSR-GGUF
6B
•
Updated
Mar 25
•
73
•
3
microsoft/VibeVoice-1.5B
Text-to-Speech
•
3B
•
Updated
Jan 22
•
183k
•
2.34k
microsoft/VibeVoice-Realtime-0.5B
Text-to-Speech
•
1B
•
Updated
Dec 12, 2025
•
1.23M
•
1.21k
meituan-longcat/LongCat-AudioDiT-3.5B
4B
•
Updated
23 days ago
•
5.99k
•
65
openbmb/VoxCPM2
Text-to-Speech
•
Updated
10 days ago
•
98k
•
1.24k
k2-fsa/OmniVoice
Text-to-Speech
•
Updated
4 days ago
•
1.48M
•
702
Upvote
-
Share collection
View history
Collection guide
Browse collections