Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
baojian1024 's Collections
Video
Audio
Image
OCR
Comfyui
LTX-2.3
3D models

Audio

updated 7 days ago
Upvote
-

  • microsoft/VibeVoice-ASR

    Automatic Speech Recognition • 9B • Updated Jan 27 • 732k • 1.05k

  • CohereLabs/cohere-transcribe-03-2026

    Automatic Speech Recognition • Updated 5 days ago • 304k • 912

  • JiongzeYu/SparkVSR

    Updated 22 days ago • 1.44k • 57

  • smthem/SparkVSR-GGUF

    6B • Updated Mar 25 • 73 • 3

  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated Jan 22 • 183k • 2.34k

  • microsoft/VibeVoice-Realtime-0.5B

    Text-to-Speech • 1B • Updated Dec 12, 2025 • 1.23M • 1.21k

  • meituan-longcat/LongCat-AudioDiT-3.5B

    4B • Updated 23 days ago • 5.99k • 65

  • openbmb/VoxCPM2

    Text-to-Speech • Updated 10 days ago • 98k • 1.24k

  • k2-fsa/OmniVoice

    Text-to-Speech • Updated 4 days ago • 1.48M • 702
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs