1 5 11

HJ.Chen

HJChen

https://harryxd2018.github.io

HarryXD2018

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

Kling-Omni Technical Report

upvoted a paper 16 days ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

liked a Space 19 days ago

C4G-HKUST/AnyTalker

View all activity

Organizations

upvoted a paper 12 days ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published 13 days ago • 163

upvoted a paper 16 days ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published 19 days ago • 36

liked a Space 19 days ago

AnyTalker

🎬

Let your character interact naturally

upvoted a paper about 1 month ago

AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement

Paper • 2511.23475 • Published Nov 28 • 41

upvoted 2 papers 4 months ago

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

Paper • 2509.09595 • Published Sep 11 • 48

MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation

Paper • 2508.19320 • Published Aug 26 • 29

liked a dataset 7 months ago

SALT-Research/DeepDialogue-xtts

Viewer • Updated May 28 • 243k • 5.6k • 6

liked 3 models over 1 year ago

liked 3 models almost 2 years ago

Saire2023/wav2vec2-base-finetuned-Speaker-Classification

Audio Classification • 94.6M • Updated Apr 16, 2024 • 10 • 2

trpakov/vit-face-expression

Image Classification • 85.8M • Updated Feb 20 • 491k • • 85

facebook/seamless-streaming

Text-to-Speech • Updated Jan 4, 2024 • 273

New activity in ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition over 2 years ago

How to run pre-trained model on local audio file?

👍 5

#2 opened over 3 years ago by

risau

liked 2 models almost 3 years ago

harshit345/xlsr-wav2vec-speech-emotion-recognition

Audio Classification • Updated Dec 12, 2021 • 990 • 62

ehcalabres/wav2vec2-lg-xlsr-en-speech-emotion-recognition

Audio Classification • 0.3B • Updated Oct 24, 2024 • 54k • 237

liked a model about 3 years ago

speechbrain/emotion-recognition-wav2vec2-IEMOCAP

Audio Classification • Updated Jul 23, 2024 • 682k • 168