6 2

K

Kenny1004

AI & ML interests

AI NLP

Recent Activity

upvoted a paper 7 days ago

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

upvoted a paper 17 days ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

liked a Space 18 days ago

SII-GAIR/daVinci-MagiHuman

View all activity

Organizations

upvoted a paper 7 days ago

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Paper • 2604.02097 • Published 8 days ago • 30

upvoted a paper 17 days ago

Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model

Paper • 2603.21986 • Published 18 days ago • 121

liked a Space 18 days ago

daVinci-MagiHuman

🎬

143

Generate short videos from an image and text prompt

liked a model 19 days ago

GAIR/daVinci-MagiHuman

Image-to-Video • Updated 16 days ago • 902 • 310

upvoted a paper 25 days ago

daVinci-Env: Open SWE Environment Synthesis at Scale

Paper • 2603.13023 • Published 28 days ago • 30

upvoted a paper 2 months ago

daVinci-Dev: Agent-native Mid-training for Software Engineering

Paper • 2601.18418 • Published Jan 26 • 126

upvoted a paper 3 months ago

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published Dec 29, 2025 • 66

upvoted a paper 7 months ago

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22, 2025 • 104

updated a model 11 months ago

GAIR/Anole-7b

7B • Updated May 26, 2025 • 14 • 6

published a model 11 months ago