16 129 9

Chengsong Huang

ChengsongHuang

https://chengsong-huang.github.io/

hcscctv

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Guided Self-Evolving LLMs with Minimal Human Supervision

commented on a paper 4 days ago

Guided Self-Evolving LLMs with Minimal Human Supervision

upvoted a paper 7 days ago

Video Generation Models Are Good Latent Reward Models

View all activity

Organizations

upvoted a paper 4 days ago

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published 5 days ago • 47

upvoted a paper 7 days ago

Video Generation Models Are Good Latent Reward Models

Paper • 2511.21541 • Published 11 days ago • 44

upvoted 2 papers 10 days ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published 12 days ago • 111

Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning

Paper • 2511.19900 • Published 12 days ago • 46

upvoted 2 papers 13 days ago

Insights from the ICLR Peer Review and Rebuttal Process

Paper • 2511.15462 • Published 18 days ago • 6

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published 17 days ago • 105

upvoted a paper 15 days ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published 17 days ago • 104

upvoted 3 papers 16 days ago

upvoted a paper 17 days ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published 18 days ago • 42

upvoted a paper 19 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published 20 days ago • 132

upvoted a paper 20 days ago

MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism

Paper • 2511.11373 • Published 23 days ago • 12

upvoted a paper 23 days ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published 24 days ago • 46

upvoted a paper 24 days ago

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published 25 days ago • 110

upvoted a paper 26 days ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published about 1 month ago • 52

upvoted a paper 30 days ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5 • 80

upvoted 3 papers about 1 month ago

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published Nov 6 • 26

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30 • 29

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31 • 70

Chengsong Huang

AI & ML interests

Recent Activity

Organizations

ChengsongHuang's activity