Yangyi Chen

YangyiYY

https://yangyi-chen.github.io/

AI & ML interests

Multimodal, Large Language Models

Recent Activity

liked a model 8 days ago

nvidia/Nemotron-Cascade-8B-Intermediate-ckpts

authored a paper 9 days ago

CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets

authored a paper 9 days ago

R-Tuning: Teaching Large Language Models to Refuse Unknown Questions

View all activity

Organizations

None yet

liked a model 8 days ago

nvidia/Nemotron-Cascade-8B-Intermediate-ckpts

Text Generation • Updated 9 days ago • 8

authored 8 papers 9 days ago

MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback

Paper • 2309.10691 • Published Sep 19, 2023 • 4

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Paper • 2512.13607 • Published 12 days ago • 26

upvoted a paper 9 days ago

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

Paper • 2512.13607 • Published 12 days ago • 26

liked a dataset 11 days ago

nvidia/Nemotron-Cascade-RL-SWE

Viewer • Updated 11 days ago • 110k • 405 • 22

liked 3 models 11 days ago

nvidia/Nemotron-Cascade-14B-Thinking

Text Generation • 15B • Updated 9 days ago • 2.41k • 44

nvidia/Nemotron-Cascade-8B-Thinking

Text Generation • 8B • Updated 9 days ago • 1.19k • 26

nvidia/Nemotron-Cascade-8B

Text Generation • 8B • Updated 9 days ago • 2.41k • 43

upvoted a collection 12 days ago

Nemotron-Cascade

Collection

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 17 items • Updated 4 days ago • 38

upvoted a paper 24 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26 • 109

upvoted a paper 6 months ago

Perception-Aware Policy Optimization for Multimodal Reasoning

Paper • 2507.06448 • Published Jul 8 • 47

liked a Space 7 months ago

The Ultra-Scale Playbook

🌌

3.6k

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 7 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30 • 143

upvoted a paper 8 months ago

RM-R1: Reward Modeling as Reasoning

Paper • 2505.02387 • Published May 5 • 79

Yangyi Chen

AI & ML interests

Recent Activity

Organizations

YangyiYY's activity

The Ultra-Scale Playbook