Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Fan Zhou
koalazf99
AI & ML interests
Deep Learning; Natural Language Processing; Foundation Models
Recent Activity
upvoted
a
paper
about 1 month ago
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic,
and Long-Horizon Task Execution
upvoted
a
paper
about 1 month ago
VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos
upvoted
a
paper
3 months ago
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents