73 61 67

Ziyang Luo

Ziyang

https://chiyeunglaw.github.io/

AI & ML interests

Agents, LLMs, Multimodal ML

Recent Activity

upvoted a paper 11 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 11 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

updated a Space 11 days ago

HKBU-NLP/README

View all activity

Organizations

upvoted 2 papers 11 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 13 days ago • 141

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 12 days ago • 82

updated a Space 11 days ago

README

🚀

upvoted a paper 12 days ago

Towards Comprehensive Stage-wise Benchmarking of Large Language Models in Fact-Checking

Paper • 2601.02669 • Published 21 days ago • 3

authored a paper 17 days ago

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Paper • 2601.03559 • Published 20 days ago • 13

upvoted a paper 17 days ago

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Paper • 2601.03559 • Published 20 days ago • 13

liked 2 datasets 27 days ago

nvidia/Nemotron-Post-Training-Dataset-v1

Viewer • Updated Aug 25, 2025 • 25.7M • 4.68k • 172

ScaleAI/MCP-Atlas

Viewer • Updated Dec 19, 2025 • 500 • 497 • 6

upvoted a paper 28 days ago

MAI-UI Technical Report: Real-World Centric Foundation GUI Agents

Paper • 2512.22047 • Published Dec 26, 2025 • 28

upvoted an article 28 days ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

Oct 23, 2025

•

145

upvoted a paper about 2 months ago

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

Paper • 2509.19736 • Published Sep 24, 2025 • 12

upvoted 2 collections about 2 months ago

GTA1

Collection

A collection of GUI grounding models trained with GRPO. • 5 items • Updated Oct 31, 2025 • 4

Elastic-Reasoning