3 28 4

minghao

Liam-Liu

liam-liu-1b262631a

AI & ML interests

LLM, AD

Recent Activity

authored a paper 11 days ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

upvoted a paper 2 months ago

Reasoning with Sampling: Your Base Model is Smarter Than You Think

upvoted a paper 3 months ago

ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems

View all activity

Organizations

authored a paper 11 days ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Paper • 2512.12730 • Published Dec 14, 2025 • 43

upvoted a paper 2 months ago

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published Oct 16, 2025 • 47

upvoted a paper 3 months ago

ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems

Paper • 2510.11652 • Published Oct 13, 2025 • 29

authored 10 papers 3 months ago

OAgents: An Empirical Study of Building Effective Agents

Paper • 2506.15741 • Published Jun 17, 2025 • 35

IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video?

Paper • 2509.24709 • Published Sep 29, 2025 • 6

ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems

Paper • 2510.11652 • Published Oct 13, 2025 • 29

Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures

Paper • 2510.14616 • Published Oct 16, 2025 • 12

COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

Paper • 2510.14763 • Published Oct 16, 2025 • 13

A$^2$FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning

Paper • 2510.12838 • Published Oct 13, 2025 • 24

upvoted 2 papers 3 months ago

COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes

Paper • 2510.14763 • Published Oct 16, 2025 • 13

SimKO: Simple Pass@K Policy Optimization

Paper • 2510.14807 • Published Oct 16, 2025 • 10

upvoted 2 papers 4 months ago

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Paper • 2509.26346 • Published Sep 30, 2025 • 18

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 141

authored a paper 4 months ago

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 149

upvoted 2 papers 4 months ago

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Paper • 2509.04292 • Published Sep 4, 2025 • 57

Reverse-Engineered Reasoning for Open-Ended Generation

Paper • 2509.06160 • Published Sep 7, 2025 • 149

minghao

AI & ML interests

Recent Activity

Organizations

Liam-Liu's activity