46 52 57

Xing Han Lù

xhluca

https://xinghanlu.com

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Would you still call this Dax? Novel Visual References in VLMs and Humans

upvoted a paper 19 days ago

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

upvoted a paper 24 days ago

Forecasting Downstream Performance of LLMs With Proxy Metrics

View all activity

Organizations

upvoted a paper 10 days ago

Would you still call this Dax? Novel Visual References in VLMs and Humans

Paper • 2606.05409 • Published 12 days ago • 8

upvoted a paper 19 days ago

CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents

Paper • 2605.25624 • Published 21 days ago • 33

upvoted a paper 24 days ago

Forecasting Downstream Performance of LLMs With Proxy Metrics

Paper • 2605.18607 • Published 28 days ago • 14

updated a Space 28 days ago

Agent Reward Bench Leaderboard

🥇

Leaderboard for AgentRewardBench

liked a model about 2 months ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 7 days ago • 2.93M • • 4.84k

New activity in McGill-NLP/A3-Qwen3.5-2B about 2 months ago

nice work

#2 opened about 2 months ago by

kalle07

updated 3 models about 2 months ago

liked a model 2 months ago

McGill-NLP/A3-Qwen3.5-9B

Image-Text-to-Text • 9B • Updated Apr 16 • 410 • 6

New activity in huggingface/InferenceSupport 2 months ago

McGill-NLP/A3-Qwen3.5-9B

👍 1

#9270 opened 2 months ago by

xhluca

updated a collection 2 months ago

A3: Agent-as-Annotators

Collection

Models and data from "Structured Distillation of Web Agent Capabilities Enables Generalization" (arXiv:2604.07776) • 6 items • Updated Apr 14 • 1

New activity in ServiceNow/browsergym-leaderboard 2 months ago

Add A3-Qwen3.5-9B WorkArena-L2 results (10.6%) and update all comments

#14 opened 2 months ago by

xhluca

Add A3-Qwen3.5-9B WorkArena-L2 results (9.7%)

#13 opened 2 months ago by

xhluca

updated a dataset 2 months ago

xhluca/a3-qwen-3.5-9b-trajectories

Updated Apr 13 • 14

New activity in McGill-NLP/A3-Qwen3.5-2B 2 months ago

Improve model card: add metadata, library name, and citation

#1 opened 2 months ago by

nielsr

New activity in McGill-NLP/A3-Qwen3.5-4B 2 months ago

Update metadata and improve model card

#1 opened 2 months ago by

nielsr

New activity in McGill-NLP/A3-Qwen3.5-9B 2 months ago

Update pipeline tag to image-text-to-text and add transformers metadata

#1 opened 2 months ago by

nielsr

authored a paper 2 months ago

Structured Distillation of Web Agent Capabilities Enables Generalization

Paper • 2604.07776 • Published Apr 9 • 23

published a dataset 2 months ago

xhluca/a3-qwen-3.5-9b-trajectories

Updated Apr 13 • 14

Xing Han Lù

AI & ML interests

Recent Activity

Organizations

xhluca's activity

Agent Reward Bench Leaderboard

nice work

McGill-NLP/A3-Qwen3.5-9B

Add A3-Qwen3.5-9B WorkArena-L2 results (10.6%) and update all comments

Add A3-Qwen3.5-9B WorkArena-L2 results (9.7%)

Improve model card: add metadata, library name, and citation

Update metadata and improve model card

Update pipeline tag to image-text-to-text and add transformers metadata