Datasets and models for ACL 2026 paper: Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems.
AI & ML interests
Natural Language Processing at Yale
Recent Activity
Papers
Step-level Optimization for Efficient Computer-use Agents
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation
models 95
yale-nlp/RTriever-4B
Feature Extraction • 4B • Updated • 26 • 1
yale-nlp/AgentTrek-1.0-32B_webarena-verified_milestone-bert
0.1B • Updated • 9
yale-nlp/gpt-oss-20b_webarena-verified_stuck-bert
0.1B • Updated • 11
yale-nlp/AgentTrek-1.0-32B_webarena-verified_stuck-bert
0.1B • Updated • 13
yale-nlp/gpt-oss-20b_webarena-verified_milestone-bert
0.1B • Updated • 13
yale-nlp/modernbert-evocua-milestone-detector
0.1B • Updated • 13
yale-nlp/modernbert-evocua-stuck-detector
0.1B • Updated • 12
yale-nlp/modernbert-qwen-milestone-detector
0.1B • Updated • 14
yale-nlp/modernbert-qwen-stuck-detector
0.1B • Updated • 13
yale-nlp/Qwen3-VL-8B-Anchor-Windows
770k • Updated • 2
datasets 29
yale-nlp/Bright-Pro
Viewer • Updated • 530k • 101 • 1
yale-nlp/Anchor
Viewer • Updated • 30.6k • 40
yale-nlp/MedTutor
Updated • 270 • 2
yale-nlp/SciArena
Viewer • Updated • 13.2k • 52 • 25
yale-nlp/SciReas-Pro
Viewer • Updated • 1.36k • 16 • 1
yale-nlp/MSRS
Viewer • Updated • 2.44k • 84 • 2
yale-nlp/SciArena-Eval
Viewer • Updated • 2k • 6
yale-nlp/SciArena-with-paperbank
Viewer • Updated • 15.2k • 14
yale-nlp/SciDQA
Viewer • Updated • 2.94k • 137 • 2
yale-nlp/AbGen
Viewer • Updated • 3.3k • 24 • 3