CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 5 days ago • 79
G-Zero: Self-Play for Open-Ended Generation from Zero Data Paper • 2605.09959 • Published 7 days ago • 16
Dual-View Training for Instruction-Following Information Retrieval Paper • 2604.18845 • Published 28 days ago • 10
GameWorld: Towards Standardized and Verifiable Evaluation of Multimodal Game Agents Paper • 2604.07429 • Published Apr 8 • 121
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 503
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 628