GAD-Models Collection Model checkpoints of Black-Box On-Policy Distillation of Large Language Models • 5 items • Updated 19 days ago • 6
Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published 23 days ago • 46
Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published 23 days ago • 46
Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective Paper • 2509.22613 • Published Sep 26 • 9
DocReward: A Document Reward Model for Structuring and Stylizing Paper • 2510.11391 • Published Oct 13 • 27
Information-Preserving Reformulation of Reasoning Traces for Antidistillation Paper • 2510.11545 • Published Oct 13 • 1
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs Paper • 2510.24514 • Published Oct 28 • 21
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published Oct 30 • 26
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published Oct 30 • 26
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution Paper • 2510.25726 • Published Oct 29 • 45