AnomalyVFM -- Transforming Vision Foundation Models into Zero-Shot Anomaly Detectors Paper • 2601.20524 • Published 3 days ago • 2
Faithful GRPO: Improving Visual Spatial Reasoning in Multimodal Language Models via Constrained Policy Optimization Paper • 2604.08476 • Published 2 days ago • 3
MolmoWeb: Open Visual Web Agent and Open Data for the Open Web Paper • 2604.08516 • Published 3 days ago • 29
Test-Time Scaling Makes Overtraining Compute-Optimal Paper • 2604.01411 • Published 11 days ago • 25
Token Warping Helps MLLMs Look from Nearby Viewpoints Paper • 2604.02870 • Published 9 days ago • 30
Less Detail, Better Answers: Degradation-Driven Prompting for VQA Paper • 2604.04838 • Published 6 days ago • 11
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression Paper • 2604.04921 • Published 6 days ago • 99
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published 10 days ago • 211
REAM: Merging Improves Pruning of Experts in LLMs Paper • 2604.04356 • Published 6 days ago • 3
DARE: Diffusion Large Language Models Alignment and Reinforcement Executor Paper • 2604.04215 • Published 7 days ago • 18
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 4 days ago • 28
IMU-1: Sample-Efficient Pre-training of Small Language Models Paper • 2602.02522 • Published Jan 25 • 7
COSMOS: Predictable and Cost-Effective Adaptation of LLMs Paper • 2505.01449 • Published Apr 30, 2025 • 4
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models Paper • 2403.07384 • Published Mar 12, 2024 • 3
Less is More: Improving LLM Alignment via Preference Data Selection Paper • 2502.14560 • Published Feb 20, 2025 • 1