CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 23 days ago • 97
Lost in Backpropagation: The LM Head is a Gradient Bottleneck Paper • 2603.10145 • Published 12 days ago • 11
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights Paper • 2603.12228 • Published 10 days ago • 11
Prism-Δ: Differential Subspace Steering for Prompt Highlighting in Large Language Models Paper • 2603.10705 • Published 12 days ago • 11
One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers Paper • 2603.12245 • Published 10 days ago • 18
WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing Paper • 2603.11593 • Published 11 days ago • 25
Reasoning Models Struggle to Control their Chains of Thought Paper • 2603.05706 • Published 17 days ago • 34
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published 11 days ago • 63
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs Paper • 2603.09906 • Published 13 days ago • 71
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs Paper • 2603.05890 • Published 17 days ago • 91
Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning Paper • 2603.04597 • Published 18 days ago • 204
Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents Paper • 2603.12634 • Published 10 days ago • 16
Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation Paper • 2603.12793 • Published 10 days ago • 37
Implicit Intelligence -- Evaluating Agents on What Users Don't Say Paper • 2602.20424 • Published 27 days ago • 4
Dropping Anchor and Spherical Harmonics for Sparse-view Gaussian Splatting Paper • 2602.20933 • Published 27 days ago • 4
Causal Motion Diffusion Models for Autoregressive Motion Generation Paper • 2602.22594 • Published 25 days ago • 7