-
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Paper • 2603.12180 • Published • 65 -
Flow-OPD: On-Policy Distillation for Flow Matching Models
Paper • 2605.08063 • Published • 98 -
Normalizing Trajectory Models
Paper • 2605.08078 • Published • 14 -
STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation
Paper • 2605.08029 • Published • 12
Collections
Discover the best community collections!
Collections including paper arxiv:2603.12180
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 153 -
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
Paper • 2603.15594 • Published • 149 -
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper • 2603.13398 • Published • 155 -
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper • 2603.06569 • Published • 120
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 4 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 197
-
The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models
Paper • 2507.23313 • Published -
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
Paper • 2508.03448 • Published • 7 -
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor
Paper • 2508.01311 • Published • 2 -
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model
Paper • 2505.21179 • Published • 13
-
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use
Paper • 2603.08262 • Published • 42 -
On-Policy Context Distillation for Language Models
Paper • 2602.12275 • Published • 4 -
Online Experiential Learning for Language Models
Paper • 2603.16856 • Published • 60 -
Mixture-of-Depths Attention
Paper • 2603.15619 • Published • 80
-
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Paper • 2603.12180 • Published • 65 -
In-Context Reinforcement Learning for Tool Use in Large Language Models
Paper • 2603.08068 • Published • 43 -
Generative Recursive Reasoning
Paper • 2605.19376 • Published • 29
-
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
Paper • 2601.15369 • Published • 22 -
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
Paper • 2601.15892 • Published • 54 -
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
Paper • 2601.16208 • Published • 55 -
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
Paper • 2601.11004 • Published • 30
-
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
Paper • 2505.13227 • Published • 46 -
facebook/natural_reasoning
Viewer • Updated • 1.15M • 2.49k • 570 -
nvidia/OpenMathReasoning
Viewer • Updated • 5.68M • 13.6k • 464 -
Search Arena: Analyzing Search-Augmented LLMs
Paper • 2506.05334 • Published • 19
-
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Paper • 2603.12180 • Published • 65 -
Flow-OPD: On-Policy Distillation for Flow Matching Models
Paper • 2605.08063 • Published • 98 -
Normalizing Trajectory Models
Paper • 2605.08078 • Published • 14 -
STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation
Paper • 2605.08029 • Published • 12
-
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use
Paper • 2603.08262 • Published • 42 -
On-Policy Context Distillation for Language Models
Paper • 2602.12275 • Published • 4 -
Online Experiential Learning for Language Models
Paper • 2603.16856 • Published • 60 -
Mixture-of-Depths Attention
Paper • 2603.15619 • Published • 80
-
dLLM: Simple Diffusion Language Modeling
Paper • 2602.22661 • Published • 153 -
OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data
Paper • 2603.15594 • Published • 149 -
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence
Paper • 2603.13398 • Published • 155 -
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders
Paper • 2603.06569 • Published • 120
-
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Paper • 2603.12180 • Published • 65 -
In-Context Reinforcement Learning for Tool Use in Large Language Models
Paper • 2603.08068 • Published • 43 -
Generative Recursive Reasoning
Paper • 2605.19376 • Published • 29
-
AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation
Paper • 2602.17100 • Published • 4 -
GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant
Paper • 2603.01059 • Published • 1 -
Multi-Domain Riemannian Graph Gluing for Building Graph Foundation Models
Paper • 2603.00618 • Published -
Heterogeneous Agent Collaborative Reinforcement Learning
Paper • 2603.02604 • Published • 197
-
OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation
Paper • 2601.15369 • Published • 22 -
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
Paper • 2601.15892 • Published • 54 -
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
Paper • 2601.16208 • Published • 55 -
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
Paper • 2601.11004 • Published • 30
-
The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models
Paper • 2507.23313 • Published -
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
Paper • 2508.03448 • Published • 7 -
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor
Paper • 2508.01311 • Published • 2 -
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model
Paper • 2505.21179 • Published • 13
-
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
Paper • 2505.13227 • Published • 46 -
facebook/natural_reasoning
Viewer • Updated • 1.15M • 2.49k • 570 -
nvidia/OpenMathReasoning
Viewer • Updated • 5.68M • 13.6k • 464 -
Search Arena: Analyzing Search-Augmented LLMs
Paper • 2506.05334 • Published • 19