Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception Paper • 2503.13587 • Published Mar 17, 2025
More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models Paper • 2510.23574 • Published Oct 27, 2025
Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching Paper • 2507.02860 • Published Jul 3, 2025
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published 4 days ago • 131
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models Paper • 2603.25716 • Published 4 days ago • 131
Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding Paper • 2603.19235 • Published 11 days ago • 93
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously Paper • 2603.12262 • Published 18 days ago • 30
Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution Paper • 2511.19430 • Published Nov 24, 2025 • 7
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published Sep 2, 2025 • 237
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems Paper • 2508.07407 • Published Aug 10, 2025 • 99