SpatialTree: How Spatial Abilities Branch Out in MLLMs Paper • 2512.20617 • Published 5 days ago • 42
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published 12 days ago • 65
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 13 days ago • 101
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics Paper • 2512.12602 • Published 14 days ago • 40
LEO-RobotAgent: A General-purpose Robotic Agent for Language-driven Embodied Operator Paper • 2512.10605 • Published 17 days ago • 6
Task adaptation of Vision-Language-Action model: 1st Place Solution for the 2025 BEHAVIOR Challenge Paper • 2512.06951 • Published 21 days ago • 3
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing Paper • 2509.22186 • Published Sep 26 • 139
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published about 1 month ago • 214