Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems Paper • 2512.24385 • Published 7 days ago • 7
Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems Paper • 2512.24385 • Published 7 days ago • 7
Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems Paper • 2512.24385 • Published 7 days ago • 7
Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future Paper • 2512.16760 • Published 19 days ago • 12
Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion Paper • 2512.04926 • Published Dec 4, 2025 • 41
RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning Paper • 2510.02240 • Published Oct 2, 2025 • 17
ReasonMap Collection A fine-grained visual reasoning benchmark (We show more question types in the extension dataset.) • 3 items • Updated Oct 1, 2025 • 8
RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning Paper • 2510.02240 • Published Oct 2, 2025 • 17
RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning Paper • 2510.02240 • Published Oct 2, 2025 • 17 • 2
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding Paper • 2508.01197 • Published Aug 2, 2025 • 5
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding Paper • 2508.01197 • Published Aug 2, 2025 • 5
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding Paper • 2508.01197 • Published Aug 2, 2025 • 5 • 2
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport Paper • 2308.01779 • Published Aug 3, 2023 • 1
Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning Paper • 2503.00513 • Published Mar 1, 2025 • 1
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport Paper • 2308.01779 • Published Aug 3, 2023 • 1
Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning Paper • 2503.00513 • Published Mar 1, 2025 • 1
Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction Paper • 2503.23109 • Published Mar 29, 2025