oguzhanercan 's Collections MultiModal Reasoning
updated
Perception-Aware Policy Optimization for Multimodal Reasoning
Paper
• 2507.06448
• Published • 48
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based
Reinforcement Learning
Paper
• 2507.05920
• Published • 12
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility,
Reasoning, and Efficiency
Paper
• 2508.18265
• Published • 217
Latent Chain-of-Thought for Visual Reasoning
Paper
• 2510.23925
• Published • 10
Thinking with Video: Video Generation as a Promising Multimodal
Reasoning Paradigm
Paper
• 2511.04570
• Published • 242
V-Thinker: Interactive Thinking with Images
Paper
• 2511.04460
• Published • 98
NVIDIA Nemotron Nano V2 VL
Paper
• 2511.03929
• Published • 30
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
Paper
• 2512.02014
• Published • 74
Latent Implicit Visual Reasoning
Paper
• 2512.21218
• Published • 69
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding
Paper
• 2512.17532
• Published • 68