ENPIRE: Agentic Robot Policy Self-Improvement in the Real World Paper • 2606.19980 • Published 2 days ago • 7
Retrieve, Don't Retrain: Extending Vision Language Action Models to New Tasks at Test Time Paper • 2606.15631 • Published 6 days ago • 15
Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories Paper • 2606.11176 • Published 11 days ago • 118
JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence Paper • 2606.14777 • Published 10 days ago • 190
HarnessX: A Composable, Adaptive, and Evolvable Agent Harness Foundry Paper • 2606.14249 • Published 8 days ago • 41
From Chatbot to Digital Colleague: The Paradigm Shift Toward Persistent Autonomous AI Paper • 2606.14502 • Published 8 days ago • 51
view article Article Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action nvidia • 19 days ago • 83
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 23 days ago • 145
AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security Paper • 2605.29801 • Published 23 days ago • 143
ARIS: Autonomous Research via Adversarial Multi-Agent Collaboration Paper • 2605.03042 • Published May 4 • 136
view article Article NEO-unify: Building Native Multimodal Unified Models End to End sensenova • Mar 5 • 165
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 229
V^{2}-SAM: Marrying SAM2 with Multi-Prompt Experts for Cross-View Object Correspondence Paper • 2511.20886 • Published Nov 25, 2025 • 1