Measuring Epistemic Resilience of LLMs Under Misleading Medical Context Paper • 2606.12291 • Published 21 days ago • 60
Data Journalist Agent: Transforming Data into Verifiable Multimodal Stories Paper • 2606.11176 • Published 22 days ago • 127
The Alignment Curse: Modality Alignment Supercharges Audio Attacks via Text Transfer Paper • 2602.02557 • Published May 29 • 21
D^2-Monitor: Dynamic Safety Monitoring for Diffusion LLMs via Hesitation-Aware Routing Paper • 2605.25893 • Published May 25 • 39
Forecasting Scientific Progress with Artificial Intelligence Paper • 2605.22681 • Published May 21 • 45
Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks Paper • 2511.15065 • Published Nov 19, 2025 • 78
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks Paper • 2509.01396 • Published Sep 1, 2025 • 58
LLM4SR: A Survey on Large Language Models for Scientific Research Paper • 2501.04306 • Published Jan 8, 2025 • 35