Submitted by Kaicheng Yang 15 UniDoc-RL: Coarse-to-Fine Visual RAG with Hierarchical Actions and Dense Rewards DeepGlint 12 2