G$^2$RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance Paper • 2508.13023 • Published Aug 18, 2025 • 1
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows Paper • 2512.05150 • Published 29 days ago • 74
Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis Paper • 2510.15710 • Published Oct 17, 2025 • 6
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark Paper • 2402.02242 • Published Feb 3, 2024
dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models Paper • 2512.19433 • Published 10 days ago • 3
Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling Paper • 2507.17801 • Published Jul 23, 2025 • 1
Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation Paper • 2507.13032 • Published Jul 17, 2025
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding Paper • 2510.06308 • Published Oct 7, 2025 • 54
Enhancing Long Video Understanding via Hierarchical Event-Based Memory Paper • 2409.06299 • Published Sep 10, 2024
Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM Paper • 2505.18110 • Published May 23, 2025 • 1
G$^2$RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance Paper • 2508.13023 • Published Aug 18, 2025 • 1
No Fear of Classifier Biases: Neural Collapse Inspired Federated Learning with Synthetic and Fixed Classifier Paper • 2303.10058 • Published Mar 17, 2023
Masking as an Efficient Alternative to Finetuning for Pretrained Language Models Paper • 2004.12406 • Published Apr 26, 2020 • 1
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology Paper • 2401.16355 • Published Jan 29, 2024 • 2
FedBR: Improving Federated Learning on Heterogeneous Data via Local Learning Bias Reduction Paper • 2205.13462 • Published May 26, 2022