Asking like Socrates: Socrates helps VLMs understand remote sensing images Paper • 2511.22396 • Published about 1 month ago • 4
Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning Paper • 2512.05591 • Published 22 days ago • 16
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards Paper • 2512.00473 • Published 28 days ago • 25
SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning Paper • 2512.03244 • Published 25 days ago • 16
TreeGRPO: Tree-Advantage GRPO for Online RL Post-Training of Diffusion Models Paper • 2512.08153 • Published 19 days ago • 6
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder Paper • 2512.11749 • Published 15 days ago • 36
Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models Paper • 2512.13607 • Published 12 days ago • 26
REGLUE Your Latents with Global and Local Semantics for Entangled Diffusion Paper • 2512.16636 • Published 9 days ago • 25