Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models Paper • 2606.25041 • Published 9 days ago • 110
TryOnCrafter: Unleashing Camera Trajectories for Realistic Video Virtual Try-on via a Renderable 4D Try-on Proxy Paper • 2606.26092 • Published 8 days ago • 6
LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing Paper • 2606.06042 • Published 28 days ago • 24
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published May 28 • 59
WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation Paper • 2605.25874 • Published May 25 • 103
Rethinking Cross-Layer Information Routing in Diffusion Transformers Paper • 2605.20708 • Published May 20 • 111
DVAO: Dynamic Variance-adaptive Advantage Optimization for Multi-reward Reinforcement Learning Paper • 2605.25604 • Published May 25 • 138
Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration Paper • 2605.17423 • Published May 17 • 34
FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization Paper • 2605.15824 • Published May 15 • 67
D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models Paper • 2605.05204 • Published May 6 • 28
CEPO: RLVR Self-Distillation using Contrastive Evidence Policy Optimization Paper • 2605.19436 • Published May 19 • 14
Stress-Testing the Reasoning Competence of LLMs With Proofs Under Minimal Formalism Paper • 2605.12524 • Published Apr 7 • 4
No One Knows the State of the Art in Geospatial Foundation Models Paper • 2605.12678 • Published May 12 • 5
AuralSAM2: Enabling SAM2 Hear Through Pyramid Audio-Visual Feature Prompting Paper • 2506.01015 • Published May 14 • 3
Raster2Seq: Polygon Sequence Generation for Floorplan Reconstruction Paper • 2602.09016 • Published May 11 • 5
Physics-R1: An Audited Olympiad Corpus and Recipe for Visual Physics Reasoning Paper • 2605.14040 • Published May 13 • 5
Forgetting That Sticks: Quantization-Permanent Unlearning via Circuit Attribution Paper • 2605.15138 • Published May 14 • 7
Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization Paper • 2605.15980 • Published May 15 • 36
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published Apr 21 • 252
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications Paper • 2508.16279 • Published Aug 22, 2025 • 67