Collections
Discover the best community collections!
Collections including paper arxiv:2506.09113
-
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 106 -
Video World Models with Long-term Spatial Memory
Paper • 2506.05284 • Published • 55 -
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance
Paper • 2504.01724 • Published • 68 -
Kwai Keye-VL Technical Report
Paper • 2507.01949 • Published • 131
-
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 106 -
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Paper • 2506.08009 • Published • 30 -
Seeing Voices: Generating A-Roll Video from Audio with Mirage
Paper • 2506.08279 • Published • 27 -
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Paper • 2506.07848 • Published • 4
-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 441 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 140 -
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation
Paper • 2409.12576 • Published • 16 -
Transformer Explainer: Interactive Learning of Text-Generative Models
Paper • 2408.04619 • Published • 175
-
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
Paper • 2506.05046 • Published • 2 -
Image Editing As Programs with Diffusion Models
Paper • 2506.04158 • Published • 24 -
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Paper • 2506.07848 • Published • 4 -
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice
Paper • 2503.05978 • Published • 36
-
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 106 -
Video World Models with Long-term Spatial Memory
Paper • 2506.05284 • Published • 55 -
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance
Paper • 2504.01724 • Published • 68 -
Kwai Keye-VL Technical Report
Paper • 2507.01949 • Published • 131
-
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 441 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 140 -
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation
Paper • 2409.12576 • Published • 16 -
Transformer Explainer: Interactive Learning of Text-Generative Models
Paper • 2408.04619 • Published • 175
-
Seedance 1.0: Exploring the Boundaries of Video Generation Models
Paper • 2506.09113 • Published • 106 -
Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion
Paper • 2506.08009 • Published • 30 -
Seeing Voices: Generating A-Roll Video from Audio with Mirage
Paper • 2506.08279 • Published • 27 -
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Paper • 2506.07848 • Published • 4
-
FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing
Paper • 2506.05046 • Published • 2 -
Image Editing As Programs with Diffusion Models
Paper • 2506.04158 • Published • 24 -
PolyVivid: Vivid Multi-Subject Video Generation with Cross-Modal Interaction and Enhancement
Paper • 2506.07848 • Published • 4 -
MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice
Paper • 2503.05978 • Published • 36