One Pass Is Not Enough: Recursive Latent Refinement for Generative Models Paper • 2605.15309 • Published 9 days ago • 1
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation Paper • 2605.18739 • Published 5 days ago • 108
SANA-WM Collection SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer • 2 items • Updated 5 days ago • 3
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 9 days ago • 80
WorldJen: An End-to-End Multi-Dimensional Benchmark for Generative Video Models Paper • 2605.03475 • Published 18 days ago • 8
view article Article Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts NucleusAI • Apr 14 • 11
view article Article How I contributed a new model to the Transformers library using Codex nielsr • Mar 30 • 52
view article Article Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines +2 YiYiXu, OzzyGT, dn6, sayakpaul • Mar 5 • 51
6Bit-Diffusion: Inference-Time Mixed-Precision Quantization for Video Diffusion Models Paper • 2603.18742 • Published Mar 19 • 10
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper • 2603.12254 • Published Mar 12 • 22
ID-LoRA: Identity-Driven Audio-Video Personalization with In-Context LoRA Paper • 2603.10256 • Published Mar 10 • 23
LiteAttention: A Temporal Sparse Attention for Diffusion Transformers Paper • 2511.11062 • Published Nov 14, 2025 • 33
Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published Mar 23 • 125
view article Article Make your ZeroGPU Spaces go brrr with ahead-of-time compilation +2 cbensimon, sayakpaul, linoyts, multimodalart • Sep 2, 2025 • 77
view article Article Fast LoRA inference for Flux with Diffusers and PEFT sayakpaul, BenjaminB • Jul 23, 2025 • 54
LTX-2.3 Collection LTX-2.3 base models, quantized models and accompanying LoRAs and IC-LoRAs • 10 items • Updated 12 days ago • 54
view article Article Ulysses Sequence Parallelism: Training with Million-Token Contexts kashif, stas • Mar 9 • 28