arxiv:2603.09229
andy-yang
andy-yang
AI & ML interests
None yet
Recent Activity
authored a paper 7 days ago
BlendServe: Optimizing Offline Inference for Auto-regressive Large
Models with Resource-aware Batching authored a paper 7 days ago
Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for
Long Video Generation authored a paper 7 days ago
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable
Sparse-Linear Attention