Laguna M.1 Collection Our most capable model to date, designed for long-horizon work. Apache 2.0. • 4 items • Updated about 7 hours ago • 17
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens Paper • 2603.23516 • Published Mar 6 • 53
view article Article Profiling in PyTorch (Part 2): From nn.Linear to a Fused MLP +3 ariG23498, ror, sergiopaniego, pcuenq, sayakpaul • 19 days ago • 50
view article Article How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent nvidia • 25 days ago • 66
PP-OCRv6 Collection From 1.5M to 34.5M Parameters, Surpassing Billion-Scale VLMs on OCR Tasks • 19 items • Updated 14 days ago • 98
view article Article Introducing North Mini Code: Cohere’s First Model For Developers CohereLabs • 20 days ago • 78
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Paper • 2606.09079 • Published 22 days ago • 64
Zamba2-VL Collection A suite of vision-language models based on Zamba2. • 3 items • Updated 21 days ago • 5
Multi-Faceted Interactivity Alignment in Full-Duplex Speech Models Paper • 2606.11167 • Published 20 days ago • 5
Interactivity Alignment Collection Full-duplex speech models post-trained with reinforcement learning for improved conversational interactivity. • 4 items • Updated 19 days ago • 6
Self-Evolving Vision-Language Models for Image Quality Assessment via Voting and Ranking Paper • 2509.25787 • Published Jan 27 • 3