Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9, 2025 • 140
view article Article The Open Source Community is backing OpenEnv for Agentic RL +16 burtenshaw, spisakjo, lysandre, darktex, willcb, qjoy, pawalt, cwing-nv, danielhanchen, andrewzhou, thegovind, shimmyshimmer, Hamid-Nazeri, Sanyam, zkwentz, emre0, lewtun, sergiopaniego • 14 days ago • 89
Qwen 3.x MTP Collection MLX MTP drafter checkpoints for Qwen 3.x speculative decoding with mlx-vlm. • 12 items • Updated 20 days ago • 9
SeedVR2 (MLX-Swift) Collection SeedVR2-3B (ByteDance, ICLR 2026) one-step diffusion super-resolution, MLX-Swift weights for on-device Apple Silicon. fp16 + int8. • 2 items • Updated 15 days ago • 3
LeRobot Pi0.5 - Robotics Foundation Model v0.5 Collection Hugging Face LeRobot Pi0.5 intermediate robotics model with improved action generation capabilities • 4 items • Updated Jan 16 • 1
BERT release Collection Regroups the original BERT models released by the Google team. Except for the models marked otherwise, the checkpoints support English. • 8 items • Updated Mar 12 • 44
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift • Apr 2 • 909
Scaling Laws for Code: Every Programming Language Matters Paper • 2512.13472 • Published Dec 15, 2025 • 17
Context as a Tool: Context Management for Long-Horizon SWE-Agents Paper • 2512.22087 • Published Dec 26, 2025 • 4
Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing Paper • 2512.23611 • Published Dec 29, 2025 • 7
Ming 2.0 Collection Ming is the multi-modal series of any-to-any models developed by Ant Ling team. • 14 items • Updated 6 days ago • 37
dqnCode Collection dqnCode is a set of small-sized LLMs that are capable of running on basic consumer hardware, precision trained on coding datasets. NOT FULLY RELEASED! • 2 items • Updated Feb 24 • 2
Claude 4.5 Sonnet Collection Distilled models and datasets for Claude 4.5 Sonnet. • 5 items • Updated Dec 20, 2025 • 14