Running 98 Unlocking On-Policy Distillation for Any Model Family 📝 98 Visualize on-policy distillation for any model family
Running on CPU Upgrade Featured 3.14k The Smol Training Playbook 📚 3.14k The secrets to building world-class LLMs
Paused Agents Featured 1.06k Qwen3 Coder WebDev 🌍 1.06k Generate HTML/React code from a web app description
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 416k • 1.6k
yentinglin/Mistral-Small-24B-Instruct-2501-reasoning Text Generation • 24B • Updated Apr 20, 2025 • 64 • • 59
bartowski/DeepSeek-R1-Distill-Qwen-32B-abliterated-GGUF Text Generation • 33B • Updated Jan 25, 2025 • 36.5k • 142