Running on CPU Upgrade Featured 2.8k The Smol Training Playbook 📚 2.8k The secrets to building world-class LLMs
🦫 PIPer Collection All the resources for our paper "PIPer: On-Device Environment Setup via Online Reinforcement Learning"! • 9 items • Updated Oct 1, 2025 • 3
PIPer: On-Device Environment Setup via Online Reinforcement Learning Paper • 2509.25455 • Published Sep 29, 2025 • 37
view article Article CircleGuardBench: New Standard for Evaluating AI Moderation Models May 7, 2025 • 59
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published Apr 29, 2025 • 92