view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 about 1 month ago • 259
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains Paper • 2511.04962 • Published Nov 7 • 53
Running 3.61k The Ultra-Scale Playbook 🌌 3.61k The ultimate guide to training LLM on large GPU Clusters
Running on CPU Upgrade Featured 2.75k The Smol Training Playbook 📚 2.75k The secrets to building world-class LLMs
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 158
DiffGuard: Text-Based Safety Checker for Diffusion Models Paper • 2412.00064 • Published Nov 25, 2024 • 3
DiffGuard: Text-Based Safety Checker for Diffusion Models Paper • 2412.00064 • Published Nov 25, 2024 • 3