Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets Paper • 2201.02177 • Published Jan 6, 2022 • 6
Qwen 3.x MTP Collection MLX MTP drafter checkpoints for Qwen 3.x speculative decoding with mlx-vlm. • 12 items • Updated 18 days ago • 9
Apriel Collection ServiceNow Language Modeling Lab's first model family series • 5 items • Updated Dec 12, 2025 • 16
200+ Roleplay, Creative Writing, Uncensored, NSFW models. Collection Oldest models listed first, with Newest models at bottom of the page. Most repos have full examples, instructions, best settings and so on. • 280 items • Updated 4 days ago • 837
HyperCLOVA X SEED Collection HyperCLOVA X SEED is NAVER's lightweight open-source lineup with a strong focus on Korean language performance • 6 items • Updated Dec 24, 2025 • 42
Orpheus Multilingual Research Release Collection Beta Release of multilingual models. • 12 items • Updated Apr 10, 2025 • 114
OwO-ified Models V1.0 Collection This is a (better) series of experimental models fine-tuned for generating text in the "OwO/UwU" style, Are they smart? No, Are they fun? Mostly :3 • 7 items • Updated Feb 27, 2025 • 4