-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 3.82M • • 13.3k -
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer • Updated • 110k • 1.09k • 744 -
The Ultra-Scale Playbook
🌌3.83kThe ultimate guide to training LLM on large GPU Clusters
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 629
Sunny Ratnani
SunnyRatnaniMD
·
AI & ML interests
None yet
Organizations
Medical License Exam
-
deepseek-ai/DeepSeek-R1
Text Generation • 685B • Updated • 3.82M • • 13.3k -
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer • Updated • 110k • 1.09k • 744 - Running3.83k
The Ultra-Scale Playbook
🌌3.83kThe ultimate guide to training LLM on large GPU Clusters
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 629