view changelog Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face Jul 30 β’ 201
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 Jun 3 β’ 289
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 β’ 118
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 β’ 171
view article Article Introduction to Quantization cooked in π€ with ππ§βπ³ Aug 25, 2023 β’ 37
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. β’ 4 items β’ Updated 5 days ago β’ 162
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Paper β’ 2406.06525 β’ Published Jun 10, 2024 β’ 71
Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs Paper β’ 2403.20041 β’ Published Mar 29, 2024 β’ 34