1 20 16

Kartikey Rawat

carrycooldude

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

The Principles of Diffusion Models

upvoted a changelog 4 months ago

JSON Support in the Dataset Viewer

upvoted a changelog 4 months ago

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

View all activity

Organizations

upvoted a paper about 1 month ago

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24 • 59

upvoted 4 changelogs 4 months ago

Changelog

JSON Support in the Dataset Viewer

Jul 23

• 52

Changelog

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Jul 30

• 201

Changelog

Trending Papers

Jul 28

• 104

Changelog

Introducing a better Hugging Face CLI

Jul 25

• 93

upvoted 2 articles 6 months ago

Article

Why Maybe We're Measuring LLM Compression Wrong

Jun 21

•

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3

•

289

upvoted 4 articles 8 months ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

•

118

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

•

171

Article

Making LLMs lighter with AutoGPTQ and transformers

Aug 23, 2023

•

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

Aug 25, 2023

•

upvoted 2 collections over 1 year ago

Instruction Pre-Training

Collection

8 items • Updated Jun 21, 2024 • 26

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 5 days ago • 162

upvoted a paper over 1 year ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 71

upvoted 4 articles over 1 year ago

Article

A Dive into Vision-Language Models

Feb 3, 2023

•

Article

Vision Language Models Explained

Apr 11, 2024

•

496

Article

Fine-tune Llama 3 with ORPO

Apr 22, 2024

•

241

Article

CodeGemma - an official Google release for code LLMs

Apr 9, 2024

•

106

upvoted a paper over 1 year ago

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29, 2024 • 34

upvoted a collection almost 2 years ago

Fellows Highlights Winter '23 (Dec) ❄️⛄️

Collection

14 items • Updated Dec 27, 2023 • 5

Kartikey Rawat

AI & ML interests

Recent Activity

Organizations

carrycooldude's activity

JSON Support in the Dataset Viewer

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Trending Papers

Introducing a better Hugging Face CLI

Why Maybe We're Measuring LLM Compression Wrong

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Making LLMs lighter with AutoGPTQ and transformers

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

A Dive into Vision-Language Models

Vision Language Models Explained

Fine-tune Llama 3 with ORPO

CodeGemma - an official Google release for code LLMs