24 5

Austin Liu

Austin362667

austin362667

AI & ML interests

None yet

Recent Activity

upvoted an article 6 days ago

The PR you would have opened yourself

upvoted a paper 21 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

updated a model about 1 month ago

Austin362667/Qwen3-1.7B-MLX-bf16-python-18k-alpaca

View all activity

Organizations

None yet

upvoted an article 6 days ago

Article

The PR you would have opened yourself

7 days ago

•

upvoted a paper 21 days ago

Embarrassingly Simple Self-Distillation Improves Code Generation

Paper • 2604.01193 • Published 22 days ago • 46

upvoted 2 articles about 2 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

May 11, 2023

•

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Aug 17, 2022

•

130

upvoted a collection about 2 months ago

SiliconMind-V1

Collection

4 items • Updated Feb 11 • 2

upvoted 2 articles 2 months ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

May 20, 2025

•

Article

KV Cache from scratch in nanoVLM

Jun 4, 2025

•

116

upvoted an article 3 months ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

upvoted an article 5 months ago

Article

Continuous batching from first principles

Nov 25, 2025

•

364

upvoted an article 6 months ago

Article

Key Insights into the Law of Vision Representations in MLLMs

Sep 2, 2024

•

upvoted an article 7 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

303

upvoted 6 articles 9 months ago

Article

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

Jun 11, 2024

•

Article

Parquet Content-Defined Chunking

Jul 25, 2025

•

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3, 2025

•

345

Article

TimeScope: How Long Can Your Video Large Multimodal Model Go?

Jul 23, 2025

•

Article

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

Jun 28, 2025

•

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20, 2025

•

337

upvoted an article 10 months ago

Article

Introducing Cosmos Predict-2: A Foundation For Your Own World Model

Jun 17, 2025

•

upvoted an article 11 months ago

Article

🐯 Liger GRPO meets TRL

May 25, 2025

•

upvoted an article about 1 year ago

Article

Introduction to 3D Gaussian Splatting

Sep 18, 2023

•

134

Austin Liu

AI & ML interests

Recent Activity

Organizations

Austin362667's activity

The PR you would have opened yourself

Assisted Generation: a new direction toward low-latency text generation

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

KV Cache from scratch in nanoVLM

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Continuous batching from first principles

Key Insights into the Law of Vision Representations in MLLMs

KV Caching Explained: Optimizing Transformer Inference Efficiency

An Analysis of Chinese LLM Censorship and Bias with Qwen 2 Instruct

Parquet Content-Defined Chunking

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

TimeScope: How Long Can Your Video Large Multimodal Model Go?

⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch

SmolVLM2: Bringing Video Understanding to Every Device

Introducing Cosmos Predict-2: A Foundation For Your Own World Model

🐯 Liger GRPO meets TRL

Introduction to 3D Gaussian Splatting