view article Article DeepSeek-V4: a million-token context that agents can actually use about 20 hours ago • 13
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 • 54
view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models 3 days ago • 31
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 9 days ago • 63
view article Article Multimodal Embedding & Reranker Models with Sentence Transformers 16 days ago • 50
view article Article AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality Jan 21 • 33
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 Mar 10 • 133
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published Oct 16, 2025 • 124
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 May 24, 2023 • 178
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 Dec 18, 2025 • 124
📝 Research & Long-Form Blog Posts Collection In-depth technical articles and research pieces published by Hugging Face • 12 items • Updated 11 days ago • 21
Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning Paper • 2512.20848 • Published Dec 23, 2025 • 42