view article Article How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs 4 days ago • 38
TurboQuant: Online Vector Quantization with Near-optimal Distortion Rate Paper • 2504.19874 • Published Apr 28, 2025 • 32
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents Paper • 2604.06132 • Published 4 days ago • 107
Qwopus3.5-v3 Collection 🌟Qwopus3.5-v3 is the latest model in the Claude series. • 12 items • Updated 1 day ago • 68
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26, 2025 • 188
PII & De-Identification Collection Models for extracting PII entities and de-identifying clinical text, with support for HIPAA and GDPR compliance. • 278 items • Updated Mar 10 • 35
Qianfan-OCR: A Unified End-to-End Model for Document Intelligence Paper • 2603.13398 • Published about 1 month ago • 153
AfriNLLB Collection AfriNLLB: Efficient Translation Models for African Languages • 11 items • Updated Feb 15 • 4
Nemotron-Terminal Collection We are releasing Nemotron-Terminal models and training datasets. • 5 items • Updated 5 days ago • 34
LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model Paper • 2603.01068 • Published Mar 1 • 22
SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published Feb 27 • 88
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens Paper • 2603.02138 • Published Mar 2 • 151
Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper Paper • 2511.04583 • Published Nov 6, 2025 • 5