view article Article SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation 14 days ago • 16
Self-Improving Pretraining: using post-trained models to pretrain better models Paper • 2601.21343 • Published Jan 29 • 18
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 28 days ago • 107
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability Paper • 2601.18778 • Published Jan 26 • 42