Collections

Discover the best community collections!

Collections including paper arxiv:2504.11393
LLM - Pretraining Dataset Research
Collection by
Nov 28, 2025
Reading-Paper-List
Collection by
Apr 22, 2025
DataDecide
A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale.
Papers
Collection by
23 days ago
paper
Collection by
Jul 2, 2025
CAMV
Collection by
Jun 28, 2025
papers
Collection by
Jul 2, 2025
Datasets
Collection by
Sep 26, 2025
LLM - Pretraining Dataset Research
Collection by
Nov 28, 2025
paper
Collection by
Jul 2, 2025
Reading-Paper-List
Collection by
Apr 22, 2025
CAMV
Collection by
Jun 28, 2025
DataDecide
A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale.
papers
Collection by
Jul 2, 2025
Papers
Collection by
23 days ago
Datasets
Collection by
Sep 26, 2025