Paper: A UNIVERSITY-LEVEL BENCHMARK FOR EVALUATING MATHEMATICAL SKILLS IN LLMS
Toloka
company
Verified
AI & ML interests
Human In The Loop - data labeling, model training and hosting, human verification, and more
Organization Card
Hey, this is Toloka!
models
4
toloka/prompts_reward_model
Text Classification
•
82.1M
•
Updated
•
17
toloka/gpt2-large-supervised-prompt-writing
Text Generation
•
0.8B
•
Updated
•
10
toloka/gpt2-large-rl-prompt-writing
Text Generation
•
0.8B
•
Updated
•
10
•
3
toloka/t5-large-for-text-aggregation
Summarization
•
Updated
•
12
•
7
datasets
11
toloka/VOX-DUB
Viewer
•
Updated
•
7.58k
•
218
•
10
toloka/JEEM
Viewer
•
Updated
•
2.2k
•
108
•
11
toloka/beemo
Viewer
•
Updated
•
2.19k
•
435
•
18
toloka/u-math
Viewer
•
Updated
•
1.1k
•
463
•
24
toloka/mu-math
Viewer
•
Updated
•
1.08k
•
99
•
23
toloka/CLESC
Viewer
•
Updated
•
500
•
34
•
2
toloka/VoxDIY-RusNews
Updated
•
614
•
3
toloka/CrowdSpeech
Updated
•
172
•
5
toloka/crowdkit-datasets
Updated
•
2.6k
toloka/WSDMCup2023
Viewer
•
Updated
•
46.2k
•
221
•
5