Paraphrases of the max 500 tokens subset of the MMLU dataset. We train models on both paraphrases and not paraphrases to increase robustness.
Róbert Belanec
rbelanec
AI & ML interests
Parameter-Efficient Fine-Tuning, Multi-Task Transfer-Learning, Model Merging, Efficient Training
Recent Activity
updated a model about 19 hours ago
rbelanec/pretrained_weights_42 published a model about 19 hours ago
rbelanec/pretrained_weights_42 updated a model about 20 hours ago
rbelanec/train_mnli_42_1775736849