230912GPT2_fine_tuned_SP_GPT2_config_ESM_tokenizer_6kSwissProt_20epochs

This model is a fine-tuned version of gpt2 on the None dataset. It achieves the following results on the evaluation set:

Model description

More information needed

More information needed

More information needed

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss
4.1505	0.99	41	2.7547
2.6987	2.0	83	2.6860
2.7257	2.99	124	2.6648
2.6391	4.0	166	2.6580
2.6872	4.99	207	2.6411
2.6053	6.0	249	2.6419
2.6612	6.99	290	2.6301
2.5815	8.0	332	2.6124
2.6307	8.99	373	2.5928
2.5507	10.0	415	2.5764
2.5782	10.99	456	2.5471
2.4578	12.0	498	2.4726
2.43	12.99	539	2.3949
2.2695	14.0	581	2.3361
2.201	14.99	622	2.2559
1.999	16.0	664	2.1804
1.8843	16.99	705	2.1191
1.6932	18.0	747	2.1015
1.6252	18.99	788	2.1163
1.5326	19.76	820	2.1215

Base model

Finetuned

(2041)

this model