pos-polish-gpt2-large

This model was trained from polish-gpt2-large on clarin-pl/nkjp-pos dataset. It achieves the following results on the evaluation set:

Model description

Part-of-speech tagging for Polish language. Tags description at the bottom of http://nkjp.pl/poliqarp/help/plse2.html

Datacollator:

from transformers import DataCollatorForTokenClassification
data_collator = DataCollatorForTokenClassification(tokenizer=tokenizer)

GPU: RTX 3090

Training time: 01:15:31

The following hyperparameters were used during training:

Training Loss	Epoch	Step	Validation Loss	Precision	Recall	F1	Accuracy
	0.0	0	3.8487	3.8487	3.8487	3.8487	3.8487
0.1952	1.0	2444	0.1942	0.8865	0.9304	0.9079	0.9426
0.1287	2.0	4889	0.1984	0.8903	0.9322	0.9108	0.9449
0.0832	3.0	7332	0.2290	0.8910	0.9328	0.9114	0.9450

Safetensors

Model size

0.8B params

Tensor type

F32