distilbert-emotion

distilbert-base-uncased fine-tuned on the emotion dataset to classify a short English sentence into one of six emotions: sadness, joy, love, anger, fear, surprise.

Built by Laela Zorana. Code, tests, and a live demo:

GitHub: https://github.com/LaelaZorana/distilbert-emotion
Demo Space: https://huggingface.co/spaces/LaelaZ/distilbert-emotion

Usage

from transformers import pipeline
clf = pipeline("text-classification", model="LaelaZ/distilbert-emotion", top_k=None)
clf("i can't stop smiling, today went better than i ever hoped")
# -> [{'label': 'joy', 'score': 0.99}, ...]

Evaluation

Evaluated on the held-out test split (2,000 examples the model never trained on). Macro F1 is reported alongside accuracy because the classes are imbalanced (joy and sadness dominate, surprise is rare), so accuracy alone would overstate performance on the rare classes.

metric	score
accuracy	0.920
macro F1	0.874
weighted F1	0.920

Per-class F1: sadness 0.96, joy 0.94, anger 0.92, fear 0.90, love 0.81, surprise 0.72. The two weakest classes are the two rarest (love n=159, surprise n=66), which is why macro F1 (0.874) sits below accuracy (0.920): macro F1 weights every class equally and exposes the rare-class weakness that accuracy hides.

The repository also surfaces the model's confidently wrong predictions (the loudest mistakes), which is where the model's real limits show.

Training

Base model: distilbert-base-uncased
Dataset: dair-ai/emotion (split config), 5,000-example training subset
Objective: cross-entropy over 6 classes
Optimizer: AdamW, lr 2e-5, linear warmup (10%), gradient clipping at 1.0
Max sequence length: 64, batch size 16, 3 epochs, CPU

Limitations

The emotion dataset is short, informal English (tweet-style). The model can be confidently wrong on sarcasm, mixed feelings, or text unlike the training distribution. It predicts exactly one of six emotions and has no "neutral" or "other" class.

License

MIT.

Downloads last month: 24

Safetensors

Model size

67M params

Tensor type

F32

Model tree for LaelaZ/distilbert-emotion

Base model

distilbert/distilbert-base-uncased

Finetuned

(11726)

this model

Dataset used to train LaelaZ/distilbert-emotion

Spaces using LaelaZ/distilbert-emotion 5

Evaluation results

Accuracy on emotion
test set self-reported

0.920
Macro F1 on emotion
test set self-reported

0.874