Bringing BERT into modernity via both architecture changes and scaling
-
answerdotai/ModernBERT-base
Fill-Mask • 0.1B • Updated • 792k • 962 -
answerdotai/ModernBERT-large
Fill-Mask • 0.4B • Updated • 79.2k • 435 -
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper • 2412.13663 • Published • 158