TinyStories SAE Regularization Comparison
Collection
Comparison of different regularization methods for training SAE models on the layer 1 MLP of TinyStories 2L 33M. • 18 items • Updated
How to use lovish/SAE-tiny-stories-2L-33M-L1-263 with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("lovish/SAE-tiny-stories-2L-33M-L1-263", dtype="auto")No model card