Bartosz Cywiński

bcywinski

AI & ML interests

Mechanistic Interpretability

Recent Activity

updated a collection 2 days ago
Llama-3.1-8B-Instruct-taboo
updated a collection 2 days ago
Eliciting Secret Knowledge from Language Models
updated a model 11 days ago
bcywinski/gemma-2-9b-it-occupation-doctor
View all activity

Organizations

None yet