2 2 6

Joachim Baumann

joebaumann

https://joe-baumann.com/

AI & ML interests

Postdoc @ Stanford

Recent Activity

liked a dataset 4 days ago

SALT-NLP/SWE-chat

updated a dataset 6 days ago

SALT-NLP/SWE-chat

commentedon a paper 7 days ago

SWE-chat: Coding Agent Interactions From Real Users in the Wild

View all activity

Organizations

liked a dataset 4 days ago

SALT-NLP/SWE-chat

Viewer • Updated 5 days ago • 2.73M • 1.65k • 36

updated a dataset 6 days ago

SALT-NLP/SWE-chat

Viewer • Updated 5 days ago • 2.73M • 1.65k • 36

commented a paper 7 days ago

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Paper • 2604.20779 • Published 13 days ago • 14 •

published a dataset 7 days ago

SALT-NLP/SWE-chat

Viewer • Updated 5 days ago • 2.73M • 1.65k • 36

authored 2 papers 7 days ago

SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors

Paper • 2510.17516 • Published Oct 20, 2025 • 2

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Paper • 2604.20779 • Published 13 days ago • 14

upvoted a paper 11 days ago

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Paper • 2604.20779 • Published 13 days ago • 14

commented a paper 11 days ago

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Paper • 2604.20779 • Published 13 days ago • 14 •

liked a dataset 3 months ago

zai-org/terminal-bench-2-verified

Updated Feb 27 • 2.41k • 70

liked a dataset 4 months ago

aurman/GoogleTrendArchive

Viewer • Updated Mar 20 • 7.64M • 181 • 1

liked a dataset 7 months ago

pitehu/SimBench

Preview • Updated Oct 27, 2025 • 151 • 9

upvoted a paper 8 months ago

Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation

Paper • 2509.08825 • Published Sep 10, 2025 • 3

commented a paper 8 months ago

Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation

Paper • 2509.08825 • Published Sep 10, 2025 • 3 •

authored 2 papers 8 months ago

AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons

Paper • 2503.05731 • Published Feb 19, 2025 • 3

Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation

Paper • 2509.08825 • Published Sep 10, 2025 • 3

commented a paper 8 months ago

Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation

Paper • 2509.08825 • Published Sep 10, 2025 • 3 •

liked a model almost 2 years ago

NousResearch/Genstruct-7B

Text Generation • 7B • Updated Jun 7, 2025 • 137 • • 406

liked a dataset about 2 years ago

vblagoje/cc_news

Viewer • Updated Jan 4, 2024 • 708k • 6.77k • 66

Joachim Baumann

AI & ML interests

Recent Activity

Organizations

joebaumann's activity