Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Mechanistic Interpretability Benchmark

university
https://mib-bench.github.io
Activity Feed

AI & ML interests

Principled evaluation of mechanistic interpretability methods.

Recent Activity

nepp1d0  authored a paper 4 days ago
InTraVisTo: Inside Transformer Visualisation Tool
amueller  updated a Space 26 days ago
mib-bench/leaderboard
hij  authored a paper 4 months ago
Blackbox Model Provenance via Palimpsestic Membership Inference
View all activity

Aaron Mueller's profile pictureSarah Wiegreffe's profile pictureIvan Arcuschin's profile pictureDana Arad's profile pictureYaniv Nikankin's profile pictureAruna S's profile pictureRohan Gupta's profile pictureMichael Hanna's profile pictureshun shao's profile pictureAdam Belfki's profile pictureAtticus Geiger's profile pictureYik Siu Chan's profile pictureAmir Zur's profile pictureAlessandro Stolfo's profile pictureNikhil Prakash's profile pictureJing's profile pictureHadas Orgad's profile pictureMartin Tutek's profile pictureYonatan Belinkov's profile pictureNicolò Brunello's profile picture

mib-bench 's collections 1

MIB Datasets
The tasks and counterfactuals from the Mechanistic Interpretability Benchmark.
  • mib-bench/ioi

    Viewer • Updated May 29, 2025 • 21k • 1.5k
  • mib-bench/copycolors_mcqa

    Viewer • Updated Jan 16, 2025 • 1.89k • 318
  • mib-bench/arithmetic_addition

    Viewer • Updated May 31, 2025 • 40.4k • 217
  • mib-bench/arithmetic_subtraction

    Viewer • Updated May 31, 2025 • 20.9k • 171
MIB Datasets
The tasks and counterfactuals from the Mechanistic Interpretability Benchmark.
  • mib-bench/ioi

    Viewer • Updated May 29, 2025 • 21k • 1.5k
  • mib-bench/copycolors_mcqa

    Viewer • Updated Jan 16, 2025 • 1.89k • 318
  • mib-bench/arithmetic_addition

    Viewer • Updated May 31, 2025 • 40.4k • 217
  • mib-bench/arithmetic_subtraction

    Viewer • Updated May 31, 2025 • 20.9k • 171
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs