Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Mechanistic Interpretability Benchmark

university
https://mib-bench.github.io
Activity Feed

AI & ML interests

Principled evaluation of mechanistic interpretability methods.

Nicolò Brunello's profile picture Aaron Mueller's profile picture Hadas Orgad's profile picture Yonatan Belinkov's profile picture Atticus Geiger's profile picture Amir Zur's profile picture Aruna S's profile picture Yaniv Nikankin's profile picture Michael Hanna's profile picture Dana Arad's profile picture Jing's profile picture Nikhil Prakash's profile picture Rohan Gupta's profile picture Ivan Arcuschin's profile picture Alessandro Stolfo's profile picture Martin Tutek's profile picture shun shao's profile picture Yik Siu Chan's profile picture Adam Belfki's profile picture Sarah Wiegreffe's profile picture

mib-bench 's models 3

mib-bench/mib-circuits-example

Updated Jul 23, 2025

mib-bench/mib-causalvariable-example

Updated May 29, 2025

mib-bench/interpbench

Updated May 17, 2025
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs