Conference Publications
In-Context Algebra
Eric Todd, Jannik Brinkmann, Rohit Gandikota, David Bau.
The Fourteenth International Conference on Learning Representations (ICLR), 2026.
Eric Todd, Jannik Brinkmann, Rohit Gandikota, David Bau.
The Fourteenth International Conference on Learning Representations (ICLR), 2026.
The Dual-Route Model of Induction
Sheridan Feucht, Eric Todd, Byron C. Wallace, David Bau.
The Second Conference on Language Modeling (COLM), 2025.
Sheridan Feucht, Eric Todd, Byron C. Wallace, David Bau.
The Second Conference on Language Modeling (COLM), 2025.
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals
Jaden Fiotto-Kaufman, Alexander R. Loftus, Eric Todd, Jannik Brinkmann, Koyena Pal, Dmitrii Troitskii, Michael Ripa, Adam Belfki, Can Rager, Caden Juang, Aaron Mueller, Samuel Marks, Arnab Sen Sharma, Francesca Lucchetti, Nikhil Prakash, Carla Brodley, Arjun Guha, Jonathan Bell, Byron C. Wallace, David Bau.
The Thirteenth International Conference on Learning Representations (ICLR), 2025.
Jaden Fiotto-Kaufman, Alexander R. Loftus, Eric Todd, Jannik Brinkmann, Koyena Pal, Dmitrii Troitskii, Michael Ripa, Adam Belfki, Can Rager, Caden Juang, Aaron Mueller, Samuel Marks, Arnab Sen Sharma, Francesca Lucchetti, Nikhil Prakash, Carla Brodley, Arjun Guha, Jonathan Bell, Byron C. Wallace, David Bau.
The Thirteenth International Conference on Learning Representations (ICLR), 2025.
Function Vectors in Large Language Models
Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau.
The Twelfth International Conference on Learning Representations (ICLR), 2024.
Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau.
The Twelfth International Conference on Learning Representations (ICLR), 2024.
Automatic detection of instances of
focused crowd involvement at recreational events
Eric Todd, Mylan R. Cook, Katrina Pedersen, David S. Woolworth, Brooks A. Butler, Xin Zhao, Colt Liu, Kent L. Gee, Mark K. Transtrum, Sean Warnick.
Proceedings of Meetings on Acoustics 39, 2019.
Eric Todd, Mylan R. Cook, Katrina Pedersen, David S. Woolworth, Brooks A. Butler, Xin Zhao, Colt Liu, Kent L. Gee, Mark K. Transtrum, Sean Warnick.
Proceedings of Meetings on Acoustics 39, 2019.
Classifying crowd behavior at collegiate basketball games using acoustic data.
Brooks A. Butler, Katrina Pedersen, Mylan R. Cook, Spencer G. Wadsworth, Eric Todd, Dallen Stark, Kent L. Gee, Mark K. Transtrum, Sean Warnick.
Proceedings of Meetings on Acoustics 35, 2018.
Brooks A. Butler, Katrina Pedersen, Mylan R. Cook, Spencer G. Wadsworth, Eric Todd, Dallen Stark, Kent L. Gee, Mark K. Transtrum, Sean Warnick.
Proceedings of Meetings on Acoustics 35, 2018.
Journal Articles
Open Problems in Mechanistic Interpretability
Lee Sharkey, Bilal Chughtai, Joshua Batson, Jack Lindsey, Jeff Wu, Lucius Bushnaq, Nicholas Goldowsky-Dill, Stefan Heimersheim, Alejandro Ortega, Joseph Bloom, Stella Biderman, Adria Garriga-Alonso, Arthur Conmy, Neel Nanda, Jessica Rumbelow, Martin Wattenberg, Nandi Schoots, Joseph Miller, Eric J. Michaud, Stephen Casper, Max Tegmark, William Saunders, David Bau, Eric Todd, Atticus Geiger, Mor Geva, Jesse Hoogland, Daniel Murfet, Tom McGrath.
Transactions on Machine Learning Research (TMLR), 2025.
Lee Sharkey, Bilal Chughtai, Joshua Batson, Jack Lindsey, Jeff Wu, Lucius Bushnaq, Nicholas Goldowsky-Dill, Stefan Heimersheim, Alejandro Ortega, Joseph Bloom, Stella Biderman, Adria Garriga-Alonso, Arthur Conmy, Neel Nanda, Jessica Rumbelow, Martin Wattenberg, Nandi Schoots, Joseph Miller, Eric J. Michaud, Stephen Casper, Max Tegmark, William Saunders, David Bau, Eric Todd, Atticus Geiger, Mor Geva, Jesse Hoogland, Daniel Murfet, Tom McGrath.
Transactions on Machine Learning Research (TMLR), 2025.
The Quest for the Right Mediator: Surveying Mechanistic Interpretability for NLP Through the Lens of Causal Mediation Analysis
Aaron Mueller, Jannik Brinkmann, Millicent Li, Samuel Marks, Koyena Pal, Nikhil Prakash, Can Rager, Aruna Sankaranarayanan, Arnab Sen Sharma, Jiuding Sun, Eric Todd, David Bau, Yonatan Belinkov.
Computational Linguistics, 2025.
Aaron Mueller, Jannik Brinkmann, Millicent Li, Samuel Marks, Koyena Pal, Nikhil Prakash, Can Rager, Aruna Sankaranarayanan, Arnab Sen Sharma, Jiuding Sun, Eric Todd, David Bau, Yonatan Belinkov.
Computational Linguistics, 2025.