Conference Publications
The Dual-Route Model of Induction
Sheridan Feucht, Eric Todd, Byron C. Wallace, David Bau.
The Second Conference on Language Modeling. (COLM 2025)
Sheridan Feucht, Eric Todd, Byron C. Wallace, David Bau.
The Second Conference on Language Modeling. (COLM 2025)
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals
Jaden Fiotto-Kaufman, Alexander R. Loftus, Eric Todd, Jannik Brinkmann, Koyena Pal, Dmitrii Troitskii, Michael Ripa, Adam Belfki, Can Rager, Caden Juang, Aaron Mueller, Samuel Marks, Arnab Sen Sharma, Francesca Lucchetti, Nikhil Prakash, Carla Brodley, Arjun Guha, Jonathan Bell, Byron C. Wallace, David Bau.
The Thirteenth International Conference on Learning Representations. (ICLR 2025)
Jaden Fiotto-Kaufman, Alexander R. Loftus, Eric Todd, Jannik Brinkmann, Koyena Pal, Dmitrii Troitskii, Michael Ripa, Adam Belfki, Can Rager, Caden Juang, Aaron Mueller, Samuel Marks, Arnab Sen Sharma, Francesca Lucchetti, Nikhil Prakash, Carla Brodley, Arjun Guha, Jonathan Bell, Byron C. Wallace, David Bau.
The Thirteenth International Conference on Learning Representations. (ICLR 2025)
Function Vectors in Large Language Models
Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau.
The Twelfth International Conference on Learning Representations. (ICLR 2024)
Eric Todd, Millicent L. Li, Arnab Sen Sharma, Aaron Mueller, Byron C. Wallace, David Bau.
The Twelfth International Conference on Learning Representations. (ICLR 2024)
Automatic detection of instances of
focused crowd involvement at recreational events
Eric Todd, Mylan R. Cook, Katrina Pedersen, David S. Woolworth, Brooks A. Butler, Xin Zhao, Colt Liu, Kent L. Gee, Mark K. Transtrum, Sean Warnick.
Proceedings of Meetings on Acoustics 39 (1) (2019)
Eric Todd, Mylan R. Cook, Katrina Pedersen, David S. Woolworth, Brooks A. Butler, Xin Zhao, Colt Liu, Kent L. Gee, Mark K. Transtrum, Sean Warnick.
Proceedings of Meetings on Acoustics 39 (1) (2019)
Classifying crowd behavior at collegiate basketball games using acoustic data.
Brooks A. Butler, Katrina Pedersen, Mylan R. Cook, Spencer G. Wadsworth, Eric Todd, Dallen Stark, Kent L. Gee, Mark K. Transtrum, Sean Warnick.
Proceedings of Meetings on Acoustics 35 (1) (2018)
Brooks A. Butler, Katrina Pedersen, Mylan R. Cook, Spencer G. Wadsworth, Eric Todd, Dallen Stark, Kent L. Gee, Mark K. Transtrum, Sean Warnick.
Proceedings of Meetings on Acoustics 35 (1) (2018)
Preprints and In Submission
Open Problems in Mechanistic Interpretability
Lee Sharkey, Bilal Chughtai, Joshua Batson, Jack Lindsey, Jeff Wu, Lucius Bushnaq, Nicholas Goldowsky-Dill, Stefan Heimersheim, Alejandro Ortega, Joseph Bloom, Stella Biderman, Adria Garriga-Alonso, Arthur Conmy, Neel Nanda, Jessica Rumbelow, Martin Wattenberg, Nandi Schoots, Joseph Miller, Eric J. Michaud, Stephen Casper, Max Tegmark, William Saunders, David Bau, Eric Todd, Atticus Geiger, Mor Geva, Jesse Hoogland, Daniel Murfet, Tom McGrath.
arxiv 2025.
Lee Sharkey, Bilal Chughtai, Joshua Batson, Jack Lindsey, Jeff Wu, Lucius Bushnaq, Nicholas Goldowsky-Dill, Stefan Heimersheim, Alejandro Ortega, Joseph Bloom, Stella Biderman, Adria Garriga-Alonso, Arthur Conmy, Neel Nanda, Jessica Rumbelow, Martin Wattenberg, Nandi Schoots, Joseph Miller, Eric J. Michaud, Stephen Casper, Max Tegmark, William Saunders, David Bau, Eric Todd, Atticus Geiger, Mor Geva, Jesse Hoogland, Daniel Murfet, Tom McGrath.
arxiv 2025.
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability
Aaron Mueller, Jannik Brinkmann, Millicent Li, Samuel Marks, Koyena Pal, Nikhil Prakash, Can Rager, Aruna Sankaranarayanan, Arnab Sen Sharma, Jiuding Sun, Eric Todd, David Bau, Yonatan Belinkov.
arxiv 2024.
Aaron Mueller, Jannik Brinkmann, Millicent Li, Samuel Marks, Koyena Pal, Nikhil Prakash, Can Rager, Aruna Sankaranarayanan, Arnab Sen Sharma, Jiuding Sun, Eric Todd, David Bau, Yonatan Belinkov.
arxiv 2024.