Media Summary: MIT 6.S897 Machine Learning for Healthcare, Spring 2019 Instructor: Peter Szolovits View the complete course: ... How can we reverse engineer what a neural network is doing? In this IASEAI ' This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

25 Interpretability - Detailed Analysis & Overview

MIT 6.S897 Machine Learning for Healthcare, Spring 2019 Instructor: Peter Szolovits View the complete course: ... How can we reverse engineer what a neural network is doing? In this IASEAI ' This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? Quantitative Testing with Concept Activation Vectors (TCAV) Been Kim, Senior Research Scientist, Google Brain Presented at ... Stanford AI Lab Faculty Lunch, November 7, 2025. Updated version of 0:59 ... A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...

Today Lee Sharkey of Goodfire joins The Cognitive Revolution to discuss his research on parameter decomposition methods that ... Forough Poursabzi, Researcher, Microsoft Research Presented at MLconf 2018 Abstract: Machine learning is increasingly used to ... Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...

Photo Gallery

25. Interpretability
Lecture 25: Interpretability
An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025
What Matters Right Now In Mechanistic Interpretability?
Interpretability Beyond Feature Attribution
A Roadmap for the Rigorous Science of Interpretability | Finale Doshi-Velez | Talks at Google
Assessing skeptical views of interpretability research
[XHRI 2025] Interpretability Analysis of Symbolic Representations for SDM Systems
What is interpretability?
Untangling Neural Network Mechanisms: Goodfire's Lee Sharkey on Parameter-based Interpretability
Manipulating and Measuring Model Interpretability
A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)
View Detailed Profile
25. Interpretability

25. Interpretability

MIT 6.S897 Machine Learning for Healthcare, Spring 2019 Instructor: Peter Szolovits View the complete course: ...

Lecture 25: Interpretability

Lecture 25: Interpretability

Machine Learning for Healthcare #MachineLearning #ArtificialIntelligence #AI #ML #DataScience #HealthcareAI #AIinHealthcare ...

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

An Introduction to Mechanistic Interpretability – Neel Nanda | IASEAI 2025

How can we reverse engineer what a neural network is doing? In this IASEAI '

What Matters Right Now In Mechanistic Interpretability?

What Matters Right Now In Mechanistic Interpretability?

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed?

Interpretability Beyond Feature Attribution

Interpretability Beyond Feature Attribution

Quantitative Testing with Concept Activation Vectors (TCAV) Been Kim, Senior Research Scientist, Google Brain Presented at ...

A Roadmap for the Rigorous Science of Interpretability | Finale Doshi-Velez | Talks at Google

A Roadmap for the Rigorous Science of Interpretability | Finale Doshi-Velez | Talks at Google

With a growing interest in

Assessing skeptical views of interpretability research

Assessing skeptical views of interpretability research

Stanford AI Lab Faculty Lunch, November 7, 2025. Updated version of https://web.stanford.edu/~cgpotts/blog/interp/ 0:59 ...

[XHRI 2025] Interpretability Analysis of Symbolic Representations for SDM Systems

[XHRI 2025] Interpretability Analysis of Symbolic Representations for SDM Systems

Interpretability

What is interpretability?

What is interpretability?

A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ...

Untangling Neural Network Mechanisms: Goodfire's Lee Sharkey on Parameter-based Interpretability

Untangling Neural Network Mechanisms: Goodfire's Lee Sharkey on Parameter-based Interpretability

Today Lee Sharkey of Goodfire joins The Cognitive Revolution to discuss his research on parameter decomposition methods that ...

Manipulating and Measuring Model Interpretability

Manipulating and Measuring Model Interpretability

Forough Poursabzi, Researcher, Microsoft Research Presented at MLconf 2018 Abstract: Machine learning is increasingly used to ...

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

A Walkthrough of Progress Measures for Grokking via Mechanistic Interpretability: What? (Part 1/3)

Part 1 of a walkthrough of our paper, Progress Measures for Grokking via Mechanistic

The Dark Matter of AI [Mechanistic Interpretability]

The Dark Matter of AI [Mechanistic Interpretability]

Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...