Notes on Lecture Slides
You can access the lecture slides for this course on Canvas by navigating to the "Files" tab. These slides are currently only available to enrolled students, but we plan to release them to the public once the course has concluded.
Tentative schedule & reading
Week 1
3/31: Introduction
Reading: No reading
4/2: Background
Week 2
4/7: Removal-based explanations 1
4/9: Removal-based explanations 2
Week 3
4/14: Shapley values 1
4/16: Shapley values 2
Week 4
4/21: Propagation and gradient-based explanations 1
4/23: Propagation and gradient-based explanations 2 + Representation explainability
Week 5
4/28: Amortized optimization
4/30: Evaluating explanation methods
Week 6: Inherently interpretable models
5/5: Inherently interpretable models 1
5/7: Inherently interpretable models 2
Week 7
5/12: Concept-based explanations
Reading: Concept Bottleneck Models, (Optional) Feature Visualization
5/14: Sparse autoencoders
Week 8
5/19: Instance explanations 1
5/21: Counterfactual explanations
Week 9:
5/26: Hima Lakkaraju's Guest Lecture: Explainable AI for Real-World Decisions & Engineering Systems: Algorithmic Foundations and Practical Considerations
5/28: LLM explainability 1
Week 10
6/2: LLM explainability 2
6/4: XAI in practice II –Model improvement, applications to healthcare