AI Safety & Alignment
Lecture Slides
Lecture Slides
Slides by instructor / visiting lecturers:
Slides by instructor / visiting lecturers:
TBD
Slides by students:
Slides by students:
** disclaimer: these slides were created by students and may contain errors/omissions/inaccuracies of various severity**
** disclaimer: these slides were created by students and may contain errors/omissions/inaccuracies of various severity**
Capabilities and scaling
Criticism on safety and alignment
RLHF
Constitutional AI
Understanding and aligning ethics
Adversarial attacks
Game theoretic approaches
Interpretability
Economic impacts of AGI
![](https://www.google.com/images/icons/product/drive-32.png)
![](https://www.google.com/images/icons/product/drive-32.png)
![](https://www.google.com/images/icons/product/drive-32.png)
![](https://www.google.com/images/icons/product/drive-32.png)
![](https://www.google.com/images/icons/product/drive-32.png)
![](https://www.google.com/images/icons/product/drive-32.png)
![](https://www.google.com/images/icons/product/drive-32.png)
![](https://www.google.com/images/icons/product/drive-32.png)
![](https://www.google.com/images/icons/product/drive-32.png)