AI Safety and Alignment

Channel: #safety-and-alignment

Each session covers different topics in safety and alignment. The goal is for you to come out of each session with at least a high-level understanding of what we discussed.
Topics include (but are not limited to) RLHF, inner & outer misalignment, goal mis-generalization, inverse RL, scalable oversight, interpretability.

AI Alignment Cohort

Together with the BIRDS group, Safety & Alignment is organizing the ARENA Cohort.

Find materials for the AI Alignment Cohort

Recent Recordings

AI SL.mp4

May 30, 2024

C4AI - AI Safety & Alignment (2024-05-02 10:07 GMT-7)

May 2, 2024

C4AI - AI Safety & Alignment (2024-04-18 10:06 GMT-7)

April 18, 2024

C4AI - AI Safety & Alignment (2024-05-30 10:05 GMT-7)

May 30, 2024

C4AI - AI Safety & Alignment (2024-03-07 10:22 GMT-8)

March 7, 2024

C4AI - AI Safety & Alignment (2024-04-04 10:05 GMT-7)

Back to "Existing Community Programs"