AI Safety and Alignment
Channel: #safety-and-alignment
Co-leads:
Co-leads:
Alif - @biggmon on Discord
Sunitha - @prisca6117 on Discord
Goal:
Goal:
Each session covers different topics in safety and alignment. The goal is for you to come out of each session with at least a high-level understanding of what we discussed.
Topics include (but are not limited to) RLHF, inner & outer misalignment, goal mis-generalization, inverse RL, scalable oversight, interpretability.
Logistics:
Logistics:
Everyone is welcome to join! A basic ML background is sufficient.
We meet on Thursday at 10 am PST (on a bi-weekly basis).
Recent Recordings
AI SL.mp4
May 30, 2024
C4AI - AI Safety & Alignment (2024-05-02 10:07 GMT-7)
May 2, 2024
C4AI - AI Safety & Alignment (2024-04-18 10:06 GMT-7)
April 18, 2024
C4AI - AI Safety & Alignment (2024-05-30 10:05 GMT-7)
May 30, 2024
C4AI - AI Safety & Alignment (2024-03-07 10:22 GMT-8)
March 7, 2024
C4AI - AI Safety & Alignment (2024-04-04 10:05 GMT-7)