Bay Area Alignment Workshop

24-25 Oct 2024

Bay Area Alignment Workshop

The Bay Area Alignment Workshop was held 24-25 Oct 2024, at Chaminade in Santa Cruz, featuring Anca Dragan on Optimised Misalignment. Participants additionally explored topics such as threat models, safety cases, monitoring and assurance, interpretability, robustness, and oversight.

Event Context

The Alignment Workshop series brings together top machine learning researchers and practitioners from industry, academia, and government. The workshop focuses on discussing and debating critical topics related to AI alignment, enabling participants to better understand potential risks from advanced AI, and strategies for solving them. Key issues discussed include model evaluations, interpretability, robustness, and AI governance.

Bay Area Alignment Workshop sessions