San Diego Alignment Workshop
San Diego, California
1–2 December, 2025

Program Committee
San Diego Alignment Workshop
San Diego Alignment Workshop took place 1–2 December, 2025, immediately prior to the start of NeurIPS 2025 serving as a successor to the Alignment Workshops held in Singapore, San Francisco Bay Area, Vienna, New Orleans and San Francisco.
The program featured expert presentations, breakout discussions, and opportunities for one-on-one conversations.
The Alignment Workshop series brings together top machine learning researchers and practitioners from industry, academia, and government. The workshop focuses on discussing and debating critical topics related to AI alignment, enabling participants to better understand potential risks from advanced AI, and strategies for solving them. Key issues discussed include model evaluations, interpretability, robustness, and AI governance.










































