Agenda

AI Safety in a World of Vulnerable Machine Learning Systems

All contemporary machine learning systems are vulnerable to adversarial attack. This poses serious problems for existing alignment proposals. We explore these issues and propose several research directions FAR.AI is pursuing to overcome this challenge.

Last updated on Oct 7, 2024 43 min read News, Agenda