AI Safety in a World of Vulnerable Machine Learning Systems
All contemporary machine learning systems are vulnerable to adversarial attack. This poses serious problems for existing alignment proposals. We explore these issues and propose several research directions FAR is pursuing to overcome this challenge.
Last updated on Mar 7, 2023
43 min read