News
Publications
Jobs
Donate
About
Team
SAIF
Transparency
Contact Us
Agenda
AI Safety in a World of Vulnerable Machine Learning Systems
All contemporary machine learning systems are vulnerable to adversarial attack. This poses serious problems for existing alignment proposals. We explore these issues and propose several research directions FAR is pursuing to overcome this challenge.
Adam Gleave
Last updated on Feb 9, 2024
43 min read
News
,
Agenda
Cite
×