[Fireside Chat] White-box Methods for AI Control

Summary

Neel Nanda, Joshua Clymer

SESSION Transcript