Jacob Steinhardt is an Assistant Professor in Statistics at UC Berkeley. His work focuses on robustness, reward specification and scalable alignment of machine learning (ML) systems.
NEWs & publications
No items found.
Eliciting Latent Predictions from Transformers with the Tuned Lens