/

Samuel Bowman

Alignment Research Lead

Anthropic

NEWs & publications

NEWs & publications

Open Problems in Mechanistic Interpretability

January 27, 2025

Inverse Scaling: When Bigger Isn't Better

June 15, 2023

Improving Code Generation by Training with Natural Language Feedback

March 28, 2023

Pretraining Language Models with Human Preferences

February 16, 2023

No items found.

Open Problems in Mechanistic Interpretability

January 27, 2025

open-problems-in-mechanistic-interpretability

Inverse Scaling: When Bigger Isn't Better

June 15, 2023

inverse-scaling-when-bigger-isnt-better

Improving Code Generation by Training with Natural Language Feedback

March 28, 2023

improving-code-generation-by-training-with-natural-language-feedback

Pretraining Language Models with Human Preferences

February 16, 2023

pretraining-language-models-with-human-preferences

publications:

Open Problems in Mechanistic Interpretability

January 27, 2025

Inverse Scaling: When Bigger Isn't Better

June 15, 2023

Improving Code Generation by Training with Natural Language Feedback

March 28, 2023

Pretraining Language Models with Human Preferences

February 16, 2023

Research

Our research explores a portfolio
of high-potential agendas.

Events

Our events bring together
global leaders in AI.

Programs

Our programs build the field of trustworthy and secure AI

Research

Our research explores a portfolio
of high-potential agendas.

Events

Our events bring together
global leaders in AI.

Programs

Our programs build the field of trustworthy and secure AI

Subscribe to our newsletter

Organization

About Team Programs News

Events

All Events Alignment Workshops Specialized Workshops All Event Recordings

Research

All Publications Research Overview

Robustness & Security

Red-Teaming & Evaluation

Get involved

Careers Contact Donate Newsletter

Financial Reports / 990s Privacy Policy Terms of Service

Cookies Notice: This website uses cookies to identify pages that are being used most frequently. This helps us analyze web page traffic and improve our website. We do not and will never sell user data. Read more about our cookie policy on our privacy policy. Please contact us if you have any questions.

© 2025 FAR AI, Inc.

Website by ODW