Stephen Casper

MIT

NEWs & publications

February 6, 2026

October 1, 2025

July 2, 2025

Layered AI Defenses Have Holes: Vulnerabilities and Key Recommendations

July 2, 2025

defense-in-depth

STACK: Adversarial Attacks on LLM Safeguard Pipelines

stack-adversarial-attacks-on-llm-safeguard-pipelines

TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering

February 6, 2026

tamperbench-systematically-stress-testing-llm-safety-under-fine-tuning-and-tampering

Open Technical Problems in Open-Weight AI Model Risk Management

October 1, 2025

open-technical-problems-in-open-weight-ai-model-risk-management

STACK: Adversarial Attacks on LLM Safeguard Pipelines

July 2, 2025

stack-adversarial-attacks-on-llm-safeguard-pipelines

Layered AI Defenses Have Holes: Vulnerabilities and Key Recommendations

defense-in-depth

February 6, 2026

October 1, 2025

Our research explores a portfolio
of high-potential agendas.

Our events bring together
global leaders in AI.

Our programs build the field of trustworthy and secure AI

Our research explores a portfolio
of high-potential agendas.

Our events bring together
global leaders in AI.

Our programs build the field of trustworthy and secure AI