Stephen Casper

MIT

Layered AI Defenses Have Holes: Vulnerabilities and Key Recommendations
July 2, 2025
defense-in-depth
STACK: Adversarial Attacks on LLM Safeguard Pipelines
stack-adversarial-attacks-on-llm-safeguard-pipelines
TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering
February 6, 2026
tamperbench-systematically-stress-testing-llm-safety-under-fine-tuning-and-tampering
Open Technical Problems in Open-Weight AI Model Risk Management
October 1, 2025
open-technical-problems-in-open-weight-ai-model-risk-management
STACK: Adversarial Attacks on LLM Safeguard Pipelines
July 2, 2025
stack-adversarial-attacks-on-llm-safeguard-pipelines
Layered AI Defenses Have Holes: Vulnerabilities and Key Recommendations
defense-in-depth