Punya Syon Pandey

NEWs & publications

No items found.
Accidental Misalignment: Fine-Tuning Language Models Induces Unexpected Vulnerability
May 22, 2025
accidental-misalignment-fine-tuning-language-models-induces-unexpected-vulnerability