Oskar Hollinsworth

Member of Technical Staff

FAR.AI

Oskar Hollinsworth is a research engineer at FAR.AI, where has has worked on mitigating deception in LLMs, post-training infrastructure and scaling laws for adversarial robustness. Previously he studied how sentiment is represented in language models under Neel Nanda. Oskar had a first career as an algorithmic trader at Susquehanna International Group, Dublin.

Layered AI Defenses Have Holes: Vulnerabilities and Key Recommendations
July 2, 2025
defense-in-depth
STACK: Adversarial Attacks on LLM Safeguard Pipelines
stack-adversarial-attacks-on-llm-safeguard-pipelines
ClearHarm: A more challenging jailbreak dataset
June 23, 2025
clearharm-a-more-challenging-jailbreak-dataset
ClearHarm: A more challenging jailbreak dataset
clearharm-a-more-challenging-jailbreak-dataset
Does Robustness Improve with Scale?
July 23, 2024
does-robustness-improve-with-scale
Exploring Scaling Trends in LLM Robustness
exploring-scaling-trends-in-llm-robustness
STACK: Adversarial Attacks on LLM Safeguard Pipelines
July 2, 2025
stack-adversarial-attacks-on-llm-safeguard-pipelines
Layered AI Defenses Have Holes: Vulnerabilities and Key Recommendations
defense-in-depth
ClearHarm: A more challenging jailbreak dataset
June 23, 2025
clearharm-a-more-challenging-jailbreak-dataset
ClearHarm: A more challenging jailbreak dataset
clearharm-a-more-challenging-jailbreak-dataset
Exploring Scaling Trends in LLM Robustness
July 26, 2024
exploring-scaling-trends-in-llm-robustness
Does Robustness Improve with Scale?
does-robustness-improve-with-scale

publications:

No studies available yet.