Tom Tseng

Member of Technical Staff

FAR.AI

Tom Tseng is a research engineer at FAR.AI. Tom previously worked as a software engineer at Gather and Cruise. He has a master’s degree from MIT and a bachelor’s degree from Carnegie Mellon University.

NEWs & publications

Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution

February 19, 2026

TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering

February 6, 2026

Layered AI Defenses Have Holes: Vulnerabilities and Key Recommendations

July 2, 2025

ClearHarm: A more challenging jailbreak dataset

June 23, 2025

Does Robustness Improve with Scale?

July 23, 2024

Beyond the Board: Exploring AI Robustness Through Go

June 18, 2024

Even Superhuman Go AIs Have Surprising Failure Modes

July 15, 2023

Inverse Scaling: When Bigger Isn't Better

June 15, 2023

Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution

February 19, 2026

concept-data-attribution-02-2026

Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution

concept-influence-leveraging-interpretability-to-improve-performance-and-efficiency-in-training-data-attribution

Layered AI Defenses Have Holes: Vulnerabilities and Key Recommendations

July 2, 2025

defense-in-depth

STACK: Adversarial Attacks on LLM Safeguard Pipelines

stack-adversarial-attacks-on-llm-safeguard-pipelines

ClearHarm: A more challenging jailbreak dataset

June 23, 2025

clearharm-a-more-challenging-jailbreak-dataset

ClearHarm: A more challenging jailbreak dataset

clearharm-a-more-challenging-jailbreak-dataset

Does Robustness Improve with Scale?

July 23, 2024

does-robustness-improve-with-scale

Exploring Scaling Trends in LLM Robustness

exploring-scaling-trends-in-llm-robustness

Beyond the Board: Exploring AI Robustness Through Go

June 18, 2024

beyond-the-board-exploring-ai-robustness-through-go

Can Go AIs be adversarially robust?

can-go-ais-be-adversarially-robust

Even Superhuman Go AIs Have Surprising Failure Modes

July 15, 2023

even-superhuman-go-ais-have-surprising-failure-modes

Adversarial Policies Beat Superhuman Go AIs

adversarial-policies-beat-superhuman-go-ais

TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering

February 6, 2026

tamperbench-systematically-stress-testing-llm-safety-under-fine-tuning-and-tampering

Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution

February 19, 2026

concept-influence-leveraging-interpretability-to-improve-performance-and-efficiency-in-training-data-attribution

Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution

concept-data-attribution-02-2026

STACK: Adversarial Attacks on LLM Safeguard Pipelines

July 2, 2025

stack-adversarial-attacks-on-llm-safeguard-pipelines

Layered AI Defenses Have Holes: Vulnerabilities and Key Recommendations

defense-in-depth

ClearHarm: A more challenging jailbreak dataset

June 23, 2025

clearharm-a-more-challenging-jailbreak-dataset

ClearHarm: A more challenging jailbreak dataset

clearharm-a-more-challenging-jailbreak-dataset

Exploring Scaling Trends in LLM Robustness

July 26, 2024

exploring-scaling-trends-in-llm-robustness

Does Robustness Improve with Scale?

does-robustness-improve-with-scale

Can Go AIs be adversarially robust?

June 18, 2024

can-go-ais-be-adversarially-robust

Even Superhuman Go AIs Have Surprising Failure Modes

even-superhuman-go-ais-have-surprising-failure-modes

Inverse Scaling: When Bigger Isn't Better

June 15, 2023

inverse-scaling-when-bigger-isnt-better

Adversarial Policies Beat Superhuman Go AIs

January 9, 2023

adversarial-policies-beat-superhuman-go-ais

Beyond the Board: Exploring AI Robustness Through Go

beyond-the-board-exploring-ai-robustness-through-go

publications:

TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering

February 6, 2026

Inverse Scaling: When Bigger Isn't Better

June 15, 2023

Research

Our research explores a portfolio
of high-potential agendas.

Events

Our events bring together
global leaders in AI.

Programs

Our programs build the field of trustworthy and secure AI

Research

Our research explores a portfolio
of high-potential agendas.

Events

Our events bring together
global leaders in AI.

Programs

Our programs build the field of trustworthy and secure AI