Claudia Shi

PhD Candidate | Columbia University

Claudia Shi is a Ph.D. student in Computer Science at Columbia University, advised by David Blei. She is broadly interested in using insights from the causality and machine learning literature to approach AI alignment problems. Currently, she is working on making language models produce truthful and honest responses.

NEWs & publications

Evaluating LLM Responses to Moral Scenarios
March 25, 2024
evaluating-llm-responses-to-moral-scenarios
Evaluating the Moral Beliefs Encoded in LLMs
evaluating-the-moral-beliefs-encoded-in-llms
Evaluating the Moral Beliefs Encoded in LLMs
July 26, 2023
evaluating-the-moral-beliefs-encoded-in-llms
Evaluating LLM Responses to Moral Scenarios
evaluating-llm-responses-to-moral-scenarios

publications:

No studies available yet.