Tomek Korbak

PhD Student

Tomas previously worked at FAR.AI with Ethan Perez and Sam Bowman on aligning language models with human preferences.

NEWs & publications

No items found.
Inverse Scaling: When Bigger Isn't Better
June 15, 2023
inverse-scaling-when-bigger-isnt-better
Improving Code Generation by Training with Natural Language Feedback
March 28, 2023
improving-code-generation-by-training-with-natural-language-feedback
Training Language Models with Language Feedback at Scale
March 28, 2023
training-language-models-with-language-feedback-at-scale
Pretraining Language Models with Human Preferences
February 16, 2023
pretraining-language-models-with-human-preferences
RL with KL penalties is better viewed as Bayesian inference
August 8, 2022
rl-with-kl-penalties-is-better-viewed-as-bayesian-inference