Preference Learning with Lie Detectors can Induce Honesty or Evasion

Full PDF
Project
Source
Blog

abstract