Deep Reinforcement Learning from Human Preferences, Paul F Christiano, Jan Leike, Tom B Brown, Miljan Martic, Shane Legg, Dario Amodei, 2017Advances in Neural Information Processing Systems, Vol. 30 (Curran Associates, Inc.) - A foundational work introducing the method of training reinforcement learning agents directly from human preference comparisons, laying the groundwork for reward modeling.