Constitutional AI: Harmlessness from AI Feedback, Yuntao Bai, Saurav Kadavath, Sandipan Kundu, Amanda Askell, Jackson Kernion, Andy Jones, Anna Chen, Anna Goldie, Azalia Mirhoseini, Cameron McKinnon, Carol Chen, Catherine Olsson, Christopher Olah, Danny Hernandez, Dawn Drain, Deep Ganguli, Dustin Li, Eli Tran-Johnson, Ethan Perez, Jamie Kerr, Jared Mueller, Jeffrey Ladish, Joshua Landau, Kamal Ndousse, Kamile Lukosuite, Liane Lovitt, Michael Sellitto, Nelson Elhage, Nicholas Schiefer, Noemi Mercado, Nova DasSarma, Robert Lasenby, Robin Larson, Sam Ringer, Scott Johnston, Shauna Kravec, Sheer El Showk, Stanislav Fort, Tamera Lanham, Timothy Telleen-Lawton, Tom Conerly, Tom Henighan, Tristan Hume, Samuel R. Bowman, Zac Hatfield-Dodds, Ben Mann, Dario Amodei, Nicholas Joseph, Sam McCandlish, Tom Brown, Jared Kaplan, 2022arXiv preprint arXiv:2212.08073DOI: 10.48550/arXiv.2212.08073 - 描述了宪法式人工智能的原始方法,该方法构成了RLAIF的基础,详细说明了如何在没有人为标注的情况下使用AI反馈来使模型与人类价值观对齐。
ZeRO: Memory Optimizations Toward Training Trillion Parameter Models, Samyam Rajbhandari, Cong Guo, Eikan Lim, Sheng Li, Sam Ade Jacobs, Sam Davis, Saurabh Tiwary, Zhewei Yao, Minjia Zhang, Reza Yazdani, Elton Zheng, Jeff Huang, Deepay Roy, Yuxiong He, 2020SC20: International Conference for High Performance Computing, Networking, Storage and Analysis (IEEE)DOI: 10.1109/SC41405.2020.00008 - 介绍了ZeRO优化器,这是一种内存高效的策略,通过在设备之间分区优化器状态、梯度和参数,对于扩展超大型模型的训练很有效。
Ray: A Distributed System for AI Applications, Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Gokalp, Eric Rosen, Joshua Rosen, Joseph E. Gonzalez, Ion Stoica, 2018Proceedings of the 7th ACM Symposium on Cloud Computing (ACM)DOI: 10.1145/3267809.3270319 - 描述了Ray,一个用于构建和运行分布式人工智能应用的开源框架,它提供了统一的分布式计算API,并被提及用于推理服务。