Overcoming catastrophic forgetting in neural networks, James Kirkpatrick, Razvan Pascanu, Gabriel Jimenez Rezende, Adria Puigdomenech Badia, Oriol Vinyals, Fabio Hubert, Zachary Li, Peter Battaglia, Laurent Sifre, Evan Zoph, Martin Reichstein, Dean Hassabis, Iordanis Antonoglou, Charles Blundell, 2017Proceedings of the National Academy of Sciences, Vol. 114 (National Academy of Sciences)DOI: 10.1073/pnas.1611835114 - 介绍了弹性权重整合(EWC),一种通过识别并保护对先前任务重要的参数来减轻灾难性遗忘的正则化方法。
Parameter-Efficient Transfer Learning for NLP, Neil Houlsby, Andrei Giurgiu, Stanislaw Jastrzebski, Bruna Morrone, Quentin De Laroussilhe, Andrea Gesmundo, Mona Attariyan, Sylvain Gelly, 2019Proceedings of the 36th International Conference on Machine Learning (ICML), Vol. 97 (PMLR)DOI: 10.48550/arXiv.1902.00751 - 介绍了适配器模块,这是一种参数高效的微调方法,可应用于参数隔离以减轻灾难性遗忘。