Learning to Communicate with Deep Reinforcement Learning, Jakob Foerster, Yannis Assael, Nando de Freitas, Shimon Whiteson, 2016Advances in Neural Information Processing Systems 29 (NIPS 2016) (NeurIPS) - Presents DIAL, a framework for learning discrete communication protocols in MARL, using a differentiable communication channel for gradient flow during training.
TarMAC: Targeted Multi-Agent Communication, Abhishek Das, Samyak Parajuli, Souvik Bhattacharya, Akshat Rastogi, Stefano Ermon, Joshua Meier, Andrew M. Saxe, Dhruv Batra, Devi Parikh, 2019Proceedings of the 36th International Conference on Machine Learning (ICML), Vol. PMLR 97 (Proceedings of Machine Learning Research)DOI: 10.48550/arXiv.1906.01220 - Introduces attention mechanisms for multi-agent communication, allowing agents to selectively attend to relevant messages and partners to improve coordination.
Multi-Agent Reinforcement Learning: A Review of Foundational Concepts and Recent Trends, Peter Kairouz, H. Brendan McMahan, Brendan Avent, Aurélien Bellet, Mehdi Bennis, Arjun Nitin Bhagoji, Kallista Bonawitz, Zachary Charles, Graham Cormode, Rachel Cummings, Rafael G. L. D’Oliveira, Hubert Eichner, Salim El Rouayheb, David Evans, Josh Gardner, Zachary Garrett, Adrià Gascón, Badih Ghazi, Phillip B. Gibbons, Marco Gruteser, Zaid Harchaoui, Chaoyang He, Lie He, Zhouyuan Huo, Ben Hutchinson, Justin Hsu, Martin Jaggi, Tara Javidi, Gauri Joshi, Mikhail Khodak, Jakub Konecný, Aleksandra Korolova, Farinaz Koushanfar, Sanmi Koyejo, Tancrède Lepoint, Yang Liu, Prateek Mittal, Mehryar Mohri, Richard Nock, Ayfer Özgür, Rasmus Pagh, Hang Qi, Daniel Ramage, Ramesh Raskar, Mariana Raykova, Dawn Song, Weikang Song, Sebastian U. Stich, Ziteng Sun, Ananda Theertha Suresh, Florian Tramèr, Praneeth Vepakomma, Jianyu Wang, Li Xiong, Zheng Xu, Qiang Yang, Felix X. Yu, Han Yu and Sen Zhao, 2021Foundations and Trends® in Machine Learning, Vol. 14 (Now Publishers)DOI: 10.1561/2200000083 - A comprehensive review of MARL, covering foundational concepts, communication challenges, and various solution approaches, useful for a broader context.