Multi-Agent Deep Deterministic Policy Gradient (MADDPG)
Was this section helpful?
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments, Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel, Igor Mordatch, 2017Advances in Neural Information Processing Systems (NeurIPS)DOI: 10.48550/arXiv.1706.02275 - The original publication that introduced the MADDPG algorithm, detailing its architecture and the centralized training with decentralized execution framework for multi-agent settings.
Continuous Control with Deep Reinforcement Learning, Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, Daan Wierstra, 2016International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.1509.02971 - Presents the Deep Deterministic Policy Gradient (DDPG) algorithm, which serves as the foundational single-agent actor-critic method for MADDPG.