挑战：非静态性问题

这部分内容有帮助吗？

参考文献

Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments, Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel, Igor Mordatch, 2017 Advances in Neural Information Processing Systems 30, Vol. 30 (Curran Associates, Inc.) - 提出MADDPG算法，通过集中式训练与分布式执行，应对多智能体系统中的非平稳性问题。
Reinforcement Learning: An Introduction (2nd Edition), Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - 第15章提供多智能体学习和博弈论的基础知识，解释智能体互动如何引起非平稳性等问题。