基于模型的强化学习的理由

这部分内容有帮助吗？

参考文献

Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (The MIT Press) - 这本权威教材涵盖了强化学习的基础概念，包括基于模型的方法、规划和样本效率的概念。
Integrated Architectures for Learning, Planning, and Reacting Based on Automatically Constructed Predictive Models, Richard S. Sutton, 1990 SIGART Bulletin, Vol. 1 (ACM) DOI: 10.1145/104523.104528 - 这篇奠基性论文介绍了Dyna架构，阐明了学习环境模型如何通过生成模拟经验来更新策略，从而提高样本效率。