World Models, David Ha, Jürgen Schmidhuber, 2018arXiv preprint arXiv:1803.10122DOI: 10.48550/arXiv.1803.10122 - Presents a model that learns a compressed spatiotemporal representation of its environment using VAEs and recurrent networks, enabling efficient planning and control in the latent space for reinforcement learning.