Deep Multimodal Learning: A Survey, Yanan Wang, Jun Yuan, Tingting Lin, and Qing Liao, 2020ACM Computing Surveys (CSUR), Vol. 53 (Association for Computing Machinery (ACM))DOI: 10.1145/3371994 - This survey specifically reviews deep learning approaches in multimodal AI, detailing advanced techniques for multimodal representation, alignment, and fusion.
A Survey of Multimodal Datasets for Machine Learning, Devamanyu Hazarika, Soujanya Poria, Erik Cambria, and Louis-Philippe Morency, 2020ACM Computing Surveys (CSUR), Vol. 53 (Association for Computing Machinery)DOI: 10.1145/3386616 - This survey addresses the data challenge in multimodal AI by cataloging and discussing numerous datasets essential for training and evaluating multimodal systems.