Visual Question Answering: Interacting with Images Through Questions
Was this section helpful?
VQA: Visual Question Answering, Aishwarya Agrawal, Jiasen Lu, Stanislaw Antol, Margaret Mitchell, C. Lawrence Zitnick, Dhruv Batra, Devi Parikh, 2015International Conference on Computer Vision (ICCV)DOI: 10.48550/arXiv.1505.00468 - Introduces the Visual Question Answering (VQA) task and the first large-scale VQA dataset, establishing a benchmark for the field.