VQA: Visual Question Answering Agrawal et al. A model that takes an image and a free-form, open-ended natural language question about the image and outputs a natural-language answer. contribute Yin and Yang: Balancing and Answering Binary Visual Questions Zhang et al. Addresses VQA by convertin...