Answer Questions with Right Image Regions: A Visual Attention Regularization Approach — arXiv2