Towards Diverse and Natural Image Descriptions via a Conditional GAN — arXiv2