Shape-conditioned Image Generation by Learning Latent Appearance Representation from Unpaired Data — arXiv2