Probing Few-Shot Generalization with Attributes — arXiv2