R2SM: Referring and Reasoning for Selective Masks — arXiv2