ESCA: Contextualizing Embodied Agents via Scene-Graph Generation — arXiv2