"au:"Alexander Ku"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Alexander Ku"" — arXiv2 Search

Showing 1–17 of 17 results

/ Date/ Name

Oct 15, 2020Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding Mar 23, 2021PanGEA: The Panoramic Graph Environment Annotation Toolkit Feb 15, 2018Image Transformer Mar 17, 2025Levels of Analysis for Large Language Models May 14, 2025An evolutionary perspective on modes of learning in Transformers May 29, 2019Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation Oct 9, 2021Vector-quantized Image Modeling with Improved VQGAN Aug 9, 2019Transferable Representation Learning in Vision-and-Language Navigation Dec 27, 2023Prompt Expansion for Adaptive Text-to-Image Generation Jan 26, 2021On the Evaluation of Vision-and-Language Navigation Instructions Jun 22, 2022Scaling Autoregressive Models for Content-Rich Text-to-Image Generation Apr 30, 2024DOCCI: Descriptions of Connected and Contrasting Images Oct 31, 2024Understanding the Limits of Vision Language Models Through the Lens of the Binding Problem Oct 6, 2022A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning May 29, 2023Gaussian Process Probes (GPP) for Uncertainty-Aware Probing Jul 11, 2019General Evaluation for Instruction Conditioned Navigation using Dynamic Time Warping May 19, 2018Capturing human category representations by sampling in deep feature spaces