"au:"Emanuele Bugliarello"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Emanuele Bugliarello"" — arXiv2 Search

Showing 1–20 of 28 results

/ Date/ Name

Jan 28, 2021The Role of Syntactic Planning in Compositional Image Captioning May 5, 2020It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information Oct 25, 2023On the Interplay between Fairness and Explainability Aug 22, 2023StoryBench: A Multifaceted Benchmark for Continuous Story Visualization Oct 24, 2022Multilingual Multimodal Learning with Machine Translated Text May 23, 2023Weakly-Supervised Learning of Visual Relations in Multimodal Pretraining Apr 22, 2022Mostra: A Flexible Balancing Framework to Trade-off User, Artist and Platform Objectives for Music Sequencing May 24, 2022Reassessing Evaluation Practices in Visual Question Answering: A Case Study on Out-of-Distribution Generalization Sep 28, 2021Visually Grounded Reasoning across Languages and Cultures May 30, 2019Matrix Completion in the Unit Hypercube via Structured Matrix Factorization Sep 6, 2019Enhancing Machine Translation with Dependency-Aware Self-Attention Jan 27, 2022IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages Mar 30, 2023A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision May 12, 2023Measuring Progress in Fine-grained Vision-and-Language Understanding Nov 30, 2020Multimodal Pretraining Unmasked: A Meta-Analysis and a Unified Framework of Vision-and-Language BERTs Sep 9, 2021Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers Mar 6, 2025What Are You Doing? A Closer Look at Controllable Human Video Generation Jun 9, 2022Ancestor-to-Creole Transfer is Not a Walk in the Park Sep 19, 2025Dynamic Classifier-Free Diffusion Guidance via Online Feedback Jul 14, 2022Language Modelling with Pixels