"au:"Hany Hassan Awadalla"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Hany Hassan Awadalla"" — arXiv2 Search

Showing 1–20 of 24 results

/ Date/ Name

Dec 31, 2020XLM-T: Scaling up Multilingual Machine Translation with Pretrained Cross-lingual Transformer Encoders Feb 18, 2023How Good Are GPT Models at Machine Translation? A Comprehensive Evaluation Jun 25, 2021DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders Apr 28, 2023ResiDual: Transformer with Dual Residual Connections Nov 3, 2015Detecting Interrogative Utterances with Recurrent Neural Networks Oct 24, 2023Dissecting In-Context Learning of Translations in GPTs May 26, 2023Do GPTs Produce Less Literal Translations?Jun 30, 2022Building Multilingual Machine Translation Systems That Serve Arbitrary X-Y Translations Sep 22, 2021Scalable and Efficient MoE Training for Multitask Multilingual Models Nov 3, 2021Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task Oct 3, 2023Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness Aug 16, 2023FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs Aug 21, 2022Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization May 28, 2022Gating Dropout: Communication-efficient Regularization for Sparsely Activated Transformers Mar 1, 2025Efficiently Editing Mixture-of-Experts Models with Compressed Experts Aug 30, 2023Task-Based MoE for Multitask Multilingual Machine Translation Nov 18, 2022Who Says Elephants Can't Run: Bringing Large Scale MoE Models into Cloud Scale Production Sep 20, 2023A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models Oct 26, 2020FastFormers: Highly Efficient Transformer Models for Natural Language Understanding Oct 6, 2020Multi-task Learning for Multilingual Neural Machine Translation