Showing 1–20 of 84 results
/ Date/ Name
Oct 13, 2020Incorporating BERT into Parallel Sequence Decoding with AdaptersApr 16, 2021Towards Variable-Length Textual Adversarial AttacksAug 31, 2021Task-Oriented Dialogue System as Natural Language GenerationApr 13, 2022Efficient Cluster-Based k-Nearest-Neighbor Machine TranslationApr 9, 2022PSP: Pre-trained Soft Prompts for Few-Shot Abstractive SummarizationDec 3, 2019Cross-lingual Pre-training Based Transfer for Zero-shot Neural Machine TranslationFeb 18, 2023RobustDistiller: Compressing Universal Speech Representations for Enhanced Environment RobustnessDec 18, 2023"Knowing When You Don't Know": A Multilingual Relevance Assessment Dataset for Robust Retrieval-Augmented GenerationJan 15, 2025Rethinking Post-Training Quantization: Introducing a Statistical Pre-Calibration ApproachFeb 27, 2025R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning LearningDec 7, 2024Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache CompressionMar 28, 2025Resona: Improving Context Copying in Linear Recurrence Models with RetrievalDec 2, 2024Adapting Large Language Models to Log Analysis with Interpretable Domain KnowledgeApr 17, 2026C-Mining: Unsupervised Discovery of Seeds for Cultural Data Synthesis via Geometric MisalignmentJul 6, 2020Bilingual Dictionary Based Neural Machine Translation without Using Parallel SentencesOct 26, 2020Exploiting Neural Query Translation into Cross Lingual Information RetrievalJul 25, 2018"Bilingual Expert" Can Find Translation ErrorsMar 24, 2023Mathematical Challenges in Deep LearningJan 15, 2024On the importance of Data Scale in Pretraining Arabic Language ModelsOct 18, 2022Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation