Showing 1–16 of 16 results
/ Date/ Name
Oct 21, 2020ProphetNet-Ads: A Looking Ahead Strategy for Generative Retrieval Models in Sponsored Search EngineDec 31, 2020BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale PretrainingApr 16, 2021ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code GenerationMay 23, 2022A Self-Paced Mixed Distillation Method for Non-Autoregressive GenerationJan 13, 2020ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-trainingJan 26, 2026Yunjue Agent Tech Report: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended TasksApr 3, 2020XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and GenerationMay 26, 2021Improving Sign Language Translation with Monolingual Data by Sign Back-TranslationMay 11, 2021EL-Attention: Memory Efficient Lossless Attention for GenerationJan 26, 2022CodeRetriever: Unimodal and Bimodal Contrastive Learning for Code SearchMay 23, 2025Not All Tokens Are What You Need In ThinkingOct 21, 2022Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense ReasoningMar 8, 2023Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation ModelsNov 24, 2020GLGE: A New General Language Generation Evaluation BenchmarkMay 6, 2025Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language ModelsApr 27, 2022DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation