"au:"Weizhen Qi"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Weizhen Qi"" — arXiv2 Search

Showing 1–16 of 16 results

/ Date/ Name

Oct 21, 2020ProphetNet-Ads: A Looking Ahead Strategy for Generative Retrieval Models in Sponsored Search Engine Dec 31, 2020BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale Pretraining Apr 16, 2021ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code Generation May 23, 2022A Self-Paced Mixed Distillation Method for Non-Autoregressive Generation Jan 13, 2020ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training Jan 26, 2026Yunjue Agent Tech Report: A Fully Reproducible, Zero-Start In-Situ Self-Evolving Agent System for Open-Ended Tasks Apr 3, 2020XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation May 26, 2021Improving Sign Language Translation with Monolingual Data by Sign Back-Translation May 11, 2021EL-Attention: Memory Efficient Lossless Attention for Generation Jan 26, 2022CodeRetriever: Unimodal and Bimodal Contrastive Learning for Code Search May 23, 2025Not All Tokens Are What You Need In Thinking Oct 21, 2022Metric-guided Distillation: Distilling Knowledge from the Metric to Ranker and Retriever for Generative Commonsense Reasoning Mar 8, 2023Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models Nov 24, 2020GLGE: A New General Language Generation Evaluation Benchmark May 6, 2025Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models Apr 27, 2022DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation