"au:"Shuohuan Wang"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Shuohuan Wang"" — arXiv2 Search

Showing 1–20 of 39 results

/ Date/ Name

Dec 23, 2021ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation Dec 31, 2020ERNIE-M: Enhanced Multilingual Representation by Aligning Cross-lingual Semantics with Monolingual Corpora Jul 5, 2021ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation Dec 31, 2020ERNIE-Doc: A Retrospective Long-Document Modeling Transformer Oct 7, 2020Galileo at SemEval-2020 Task 12: Multi-lingual Learning for Offensive Language Identification using Pre-trained Language Models Feb 9, 2023ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models Dec 7, 2024Mixture of Hidden-Dimensions Transformer Dec 3, 2025V-ITI: Mitigating Hallucinations in Multimodal Large Language Models via Visual Inference-Time Intervention Sep 26, 2025Elastic MoE: Unlocking the Inference-Time Scalability of Mixture-of-Experts Nov 27, 2022X-PuDu at SemEval-2022 Task 7: A Replaced Token Detection Task Pre-trained Model with Pattern-aware Ensembling for Identifying Plausible Clarifications Nov 7, 2022ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech Aug 7, 2024NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time Oct 3, 2024MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions Mar 25, 2026Sparse Growing Transformer: Training-Time Sparse Depth Allocation via Progressive Attention Looping Jul 29, 2019ERNIE 2.0: A Continual Pre-training Framework for Language Understanding Nov 30, 2022X-PuDu at SemEval-2022 Task 6: Multilingual Learning for English and Arabic Sarcasm Detection Dec 13, 2022ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages Apr 29, 2024HFT: Half Fine-Tuning for Large Language Models Oct 2, 2024Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging Apr 16, 2024Autoregressive Pre-Training on Pixels and Texts