"au:"Xiaojun Meng"" — arXiv2 Search

/ Date/ Name

/ Date/ Name

"au:"Xiaojun Meng"" — arXiv2 Search

Showing 1–20 of 31 results

/ Date/ Name

Feb 14, 2022Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark Nov 26, 2022Lexicon-injected Semantic Parsing for Task-Oriented Dialog Sep 13, 2021UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation Mar 15, 2024HawkEye: Training Video-Text LLMs for Grounding Text in Videos Jan 23, 2025ReasVQA: Advancing VideoQA with Imperfect Reasoning Process Dec 23, 2024Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding Apr 10, 2025Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Jul 12, 2025ProactiveVideoQA: A Comprehensive Benchmark Evaluating Proactive Interactions in Video Large Language Models Oct 23, 2025Why Did Apple Fall: Evaluating Curiosity in Large Language Models Sep 19, 2020Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations Mar 8, 2022HyperPELT: Unified Parameter-Efficient Language Model Tuning for Both Language and Vision-and-Language Tasks Jul 21, 2024End-to-End Video Question Answering with Frame Scoring Mechanisms and Adaptive Sampling Nov 27, 2024VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format Dec 12, 2023Unsupervised Extractive Summarization with Learnable Length Control Strategies Mar 14, 2022Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal Information Jun 12, 2024Prompt-Based Length Controlled Generation with Multiple Control Types May 27, 2025Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity May 7, 2025Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs May 8, 2023Learning Summary-Worthy Visual Representation for Abstractive Summarization in Video Nov 4, 2024Sparsing Law: Towards Large Language Models with Greater Activation Sparsity