Showing 1–20 of 99 results
/ Date/ Name
Dec 31, 2020Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuningSep 13, 2021UniMS: A Unified Framework for Multimodal Summarization with Knowledge DistillationNov 16, 2021CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and DiagnosisAug 31, 2019NEZHA: Neural Contextualized Representation for Chinese Language UnderstandingMar 10, 2022Compilable Neural Code Generation with Compiler FeedbackFeb 26, 2024UniRetriever: Multi-task Candidates Selection for Various Context-Adaptive Conversational RetrievalDec 21, 2022MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral DiscussionsJul 1, 2024SINKT: A Structure-Aware Inductive Knowledge Tracing Model with Large Language ModelJan 17, 2025A Survey on Multi-Turn Interaction Capabilities of Large Language ModelsApr 10, 2025Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUsMay 21, 2025InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented GenerationJul 3, 2024CoIR: A Comprehensive Benchmark for Code Information Retrieval ModelsAug 17, 2025Fast, Slow, and Tool-augmented Thinking for LLMs: A ReviewFeb 11, 2026RePO: Bridging On-Policy Learning and Off-Policy Knowledge through Rephrasing Policy OptimizationFeb 17, 2025Teaching LLMs According to Their Aptitude: Adaptive Reasoning for Mathematical Problem SolvingDec 18, 2019Multi-channel Reverse Dictionary ModelDec 31, 2020Unified Mandarin TTS Front-end Based on Distilled BERT ModelJan 27, 2022Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval GenerationMay 21, 2022Revisiting Pre-trained Language Models and their Evaluation for Arabic Natural Language UnderstandingMay 4, 2022CODE-MVP: Learning to Represent Source Code from Multiple Views with Contrastive Pre-Training