Showing 21–40 of 54 results
/ Date/ Name
Sep 7, 2021Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT CompressionJan 2, 2021Improving Sequence-to-Sequence Pre-training via Sequence Span RewritingApr 15, 2022Text Revision by On-the-Fly Representation OptimizationJan 15, 2024Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative DecodingJul 8, 2024Enhancing Language Model Rationality with Bi-Directional Deliberation ReasoningJun 17, 2024Learn Beyond The Answer: Training Language Models with Reflection for Mathematical ReasoningSep 13, 2019Sequence-to-sequence Pre-training with Data Augmentation for Sentence RewritingJan 26, 2022A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language ModelSep 29, 2023SCALE: Synergized Collaboration of Asymmetric Language Translation EnginesJul 22, 2024Refining Corpora from a Model Calibration Perspective for Chinese Spelling CorrectionFeb 2, 2024K-Level Reasoning: Establishing Higher Order Beliefs in Large Language Models for Strategic ReasoningJan 26, 2025OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic PersonasOct 13, 2025DocReward: A Document Reward Model for Structuring and StylizingOct 17, 2024Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in TransformersApr 10, 2023Inference with Reference: Lossless Acceleration of Large Language ModelsJul 11, 2023Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-CollaborationMar 15, 2019Formality Style Transfer with Hybrid Textual AnnotationsFeb 7, 2020BERT-of-Theseus: Compressing BERT by Progressive Module ReplacingApr 6, 2021Blow the Dog Whistle: A Chinese Dataset for Cant Understanding with Common Sense and World KnowledgeOct 8, 2024ParallelSpec: Parallel Drafter for Efficient Speculative Decoding