Showing 1–20 of 24 results
/ Date/ Name
Nov 16, 2023FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language ModelsMar 14, 2025Joint Training And Decoding for Multilingual End-to-End Simultaneous Speech TranslationFeb 26, 2024A Comprehensive Evaluation of Quantization Strategies for Large Language ModelsSep 4, 2022Informative Language Representation Learning for Massively Multilingual Neural Machine TranslationJun 30, 2025TaP: A Taxonomy-Guided Framework for Automated and Scalable Preference Data GenerationNov 8, 2025Revisiting Entropy in Reinforcement Learning for Large Reasoning ModelsMar 19, 2024LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language ModelsJul 12, 2025Advancing Large Language Models for Tibetan with Curated Data and Continual Pre-TrainingJan 29, 2026SOUP: Token-level Single-sample Mix-policy Reinforcement Learning for Large Language ModelsApr 27, 2026Why Does Reinforcement Learning Generalize? A Feature-Level Mechanistic Study of Post-Training in Large Language ModelsMar 18, 2024OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and SafetyOct 30, 2023Evaluating Large Language Models: A Comprehensive SurveySep 26, 2023Large Language Model Alignment: A SurveyAug 12, 2024FuxiTranyu: A Multilingual Large Language Model Trained with Balanced DataJun 26, 2024IRCAN: Mitigating Knowledge Conflicts in LLM Generation via Identifying and Reweighting Context-Aware NeuronsMay 22, 2024ConTrans: Weak-to-Strong Alignment Engineering via Concept TransplantationMar 12, 2024FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language ModelsDec 23, 2024Large Language Model Safety: A Holistic SurveyFeb 28, 2025ProBench: Benchmarking Large Language Models in Competitive ProgrammingApr 14, 2026KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance