Showing 1–14 of 14 results
/ Date/ Name
Aug 21, 2023LatEval: An Interactive LLMs Evaluation Benchmark with Incomplete Information from Lateral Thinking PuzzlesJul 13, 2025MCEval: A Dynamic Framework for Fair Multilingual Cultural Evaluation of LLMsSep 28, 2025Beyond English-Centric Training: How Reinforcement Learning Improves Cross-Lingual Reasoning in LLMsFeb 22, 2025ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM ReasoningOct 29, 2022Towards Attribute-Entangled Controllable Text Generation: A Pilot Study of Blessing GenerationJul 27, 2023MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative EntitiesOct 19, 2022Learning from the Dictionary: Heterogeneous Knowledge Guided Fine-tuning for Chinese Spell CheckingOct 19, 2022Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error CorrectionJun 30, 2023Correct Like Humans: Progressive Learning Framework for Chinese Text Error CorrectionApr 7, 2023From Retrieval to Generation: Efficient and Effective Entity Set ExpansionJul 17, 2022Automatic Context Pattern Generation for Entity Set ExpansionApr 22, 2025Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference ModelJul 31, 2025How Far Are AI Scientists from Changing the World?Dec 25, 2023EcomGPT-CT: Continual Pre-training of E-commerce Large Language Models with Semi-structured Data