Showing 1–20 of 20 results
/ Date/ Name
Oct 7, 2022A Unified Framework for Multi-intent Spoken Language Understanding with promptingApr 14, 2023API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMsApr 7, 2022Interacting with Non-Cooperative User: A New Paradigm for Proactive Dialogue PolicyJun 30, 2023Preference Ranking Optimization for Human AlignmentMar 17, 2024Scaling Data Diversity for Fine-Tuning Language Models in Human AlignmentFeb 14, 2024ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference OptimizationSep 4, 2024Towards a Unified View of Preference Learning for Large Language Models: A SurveyOct 10, 2025Mitigating Overthinking through Reasoning ShapingSep 5, 2023Making Large Language Models Better Reasoners with AlignmentAug 6, 2025P-Aligner: Enabling Pre-Alignment of Language Models via Principled Instruction SynthesisOct 10, 2024Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language ModelsJun 9, 2025Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong DecodingMay 19, 2025TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World ScenariosMar 4, 2025MPO: Boosting LLM Agents with Meta Plan OptimizationFeb 9, 2026TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual CaptionsMar 11, 2025Odysseus Navigates the Sirens' Song: Dynamic Focus Decoding for Factual and Diverse Open-Ended Text GenerationJul 28, 2025Kimi K2: Open Agentic IntelligenceJan 12, 2026Two Pathways to Truthfulness: On the Intrinsic Encoding of LLM HallucinationsMay 20, 2024Learning Spatial Similarity Distribution for Few-shot Object CountingFeb 2, 2026Kimi K2.5: Visual Agentic Intelligence