Showing 581–600 of 1,726 results
/ Date/ Name
Mar 21, 2025Sparse Logit Sampling: Accelerating Knowledge Distillation in LLMsMar 20, 2025Survey on Evaluation of LLM-based AgentsMar 20, 2025The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data ContaminationMar 20, 2025Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language ModelsMar 19, 2025EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?Mar 19, 2025Causal Discovery and Counterfactual Reasoning to Optimize Persuasive Dialogue PoliciesMar 18, 2025Calibrating Verbal Uncertainty as a Linear Feature to Reduce HallucinationsMar 17, 2025DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning PerspectiveMar 12, 2025Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language ModelsMar 10, 2025Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast AsiaMar 9, 2025Alignment for Efficient Tool Calling of Large Language ModelsMar 6, 2025PP-DocBee: Improving Multimodal Document Understanding Through a Bag of TricksMar 5, 2025Structured Outputs Enable General-Purpose LLMs to be Medical ExpertsMar 3, 2025Word Form Matters: LLMs' Semantic Reconstruction under TypoglycemiaMar 3, 2025Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of SummarizationMar 3, 2025Sherkala-Chat: Building a State-of-the-Art LLM for Kazakh in a Moderately Resourced SettingFeb 26, 2025LongEval: A Comprehensive Analysis of Long-Text Generation Through a Plan-based ParadigmFeb 25, 2025On Synthetic Data Strategies for Domain-Specific Generative RetrievalFeb 24, 2025Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal GesturesFeb 24, 2025Predicting Liquidity-Aware Bond Yields using Causal GANs and Deep Reinforcement Learning with LLM Evaluation