Showing 21–31 of 31 results
/ Date/ Name
Sep 30, 2024FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"Jun 5, 2025Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement LearningFeb 3, 2026MAS-ProVe: Understanding the Process Verification of Multi-Agent SystemsAug 20, 2022A Multi-Head Model for Continual Learning via Out-of-Distribution ReplayApr 20, 2023Open-World Continual Learning: Unifying Novelty Detection and Continual LearningDec 20, 2024In-context Continual Learning Assisted by an External Continual LearnerJun 3, 2025Helpful Agent Meets Deceptive Judge: Understanding Vulnerabilities in Agentic WorkflowsOct 16, 2025LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the WildSep 16, 2024SFR-RAG: Towards Contextually Faithful LLMsJan 14, 2026Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A SurveyDec 20, 2024Continual Learning Using Only Large Language Model Prompting