Showing 1–13 of 13 results
/ Date/ Name
Jul 1, 2024Enhancing the Capability and Robustness of Large Language Models through Reinforcement Learning-Driven Query RefinementJan 8, 2026Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent InteractionJan 7, 2026CSSG: Measuring Code Similarity with Semantic GraphsFeb 2, 2026TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World ScenariosAug 27, 2025IntentionReasoner: Facilitating Adaptive LLM Safeguards through Intent Reasoning and Selective Query RefinementApr 15, 2026Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, ChallengesOct 4, 2025SATER: A Self-Aware and Token-Efficient Approach to Routing and CascadingAug 26, 2025Enhancing Model Privacy in Federated Learning with Random Masking and QuantizationJan 30, 2026BatCoder: Self-Supervised Bidirectional Code-Documentation Learning via Back-TranslationFeb 3, 2026Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report GenerationJun 4, 2025Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical ReasoningJan 7, 2026Benchmark^2: Systematic Evaluation of LLM BenchmarksMay 25, 2025RECAST: Expanding the Boundaries of LLMs' Complex Instruction Following with Multi-Constraint Data