Showing 1–16 of 16 results
/ Date/ Name
Aug 26, 2025GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository LeveragingMay 27, 2025RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task SolvingJun 5, 2024SpikeLM: Towards General Spike-Driven Language Modeling via Elastic Bi-Spiking MechanismsAug 4, 2025SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based AgentsMay 31, 2023VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech RecognitionDec 18, 2024Tree-of-Code: A Hybrid Approach for Robust Complex Task Planning and ExecutionDec 19, 2024Tree-of-Code: A Tree-Structured Exploring Framework for End-to-End Code Generation and Execution in Complex Task HandlingFeb 16, 2025ShieldLearner: A New Paradigm for Jailbreak Attack Defense in LLMsOct 1, 2024Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter MergingMay 10, 2023Mixture of personality improved Spiking actor network for efficient multi-agent cooperationMar 2, 2023Matching-based Term Semantics Pre-training for Spoken Patient Query UnderstandingDec 15, 2025AOI: Context-Aware Multi-Agent Operations via Dynamic Scheduling and Hierarchical Memory CompressionApr 14, 2023nanoLM: an Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across ScalesDec 21, 2025Learning-Based Automated Adversarial Red-Teaming for Robustness Evaluation of Large Language ModelsJan 6, 2026TiMem: Temporal-Hierarchical Memory Consolidation for Long-Horizon Conversational AgentsJul 30, 2023A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction