Showing 1–20 of 28 results
/ Date/ Name
Dec 18, 2023Learning Top-k Subtask Planning Tree based on Discriminative Representation Pre-training for Decision MakingApr 12, 2026SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive WeightingJul 28, 2023Learning to Collaborate by Grouping: a Consensus-oriented Strategy for Multi-agent Reinforcement LearningSep 20, 2021Learning Multi-agent Action Coordination via Electing First-move AgentMay 27, 2024CoSLight: Co-optimizing Collaborator Selection and Decision-making to Enhance Traffic Signal ControlAug 7, 2023TPTU: Large Language Model-based AI Agents for Task Planning and Tool UsageJan 17, 2022GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement LearningJul 2, 2025Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling StrategyDec 22, 2023DuaLight: Enhancing Traffic Signal Control by Leveraging Scenario-Specific and Scenario-Shared KnowledgeMay 21, 2025When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient ReasoningMay 4, 2023Explainable Reinforcement Learning via a Causal World ModelNov 19, 2023TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world SystemsMay 21, 2024Learning Causal Dynamics Models in Object-Oriented EnvironmentsMay 10, 2023Mixture of personality improved Spiking actor network for efficient multi-agent cooperationJul 15, 2024GuideLight: "Industrial Solution" Guidance for More Practical Traffic Signal Control AgentsOct 13, 2025Revisiting Entropy Regularization: Adaptive Coefficient Unlocks Its Potential for LLM Reinforcement LearningOct 28, 2023Reboost Large Language Model-based Text-to-SQL, Text-to-Python, and Text-to-Function -- with Real Applications in Traffic DomainMar 2, 2026Harmonizing Dense and Sparse Signals in Multi-turn RL: Dual-Horizon Credit Assignment for Industrial Sales AgentsNov 23, 2023Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic ApproachJul 22, 2023Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs