Showing 1–20 of 26 results
/ Date/ Name
Mar 9, 2023Conceptual Reinforcement Learning for Language-Conditioned TasksOct 13, 2022Causality-driven Hierarchical Structure Discovery for Reinforcement LearningJun 14, 2025QiMeng-Attention: SOTA Attention Operator is generated by SOTA Attention AlgorithmMar 19, 2026DA-Mamba: Learning Domain-Aware State Space Model for Global-Local Alignment in Domain Adaptive Object DetectionJul 26, 2021Hindsight Value Function for Variance Reduction in Stochastic Dynamic EnvironmentJun 12, 2023Online Prototype Alignment for Few-shot Policy TransferNov 2, 2023Efficient Symbolic Policy Learning with Differentiable Symbolic ExpressionDec 9, 2024World-Consistent Data Generation for Vision-and-Language NavigationNov 26, 2025Efficient Diffusion Planning with Temporal DiffusionNov 25, 2025QiMeng-Kernel: Macro-Thinking Micro-Coding Paradigm for LLM-Based High-Performance GPU Kernel GenerationNov 8, 2023Emergent Communication for Rules ReasoningNov 2, 2023Contrastive Modules with Temporal Attention for Multi-Task Reinforcement LearningMay 29, 2023ANPL: Towards Natural Programming with Interactive DecompositionJan 12, 2026Segmental Advantage Estimation: Enhancing PPO for Long-Context LLM TrainingSep 4, 2021Eden: A Unified Environment Framework for Booming Reinforcement Learning AlgorithmsOct 13, 2022Object-Category Aware Reinforcement LearningSep 4, 2023Self-driven Grounding: Large Language Model Agents with Automatical Language-aligned Skill LearningNov 7, 2023Context Shift Reduction for Offline Meta-Reinforcement LearningJun 5, 2024Prompt-based Visual Alignment for Zero-shot Policy TransferAug 16, 2024Ex3: Automatic Novel Writing by Extracting, Excelsior and Expanding