Showing 1–20 of 24 results
/ Date/ Name
Jul 30, 2024ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement LearningOct 15, 2024Unsupervised Training of Diffusion Models for Feasible Solution Generation in Neural Combinatorial OptimizationJun 16, 2025K/DA: Automated Data Generation Pipeline for Detoxifying Implicitly Offensive Language in KoreanMay 16, 2025FALCON: False-Negative Aware Learning of Contrastive Negatives in Vision-Language AlignmentOct 15, 2024Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC TaskFeb 8, 2026Direct Soft-Policy Sampling via Langevin DynamicsFeb 24, 2026ERA: Evidence-based Reliability Alignment for Honest Retrieval-Augmented GenerationJun 21, 2021OptiDICE: Offline Policy Optimization via Stationary Distribution Correction EstimationOct 24, 2022Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous ActionsJul 3, 2025Offline Reinforcement Learning with Penalized Action Noise InjectionMay 16, 2025Prior-Guided Diffusion Planning for Offline Reinforcement LearningNov 15, 2024Adaptive Non-uniform Timestep Sampling for Accelerating Diffusion Model TrainingJun 9, 2025FairDICE: Fairness-Driven Offline Multi-Objective Reinforcement LearningSep 17, 2025Iterative Prompt Refinement for Safer Text-to-Image GenerationSep 25, 2025Actor-Critic without ActorJun 10, 2025Semi-gradient DICE for Offline Constrained Reinforcement LearningSep 26, 2025Beyond RAG vs. Long-Context: Learning Distraction-Aware Retrieval for Efficient Knowledge GroundingFeb 2, 2026TABX: A High-Throughput Sandbox Battle Simulator for Multi-Agent Reinforcement LearningJan 18, 2024Offline Imitation Learning by Controlling the Effective Planning HorizonJan 22, 2025NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations