Showing 1–20 of 56 results
/ Date/ Name
Mar 13, 2022DARA: Dynamics-Aware Reward Augmentation in Offline Reinforcement LearningOct 25, 2021Unsupervised Domain Adaptation with Dynamics-Aware Rewards in Reinforcement LearningApr 11, 2021Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep Reinforcement LearningJun 26, 2023CEIL: Generalized Contextual Imitation LearningMay 23, 2024DIDI: Diffusion-Guided Diversity for Offline Behavioral GenerationJun 26, 2023Design from Policies: Conservative Test-Time Adaptation for Offline Policy OptimizationAug 20, 2024Tracing Privacy Leakage of Language Models to Training Data via Adjusted Influence FunctionsFeb 22, 2023Behavior Proximal Policy OptimizationJun 23, 2023CLUE: Calibrated Latent Guidance for Offline Reinforcement LearningJun 22, 2023Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement LearningJun 3, 2020A no-gold-standard technique to objectively evaluate quantitative imaging methods using patient data: TheoryJun 24, 2018High-speed RF Switch Electronics for picking up of Electron-Positron Beam BunchesApr 7, 2022Machine Learning-Enabled IoT Security: Open Issues and Challenges Under Advanced Persistent ThreatsJan 18, 2024Deep Dict: Deep Learning-based Lossy Time Series Compressor for IoT DataOct 7, 2023Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy ExplorationJun 15, 2023KoLA: Carefully Benchmarking World Knowledge of Large Language ModelsOct 21, 2025WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-ReflectionApr 11, 2026Energy-Efficient Hybrid Data Computation via Coordinated AirComp and Edge OffloadingAug 8, 2025GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation ModelsJul 19, 2023STRAPPER: Preference-based Reinforcement Learning via Self-training Augmentation and Peer Regularization