Showing 1–18 of 18 results
/ Date/ Name
Mar 2, 2025Offline RLAIF: Piloting VLM Feedback for RL via SFOSep 10, 2025Bias in the Loop: How Humans Evaluate AI-Generated SuggestionsOct 10, 2024Metalic: Meta-Learning In-Context with Protein Language ModelsSep 26, 2023Recurrent Hypernetworks are Surprisingly Strong in Meta-RLOct 20, 2022Hypernetworks in Meta-Reinforcement LearningJul 29, 2018Neural Mesh: Introducing a Notion of Space and Conservation of Energy to Neural NetworksJan 16, 2019ReNeg and Backseat Driver: Learning from Demonstration with Continuous Human FeedbackMar 5, 2024SplAgger: Split Aggregation for Meta-Reinforcement LearningJan 19, 2023A Tutorial on Meta-Reinforcement LearningFeb 22, 2023Universal Morphology Control via Contextual ModulationNov 23, 2023Annotation Sensitivity: Training Data Collection Methods Affect Model PerformanceFeb 11, 2025A Survey of In-Context Reinforcement LearningAug 23, 2019Stackelberg Punishment and Bully-Proofing Autonomous VehiclesSep 22, 2022An Investigation of the Bias-Variance Tradeoff in Meta-GradientsFeb 9, 2024Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology ControlDec 1, 2021On the Practical Consistency of Meta-Reinforcement Learning AlgorithmsJan 31, 2022Trust Region Bounds for Decentralized PPO Under Non-stationarityDec 20, 2024Cracking the Code: Evaluating Zero-Shot Prompting Methods for Providing Programming Feedback