arXiv2
Search
Dark
/ Date
/ Name
Aa
W
/ Date
/ Name
"au:"Max Sobol Mark"" — arXiv2 Search
Showing 1–6 of 6 results
/ Date
/ Name
Oct 12, 2023
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
Dec 9, 2024
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone
Feb 16, 2026
BPP: Long-Context Robot Imitation Learning by Focusing on Key History Frames
Oct 23, 2023
Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning
May 28, 2019
Unsupervised Learning from Video with Deep Neural Embeddings
Mar 9, 2023
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning