Agent-Agnostic Human-in-the-Loop Reinforcement Learning — arXiv2