Reward Shaping via Meta-Learning — arXiv2