Self-Rewarding Language Models — arXiv2