Self-Generated Critiques Boost Reward Modeling for Language Models — arXiv2