GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning — arXiv2