Learning Without Critics? Revisiting GRPO in Classical Reinforcement Learning Environments — arXiv2