Reasoning-Driven Synthetic Data Generation and Evaluation — arXiv2