Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models — arXiv2