Benchmarking Real-World Medical Image Classification with Noisy Labels: Challenges, Practice, and Outlook — arXiv2