Current validation practice undermines surgical AI development — arXiv2