Analyzing Data Selection Techniques with Tools from the Theory of Information Losses — arXiv2