Anomaly Detection for Automated Data Quality Monitoring in the CMS Detector
/ Authors
A. Brinkerhoff, Chosila Sutantawibul, I. Suarez, R. White, C. Daumann, J. Guiang, C. Freer, S. May, B. Marsh, Darin Acosta
and 17 more authors
Alex Aubuchon, E. Barberis, A. Bundock, C. Campagnari, E. Collins, Preston Epps, J. Erdmann, H. Flaecher, Jun Huang, V. Nguyen, Ryan Nie, S. Paramesvaran, J. Rotter, K. Salyer, S. Sawant, T. Sheokand, Darien Wood
/ Abstract
Successful operation of large particle detectors like the Compact Muon Solenoid (CMS) at the CERN Large Hadron Collider requires rapid, in-depth assessment of data quality. We introduce the “AutoDQM” system for Automated Data Quality Monitoring using advanced statistical techniques and unsupervised machine learning. Anomaly detection algorithms based on the beta-binomial probability function and principal component analysis are tested on the full set of proton-proton collision data collected by CMS in 2022. AutoDQM identifies anomalous “bad” data affected by significant detector malfunction at a rate 4 – 6 times higher than “good” data, demonstrating its effectiveness as a general data quality monitoring tool.
Journal: Epj Research Infrastructures