Ł. Bibrzycki, L. Bibrzycki, D. Alvarez-Castillo, O. Bar, D. Góra, P. Homola, P. Kovács, M. Niedźwiecki, M. Piekarczyk, K. Rzecki, J. Stasielak, S. Stuglik, O. Sushchov, A. Tursunov, Credo
{"title":"Machine learning aided noise filtration and signal classification for CREDO experiment","authors":"Ł. Bibrzycki, L. Bibrzycki, D. Alvarez-Castillo, O. Bar, D. Góra, P. Homola, P. Kovács, M. Niedźwiecki, M. Piekarczyk, K. Rzecki, J. Stasielak, S. Stuglik, O. Sushchov, A. Tursunov, Credo","doi":"10.22323/1.395.0227","DOIUrl":null,"url":null,"abstract":"The wealth of smartphone data collected by the Cosmic Ray Extremely Distributed Observatory (CREDO) greatly surpasses the capabilities of manual analysis. So, efficient means of rejecting the non-cosmic-ray noise and identification of signals attributable to extensive air showers are necessary. To address these problemswe discuss a Convolutional Neural Network-basedmethod of artefact rejection and complementarymethod of particle identification based on common statistical classifiers aswell as their ensemble extensions. These approaches are based on supervised learning, so we need to provide a representative subset of the CREDO dataset for training and validation. According to this approach over 2300 images were chosen and manually labeled by 5 judges. The images were split into spot, track, worm (collectively named signals) and artefact classes. Then the preprocessing consisting of luminance summation of RGB channels (grayscaling) and background removal by adaptive thresholding was performed. For purposes of artefact rejection the binary CNN-based classifier was proposed which is able to distinguish between artefacts and signals. The classifier was fed with input data in the form of Daubechies wavelet transformed images. In the case of cosmic ray signal classification, the well-known feature-based classifiers were considered. As feature descriptors, we used Zernike moments with additional feature related to total image luminance. For the problem of artefact rejection, we obtained an accuracy of 99%. For the 4-class signal classification, the best performing classifiers achieved a recognition rate of 88%.","PeriodicalId":20473,"journal":{"name":"Proceedings of 37th International Cosmic Ray Conference — PoS(ICRC2021)","volume":"17 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 37th International Cosmic Ray Conference — PoS(ICRC2021)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22323/1.395.0227","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The wealth of smartphone data collected by the Cosmic Ray Extremely Distributed Observatory (CREDO) greatly surpasses the capabilities of manual analysis. So, efficient means of rejecting the non-cosmic-ray noise and identification of signals attributable to extensive air showers are necessary. To address these problemswe discuss a Convolutional Neural Network-basedmethod of artefact rejection and complementarymethod of particle identification based on common statistical classifiers aswell as their ensemble extensions. These approaches are based on supervised learning, so we need to provide a representative subset of the CREDO dataset for training and validation. According to this approach over 2300 images were chosen and manually labeled by 5 judges. The images were split into spot, track, worm (collectively named signals) and artefact classes. Then the preprocessing consisting of luminance summation of RGB channels (grayscaling) and background removal by adaptive thresholding was performed. For purposes of artefact rejection the binary CNN-based classifier was proposed which is able to distinguish between artefacts and signals. The classifier was fed with input data in the form of Daubechies wavelet transformed images. In the case of cosmic ray signal classification, the well-known feature-based classifiers were considered. As feature descriptors, we used Zernike moments with additional feature related to total image luminance. For the problem of artefact rejection, we obtained an accuracy of 99%. For the 4-class signal classification, the best performing classifiers achieved a recognition rate of 88%.