Khaiwal Ravindra, Sahil Kumar, Abhishek Kumar, Suman Mor
{"title":"Enhancing accuracy of air quality sensors with machine learning to augment large-scale monitoring networks","authors":"Khaiwal Ravindra, Sahil Kumar, Abhishek Kumar, Suman Mor","doi":"10.1038/s41612-024-00833-9","DOIUrl":null,"url":null,"abstract":"Low-cost sensors have revolutionized air quality monitoring, however, precision is questioned compared to reference instruments. Hence, the performance of two widely used PM2.5 Sensors, Purple Air (PA) and ATMOS, were evaluated over a 10-month period in the North Western-Indo Gangetic Plains (NW-IGP). In-field collocation with Beta Attenuation Monitor found low R2 values; 0.40 for ATMOS and 0.43 for PA. To calibrate and improve the accuracy of sensors, five Machine Learning (ML) models and an empirical relative humidity correction methodology were used separately for both sensors. Out of these, the Decision Tree outperformed others, and R2 values improved to 0.996 for ATMOS and 0.999 for PA. Root mean square error reduced from 34.6 µg/m3 to 0.731 µg/m3 for ATMOS and from 77.7 µg/m3 to 0.61 µg/m3 for PA, while using DT as a calibrating model. The study reveals the best-performing ML model for correcting PM2.5 sensor data, enhancing the accuracy of air quality monitoring systems.","PeriodicalId":19438,"journal":{"name":"npj Climate and Atmospheric Science","volume":" ","pages":"1-11"},"PeriodicalIF":8.5000,"publicationDate":"2024-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41612-024-00833-9.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"npj Climate and Atmospheric Science","FirstCategoryId":"89","ListUrlMain":"https://www.nature.com/articles/s41612-024-00833-9","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"METEOROLOGY & ATMOSPHERIC SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Low-cost sensors have revolutionized air quality monitoring, however, precision is questioned compared to reference instruments. Hence, the performance of two widely used PM2.5 Sensors, Purple Air (PA) and ATMOS, were evaluated over a 10-month period in the North Western-Indo Gangetic Plains (NW-IGP). In-field collocation with Beta Attenuation Monitor found low R2 values; 0.40 for ATMOS and 0.43 for PA. To calibrate and improve the accuracy of sensors, five Machine Learning (ML) models and an empirical relative humidity correction methodology were used separately for both sensors. Out of these, the Decision Tree outperformed others, and R2 values improved to 0.996 for ATMOS and 0.999 for PA. Root mean square error reduced from 34.6 µg/m3 to 0.731 µg/m3 for ATMOS and from 77.7 µg/m3 to 0.61 µg/m3 for PA, while using DT as a calibrating model. The study reveals the best-performing ML model for correcting PM2.5 sensor data, enhancing the accuracy of air quality monitoring systems.
低成本传感器已经彻底改变了空气质量监测,然而,与参考仪器相比,精度受到质疑。因此,在西北印度恒河平原(NW-IGP)对两种广泛使用的PM2.5传感器Purple Air (PA)和ATMOS的性能进行了为期10个月的评估。与Beta衰减监视器现场搭配发现R2值较低;ATMOS 0.40, PA 0.43。为了校准和提高传感器的精度,分别对两个传感器使用了五种机器学习(ML)模型和经验相对湿度校正方法。其中,决策树的表现优于其他决策树,ATMOS的R2值提高到0.996,PA的R2值提高到0.999。当使用DT作为校准模型时,ATMOS的均方根误差从34.6µg/m3降至0.731µg/m3, PA的均方根误差从77.7µg/m3降至0.61µg/m3。该研究揭示了用于校正PM2.5传感器数据的最佳ML模型,提高了空气质量监测系统的准确性。
期刊介绍:
npj Climate and Atmospheric Science is an open-access journal encompassing the relevant physical, chemical, and biological aspects of atmospheric and climate science. The journal places particular emphasis on regional studies that unveil new insights into specific localities, including examinations of local atmospheric composition, such as aerosols.
The range of topics covered by the journal includes climate dynamics, climate variability, weather and climate prediction, climate change, ocean dynamics, weather extremes, air pollution, atmospheric chemistry (including aerosols), the hydrological cycle, and atmosphere–ocean and atmosphere–land interactions. The journal welcomes studies employing a diverse array of methods, including numerical and statistical modeling, the development and application of in situ observational techniques, remote sensing, and the development or evaluation of new reanalyses.