Reek Majumder, Jacquan Pollard, M Sabbir Salek, David Werth, Gurcan Comert, Adrian Gale, Sakib Mahmud Khan, Samuel Darko, Mashrur Chowdhury
{"title":"Development and Evaluation of Ensemble Learning-based Environmental Methane Detection and Intensity Prediction Models.","authors":"Reek Majumder, Jacquan Pollard, M Sabbir Salek, David Werth, Gurcan Comert, Adrian Gale, Sakib Mahmud Khan, Samuel Darko, Mashrur Chowdhury","doi":"10.1177/11786302241227307","DOIUrl":null,"url":null,"abstract":"<p><p>The environmental impacts of global warming driven by methane (CH<sub>4</sub>) emissions have catalyzed significant research initiatives in developing novel technologies that enable proactive and rapid detection of CH<sub>4</sub>. Several data-driven machine learning (ML) models were tested to determine how well they identified fugitive CH<sub>4</sub> and its related intensity in the affected areas. Various meteorological characteristics, including wind speed, temperature, pressure, relative humidity, water vapor, and heat flux, were included in the simulation. We used the ensemble learning method to determine the best-performing weighted ensemble ML models built upon several weaker lower-layer ML models to (i) detect the presence of CH<sub>4</sub> as a classification problem and (ii) predict the intensity of CH<sub>4</sub> as a regression problem. The classification model performance for CH<sub>4</sub> detection was evaluated using accuracy, F1 score, Matthew's Correlation Coefficient (MCC), and the area under the receiver operating characteristic curve (AUC ROC), with the top-performing model being 97.2%, 0.972, 0.945 and 0.995, respectively. The <i>R</i><sup> 2</sup> score was used to evaluate the regression model performance for CH<sub>4</sub> intensity prediction, with the <i>R</i><sup> 2</sup> score of the best-performing model being 0.858. The ML models developed in this study for fugitive CH<sub>4</sub> detection and intensity prediction can be used with fixed environmental sensors deployed on the ground or with sensors mounted on unmanned aerial vehicles (UAVs) for mobile detection.</p>","PeriodicalId":11827,"journal":{"name":"Environmental Health Insights","volume":"18 ","pages":"11786302241227307"},"PeriodicalIF":2.3000,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10901066/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Health Insights","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/11786302241227307","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
The environmental impacts of global warming driven by methane (CH4) emissions have catalyzed significant research initiatives in developing novel technologies that enable proactive and rapid detection of CH4. Several data-driven machine learning (ML) models were tested to determine how well they identified fugitive CH4 and its related intensity in the affected areas. Various meteorological characteristics, including wind speed, temperature, pressure, relative humidity, water vapor, and heat flux, were included in the simulation. We used the ensemble learning method to determine the best-performing weighted ensemble ML models built upon several weaker lower-layer ML models to (i) detect the presence of CH4 as a classification problem and (ii) predict the intensity of CH4 as a regression problem. The classification model performance for CH4 detection was evaluated using accuracy, F1 score, Matthew's Correlation Coefficient (MCC), and the area under the receiver operating characteristic curve (AUC ROC), with the top-performing model being 97.2%, 0.972, 0.945 and 0.995, respectively. The R 2 score was used to evaluate the regression model performance for CH4 intensity prediction, with the R 2 score of the best-performing model being 0.858. The ML models developed in this study for fugitive CH4 detection and intensity prediction can be used with fixed environmental sensors deployed on the ground or with sensors mounted on unmanned aerial vehicles (UAVs) for mobile detection.