Reek Majumder, Jacquan Pollard, M Sabbir Salek, David Werth, Gurcan Comert, Adrian Gale, Sakib Mahmud Khan, Samuel Darko, Mashrur Chowdhury
{"title":"基于集合学习的环境甲烷检测和强度预测模型的开发与评估。","authors":"Reek Majumder, Jacquan Pollard, M Sabbir Salek, David Werth, Gurcan Comert, Adrian Gale, Sakib Mahmud Khan, Samuel Darko, Mashrur Chowdhury","doi":"10.1177/11786302241227307","DOIUrl":null,"url":null,"abstract":"<p><p>The environmental impacts of global warming driven by methane (CH<sub>4</sub>) emissions have catalyzed significant research initiatives in developing novel technologies that enable proactive and rapid detection of CH<sub>4</sub>. Several data-driven machine learning (ML) models were tested to determine how well they identified fugitive CH<sub>4</sub> and its related intensity in the affected areas. Various meteorological characteristics, including wind speed, temperature, pressure, relative humidity, water vapor, and heat flux, were included in the simulation. We used the ensemble learning method to determine the best-performing weighted ensemble ML models built upon several weaker lower-layer ML models to (i) detect the presence of CH<sub>4</sub> as a classification problem and (ii) predict the intensity of CH<sub>4</sub> as a regression problem. The classification model performance for CH<sub>4</sub> detection was evaluated using accuracy, F1 score, Matthew's Correlation Coefficient (MCC), and the area under the receiver operating characteristic curve (AUC ROC), with the top-performing model being 97.2%, 0.972, 0.945 and 0.995, respectively. The <i>R</i><sup> 2</sup> score was used to evaluate the regression model performance for CH<sub>4</sub> intensity prediction, with the <i>R</i><sup> 2</sup> score of the best-performing model being 0.858. The ML models developed in this study for fugitive CH<sub>4</sub> detection and intensity prediction can be used with fixed environmental sensors deployed on the ground or with sensors mounted on unmanned aerial vehicles (UAVs) for mobile detection.</p>","PeriodicalId":11827,"journal":{"name":"Environmental Health Insights","volume":"18 ","pages":"11786302241227307"},"PeriodicalIF":2.3000,"publicationDate":"2024-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10901066/pdf/","citationCount":"0","resultStr":"{\"title\":\"Development and Evaluation of Ensemble Learning-based Environmental Methane Detection and Intensity Prediction Models.\",\"authors\":\"Reek Majumder, Jacquan Pollard, M Sabbir Salek, David Werth, Gurcan Comert, Adrian Gale, Sakib Mahmud Khan, Samuel Darko, Mashrur Chowdhury\",\"doi\":\"10.1177/11786302241227307\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>The environmental impacts of global warming driven by methane (CH<sub>4</sub>) emissions have catalyzed significant research initiatives in developing novel technologies that enable proactive and rapid detection of CH<sub>4</sub>. Several data-driven machine learning (ML) models were tested to determine how well they identified fugitive CH<sub>4</sub> and its related intensity in the affected areas. Various meteorological characteristics, including wind speed, temperature, pressure, relative humidity, water vapor, and heat flux, were included in the simulation. We used the ensemble learning method to determine the best-performing weighted ensemble ML models built upon several weaker lower-layer ML models to (i) detect the presence of CH<sub>4</sub> as a classification problem and (ii) predict the intensity of CH<sub>4</sub> as a regression problem. The classification model performance for CH<sub>4</sub> detection was evaluated using accuracy, F1 score, Matthew's Correlation Coefficient (MCC), and the area under the receiver operating characteristic curve (AUC ROC), with the top-performing model being 97.2%, 0.972, 0.945 and 0.995, respectively. The <i>R</i><sup> 2</sup> score was used to evaluate the regression model performance for CH<sub>4</sub> intensity prediction, with the <i>R</i><sup> 2</sup> score of the best-performing model being 0.858. The ML models developed in this study for fugitive CH<sub>4</sub> detection and intensity prediction can be used with fixed environmental sensors deployed on the ground or with sensors mounted on unmanned aerial vehicles (UAVs) for mobile detection.</p>\",\"PeriodicalId\":11827,\"journal\":{\"name\":\"Environmental Health Insights\",\"volume\":\"18 \",\"pages\":\"11786302241227307\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-02-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10901066/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Health Insights\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/11786302241227307\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Health Insights","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/11786302241227307","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
摘要
甲烷(CH4)排放导致的全球变暖对环境的影响催生了大量研究项目,这些项目旨在开发能够主动、快速检测甲烷(CH4)的新型技术。我们测试了几种数据驱动的机器学习(ML)模型,以确定它们识别受影响地区逃逸的甲烷(CH4)及其相关强度的能力。模拟中包含了各种气象特征,包括风速、温度、压力、相对湿度、水蒸气和热通量。我们使用集合学习法来确定性能最佳的加权集合 ML 模型,该模型建立在多个较弱的低层 ML 模型之上,用于 (i) 作为分类问题检测 CH4 的存在,以及 (ii) 作为回归问题预测 CH4 的强度。使用准确率、F1 分数、马修相关系数(MCC)和接收器工作特征曲线下面积(AUC ROC)评估了检测 CH4 的分类模型性能,其中表现最好的模型分别为 97.2%、0.972、0.945 和 0.995。R 2 分数用于评估 CH4 强度预测回归模型的性能,表现最好的模型的 R 2 分数为 0.858。本研究开发的用于逃逸性 CH4 检测和强度预测的 ML 模型可与部署在地面上的固定环境传感器一起使用,也可与安装在无人机上的传感器一起用于移动检测。
Development and Evaluation of Ensemble Learning-based Environmental Methane Detection and Intensity Prediction Models.
The environmental impacts of global warming driven by methane (CH4) emissions have catalyzed significant research initiatives in developing novel technologies that enable proactive and rapid detection of CH4. Several data-driven machine learning (ML) models were tested to determine how well they identified fugitive CH4 and its related intensity in the affected areas. Various meteorological characteristics, including wind speed, temperature, pressure, relative humidity, water vapor, and heat flux, were included in the simulation. We used the ensemble learning method to determine the best-performing weighted ensemble ML models built upon several weaker lower-layer ML models to (i) detect the presence of CH4 as a classification problem and (ii) predict the intensity of CH4 as a regression problem. The classification model performance for CH4 detection was evaluated using accuracy, F1 score, Matthew's Correlation Coefficient (MCC), and the area under the receiver operating characteristic curve (AUC ROC), with the top-performing model being 97.2%, 0.972, 0.945 and 0.995, respectively. The R 2 score was used to evaluate the regression model performance for CH4 intensity prediction, with the R 2 score of the best-performing model being 0.858. The ML models developed in this study for fugitive CH4 detection and intensity prediction can be used with fixed environmental sensors deployed on the ground or with sensors mounted on unmanned aerial vehicles (UAVs) for mobile detection.