Jayanta Kumar Basak, Bhola Paudel, Nibas Chandra Deb, Dae Yeong Kang, Mohammed Abdus Salam, Sanjay Saha Sonet, Hyeon Tae Kim
{"title":"Application of Machine Learning Models for Predicting Methane Concentrations in Pig Barns Using Biophysical Data","authors":"Jayanta Kumar Basak, Bhola Paudel, Nibas Chandra Deb, Dae Yeong Kang, Mohammed Abdus Salam, Sanjay Saha Sonet, Hyeon Tae Kim","doi":"10.1007/s11270-025-07954-8","DOIUrl":null,"url":null,"abstract":"<div><p>The prediction of methane (CH<sub>4</sub>) concentration is important for pig farming due to its environmental impact on pigs and farm workers. This study examined the utilization of machine learning algorithms, specifically multiple linear regression (MLR), XGBoost regression (XGB), and random forest regression (RFR), to predict CH<sub>4</sub> concentrations in pig barns during the growing-finishing stage of pigs. The dataset included five key input biophysical variables: feed intake (FI), pig mass (MP), carbon dioxide (CO<sub>2</sub>) levels, temperature (T), and relative humidity (RH). Data was collected from three experimental pig barns during 2022 and 2023 to train and test the machine learning models. Among the three machine learning models, the RFR consistently outperformed both MLR and XGB in predicting CH<sub>4</sub> concentrations. The results demonstrated better performance by the RFR model in testing (R<sup>2</sup> > 0.81), with improvements in R<sup>2</sup> of up to 1.92% and 10.46%, as well as decreases in RMSE of up to 5.74% and 20.51%, compared to the XGB and MLR across the three input datasets. In terms of stability, MLR exhibited the maximum stability, followed by RFR and XGB. Sensitivity analysis found FI to be the most influential input variable for CH<sub>4</sub> concentration prediction, with the impact ranking being FI > MP > CO<sub>2</sub> > T > RH. This study emphasized the potential of machine learning models, particularly RFR, in predicting CH<sub>4</sub> concentrations using relevant input variables. These findings enhance understanding of CH<sub>4</sub> concentration, providing useful insights into pig production and environmental management.</p></div>","PeriodicalId":808,"journal":{"name":"Water, Air, & Soil Pollution","volume":"236 5","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water, Air, & Soil Pollution","FirstCategoryId":"6","ListUrlMain":"https://link.springer.com/article/10.1007/s11270-025-07954-8","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The prediction of methane (CH4) concentration is important for pig farming due to its environmental impact on pigs and farm workers. This study examined the utilization of machine learning algorithms, specifically multiple linear regression (MLR), XGBoost regression (XGB), and random forest regression (RFR), to predict CH4 concentrations in pig barns during the growing-finishing stage of pigs. The dataset included five key input biophysical variables: feed intake (FI), pig mass (MP), carbon dioxide (CO2) levels, temperature (T), and relative humidity (RH). Data was collected from three experimental pig barns during 2022 and 2023 to train and test the machine learning models. Among the three machine learning models, the RFR consistently outperformed both MLR and XGB in predicting CH4 concentrations. The results demonstrated better performance by the RFR model in testing (R2 > 0.81), with improvements in R2 of up to 1.92% and 10.46%, as well as decreases in RMSE of up to 5.74% and 20.51%, compared to the XGB and MLR across the three input datasets. In terms of stability, MLR exhibited the maximum stability, followed by RFR and XGB. Sensitivity analysis found FI to be the most influential input variable for CH4 concentration prediction, with the impact ranking being FI > MP > CO2 > T > RH. This study emphasized the potential of machine learning models, particularly RFR, in predicting CH4 concentrations using relevant input variables. These findings enhance understanding of CH4 concentration, providing useful insights into pig production and environmental management.
期刊介绍:
Water, Air, & Soil Pollution is an international, interdisciplinary journal on all aspects of pollution and solutions to pollution in the biosphere. This includes chemical, physical and biological processes affecting flora, fauna, water, air and soil in relation to environmental pollution. Because of its scope, the subject areas are diverse and include all aspects of pollution sources, transport, deposition, accumulation, acid precipitation, atmospheric pollution, metals, aquatic pollution including marine pollution and ground water, waste water, pesticides, soil pollution, sewage, sediment pollution, forestry pollution, effects of pollutants on humans, vegetation, fish, aquatic species, micro-organisms, and animals, environmental and molecular toxicology applied to pollution research, biosensors, global and climate change, ecological implications of pollution and pollution models. Water, Air, & Soil Pollution also publishes manuscripts on novel methods used in the study of environmental pollutants, environmental toxicology, environmental biology, novel environmental engineering related to pollution, biodiversity as influenced by pollution, novel environmental biotechnology as applied to pollution (e.g. bioremediation), environmental modelling and biorestoration of polluted environments.
Articles should not be submitted that are of local interest only and do not advance international knowledge in environmental pollution and solutions to pollution. Articles that simply replicate known knowledge or techniques while researching a local pollution problem will normally be rejected without review. Submitted articles must have up-to-date references, employ the correct experimental replication and statistical analysis, where needed and contain a significant contribution to new knowledge. The publishing and editorial team sincerely appreciate your cooperation.
Water, Air, & Soil Pollution publishes research papers; review articles; mini-reviews; and book reviews.