Juan Diaz, Z. Agioutantis, D. Hristopulos, S. Schafrik
{"title":"Managing and Utilizing Big Data in Atmospheric Monitoring Systems for Underground Coal Mines","authors":"Juan Diaz, Z. Agioutantis, D. Hristopulos, S. Schafrik","doi":"10.3390/materproc2021005078","DOIUrl":null,"url":null,"abstract":"Underground coal mining Atmospheric Monitoring Systems (AMS) have been implemented for real-time or near real-time monitoring and evaluation of the mine atmosphere and related parameters such as gas concentration (e.g., CH4, CO, O2), fan performance (e.g., power, speed), barometric pressure, ambient temperature, humidity, etc. Depending on the sampling frequency, AMS can collect and manage a tremendous amount of data, which mine operators typically consult for everyday operations as well as long-term planning and more effective management of ventilation systems. The raw data collected by AMS need considerable pre-processing and filtering before they can be used for analysis. This paper discusses different challenges related to filtering raw AMS data in order to identify and remove values due to sensor breakdowns, sensor calibration periods, transient values due to operational considerations, etc., as well as to homogenize time series for different variables. The statistical challenges involve the removal of faulty values and outliers (due to systematic problems) and transient effects, gap-filling (by means of interpolation methods), and homogenization (setting a common time reference and time step) of the respective time series. The objective is to derive representative and synchronous time series values that can subsequently be used to estimate summary statistics of AMS and to infer correlations or nonlinear dependence between different data streams. Identification and modeling of statistical dependencies can be further exploited to develop predictive equations based on time series models.","PeriodicalId":18729,"journal":{"name":"Materials Proceedings","volume":"195 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Materials Proceedings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/materproc2021005078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Underground coal mining Atmospheric Monitoring Systems (AMS) have been implemented for real-time or near real-time monitoring and evaluation of the mine atmosphere and related parameters such as gas concentration (e.g., CH4, CO, O2), fan performance (e.g., power, speed), barometric pressure, ambient temperature, humidity, etc. Depending on the sampling frequency, AMS can collect and manage a tremendous amount of data, which mine operators typically consult for everyday operations as well as long-term planning and more effective management of ventilation systems. The raw data collected by AMS need considerable pre-processing and filtering before they can be used for analysis. This paper discusses different challenges related to filtering raw AMS data in order to identify and remove values due to sensor breakdowns, sensor calibration periods, transient values due to operational considerations, etc., as well as to homogenize time series for different variables. The statistical challenges involve the removal of faulty values and outliers (due to systematic problems) and transient effects, gap-filling (by means of interpolation methods), and homogenization (setting a common time reference and time step) of the respective time series. The objective is to derive representative and synchronous time series values that can subsequently be used to estimate summary statistics of AMS and to infer correlations or nonlinear dependence between different data streams. Identification and modeling of statistical dependencies can be further exploited to develop predictive equations based on time series models.