Yang Li , Lluis Sanmiquel , Zhengxin Zhang , Guoyan Zhao , Marc Bascompta
{"title":"Discovering the underground coal mining accident patterns in Spain from 2003 to 2021: Insights through machine learning techniques","authors":"Yang Li , Lluis Sanmiquel , Zhengxin Zhang , Guoyan Zhao , Marc Bascompta","doi":"10.1016/j.ssci.2024.106677","DOIUrl":null,"url":null,"abstract":"<div><div>The safety of underground coal mining has always been a global concern, involving the stable supply of energy and stakes in miners’ lives. Lessons learned from historical accidents and transforming into practical experience help reduce the quantity and severity of accidents. In this study, six ensemble learning techniques, including AdaBoost, Extra Trees, GBDT, LightGBM, Random Forest, and XGBoost, were used to investigate the correlation between accident-causing factors and severity. Firstly, 39<!--> <!-->487 underground coal mine accidents data was obtained from Spain, variables were categorized and coded. To address the extreme class imbalance, a new dataset (2468 cases) was obtained by data sampling from the original database. Subsequently, the new dataset was randomly divided into training sets (75% of the data) and test sets (25% of the data), then the hyperparameters of each model were optimized and configured. Thirdly, the models’ performance was evaluated on the test data by five metrics (accuracy, Cohen’s Kappa, precision, recall, and <span><math><msub><mrow><mi>F</mi></mrow><mrow><mn>1</mn></mrow></msub></math></span>). Finally, accident patterns were derived from the identified variables along with preventive strategies. Results show that tree-based ensemble learning model performs better compared to the boosting model, and the relative importance of seven variables were determined, where previous cause (PC) and material agent (MA) are the most important factors, followed by the miner’s physical activity (PA), age (A), and experience (E), scale (S) and preventive organization (PO) are in the third tier. Furthermore, the type of accident and injury caused by PC were confirmed. Working with hand tools, younger age, lack of experience, small-scale coal mines, and unfit preventive organization increased the risk of accidents. This study not only facilitates the prediction of accident severity but also provides strategies for preventing and mitigating accidents.</div></div>","PeriodicalId":21375,"journal":{"name":"Safety Science","volume":"181 ","pages":"Article 106677"},"PeriodicalIF":4.7000,"publicationDate":"2024-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Safety Science","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925753524002674","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
Abstract
The safety of underground coal mining has always been a global concern, involving the stable supply of energy and stakes in miners’ lives. Lessons learned from historical accidents and transforming into practical experience help reduce the quantity and severity of accidents. In this study, six ensemble learning techniques, including AdaBoost, Extra Trees, GBDT, LightGBM, Random Forest, and XGBoost, were used to investigate the correlation between accident-causing factors and severity. Firstly, 39 487 underground coal mine accidents data was obtained from Spain, variables were categorized and coded. To address the extreme class imbalance, a new dataset (2468 cases) was obtained by data sampling from the original database. Subsequently, the new dataset was randomly divided into training sets (75% of the data) and test sets (25% of the data), then the hyperparameters of each model were optimized and configured. Thirdly, the models’ performance was evaluated on the test data by five metrics (accuracy, Cohen’s Kappa, precision, recall, and ). Finally, accident patterns were derived from the identified variables along with preventive strategies. Results show that tree-based ensemble learning model performs better compared to the boosting model, and the relative importance of seven variables were determined, where previous cause (PC) and material agent (MA) are the most important factors, followed by the miner’s physical activity (PA), age (A), and experience (E), scale (S) and preventive organization (PO) are in the third tier. Furthermore, the type of accident and injury caused by PC were confirmed. Working with hand tools, younger age, lack of experience, small-scale coal mines, and unfit preventive organization increased the risk of accidents. This study not only facilitates the prediction of accident severity but also provides strategies for preventing and mitigating accidents.
期刊介绍:
Safety Science is multidisciplinary. Its contributors and its audience range from social scientists to engineers. The journal covers the physics and engineering of safety; its social, policy and organizational aspects; the assessment, management and communication of risks; the effectiveness of control and management techniques for safety; standardization, legislation, inspection, insurance, costing aspects, human behavior and safety and the like. Papers addressing the interfaces between technology, people and organizations are especially welcome.