Discovering the underground coal mining accident patterns in Spain from 2003 to 2021: Insights through machine learning techniques

IF 4.7 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Safety Science Pub Date : 2024-09-29 DOI:10.1016/j.ssci.2024.106677
Yang Li , Lluis Sanmiquel , Zhengxin Zhang , Guoyan Zhao , Marc Bascompta
{"title":"Discovering the underground coal mining accident patterns in Spain from 2003 to 2021: Insights through machine learning techniques","authors":"Yang Li ,&nbsp;Lluis Sanmiquel ,&nbsp;Zhengxin Zhang ,&nbsp;Guoyan Zhao ,&nbsp;Marc Bascompta","doi":"10.1016/j.ssci.2024.106677","DOIUrl":null,"url":null,"abstract":"<div><div>The safety of underground coal mining has always been a global concern, involving the stable supply of energy and stakes in miners’ lives. Lessons learned from historical accidents and transforming into practical experience help reduce the quantity and severity of accidents. In this study, six ensemble learning techniques, including AdaBoost, Extra Trees, GBDT, LightGBM, Random Forest, and XGBoost, were used to investigate the correlation between accident-causing factors and severity. Firstly, 39<!--> <!-->487 underground coal mine accidents data was obtained from Spain, variables were categorized and coded. To address the extreme class imbalance, a new dataset (2468 cases) was obtained by data sampling from the original database. Subsequently, the new dataset was randomly divided into training sets (75% of the data) and test sets (25% of the data), then the hyperparameters of each model were optimized and configured. Thirdly, the models’ performance was evaluated on the test data by five metrics (accuracy, Cohen’s Kappa, precision, recall, and <span><math><msub><mrow><mi>F</mi></mrow><mrow><mn>1</mn></mrow></msub></math></span>). Finally, accident patterns were derived from the identified variables along with preventive strategies. Results show that tree-based ensemble learning model performs better compared to the boosting model, and the relative importance of seven variables were determined, where previous cause (PC) and material agent (MA) are the most important factors, followed by the miner’s physical activity (PA), age (A), and experience (E), scale (S) and preventive organization (PO) are in the third tier. Furthermore, the type of accident and injury caused by PC were confirmed. Working with hand tools, younger age, lack of experience, small-scale coal mines, and unfit preventive organization increased the risk of accidents. This study not only facilitates the prediction of accident severity but also provides strategies for preventing and mitigating accidents.</div></div>","PeriodicalId":21375,"journal":{"name":"Safety Science","volume":"181 ","pages":"Article 106677"},"PeriodicalIF":4.7000,"publicationDate":"2024-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Safety Science","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925753524002674","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0

Abstract

The safety of underground coal mining has always been a global concern, involving the stable supply of energy and stakes in miners’ lives. Lessons learned from historical accidents and transforming into practical experience help reduce the quantity and severity of accidents. In this study, six ensemble learning techniques, including AdaBoost, Extra Trees, GBDT, LightGBM, Random Forest, and XGBoost, were used to investigate the correlation between accident-causing factors and severity. Firstly, 39 487 underground coal mine accidents data was obtained from Spain, variables were categorized and coded. To address the extreme class imbalance, a new dataset (2468 cases) was obtained by data sampling from the original database. Subsequently, the new dataset was randomly divided into training sets (75% of the data) and test sets (25% of the data), then the hyperparameters of each model were optimized and configured. Thirdly, the models’ performance was evaluated on the test data by five metrics (accuracy, Cohen’s Kappa, precision, recall, and F1). Finally, accident patterns were derived from the identified variables along with preventive strategies. Results show that tree-based ensemble learning model performs better compared to the boosting model, and the relative importance of seven variables were determined, where previous cause (PC) and material agent (MA) are the most important factors, followed by the miner’s physical activity (PA), age (A), and experience (E), scale (S) and preventive organization (PO) are in the third tier. Furthermore, the type of accident and injury caused by PC were confirmed. Working with hand tools, younger age, lack of experience, small-scale coal mines, and unfit preventive organization increased the risk of accidents. This study not only facilitates the prediction of accident severity but also provides strategies for preventing and mitigating accidents.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
发现 2003 至 2021 年西班牙地下煤矿事故模式:通过机器学习技术获得的启示
煤矿井下开采的安全问题一直是全球关注的焦点,它关系到能源的稳定供应和矿工的生命安全。从历史事故中吸取教训并转化为实践经验,有助于减少事故的数量和严重程度。本研究采用了六种集合学习技术,包括 AdaBoost、Extra Trees、GBDT、LightGBM、Random Forest 和 XGBoost,来研究事故致因与严重程度之间的相关性。首先,从西班牙获得了 39 487 个煤矿井下事故数据,对变量进行了分类和编码。为了解决类别极度不平衡的问题,我们从原始数据库中抽取数据,得到了一个新的数据集(2468 个案例)。随后,将新数据集随机分为训练集(占数据的 75%)和测试集(占数据的 25%),然后对每个模型的超参数进行优化和配置。第三,通过五个指标(准确率、科恩卡帕、精确度、召回率和 F1)评估模型在测试数据上的性能。最后,从识别出的变量中得出事故模式以及预防策略。结果表明,与提升模型相比,基于树的集合学习模型表现更好,并且确定了七个变量的相对重要性,其中先前原因(PC)和材料代理(MA)是最重要的因素,其次是矿工的体力活动(PA)、年龄(A)和经验(E),规模(S)和预防组织(PO)排在第三位。此外,PC 导致的事故类型和伤害也得到了证实。使用手工工具工作、年龄较小、缺乏经验、煤矿规模较小以及预防组织不合理都增加了事故风险。这项研究不仅有助于预测事故的严重程度,还为预防和减轻事故提供了策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Safety Science
Safety Science 管理科学-工程:工业
CiteScore
13.00
自引率
9.80%
发文量
335
审稿时长
53 days
期刊介绍: Safety Science is multidisciplinary. Its contributors and its audience range from social scientists to engineers. The journal covers the physics and engineering of safety; its social, policy and organizational aspects; the assessment, management and communication of risks; the effectiveness of control and management techniques for safety; standardization, legislation, inspection, insurance, costing aspects, human behavior and safety and the like. Papers addressing the interfaces between technology, people and organizations are especially welcome.
期刊最新文献
Editorial Board A systematic review of hazard recognition and risk perception research in the construction industry Investigating the predictive validity of subjective and objective measures for general aviation pilots’ risk propensity Risk perception and safety behaviors in high-risk workers: A systematic literature review Exploring the role of safety culture dimensions in patient safety using a Bayesian Belief Network model
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1