Ontology-based text convolution neural network (TextCNN) for prediction of construction accidents

IF 2.5 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Knowledge and Information Systems Pub Date : 2024-01-11 DOI:10.1007/s10115-023-02036-9

Donghui Shi, Zhigang Li, Jozef Zurada, Andrew Manikas, Jian Guan, Pawel Weichbroth

{"title":"Ontology-based text convolution neural network (TextCNN) for prediction of construction accidents","authors":"Donghui Shi, Zhigang Li, Jozef Zurada, Andrew Manikas, Jian Guan, Pawel Weichbroth","doi":"10.1007/s10115-023-02036-9","DOIUrl":null,"url":null,"abstract":"<p>The construction industry suffers from workplace accidents, including injuries and fatalities, which represent a significant economic and social burden for employers, workers, and society as a whole. The existing research on construction accidents heavily relies on expert evaluations, which often suffer from issues such as low efficiency, insufficient intelligence, and subjectivity. However, expert opinions provided in construction accident reports offer a valuable source of knowledge that can be extracted and utilized to enhance safety management. Today this valuable resource can be mined as the advent of artificial intelligence has opened up significant opportunities to advance construction site safety. Ontology represents an attractive representation scheme. Though ontology has been used in construction safety to solve the problem of information heterogeneity using formal conceptual specifications, the establishment and development of ontologies that utilize construction accident reports are currently in an early stage of development and require further improvements. Moreover, research on the exploration of incorporating deep learning methodologies into construction safety ontologies for predicting construction safety incidents is relatively limited. This paper describes a novel approach to improving the performance of accident prediction models by incorporating ontology into a deep learning model. First, a domain word discovery algorithm, based on mutual information and adjacency entropy, is used to analyze the causes of accidents mentioned in construction reports. This analysis is then combined with technical specifications and the literature in the field of construction safety to build an ontology encompassing unsafe factors related to construction accidents. By employing a Translating on Hyperplane (TransH) model, the reports are transformed into conceptual vectors using the constructed ontology. Building on this foundation, we propose a Text Convolutional Neural Network (TextCNN) model that incorporates the ontology specifically designed for construction accidents. We compared the performance of the TextCNN model against five traditional machine learning models, namely Naive Bayes, support vector machine, logistic regression, random forest, and multilayer perceptron, using three different data sets: One-Hot encoding, word vector, and conceptual vectors. The results indicate that the TextCNN model integrated with the ontology outperformed the other models in terms of performance achieving an impressive accuracy rate of 88% and AUC value of 0.92.</p>","PeriodicalId":54749,"journal":{"name":"Knowledge and Information Systems","volume":"209 1","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge and Information Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10115-023-02036-9","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

The construction industry suffers from workplace accidents, including injuries and fatalities, which represent a significant economic and social burden for employers, workers, and society as a whole. The existing research on construction accidents heavily relies on expert evaluations, which often suffer from issues such as low efficiency, insufficient intelligence, and subjectivity. However, expert opinions provided in construction accident reports offer a valuable source of knowledge that can be extracted and utilized to enhance safety management. Today this valuable resource can be mined as the advent of artificial intelligence has opened up significant opportunities to advance construction site safety. Ontology represents an attractive representation scheme. Though ontology has been used in construction safety to solve the problem of information heterogeneity using formal conceptual specifications, the establishment and development of ontologies that utilize construction accident reports are currently in an early stage of development and require further improvements. Moreover, research on the exploration of incorporating deep learning methodologies into construction safety ontologies for predicting construction safety incidents is relatively limited. This paper describes a novel approach to improving the performance of accident prediction models by incorporating ontology into a deep learning model. First, a domain word discovery algorithm, based on mutual information and adjacency entropy, is used to analyze the causes of accidents mentioned in construction reports. This analysis is then combined with technical specifications and the literature in the field of construction safety to build an ontology encompassing unsafe factors related to construction accidents. By employing a Translating on Hyperplane (TransH) model, the reports are transformed into conceptual vectors using the constructed ontology. Building on this foundation, we propose a Text Convolutional Neural Network (TextCNN) model that incorporates the ontology specifically designed for construction accidents. We compared the performance of the TextCNN model against five traditional machine learning models, namely Naive Bayes, support vector machine, logistic regression, random forest, and multilayer perceptron, using three different data sets: One-Hot encoding, word vector, and conceptual vectors. The results indicate that the TextCNN model integrated with the ontology outperformed the other models in terms of performance achieving an impressive accuracy rate of 88% and AUC value of 0.92.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于本体的文本卷积神经网络（TextCNN）用于建筑事故预测

建筑行业工伤事故频发，包括伤亡事故，给雇主、工人和整个社会带来了沉重的经济和社会负担。现有的建筑事故研究严重依赖专家评估，而专家评估往往存在效率低、智力不足和主观性等问题。然而，建筑事故报告中提供的专家意见提供了宝贵的知识来源，可以提取和利用这些知识来加强安全管理。如今，人工智能的出现为促进建筑工地安全带来了重大机遇，因此可以挖掘这一宝贵资源。本体论是一种极具吸引力的表示方案。虽然本体论已被用于建筑安全领域，利用正式的概念规范解决信息异构问题，但利用建筑事故报告的本体论的建立和发展目前还处于早期发展阶段，需要进一步改进。此外，探索将深度学习方法融入建筑安全本体以预测建筑安全事故的研究也相对有限。本文介绍了一种通过将本体融入深度学习模型来提高事故预测模型性能的新方法。首先，使用基于互信息和邻接熵的领域词发现算法来分析施工报告中提到的事故原因。然后，将分析结果与建筑安全领域的技术规范和文献结合起来，建立一个包含与建筑事故相关的不安全因素的本体。通过采用超平面转换（TransH）模型，利用所构建的本体将报告转换为概念向量。在此基础上，我们提出了一个文本卷积神经网络（TextCNN）模型，其中包含了专门为建筑事故设计的本体。我们使用三个不同的数据集，比较了 TextCNN 模型与五种传统机器学习模型（即 Naive Bayes、支持向量机、逻辑回归、随机森林和多层感知器）的性能：单热编码、词向量和概念向量。结果表明，集成了本体的 TextCNN 模型在性能上优于其他模型，准确率达到 88%，AUC 值达到 0.92。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Knowledge and Information Systems 工程技术-计算机：人工智能

CiteScore

5.70

自引率

7.40%

发文量

152

审稿时长

7.2 months

期刊介绍： Knowledge and Information Systems (KAIS) provides an international forum for researchers and professionals to share their knowledge and report new advances on all topics related to knowledge systems and advanced information systems. This monthly peer-reviewed archival journal publishes state-of-the-art research reports on emerging topics in KAIS, reviews of important techniques in related areas, and application papers of interest to a general readership.