A deep learning approach for robust traffic accident information extraction from online chinese news

IF 2.3 4区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC IET Intelligent Transport Systems Pub Date : 2024-02-22 DOI:10.1049/itr2.12493
Yancheng Ling, Zhenliang Ma, Xiaoxian Dong, Xiaoxiong Weng
{"title":"A deep learning approach for robust traffic accident information extraction from online chinese news","authors":"Yancheng Ling,&nbsp;Zhenliang Ma,&nbsp;Xiaoxian Dong,&nbsp;Xiaoxiong Weng","doi":"10.1049/itr2.12493","DOIUrl":null,"url":null,"abstract":"<p>Road traffic accidents are the leading causes of injuries and fatalities. Understanding the traffic accident occurrence pattern and its contributing factors are prerequisites for effective traffic safety management. The paper proposes a deep learning approach for traffic accident recognition and information extraction from online Chinese news to extract and organize traffic accidents automatically. The approach consists of three modules, including automated news collection, news classification, and traffic accident information extraction. The automated news collection module crawls news from online sources, cleans and organizes it into a general news database with different categories of news. The news classification module robustly recognizes the traffic accident news from all types of news by fusing the sentence-wise and context-wise semantic news information. The accident information extraction module extracts the key attributes of traffic accidents (e.g. causes, times, locations) from news text using the SoftLexicon-BiLSTM-CRF method. The proposed approach is validated by comparing it with state-of-the-art text mining methods using Chinese news data crawled online. The results show that the approach can achieve a high information extraction performance in terms of precision, recall, and F1-score. It improves the performance of the best benchmark model (BiLSTM-CRF) by 18.8% in precision and 12.08% in F1-score. In addition, the potential value of the automatically extracted accident data is illustrated from online news in complementing traditional authority accident data to drive more effective traffic safety management in practice.</p>","PeriodicalId":50381,"journal":{"name":"IET Intelligent Transport Systems","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2024-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/itr2.12493","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Intelligent Transport Systems","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/itr2.12493","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Road traffic accidents are the leading causes of injuries and fatalities. Understanding the traffic accident occurrence pattern and its contributing factors are prerequisites for effective traffic safety management. The paper proposes a deep learning approach for traffic accident recognition and information extraction from online Chinese news to extract and organize traffic accidents automatically. The approach consists of three modules, including automated news collection, news classification, and traffic accident information extraction. The automated news collection module crawls news from online sources, cleans and organizes it into a general news database with different categories of news. The news classification module robustly recognizes the traffic accident news from all types of news by fusing the sentence-wise and context-wise semantic news information. The accident information extraction module extracts the key attributes of traffic accidents (e.g. causes, times, locations) from news text using the SoftLexicon-BiLSTM-CRF method. The proposed approach is validated by comparing it with state-of-the-art text mining methods using Chinese news data crawled online. The results show that the approach can achieve a high information extraction performance in terms of precision, recall, and F1-score. It improves the performance of the best benchmark model (BiLSTM-CRF) by 18.8% in precision and 12.08% in F1-score. In addition, the potential value of the automatically extracted accident data is illustrated from online news in complementing traditional authority accident data to drive more effective traffic safety management in practice.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从在线中文新闻中提取稳健交通事故信息的深度学习方法
道路交通事故是造成人员伤亡的主要原因。了解交通事故的发生规律及其诱因是有效进行交通安全管理的前提。本文提出了一种从中文在线新闻中进行交通事故识别和信息提取的深度学习方法,以自动提取和整理交通事故。该方法由三个模块组成,包括新闻自动采集、新闻分类和交通事故信息提取。自动新闻采集模块从网络资源中抓取新闻,并将其清理和整理成一个包含不同类别新闻的通用新闻数据库。新闻分类模块通过融合句子和上下文的语义新闻信息,从各类新闻中稳健地识别出交通事故新闻。事故信息提取模块使用 SoftLexicon-BiLSTM-CRF 方法从新闻文本中提取交通事故的关键属性(如原因、时间、地点)。我们利用在线抓取的中文新闻数据,将所提出的方法与最先进的文本挖掘方法进行了比较,从而验证了所提出的方法。结果表明,该方法在精确度、召回率和 F1 分数方面都能达到较高的信息提取性能。它在精确度和 F1 分数上分别比最佳基准模型(BiLSTM-CRF)提高了 18.8% 和 12.08%。此外,从在线新闻中自动提取的事故数据也说明了其在补充传统权威事故数据方面的潜在价值,从而在实践中推动更有效的交通安全管理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IET Intelligent Transport Systems
IET Intelligent Transport Systems 工程技术-运输科技
CiteScore
6.50
自引率
7.40%
发文量
159
审稿时长
3 months
期刊介绍: IET Intelligent Transport Systems is an interdisciplinary journal devoted to research into the practical applications of ITS and infrastructures. The scope of the journal includes the following: Sustainable traffic solutions Deployments with enabling technologies Pervasive monitoring Applications; demonstrations and evaluation Economic and behavioural analyses of ITS services and scenario Data Integration and analytics Information collection and processing; image processing applications in ITS ITS aspects of electric vehicles Autonomous vehicles; connected vehicle systems; In-vehicle ITS, safety and vulnerable road user aspects Mobility as a service systems Traffic management and control Public transport systems technologies Fleet and public transport logistics Emergency and incident management Demand management and electronic payment systems Traffic related air pollution management Policy and institutional issues Interoperability, standards and architectures Funding scenarios Enforcement Human machine interaction Education, training and outreach Current Special Issue Call for papers: Intelligent Transportation Systems in Smart Cities for Sustainable Environment - https://digital-library.theiet.org/files/IET_ITS_CFP_ITSSCSE.pdf Sustainably Intelligent Mobility (SIM) - https://digital-library.theiet.org/files/IET_ITS_CFP_SIM.pdf Traffic Theory and Modelling in the Era of Artificial Intelligence and Big Data (in collaboration with World Congress for Transport Research, WCTR 2019) - https://digital-library.theiet.org/files/IET_ITS_CFP_WCTR.pdf
期刊最新文献
Exploring changes in residents' daily activity patterns through sequence visualization analysis ADWNet: An improved detector based on YOLOv8 for application in adverse weather for autonomous driving Creep slope estimation for assessing adhesion in the wheel/rail contact Evaluation of large-scale cycling environment by using the trajectory data of dockless shared bicycles: A data-driven approach Driver distraction and fatigue detection in images using ME-YOLOv8 algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1