Harnessing coloured Petri nets to enhance machine learning:A simulation-based method for healthcare and beyond

IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Simulation Modelling Practice and Theory Pub Date : 2025-02-07 DOI:10.1016/j.simpat.2025.103080
Andressa C.M. da Silveira , Álvaro Sobrinho , Leandro Dias da Silva , Danilo F.S. Santos , Muhammad Nauman , Angelo Perkusich
{"title":"Harnessing coloured Petri nets to enhance machine learning:A simulation-based method for healthcare and beyond","authors":"Andressa C.M. da Silveira ,&nbsp;Álvaro Sobrinho ,&nbsp;Leandro Dias da Silva ,&nbsp;Danilo F.S. Santos ,&nbsp;Muhammad Nauman ,&nbsp;Angelo Perkusich","doi":"10.1016/j.simpat.2025.103080","DOIUrl":null,"url":null,"abstract":"<div><div>Many industries use Machine Learning (ML) techniques to enhance systems’ performance. However, integrating ML into these systems poses challenges, often requiring improved explainability and accuracy. Using formal methods is a potential solution to address these challenges. This paper presents a simulation-based method using Coloured Petri Nets (CPN) to enhance the explainability and accuracy of Decision Tree (DT) and Random Forest (RF) models, which industries such as healthcare widely adopt. Our simulation-based method, named RuleXtract/CPN, provides procedures for the automatic extraction of decision rules from an implemented ML model, the generation of these decision rules into a CPN model, the analysis of the CPN model through simulations, and the adjustment of the CPN model to improve explainability and accuracy. Automating the transformation from DT/RF to a CPN model and the analysis procedures can reduce the time and effort needed for modeling tasks. We used web technologies and the Access/CPN framework to implement the procedures defined in our simulation-based method so that users would not need CPN expertise to generate and simulate models, running them in the background. An experiment with three datasets for COVID-19 and five for Influenza screening shows that applying our simulation-based method results in more explainable models. The experiment also shows improvement in accuracy measures for RF models. For instance, the accuracy of the RF model using the Influenza rapid test balanced dataset increased from 84.02% to 86.34%, and the unbalanced dataset from 84.78% to 87.53%. Our results underscore the importance of eliminating duplicated, poorly generalized, and incorrect rules to improve explainability and accuracy. These findings also emphasize the effectiveness of using CPN to improve the models, paving the way for future research.</div></div>","PeriodicalId":49518,"journal":{"name":"Simulation Modelling Practice and Theory","volume":"140 ","pages":"Article 103080"},"PeriodicalIF":3.5000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Simulation Modelling Practice and Theory","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569190X25000152","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Many industries use Machine Learning (ML) techniques to enhance systems’ performance. However, integrating ML into these systems poses challenges, often requiring improved explainability and accuracy. Using formal methods is a potential solution to address these challenges. This paper presents a simulation-based method using Coloured Petri Nets (CPN) to enhance the explainability and accuracy of Decision Tree (DT) and Random Forest (RF) models, which industries such as healthcare widely adopt. Our simulation-based method, named RuleXtract/CPN, provides procedures for the automatic extraction of decision rules from an implemented ML model, the generation of these decision rules into a CPN model, the analysis of the CPN model through simulations, and the adjustment of the CPN model to improve explainability and accuracy. Automating the transformation from DT/RF to a CPN model and the analysis procedures can reduce the time and effort needed for modeling tasks. We used web technologies and the Access/CPN framework to implement the procedures defined in our simulation-based method so that users would not need CPN expertise to generate and simulate models, running them in the background. An experiment with three datasets for COVID-19 and five for Influenza screening shows that applying our simulation-based method results in more explainable models. The experiment also shows improvement in accuracy measures for RF models. For instance, the accuracy of the RF model using the Influenza rapid test balanced dataset increased from 84.02% to 86.34%, and the unbalanced dataset from 84.78% to 87.53%. Our results underscore the importance of eliminating duplicated, poorly generalized, and incorrect rules to improve explainability and accuracy. These findings also emphasize the effectiveness of using CPN to improve the models, paving the way for future research.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Simulation Modelling Practice and Theory
Simulation Modelling Practice and Theory 工程技术-计算机:跨学科应用
CiteScore
9.80
自引率
4.80%
发文量
142
审稿时长
21 days
期刊介绍: The journal Simulation Modelling Practice and Theory provides a forum for original, high-quality papers dealing with any aspect of systems simulation and modelling. The journal aims at being a reference and a powerful tool to all those professionally active and/or interested in the methods and applications of simulation. Submitted papers will be peer reviewed and must significantly contribute to modelling and simulation in general or use modelling and simulation in application areas. Paper submission is solicited on: • theoretical aspects of modelling and simulation including formal modelling, model-checking, random number generators, sensitivity analysis, variance reduction techniques, experimental design, meta-modelling, methods and algorithms for validation and verification, selection and comparison procedures etc.; • methodology and application of modelling and simulation in any area, including computer systems, networks, real-time and embedded systems, mobile and intelligent agents, manufacturing and transportation systems, management, engineering, biomedical engineering, economics, ecology and environment, education, transaction handling, etc.; • simulation languages and environments including those, specific to distributed computing, grid computing, high performance computers or computer networks, etc.; • distributed and real-time simulation, simulation interoperability; • tools for high performance computing simulation, including dedicated architectures and parallel computing.
期刊最新文献
Harnessing coloured Petri nets to enhance machine learning:A simulation-based method for healthcare and beyond Editorial Board Analysis of oil capture characteristics and global sensitivity of radial oil scoops with different blade radius differences for high-speed bearings The Bevel Local Slope Approach: A method for mesh stiffness estimation in spur, helical and spiral bevel gears Application of the multi-grid modelling method to pedestrian social group dynamics through a bottleneck
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1