Optimizing Process Discovery Quality Criteria and Model Measurements using Receiver Operating Characteristic Analysis and Infrequent Inductive Miner

R. Budiraharjo, H. Prasetyo, R. Sarno, K. R. Sungkono
{"title":"Optimizing Process Discovery Quality Criteria and Model Measurements using Receiver Operating Characteristic Analysis and Infrequent Inductive Miner","authors":"R. Budiraharjo, H. Prasetyo, R. Sarno, K. R. Sungkono","doi":"10.1109/APWiMob51111.2021.9435217","DOIUrl":null,"url":null,"abstract":"Generating process models that reflect close behavioral resemblance to the actual process Standard Operating Procedure (SOP) in process mining can be challenging without taking the four quality criteria of process discovery into account. The four quality criteria, i.e. fitness, precision, generalization, and simplicity, should be well balanced in order to produce proper process models which are aligned to the real-life executions. This paper proposes a method to optimize process discovery quality criteria (PDQC) by implementing different thresholds and analyzing calculation results using Receiver Operating Characteristic (ROC) curve and Infrequent Inductive Miner algorithm. This paper sets up two experiments with different scenarios to measure the calculations of quality criteria and the quality of generated models. The experiments compare two SOPs to the process models discovered by Infrequent Inductive Miner algorithm; hence the SOPs serve as references to determine the generated models quality. The purpose of applying two different scenarios in the experiments is to discover how well the Infrequent Inductive Miner thresholds can produce predictive models under these two different scenarios circumstances. This paper has been successful in predicting the best-fit model in reference to the SOPs by optimizing the four quality criteria of process discovery using ROC thresholds settings and by using infrequent inductive miner algorithm for models generation, and also in improving the accuracy of models measurements. The accuracy rate of the prediction model from Experiment 1 is 83%, while Experiment 2 yields an accuracy rate of 88%. The most optimal threshold settings to generate the best model in this paper are threshold 0.4 in Experiment 1 and threshold 0.5 in Experiment 2.","PeriodicalId":325270,"journal":{"name":"2021 IEEE Asia Pacific Conference on Wireless and Mobile (APWiMob)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE Asia Pacific Conference on Wireless and Mobile (APWiMob)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APWiMob51111.2021.9435217","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Generating process models that reflect close behavioral resemblance to the actual process Standard Operating Procedure (SOP) in process mining can be challenging without taking the four quality criteria of process discovery into account. The four quality criteria, i.e. fitness, precision, generalization, and simplicity, should be well balanced in order to produce proper process models which are aligned to the real-life executions. This paper proposes a method to optimize process discovery quality criteria (PDQC) by implementing different thresholds and analyzing calculation results using Receiver Operating Characteristic (ROC) curve and Infrequent Inductive Miner algorithm. This paper sets up two experiments with different scenarios to measure the calculations of quality criteria and the quality of generated models. The experiments compare two SOPs to the process models discovered by Infrequent Inductive Miner algorithm; hence the SOPs serve as references to determine the generated models quality. The purpose of applying two different scenarios in the experiments is to discover how well the Infrequent Inductive Miner thresholds can produce predictive models under these two different scenarios circumstances. This paper has been successful in predicting the best-fit model in reference to the SOPs by optimizing the four quality criteria of process discovery using ROC thresholds settings and by using infrequent inductive miner algorithm for models generation, and also in improving the accuracy of models measurements. The accuracy rate of the prediction model from Experiment 1 is 83%, while Experiment 2 yields an accuracy rate of 88%. The most optimal threshold settings to generate the best model in this paper are threshold 0.4 in Experiment 1 and threshold 0.5 in Experiment 2.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用接收机工作特性分析和非频繁感应挖掘优化过程发现质量标准和模型测量
如果不考虑过程发现的四个质量标准,在过程挖掘中生成反映与实际过程标准操作程序(SOP)行为相似的过程模型可能是具有挑战性的。四个质量标准,即适应性、精确性、泛化和简单性,应该得到很好的平衡,以便生成与实际执行相一致的适当流程模型。本文提出了一种利用Receiver Operating Characteristic (ROC)曲线和infrequency inductminer算法实现不同阈值并分析计算结果的工艺发现质量准则(PDQC)优化方法。本文设置了两个不同场景的实验来衡量质量标准的计算和生成的模型的质量。实验比较了两种标准操作程序与非频繁感应Miner算法发现的过程模型;因此sop可以作为确定生成模型质量的参考。在实验中应用两种不同场景的目的是发现在这两种不同的场景情况下,不频繁感应挖掘器阈值可以产生预测模型的效果。本文通过使用ROC阈值设置优化过程发现的四个质量标准,并通过使用不频繁的归纳矿工算法进行模型生成,以及提高模型测量的准确性,成功地预测了参考sop的最佳拟合模型。实验1的预测模型准确率为83%,而实验2的准确率为88%。本文生成最佳模型的最优阈值设置为实验1中的阈值0.4和实验2中的阈值0.5。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Effect of Duration Heteroscedasticity to the Bottleneck in Business Process Discovered by Inductive Miner Algorithm Performance Evaluation of XGS-PON Optical Network Termination for Mobile Backhaul IoT Resiliency through Edge-located Container-based Virtualization and SDN Detection of the Key Actor of Issues Spreading Based on Social Network Analysis in Twitter Social Media Unsupervised Method for 3D Brain Magnetic Resonance Image Segmentation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1