Juan G. Colonna , Ahmed A. Fares , Márcio Duarte , Ricardo Sousa
{"title":"Process mining embeddings: Learning vector representations for Petri nets","authors":"Juan G. Colonna , Ahmed A. Fares , Márcio Duarte , Ricardo Sousa","doi":"10.1016/j.iswa.2024.200423","DOIUrl":null,"url":null,"abstract":"<div><p>Process Mining offers a powerful framework for uncovering, analyzing, and optimizing real-world business processes. Petri nets provide a versatile means of modeling process behavior. However, traditional methods often struggle to effectively compare complex Petri nets, hindering their potential for process enhancement. To address this challenge, we introduce PetriNet2Vec, an unsupervised methodology inspired by Doc2Vec. This approach converts Petri nets into embedding vectors, facilitating the comparison, clustering, and classification of process models. We validated our approach using the PDC Dataset, comprising 96 diverse Petri net models. The results demonstrate that PetriNet2Vec effectively captures the structural properties of process models, enabling accurate process classification and efficient process retrieval. Specifically, our findings highlight the utility of the learned embeddings in two key downstream tasks: process classification and process retrieval. In process classification, the embeddings allowed for accurate categorization of process models based on their structural properties. In process retrieval, the embeddings enabled efficient retrieval of similar process models using cosine distance. These results demonstrate the potential of PetriNet2Vec to significantly enhance process mining capabilities.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"23 ","pages":"Article 200423"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000978/pdfft?md5=c5510ffdcb881b0b3214984ffc9b29e1&pid=1-s2.0-S2667305324000978-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Systems with Applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667305324000978","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Process Mining offers a powerful framework for uncovering, analyzing, and optimizing real-world business processes. Petri nets provide a versatile means of modeling process behavior. However, traditional methods often struggle to effectively compare complex Petri nets, hindering their potential for process enhancement. To address this challenge, we introduce PetriNet2Vec, an unsupervised methodology inspired by Doc2Vec. This approach converts Petri nets into embedding vectors, facilitating the comparison, clustering, and classification of process models. We validated our approach using the PDC Dataset, comprising 96 diverse Petri net models. The results demonstrate that PetriNet2Vec effectively captures the structural properties of process models, enabling accurate process classification and efficient process retrieval. Specifically, our findings highlight the utility of the learned embeddings in two key downstream tasks: process classification and process retrieval. In process classification, the embeddings allowed for accurate categorization of process models based on their structural properties. In process retrieval, the embeddings enabled efficient retrieval of similar process models using cosine distance. These results demonstrate the potential of PetriNet2Vec to significantly enhance process mining capabilities.
流程挖掘为发现、分析和优化现实世界的业务流程提供了一个强大的框架。Petri 网为流程行为建模提供了一种多功能手段。然而,传统方法往往难以对复杂的 Petri 网进行有效比较,从而阻碍了 Petri 网在流程改进方面的潜力。为了应对这一挑战,我们引入了 PetriNet2Vec,这是一种受 Doc2Vec 启发的无监督方法。这种方法将 Petri 网转换为嵌入向量,便于流程模型的比较、聚类和分类。我们使用由 96 个不同 Petri 网模型组成的 PDC 数据集验证了我们的方法。结果表明,PetriNet2Vec 能有效捕捉流程模型的结构特性,从而实现准确的流程分类和高效的流程检索。具体来说,我们的研究结果强调了所学嵌入在流程分类和流程检索这两个关键下游任务中的实用性。在流程分类中,嵌入可以根据流程模型的结构特性对其进行准确分类。在流程检索中,嵌入可以利用余弦距离高效检索类似的流程模型。这些结果证明了 PetriNet2Vec 在显著提高流程挖掘能力方面的潜力。