{"title":"为您的流程挖掘分析获取数据:对预分析阶段的深入分析","authors":"Shameer K. Pradhan, Mieke Jans, Niels Martin","doi":"10.1145/3712587","DOIUrl":null,"url":null,"abstract":"Process mining enables organizations to analyze the data stored in their information systems and derive insights regarding their business processes. However, raw data needs to be converted into a format that can be fed into process mining algorithms. Various pre-analysis activities can be performed on the raw data, such as imperfection removal or granularity level change. Although pre-analysis activities play a crucial role in process mining, there is currently a limited overview available regarding their scope and the extent of their examination. This study presents a systematic literature review of the pre-analysis activities in process mining projects. To better understand this stage and its current state of research, we explore which activities constitute the pre-analysis stage, their goals, the applied research methodologies, the proposed research outcomes, and the data used to evaluate the research outcomes. We identify 15 pre-analysis activities and concepts, e.g., data extraction, generation, and cleaning. We also discover that design science research is the methodology and methods that are the primary research outcome in previous studies. We also realize that the proposed outcomes have been evaluated using only real-life data most of the time. This study reveals that research on pre-analysis is a growing field of interest in process mining.","PeriodicalId":50926,"journal":{"name":"ACM Computing Surveys","volume":"15 1","pages":""},"PeriodicalIF":23.8000,"publicationDate":"2025-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Getting the Data in Shape for Your Process Mining Analysis: An In-Depth Analysis of the Pre-Analysis Stage\",\"authors\":\"Shameer K. Pradhan, Mieke Jans, Niels Martin\",\"doi\":\"10.1145/3712587\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Process mining enables organizations to analyze the data stored in their information systems and derive insights regarding their business processes. However, raw data needs to be converted into a format that can be fed into process mining algorithms. Various pre-analysis activities can be performed on the raw data, such as imperfection removal or granularity level change. Although pre-analysis activities play a crucial role in process mining, there is currently a limited overview available regarding their scope and the extent of their examination. This study presents a systematic literature review of the pre-analysis activities in process mining projects. To better understand this stage and its current state of research, we explore which activities constitute the pre-analysis stage, their goals, the applied research methodologies, the proposed research outcomes, and the data used to evaluate the research outcomes. We identify 15 pre-analysis activities and concepts, e.g., data extraction, generation, and cleaning. We also discover that design science research is the methodology and methods that are the primary research outcome in previous studies. We also realize that the proposed outcomes have been evaluated using only real-life data most of the time. This study reveals that research on pre-analysis is a growing field of interest in process mining.\",\"PeriodicalId\":50926,\"journal\":{\"name\":\"ACM Computing Surveys\",\"volume\":\"15 1\",\"pages\":\"\"},\"PeriodicalIF\":23.8000,\"publicationDate\":\"2025-01-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Computing Surveys\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1145/3712587\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Computing Surveys","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3712587","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
Getting the Data in Shape for Your Process Mining Analysis: An In-Depth Analysis of the Pre-Analysis Stage
Process mining enables organizations to analyze the data stored in their information systems and derive insights regarding their business processes. However, raw data needs to be converted into a format that can be fed into process mining algorithms. Various pre-analysis activities can be performed on the raw data, such as imperfection removal or granularity level change. Although pre-analysis activities play a crucial role in process mining, there is currently a limited overview available regarding their scope and the extent of their examination. This study presents a systematic literature review of the pre-analysis activities in process mining projects. To better understand this stage and its current state of research, we explore which activities constitute the pre-analysis stage, their goals, the applied research methodologies, the proposed research outcomes, and the data used to evaluate the research outcomes. We identify 15 pre-analysis activities and concepts, e.g., data extraction, generation, and cleaning. We also discover that design science research is the methodology and methods that are the primary research outcome in previous studies. We also realize that the proposed outcomes have been evaluated using only real-life data most of the time. This study reveals that research on pre-analysis is a growing field of interest in process mining.
期刊介绍:
ACM Computing Surveys is an academic journal that focuses on publishing surveys and tutorials on various areas of computing research and practice. The journal aims to provide comprehensive and easily understandable articles that guide readers through the literature and help them understand topics outside their specialties. In terms of impact, CSUR has a high reputation with a 2022 Impact Factor of 16.6. It is ranked 3rd out of 111 journals in the field of Computer Science Theory & Methods.
ACM Computing Surveys is indexed and abstracted in various services, including AI2 Semantic Scholar, Baidu, Clarivate/ISI: JCR, CNKI, DeepDyve, DTU, EBSCO: EDS/HOST, and IET Inspec, among others.