S. Jablonski, Maximilian Röglinger, Stefan Schönig, K. Wyrtki
{"title":"过程执行轨迹的多视角聚类","authors":"S. Jablonski, Maximilian Röglinger, Stefan Schönig, K. Wyrtki","doi":"10.18417/emisa.14.2","DOIUrl":null,"url":null,"abstract":"Process mining techniques enable extracting process models from process event logs. Problems can arise if process mining is applied to event logs of flexible processes that are extremely heterogeneous. Here, trace clustering can be used to reduce the complexity of logs. Common techniques use isolated criteria such as activity profiles for clustering. Especially in flexible environments, however, additional data attributes stored in event logs are a source of unused knowledge for trace clustering. In this paper, we present a multi-perspective trace clustering approach that improves the homogeneity of trace subsets. Our approach provides an integrated definition of similarity between traces by defining a distance measure that combines information about executed activities, performing resources, and data values. The evaluation with real-life event logs, one from a hospital and one with traffic fine data, shows that the homogeneity of the resulting clusters can be significantly improved compared to existing techniques.","PeriodicalId":186216,"journal":{"name":"Enterp. Model. Inf. Syst. Archit. Int. J. Concept. Model.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":"{\"title\":\"Multi-Perspective Clustering of Process Execution Traces\",\"authors\":\"S. Jablonski, Maximilian Röglinger, Stefan Schönig, K. Wyrtki\",\"doi\":\"10.18417/emisa.14.2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Process mining techniques enable extracting process models from process event logs. Problems can arise if process mining is applied to event logs of flexible processes that are extremely heterogeneous. Here, trace clustering can be used to reduce the complexity of logs. Common techniques use isolated criteria such as activity profiles for clustering. Especially in flexible environments, however, additional data attributes stored in event logs are a source of unused knowledge for trace clustering. In this paper, we present a multi-perspective trace clustering approach that improves the homogeneity of trace subsets. Our approach provides an integrated definition of similarity between traces by defining a distance measure that combines information about executed activities, performing resources, and data values. The evaluation with real-life event logs, one from a hospital and one with traffic fine data, shows that the homogeneity of the resulting clusters can be significantly improved compared to existing techniques.\",\"PeriodicalId\":186216,\"journal\":{\"name\":\"Enterp. Model. Inf. Syst. Archit. Int. J. Concept. Model.\",\"volume\":\"15 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-02-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"9\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Enterp. Model. Inf. Syst. Archit. Int. J. Concept. Model.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.18417/emisa.14.2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Enterp. Model. Inf. Syst. Archit. Int. J. Concept. Model.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18417/emisa.14.2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-Perspective Clustering of Process Execution Traces
Process mining techniques enable extracting process models from process event logs. Problems can arise if process mining is applied to event logs of flexible processes that are extremely heterogeneous. Here, trace clustering can be used to reduce the complexity of logs. Common techniques use isolated criteria such as activity profiles for clustering. Especially in flexible environments, however, additional data attributes stored in event logs are a source of unused knowledge for trace clustering. In this paper, we present a multi-perspective trace clustering approach that improves the homogeneity of trace subsets. Our approach provides an integrated definition of similarity between traces by defining a distance measure that combines information about executed activities, performing resources, and data values. The evaluation with real-life event logs, one from a hospital and one with traffic fine data, shows that the homogeneity of the resulting clusters can be significantly improved compared to existing techniques.