首页 > 最新文献

Business & Information Systems Engineering最新文献

英文 中文
Predict COVID-19 Spreading With C-SMOTE 用C-SMOTE预测COVID-19的传播
IF 7.9 3区 管理学 Q1 Computer Science Pub Date : 2021-07-02 DOI: 10.52825/bis.v1i.45
Alessio Bernardo, Emanuele Della Valle
Data continuously gathered monitoring the spreading of the COVID-19 pandemic form an unbounded flow of data. Accurately forecasting if the infections will increase or decrease has a high impact, but it is challenging because the pandemic spreads and contracts periodically. Technically, the flow of data is said to be imbalanced and subject to concept drifts because signs of decrements are the minority class during the spreading periods, while they become the majority class in the contraction periods and the other way round. In this paper, we propose a case study applying the Continuous Synthetic Minority Oversampling Technique (C-SMOTE), a novel meta-strategy to pipeline with Streaming Machine Learning (SML) classification algorithms, to forecast the COVID-19 pandemic trend. Benchmarking SML pipelinesthat use C-SMOTE against state-of-the-art methods on a COVID-19 dataset, we bring statistical evidence that models learned using C-SMOTE are better.
持续收集的监测COVID-19大流行传播的数据形成了无界的数据流。准确预测感染增加或减少的影响很大,但由于大流行的周期性传播和收缩,这一预测具有挑战性。从技术上说,数据的流动是不平衡的,并受到概念漂移的影响,因为减量的迹象是在扩大期是少数阶层,而在收缩期则是多数阶层,反之亦然。在本文中,我们提出了一个应用连续合成少数过采样技术(C-SMOTE)的案例研究,这是一种基于流机器学习(SML)分类算法的流水线元策略,用于预测COVID-19大流行趋势。通过在COVID-19数据集上对使用C-SMOTE的SML管道与最先进的方法进行基准测试,我们提供了统计证据,表明使用C-SMOTE学习的模型更好。
{"title":"Predict COVID-19 Spreading With C-SMOTE","authors":"Alessio Bernardo, Emanuele Della Valle","doi":"10.52825/bis.v1i.45","DOIUrl":"https://doi.org/10.52825/bis.v1i.45","url":null,"abstract":"Data continuously gathered monitoring the spreading of the COVID-19 pandemic form an unbounded flow of data. Accurately forecasting if the infections will increase or decrease has a high impact, but it is challenging because the pandemic spreads and contracts periodically. Technically, the flow of data is said to be imbalanced and subject to concept drifts because signs of decrements are the minority class during the spreading periods, while they become the majority class in the contraction periods and the other way round. In this paper, we propose a case study applying the Continuous Synthetic Minority Oversampling Technique (C-SMOTE), a novel meta-strategy to pipeline with Streaming Machine Learning (SML) classification algorithms, to forecast the COVID-19 pandemic trend. Benchmarking SML pipelinesthat use C-SMOTE against state-of-the-art methods on a COVID-19 dataset, we bring statistical evidence that models learned using C-SMOTE are better.","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":null,"pages":null},"PeriodicalIF":7.9,"publicationDate":"2021-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82762125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Integrated Group Decision-Making Approach Considering Uncertainty Conditions 考虑不确定性条件的综合群体决策方法
IF 7.9 3区 管理学 Q1 Computer Science Pub Date : 2021-07-02 DOI: 10.52825/bis.v1i.52
D. Borissova, Z. Dimitrova
The management of business information processes needs effective decision-making models. That means to involve different methods, techniques, and principles to improve competitiveness and to achieve the planned business results. In this context, the article deals with the problem of group decision-making under uncertain conditions. To cope with such problems some well-known optimization strategies of Wald, Laplace, Hurwitz, and Savage are modified to take into account the experts’ opinions with different importance when forming the final group decision. Numerical testing is based on a case study for CRM software selection. The results are discussed based on the proposed models under two different cases derived from the case study. The conducted numerical testing of the proposed models demonstrates their applicability to cope simultaneously with multiple experts’ evaluations and uncertainty conditions.
企业信息流程的管理需要有效的决策模型。这意味着涉及不同的方法,技术和原则,以提高竞争力和实现计划的业务成果。在此背景下,本文研究了不确定条件下的群体决策问题。为了解决这类问题,我们对Wald、Laplace、Hurwitz和Savage等著名的优化策略进行了改进,在形成最终群体决策时考虑了不同重要程度的专家意见。数值测试是基于客户关系管理软件选择的案例研究。在两个不同的案例下,对所建立的模型进行了讨论。对所提模型进行的数值试验表明,所提模型能够同时应对多专家评价和不确定性条件。
{"title":"An Integrated Group Decision-Making Approach Considering Uncertainty Conditions","authors":"D. Borissova, Z. Dimitrova","doi":"10.52825/bis.v1i.52","DOIUrl":"https://doi.org/10.52825/bis.v1i.52","url":null,"abstract":"The management of business information processes needs effective decision-making models. That means to involve different methods, techniques, and principles to improve competitiveness and to achieve the planned business results. In this context, the article deals with the problem of group decision-making under uncertain conditions. To cope with such problems some well-known optimization strategies of Wald, Laplace, Hurwitz, and Savage are modified to take into account the experts’ opinions with different importance when forming the final group decision. Numerical testing is based on a case study for CRM software selection. The results are discussed based on the proposed models under two different cases derived from the case study. The conducted numerical testing of the proposed models demonstrates their applicability to cope simultaneously with multiple experts’ evaluations and uncertainty conditions.","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":null,"pages":null},"PeriodicalIF":7.9,"publicationDate":"2021-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81307127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Evaluation of Deep Learning Instance Segmentation Models for Pig Precision Livestock Farming 猪精准养殖中深度学习实例分割模型的评价
IF 7.9 3区 管理学 Q1 Computer Science Pub Date : 2021-07-02 DOI: 10.52825/bis.v1i.59
J. Witte, Johann Gerberding, Christian Melching, J. Gómez
In this paper, the deep learning instance segmentation architectures DetectoRS, SOLOv2, DETR and Mask R-CNN were applied to data from the field of Pig Precision Livestock Farming to investigate whether these models can address the specific challenges of this domain. For this purpose, we created a custom dataset consisting of 731 images with high heterogeneity and high-quality segmentation masks. For evaluation, the standard metric for benchmarking instance segmentation models in computer vision, the mean average precision, was used. The results show that all tested models can be applied to the considered domain in terms of prediction accuracy. With a mAP of 0.848, DetectoRS achieves the best results on the test set, but is also the largest model with the greatest hardware requirements. It turns out that increasing model complexity and size does not have a large impact on prediction accuracy for instance segmentation of pigs. DETR, SOLOv2, and Mask R-CNN achieve similar results to DetectoRS with a parameter count almost three times smaller. Visual evaluation of predictions shows quality differences in terms of accuracy of segmentation masks. DetectoRS generates the best masks overall, while DETR has advantages in correctly segmenting the tail region. However, it can be observed that each of the tested models has problems in assigning segmentation masks correctly once a pig is overlapped. The results demonstrate the potential of deep learning instance segmentation models in Pig Precision Livestock Farming and lay the foundation for future research in this area.
本文将深度学习实例分割架构detector、SOLOv2、DETR和Mask R-CNN应用于养猪精准养殖领域的数据,研究这些模型是否能够解决该领域的具体挑战。为此,我们创建了一个由731张具有高异质性和高质量分割掩码的图像组成的自定义数据集。为了进行评估,使用了计算机视觉中对实例分割模型进行基准测试的标准度量,即平均精度。结果表明,从预测精度上看,所有测试模型都可以应用于所考虑的领域。检测器的mAP为0.848,在测试集上获得了最好的结果,但也是硬件要求最高的最大模型。结果表明,增加模型复杂度和模型大小对猪实例分割的预测精度影响不大。DETR, SOLOv2和Mask R-CNN在参数数量几乎小三倍的情况下实现了与检测器相似的结果。预测的视觉评估显示了分割掩码准确性方面的质量差异。检测器总体上产生最好的掩模,而DETR在正确分割尾部区域方面具有优势。然而,可以观察到,一旦猪重叠,每个测试模型在正确分配分割掩码方面都存在问题。研究结果显示了深度学习实例分割模型在养猪精准养殖中的潜力,为该领域未来的研究奠定了基础。
{"title":"Evaluation of Deep Learning Instance Segmentation Models for Pig Precision Livestock Farming","authors":"J. Witte, Johann Gerberding, Christian Melching, J. Gómez","doi":"10.52825/bis.v1i.59","DOIUrl":"https://doi.org/10.52825/bis.v1i.59","url":null,"abstract":"In this paper, the deep learning instance segmentation architectures DetectoRS, SOLOv2, DETR and Mask R-CNN were applied to data from the field of Pig Precision Livestock Farming to investigate whether these models can address the specific challenges of this domain. For this purpose, we created a custom dataset consisting of 731 images with high heterogeneity and high-quality segmentation masks. For evaluation, the standard metric for benchmarking instance segmentation models in computer vision, the mean average precision, was used. The results show that all tested models can be applied to the considered domain in terms of prediction accuracy. With a mAP of 0.848, DetectoRS achieves the best results on the test set, but is also the largest model with the greatest hardware requirements. It turns out that increasing model complexity and size does not have a large impact on prediction accuracy for instance segmentation of pigs. DETR, SOLOv2, and Mask R-CNN achieve similar results to DetectoRS with a parameter count almost three times smaller. Visual evaluation of predictions shows quality differences in terms of accuracy of segmentation masks. DetectoRS generates the best masks overall, while DETR has advantages in correctly segmenting the tail region. However, it can be observed that each of the tested models has problems in assigning segmentation masks correctly once a pig is overlapped. The results demonstrate the potential of deep learning instance segmentation models in Pig Precision Livestock Farming and lay the foundation for future research in this area.","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":null,"pages":null},"PeriodicalIF":7.9,"publicationDate":"2021-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75204751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Mapping of ImageNet and Wikidata for Knowledge Graphs Enabled Computer Vision 面向知识图谱的ImageNet和Wikidata的映射
IF 7.9 3区 管理学 Q1 Computer Science Pub Date : 2021-07-02 DOI: 10.52825/bis.v1i.65
D. Filipiak, A. Fensel, A. Filipowska
Knowledge graphs are used as a source of prior knowledge in numerous computer vision tasks. However, such an approach requires to have a mapping between ground truth data labels and the target knowledge graph. We linked the ILSVRC 2012 dataset (often simply referred to as ImageNet) labels to Wikidata entities. This enables using rich knowledge graph structure and contextual information for several computer vision tasks, traditionally benchmarked with ImageNet and its variations. For instance, in few-shot learning classification scenarios with neural networks, this mapping can be leveraged for weight initialisation, which can improve the final performance metrics value. We mapped all 1000 ImageNet labels – 461 were already directly linked with the exact match property (P2888), 467 have exact match candidates, and 72 cannot be matched directly. For these 72 labels, we discuss different problem categories stemming from the inability of finding an exact match. Semantically close non-exact match candidates are presented as well. The mapping is publicly available athttps://github.com/DominikFilipiak/imagenet-to-wikidata-mapping.
在许多计算机视觉任务中,知识图被用作先验知识的来源。然而,这种方法需要在真实数据标签和目标知识图之间有一个映射。我们将ILSVRC 2012数据集(通常简称为ImageNet)标签链接到维基数据实体。这使得可以使用丰富的知识图谱结构和上下文信息来完成几个计算机视觉任务,传统上使用ImageNet及其变体进行基准测试。例如,在使用神经网络的少量学习分类场景中,可以利用这种映射进行权重初始化,这可以提高最终的性能指标值。我们映射了所有1000个ImageNet标签——461个已经直接链接到精确匹配属性(P2888), 467个有精确匹配候选,72个不能直接匹配。对于这72个标签,我们讨论了由于无法找到精确匹配而产生的不同问题类别。语义上接近的非精确匹配候选也被提出。映射是公开的:https://github.com/DominikFilipiak/imagenet-to-wikidata-mapping。
{"title":"Mapping of ImageNet and Wikidata for Knowledge Graphs Enabled Computer Vision","authors":"D. Filipiak, A. Fensel, A. Filipowska","doi":"10.52825/bis.v1i.65","DOIUrl":"https://doi.org/10.52825/bis.v1i.65","url":null,"abstract":"Knowledge graphs are used as a source of prior knowledge in numerous computer vision tasks. However, such an approach requires to have a mapping between ground truth data labels and the target knowledge graph. We linked the ILSVRC 2012 dataset (often simply referred to as ImageNet) labels to Wikidata entities. This enables using rich knowledge graph structure and contextual information for several computer vision tasks, traditionally benchmarked with ImageNet and its variations. For instance, in few-shot learning classification scenarios with neural networks, this mapping can be leveraged for weight initialisation, which can improve the final performance metrics value. We mapped all 1000 ImageNet labels – 461 were already directly linked with the exact match property (P2888), 467 have exact match candidates, and 72 cannot be matched directly. For these 72 labels, we discuss different problem categories stemming from the inability of finding an exact match. Semantically close non-exact match candidates are presented as well. The mapping is publicly available athttps://github.com/DominikFilipiak/imagenet-to-wikidata-mapping.","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":null,"pages":null},"PeriodicalIF":7.9,"publicationDate":"2021-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81874003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Post-Brexit power of European Union from the world trade network analysis 从世界贸易网络分析脱欧后欧盟的权力
IF 7.9 3区 管理学 Q1 Computer Science Pub Date : 2021-05-03 DOI: 10.52825/bis.v1i.48
Justin Loye, K. Jaffrès-Runser, D. Shepelyansky
We develop the Google matrix analysis of the multiproduct world trade network obtained from the UN COMTRADE database in recent years. The comparison is done between this new approach and the usual Import-Export description of this world trade network. The Google matrix analysis takes into account the multiplicity of trade transactions thus highlighting in a better way the world influence of specific countries and products. It shows that after Brexit, the European Union of 27 countries has the leading position in the world trade network ranking, being ahead of USA and China. Our approach determines also a sensitivity of trade country balance to specific products showing the dominant role of machinery and mineral fuels in multiproduct exchanges. It also underlines the growing influence of Asian countries.
本文对近年来从联合国COMTRADE数据库中获得的多产品世界贸易网络进行了谷歌矩阵分析。将这种新方法与通常的世界贸易网络的进出口描述进行了比较。谷歌矩阵分析考虑到贸易交易的多样性,从而以更好的方式突出了特定国家和产品的世界影响力。数据显示,英国脱欧后,由27个国家组成的欧盟在世界贸易网络排名中处于领先地位,领先于美国和中国。我们的方法还决定了贸易国家平衡对特定产品的敏感性,显示了机械和矿物燃料在多产品交换中的主导作用。这也突显了亚洲国家日益增长的影响力。
{"title":"Post-Brexit power of European Union from the world trade network analysis","authors":"Justin Loye, K. Jaffrès-Runser, D. Shepelyansky","doi":"10.52825/bis.v1i.48","DOIUrl":"https://doi.org/10.52825/bis.v1i.48","url":null,"abstract":"We develop the Google matrix analysis of the multiproduct world trade network obtained from the UN COMTRADE database in recent years. The comparison is done between this new approach and the usual Import-Export description of this world trade network. The Google matrix analysis takes into account the multiplicity of trade transactions thus highlighting in a better way the world influence of specific countries and products. It shows that after Brexit, the European Union of 27 countries has the leading position in the world trade network ranking, being ahead of USA and China. Our approach determines also a sensitivity of trade country balance to specific products showing the dominant role of machinery and mineral fuels in multiproduct exchanges. It also underlines the growing influence of Asian countries.","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":null,"pages":null},"PeriodicalIF":7.9,"publicationDate":"2021-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82270963","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Stream Processing Tools for Analyzing Objects in Motion Sending High-Volume Location Data 用于分析运动对象的流处理工具发送大量位置数据
IF 7.9 3区 管理学 Q1 Computer Science Pub Date : 2021-01-01 DOI: 10.52825/bis.v1i.41
Krzysztof Węcel, Marcin Szmydt, Milena Stróżyna
Recently we observe a significant increase in the amount of easily accessible data on transport and mobility. This data is mostly massive streams of high velocity, magnitude, and heterogeneity, which represent a flow of goods, shipments and the movements of fleet. It is therefore necessary to develop a scalable framework and apply tools capable of handling these streams. In the paper we propose an approach for the selection of software for stream processing solutions that may be used in the transportation domain. We provide an overview of potential stream processing technologies, followed by the method for choosing the selected software for real-time analysis of data streams coming from objects in motion. We have selected two solutions: Apache Spark Streaming and Apache Flink, and benchmarked them on a real-world task. We identified the caveats and challenges when it comes to implementation of the solution in practice.
最近,我们观察到关于交通和流动性的易于获取的数据量显著增加。这些数据大多是高速、大规模和异构的大量数据流,它们代表了货物、货物和船队的流动。因此,有必要开发一个可伸缩的框架,并应用能够处理这些流的工具。在本文中,我们提出了一种可用于传输领域的流处理解决方案的软件选择方法。我们概述了潜在的流处理技术,然后介绍了选择用于实时分析来自运动对象的数据流的选定软件的方法。我们选择了两种解决方案:Apache Spark Streaming和Apache Flink,并在实际任务中对它们进行了基准测试。我们确定了在实践中实现解决方案时的注意事项和挑战。
{"title":"Stream Processing Tools for Analyzing Objects in Motion Sending High-Volume Location Data","authors":"Krzysztof Węcel, Marcin Szmydt, Milena Stróżyna","doi":"10.52825/bis.v1i.41","DOIUrl":"https://doi.org/10.52825/bis.v1i.41","url":null,"abstract":"Recently we observe a significant increase in the amount of easily accessible data on transport and mobility. This data is mostly massive streams of high velocity, magnitude, and heterogeneity, which represent a flow of goods, shipments and the movements of fleet. It is therefore necessary to develop a scalable framework and apply tools capable of handling these streams. In the paper we propose an approach for the selection of software for stream processing solutions that may be used in the transportation domain. We provide an overview of potential stream processing technologies, followed by the method for choosing the selected software for real-time analysis of data streams coming from objects in motion. We have selected two solutions: Apache Spark Streaming and Apache Flink, and benchmarked them on a real-world task. We identified the caveats and challenges when it comes to implementation of the solution in practice.","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":null,"pages":null},"PeriodicalIF":7.9,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73898859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep Learning for Customer Churn Prediction in E-Commerce Decision Support 电子商务决策支持中客户流失预测的深度学习
IF 7.9 3区 管理学 Q1 Computer Science Pub Date : 2021-01-01 DOI: 10.52825/bis.v1i.42
Maciej Pondel, Maciej Wuczynski, W. Gryncewicz, Lukasz Lysik, Marcin Hernes, Artur Rot, Agata Kozina
Churn prediction is a Big Data domain, one of the most demanding use cases of recent time. It is also one of the most critical indicators of a healthy and growing business, irrespective of the size or channel of sales. This paper aims to develop a deep learning model for customers’ churn prediction in e-commerce, which is the main contribution of the article. The experiment was performed over real e-commerce data where 75% of buyers are one-off customers. The prediction based on this business specificity (many one-off customers and very few regular ones) is extremely challenging and, in a natural way, must be inaccurate to a certain ex-tent. Looking from another perspective, correct prediction and subsequent actions resulting in a higher customer retention are very attractive for overall business performance. In such a case, predictions with 74% accuracy, 78% precision, and 68% recall are very promising. Also, the paper fills a research gap and contrib-utes to the existing literature in the area of developing a customer churn prediction method for the retail sector by using deep learning tools based on customer churn and the full history of each customer’s transactions.
流失预测是一个大数据领域,也是最近最苛刻的用例之一。无论销售规模或渠道如何,这也是衡量企业健康成长的最关键指标之一。本文旨在开发电子商务中客户流失预测的深度学习模型,这是本文的主要贡献。该实验是在真实的电子商务数据上进行的,其中75%的买家是一次性客户。基于这种业务特殊性(许多一次性客户和很少的常规客户)的预测是极具挑战性的,并且在一定程度上必然是不准确的。从另一个角度来看,正确的预测和后续行动导致更高的客户保留率对整体业务绩效非常有吸引力。在这种情况下,准确率为74%,准确率为78%,召回率为68%的预测是非常有希望的。此外,本文填补了研究空白,并通过使用基于客户流失和每个客户交易的完整历史的深度学习工具,为零售业开发客户流失预测方法的现有文献做出了贡献。
{"title":"Deep Learning for Customer Churn Prediction in E-Commerce Decision Support","authors":"Maciej Pondel, Maciej Wuczynski, W. Gryncewicz, Lukasz Lysik, Marcin Hernes, Artur Rot, Agata Kozina","doi":"10.52825/bis.v1i.42","DOIUrl":"https://doi.org/10.52825/bis.v1i.42","url":null,"abstract":"Churn prediction is a Big Data domain, one of the most demanding use cases of recent time. It is also one of the most critical indicators of a healthy and growing business, irrespective of the size or channel of sales. This paper aims to develop a deep learning model for customers’ churn prediction in e-commerce, which is the main contribution of the article. The experiment was performed over real e-commerce data where 75% of buyers are one-off customers. The prediction based on this business specificity (many one-off customers and very few regular ones) is extremely challenging and, in a natural way, must be inaccurate to a certain ex-tent. Looking from another perspective, correct prediction and subsequent actions resulting in a higher customer retention are very attractive for overall business performance. In such a case, predictions with 74% accuracy, 78% precision, and 68% recall are very promising. Also, the paper fills a research gap and contrib-utes to the existing literature in the area of developing a customer churn prediction method for the retail sector by using deep learning tools based on customer churn and the full history of each customer’s transactions.","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":null,"pages":null},"PeriodicalIF":7.9,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84919126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Data Quality Assessment of Comma Separated Values Using Linked Data Approach 使用关联数据方法评估逗号分隔值的数据质量
IF 7.9 3区 管理学 Q1 Computer Science Pub Date : 2021-01-01 DOI: 10.1007/978-3-031-04216-4_22
Aparna Nayak, Bojan Bozic, L. Longo
{"title":"Data Quality Assessment of Comma Separated Values Using Linked Data Approach","authors":"Aparna Nayak, Bojan Bozic, L. Longo","doi":"10.1007/978-3-031-04216-4_22","DOIUrl":"https://doi.org/10.1007/978-3-031-04216-4_22","url":null,"abstract":"","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":null,"pages":null},"PeriodicalIF":7.9,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85686058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Challenges of Mining Twitter Data for Analyzing Service Performance: A Case Study of Transportation Service in Malaysia 挖掘Twitter数据分析服务绩效的挑战:以马来西亚交通服务为例
IF 7.9 3区 管理学 Q1 Computer Science Pub Date : 2021-01-01 DOI: 10.1007/978-3-031-04216-4_21
Hui Na Chua, Alvin Wei Qiang Liao, Y. Low, A. Lee, M. Ismail
{"title":"Challenges of Mining Twitter Data for Analyzing Service Performance: A Case Study of Transportation Service in Malaysia","authors":"Hui Na Chua, Alvin Wei Qiang Liao, Y. Low, A. Lee, M. Ismail","doi":"10.1007/978-3-031-04216-4_21","DOIUrl":"https://doi.org/10.1007/978-3-031-04216-4_21","url":null,"abstract":"","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":null,"pages":null},"PeriodicalIF":7.9,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76843317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Database-Less Extraction of Event Logs from Redo Logs 从重做日志中提取事件日志
IF 7.9 3区 管理学 Q1 Computer Science Pub Date : 2021-01-01 DOI: 10.52825/bis.v1i.66
Dorina Bano, Tom Lichtenstein, Finn Klessascheck, M. Weske
Process mining is widely adopted in organizations to gain deep insights about running business processes. This can be achieved by applying different process mining techniques like discovery, conformance checking, and performance analysis. These techniques are applied on event logs, which need to be extracted from the organization’s databases beforehand. This not only implies access to databases, but also detailed knowledge about the database schema, which is often not available. In many real-world scenarios, however, process execution data is available as redo logs. Such logs are used to bring a database into a consistent state in case of a system failure. This paper proposes a semi-automatic approach to extract an event log from redo logs alone. It does not require access to the database or knowledge of the databaseschema. The feasibility of the proposed approach is evaluated on two synthetic redo logs.
流程挖掘在组织中被广泛采用,以获得关于运行业务流程的深入见解。这可以通过应用不同的流程挖掘技术(如发现、一致性检查和性能分析)来实现。这些技术应用于事件日志,需要事先从组织的数据库中提取事件日志。这不仅意味着对数据库的访问,还意味着对数据库模式的详细了解,而这些通常是无法获得的。然而,在许多实际场景中,进程执行数据作为重做日志可用。这些日志用于在系统发生故障时使数据库保持一致状态。本文提出了一种半自动的从重做日志中提取事件日志的方法。它不需要访问数据库或了解数据库模式。在两个合成重做日志上对该方法的可行性进行了评价。
{"title":"Database-Less Extraction of Event Logs from Redo Logs","authors":"Dorina Bano, Tom Lichtenstein, Finn Klessascheck, M. Weske","doi":"10.52825/bis.v1i.66","DOIUrl":"https://doi.org/10.52825/bis.v1i.66","url":null,"abstract":"Process mining is widely adopted in organizations to gain deep insights about running business processes. This can be achieved by applying different process mining techniques like discovery, conformance checking, and performance analysis. These techniques are applied on event logs, which need to be extracted from the organization’s databases beforehand. This not only implies access to databases, but also detailed knowledge about the database schema, which is often not available. In many real-world scenarios, however, process execution data is available as redo logs. Such logs are used to bring a database into a consistent state in case of a system failure. This paper proposes a semi-automatic approach to extract an event log from redo logs alone. It does not require access to the database or knowledge of the databaseschema. The feasibility of the proposed approach is evaluated on two synthetic redo logs.","PeriodicalId":56020,"journal":{"name":"Business & Information Systems Engineering","volume":null,"pages":null},"PeriodicalIF":7.9,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80312091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Business & Information Systems Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1