首页 > 最新文献

Data Mining for Business Applications最新文献

英文 中文
A Technology Platform to Enable the Building of Corporate Radar Applications that Mine the Web for Business Insight 一个技术平台,使企业雷达应用程序的建设,挖掘网络的业务洞察力
Pub Date : 2010-08-07 DOI: 10.3233/978-1-60750-633-1-149
P. Yeh, A. Kass
In this paper, we present a technology platform that can be customized to create a wide range of corporate radar applications that can turn the Web into a systematic source of business insight. This platform integrates a combination of established AI technologies --i.e. semantic models, natural language processing, and inference engines --in a novel way. We present two prototype corporate radars built using this platform: the Business Event Advisor, which detects threats and opportunities relevant to a decision maker's organization, and the Technology Investment Radar which assesses the maturity of technologies that impact a decision maker's business. The Technology Investment Radar has been piloted with business users, and we present encouraging initial results from this pilot.
在本文中,我们提出了一个技术平台,可以对其进行定制,以创建范围广泛的企业雷达应用程序,这些应用程序可以将Web转变为业务洞察的系统来源。该平台以一种新颖的方式整合了现有的人工智能技术,即语义模型、自然语言处理和推理引擎。我们展示了使用该平台构建的两个原型公司雷达:业务事件顾问,用于检测与决策者组织相关的威胁和机会,以及技术投资雷达,用于评估影响决策者业务的技术成熟度。技术投资雷达已经在商业用户中进行了试验,我们从试验中获得了令人鼓舞的初步结果。
{"title":"A Technology Platform to Enable the Building of Corporate Radar Applications that Mine the Web for Business Insight","authors":"P. Yeh, A. Kass","doi":"10.3233/978-1-60750-633-1-149","DOIUrl":"https://doi.org/10.3233/978-1-60750-633-1-149","url":null,"abstract":"In this paper, we present a technology platform that can be customized to create a wide range of corporate radar applications that can turn the Web into a systematic source of business insight. This platform integrates a combination of established AI technologies --i.e. semantic models, natural language processing, and inference engines --in a novel way. We present two prototype corporate radars built using this platform: the Business Event Advisor, which detects threats and opportunities relevant to a decision maker's organization, and the Technology Investment Radar which assesses the maturity of technologies that impact a decision maker's business. The Technology Investment Radar has been piloted with business users, and we present encouraging initial results from this pilot.","PeriodicalId":438467,"journal":{"name":"Data Mining for Business Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129240954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards the Generic Framework for Utility Considerations in Data Mining Research 面向数据挖掘研究中实用考虑的通用框架
Pub Date : 2010-08-07 DOI: 10.3233/978-1-60750-633-1-49
S. Puuronen, Mykola Pechenizkiy
Rigor data mining (DM) research has successfully developed advanced data mining techniques and algorithms, and many organizations have great expectations to take more benefit of their vast data warehouses in decision making. Even when there are some success stories the current status in practice is mainly including great expectations that have not yet been fulfilled. DM researchers have recently become interested in utility-based DM (UBDM) starting to consider some of the economic utility factors (like cost of data, cost of measurement, cost of class label and so forth), but yet many other utility factors are left outside the main directions of UBDM. The goal of this position paper is (1) to motivate researchers to consider utility from broader perspective than usually done in UBDM context and (2) to introduce a new generic framework for these broader utility considerations in DM research. Besides describing our multi-criteria utility based framework (MCUF) we present a few hypothetical examples showing how the framework might be used to consider utilities of some potential DM research stakeholders.
严谨的数据挖掘(DM)研究已经成功地开发了先进的数据挖掘技术和算法,许多组织都希望在决策中更多地利用其庞大的数据仓库。即使有一些成功的故事,目前的实践状况主要是包括尚未实现的巨大期望。最近,基于效用的数据管理研究人员开始对基于效用的数据管理(UBDM)感兴趣,开始考虑一些经济效用因素(如数据成本、测量成本、分类标签成本等),但许多其他效用因素被排除在UBDM的主要方向之外。本意见书的目标是:(1)激励研究人员从比通常在UBDM背景下更广泛的角度考虑效用;(2)为DM研究中这些更广泛的效用考虑引入一个新的通用框架。除了描述我们的基于多标准效用的框架(MCUF)之外,我们还提出了一些假设的例子,展示了如何使用该框架来考虑一些潜在的DM研究利益相关者的效用。
{"title":"Towards the Generic Framework for Utility Considerations in Data Mining Research","authors":"S. Puuronen, Mykola Pechenizkiy","doi":"10.3233/978-1-60750-633-1-49","DOIUrl":"https://doi.org/10.3233/978-1-60750-633-1-49","url":null,"abstract":"Rigor data mining (DM) research has successfully developed advanced data mining techniques and algorithms, and many organizations have great expectations to take more benefit of their vast data warehouses in decision making. Even when there are some success stories the current status in practice is mainly including great expectations that have not yet been fulfilled. DM researchers have recently become interested in utility-based DM (UBDM) starting to consider some of the economic utility factors (like cost of data, cost of measurement, cost of class label and so forth), but yet many other utility factors are left outside the main directions of UBDM. The goal of this position paper is (1) to motivate researchers to consider utility from broader perspective than usually done in UBDM context and (2) to introduce a new generic framework for these broader utility considerations in DM research. Besides describing our multi-criteria utility based framework (MCUF) we present a few hypothetical examples showing how the framework might be used to consider utilities of some potential DM research stakeholders.","PeriodicalId":438467,"journal":{"name":"Data Mining for Business Applications","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116259337","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Clustering of Adolescent Criminal Offenders using Psychological and Criminological Profiles 基于心理和犯罪学特征的青少年罪犯聚类研究
Pub Date : 2010-08-07 DOI: 10.3233/978-1-60750-633-1-123
M. Breitenbach, T. Brennan, W. Dieterich, G. Grudic
In Criminology research the question arises if certain types of delinquents can be identified from data, and while there are many cases that can not be clearly labeled, overlapping taxonomies have been proposed in [1,2,3]. In a recent study Juvenile offenders (N = 1572) from three state systems were assessed on a battery of criminogenic risk and needs factors and their official criminal histories. Cluster analysis methods were applied. One problem we encountered is the large number of hybrid cases that have to belong to two or more classes. To eliminate these cases we propose a method that combines the results of Bagged K-Means and the consistency method [4], a semi-supervised learning technique. A manual interpretation of the results showed very interpretable patterns that were linked to existing criminologic research.
在犯罪学研究中,如果可以从数据中识别出某些类型的违法者,那么问题就出现了,尽管有许多案例无法明确标记,但在[1,2,3]中提出了重叠的分类。在最近的一项研究中,对来自三个州的青少年罪犯(N = 1572)进行了一系列犯罪风险和需求因素以及他们的正式犯罪历史的评估。采用聚类分析方法。我们遇到的一个问题是大量的混合案例必须属于两个或更多的类。为了消除这些情况,我们提出了一种结合Bagged K-Means结果和一致性方法[4]的方法,这是一种半监督学习技术。对结果的人工解释显示了与现有犯罪学研究相关的非常可解释的模式。
{"title":"Clustering of Adolescent Criminal Offenders using Psychological and Criminological Profiles","authors":"M. Breitenbach, T. Brennan, W. Dieterich, G. Grudic","doi":"10.3233/978-1-60750-633-1-123","DOIUrl":"https://doi.org/10.3233/978-1-60750-633-1-123","url":null,"abstract":"In Criminology research the question arises if certain types of delinquents can be identified from data, and while there are many cases that can not be clearly labeled, overlapping taxonomies have been proposed in [1,2,3]. In a recent study Juvenile offenders (N = 1572) from three state systems were assessed on a battery of criminogenic risk and needs factors and their official criminal histories. Cluster analysis methods were applied. One problem we encountered is the large number of hybrid cases that have to belong to two or more classes. To eliminate these cases we propose a method that combines the results of Bagged K-Means and the consistency method [4], a semi-supervised learning technique. A manual interpretation of the results showed very interpretable patterns that were linked to existing criminologic research.","PeriodicalId":438467,"journal":{"name":"Data Mining for Business Applications","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121948668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
An Integrated System to Support Electricity Tariff Contract Definition 支持电价合同定义的集成系统
Pub Date : 2010-08-07 DOI: 10.3233/978-1-60750-633-1-99
F. Rodrigues, V. Figueiredo, Z. Vale
This paper presents an integrated system that helps both retail companies and electricity consumers on the definition of the best retail contracts and tariffs. This integrated system is composed by a Decision Support System (DSS) based on a Consumer Characterization Framework (CCF). The CCF is based on data mining techniques, applied to obtain useful knowledge about electricity consumers from large amounts of consumption data. This knowledge is acquired following an innovative and systematic approach able to identify different consumers' classes, represented by a load profile, and its characterization using decision trees. The framework generates inputs to use in the knowledge base and in the database of the DSS. The rule sets derived from the decision trees are integrated in the knowledge base of the DSS. The load profiles together with the information about contracts and electricity prices form the database of the DSS. This DSS is able to perform the classification of different consumers, present its load profile and test different electricity tariffs and contracts. The final outputs of the DSS are a comparative economic analysis between different contracts and advice about the most economic contract to each consumer class. The presentation of the DSS is completed with an application example using a real data base of consumers from the Portuguese distribution company.
本文提出了一个综合系统,可以帮助零售公司和电力消费者定义最佳零售合同和电价。该集成系统由基于消费者特征框架(CCF)的决策支持系统(DSS)组成。CCF基于数据挖掘技术,从大量的用电数据中获取有用的用电信息。这种知识是通过一种创新和系统的方法获得的,这种方法能够识别不同的消费者类别,由负载概况表示,并使用决策树进行表征。该框架生成输入,以便在决策支持系统的知识库和数据库中使用。决策树生成的规则集被集成到决策支持系统的知识库中。负荷概况连同有关合约及电价的资料,构成决策支援系统的资料库。该DSS能够对不同的用户进行分类,呈现其负荷概况,并测试不同的电价和合同。决策服务的最终产出是对不同合同的比较经济分析,并就每个消费者阶层最经济的合同提出建议。通过使用来自葡萄牙分销公司的真实消费者数据库的应用程序示例,完成了对决策支持系统的介绍。
{"title":"An Integrated System to Support Electricity Tariff Contract Definition","authors":"F. Rodrigues, V. Figueiredo, Z. Vale","doi":"10.3233/978-1-60750-633-1-99","DOIUrl":"https://doi.org/10.3233/978-1-60750-633-1-99","url":null,"abstract":"This paper presents an integrated system that helps both retail companies and electricity consumers on the definition of the best retail contracts and tariffs. This integrated system is composed by a Decision Support System (DSS) based on a Consumer Characterization Framework (CCF). The CCF is based on data mining techniques, applied to obtain useful knowledge about electricity consumers from large amounts of consumption data. This knowledge is acquired following an innovative and systematic approach able to identify different consumers' classes, represented by a load profile, and its characterization using decision trees. The framework generates inputs to use in the knowledge base and in the database of the DSS. The rule sets derived from the decision trees are integrated in the knowledge base of the DSS. The load profiles together with the information about contracts and electricity prices form the database of the DSS. This DSS is able to perform the classification of different consumers, present its load profile and test different electricity tariffs and contracts. The final outputs of the DSS are a comparative economic analysis between different contracts and advice about the most economic contract to each consumer class. The presentation of the DSS is completed with an application example using a real data base of consumers from the Portuguese distribution company.","PeriodicalId":438467,"journal":{"name":"Data Mining for Business Applications","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128579112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Best Practices for Predictive Analytics in B2B Financial Services B2B金融服务中预测分析的最佳实践
Pub Date : 2010-08-07 DOI: 10.3233/978-1-60750-633-1-35
Raul Domingos, Thierry Van de Merckt
Predictive analytics is a well known practice among corporations having business with private consumers (B2C) as a means to achieve competitive advantage. The first part of this article intends to show that corporations operating in a business to business (B2B) setting have similar conditions to use predictive analytics on their favor. Predictive analytics can be applied to solve a myriad of business problems. The solutions to solve some of these problems are well known while the resolution of other problems requires quite an amount of research and innovation. However, predictive analytics professionals tend to solve similar problems in very different ways, even those to which there are known best practices. The second part of this article uses predictive analytics applications identified in a B2B context to describe a set of best practices to solve well known problems (the “let's not re-invent the wheel” attitude) and innovative practices to solve challenging problems.
预测分析在拥有私人消费者(B2C)业务的公司中是一种众所周知的实践,它是获得竞争优势的一种手段。本文的第一部分旨在表明,在企业对企业(B2B)环境中运营的公司也有类似的条件来使用对他们有利的预测分析。预测分析可以应用于解决无数的业务问题。其中一些问题的解决方案是众所周知的,而其他问题的解决需要相当多的研究和创新。然而,预测分析专业人员倾向于以非常不同的方式解决类似的问题,即使是那些已知的最佳实践。本文的第二部分使用在B2B上下文中确定的预测分析应用程序来描述一组解决众所周知问题的最佳实践(“让我们不要重新发明轮子”的态度)和解决具有挑战性问题的创新实践。
{"title":"Best Practices for Predictive Analytics in B2B Financial Services","authors":"Raul Domingos, Thierry Van de Merckt","doi":"10.3233/978-1-60750-633-1-35","DOIUrl":"https://doi.org/10.3233/978-1-60750-633-1-35","url":null,"abstract":"Predictive analytics is a well known practice among corporations having business with private consumers (B2C) as a means to achieve competitive advantage. The first part of this article intends to show that corporations operating in a business to business (B2B) setting have similar conditions to use predictive analytics on their favor. Predictive analytics can be applied to solve a myriad of business problems. The solutions to solve some of these problems are well known while the resolution of other problems requires quite an amount of research and innovation. However, predictive analytics professionals tend to solve similar problems in very different ways, even those to which there are known best practices. The second part of this article uses predictive analytics applications identified in a B2B context to describe a set of best practices to solve well known problems (the “let's not re-invent the wheel” attitude) and innovative practices to solve challenging problems.","PeriodicalId":438467,"journal":{"name":"Data Mining for Business Applications","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123149273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mining Medical Administrative Data - The PKB Suite 挖掘医疗管理数据- PKB套件
Pub Date : 2010-08-07 DOI: 10.3233/978-1-60750-633-1-110
Aaron Ceglar, R. Morrall, J. Roddick
Hospitals are adept at capturing large volumes of highly multi-dimensional data about their activities including clinical, demographic, administrative, financial and, increasingly, outcome data (such as adverse events). Managing and understanding this data is difficult as hospitals typically do not have the staff and/or the expertise to assemble, query, analyse and report on the potential knowledge contained within such data. The Power Knowledge Builder (PKB) project investigated the adaption of data mining algorithms to the domain of patient costing, with the aim of helping practitioners better understand their data and therefore facilitate best practice.
医院擅长获取有关其活动的大量高度多维数据,包括临床、人口、行政、财务以及越来越多的结果数据(如不良事件)。管理和理解这些数据是困难的,因为医院通常没有工作人员和/或专业知识来收集、查询、分析和报告这些数据中包含的潜在知识。Power Knowledge Builder (PKB)项目调查了数据挖掘算法在患者成本计算领域的应用,目的是帮助从业者更好地理解他们的数据,从而促进最佳实践。
{"title":"Mining Medical Administrative Data - The PKB Suite","authors":"Aaron Ceglar, R. Morrall, J. Roddick","doi":"10.3233/978-1-60750-633-1-110","DOIUrl":"https://doi.org/10.3233/978-1-60750-633-1-110","url":null,"abstract":"Hospitals are adept at capturing large volumes of highly multi-dimensional data about their activities including clinical, demographic, administrative, financial and, increasingly, outcome data (such as adverse events). Managing and understanding this data is difficult as hospitals typically do not have the staff and/or the expertise to assemble, query, analyse and report on the potential knowledge contained within such data. The Power Knowledge Builder (PKB) project investigated the adaption of data mining algorithms to the domain of patient costing, with the aim of helping practitioners better understand their data and therefore facilitate best practice.","PeriodicalId":438467,"journal":{"name":"Data Mining for Business Applications","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127119572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Resource-bounded Outlier Detection using Clustering Methods 基于聚类方法的资源边界异常点检测
Pub Date : 2010-08-07 DOI: 10.3233/978-1-60750-633-1-84
L. Torgo, Carlos Soares
This paper describes a methodology for the application of hierarchical clustering methods to the task of outlier detection. The methodology is tested on the problem of cleaning Official Statistics data. The goal is to detect erroneous foreign trade transactions in data collected by the Portuguese Institute of Statistics (INE). These transactions are a minority, but still they have an important impact on the statistics produced by the institute. The detectiong of these rare errors is a manual, time-consuming task. This type of tasks is usually constrained by a limited amount of available resources. Our proposal addresses this issue by producing a ranking of outlyingness that allows a better management of the available resources by allocating them to the cases which are most different from the other and, thus, have a higher probability of being errors. Our method is based on the output of standard agglomerative hierarchical clustering algorithms, resulting in no significant additional computational costs. Our results show that it enables large savings by selecting a small subset of suspicious transactions for manual inspection, which, nevertheless, includes most of the erroneous transactions. In this study we compare our proposal to a state of the art outlier ranking method (LOF) and show that our method achieves better results on this particular application. The results of our experiments are also competitive with previous results on the same data. Finally, the outcome of our experiments raises important questions concerning the method currently followed at INE concerning items with small number of transactions.
本文描述了一种将层次聚类方法应用于异常点检测任务的方法。在清理官方统计数据的问题上对该方法进行了检验。目标是在葡萄牙统计局(INE)收集的数据中发现错误的外贸交易。这些交易是少数,但它们仍然对研究所的统计数据产生重要影响。检测这些罕见的错误是一项手动且耗时的任务。这类任务通常受到可用资源数量有限的限制。我们的建议通过生成离群度排序来解决这个问题,通过将可用资源分配给与其他情况最不同的情况,从而允许更好地管理可用资源,从而具有更高的错误概率。我们的方法基于标准聚类分层聚类算法的输出,因此没有显著的额外计算成本。我们的结果表明,通过选择一小部分可疑事务进行人工检查,它可以节省大量费用,然而,其中包括大多数错误事务。在本研究中,我们将我们的建议与最先进的离群值排序方法(LOF)进行了比较,并表明我们的方法在此特定应用中取得了更好的结果。我们的实验结果与以往在相同数据上的结果也具有竞争力。最后,我们的实验结果提出了一些重要的问题,这些问题涉及INE目前所采用的涉及少量交易项目的方法。
{"title":"Resource-bounded Outlier Detection using Clustering Methods","authors":"L. Torgo, Carlos Soares","doi":"10.3233/978-1-60750-633-1-84","DOIUrl":"https://doi.org/10.3233/978-1-60750-633-1-84","url":null,"abstract":"This paper describes a methodology for the application of hierarchical clustering methods to the task of outlier detection. The methodology is tested on the problem of cleaning Official Statistics data. The goal is to detect erroneous foreign trade transactions in data collected by the Portuguese Institute of Statistics (INE). These transactions are a minority, but still they have an important impact on the statistics produced by the institute. The detectiong of these rare errors is a manual, time-consuming task. This type of tasks is usually constrained by a limited amount of available resources. Our proposal addresses this issue by producing a ranking of outlyingness that allows a better management of the available resources by allocating them to the cases which are most different from the other and, thus, have a higher probability of being errors. Our method is based on the output of standard agglomerative hierarchical clustering algorithms, resulting in no significant additional computational costs. Our results show that it enables large savings by selecting a small subset of suspicious transactions for manual inspection, which, nevertheless, includes most of the erroneous transactions. In this study we compare our proposal to a state of the art outlier ranking method (LOF) and show that our method achieves better results on this particular application. The results of our experiments are also competitive with previous results on the same data. Finally, the outcome of our experiments raises important questions concerning the method currently followed at INE concerning items with small number of transactions.","PeriodicalId":438467,"journal":{"name":"Data Mining for Business Applications","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126584611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Data Mining for Business Applications: Introduction 商业应用的数据挖掘:导论
Pub Date : 2010-08-07 DOI: 10.3233/978-1-60750-633-1-1
Carlos Soares, R. Ghani
This chapter introduces the volume on Data Mining (DM) for Business Applications. The chapters in this book provide an overview of some of the major advances in the field, namely in terms of methodology and applications, both traditional and emerging. In this introductory paper, we provide a context for the rest of the book. The framework for discussing the contents of the book is the DM methodology, which is suitable both to organize and relate the diverse contributions of the chapters selected. The chapter closes with an overview of the chapters in the book to guide the reader.
本章介绍了商业应用的数据挖掘(DM)卷。本书的章节概述了该领域的一些主要进展,即在传统和新兴的方法论和应用方面。在这篇介绍性的论文中,我们为本书的其余部分提供了一个背景。讨论本书内容的框架是DM方法论,它既适合于组织,也适合于将所选章节的不同贡献联系起来。本章以本书各章的概述结束,以指导读者。
{"title":"Data Mining for Business Applications: Introduction","authors":"Carlos Soares, R. Ghani","doi":"10.3233/978-1-60750-633-1-1","DOIUrl":"https://doi.org/10.3233/978-1-60750-633-1-1","url":null,"abstract":"This chapter introduces the volume on Data Mining (DM) for Business Applications. The chapters in this book provide an overview of some of the major advances in the field, namely in terms of methodology and applications, both traditional and emerging. In this introductory paper, we provide a context for the rest of the book. The framework for discussing the contents of the book is the DM methodology, which is suitable both to organize and relate the diverse contributions of the chapters selected. The chapter closes with an overview of the chapters in the book to guide the reader.","PeriodicalId":438467,"journal":{"name":"Data Mining for Business Applications","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117262271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Customer churn prediction - a case study in retail banking 客户流失预测——零售银行案例研究
Pub Date : 2010-08-07 DOI: 10.3233/978-1-60750-633-1-77
Teemu Mutanen, S. Nousiainen, J. Ahola
This work focuses on one of the central topics in customer relationship management (CRM): transfer of valuable customers to a competitor. Customer retention rate has a strong impact on customer lifetime value, and understanding the true value of a possible customer churn will help the company in its customer relationship management. Customer value analysis along with customer churn predictions will help marketing programs target more specific groups of customers. We predict customer churn with logistic regression techniques and analyze the churning and nonchurning customers by using data from a consumer retail banking company. The result of the case study show that using conventional statistical methods to identify possible churners can be successful.
本研究的重点是客户关系管理(CRM)的核心主题之一:将有价值的客户转移到竞争对手。客户保留率对客户终身价值有很大的影响,了解潜在客户流失的真正价值将有助于公司进行客户关系管理。客户价值分析和客户流失预测将有助于营销计划瞄准更具体的客户群体。本文运用logistic回归技术对客户流失进行预测,并利用某消费零售银行公司的数据对流失客户和非流失客户进行分析。案例研究的结果表明,使用传统的统计方法来识别可能的流失是成功的。
{"title":"Customer churn prediction - a case study in retail banking","authors":"Teemu Mutanen, S. Nousiainen, J. Ahola","doi":"10.3233/978-1-60750-633-1-77","DOIUrl":"https://doi.org/10.3233/978-1-60750-633-1-77","url":null,"abstract":"This work focuses on one of the central topics in customer relationship management (CRM): transfer of valuable customers to a competitor. Customer retention rate has a strong impact on customer lifetime value, and understanding the true value of a possible customer churn will help the company in its customer relationship management. Customer value analysis along with customer churn predictions will help marketing programs target more specific groups of customers. We predict customer churn with logistic regression techniques and analyze the churning and nonchurning customers by using data from a consumer retail banking company. The result of the case study show that using conventional statistical methods to identify possible churners can be successful.","PeriodicalId":438467,"journal":{"name":"Data Mining for Business Applications","volume":"132 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115461330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Forecasting Online Auctions using Dynamic Models 使用动态模型预测在线拍卖
Pub Date : 2010-08-07 DOI: 10.3233/978-1-60750-633-1-137
Wolfgang Jank, Galit Shmueli
We propose a dynamic forecasting model for price in online auctions. One of the key features of our model is that it operates during the live-auction, generating real-time forecasts which makes it different from previous static models. Our model is also different with respect to how information about price is incorporated. While one part of the model is based on the more traditional notion of an auction's price-level, another part incorporates its dynamics in the form of price-velocity and -acceleration. In that sense, it incorporates key features of a dynamic environment such as an online auction. The use of novel functional data methodology allows us to measure, and subsequently include, dynamic price characteristics. We illustrate our model on a diverse set of eBay auctions across many different book categories. It achieves significantly higher prediction accuracy compared to standard approaches.
提出了一种在线拍卖价格动态预测模型。我们的模型的一个关键特征是它在现场拍卖期间运行,生成实时预测,这使得它不同于以前的静态模型。我们的模型在如何纳入价格信息方面也有所不同。虽然该模型的一部分基于拍卖价格水平的更传统概念,但另一部分以价格速度和加速度的形式纳入了其动态。从这个意义上说,它结合了在线拍卖等动态环境的关键特征。使用新颖的功能数据方法使我们能够测量并随后包括动态价格特征。我们在许多不同图书类别的eBay拍卖中演示了我们的模型。与标准方法相比,该方法的预测精度显著提高。
{"title":"Forecasting Online Auctions using Dynamic Models","authors":"Wolfgang Jank, Galit Shmueli","doi":"10.3233/978-1-60750-633-1-137","DOIUrl":"https://doi.org/10.3233/978-1-60750-633-1-137","url":null,"abstract":"We propose a dynamic forecasting model for price in online auctions. One of the key features of our model is that it operates during the live-auction, generating real-time forecasts which makes it different from previous static models. Our model is also different with respect to how information about price is incorporated. While one part of the model is based on the more traditional notion of an auction's price-level, another part incorporates its dynamics in the form of price-velocity and -acceleration. In that sense, it incorporates key features of a dynamic environment such as an online auction. The use of novel functional data methodology allows us to measure, and subsequently include, dynamic price characteristics. We illustrate our model on a diverse set of eBay auctions across many different book categories. It achieves significantly higher prediction accuracy compared to standard approaches.","PeriodicalId":438467,"journal":{"name":"Data Mining for Business Applications","volume":"114 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120880402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Data Mining for Business Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1