首页 > 最新文献

2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)最新文献

英文 中文
IQ estimation for accurate time-series classification 用于精确时间序列分类的IQ估计
Pub Date : 2011-04-11 DOI: 10.1109/CIDM.2011.5949441
Krisztián Búza, A. Nanopoulos, L. Schmidt-Thieme
Due to its various applications, time-series classification is a prominent research topic in data mining and computational intelligence. The simple k-NN classifier using dynamic time warping (DTW) distance had been shown to be competitive to other state-of-the art time-series classifiers. In our research, however, we observed that a single fixed choice for the number of nearest neighbors k may lead to suboptimal performance. This is due to the complexity of time-series data, especially because the characteristic of the data may vary from region to region. Therefore, local adaptations of the classification algorithm is required. In order to address this problem in a principled way by, in this paper we introduce individual quality (IQ) estimation. This refers to estimating the expected classification accuracy for each time series and each k individually. Based on the IQ estimations we combine the classification results of several k-NN classifiers as final prediction. In our framework of IQ, we develop two time-series classification algorithms, IQ-MAX and IQ-WV. In our experiments on 35 commonly used benchmark data sets, we show that both IQ-MAX and IQ-WV outperform two baselines.
由于其广泛的应用,时间序列分类是数据挖掘和计算智能领域的一个重要研究课题。使用动态时间翘曲(DTW)距离的简单k-NN分类器已被证明与其他最先进的时间序列分类器具有竞争力。然而,在我们的研究中,我们观察到,对于最近邻居的数量k的单一固定选择可能会导致次优性能。这是由于时间序列数据的复杂性,特别是因为数据的特征可能因地区而异。因此,需要对分类算法进行局部适应。为了原则性地解决这一问题,本文引入了个体素质(IQ)估计。这是指分别估计每个时间序列和每个k的期望分类精度。在IQ估计的基础上,我们将几个k-NN分类器的分类结果结合起来作为最终的预测。在我们的IQ框架中,我们开发了IQ- max和IQ- wv两种时间序列分类算法。在我们对35个常用基准数据集的实验中,我们表明IQ-MAX和IQ-WV都优于两个基线。
{"title":"IQ estimation for accurate time-series classification","authors":"Krisztián Búza, A. Nanopoulos, L. Schmidt-Thieme","doi":"10.1109/CIDM.2011.5949441","DOIUrl":"https://doi.org/10.1109/CIDM.2011.5949441","url":null,"abstract":"Due to its various applications, time-series classification is a prominent research topic in data mining and computational intelligence. The simple k-NN classifier using dynamic time warping (DTW) distance had been shown to be competitive to other state-of-the art time-series classifiers. In our research, however, we observed that a single fixed choice for the number of nearest neighbors k may lead to suboptimal performance. This is due to the complexity of time-series data, especially because the characteristic of the data may vary from region to region. Therefore, local adaptations of the classification algorithm is required. In order to address this problem in a principled way by, in this paper we introduce individual quality (IQ) estimation. This refers to estimating the expected classification accuracy for each time series and each k individually. Based on the IQ estimations we combine the classification results of several k-NN classifiers as final prediction. In our framework of IQ, we develop two time-series classification algorithms, IQ-MAX and IQ-WV. In our experiments on 35 commonly used benchmark data sets, we show that both IQ-MAX and IQ-WV outperform two baselines.","PeriodicalId":211565,"journal":{"name":"2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132110255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
On the use of decision trees for ICU outcome prediction in sepsis patients treated with statins 决策树在他汀类药物治疗脓毒症患者ICU预后预测中的应用
Pub Date : 2011-04-11 DOI: 10.1109/CIDM.2011.5949439
V. Ribas, J. Lopez, J. Ruiz-Rodríguez, Adolf Ruiz-Sanmartin, J. Rello, A. Vellido
Sepsis is one of the main causes of death for noncoronary ICU (Intensive Care Unit) patients and has become the tenth most common cause of death in western societies. This is a transversal condition affecting immunocompromised patients, critically ill patients, post-surgery patients, patients with AIDS, and the elderly. In western countries, septic patients account for as much as 25% of ICU bed utilization and the pathology affects 1% – 2% of all hospitalizations. Its mortality rates range from 12.8% for sepsis to 45.7% for septic shock. Early administration of antibiotics is known to be crucial for ICU outcomes. In this regard, statins, a class of drug, have been shown to present good anti-inflammatory properties beyond their regulation of the biosynthesis of cholesterol. In this brief paper, we hypothesize that preadmission use of statins improves ICU outcomes. We test this hypothesis in a prospective study in patients admitted with severe sepsis and multiorgan failure at the ICU of Vall d' Hebron University Hospital (Barcelona, Spain), using statistic algebraic models and regression trees.
脓毒症是非冠状动脉重症监护病房(ICU)患者死亡的主要原因之一,在西方社会已成为第十大常见死因。这是一种横向疾病,影响免疫功能低下患者、危重患者、手术后患者、艾滋病患者和老年人。在西方国家,脓毒症患者占ICU床位使用率的25%,病理影响了所有住院患者的1% - 2%。其死亡率从败血症的12.8%到感染性休克的45.7%不等。众所周知,早期使用抗生素对ICU的预后至关重要。在这方面,他汀类药物已被证明具有良好的抗炎特性,而不仅仅是对胆固醇生物合成的调节。在这篇简短的文章中,我们假设入院前使用他汀类药物可以改善ICU的预后。我们在Vall d’Hebron大学医院(Barcelona, Spain) ICU收治的严重脓毒症和多器官功能衰竭患者中进行了一项前瞻性研究,使用统计代数模型和回归树来验证这一假设。
{"title":"On the use of decision trees for ICU outcome prediction in sepsis patients treated with statins","authors":"V. Ribas, J. Lopez, J. Ruiz-Rodríguez, Adolf Ruiz-Sanmartin, J. Rello, A. Vellido","doi":"10.1109/CIDM.2011.5949439","DOIUrl":"https://doi.org/10.1109/CIDM.2011.5949439","url":null,"abstract":"Sepsis is one of the main causes of death for noncoronary ICU (Intensive Care Unit) patients and has become the tenth most common cause of death in western societies. This is a transversal condition affecting immunocompromised patients, critically ill patients, post-surgery patients, patients with AIDS, and the elderly. In western countries, septic patients account for as much as 25% of ICU bed utilization and the pathology affects 1% – 2% of all hospitalizations. Its mortality rates range from 12.8% for sepsis to 45.7% for septic shock. Early administration of antibiotics is known to be crucial for ICU outcomes. In this regard, statins, a class of drug, have been shown to present good anti-inflammatory properties beyond their regulation of the biosynthesis of cholesterol. In this brief paper, we hypothesize that preadmission use of statins improves ICU outcomes. We test this hypothesis in a prospective study in patients admitted with severe sepsis and multiorgan failure at the ICU of Vall d' Hebron University Hospital (Barcelona, Spain), using statistic algebraic models and regression trees.","PeriodicalId":211565,"journal":{"name":"2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120947973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
About the analysis of time series with temporal association rule mining 关于时间序列分析的时间关联规则挖掘
Pub Date : 2011-04-11 DOI: 10.1109/CIDM.2011.5949303
Tim Schlüter, Stefan Conrad
This paper addresses the issue of analyzing time series with temporal association rule mining techniques. Since originally association rule mining was developed for the analysis of transactional data, as it occurs for instance in market basket analysis, algorithms and time series have to be adapted in order to apply these techniques gainfully to the analysis of time series in general. Continuous time series of different origins can be discretized in order to mine several temporal association rules, what reveals interesting coherences in one and between pairs of time series. Depending on the domain, the knowledge about these coherences can be used for several purposes, e.g. for the prediction of future values of time series. We present a short review on different standard and temporal association rule mining approaches and on approaches that apply association rule mining to time series analysis. In addition to that, we explain in detail how some of the most interesting kinds of temporal association rules can be mined from continuous time series and present an prototype implementation. We demonstrate and evaluate our implementation on two large datasets containing river level measurement and stock data.
本文讨论了用时序关联规则挖掘技术分析时间序列的问题。由于最初的关联规则挖掘是为分析事务数据而开发的,例如它发生在市场购物篮分析中,因此必须调整算法和时间序列,以便将这些技术有效地应用于一般的时间序列分析。不同起源的连续时间序列可以离散化,从而挖掘出若干时间关联规则,这些规则揭示了时间序列对之间的有趣的一致性。根据不同的领域,关于这些相干性的知识可以用于多种目的,例如用于预测时间序列的未来值。我们简要回顾了不同的标准和时态关联规则挖掘方法,以及将关联规则挖掘应用于时间序列分析的方法。除此之外,我们还详细解释了如何从连续时间序列中挖掘一些最有趣的时间关联规则,并给出了一个原型实现。我们在两个包含水位测量和库存数据的大型数据集上演示并评估了我们的实现。
{"title":"About the analysis of time series with temporal association rule mining","authors":"Tim Schlüter, Stefan Conrad","doi":"10.1109/CIDM.2011.5949303","DOIUrl":"https://doi.org/10.1109/CIDM.2011.5949303","url":null,"abstract":"This paper addresses the issue of analyzing time series with temporal association rule mining techniques. Since originally association rule mining was developed for the analysis of transactional data, as it occurs for instance in market basket analysis, algorithms and time series have to be adapted in order to apply these techniques gainfully to the analysis of time series in general. Continuous time series of different origins can be discretized in order to mine several temporal association rules, what reveals interesting coherences in one and between pairs of time series. Depending on the domain, the knowledge about these coherences can be used for several purposes, e.g. for the prediction of future values of time series. We present a short review on different standard and temporal association rule mining approaches and on approaches that apply association rule mining to time series analysis. In addition to that, we explain in detail how some of the most interesting kinds of temporal association rules can be mined from continuous time series and present an prototype implementation. We demonstrate and evaluate our implementation on two large datasets containing river level measurement and stock data.","PeriodicalId":211565,"journal":{"name":"2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131335548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
A framework for semi-automated process instance discovery from decorative attributes 用于从装饰性属性发现半自动化流程实例的框架
Pub Date : 2011-04-11 DOI: 10.1109/CIDM.2011.5949450
Andrea Burattin, R. Vigo
Process mining is a relatively new field of research: its final aim is to bridge the gap between data mining and business process modelling. In particular, the assumption underpinning this discipline is the availability of data coming from business process executions. In business process theory, once the process has been defined, it is possible to have a number of instances of the process running at the same time. Usually, the identification of different instances is referred to a specific “case id” field in the log exploited by process mining techniques. The software systems that support the execution of a business process, however, often do not record explicitly such information. This paper presents an approach that faces the absence of the “case id” information: we have a set of extra fields, decorating each single activity log, that are known to carry the information on the process instance. A framework is addressed, based on simple relational algebra notions, to extract the most promising case ids from the extra fields. The work is a generalization of a real business case.
流程挖掘是一个相对较新的研究领域:其最终目标是弥合数据挖掘和业务流程建模之间的差距。特别是,支撑这一原则的假设是来自业务流程执行的数据的可用性。在业务流程理论中,一旦定义了流程,就有可能同时运行多个流程实例。通常,不同实例的标识指的是流程挖掘技术利用的日志中的特定“case id”字段。然而,支持业务流程执行的软件系统通常不会明确地记录这些信息。本文提出了一种解决缺少“case id”信息的方法:我们有一组额外的字段,装饰每个单独的活动日志,已知这些字段携带流程实例上的信息。基于简单的关系代数概念,解决了一个框架,以便从额外字段中提取最有希望的case id。这项工作是对真实商业案例的概括。
{"title":"A framework for semi-automated process instance discovery from decorative attributes","authors":"Andrea Burattin, R. Vigo","doi":"10.1109/CIDM.2011.5949450","DOIUrl":"https://doi.org/10.1109/CIDM.2011.5949450","url":null,"abstract":"Process mining is a relatively new field of research: its final aim is to bridge the gap between data mining and business process modelling. In particular, the assumption underpinning this discipline is the availability of data coming from business process executions. In business process theory, once the process has been defined, it is possible to have a number of instances of the process running at the same time. Usually, the identification of different instances is referred to a specific “case id” field in the log exploited by process mining techniques. The software systems that support the execution of a business process, however, often do not record explicitly such information. This paper presents an approach that faces the absence of the “case id” information: we have a set of extra fields, decorating each single activity log, that are known to carry the information on the process instance. A framework is addressed, based on simple relational algebra notions, to extract the most promising case ids from the extra fields. The work is a generalization of a real business case.","PeriodicalId":211565,"journal":{"name":"2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122461430","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Geodesic distances for web document clustering web文档聚类的测地线距离
Pub Date : 2011-04-11 DOI: 10.1109/CIDM.2011.5949449
Selma Tekir, Florian Mansmann, D. Keim
While traditional distance measures are often capable of properly describing similarity between objects, in some application areas there is still potential to fine-tune these measures with additional information provided in the data sets. In this work we combine such traditional distance measures for document analysis with link information between documents to improve clustering results. In particular, we test the effectiveness of geodesic distances as similarity measures under the space assumption of spherical geometry in a 0-sphere. Our proposed distance measure is thus a combination of the cosine distance of the term-document matrix and some curvature values in the geodesic distance formula. To estimate these curvature values, we calculate clustering coefficient values for every document from the link graph of the data set and increase their distinctiveness by means of a heuristic as these clustering coefficient values are rough estimates of the curvatures. To evaluate our work, we perform clustering tests with the k-means algorithm on the English Wikipedia hyperlinked data set with both traditional cosine distance and our proposed geodesic distance. The effectiveness of our approach is measured by computing micro-precision values of the clusters based on the provided categorical information of each article.
虽然传统的距离测量通常能够正确地描述物体之间的相似性,但在某些应用领域,仍然有可能根据数据集中提供的额外信息对这些测量进行微调。在这项工作中,我们将这种传统的文档分析距离度量与文档之间的链接信息相结合,以提高聚类结果。特别地,我们在0球的球面几何空间假设下测试了测地线距离作为相似度量的有效性。因此,我们提出的距离度量是术语-文档矩阵的余弦距离和测地线距离公式中的一些曲率值的组合。为了估计这些曲率值,我们从数据集的链接图中计算每个文档的聚类系数值,并通过启发式方法增加它们的独特性,因为这些聚类系数值是曲率的粗略估计。为了评估我们的工作,我们使用k-means算法对英语维基百科超链接数据集进行聚类测试,该数据集具有传统的余弦距离和我们提出的测地线距离。我们的方法的有效性是通过基于每篇文章提供的分类信息计算聚类的微精度值来衡量的。
{"title":"Geodesic distances for web document clustering","authors":"Selma Tekir, Florian Mansmann, D. Keim","doi":"10.1109/CIDM.2011.5949449","DOIUrl":"https://doi.org/10.1109/CIDM.2011.5949449","url":null,"abstract":"While traditional distance measures are often capable of properly describing similarity between objects, in some application areas there is still potential to fine-tune these measures with additional information provided in the data sets. In this work we combine such traditional distance measures for document analysis with link information between documents to improve clustering results. In particular, we test the effectiveness of geodesic distances as similarity measures under the space assumption of spherical geometry in a 0-sphere. Our proposed distance measure is thus a combination of the cosine distance of the term-document matrix and some curvature values in the geodesic distance formula. To estimate these curvature values, we calculate clustering coefficient values for every document from the link graph of the data set and increase their distinctiveness by means of a heuristic as these clustering coefficient values are rough estimates of the curvatures. To evaluate our work, we perform clustering tests with the k-means algorithm on the English Wikipedia hyperlinked data set with both traditional cosine distance and our proposed geodesic distance. The effectiveness of our approach is measured by computing micro-precision values of the clusters based on the provided categorical information of each article.","PeriodicalId":211565,"journal":{"name":"2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124873197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Logistic sub-models for small size populations in credit scoring 信用评分中小群体Logistic子模型
Pub Date : 2011-04-11 DOI: 10.1109/CIDM.2011.5949425
Bouaguel Waad, F. Beninel, G. B. Mufti
The credit scoring risk management is a fast growing field due to consumer's credit requests. Credit requests, of new and existing customers, are often evaluated by classical discrimination rules based on customers information. However, these kinds of strategies have serious limits and don't take into account the characteristics difference between current customers and the future ones. The aim of this paper is to measure credit worthiness for non customers borrowers and to model potential risk given a heterogeneous population formed by borrowers customers of the bank and others who are not. We hold on previous works done in generalized discrimination and transpose them into the logistic model to bring out efficient discrimination rules for non customers' subpopulation. Therefore we obtain seven simple models of connection between parameters of both logistic models associated respectively to the two subpopulations. The German credit data set is selected as the experimental data to compare the seven models. Experimental results show that the use of links between the two subpopulations improve the classification accuracy for the new loan applicants.
信用评分风险管理是由于消费者的信用需求而迅速发展起来的领域。无论是新客户还是老客户的信用请求,通常都是通过基于客户信息的经典歧视规则来评估的。然而,这些策略有严重的局限性,并且没有考虑到当前客户和未来客户的特征差异。本文的目的是衡量非客户借款人的信用价值,并对由借款人、银行客户和其他非客户组成的异质人口构成的潜在风险进行建模。我们在继承前人在广义判别方面所做的工作的基础上,将其转化为logistic模型,提出了针对非顾客子群体的有效判别规则。因此,我们得到了分别与两个子总体相关的两个逻辑模型的参数之间的七个简单连接模型。选取德国信用数据集作为实验数据,对七个模型进行比较。实验结果表明,使用两个子种群之间的链接提高了新贷款申请人的分类精度。
{"title":"Logistic sub-models for small size populations in credit scoring","authors":"Bouaguel Waad, F. Beninel, G. B. Mufti","doi":"10.1109/CIDM.2011.5949425","DOIUrl":"https://doi.org/10.1109/CIDM.2011.5949425","url":null,"abstract":"The credit scoring risk management is a fast growing field due to consumer's credit requests. Credit requests, of new and existing customers, are often evaluated by classical discrimination rules based on customers information. However, these kinds of strategies have serious limits and don't take into account the characteristics difference between current customers and the future ones. The aim of this paper is to measure credit worthiness for non customers borrowers and to model potential risk given a heterogeneous population formed by borrowers customers of the bank and others who are not. We hold on previous works done in generalized discrimination and transpose them into the logistic model to bring out efficient discrimination rules for non customers' subpopulation. Therefore we obtain seven simple models of connection between parameters of both logistic models associated respectively to the two subpopulations. The German credit data set is selected as the experimental data to compare the seven models. Experimental results show that the use of links between the two subpopulations improve the classification accuracy for the new loan applicants.","PeriodicalId":211565,"journal":{"name":"2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128005799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Opening black box Data Mining models using Sensitivity Analysis 利用敏感性分析打开黑匣子数据挖掘模型
Pub Date : 2011-04-01 DOI: 10.1109/CIDM.2011.5949423
P. Cortez, M. Embrechts
There are several supervised learning Data Mining (DM) methods, such as Neural Networks (NN), Support Vector Machines (SVM) and ensembles, that often attain high quality predictions, although the obtained models are difficult to interpret by humans. In this paper, we open these black box DM models by using a novel visualization approach that is based on a Sensitivity Analysis (SA) method. In particular, we propose a Global SA (GSA), which extends the applicability of previous SA methods (e.g. to classification tasks), and several visualization techniques (e.g. variable effect characteristic curve), for assessing input relevance and effects on the model's responses. We show the GSA capabilities by conducting several experiments, using a NN ensemble and SVM model, in both synthetic and real-world datasets.
有几种监督学习数据挖掘(DM)方法,如神经网络(NN),支持向量机(SVM)和集成,通常可以获得高质量的预测,尽管获得的模型很难被人类解释。在本文中,我们使用一种新的基于灵敏度分析(SA)方法的可视化方法打开这些黑盒DM模型。特别地,我们提出了一个全局情景分析(GSA),它扩展了以前情景分析方法的适用性(例如分类任务),以及几种可视化技术(例如可变效果特征曲线),用于评估输入相关性和对模型响应的影响。我们通过在合成和真实数据集中使用NN集成和SVM模型进行几个实验来展示GSA的能力。
{"title":"Opening black box Data Mining models using Sensitivity Analysis","authors":"P. Cortez, M. Embrechts","doi":"10.1109/CIDM.2011.5949423","DOIUrl":"https://doi.org/10.1109/CIDM.2011.5949423","url":null,"abstract":"There are several supervised learning Data Mining (DM) methods, such as Neural Networks (NN), Support Vector Machines (SVM) and ensembles, that often attain high quality predictions, although the obtained models are difficult to interpret by humans. In this paper, we open these black box DM models by using a novel visualization approach that is based on a Sensitivity Analysis (SA) method. In particular, we propose a Global SA (GSA), which extends the applicability of previous SA methods (e.g. to classification tasks), and several visualization techniques (e.g. variable effect characteristic curve), for assessing input relevance and effects on the model's responses. We show the GSA capabilities by conducting several experiments, using a NN ensemble and SVM model, in both synthetic and real-world datasets.","PeriodicalId":211565,"journal":{"name":"2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2011-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115835704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 99
期刊
2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1