首页 > 最新文献

2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)最新文献

英文 中文
On the Use of Ontology as A Priori Knowledge into Constrained Clustering 本体作为先验知识在约束聚类中的应用
Hatim Chahdi, Nistor Grozavu, I. Mougenot, Laure Berti-Équille, Younès Bennani
Recent studies have shown that the use of a priori knowledge can significantly improve the results of unsupervised classification. However, capturing and formatting such knowledge as constraints is not only very expensive requiring the sustained involvement of an expert but it is also very difficult because some valuable information can be lost when it cannot be encoded as constraints. In this paper, we propose a new constraint-based clustering approach based on ontology reasoning for automatically generating constraints and bridging the semantic gap in satellite image labeling. The use of ontology as a priori knowledge has many advantages that we leverage in the context of satellite image interpretation. The experiments we conduct have shown that our proposed approach can deal with incomplete knowledge while completely exploiting the available one.
近年来的研究表明,使用先验知识可以显著提高无监督分类的结果。然而,捕获和格式化这些知识作为约束不仅非常昂贵,需要专家的持续参与,而且也非常困难,因为一些有价值的信息在不能编码为约束时可能会丢失。本文提出了一种基于本体推理的约束聚类方法,用于自动生成约束,弥合卫星图像标注中的语义鸿沟。本体作为先验知识的使用在卫星图像解译中具有许多优势。我们所做的实验表明,我们提出的方法可以处理不完整的知识,同时完全利用可用的知识。
{"title":"On the Use of Ontology as A Priori Knowledge into Constrained Clustering","authors":"Hatim Chahdi, Nistor Grozavu, I. Mougenot, Laure Berti-Équille, Younès Bennani","doi":"10.1109/DSAA.2016.72","DOIUrl":"https://doi.org/10.1109/DSAA.2016.72","url":null,"abstract":"Recent studies have shown that the use of a priori knowledge can significantly improve the results of unsupervised classification. However, capturing and formatting such knowledge as constraints is not only very expensive requiring the sustained involvement of an expert but it is also very difficult because some valuable information can be lost when it cannot be encoded as constraints. In this paper, we propose a new constraint-based clustering approach based on ontology reasoning for automatically generating constraints and bridging the semantic gap in satellite image labeling. The use of ontology as a priori knowledge has many advantages that we leverage in the context of satellite image interpretation. The experiments we conduct have shown that our proposed approach can deal with incomplete knowledge while completely exploiting the available one.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"132 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124511817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Causal Structure Learning with Reduced Partial Correlation Thresholding 减少部分相关阈值的因果结构学习
A. Sondhi, A. Shojaie
Directed acyclic graphs (DAGs) are commonly used to represent causal relations within a large number of random variables. Estimating DAGs from observational data is a difficult task, it is often impossible to uniquely determine edge direction. The skeleton of the graph, where directions are removed from edges, is often estimated instead. We consider the task of estimating the skeleton of a potentially high-dimensional DAG consisting of Gaussian random variables. A drawback of existing methods is that a prohibitively large number of conditional independence relations need to be tested for. By exploiting properties of common random graph families, we develop a new algorithm that requires conditioning only on small sets of variables. By extending previous theoretical results for undirected graphs to the setting of directed graphs, we prove the consistency of our algorithm, and demonstrate improvements over the state-of-the-art alternative in low and high-dimensional simulation settings. We conclude by applying our proposed algorithm on a real gene expression data set.
有向无环图(dag)通常用于表示大量随机变量之间的因果关系。从观测数据估计dag是一项困难的任务,通常不可能唯一地确定边缘方向。图的骨架,即从边缘上去除方向,通常是估计的。我们考虑估计由高斯随机变量组成的潜在高维DAG的骨架的任务。现有方法的一个缺点是需要测试大量的条件独立关系。通过利用常见的随机图族的性质,我们开发了一种新的算法,它只需要对小的变量集进行调节。通过将先前无向图的理论结果扩展到有向图的设置,我们证明了算法的一致性,并证明了在低维和高维模拟设置中优于最先进的替代方案的改进。最后,我们将提出的算法应用于真实的基因表达数据集。
{"title":"Causal Structure Learning with Reduced Partial Correlation Thresholding","authors":"A. Sondhi, A. Shojaie","doi":"10.1109/DSAA.2016.68","DOIUrl":"https://doi.org/10.1109/DSAA.2016.68","url":null,"abstract":"Directed acyclic graphs (DAGs) are commonly used to represent causal relations within a large number of random variables. Estimating DAGs from observational data is a difficult task, it is often impossible to uniquely determine edge direction. The skeleton of the graph, where directions are removed from edges, is often estimated instead. We consider the task of estimating the skeleton of a potentially high-dimensional DAG consisting of Gaussian random variables. A drawback of existing methods is that a prohibitively large number of conditional independence relations need to be tested for. By exploiting properties of common random graph families, we develop a new algorithm that requires conditioning only on small sets of variables. By extending previous theoretical results for undirected graphs to the setting of directed graphs, we prove the consistency of our algorithm, and demonstrate improvements over the state-of-the-art alternative in low and high-dimensional simulation settings. We conclude by applying our proposed algorithm on a real gene expression data set.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121507959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Query Sample Selection Bias on Information Retrieval System Ranking 查询样本选择偏差对信息检索系统排序的影响
M. Melucci
Information Retrieval (IR) effectiveness measures commonly assume that the experimental query sets consist of randomly drawn queries that represent the population of queries submitted to IR systems. In many practical situations, however, this assumption is violated, in a problem known as sample selection bias. It follows that the systems participating in evaluation campaigns are ranked by biased estimators of effectiveness. In this paper, we address the problem of query sample selection bias in machine learning terms and study experimentally how retrieval system rankings are affected by it. To this end, we apply a number of retrieval effectiveness measures and query probability estimation methods useful to correct sample selection bias. We report that the ranking of the most effective systems and that of the least effective systems is fairly affected by query sample selection bias, while the ranking of the average systems is much more affected. We also report that the measure of bias depends on the retrieval measure used to rank systems and eventually on the search task being evaluated.
信息检索(IR)有效性度量通常假设实验查询集由随机抽取的查询组成,这些查询代表提交给IR系统的查询的总体。然而,在许多实际情况下,这种假设是违反的,这是一个被称为样本选择偏差的问题。因此,参与评估活动的系统是由有偏见的有效性估计者进行排名的。在本文中,我们解决了机器学习术语中的查询样本选择偏差问题,并实验研究了它是如何影响检索系统排名的。为此,我们应用了一些有用的检索有效性度量和查询概率估计方法来纠正样本选择偏差。我们报告了最有效系统和最不有效系统的排名受到查询样本选择偏差的影响,而平均系统的排名受到更大的影响。我们还报告了偏差的度量取决于用于对系统进行排序的检索度量,并最终取决于正在评估的搜索任务。
{"title":"Impact of Query Sample Selection Bias on Information Retrieval System Ranking","authors":"M. Melucci","doi":"10.1109/DSAA.2016.43","DOIUrl":"https://doi.org/10.1109/DSAA.2016.43","url":null,"abstract":"Information Retrieval (IR) effectiveness measures commonly assume that the experimental query sets consist of randomly drawn queries that represent the population of queries submitted to IR systems. In many practical situations, however, this assumption is violated, in a problem known as sample selection bias. It follows that the systems participating in evaluation campaigns are ranked by biased estimators of effectiveness. In this paper, we address the problem of query sample selection bias in machine learning terms and study experimentally how retrieval system rankings are affected by it. To this end, we apply a number of retrieval effectiveness measures and query probability estimation methods useful to correct sample selection bias. We report that the ranking of the most effective systems and that of the least effective systems is fairly affected by query sample selection bias, while the ranking of the average systems is much more affected. We also report that the measure of bias depends on the retrieval measure used to rank systems and eventually on the search task being evaluated.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122733011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Trend Detection Based Regret Minimization for Bandit Problems 基于趋势检测的强盗问题后悔最小化
Paresh Nakhe, Rebecca Reiffenhäuser
We study a variation of the classical multi-armed bandits problem. In this problem, the learner has to make a sequence of decisions, picking from a fixed set of choices. In each round, she receives as feedback only the loss incurred from the chosen action. Conventionally, this problem has been studied when losses of the actions are drawn from an unknown distribution or when they are adversarial. In this paper, we study this problem when the losses of the actions also satisfy certain structural properties, and especially, do show a trend structure. When this is true, we show that using trend detection, we can achieve regret of order Õ (N √TK) with respect to a switching strategy for the version of the problem where a single action is chosen in each round and Õ (Nm √TK) when m actions are chosen each round. This guarantee is a significant improvement over the conventional benchmark. Our approach can, as a framework, be applied in combination with various well-known bandit algorithms, like Exp3. For both versions of the problem, we give regret guarantees also for the anytime setting, i.e. when length of the choice-sequence is not known in advance. Finally, we pinpoint the advantages of our method by comparing it to some well-known other strategies.
本文研究了经典多臂土匪问题的一种变体。在这个问题中,学习者必须做出一系列决定,从一组固定的选择中进行选择。在每一轮中,她只会收到选择行动所造成的损失的反馈。传统上,当行动的损失来自未知分布或它们是对抗性的时候,这个问题已经被研究过了。本文研究了当动作的损失也满足一定的结构性质,特别是表现为趋势结构时的问题。当这是正确的,我们表明使用趋势检测,我们可以在每轮选择一个动作的问题版本的切换策略中实现Õ (N√TK)阶的遗憾,在每轮选择m个动作的问题版本中实现Õ (Nm√TK)阶的遗憾。这种保证是对传统基准的重大改进。我们的方法可以作为一个框架,与各种知名的强盗算法(如Exp3)结合应用。对于这两个版本的问题,我们也给出了任何时间设置的遗憾保证,即当选择序列的长度事先未知时。最后,通过与其他一些知名策略的比较,我们指出了我们的方法的优势。
{"title":"Trend Detection Based Regret Minimization for Bandit Problems","authors":"Paresh Nakhe, Rebecca Reiffenhäuser","doi":"10.1109/DSAA.2016.35","DOIUrl":"https://doi.org/10.1109/DSAA.2016.35","url":null,"abstract":"We study a variation of the classical multi-armed bandits problem. In this problem, the learner has to make a sequence of decisions, picking from a fixed set of choices. In each round, she receives as feedback only the loss incurred from the chosen action. Conventionally, this problem has been studied when losses of the actions are drawn from an unknown distribution or when they are adversarial. In this paper, we study this problem when the losses of the actions also satisfy certain structural properties, and especially, do show a trend structure. When this is true, we show that using trend detection, we can achieve regret of order Õ (N √TK) with respect to a switching strategy for the version of the problem where a single action is chosen in each round and Õ (Nm √TK) when m actions are chosen each round. This guarantee is a significant improvement over the conventional benchmark. Our approach can, as a framework, be applied in combination with various well-known bandit algorithms, like Exp3. For both versions of the problem, we give regret guarantees also for the anytime setting, i.e. when length of the choice-sequence is not known in advance. Finally, we pinpoint the advantages of our method by comparing it to some well-known other strategies.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124125434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Uniqueness and Greedy Method for Quadratic Compressive Sensing 二次压缩感知的唯一性和贪心方法
Jun Fan, Lingchen Kong, Liqun Wang, N. Xiu
Quadratic compressive sensing, as a nonlinear extension of compressive sensing, has attracted considerable attention in optical image, X-ray crystallography, transmission electron microscopy, etc. We introduce the concept of uniform s-regularity to study the uniqueness in quadratic compressive sensing and propose a greedy algorithm for the corresponding numerical optimization. Moreover, we prove the convergence of the proposed algorithm under the uniform s-regularity condition. Finally, we present numerical results to demonstrate the efficiency of the proposed method.
二次压缩感知作为压缩感知的非线性扩展,在光学图像、x射线晶体学、透射电镜等领域引起了广泛的关注。我们引入一致s规则的概念来研究二次压缩感知的唯一性,并提出了一种贪心算法来进行相应的数值优化。此外,我们还证明了该算法在一致s规则条件下的收敛性。最后,给出了数值结果来验证所提方法的有效性。
{"title":"The Uniqueness and Greedy Method for Quadratic Compressive Sensing","authors":"Jun Fan, Lingchen Kong, Liqun Wang, N. Xiu","doi":"10.1109/DSAA.2016.94","DOIUrl":"https://doi.org/10.1109/DSAA.2016.94","url":null,"abstract":"Quadratic compressive sensing, as a nonlinear extension of compressive sensing, has attracted considerable attention in optical image, X-ray crystallography, transmission electron microscopy, etc. We introduce the concept of uniform s-regularity to study the uniqueness in quadratic compressive sensing and propose a greedy algorithm for the corresponding numerical optimization. Moreover, we prove the convergence of the proposed algorithm under the uniform s-regularity condition. Finally, we present numerical results to demonstrate the efficiency of the proposed method.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128032662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Exploratory Statistical Cusp Catastrophe Model 一个探索性的统计尖点突变模型
D. Chen, X. Chen, Kai Zhang
The Cusp Catastrophe Model provides a promising approach for health and behavioral researchers to investigate both continuous and quantum changes in one modeling framework. However, application of the model is hindered by unresolved issues around a statistical model fitting to the data. This paper reports our exploratory work in developing a new approach to statistical cusp catastrophe modeling. In this new approach, the Cusp Catastrophe Model is cast into a statistical nonlinear regression for parameter estimation. The algorithms of the delayed convention and Maxwell convention are applied to obtain parameter estimates using maximum likelihood estimation. Through a series of simulation studies, we demonstrate that (a) parameter estimation of this statistical cusp model is unbiased, and (b) use of a bootstrapping procedure enables efficient statistical inference. To test the utility of this new method, we analyze survey data collected for an NIH-funded project providing HIV-prevention education to adolescents in the Bahamas. We found that the results can be more reasonably explained by our approach than other existing methods. Additional research is needed to establish this new approach as the most reliable method for fitting the cusp catastrophe model. Further research should focus on additional theoretical analysis, extension of the model for analyzing categorical and counting data, and additional applications in analyzing different data types.
Cusp突变模型为健康和行为研究人员在一个建模框架内研究连续和量子变化提供了一种很有前途的方法。然而,模型的应用受到与数据拟合的统计模型周围未解决的问题的阻碍。本文报告了我们在开发统计尖点突变建模新方法方面的探索性工作。该方法将尖点突变模型转化为统计非线性回归模型进行参数估计。采用延迟约定算法和麦克斯韦约定算法,利用最大似然估计获得参数估计。通过一系列的仿真研究,我们证明(a)该统计尖点模型的参数估计是无偏的,(b)使用自举过程可以实现有效的统计推断。为了测试这种新方法的效用,我们分析了为美国国立卫生研究院资助的一个项目收集的调查数据,该项目向巴哈马的青少年提供艾滋病毒预防教育。我们发现用我们的方法可以比其他现有方法更合理地解释结果。要使这种新方法成为最可靠的尖突变模型拟合方法,还需要进一步的研究。进一步的研究应侧重于进一步的理论分析,扩展模型以分析分类和计数数据,以及在分析不同数据类型方面的其他应用。
{"title":"An Exploratory Statistical Cusp Catastrophe Model","authors":"D. Chen, X. Chen, Kai Zhang","doi":"10.1109/DSAA.2016.17","DOIUrl":"https://doi.org/10.1109/DSAA.2016.17","url":null,"abstract":"The Cusp Catastrophe Model provides a promising approach for health and behavioral researchers to investigate both continuous and quantum changes in one modeling framework. However, application of the model is hindered by unresolved issues around a statistical model fitting to the data. This paper reports our exploratory work in developing a new approach to statistical cusp catastrophe modeling. In this new approach, the Cusp Catastrophe Model is cast into a statistical nonlinear regression for parameter estimation. The algorithms of the delayed convention and Maxwell convention are applied to obtain parameter estimates using maximum likelihood estimation. Through a series of simulation studies, we demonstrate that (a) parameter estimation of this statistical cusp model is unbiased, and (b) use of a bootstrapping procedure enables efficient statistical inference. To test the utility of this new method, we analyze survey data collected for an NIH-funded project providing HIV-prevention education to adolescents in the Bahamas. We found that the results can be more reasonably explained by our approach than other existing methods. Additional research is needed to establish this new approach as the most reliable method for fitting the cusp catastrophe model. Further research should focus on additional theoretical analysis, extension of the model for analyzing categorical and counting data, and additional applications in analyzing different data types.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130345474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Robust Online Time Series Prediction with Recurrent Neural Networks 基于循环神经网络的鲁棒在线时间序列预测
T. Guo, Zhao Xu, X. Yao, Hai-Ming Chen, K. Aberer, K. Funaya
Time series forecasting for streaming data plays an important role in many real applications, ranging from IoT systems, cyber-networks, to industrial systems and healthcare. However the real data is often complicated with anomalies and change points, which can lead the learned models deviating from the underlying patterns of the time series, especially in the context of online learning mode. In this paper we present an adaptive gradient learning method for recurrent neural networks (RNN) to forecast streaming time series in the presence of anomalies and change points. We explore the local features of time series to automatically weight the gradients of the loss of the newly available observations with distributional properties of the data in real time. We perform extensive experimental analysis on both synthetic and real datasets to evaluate the performance of the proposed method.
流数据的时间序列预测在许多实际应用中发挥着重要作用,从物联网系统、网络网络到工业系统和医疗保健。然而,真实数据往往是复杂的,有异常和变化点,这可能导致学习模型偏离时间序列的基本模式,特别是在在线学习模式的背景下。本文提出了一种用于递归神经网络(RNN)的自适应梯度学习方法来预测存在异常和变化点的流时间序列。我们利用时间序列的局部特征,实时地将新观测值损失的梯度与数据的分布特性自动加权。我们对合成和真实数据集进行了广泛的实验分析,以评估所提出方法的性能。
{"title":"Robust Online Time Series Prediction with Recurrent Neural Networks","authors":"T. Guo, Zhao Xu, X. Yao, Hai-Ming Chen, K. Aberer, K. Funaya","doi":"10.1109/DSAA.2016.92","DOIUrl":"https://doi.org/10.1109/DSAA.2016.92","url":null,"abstract":"Time series forecasting for streaming data plays an important role in many real applications, ranging from IoT systems, cyber-networks, to industrial systems and healthcare. However the real data is often complicated with anomalies and change points, which can lead the learned models deviating from the underlying patterns of the time series, especially in the context of online learning mode. In this paper we present an adaptive gradient learning method for recurrent neural networks (RNN) to forecast streaming time series in the presence of anomalies and change points. We explore the local features of time series to automatically weight the gradients of the loss of the newly available observations with distributional properties of the data in real time. We perform extensive experimental analysis on both synthetic and real datasets to evaluate the performance of the proposed method.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"206 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133365515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 121
Online Collaborative Prediction of Regional Vote Results 区域投票结果的在线协同预测
Vincent Etter, M. E. Khan, M. Grossglauser, Patrick Thiran
We consider online predictions of vote results, where regions across a country vote on an issue under discussion. Such online predictions before and during the day of the vote are useful to media agencies, polling institutes, and political parties, e.g., to identify regions that are crucial in determining the national outcome of a vote. We analyze a unique dataset from Switzerland. The dataset contains 281 votes from 2352 regions over a period of 34 years. We make several contributions towards improving online predictions. First, we show that these votes exhibit a bi-clustering of the vote results, i.e., regions that are spatially close tend to vote similarly, and issues that discuss similar topics show similar global voting patterns. Second, we develop models that can exploit this bi-clustering, as well as the features associated with the votes and regions. Third, we show that, when combining vote results and features together, Bayesian methods are essential to obtaining good performance. Our results show that Bayesian methods give better estimates of the hyperparameters than non-Bayesian methods such as cross-validation. The resulting models generalize well to many different tasks, produce robust predictions, and are easily interpretable.
我们考虑对投票结果的在线预测,即全国各地对正在讨论的问题进行投票。这种在线预测在投票前和投票当天对媒体机构、民调机构和政党都很有用,例如,确定对决定全国投票结果至关重要的地区。我们分析了一个来自瑞士的独特数据集。该数据集包含了34年间2352个地区的281张选票。我们为改进在线预测做出了一些贡献。首先,我们发现这些投票表现出投票结果的双聚类,即空间上接近的区域倾向于投票相似,讨论类似主题的问题表现出类似的全球投票模式。其次,我们开发了可以利用这种双聚类的模型,以及与投票和地区相关的特征。第三,我们表明,当将投票结果和特征结合在一起时,贝叶斯方法对于获得良好的性能至关重要。我们的结果表明,贝叶斯方法比非贝叶斯方法(如交叉验证)给出了更好的超参数估计。由此产生的模型可以很好地推广到许多不同的任务,产生可靠的预测,并且易于解释。
{"title":"Online Collaborative Prediction of Regional Vote Results","authors":"Vincent Etter, M. E. Khan, M. Grossglauser, Patrick Thiran","doi":"10.1109/DSAA.2016.31","DOIUrl":"https://doi.org/10.1109/DSAA.2016.31","url":null,"abstract":"We consider online predictions of vote results, where regions across a country vote on an issue under discussion. Such online predictions before and during the day of the vote are useful to media agencies, polling institutes, and political parties, e.g., to identify regions that are crucial in determining the national outcome of a vote. We analyze a unique dataset from Switzerland. The dataset contains 281 votes from 2352 regions over a period of 34 years. We make several contributions towards improving online predictions. First, we show that these votes exhibit a bi-clustering of the vote results, i.e., regions that are spatially close tend to vote similarly, and issues that discuss similar topics show similar global voting patterns. Second, we develop models that can exploit this bi-clustering, as well as the features associated with the votes and regions. Third, we show that, when combining vote results and features together, Bayesian methods are essential to obtaining good performance. Our results show that Bayesian methods give better estimates of the hyperparameters than non-Bayesian methods such as cross-validation. The resulting models generalize well to many different tasks, produce robust predictions, and are easily interpretable.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"83 S366","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132226896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Hyperparameter Optimization Machines 超参数优化机
Martin Wistuba, Nicolas Schilling, L. Schmidt-Thieme
Algorithm selection and hyperparameter tuning are omnipresent problems for researchers and practitioners. Hence, it is not surprising that the efforts in automatizing this process using various meta-learning approaches have been increased. Sequential model-based optimization (SMBO) is ne of the most popular frameworks for finding optimal hyperparameter configurations. Originally designed for black-box optimization, researchers have contributed different meta-learning approaches to speed up the optimization process. We create a generalized framework of SMBO and its recent additions which gives access to adaptive hyperparameter transfer learning with simple surrogates (AHT), a new class of hyperparameter optimization strategies. AHT provides less time-overhead for the optimization process by replacing time-and space-consuming transfer surrogate models with simple surrogates that employ adaptive transfer learning. In an empirical comparison on two different meta-data sets, we can show that AHT outperforms various instances of the SMBO framework in the scenarios of hyperparameter tuning and algorithm selection.
算法选择和超参数调优是研究人员和实践者普遍存在的问题。因此,使用各种元学习方法使这一过程自动化的努力已经增加,这并不奇怪。基于序列模型的优化(SMBO)是寻找最优超参数配置的最流行框架之一。最初设计用于黑盒优化,研究人员贡献了不同的元学习方法来加速优化过程。我们创建了一个广义的SMBO框架及其最近添加的内容,该框架提供了使用简单代理(AHT)的自适应超参数迁移学习,这是一类新的超参数优化策略。AHT通过使用使用自适应迁移学习的简单代理取代耗时和占用空间的迁移代理模型,为优化过程提供了更少的时间开销。在两个不同元数据集的经验比较中,我们可以证明AHT在超参数调优和算法选择的情况下优于SMBO框架的各种实例。
{"title":"Hyperparameter Optimization Machines","authors":"Martin Wistuba, Nicolas Schilling, L. Schmidt-Thieme","doi":"10.1109/DSAA.2016.12","DOIUrl":"https://doi.org/10.1109/DSAA.2016.12","url":null,"abstract":"Algorithm selection and hyperparameter tuning are omnipresent problems for researchers and practitioners. Hence, it is not surprising that the efforts in automatizing this process using various meta-learning approaches have been increased. Sequential model-based optimization (SMBO) is ne of the most popular frameworks for finding optimal hyperparameter configurations. Originally designed for black-box optimization, researchers have contributed different meta-learning approaches to speed up the optimization process. We create a generalized framework of SMBO and its recent additions which gives access to adaptive hyperparameter transfer learning with simple surrogates (AHT), a new class of hyperparameter optimization strategies. AHT provides less time-overhead for the optimization process by replacing time-and space-consuming transfer surrogate models with simple surrogates that employ adaptive transfer learning. In an empirical comparison on two different meta-data sets, we can show that AHT outperforms various instances of the SMBO framework in the scenarios of hyperparameter tuning and algorithm selection.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"244 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114440240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Correcting Relational Bias to Improve Classification in Sparsely-Labeled Networks 修正关系偏差以改善稀疏标记网络的分类
J. R. King, Luke K. McDowell
Many classification problems involve nodes that have a natural connection between them, such as links between people, pages, or social network accounts. Recent work has demonstrated how to learn relational dependencies from these links, then leverage them as predictive features. However, while this can often improve accuracy, the use of linked information can also lead to cascading prediction errors, especially in the common-case when a network is only sparsely-labeled. In response, this paper examines several existing and new methods for correcting the "relational bias" that leads to such errors. First, we explain how existing approaches can be divided into "resemblance-based" and "assignment-based" methods, and provide the first experimental comparison between them. We demonstrate that all of these methods can improve accuracy, but that the former type typically leads to better accuracy. Moreover, we show that the more flexible methods typically perform best, motivating a new assignment-based method that often improves accuracy vs. a more rigid method. In addition, we demonstrate for the first time that some of these methods can also improve accuracy when combined with Gibbs sampling for inference. However, we show that, with Gibbs, correcting relational bias also requires improving label initialization, and present two new initialization methods that yield large accuracy gains. Finally, we evaluate the effects of relational bias when "neighbor attributes," recently-proposed additions that can provide more stability during inference, are included as model features. We show that such attributes reduce the negative impact of bias, but that using some form of bias correction remains important for achieving maximal accuracy.
许多分类问题涉及节点之间具有自然连接,例如人、页面或社交网络帐户之间的链接。最近的工作演示了如何从这些链接中学习关系依赖,然后利用它们作为预测特性。然而,虽然这通常可以提高准确性,但链接信息的使用也可能导致级联预测错误,特别是在网络只有稀疏标记的常见情况下。作为回应,本文研究了几种现有的和新的方法来纠正导致这种错误的“关系偏差”。首先,我们解释了现有的方法如何分为“基于相似性”和“基于作业”的方法,并提供了它们之间的第一个实验比较。我们证明了所有这些方法都可以提高准确性,但前一种方法通常会带来更好的准确性。此外,我们表明,更灵活的方法通常表现最好,激发了一种新的基于作业的方法,这种方法通常比更严格的方法提高准确性。此外,我们首次证明了其中一些方法在与Gibbs抽样相结合进行推理时也可以提高准确性。然而,我们表明,通过Gibbs,纠正关系偏差也需要改进标签初始化,并提出了两种新的初始化方法,可以产生很大的精度增益。最后,我们评估了当“邻居属性”(最近提出的可以在推理过程中提供更多稳定性的添加)作为模型特征包含时关系偏差的影响。我们表明,这些属性减少了偏差的负面影响,但使用某种形式的偏差校正对于实现最大精度仍然很重要。
{"title":"Correcting Relational Bias to Improve Classification in Sparsely-Labeled Networks","authors":"J. R. King, Luke K. McDowell","doi":"10.1109/DSAA.2016.11","DOIUrl":"https://doi.org/10.1109/DSAA.2016.11","url":null,"abstract":"Many classification problems involve nodes that have a natural connection between them, such as links between people, pages, or social network accounts. Recent work has demonstrated how to learn relational dependencies from these links, then leverage them as predictive features. However, while this can often improve accuracy, the use of linked information can also lead to cascading prediction errors, especially in the common-case when a network is only sparsely-labeled. In response, this paper examines several existing and new methods for correcting the \"relational bias\" that leads to such errors. First, we explain how existing approaches can be divided into \"resemblance-based\" and \"assignment-based\" methods, and provide the first experimental comparison between them. We demonstrate that all of these methods can improve accuracy, but that the former type typically leads to better accuracy. Moreover, we show that the more flexible methods typically perform best, motivating a new assignment-based method that often improves accuracy vs. a more rigid method. In addition, we demonstrate for the first time that some of these methods can also improve accuracy when combined with Gibbs sampling for inference. However, we show that, with Gibbs, correcting relational bias also requires improving label initialization, and present two new initialization methods that yield large accuracy gains. Finally, we evaluate the effects of relational bias when \"neighbor attributes,\" recently-proposed additions that can provide more stability during inference, are included as model features. We show that such attributes reduce the negative impact of bias, but that using some form of bias correction remains important for achieving maximal accuracy.","PeriodicalId":193885,"journal":{"name":"2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115770200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1