Proceedings of the 23rd international conference on Machine learning最新文献

英文中文

Fast time series classification using numerosity reduction 快速时间序列分类使用数字减少

Proceedings of the 23rd international conference on Machine learning

Pub Date : 2006-06-25 DOI: 10.1145/1143844.1143974

X. Xi, Eamonn J. Keogh, C. Shelton, Li Wei, C. Ratanamahatana

Many algorithms have been proposed for the problem of time series classification. However, it is clear that one-nearest-neighbor with Dynamic Time Warping (DTW) distance is exceptionally difficult to beat. This approach has one weakness, however; it is computationally too demanding for many realtime applications. One way to mitigate this problem is to speed up the DTW calculations. Nonetheless, there is a limit to how much this can help. In this work, we propose an additional technique, numerosity reduction, to speed up one-nearest-neighbor DTW. While the idea of numerosity reduction for nearest-neighbor classifiers has a long history, we show here that we can leverage off an original observation about the relationship between dataset size and DTW constraints to produce an extremely compact dataset with little or no loss in accuracy. We test our ideas with a comprehensive set of experiments, and show that it can efficiently produce extremely fast accurate classifiers.

针对时间序列分类问题，已经提出了许多算法。然而，很明显，具有动态时间翘曲(DTW)距离的最近邻是非常难以击败的。然而，这种方法有一个缺点;对于许多实时应用程序来说，它的计算要求太高。缓解这个问题的一种方法是加快DTW的计算速度。尽管如此，这种做法的帮助是有限的。在这项工作中，我们提出了一种额外的技术，即数字减少，以加速一个最近邻DTW。虽然最近邻分类器的数量减少的想法有很长的历史，但我们在这里展示了我们可以利用关于数据集大小和DTW约束之间关系的原始观察来生成一个非常紧凑的数据集，并且几乎没有准确性损失。我们用一组全面的实验来测试我们的想法，并表明它可以有效地产生非常快速准确的分类器。

引用次数: 646

Efficient co-regularised least squares regression 有效的协正则化最小二乘回归

Proceedings of the 23rd international conference on Machine learning

Pub Date : 2006-06-25 DOI: 10.1145/1143844.1143862

Ulf Brefeld, Thomas Gärtner, T. Scheffer, S. Wrobel

In many applications, unlabelled examples are inexpensive and easy to obtain. Semi-supervised approaches try to utilise such examples to reduce the predictive error. In this paper, we investigate a semi-supervised least squares regression algorithm based on the co-learning approach. Similar to other semi-supervised algorithms, our base algorithm has cubic runtime complexity in the number of unlabelled examples. To be able to handle larger sets of unlabelled examples, we devise a semi-parametric variant that scales linearly in the number of unlabelled examples. Experiments show a significant error reduction by co-regularisation and a large runtime improvement for the semi-parametric approximation. Last but not least, we propose a distributed procedure that can be applied without collecting all data at a single site.

在许多应用中，未标记的样本既便宜又容易获得。半监督方法试图利用这些例子来减少预测误差。本文研究了一种基于共同学习的半监督最小二乘回归算法。与其他半监督算法类似，我们的基本算法在未标记示例的数量上具有三次运行时复杂度。为了能够处理更大的未标记示例集，我们设计了一种半参数变体，它在未标记示例的数量上呈线性缩放。实验表明，通过协正则化可以显著降低误差，并大大改善半参数近似的运行时间。最后但并非最不重要的是，我们提出了一种分布式程序，可以在不收集所有数据的情况下在单个站点应用。

引用次数: 173

Collaborative prediction using ensembles of Maximum Margin Matrix Factorizations 使用最大边际矩阵分解集合的协同预测

Proceedings of the 23rd international conference on Machine learning

Pub Date : 2006-06-25 DOI: 10.1145/1143844.1143876

D. DeCoste

Fast gradient-based methods for Maximum Margin Matrix Factorization (MMMF) were recently shown to have great promise (Rennie & Srebro, 2005), including significantly outperforming the previous state-of-the-art methods on some standard collaborative prediction benchmarks (including MovieLens). In this paper, we investigate ways to further improve the performance of MMMF, by casting it within an ensemble approach. We explore and evaluate a variety of alternative ways to define such ensembles. We show that our resulting ensembles can perform significantly better than a single MMMF model, along multiple evaluation metrics. In fact, we find that ensembles of partially trained MMMF models can sometimes even give better predictions in total training time comparable to a single MMMF model.

基于快速梯度的最大边际矩阵分解(MMMF)方法最近被证明具有很大的前景(Rennie & Srebro, 2005)，包括在一些标准协作预测基准(包括MovieLens)上显着优于以前最先进的方法。在本文中，我们研究了通过集成方法来进一步提高MMMF性能的方法。我们探索和评估了各种不同的方法来定义这样的集成。我们表明，我们的结果集成在多个评估指标上的表现明显优于单个MMMF模型。事实上，我们发现部分训练的MMMF模型的集合有时甚至可以在总训练时间内给出比单个MMMF模型更好的预测。

引用次数: 103

An empirical comparison of supervised learning algorithms 监督学习算法的实证比较

Proceedings of the 23rd international conference on Machine learning

Pub Date : 2006-06-25 DOI: 10.1145/1143844.1143865

R. Caruana, Alexandru Niculescu-Mizil

A number of supervised learning methods have been introduced in the last decade. Unfortunately, the last comprehensive empirical evaluation of supervised learning was the Statlog Project in the early 90's. We present a large-scale empirical comparison between ten supervised learning methods: SVMs, neural nets, logistic regression, naive bayes, memory-based learning, random forests, decision trees, bagged trees, boosted trees, and boosted stumps. We also examine the effect that calibrating the models via Platt Scaling and Isotonic Regression has on their performance. An important aspect of our study is the use of a variety of performance criteria to evaluate the learning methods.

在过去的十年中，许多监督学习方法被引入。不幸的是，上一次对监督学习进行全面的实证评估是90年代初的Statlog项目。我们对十种监督学习方法进行了大规模的实证比较:支持向量机、神经网络、逻辑回归、朴素贝叶斯、基于记忆的学习、随机森林、决策树、袋形树、增强树和增强树桩。我们还研究了通过普拉特缩放和等渗回归校准模型对其性能的影响。我们研究的一个重要方面是使用各种绩效标准来评估学习方法。

引用次数: 2511

Algorithms for portfolio management based on the Newton method 基于牛顿法的投资组合管理算法

Proceedings of the 23rd international conference on Machine learning

Pub Date : 2006-06-25 DOI: 10.1145/1143844.1143846

A. Agarwal, Elad Hazan, Satyen Kale, R. Schapire

We experimentally study on-line investment algorithms first proposed by Agarwal and Hazan and extended by Hazan et al. which achieve almost the same wealth as the best constant-rebalanced portfolio determined in hindsight. These algorithms are the first to combine optimal logarithmic regret bounds with efficient deterministic computability. They are based on the Newton method for offline optimization which, unlike previous approaches, exploits second order information. After analyzing the algorithm using the potential function introduced by Agarwal and Hazan, we present extensive experiments on actual financial data. These experiments confirm the theoretical advantage of our algorithms, which yield higher returns and run considerably faster than previous algorithms with optimal regret. Additionally, we perform financial analysis using mean-variance calculations and the Sharpe ratio.

我们实验研究了由Agarwal和Hazan首先提出并由Hazan等人扩展的在线投资算法，该算法获得的财富几乎与事后确定的最佳恒定再平衡投资组合相同。这些算法是第一个将最优对数遗憾界与有效的确定性可计算性相结合的算法。它们基于牛顿离线优化方法，与以前的方法不同，它利用了二阶信息。在使用Agarwal和Hazan引入的势函数对算法进行分析之后，我们在实际金融数据上进行了大量的实验。这些实验证实了我们的算法在理论上的优势，它产生了更高的回报，并且比以前的算法运行得更快。此外，我们使用均值方差计算和夏普比率进行财务分析。

引用次数: 212

Concept boundary detection for speeding up SVMs 加速支持向量机的概念边界检测

Proceedings of the 23rd international conference on Machine learning

Pub Date : 2006-06-25 DOI: 10.1145/1143844.1143930

Navneet Panda, E. Chang, Gang Wu

Support Vector Machines (SVMs) suffer from an O(n2) training cost, where n denotes the number of training instances. In this paper, we propose an algorithm to select boundary instances as training data to substantially reduce n. Our proposed algorithm is motivated by the result of (Burges, 1999) that, removing non-support vectors from the training set does not change SVM training results. Our algorithm eliminates instances that are likely to be non-support vectors. In the concept-independent preprocessing step of our algorithm, we prepare nearest-neighbor lists for training instances. In the concept-specific sampling step, we can then effectively select useful training data for each target concept. Empirical studies show our algorithm to be effective in reducing n, outperforming other competing downsampling algorithms without significantly compromising testing accuracy.

支持向量机(svm)的训练成本为O(n2)，其中n表示训练实例的数量。在本文中，我们提出了一种选择边界实例作为训练数据的算法，以大幅减少n。我们提出的算法的动机是(Burges, 1999)的结果，即从训练集中删除非支持向量不会改变SVM的训练结果。我们的算法消除了可能是非支持向量的实例。在与概念无关的预处理步骤中，我们为训练实例准备了最近邻列表。在特定于概念的采样步骤中，我们可以有效地为每个目标概念选择有用的训练数据。实证研究表明，我们的算法在减少n方面是有效的，优于其他竞争的下采样算法，而不会显著影响测试精度。

引用次数: 59

Pachinko allocation: DAG-structured mixture models of topic correlations 柏青哥分配:dag结构的主题相关性混合模型

Proceedings of the 23rd international conference on Machine learning

Pub Date : 2006-06-25 DOI: 10.1145/1143844.1143917

Wei Li, A. McCallum

Latent Dirichlet allocation (LDA) and other related topic models are increasingly popular tools for summarization and manifold discovery in discrete data. However, LDA does not capture correlations between topics. In this paper, we introduce the pachinko allocation model (PAM), which captures arbitrary, nested, and possibly sparse correlations between topics using a directed acyclic graph (DAG). The leaves of the DAG represent individual words in the vocabulary, while each interior node represents a correlation among its children, which may be words or other interior nodes (topics). PAM provides a flexible alternative to recent work by Blei and Lafferty (2006), which captures correlations only between pairs of topics. Using text data from newsgroups, historic NIPS proceedings and other research paper corpora, we show improved performance of PAM in document classification, likelihood of held-out data, the ability to support finer-grained topics, and topical keyword coherence.

潜在狄利克雷分配(Latent Dirichlet allocation, LDA)和其他相关的主题模型是离散数据总结和流形发现日益流行的工具。但是，LDA不能捕获主题之间的相关性。在本文中，我们引入了弹珠机分配模型(PAM)，该模型使用有向无环图(DAG)捕获主题之间任意的、嵌套的和可能稀疏的相关性。DAG的叶子表示词汇表中的单个单词，而每个内部节点表示其子节点之间的相关性，这些子节点可能是单词或其他内部节点(主题)。PAM为Blei和Lafferty(2006)最近的工作提供了一种灵活的替代方案，后者仅捕获主题对之间的相关性。使用来自新闻组、历史NIPS会议记录和其他研究论文语料库的文本数据，我们展示了PAM在文档分类、保留数据的可能性、支持细粒度主题的能力和主题关键字一致性方面的改进性能。

引用次数: 703

Automatic basis function construction for approximate dynamic programming and reinforcement learning 近似动态规划和强化学习的自动基函数构造

Proceedings of the 23rd international conference on Machine learning

Pub Date : 2006-06-25 DOI: 10.1145/1143844.1143901

Philipp W. Keller, Shie Mannor, Doina Precup

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results by Bertsekas and Castañon (1989) who proposed a method for automatically aggregating states to speed up value iteration. We propose to use neighborhood component analysis (Goldberger et al., 2005), a dimensionality reduction technique created for supervised learning, in order to map a high-dimensional state space to a low-dimensional space, based on the Bellman error, or on the temporal difference (TD) error. We then place basis function in the lower-dimensional space. These are added as new features for the linear function approximator. This approach is applied to a high-dimensional inventory control problem.

研究了马尔可夫决策过程(MDP)值函数线性逼近的基函数自动构造问题。我们的工作建立在Bertsekas和Castañon(1989)的结果之上，他们提出了一种自动聚合状态以加速值迭代的方法。我们建议使用邻域成分分析(Goldberger et al.， 2005)，这是一种为监督学习创建的降维技术，以便基于Bellman误差或时间差(TD)误差将高维状态空间映射到低维空间。然后把基函数放在低维空间中。这些是作为线性函数逼近器的新特性添加的。该方法应用于一个高维库存控制问题。

引用次数: 182

Multiclass boosting with repartitioning 带重分区的多类提升

Proceedings of the 23rd international conference on Machine learning

Pub Date : 2006-06-25 DOI: 10.1145/1143844.1143916

Ling Li

A multiclass classification problem can be reduced to a collection of binary problems with the aid of a coding matrix. The quality of the final solution, which is an ensemble of base classifiers learned on the binary problems, is affected by both the performance of the base learner and the error-correcting ability of the coding matrix. A coding matrix with strong error-correcting ability may not be overall optimal if the binary problems are too hard for the base learner. Thus a trade-off between error-correcting and base learning should be sought. In this paper, we propose a new multiclass boosting algorithm that modifies the coding matrix according to the learning ability of the base learner. We show experimentally that our algorithm is very efficient in optimizing the multiclass margin cost, and outperforms existing multiclass algorithms such as AdaBoost.ECC and one-vs-one. The improvement is especially significant when the base learner is not very powerful.

在编码矩阵的帮助下，多类分类问题可以简化为二进制问题的集合。最终解是在二值问题上学习到的基分类器的集合，其质量受到基学习器性能和编码矩阵纠错能力的双重影响。如果二进制问题对于基础学习器来说太难，那么纠错能力强的编码矩阵可能不是整体最优的。因此，应该在纠错和基础学习之间寻找一个平衡点。本文提出了一种新的多类增强算法，该算法根据基学习器的学习能力修改编码矩阵。实验表明，该算法在优化多类边际成本方面非常有效，优于现有的多类算法(如AdaBoost)。ECC和一对一。当基础学习器不是很强大时，这种改进尤其显著。

引用次数: 58

Regression with the optimised combination technique 优化组合回归技术

Proceedings of the 23rd international conference on Machine learning

Pub Date : 2006-06-25 DOI: 10.1145/1143844.1143885

J. Garcke

We consider the sparse grid combination technique for regression, which we regard as a problem of function reconstruction in some given function space. We use a regularised least squares approach, discretised by sparse grids and solved using the so-called combination technique, where a certain sequence of conventional grids is employed. The sparse grid solution is then obtained by addition of the partial solutions with combination co-efficients dependent on the involved grids. This approach shows instabilities in certain situations and is not guaranteed to converge with higher discretisation levels. In this article we apply the recently introduced optimised combination technique, which repairs these instabilities. Now the combination coefficients also depend on the function to be reconstructed, resulting in a non-linear approximation method which achieves very competitive results. We show that the computational complexity of the improved method still scales only linear in regard to the number of data.

本文考虑稀疏网格组合技术的回归问题，将其看作是给定函数空间中的函数重构问题。我们使用正则化最小二乘方法，通过稀疏网格进行离散，并使用所谓的组合技术进行求解，其中采用了一定序列的常规网格。然后将部分解的组合系数与所涉及的网格相关，通过相加得到稀疏网格解。这种方法在某些情况下显示出不稳定性，并且不能保证在较高的离散化水平下收敛。在本文中，我们采用了最近引入的优化组合技术来修复这些不稳定性。现在组合系数还依赖于要重构的函数，这就产生了一种非线性近似方法，它可以获得非常有竞争力的结果。我们证明了改进方法的计算复杂度在数据数量方面仍然只是线性的。

引用次数: 43

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the 23rd international conference on Machine learning

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀