首页 > 最新文献

2010 Ninth International Conference on Machine Learning and Applications最新文献

英文 中文
Speeding Up Greedy Forward Selection for Regularized Least-Squares 加速正则化最小二乘的贪婪正向选择
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.55
T. Pahikkala, A. Airola, T. Salakoski
We propose a novel algorithm for greedy forward feature selection for regularized least-squares (RLS) regression and classification, also known as the least-squares support vector machine or ridge regression. The algorithm, which we call greedy RLS, starts from the empty feature set, and on each iteration adds the feature whose addition provides the best leave-one-out cross-validation performance. Our method is considerably faster than the previously proposed ones, since its time complexity is linear in the number of training examples, the number of features in the original data set, and the desired size of the set of selected features. Therefore, as a side effect we obtain a new training algorithm for learning sparse linear RLS predictors which can be used for large scale learning. This speed is possible due to matrix calculus based short-cuts for leave-one-out and feature addition. We experimentally demonstrate the scalability of our algorithm compared to previously proposed implementations.
我们提出了一种用于正则化最小二乘(RLS)回归和分类的贪婪前向特征选择算法,也称为最小二乘支持向量机或脊回归。我们称之为贪婪RLS的算法从空特征集开始,并在每次迭代中添加特征,这些特征的添加提供了最佳的留一交叉验证性能。我们的方法比之前提出的方法要快得多,因为它的时间复杂度在训练样例的数量、原始数据集中的特征数量和所选特征集的期望大小之间是线性的。因此,作为一种副作用,我们获得了一种新的训练算法,用于学习稀疏线性RLS预测器,可用于大规模学习。这种速度是可能的,因为基于矩阵演算的略去和特征添加的捷径。与之前提出的实现相比,我们通过实验证明了算法的可扩展性。
{"title":"Speeding Up Greedy Forward Selection for Regularized Least-Squares","authors":"T. Pahikkala, A. Airola, T. Salakoski","doi":"10.1109/ICMLA.2010.55","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.55","url":null,"abstract":"We propose a novel algorithm for greedy forward feature selection for regularized least-squares (RLS) regression and classification, also known as the least-squares support vector machine or ridge regression. The algorithm, which we call greedy RLS, starts from the empty feature set, and on each iteration adds the feature whose addition provides the best leave-one-out cross-validation performance. Our method is considerably faster than the previously proposed ones, since its time complexity is linear in the number of training examples, the number of features in the original data set, and the desired size of the set of selected features. Therefore, as a side effect we obtain a new training algorithm for learning sparse linear RLS predictors which can be used for large scale learning. This speed is possible due to matrix calculus based short-cuts for leave-one-out and feature addition. We experimentally demonstrate the scalability of our algorithm compared to previously proposed implementations.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132189739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
A Parallel Algorithm for Predicting the Secondary Structure of Polycistronic MicroRNAs 预测多顺反子microrna二级结构的并行算法
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.80
Dianwei Han, G. Tang, Jun Zhang
MicroRNAs (miRNAs) are newly discovered endogenous small non-coding RNAs (21-25nt) that target their complementary gene transcripts for degradation or translational repression. The biogenesis of a functional miRNA is largely dependent on the secondary structure of the miRNA precursor (pre-miRNA). Recently, it has been shown that miRNAs are present in the genome as the form of polycistronic transcriptional units in plants and animals. It will be important to design methods to predict such structures for miRNA discovery and its applications in gene silencing. In this paper, we propose a parallel algorithm based on the master-slave architecture to predict the secondary structure from an input sequence. First, the master processor partitions the input sequence into subsequences and distributes them to the slave processors. The slave processors will then predict the secondary structure based on their individual task. Afterward, the slave processors will return their results to the master processor. Finally, the master processor will merge the partial structures from the slave processors into a whole candidate secondary structure. The optimal structure is obtained by sorting the candidate structures according to their scores. Our experimental results indicate that the actual speed-ups match the trend of theoretic values.
MicroRNAs (miRNAs)是新发现的内源性小非编码rna (21-25nt),其靶向其互补基因转录物进行降解或翻译抑制。功能性miRNA的生物发生在很大程度上取决于miRNA前体(pre-miRNA)的二级结构。近年来,已有研究表明,mirna以多顺反子转录单位的形式存在于植物和动物基因组中。设计预测这些结构的方法对于miRNA的发现及其在基因沉默中的应用具有重要意义。本文提出了一种基于主从结构的并行算法,用于从输入序列中预测二级结构。首先,主处理器将输入序列划分为子序列,并将其分发给从处理器。然后,从处理器将根据各自的任务预测二级结构。然后,从处理器将它们的结果返回给主处理器。最后,主处理器将从处理器的部分结构合并为一个完整的候选二级结构。根据候选结构的得分对候选结构进行排序,得到最优结构。实验结果表明,实际加速速度与理论值的趋势相吻合。
{"title":"A Parallel Algorithm for Predicting the Secondary Structure of Polycistronic MicroRNAs","authors":"Dianwei Han, G. Tang, Jun Zhang","doi":"10.1109/ICMLA.2010.80","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.80","url":null,"abstract":"MicroRNAs (miRNAs) are newly discovered endogenous small non-coding RNAs (21-25nt) that target their complementary gene transcripts for degradation or translational repression. The biogenesis of a functional miRNA is largely dependent on the secondary structure of the miRNA precursor (pre-miRNA). Recently, it has been shown that miRNAs are present in the genome as the form of polycistronic transcriptional units in plants and animals. It will be important to design methods to predict such structures for miRNA discovery and its applications in gene silencing. In this paper, we propose a parallel algorithm based on the master-slave architecture to predict the secondary structure from an input sequence. First, the master processor partitions the input sequence into subsequences and distributes them to the slave processors. The slave processors will then predict the secondary structure based on their individual task. Afterward, the slave processors will return their results to the master processor. Finally, the master processor will merge the partial structures from the slave processors into a whole candidate secondary structure. The optimal structure is obtained by sorting the candidate structures according to their scores. Our experimental results indicate that the actual speed-ups match the trend of theoretic values.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132600453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Extreme Volume Detection for Managed Print Services 托管打印服务的极限体积检测
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.95
J. Handley, Marie-Luise Schneider, Victor Ciriza, J. Earl
A managed print service (MPS) manages the printing, scanning and facsimile devices in an enterprise to control cost and improve availability. Services include supplies replenishment, maintenance, repair, and use reporting. Customers are billed per page printed. Data are collected from a network of devices to facilitate management. The number of pages printed per device must be accurately counted to fairly bill the customer. Software errors, hardware changes, repairs, and human error all contribute to “meter reads” that are exceptionally high and are apt to be challenged by the customer were they to be billed. Account managers periodically review data for each device in an account. This process is tedious and time consuming and an automated solution is desired. Exceptional print volumes are not always salient and detecting them statistically is prone to errors owing to nonstationarity of the data. Mean levels and variances change over time and usage is highly auto correlated which precludes simple detection methods based on deviations from an average background. A solution must also be computationally inexpensive and require little auxiliary storage because hundreds of thousands of streams of device data must be processed. We present an algorithm and system for online detection of extreme print volumes that uses dynamic linear models (DLM) with variance learning. A DLM is a state space time series model comprising a random mean level system process and a random observation process. Both components are updated using Bayesian statistics. After each update, a forecasted value and its estimated variance are calculated. A read is flagged as exceptionally high if its value is highly unlikely with respect to a forecasted value and its standard deviation. We provide implementation details and results of a field test in which error rate was decreased from 26.4% to 0.5% on 728 observed meter reads.
管理打印服务(MPS)管理企业中的打印、扫描和传真设备,以控制成本并提高可用性。服务包括补给品、维护、修理和使用报告。客户按打印页数收费。从设备网络中收集数据,方便管理。必须准确计算每台设备打印的页数,以便公平地向客户收费。软件错误、硬件更改、维修和人为错误都会导致“仪表读数”异常高,并且很容易受到客户的质疑。客户经理定期审查客户中每个设备的数据。这个过程冗长且耗时,需要一个自动化的解决方案。由于数据的非平稳性,异常的印刷量并不总是显著的,并且在统计上检测它们容易出错。平均水平和方差随时间变化,使用情况高度自相关,这使得基于平均背景偏差的简单检测方法无法实现。解决方案还必须在计算上便宜,并且需要很少的辅助存储,因为必须处理数十万个设备数据流。我们提出了一种使用动态线性模型(DLM)和方差学习的在线检测极端打印量的算法和系统。DLM是一个由随机平均水平系统过程和随机观测过程组成的状态空间时间序列模型。这两个组件都使用贝叶斯统计更新。每次更新后,计算预测值及其估计方差。如果读数的值相对于预测值及其标准偏差极不可能,则将其标记为异常高。我们提供了现场测试的实施细节和结果,在728个观察到的仪表读数中,错误率从26.4%下降到0.5%。
{"title":"Extreme Volume Detection for Managed Print Services","authors":"J. Handley, Marie-Luise Schneider, Victor Ciriza, J. Earl","doi":"10.1109/ICMLA.2010.95","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.95","url":null,"abstract":"A managed print service (MPS) manages the printing, scanning and facsimile devices in an enterprise to control cost and improve availability. Services include supplies replenishment, maintenance, repair, and use reporting. Customers are billed per page printed. Data are collected from a network of devices to facilitate management. The number of pages printed per device must be accurately counted to fairly bill the customer. Software errors, hardware changes, repairs, and human error all contribute to “meter reads” that are exceptionally high and are apt to be challenged by the customer were they to be billed. Account managers periodically review data for each device in an account. This process is tedious and time consuming and an automated solution is desired. Exceptional print volumes are not always salient and detecting them statistically is prone to errors owing to nonstationarity of the data. Mean levels and variances change over time and usage is highly auto correlated which precludes simple detection methods based on deviations from an average background. A solution must also be computationally inexpensive and require little auxiliary storage because hundreds of thousands of streams of device data must be processed. We present an algorithm and system for online detection of extreme print volumes that uses dynamic linear models (DLM) with variance learning. A DLM is a state space time series model comprising a random mean level system process and a random observation process. Both components are updated using Bayesian statistics. After each update, a forecasted value and its estimated variance are calculated. A read is flagged as exceptionally high if its value is highly unlikely with respect to a forecasted value and its standard deviation. We provide implementation details and results of a field test in which error rate was decreased from 26.4% to 0.5% on 728 observed meter reads.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"212 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133610229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Improved Unsupervised Clustering over Watershed-Based Clustering 基于分水岭聚类的改进无监督聚类
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.44
Sai Venu Gopal Lolla, L. L. Hoberock
This paper improves upon an existing Watershed algorithm-based clustering method. The existing method uses an experimentally determined parameter to construct a density function. A better method for evaluating the cell/window size (used in the construction of the density function) is proposed, eliminating the need for arbitrary parameters. The algorithm has been tested on both published and unpublished synthetic data, and the results demonstrate that the proposed approach is able to accurately estimate the number of clusters present in the data.
本文改进了现有的基于Watershed算法的聚类方法。现有的方法使用实验确定的参数来构造密度函数。提出了一种更好的方法来评估单元/窗口大小(用于密度函数的构造),消除了对任意参数的需要。该算法在已发表和未发表的合成数据上进行了测试,结果表明该方法能够准确地估计数据中存在的聚类数量。
{"title":"Improved Unsupervised Clustering over Watershed-Based Clustering","authors":"Sai Venu Gopal Lolla, L. L. Hoberock","doi":"10.1109/ICMLA.2010.44","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.44","url":null,"abstract":"This paper improves upon an existing Watershed algorithm-based clustering method. The existing method uses an experimentally determined parameter to construct a density function. A better method for evaluating the cell/window size (used in the construction of the density function) is proposed, eliminating the need for arbitrary parameters. The algorithm has been tested on both published and unpublished synthetic data, and the results demonstrate that the proposed approach is able to accurately estimate the number of clusters present in the data.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"80 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121330345","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Parallel Projections for Manifold Learning 流形学习的并行投影
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.54
H. Strange, R. Zwiggelaar
Manifold learning is a widely used statistical tool which reduces the dimensionality of a data set while aiming to maintain both local and global properties of the data. We present a novel manifold learning technique which aligns local hyper planes to build a global representation of the data. A Minimum Spanning Tree provides the skeleton needed to traverse the manifold so that the local hyper planes can be merged using parallel projections to build a global hyper plane of the data. We show state of the art results when compared against existing manifold learning algorithm on both artificial and real world image data.
流形学习是一种广泛使用的统计工具,它可以降低数据集的维数,同时保持数据的局部和全局属性。我们提出了一种新的流形学习技术,通过对齐局部超平面来构建数据的全局表示。最小生成树提供了遍历流形所需的骨架,以便使用并行投影合并局部超平面以构建数据的全局超平面。我们展示了与现有的流形学习算法在人工和真实世界图像数据上的比较结果。
{"title":"Parallel Projections for Manifold Learning","authors":"H. Strange, R. Zwiggelaar","doi":"10.1109/ICMLA.2010.54","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.54","url":null,"abstract":"Manifold learning is a widely used statistical tool which reduces the dimensionality of a data set while aiming to maintain both local and global properties of the data. We present a novel manifold learning technique which aligns local hyper planes to build a global representation of the data. A Minimum Spanning Tree provides the skeleton needed to traverse the manifold so that the local hyper planes can be merged using parallel projections to build a global hyper plane of the data. We show state of the art results when compared against existing manifold learning algorithm on both artificial and real world image data.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115405944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Bayesian Inferences and Forecasting in Spatial Time Series Models 空间时间序列模型中的贝叶斯推断与预测
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.170
Sung Duck Lee, Duck-Ki Kim
The spatial time series data can be viewed as a set of time series collected simultaneously at a number of spatial locations with time. For example, The Mumps data have a feature to infect adjacent broader regions in accordance with spatial location and time. Therefore, The spatial time series models have many parameters of space and time. In this paper, We propose the method of bayesian inferences and prediction in spatial time series models with a Gibbs Sampler in order to overcome convergence problem in numerical methods. Our results are illustrated by using the data set of mumps cases reported from the Korea Center for Disease Control and Prevention monthly over the years 2001-2009, as well as a simulation study.
空间时间序列数据可以看作是在多个空间位置随时间同时采集的一组时间序列。例如,腮腺炎数据具有根据空间位置和时间感染邻近更广泛区域的特征。因此,空间时间序列模型具有许多时空参数。为了克服数值方法的收敛性问题,提出了基于Gibbs采样器的空间时间序列模型的贝叶斯推理和预测方法。我们的结果是通过使用2001-2009年韩国疾病控制和预防中心每月报告的腮腺炎病例数据集以及模拟研究来说明的。
{"title":"Bayesian Inferences and Forecasting in Spatial Time Series Models","authors":"Sung Duck Lee, Duck-Ki Kim","doi":"10.1109/ICMLA.2010.170","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.170","url":null,"abstract":"The spatial time series data can be viewed as a set of time series collected simultaneously at a number of spatial locations with time. For example, The Mumps data have a feature to infect adjacent broader regions in accordance with spatial location and time. Therefore, The spatial time series models have many parameters of space and time. In this paper, We propose the method of bayesian inferences and prediction in spatial time series models with a Gibbs Sampler in order to overcome convergence problem in numerical methods. Our results are illustrated by using the data set of mumps cases reported from the Korea Center for Disease Control and Prevention monthly over the years 2001-2009, as well as a simulation study.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"522 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114623439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multilayer Ferns: A Learning-based Approach of Patch Recognition and Homography Extraction 多层蕨类植物:基于学习的斑块识别和单应性提取方法
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.36
Gao Ce, Song Yixu, Jia Pei-fa
While local patches recognition is a key component of modern approaches to affine transformation detection and object detection, existing learning-based approaches just identify the patches based on a set of randomly picked and combined binary features, which will lose some strong correlations between features and can not provide stable and remarkable identification ability. In this paper, we proposed a method that select and organize the features in a Multilayer Ferns structure, and show that it is both faster in the run-time processing and more powerful in the identification ability than state-of-the-art ad hoc approaches.
局部斑块识别是现代仿射变换检测和目标检测方法的关键组成部分,但现有的基于学习的方法仅仅是基于一组随机选取和组合的二值特征来识别斑块,这将失去特征之间的一些强相关性,无法提供稳定而显著的识别能力。在本文中,我们提出了一种在多层蕨类结构中选择和组织特征的方法,并表明它在运行时处理速度和识别能力上都比目前最先进的特别方法要快。
{"title":"Multilayer Ferns: A Learning-based Approach of Patch Recognition and Homography Extraction","authors":"Gao Ce, Song Yixu, Jia Pei-fa","doi":"10.1109/ICMLA.2010.36","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.36","url":null,"abstract":"While local patches recognition is a key component of modern approaches to affine transformation detection and object detection, existing learning-based approaches just identify the patches based on a set of randomly picked and combined binary features, which will lose some strong correlations between features and can not provide stable and remarkable identification ability. In this paper, we proposed a method that select and organize the features in a Multilayer Ferns structure, and show that it is both faster in the run-time processing and more powerful in the identification ability than state-of-the-art ad hoc approaches.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"78 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129662200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Patient-Specific Seizure Detection from Intra-cranial EEG Using High Dimensional Clustering 基于高维聚类的颅内脑电图患者特异性癫痫检测
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.119
Haimonti Dutta, D. Waltz, Karthik M. Ramasamy, Philip Gross, Ansaf Salleb-Aouissi, H. Diab, Manoj Pooleery, C. Schevon, R. Emerson
Automatic seizure detection is becoming popular in modern epilepsy monitoring units since it assists diagnostic monitoring and reduces manual review of large volumes of EEG recordings. In this paper, we describe the application of machine learning algorithms for building patient-specific seizure detectors on multiple frequency bands of intra-cranial electroencephalogram (iEEG) recorded by a dense Micro-Electrode Array (MEA). The MEA is capable of recording at a very high sampling rate (30 KHz) producing an avalanche of time series data. We explore subsets of this data to build seizure detectors – we discuss several methods for extracting univariate and bivariate features from the channels and study the effectiveness of using high dimensional clustering algorithms such as K-means and Subspace clustering for constructing the model. Future work involves design of more robust seizure detectors using other features and non-parametric clustering techniques, detection of artifacts and understanding the generalization properties of the models.
自动发作检测在现代癫痫监测装置中越来越流行,因为它有助于诊断监测,减少了大量脑电图记录的人工审查。在本文中,我们描述了机器学习算法在密集微电极阵列(MEA)记录的颅内脑电图(iEEG)的多个频段上构建患者特定癫痫检测器的应用。MEA能够以非常高的采样率(30 KHz)记录,产生雪崩的时间序列数据。我们探索这些数据的子集来构建癫痫检测器-我们讨论了几种从通道中提取单变量和双变量特征的方法,并研究了使用高维聚类算法(如K-means和子空间聚类)构建模型的有效性。未来的工作包括使用其他特征和非参数聚类技术设计更健壮的癫痫检测器,检测工件和理解模型的泛化属性。
{"title":"Patient-Specific Seizure Detection from Intra-cranial EEG Using High Dimensional Clustering","authors":"Haimonti Dutta, D. Waltz, Karthik M. Ramasamy, Philip Gross, Ansaf Salleb-Aouissi, H. Diab, Manoj Pooleery, C. Schevon, R. Emerson","doi":"10.1109/ICMLA.2010.119","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.119","url":null,"abstract":"Automatic seizure detection is becoming popular in modern epilepsy monitoring units since it assists diagnostic monitoring and reduces manual review of large volumes of EEG recordings. In this paper, we describe the application of machine learning algorithms for building patient-specific seizure detectors on multiple frequency bands of intra-cranial electroencephalogram (iEEG) recorded by a dense Micro-Electrode Array (MEA). The MEA is capable of recording at a very high sampling rate (30 KHz) producing an avalanche of time series data. We explore subsets of this data to build seizure detectors – we discuss several methods for extracting univariate and bivariate features from the channels and study the effectiveness of using high dimensional clustering algorithms such as K-means and Subspace clustering for constructing the model. Future work involves design of more robust seizure detectors using other features and non-parametric clustering techniques, detection of artifacts and understanding the generalization properties of the models.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125448492","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Bayesian Nonparametric Model for Joint Relation Integration and Domain Clustering 联合关系集成与域聚类的贝叶斯非参数模型
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.168
Dazhuo Li, Fahim Mohammad, E. Rouchka
Relational databases provide unprecedented opportunities for knowledge discovery. Various approaches have been proposed to infer structures over entity types and predict relationships among elements of these types. However, discovering structures beyond the entity type level, e.g. clustering over relation concepts, remains a challenging task. We present a Bayesian nonparametric model for joint relation and domain clustering. The model can automatically infer the number of relation clusters, which is particularly important in novel cases where little prior knowledge is known about the number of relation clusters. The approach is applied to clustering various relations in a gene database. Keywords-relational learning; clustering; Bayesian non- parametric
关系数据库为知识发现提供了前所未有的机会。已经提出了各种方法来推断实体类型的结构并预测这些类型的元素之间的关系。然而,发现实体类型级别以外的结构,例如关系概念上的聚类,仍然是一项具有挑战性的任务。提出了一种用于联合关系和域聚类的贝叶斯非参数模型。该模型可以自动推断出关系簇的数量,这在对关系簇数量知之甚少的新情况下尤为重要。将该方法应用于基因数据库中各种关系的聚类。Keywords-relational学习;聚类;贝叶斯非参数
{"title":"A Bayesian Nonparametric Model for Joint Relation Integration and Domain Clustering","authors":"Dazhuo Li, Fahim Mohammad, E. Rouchka","doi":"10.1109/ICMLA.2010.168","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.168","url":null,"abstract":"Relational databases provide unprecedented opportunities for knowledge discovery. Various approaches have been proposed to infer structures over entity types and predict relationships among elements of these types. However, discovering structures beyond the entity type level, e.g. clustering over relation concepts, remains a challenging task. We present a Bayesian nonparametric model for joint relation and domain clustering. The model can automatically infer the number of relation clusters, which is particularly important in novel cases where little prior knowledge is known about the number of relation clusters. The approach is applied to clustering various relations in a gene database. Keywords-relational learning; clustering; Bayesian non- parametric","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125565441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multi-Agent Inverse Reinforcement Learning 多智能体逆强化学习
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.65
Sriraam Natarajan, Gautam Kunapuli, Kshitij Judah, Prasad Tadepalli, K. Kersting, J. Shavlik
Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship learning. We introduce the problem of multi-agent inverse reinforcement learning, where reward functions of multiple agents are learned by observing their uncoordinated behavior. A centralized controller then learns to coordinate their behavior by optimizing a weighted sum of reward functions of all the agents. We evaluate our approach on a traffic-routing domain, in which a controller coordinates actions of multiple traffic signals to regulate traffic density. We show that the learner is not only able to match but even significantly outperform the expert.
通过观察智能体的行为来学习其奖励函数被称为逆强化学习,并在示范学习或学徒学习中有应用。我们引入了多智能体逆强化学习问题,其中多智能体的奖励函数通过观察它们的不协调行为来学习。然后,集中式控制器通过优化所有代理的奖励函数的加权和来学习协调它们的行为。我们在交通路由域上评估我们的方法,其中控制器协调多个交通信号的动作来调节交通密度。我们的研究表明,学习者不仅能够匹配甚至明显优于专家。
{"title":"Multi-Agent Inverse Reinforcement Learning","authors":"Sriraam Natarajan, Gautam Kunapuli, Kshitij Judah, Prasad Tadepalli, K. Kersting, J. Shavlik","doi":"10.1109/ICMLA.2010.65","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.65","url":null,"abstract":"Learning the reward function of an agent by observing its behavior is termed inverse reinforcement learning and has applications in learning from demonstration or apprenticeship learning. We introduce the problem of multi-agent inverse reinforcement learning, where reward functions of multiple agents are learned by observing their uncoordinated behavior. A centralized controller then learns to coordinate their behavior by optimizing a weighted sum of reward functions of all the agents. We evaluate our approach on a traffic-routing domain, in which a controller coordinates actions of multiple traffic signals to regulate traffic density. We show that the learner is not only able to match but even significantly outperform the expert.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125044165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 77
期刊
2010 Ninth International Conference on Machine Learning and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1