首页 > 最新文献

2010 Ninth International Conference on Machine Learning and Applications最新文献

英文 中文
Kernel-based Approaches for Collaborative Filtering 基于核的协同过滤方法
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.41
Zhonghang Xia, Wenke Zhang, Manghui Tu, I. Yen
In a large-scale collaborative filtering system, pair wise similarity between users is usually measured by users' ratings on the whole set of items. However, this measurement may not be well defined due to the sparsity problem, i.e., the lack of adequate ratings on items for calculating accurate predictions. In fact, most correlated users have similar ratings only on a subset of items. In this paper, we consider a kernel-based classification approach for collaborative filtering and propose several kernel matrix construction methods by using biclusters to capture pair wise similarity between users. In order to characterize accurate correlation among users, we embed both local information and global information into the similarity matrix. However, this similarity matrix may not be a kernel matrix. Our solution is to approximate it with the matrix close to it and use low rank constraints to control the complexity of the matrix.
在大规模协同过滤系统中,用户之间的配对相似度通常通过用户对整个项目集的评分来衡量。然而,由于稀疏性问题,这种度量可能不能很好地定义,即,缺乏对计算准确预测的项目的适当评级。事实上,大多数相关用户只对一小部分商品有相似的评分。在本文中,我们考虑了一种基于核的协同过滤分类方法,并提出了几种基于双聚类的核矩阵构建方法来捕获用户之间的对相似度。为了准确表征用户之间的相关性,我们将局部信息和全局信息嵌入到相似矩阵中。然而,这个相似矩阵可能不是核矩阵。我们的解决方法是用它附近的矩阵来近似它,并使用低秩约束来控制矩阵的复杂度。
{"title":"Kernel-based Approaches for Collaborative Filtering","authors":"Zhonghang Xia, Wenke Zhang, Manghui Tu, I. Yen","doi":"10.1109/ICMLA.2010.41","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.41","url":null,"abstract":"In a large-scale collaborative filtering system, pair wise similarity between users is usually measured by users' ratings on the whole set of items. However, this measurement may not be well defined due to the sparsity problem, i.e., the lack of adequate ratings on items for calculating accurate predictions. In fact, most correlated users have similar ratings only on a subset of items. In this paper, we consider a kernel-based classification approach for collaborative filtering and propose several kernel matrix construction methods by using biclusters to capture pair wise similarity between users. In order to characterize accurate correlation among users, we embed both local information and global information into the similarity matrix. However, this similarity matrix may not be a kernel matrix. Our solution is to approximate it with the matrix close to it and use low rank constraints to control the complexity of the matrix.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125459368","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Public Goods Game Simulator with Reinforcement Learning Agents 具有强化学习代理的公共物品博弈模拟器
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.14
U. ManChon, Z. Li
As a famous game in the domain of game theory, both pervasive empirical studies as well as intensive theoretical analysis have been conducted and performed worldwide to research different public goods game scenarios. At the same time, computer game simulators are utilized widely for better research of game theory by providing easy but powerful visualization and statistics functionalities. However, although solutions of public goods game have been widely discussed with empirical studies or theoretical approaches, no computational and automatic simulation approaches has been adopted. For this reason, we have implemented a computer simulator with reinforcement learning agents module for public goods game, and we have utilized this simulator to further study the characteristics of public goods game. Furthermore, in this article, we have also presented a bunch of interesting experimental results with respect to the strategies that agents used and the profits they earned.
作为博弈论领域的著名博弈,世界范围内对不同的公共物品博弈场景进行了广泛的实证研究和深入的理论分析。与此同时,计算机游戏模拟器通过提供简单而强大的可视化和统计功能,被广泛用于更好地研究博弈论。然而,虽然公共产品博弈的解决方案已经通过实证研究或理论方法进行了广泛的讨论,但尚未采用计算和自动模拟的方法。为此,我们为公共物品博弈实现了一个带有强化学习代理模块的计算机模拟器,并利用该模拟器进一步研究了公共物品博弈的特点。此外,在本文中,我们还介绍了一些关于代理使用的策略和他们获得的利润的有趣实验结果。
{"title":"Public Goods Game Simulator with Reinforcement Learning Agents","authors":"U. ManChon, Z. Li","doi":"10.1109/ICMLA.2010.14","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.14","url":null,"abstract":"As a famous game in the domain of game theory, both pervasive empirical studies as well as intensive theoretical analysis have been conducted and performed worldwide to research different public goods game scenarios. At the same time, computer game simulators are utilized widely for better research of game theory by providing easy but powerful visualization and statistics functionalities. However, although solutions of public goods game have been widely discussed with empirical studies or theoretical approaches, no computational and automatic simulation approaches has been adopted. For this reason, we have implemented a computer simulator with reinforcement learning agents module for public goods game, and we have utilized this simulator to further study the characteristics of public goods game. Furthermore, in this article, we have also presented a bunch of interesting experimental results with respect to the strategies that agents used and the profits they earned.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133461732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Intelligent Classification System Using a Pruned Bayes Fuzzy Rule Set 基于修剪贝叶斯模糊规则集的智能分类系统
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.98
I. Yin, Estevam Hruschka, H. Camargo
Hybrid intelligent systems which take advantage of the Bayesian/Fuzzy collaboration have been explored in the literature in the last years. Such collaboration can play an important role mainly in real intelligent systems applications, where accuracy and comprehensibility are crucial aspects to be considered. This paper further explore the Bayes Fuzzy method proposing a classification method specially designed to be used in intelligent systems for data analysis. The main idea is to enhance comprehensibility while maintaining accuracy by decreasing the number of fuzzy rules used to explain a Bayesian Classifier (BC). The proposed Pruned Bayes Fuzzy 2 (PBF2) method is based on a new feature selection method named Selection by Markov Blanket Relation Strength (SMBRS). In the performed experiments, PBF2 is empirically applied to a real world police records problem in order to extract a comprehensible and accurate set of rules which can help in crime prevention. The obtained results show PBF2, when used with proper parameters, brings better precision and comprehensibility compared to other Bayesian/Fuzzy-based methods and to C4.5 algorithm.
利用贝叶斯/模糊协同的混合智能系统在过去几年中已经在文献中进行了探索。这种协作主要在真实的智能系统应用中发挥重要作用,其中准确性和可理解性是需要考虑的关键方面。本文进一步探讨了贝叶斯模糊方法,提出了一种专门用于智能系统数据分析的分类方法。其主要思想是通过减少用于解释贝叶斯分类器(BC)的模糊规则的数量来提高可理解性,同时保持准确性。提出的修剪贝叶斯模糊2 (PBF2)方法是基于一种新的特征选择方法——马尔可夫毯关系强度选择(SMBRS)。在所进行的实验中,PBF2被经验地应用于现实世界的警察记录问题,以提取一套可理解和准确的规则,有助于预防犯罪。结果表明,与其他基于贝叶斯/模糊的方法和C4.5算法相比,在适当的参数下使用PBF2具有更好的精度和可理解性。
{"title":"Intelligent Classification System Using a Pruned Bayes Fuzzy Rule Set","authors":"I. Yin, Estevam Hruschka, H. Camargo","doi":"10.1109/ICMLA.2010.98","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.98","url":null,"abstract":"Hybrid intelligent systems which take advantage of the Bayesian/Fuzzy collaboration have been explored in the literature in the last years. Such collaboration can play an important role mainly in real intelligent systems applications, where accuracy and comprehensibility are crucial aspects to be considered. This paper further explore the Bayes Fuzzy method proposing a classification method specially designed to be used in intelligent systems for data analysis. The main idea is to enhance comprehensibility while maintaining accuracy by decreasing the number of fuzzy rules used to explain a Bayesian Classifier (BC). The proposed Pruned Bayes Fuzzy 2 (PBF2) method is based on a new feature selection method named Selection by Markov Blanket Relation Strength (SMBRS). In the performed experiments, PBF2 is empirically applied to a real world police records problem in order to extract a comprehensible and accurate set of rules which can help in crime prevention. The obtained results show PBF2, when used with proper parameters, brings better precision and comprehensibility compared to other Bayesian/Fuzzy-based methods and to C4.5 algorithm.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132592279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aggregating Multiple Biological Measurements Per Patient 汇总每位患者的多项生物学测量
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.120
V. Zubek, F. Khan
Many machine learning algorithms require a single value per feature per record for modeling. However, there are applications, in the medical domain particularly, where a single record may have multiple observations for the same feature. For example, a patient could have the same gene analyzed in multiple tissue slides of a biopsy, or could have the same genetic test performed on multiple subsequent biopsies. The challenge in these applications is how to integrate multiple observations of the same predictor feature per record. In this paper, two data aggregation methods are compared, one method is a simple median aggregation of feature values, while the other is a novel method which constructs intervals of values for each feature. The aggregated features are passed as input to a novel support vector regression method for modeling survival data in a prostate cancer setting. The performance of both methods was similar in predicting prostate cancer progression on three data cohorts.
许多机器学习算法需要每个特征每个记录的单个值进行建模。然而,在某些应用中,特别是在医学领域,单个记录可能对同一特征有多个观察结果。例如,患者可以在活检的多个组织切片中分析相同的基因,或者可以在随后的多个活检中进行相同的基因检测。这些应用程序中的挑战是如何将每个记录的相同预测器特征的多个观察结果集成在一起。本文比较了两种数据聚合方法,一种是简单的特征值中值聚合,另一种是为每个特征构造值区间的新方法。将聚合的特征作为输入传递给一种新的支持向量回归方法,用于在前列腺癌设置中建模生存数据。在三个数据队列中,两种方法在预测前列腺癌进展方面的表现相似。
{"title":"Aggregating Multiple Biological Measurements Per Patient","authors":"V. Zubek, F. Khan","doi":"10.1109/ICMLA.2010.120","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.120","url":null,"abstract":"Many machine learning algorithms require a single value per feature per record for modeling. However, there are applications, in the medical domain particularly, where a single record may have multiple observations for the same feature. For example, a patient could have the same gene analyzed in multiple tissue slides of a biopsy, or could have the same genetic test performed on multiple subsequent biopsies. The challenge in these applications is how to integrate multiple observations of the same predictor feature per record. In this paper, two data aggregation methods are compared, one method is a simple median aggregation of feature values, while the other is a novel method which constructs intervals of values for each feature. The aggregated features are passed as input to a novel support vector regression method for modeling survival data in a prostate cancer setting. The performance of both methods was similar in predicting prostate cancer progression on three data cohorts.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"230 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115751914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Binocular Framework for Face Liveness Verification under Unconstrained Localization 无约束定位下的双目人脸活动性验证框架
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.37
Qi Li, Zhonghang Xia, Guangming Xing
In this paper, we propose a binocular framework for face liveness verification under unconstrained localization. The proposed framework contains two components: the first component localizes imbalanced points in face regions of an input pair of stereo images and the second component detects whether an imaging face is a 2D object or a 3D object. We test the propose framework on a publicly available stereo face database, which demonstrated its potential.
本文提出了一种无约束定位下的双目人脸活体验证框架。该框架包含两个组件:第一个组件定位输入对立体图像的人脸区域中的不平衡点,第二个组件检测成像的人脸是2D对象还是3D对象。我们在一个公开的立体人脸数据库上测试了该框架,证明了它的潜力。
{"title":"A Binocular Framework for Face Liveness Verification under Unconstrained Localization","authors":"Qi Li, Zhonghang Xia, Guangming Xing","doi":"10.1109/ICMLA.2010.37","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.37","url":null,"abstract":"In this paper, we propose a binocular framework for face liveness verification under unconstrained localization. The proposed framework contains two components: the first component localizes imbalanced points in face regions of an input pair of stereo images and the second component detects whether an imaging face is a 2D object or a 3D object. We test the propose framework on a publicly available stereo face database, which demonstrated its potential.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"161 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115996449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Spatial Based Feature Generation for Machine Learning Based Optimization Compilation 基于空间特征生成的机器学习优化编译
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.147
A. Malik
Modern compilers provide optimization options to obtain better performance for a given program. Effective selection of optimization options is a challenging task. Recent work has shown that machine learning can be used to select the best compiler optimization options for a given program. Machine learning techniques rely upon selecting features which represent a program in the best way. The quality of these features is critical to the performance of machine learning techniques. Previous work on feature selection for program representation is based on code size, mostly executed parts, parallelism and memory access patterns with-in a program. Spatial based information–how instructions are distributed with-in a program–has never been studied to generate features for the best compiler options selection using machine learning techniques. In this paper, we present a framework that address how to capture the spatial information with-in a program and transform it to features for machine learning techniques. An extensive experimentation is done using the SPEC2006 and MiBench benchmark applications. We compare our work with the IBM Milepost-gcc framework. The Milepost work gives a comprehensive set of features for using machine learning techniques for the best compiler options selection problem. Results show that the performance of machine learning techniques using spatial based features is better than the performance using the Milepost framework. With 66 available compiler options, we are also able to achieve 70% of the potential speed up obtained through an iterative compilation.
现代编译器提供优化选项,以获得给定程序的更好性能。优化方案的有效选择是一项具有挑战性的任务。最近的研究表明,机器学习可以用来为给定的程序选择最佳的编译器优化选项。机器学习技术依赖于选择以最佳方式代表程序的特征。这些特征的质量对机器学习技术的性能至关重要。以前关于程序表示的特征选择的工作是基于代码大小、主要执行部分、并行性和程序内部的内存访问模式。基于空间的信息——指令是如何在程序中分布的——从未被研究过,以使用机器学习技术生成最佳编译器选项选择的特征。在本文中,我们提出了一个框架,该框架解决了如何捕获程序中的空间信息并将其转换为机器学习技术的特征。使用SPEC2006和MiBench基准测试应用程序进行了广泛的实验。我们将我们的工作与IBM milestone -gcc框架进行比较。milestone的工作为使用机器学习技术解决最佳编译器选项选择问题提供了一组全面的特性。结果表明,使用基于空间特征的机器学习技术的性能优于使用里程碑框架的性能。有了66个可用的编译器选项,我们还能够实现通过迭代编译获得的70%的潜在速度提升。
{"title":"Spatial Based Feature Generation for Machine Learning Based Optimization Compilation","authors":"A. Malik","doi":"10.1109/ICMLA.2010.147","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.147","url":null,"abstract":"Modern compilers provide optimization options to obtain better performance for a given program. Effective selection of optimization options is a challenging task. Recent work has shown that machine learning can be used to select the best compiler optimization options for a given program. Machine learning techniques rely upon selecting features which represent a program in the best way. The quality of these features is critical to the performance of machine learning techniques. Previous work on feature selection for program representation is based on code size, mostly executed parts, parallelism and memory access patterns with-in a program. Spatial based information–how instructions are distributed with-in a program–has never been studied to generate features for the best compiler options selection using machine learning techniques. In this paper, we present a framework that address how to capture the spatial information with-in a program and transform it to features for machine learning techniques. An extensive experimentation is done using the SPEC2006 and MiBench benchmark applications. We compare our work with the IBM Milepost-gcc framework. The Milepost work gives a comprehensive set of features for using machine learning techniques for the best compiler options selection problem. Results show that the performance of machine learning techniques using spatial based features is better than the performance using the Milepost framework. With 66 available compiler options, we are also able to achieve 70% of the potential speed up obtained through an iterative compilation.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117231964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Peptide Sequence Tag-Based Blind Identification-based SVM Model 基于肽序列标签的盲识别SVM模型
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.156
Hui Li, Chunmei Liu, Xumin Liu, M. Diakite, L. Burge, A. Yakubu, W. Southerland
Identifying the ion types for a mass spectrum is essential for interpreting the spectrum and deriving its peptide sequence. In this paper, we proposed a novel method for identifying ion types and deriving matched peptide sequences for tandem mass spectra. We first divided our dataset into a training set and a testing set and then preprocessed the data using a Support Vector Machine and a 5-fold cross validation based dual denoting model. Then we constructed a syntax tree and generated a rule set to match the mass values from experimental mass spectra with the mass spectral values from corresponding theoretical mass spectra. Finally we applied the proposed algorithm to a tandem mass spectral dataset consisting of 2656 spectra from yeast. Compared with other methods, the experimental results showed that the proposed method can effectively filter noise and successfully derive peptide sequences.
确定质谱中的离子类型对于解释谱和推导其肽序列至关重要。在本文中,我们提出了一种新的方法来识别离子类型和衍生匹配肽序列的串联质谱。我们首先将数据集分为训练集和测试集,然后使用支持向量机和基于5倍交叉验证的对偶表示模型对数据进行预处理。然后构建语法树,生成规则集,将实验质谱的质量值与相应理论质谱的质量值进行匹配。最后,我们将该算法应用于一个包含2656个酵母谱的串联质谱数据集。实验结果表明,该方法能够有效地滤除噪声,并成功地推导出肽序列。
{"title":"Peptide Sequence Tag-Based Blind Identification-based SVM Model","authors":"Hui Li, Chunmei Liu, Xumin Liu, M. Diakite, L. Burge, A. Yakubu, W. Southerland","doi":"10.1109/ICMLA.2010.156","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.156","url":null,"abstract":"Identifying the ion types for a mass spectrum is essential for interpreting the spectrum and deriving its peptide sequence. In this paper, we proposed a novel method for identifying ion types and deriving matched peptide sequences for tandem mass spectra. We first divided our dataset into a training set and a testing set and then preprocessed the data using a Support Vector Machine and a 5-fold cross validation based dual denoting model. Then we constructed a syntax tree and generated a rule set to match the mass values from experimental mass spectra with the mass spectral values from corresponding theoretical mass spectra. Finally we applied the proposed algorithm to a tandem mass spectral dataset consisting of 2656 spectra from yeast. Compared with other methods, the experimental results showed that the proposed method can effectively filter noise and successfully derive peptide sequences.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129629517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semi-Supervised Anomaly Detection for EEG Waveforms Using Deep Belief Nets 基于深度信念网的脑电信号半监督异常检测
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.71
Drausin Wulsin, Justin A. Blanco, R. Mani, B. Litt
Clinical electroencephalography (EEG) is routinely used to monitor brain function in critically ill patients, and specific EEG waveforms are recognized by clinicians as signatures of abnormal brain. These pathologic EEG waveforms, once detected, often necessitate accute clinincal interventions, but these events are typically rare, highly variable between patients, and often hard to separate from background, making them difficult to reliably detect. We show that Deep Belief Nets (DBNs), a type of multi-layer generative neural network, can be used effectively for such EEG anomaly detection. We compare this technique to the state-of-the-art, a one-class Support Vector Machine (SVM), showing that the DBN outperforms the SVM by the F1 measure for our EEG dataset. We also show how the outputs of a DBN-based detector can be used to aid visualization of anomalies in large EEG data sets and propose a method for using DBNs to gain insight into which features of signals are characteristically anomalous. These findings show that Deep Belief Nets can facilitate human review of large amounts of clinical EEG as well as mining new EEG features that may be indicators of unusual activity.
临床脑电图(EEG)通常用于监测危重患者的脑功能,特定的脑电图波形被临床医生识别为大脑异常的标志。一旦检测到这些病理性脑电图波形,通常需要进行急性临床干预,但这些事件通常罕见,患者之间差异很大,并且通常难以与背景分离,因此难以可靠地检测到。研究表明,深度信念网络(Deep Belief Nets, DBNs)是一种多层生成神经网络,可以有效地用于这种EEG异常检测。我们将该技术与最先进的单类支持向量机(SVM)进行比较,结果表明DBN在EEG数据集的F1度量上优于SVM。我们还展示了如何使用基于dbn的检测器的输出来帮助可视化大型脑电图数据集中的异常,并提出了一种使用dbn来深入了解信号的哪些特征是典型异常的方法。这些发现表明,深度信念网络可以促进人类对大量临床脑电图的回顾,以及挖掘可能是异常活动指标的新脑电图特征。
{"title":"Semi-Supervised Anomaly Detection for EEG Waveforms Using Deep Belief Nets","authors":"Drausin Wulsin, Justin A. Blanco, R. Mani, B. Litt","doi":"10.1109/ICMLA.2010.71","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.71","url":null,"abstract":"Clinical electroencephalography (EEG) is routinely used to monitor brain function in critically ill patients, and specific EEG waveforms are recognized by clinicians as signatures of abnormal brain. These pathologic EEG waveforms, once detected, often necessitate accute clinincal interventions, but these events are typically rare, highly variable between patients, and often hard to separate from background, making them difficult to reliably detect. We show that Deep Belief Nets (DBNs), a type of multi-layer generative neural network, can be used effectively for such EEG anomaly detection. We compare this technique to the state-of-the-art, a one-class Support Vector Machine (SVM), showing that the DBN outperforms the SVM by the F1 measure for our EEG dataset. We also show how the outputs of a DBN-based detector can be used to aid visualization of anomalies in large EEG data sets and propose a method for using DBNs to gain insight into which features of signals are characteristically anomalous. These findings show that Deep Belief Nets can facilitate human review of large amounts of clinical EEG as well as mining new EEG features that may be indicators of unusual activity.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127475327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 86
On the Scalability of Supervised Learners in Metagenomics 元基因组学中监督学习器的可扩展性研究
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.123
U. ManChon, Vasim Mahamuda, K. Rasheed
Metagenomics deals with the study of micro-organisms such as prokaryotes that are found in samples from natural environments. The samples obtained from the environment may contain DNA from many different species of micro-organisms including bacteria and archea. Micro-organisms are responsible for most of the symbiotic activity on earth. They are also responsible for the complex chemical reactions which take place on the surface of the earth, which help maintain earth’s ecological balance. With the increase in genome sequencing projects there has been a considerable increase in the amount of assembled sequencing data. In this article, we apply supervised learners namely decision trees, Bayesian networks and decision tables to see how the performance degrades when the number of species present in the metagenomic sample increases. We also try to see how the performance of the metagenomic sample changes as the percentage of unknown sequences in the metagenomic sample is varied.
宏基因组学研究的是微生物,如在自然环境样本中发现的原核生物。从环境中获得的样品可能含有许多不同种类的微生物的DNA,包括细菌和古细菌。微生物负责地球上大部分的共生活动。它们还负责地球表面发生的复杂化学反应,这些化学反应有助于维持地球的生态平衡。随着基因组测序项目的增加,组装测序数据的数量也有了相当大的增加。在本文中,我们应用监督学习器即决策树、贝叶斯网络和决策表来观察当宏基因组样本中存在的物种数量增加时,性能是如何下降的。我们还试图了解宏基因组样品的性能如何随着宏基因组样品中未知序列的百分比的变化而变化。
{"title":"On the Scalability of Supervised Learners in Metagenomics","authors":"U. ManChon, Vasim Mahamuda, K. Rasheed","doi":"10.1109/ICMLA.2010.123","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.123","url":null,"abstract":"Metagenomics deals with the study of micro-organisms such as prokaryotes that are found in samples from natural environments. The samples obtained from the environment may contain DNA from many different species of micro-organisms including bacteria and archea. Micro-organisms are responsible for most of the symbiotic activity on earth. They are also responsible for the complex chemical reactions which take place on the surface of the earth, which help maintain earth’s ecological balance. With the increase in genome sequencing projects there has been a considerable increase in the amount of assembled sequencing data. In this article, we apply supervised learners namely decision trees, Bayesian networks and decision tables to see how the performance degrades when the number of species present in the metagenomic sample increases. We also try to see how the performance of the metagenomic sample changes as the percentage of unknown sequences in the metagenomic sample is varied.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129150593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Power Iteration Denoising 幂次迭代去噪
Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.131
Panganai Gomo, Mike Spann
We present a simple method for image denoising called power iteration denoising (PID). PID finds a low dimensional embedding of the image data using a truncated power iteration on a normalized pair-wise similarity matrix generated from the image. This embedding turns out to be an effective denoising algorithm outperforming the widely used non-local means algorithm. We apply this method to the denoising of noisy digital camera images producing visually pleasing results.
提出了一种简单的图像去噪方法——功率迭代去噪(PID)。PID通过对由图像生成的归一化成对相似性矩阵进行截断幂次迭代,找到图像数据的低维嵌入。结果表明,该嵌入算法是一种有效的去噪算法,优于广泛使用的非局部均值算法。我们将这种方法应用于噪声数码相机图像的去噪,产生视觉上令人满意的效果。
{"title":"Power Iteration Denoising","authors":"Panganai Gomo, Mike Spann","doi":"10.1109/ICMLA.2010.131","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.131","url":null,"abstract":"We present a simple method for image denoising called power iteration denoising (PID). PID finds a low dimensional embedding of the image data using a truncated power iteration on a normalized pair-wise similarity matrix generated from the image. This embedding turns out to be an effective denoising algorithm outperforming the widely used non-local means algorithm. We apply this method to the denoising of noisy digital camera images producing visually pleasing results.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130626034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2010 Ninth International Conference on Machine Learning and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1