Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications最新文献

英文中文

A textual representation scheme for identifying clinical relationships in patient records. 用于识别患者记录中临床关系的文本表示方案。

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Pub Date : 2011-02-04 DOI: 10.1109/ICMLA.2010.164

Rezarta Islamaj Doğan, Aurélie Névéol, Zhiyong Lu

The identification of relationships between clinical concepts in patient records is a preliminary step for many important applications in medical informatics, ranging from quality of care to hypothesis generation. In this work we describe an approach that facilitates the automatic recognition of relationships defined between two different concepts in text. Unlike the traditional bag-of-words representation, in this work, a relationship is represented with a scheme of five distinct context-blocks based on the position of concepts in the text. This scheme was applied to eight different relationships, between medical problems, treatments and tests, on a set of 349 patient records from the 4th i2b2 challenge. Results show that the context-block representation was very successful (F-Measure = 0.775) compared to the bag-of-words model (F-Measure = 0.402). The advantage of this representation scheme was the correct management of word position information, which may be critical in identifying certain relationships.

识别患者记录中临床概念之间的关系是医学信息学中许多重要应用的初步步骤，从护理质量到假设生成。在这项工作中，我们描述了一种有助于自动识别文本中两个不同概念之间定义的关系的方法。与传统的词袋表示不同，在这项工作中，根据概念在文本中的位置，用五个不同的上下文块的方案来表示关系。该方案应用于医疗问题、治疗和测试之间的八种不同关系，涉及来自第4次i2b2挑战的一组349名患者记录。结果表明，与词袋模型(F-Measure = 0.402)相比，上下文块表示非常成功(F-Measure = 0.775)。这种表示方案的优点是正确地管理单词位置信息，这对于识别某些关系可能是至关重要的。

{"title":"A textual representation scheme for identifying clinical relationships in patient records.","authors":"Rezarta Islamaj Doğan, Aurélie Névéol, Zhiyong Lu","doi":"10.1109/ICMLA.2010.164","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.164","url":null,"abstract":"The identification of relationships between clinical concepts in patient records is a preliminary step for many important applications in medical informatics, ranging from quality of care to hypothesis generation. In this work we describe an approach that facilitates the automatic recognition of relationships defined between two different concepts in text. Unlike the traditional bag-of-words representation, in this work, a relationship is represented with a scheme of five distinct context-blocks based on the position of concepts in the text. This scheme was applied to eight different relationships, between medical problems, treatments and tests, on a set of 349 patient records from the 4th i2b2 challenge. Results show that the context-block representation was very successful (F-Measure = 0.775) compared to the bag-of-words model (F-Measure = 0.402). The advantage of this representation scheme was the correct management of word position information, which may be critical in identifying certain relationships.","PeriodicalId":74528,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications","volume":"2010 ","pages":"995-998"},"PeriodicalIF":0.0,"publicationDate":"2011-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICMLA.2010.164","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"30169275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Arabic Handwriting Recognition Using Concavity Features and Classifier Fusion 基于凸性特征和分类器融合的阿拉伯语手写识别

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Pub Date : 2011-01-01 DOI: 10.1109/ICMLA.2011.36

S. Abdelazeem, Maha El Meseery

引用次数: 1

Nonlinear RANSAC Optimization for Parameter Estimation with Applications to Phagocyte Transmigration. 非线性RANSAC优化参数估计及其在吞噬细胞迁移中的应用。

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Pub Date : 2011-01-01

Mingon Kang, Jean Gao, Liping Tang

Developing vigorous mathematical equations and estimating accurate parameters within feasible computational time are two indispensable parts to build reliable system models for representing biological properties of the system and for producing reliable simulation. For a complex biological system with limited observations, one of the daunting tasks is the large number of unknown parameters in the mathematical modeling whose values directly determine the performance of computational modeling. To tackle this problem, we have developed a data-driven global optimization method, nonlinear RANSAC, based on RANdom SAmple Consensus (a.k.a. RANSAC) method for parameter estimation of nonlinear system models. Conventional RANSAC method is sound and simple, but it is oriented for linear system models. We not only adopt the strengths of RANSAC, but also extend the method to nonlinear systems with outstanding performance. As a specific application example, we have targeted understanding phagocyte transmigration which is involved in the fibrosis process for biomedical device implantation. With well-defined mathematical nonlinear equations of the system, nonlinear RANSAC is performed for the parameter estimation. In order to evaluate the general performance of the method, we also applied the method to signalling pathways with ordinary differential equations as a general format.

建立有力的数学方程和在可行的计算时间内估计准确的参数是建立可靠的系统模型以表示系统的生物特性和进行可靠的仿真所不可缺少的两个部分。对于观测值有限的复杂生物系统，数学建模中存在大量未知参数，这些参数的取值直接决定了计算建模的性能，这是一项艰巨的任务。为了解决这一问题，我们开发了一种数据驱动的全局优化方法——非线性RANSAC，该方法基于随机样本共识(RANdom SAmple Consensus，又名RANSAC)方法，用于非线性系统模型的参数估计。传统的RANSAC方法简单可靠，但主要面向线性系统模型。我们不仅吸收了RANSAC的优点，而且将该方法推广到具有优异性能的非线性系统中。作为一个具体的应用实例，我们有针对性地了解了生物医学器械植入过程中参与纤维化过程的吞噬细胞迁移。根据系统的非线性数学方程，对系统参数进行非线性RANSAC估计。为了评估该方法的一般性能，我们还将该方法应用于具有一般格式的常微分方程的信号通路。

{"title":"Nonlinear RANSAC Optimization for Parameter Estimation with Applications to Phagocyte Transmigration.","authors":"Mingon Kang, Jean Gao, Liping Tang","doi":"","DOIUrl":"","url":null,"abstract":"Developing vigorous mathematical equations and estimating accurate parameters within feasible computational time are two indispensable parts to build reliable system models for representing biological properties of the system and for producing reliable simulation. For a complex biological system with limited observations, one of the daunting tasks is the large number of unknown parameters in the mathematical modeling whose values directly determine the performance of computational modeling. To tackle this problem, we have developed a data-driven global optimization method, nonlinear RANSAC, based on RANdom SAmple Consensus (a.k.a. RANSAC) method for parameter estimation of nonlinear system models. Conventional RANSAC method is sound and simple, but it is oriented for linear system models. We not only adopt the strengths of RANSAC, but also extend the method to nonlinear systems with outstanding performance. As a specific application example, we have targeted understanding phagocyte transmigration which is involved in the fibrosis process for biomedical device implantation. With well-defined mathematical nonlinear equations of the system, nonlinear RANSAC is performed for the parameter estimation. In order to evaluate the general performance of the method, we also applied the method to signalling pathways with ordinary differential equations as a general format.","PeriodicalId":74528,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications","volume":"1 ","pages":"501-504"},"PeriodicalIF":0.0,"publicationDate":"2011-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3516195/pdf/nihms336836.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"31113556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Classification Approach for Risk Prognosis of Patients on Mechanical Ventricular Assistance. 机械心室辅助患者风险预后的分类方法。

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.50

Yajuan Wang, Carolyn Penstein Rosé, Antonio Ferreira, Dennis M McNamara, Robert L Kormos, James F Antaki

The identification of optimal candidates for ventricular assist device (VAD) therapy is of great importance for future widespread application of this life-saving technology. During recent years, numerous traditional statistical models have been developed for this task. In this study, we compared three different supervised machine learning techniques for risk prognosis of patients on VAD: Decision Tree, Support Vector Machine (SVM) and Bayesian Tree-Augmented Network, to facilitate the candidate identification. A predictive (C4.5) decision tree model was ultimately developed based on 6 features identified by SVM with assistance of recursive feature elimination. This model performed better compared to the popular risk score of Lietz et al. with respect to identification of high-risk patients and earlier survival differentiation between high- and low- risk candidates.

确定心室辅助装置(VAD)治疗的最佳候选者对这项救生技术的未来广泛应用具有重要意义。近年来，为这项任务开发了许多传统的统计模型。在这项研究中，我们比较了三种不同的VAD患者风险预后的监督机器学习技术:决策树，支持向量机(SVM)和贝叶斯树增强网络，以促进候选人的识别。基于SVM识别的6个特征，借助于递归特征消去，最终建立了预测(C4.5)决策树模型。与Lietz等人的流行风险评分相比，该模型在高风险患者的识别和高风险和低风险候选人之间的早期生存分化方面表现更好。

引用次数: 13

The Upper and Lower Bounds of the Prediction Accuracies of Ensemble Methods for Binary Classification. 二元分类集成方法预测精度的上界和下界。

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.62

Xueyi Wang, Nicholas J Davidson

Ensemble methods have been widely used to improve prediction accuracy over individual classifiers. In this paper, we achieve a few results about the prediction accuracies of ensemble methods for binary classification that are missed or misinterpreted in previous literature. First we show the upper and lower bounds of the prediction accuracies (i.e. the best and worst possible prediction accuracies) of ensemble methods. Next we show that an ensemble method can achieve > 0.5 prediction accuracy, while individual classifiers have < 0.5 prediction accuracies. Furthermore, for individual classifiers with different prediction accuracies, the average of the individual accuracies determines the upper and lower bounds. We perform two experiments to verify the results and show that it is hard to achieve the upper and lower bounds accuracies by random individual classifiers and better algorithms need to be developed.

集成方法已被广泛用于提高单个分类器的预测精度。在本文中，我们获得了一些关于二元分类的集成方法的预测精度的结果，这些结果在以前的文献中被遗漏或误解。首先，我们展示了集合方法的预测精度的上界和下界(即最佳和最差的预测精度)。接下来，我们展示了集成方法可以达到> 0.5的预测精度，而单个分类器的预测精度< 0.5。此外，对于具有不同预测精度的单个分类器，单个精度的平均值决定了上下界。我们进行了两个实验来验证结果，并表明随机个体分类器很难达到上界和下界精度，需要开发更好的算法。

引用次数: 9

Non-Alignment Features Based Enzyme/Non-Enzyme Classification Using an Ensemble Method. 基于非对准特征的酶/非酶集成分类。

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.167

Nicholas J Davidson, Xueyi Wang

As a growing number of protein structures are resolved without known functions, using computational methods to help predict protein functions from the structures becomes more and more important. Some computational methods predict protein functions by aligning to homologous proteins with known functions, but they fail to work if such homology cannot be identified. In this paper we classify enzymes/non-enzymes using non-alignment features. We propose a new ensemble method that includes three support vector machines (SVM) and two k-nearest neighbor algorithms (k-NN) and uses a simple majority voting rule. The test on a data set of 697 enzymes and 480 non-enzymes adapted from Dobson and Doig shows 85.59% accuracy in a 10-fold cross validation and 86.49% accuracy in a leave-one-out validation. The prediction accuracy is much better than other non-alignment features based methods and even slightly better than alignment features based methods. To our knowledge, our method is the first time to use ensemble methods to classify enzymes/non-enzymes and is superior over a single classifier.

随着越来越多的蛋白质结构在没有已知功能的情况下被分解，使用计算方法从结构中帮助预测蛋白质的功能变得越来越重要。一些计算方法通过与已知功能的同源蛋白比对来预测蛋白质的功能，但如果不能识别这种同源性，它们就无法工作。在本文中，我们分类酶/非酶使用不对准特征。我们提出了一种新的集成方法，包括三个支持向量机(SVM)和两个k近邻算法(k-NN)，并使用简单的多数投票规则。对Dobson和Doig的697种酶和480种非酶的数据集进行测试，在10倍交叉验证中准确率为85.59%，在留一验证中准确率为86.49%。预测精度远远优于其他基于非对准特征的方法，甚至略优于基于对准特征的方法。据我们所知，我们的方法是第一次使用集成方法对酶/非酶进行分类，并且优于单一分类器。

{"title":"Non-Alignment Features Based Enzyme/Non-Enzyme Classification Using an Ensemble Method.","authors":"Nicholas J Davidson, Xueyi Wang","doi":"10.1109/ICMLA.2010.167","DOIUrl":"https://doi.org/10.1109/ICMLA.2010.167","url":null,"abstract":"As a growing number of protein structures are resolved without known functions, using computational methods to help predict protein functions from the structures becomes more and more important. Some computational methods predict protein functions by aligning to homologous proteins with known functions, but they fail to work if such homology cannot be identified. In this paper we classify enzymes/non-enzymes using non-alignment features. We propose a new ensemble method that includes three support vector machines (SVM) and two k-nearest neighbor algorithms (k-NN) and uses a simple majority voting rule. The test on a data set of 697 enzymes and 480 non-enzymes adapted from Dobson and Doig shows 85.59% accuracy in a 10-fold cross validation and 86.49% accuracy in a leave-one-out validation. The prediction accuracy is much better than other non-alignment features based methods and even slightly better than alignment features based methods. To our knowledge, our method is the first time to use ensemble methods to classify enzymes/non-enzymes and is superior over a single classifier.","PeriodicalId":74528,"journal":{"name":"Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications","volume":" ","pages":"546-551"},"PeriodicalIF":0.0,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICMLA.2010.167","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"29883011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Empowering Simultaneous Feature and Instance Selection in Classification Problems through the Adaptation of Two Selection Algorithms 通过两种选择算法的适配，增强分类问题中特征和实例的同时选择能力

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Pub Date : 2010-12-12 DOI: 10.1109/ICMLA.2010.121

R. D. Carmo, F. Freitas, J. Souza

This paper proposes a new approach to data selection, a key issue in classification problems. This approach, which is based on a feature selection algorithm and one instance selection algorithm, reduces the original dataset in two dimensions, selecting relevant features and retaining important instances simultaneously. The search processes for the best feature and instance subsets occur separately yet, due to the influence of features in the importance of instances and vice versa, they bias one another. The experiments validate the proposed approach showing that this existing relation between features and instances can be reproduced when constructing data selection algorithms and that it leads to a quality improval comparing to the sequential execution of both algorithms.

本文提出了一种新的数据选择方法，这是分类问题中的一个关键问题。该方法在特征选择算法和单实例选择算法的基础上，对原始数据集进行二维约简，在选择相关特征的同时保留重要的实例。最佳特征子集和实例子集的搜索过程是分开进行的，由于特征对实例重要性的影响，反之亦然，它们相互偏向。实验验证了所提出的方法，表明在构建数据选择算法时可以再现特征和实例之间的这种现有关系，并且与两种算法的顺序执行相比，它可以提高质量。

引用次数: 2

Boundary Constrained Manifold Unfolding 边界约束流形展开

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Pub Date : 2008-01-01 DOI: 10.1109/ICMLA.2008.65

Bo Liu, Hongbin Zhang, Wenan Chen

引用次数: 1

Supervised Reinforcement Learning Using Behavior Models 使用行为模型的监督强化学习

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.102

Victor Uc-Cetina

We introduce a supervised reinforcement learning (SRL) architecture for robot control problems with high dimensional state spaces. Based on such architecture two new SRL algorithms are proposed. In our algorithms, a behavior model learned from examples is used to dynamically reduce the set of actions available from each state during the early reinforcement learning (RL) process. The creation of such subsets of actions leads the agent to exploit relevant parts of the action space, avoiding the selection of irrelevant actions. Once the agent has exploited the information provided by the behavior model, it keeps improving its value function without any help, by selecting the next actions to be performed from the complete action space. Our experimental work shows clearly how this approach can dramatically speed up the learning process.

针对具有高维状态空间的机器人控制问题，提出了一种监督强化学习(SRL)体系结构。在此基础上提出了两种新的SRL算法。在我们的算法中，使用从示例中学习的行为模型来动态减少早期强化学习(RL)过程中每个状态的可用动作集。这些行动子集的创建会导致代理利用行动空间的相关部分，避免选择不相关的行动。一旦智能体利用了行为模型提供的信息，它就会在没有任何帮助的情况下，通过从完整的动作空间中选择下一个要执行的动作，不断改进其价值函数。我们的实验工作清楚地表明，这种方法如何显著加快学习过程。

引用次数: 4

Improving gene expression programming performance by using differential evolution 利用差分进化改进基因表达编程性能

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

Pub Date : 2007-12-13 DOI: 10.1109/ICMLA.2007.55

Qiongyun Zhang, Chi Zhou, Weimin Xiao, Peter C. Nelson

Gene Expression Programming (GEP) is an evolutionary algorithm that incorporates both the idea of a simple, linear chromosome of fixed length used in Genetic Algorithms (GAs) and the tree structure of different sizes and shapes used in Genetic Programming (GP). As with other GP algorithms, GEP has difficulty finding appropriate numeric constants for terminal nodes in the expression trees. In this work, we describe a new approach of constant generation using Differential Evolution (DE), a real-valued GA robust and efficient at parameter optimization. Our experimental results on two symbolic regression problems show that the approach significantly improves the performance of the GEP algorithm. The proposed approach can be easily extended to other Genetic Programming variations.

基因表达编程(GEP)是一种进化算法，它结合了遗传算法(GAs)中使用的固定长度的简单线性染色体的思想和遗传规划(GP)中使用的不同大小和形状的树结构。与其他GP算法一样，GEP很难为表达式树中的终端节点找到合适的数字常量。在这项工作中，我们描述了一种新的使用微分进化(DE)的常数生成方法，微分进化是一种在参数优化方面鲁棒且高效的实值遗传算法。我们在两个符号回归问题上的实验结果表明，该方法显著提高了GEP算法的性能。所提出的方法可以很容易地扩展到其他遗传规划变体。

引用次数: 10

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the ... International Conference on Machine Learning and Applications. International Conference on Machine Learning and Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀