首页 > 最新文献

2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)最新文献

英文 中文
Multi-norm constrained optimization methods for calling copy number variants in single cell sequencing data 单细胞测序数据中拷贝数变量调用的多规范约束优化方法
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822511
Changsheng Zhang, Hongmin Cai, Jingying Huang, Bo Xu
The revolutionary invention of single-cell sequencing technology carves out a new way to delineate intra tumor heterogeneity and the evolution of single cells at the molecular level. Since single-cell sequencing requires a special genome amplification step to accumulate enough samples, a large number of bias were introduced, making the calling of copy number variants rather challenging. Accurately modeling this process and effectively detecting copy number variations (CNVs) are the major roadblock for single-cell sequencing data analysis. Recent advances manifested that the underlying copy numbers are corrupted by noise, which could be approximated by negative binomial distribution. In this paper, we formulated a general mathematical model for copy number reconstruction from read depth signal, and presented its two specific variants, namely Poisson-CNV and NB-CNV to catering for various reads distribution. Efficient numerical solution based on the classical alternating direction minimization method was designed to solve the proposed models. Extensive experiments on both synthetic datasets and empirical single-cell sequencing datasets were conducted to compare the performance of the two models. The results show that the proposed model of NB-CNV achieved superior performance in calling the CNV for single-cell sequencing data.
单细胞测序技术的革命性发明为在分子水平上描述肿瘤内异质性和单细胞进化开辟了新的途径。由于单细胞测序需要一个特殊的基因组扩增步骤来积累足够的样本,因此引入了大量的偏倚,使得拷贝数变异的调用相当具有挑战性。准确地模拟这一过程并有效地检测拷贝数变异(CNVs)是单细胞测序数据分析的主要障碍。最近的研究表明,潜在的拷贝数受到噪声的破坏,噪声可以近似为负二项分布。本文建立了从读取深度信号重构拷贝数的通用数学模型,并针对不同的读取分布,提出了该模型的两种具体变体泊松- cnv和NB-CNV。基于经典的交替方向最小化方法,设计了求解该模型的高效数值解。在合成数据集和经验单细胞测序数据集上进行了大量实验,比较了两种模型的性能。结果表明,所提出的NB-CNV模型在调用单细胞测序数据的CNV方面取得了优异的性能。
{"title":"Multi-norm constrained optimization methods for calling copy number variants in single cell sequencing data","authors":"Changsheng Zhang, Hongmin Cai, Jingying Huang, Bo Xu","doi":"10.1109/BIBM.2016.7822511","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822511","url":null,"abstract":"The revolutionary invention of single-cell sequencing technology carves out a new way to delineate intra tumor heterogeneity and the evolution of single cells at the molecular level. Since single-cell sequencing requires a special genome amplification step to accumulate enough samples, a large number of bias were introduced, making the calling of copy number variants rather challenging. Accurately modeling this process and effectively detecting copy number variations (CNVs) are the major roadblock for single-cell sequencing data analysis. Recent advances manifested that the underlying copy numbers are corrupted by noise, which could be approximated by negative binomial distribution. In this paper, we formulated a general mathematical model for copy number reconstruction from read depth signal, and presented its two specific variants, namely Poisson-CNV and NB-CNV to catering for various reads distribution. Efficient numerical solution based on the classical alternating direction minimization method was designed to solve the proposed models. Extensive experiments on both synthetic datasets and empirical single-cell sequencing datasets were conducted to compare the performance of the two models. The results show that the proposed model of NB-CNV achieved superior performance in calling the CNV for single-cell sequencing data.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116640622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A representational analysis of a temporal indeterminancy display in clinical events 临床事件中时间不确定性表现的代表性分析
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822673
M. Madkour, Hsing-yi Song, Jingcheng Du, Cui Tao
This paper describes a proposition for representing temporal indeterminacy in events from clinical narratives using fuzzy sets membership functions. This approach leverages both temporal and semantic information of events and has been proved by representational analysis evaluation method. We demonstrate that membership functions' graphs can be used for representing temporal approximation and granularity of events. We also show that this approach is helpful for the construction of fine timeline of clinical events, and can be used for calculating accurate metrics for ordering events.
本文描述了一个用模糊集隶属函数表示临床叙述事件时间不确定性的命题。该方法充分利用了事件的时间信息和语义信息,并通过表征分析评价方法得到了验证。我们证明了隶属函数图可以用来表示事件的时间逼近和粒度。我们还表明,该方法有助于构建临床事件的精细时间线,并可用于计算精确的事件排序指标。
{"title":"A representational analysis of a temporal indeterminancy display in clinical events","authors":"M. Madkour, Hsing-yi Song, Jingcheng Du, Cui Tao","doi":"10.1109/BIBM.2016.7822673","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822673","url":null,"abstract":"This paper describes a proposition for representing temporal indeterminacy in events from clinical narratives using fuzzy sets membership functions. This approach leverages both temporal and semantic information of events and has been proved by representational analysis evaluation method. We demonstrate that membership functions' graphs can be used for representing temporal approximation and granularity of events. We also show that this approach is helpful for the construction of fine timeline of clinical events, and can be used for calculating accurate metrics for ordering events.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116688591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Unconstrained optimization in projection method for indefinite SVMs 不定支持向量机投影法的无约束优化
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822585
Hao Jiang, W. Ching, Yushan Qiu, Xiaoqing Cheng
Positive semi-definiteness is a critical property in Support Vector Machine (SVM) methods to ensure efficient solutions through convex quadratic programming. In this paper, we introduce a projection matrix on indefinite kernels to formulate a positive semi-definite one. The proposed model can be regarded as a generalized version of the spectrum method (denoising method and flipping method) by varying parameter λ. In particular, our suggested optimal λ under the Bregman matrix divergence theory can be obtained using unconstrained optimization. Experimental results on 4 real world data sets ranging from glycan classification to cancer prediction show that the proposed model can achieve better or competitive performance when compared to the related indefinite kernel methods. This may suggest a new way in motif extractions or cancer predictions.
正半确定性是支持向量机方法中保证凸二次规划有效解的关键性质。本文引入不定核上的投影矩阵,从而得到一个正的半定投影矩阵。该模型可以看作是通过改变参数λ的谱法(去噪法和翻转法)的广义版本。特别地,我们建议的最优λ在Bregman矩阵散度理论下可以使用无约束优化得到。从聚糖分类到癌症预测的4个真实数据集的实验结果表明,与相关的不确定核方法相比,所提出的模型可以获得更好的或有竞争力的性能。这可能为基序提取或癌症预测提供一种新的方法。
{"title":"Unconstrained optimization in projection method for indefinite SVMs","authors":"Hao Jiang, W. Ching, Yushan Qiu, Xiaoqing Cheng","doi":"10.1109/BIBM.2016.7822585","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822585","url":null,"abstract":"Positive semi-definiteness is a critical property in Support Vector Machine (SVM) methods to ensure efficient solutions through convex quadratic programming. In this paper, we introduce a projection matrix on indefinite kernels to formulate a positive semi-definite one. The proposed model can be regarded as a generalized version of the spectrum method (denoising method and flipping method) by varying parameter λ. In particular, our suggested optimal λ under the Bregman matrix divergence theory can be obtained using unconstrained optimization. Experimental results on 4 real world data sets ranging from glycan classification to cancer prediction show that the proposed model can achieve better or competitive performance when compared to the related indefinite kernel methods. This may suggest a new way in motif extractions or cancer predictions.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116833785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Differential Co-Expression Networks using RNA-seq and microarrays in Alzheimer's disease 使用RNA-seq和微阵列的阿尔茨海默病差异共表达网络
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822811
Hyojin Kang, Junehawk Lee, S. Yu
Differential Co-Expression Networks (DCENs) are graphical representations of genes showing differential co-expression pattern in response to experimental conditions or genetic changes. They have been successfully applied to identify condition-specific modules and provide a picture of the dynamic changes in gene regulatory networks. DCENs analysis investigates the differences among gene interconnections by calculating the expression correlation change of each gene pair between conditions. In this study, we collected many different datasets from NCBI GEO including 25 RNA-seq and 2,102 microarray samples derived from human brain and blood in Alzheimer's disease and performed differential co-expression analyses to identify functional modules responsible for the characterization of Alzheimer's disease. The DCENs were generated using Pearson correlation coefficient and meta-analysis was conducted using rank-based method. The preliminary results show that the structural characteristics of DCENs can provide new insights into the underlying gene regulatory dynamics in Alzheimer's disease. There is low size overlap between microarray- and RNA-seq-derived DCENs however, DCENs from RNA-seq would complement ones from microarray due to the higher coverage and dynamic range of RNA-seq.
差异共表达网络(dcen)是基因在实验条件或遗传变化下表现出差异共表达模式的图形表示。它们已被成功地应用于识别条件特异性模块,并提供了基因调控网络动态变化的图片。dcns分析通过计算各基因对在不同条件下的表达相关性变化来研究基因互连之间的差异。在这项研究中,我们从NCBI GEO收集了许多不同的数据集,包括来自阿尔茨海默病患者的人类大脑和血液的25个RNA-seq和2102个微阵列样本,并进行了差异共表达分析,以确定负责表征阿尔茨海默病的功能模块。采用Pearson相关系数生成dcn,采用基于秩的方法进行meta分析。初步结果表明,dcns的结构特征可以为阿尔茨海默病的潜在基因调控动力学提供新的见解。微阵列和RNA-seq衍生的dccn之间存在较小的大小重叠,但由于RNA-seq的覆盖率和动态范围更高,来自RNA-seq的dccn可以补充来自微阵列的dccn。
{"title":"Differential Co-Expression Networks using RNA-seq and microarrays in Alzheimer's disease","authors":"Hyojin Kang, Junehawk Lee, S. Yu","doi":"10.1109/BIBM.2016.7822811","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822811","url":null,"abstract":"Differential Co-Expression Networks (DCENs) are graphical representations of genes showing differential co-expression pattern in response to experimental conditions or genetic changes. They have been successfully applied to identify condition-specific modules and provide a picture of the dynamic changes in gene regulatory networks. DCENs analysis investigates the differences among gene interconnections by calculating the expression correlation change of each gene pair between conditions. In this study, we collected many different datasets from NCBI GEO including 25 RNA-seq and 2,102 microarray samples derived from human brain and blood in Alzheimer's disease and performed differential co-expression analyses to identify functional modules responsible for the characterization of Alzheimer's disease. The DCENs were generated using Pearson correlation coefficient and meta-analysis was conducted using rank-based method. The preliminary results show that the structural characteristics of DCENs can provide new insights into the underlying gene regulatory dynamics in Alzheimer's disease. There is low size overlap between microarray- and RNA-seq-derived DCENs however, DCENs from RNA-seq would complement ones from microarray due to the higher coverage and dynamic range of RNA-seq.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131161465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Modular reconfiguration of metabolic brain networks in health and cancer: A resting-state PET study 健康和癌症中代谢脑网络的模块化重构:静息状态PET研究
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822665
Zhijun Yao, Bin Hu, Xuejiao Chen, Yuanwei Xie, Lei Fang
Recent studies suggested that cognitive impairments and memory difficulties in cancer survivors were associated with topology changes of brain network, particularly in terms of the functional and structural abnormalities. However, little is known about the modular reconfiguration of metabolic brain network among this population. In this study, we recruited 78 patients with pre-treatment cancer and 80 age- and gender-matched normal controls (NCs), and constructed the metabolic brain networks derived from resting-state 18F-fluorodeoxyglucose positron emission tomography (FDG-PET) to assess the alters of modularity pattern in cancer. The measurements of the participation index (PI) and mutual information (MI) were calculated for the cancer and NC groups. Compared with NC group, one module composed by the hippocampus, the amygdala and frontal and temporal regions was absented in cancer group. Moreover, cancer patients showed abnormal topology pattern in their metabolic networks (i.e., increased local efficiency and reduced global efficiency). Although node-wise PI shared positive correlated with normalized metabolism uptake in both groups, the more energy consumption were observed in metabolism network of cancer group that might be indicative of reduced capability of information processing. In addition, the between-group MIs were gradually increased over a range of thresholds. Our results suggested that modular pattern of the metabolic brain network seemed to re-shape its organization in cancer, which might uncover the neurobiological mechanisms underlying cancer-related cognitive dysfunction.
近年来的研究表明,癌症幸存者的认知障碍和记忆困难与大脑网络的拓扑变化有关,特别是在功能和结构方面的异常。然而,对这一人群中代谢脑网络的模块化重构知之甚少。在这项研究中,我们招募了78名治疗前的癌症患者和80名年龄和性别匹配的正常对照(nc),并构建了静息状态18f -氟脱氧葡萄糖正电子发射断层扫描(FDG-PET)衍生的代谢脑网络,以评估癌症中模块模式的变化。计算癌症组和NC组的参与指数(PI)和相互信息(MI)。与NC组相比,癌症组海马、杏仁核、额颞叶区组成的一个模块缺失。此外,癌症患者的代谢网络拓扑结构出现异常(局部效率提高,整体效率降低)。尽管两组的节点智慧PI与标准化代谢摄取呈正相关,但癌症组代谢网络中观察到的能量消耗越多,可能表明信息处理能力降低。此外,组间MIs在一定阈值范围内逐渐升高。我们的研究结果表明,代谢脑网络的模块化模式似乎在癌症中重塑了其组织,这可能揭示癌症相关认知功能障碍的神经生物学机制。
{"title":"Modular reconfiguration of metabolic brain networks in health and cancer: A resting-state PET study","authors":"Zhijun Yao, Bin Hu, Xuejiao Chen, Yuanwei Xie, Lei Fang","doi":"10.1109/BIBM.2016.7822665","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822665","url":null,"abstract":"Recent studies suggested that cognitive impairments and memory difficulties in cancer survivors were associated with topology changes of brain network, particularly in terms of the functional and structural abnormalities. However, little is known about the modular reconfiguration of metabolic brain network among this population. In this study, we recruited 78 patients with pre-treatment cancer and 80 age- and gender-matched normal controls (NCs), and constructed the metabolic brain networks derived from resting-state 18F-fluorodeoxyglucose positron emission tomography (FDG-PET) to assess the alters of modularity pattern in cancer. The measurements of the participation index (PI) and mutual information (MI) were calculated for the cancer and NC groups. Compared with NC group, one module composed by the hippocampus, the amygdala and frontal and temporal regions was absented in cancer group. Moreover, cancer patients showed abnormal topology pattern in their metabolic networks (i.e., increased local efficiency and reduced global efficiency). Although node-wise PI shared positive correlated with normalized metabolism uptake in both groups, the more energy consumption were observed in metabolism network of cancer group that might be indicative of reduced capability of information processing. In addition, the between-group MIs were gradually increased over a range of thresholds. Our results suggested that modular pattern of the metabolic brain network seemed to re-shape its organization in cancer, which might uncover the neurobiological mechanisms underlying cancer-related cognitive dysfunction.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132394965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Effects of propafenone on KCNH2-linked short QT syndrome: A modelling study 普罗帕酮对kcnh2相关短QT综合征的影响:一项模型研究
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822744
Cunjin Luo, Kuanquan Wang, Henggui Zhang
The identified genetic short QT syndrome (SQTS) is associated with an increased risk of arrhythmia and sudden death. This study was to investigate the potential effects of propafenone on KCNH2-linked short QT syndrome (SQT1) using a multi-scale biophysically detailed model of the heart developed by ten Tusscher and Panfilov. The ion electrical conductivities were reduced by propafenone in order to simulate the pharmacological effects in healthy and SQT1 cells. Based on the experimental data of McPate et al., the pharmacological effect of propafenone was modelled by dose-dependent IKr blocking. Action potential (AP) profiles and 1D tissue level were analyzed to predict the effects of propafenone on SQT1. Both low- and high- dose of propafenone prolonged APD and QT interval in SQT1 cells. It suggests the superior efficacy of high dose of propafenone on SQT1. However, propafenone did not significantly alter the healthy APD or QT interval at low dose, whereas markedly shortened them at high dose. Our simulation data show that propafenone has a dose-dependently anti-arrhythmic effect on SQT1, and a pro-arrhythmic effect on healthy cells. These computer simulations help to better understand the underlying mechanisms responsible for the initiation or termination of arrhythmias in healthy or SQT1 patients using propafenone.
已确定的遗传性短QT综合征(SQTS)与心律失常和猝死的风险增加有关。本研究旨在利用ten Tusscher和Panfilov建立的心脏多尺度生物物理详细模型,探讨普罗帕酮对kcnh2相关的短QT综合征(SQT1)的潜在影响。为了模拟正常细胞和SQT1细胞的药理作用,普罗帕酮降低了离子电导率。基于McPate等人的实验数据,采用剂量依赖性IKr阻断法模拟普罗帕酮的药理作用。通过分析动作电位(AP)谱和1D组织水平来预测普罗帕酮对SQT1的影响。低、高剂量普罗帕酮均可延长SQT1细胞APD和QT间期。提示大剂量普罗帕酮治疗SQT1疗效优越。然而,普罗帕酮在低剂量时没有显著改变APD或QT间期,而在高剂量时明显缩短。我们的模拟数据显示,普罗帕酮对SQT1具有剂量依赖性的抗心律失常作用,对健康细胞具有促心律失常作用。这些计算机模拟有助于更好地理解使用普罗帕酮的健康或SQT1患者心律失常发生或终止的潜在机制。
{"title":"Effects of propafenone on KCNH2-linked short QT syndrome: A modelling study","authors":"Cunjin Luo, Kuanquan Wang, Henggui Zhang","doi":"10.1109/BIBM.2016.7822744","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822744","url":null,"abstract":"The identified genetic short QT syndrome (SQTS) is associated with an increased risk of arrhythmia and sudden death. This study was to investigate the potential effects of propafenone on KCNH2-linked short QT syndrome (SQT1) using a multi-scale biophysically detailed model of the heart developed by ten Tusscher and Panfilov. The ion electrical conductivities were reduced by propafenone in order to simulate the pharmacological effects in healthy and SQT1 cells. Based on the experimental data of McPate et al., the pharmacological effect of propafenone was modelled by dose-dependent IKr blocking. Action potential (AP) profiles and 1D tissue level were analyzed to predict the effects of propafenone on SQT1. Both low- and high- dose of propafenone prolonged APD and QT interval in SQT1 cells. It suggests the superior efficacy of high dose of propafenone on SQT1. However, propafenone did not significantly alter the healthy APD or QT interval at low dose, whereas markedly shortened them at high dose. Our simulation data show that propafenone has a dose-dependently anti-arrhythmic effect on SQT1, and a pro-arrhythmic effect on healthy cells. These computer simulations help to better understand the underlying mechanisms responsible for the initiation or termination of arrhythmias in healthy or SQT1 patients using propafenone.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132437700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
COLT: COnstrained Lineage Tree Generation from sequence data 从序列数据生成约束谱系树
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822500
Keke Chen, Venkata Sai Abhishek Gogu, Di Wu, Jiang Ning
Lineage analysis has been an important method for understanding the mutation patterns and the diversity of genes, such as antibodies. A mutation lineage is typically represented as a tree structure, describing the possible mutation paths. Generating lineage trees from sequence data imposes two unique challenges: (1) Types of constraints might be defined on top of sequence data and tree structures, which have to be appropriately formulated and maintained by the algorithms. (2) Enumerating all possible trees that satisfy constraints is typically computationally intractable. In this paper, we present a COnstrained Lineage Tree generation framework (COLT) that builds lineage trees from sequences, based on local and global constraints specified by domain experts and heuristics derived from the mutation processes. Our formal analysis and experimental results show that this framework can efficiently generate valid lineage trees, while strictly satisfying the constraints specified by domain experts.
谱系分析已成为了解基因(如抗体)突变模式和多样性的重要方法。突变谱系通常以树状结构表示,描述可能的突变路径。从序列数据中生成谱系树有两个独特的挑战:(1)约束类型可能在序列数据和树结构之上定义,这些约束类型必须由算法适当地表述和维护。(2)枚举满足约束条件的所有可能的树在计算上通常是难以处理的。在本文中,我们提出了一个约束谱系树生成框架(COLT),该框架基于领域专家指定的局部和全局约束以及来自突变过程的启发式,从序列中构建谱系树。形式分析和实验结果表明,该框架能够有效地生成有效的谱系树,同时严格满足领域专家指定的约束条件。
{"title":"COLT: COnstrained Lineage Tree Generation from sequence data","authors":"Keke Chen, Venkata Sai Abhishek Gogu, Di Wu, Jiang Ning","doi":"10.1109/BIBM.2016.7822500","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822500","url":null,"abstract":"Lineage analysis has been an important method for understanding the mutation patterns and the diversity of genes, such as antibodies. A mutation lineage is typically represented as a tree structure, describing the possible mutation paths. Generating lineage trees from sequence data imposes two unique challenges: (1) Types of constraints might be defined on top of sequence data and tree structures, which have to be appropriately formulated and maintained by the algorithms. (2) Enumerating all possible trees that satisfy constraints is typically computationally intractable. In this paper, we present a COnstrained Lineage Tree generation framework (COLT) that builds lineage trees from sequences, based on local and global constraints specified by domain experts and heuristics derived from the mutation processes. Our formal analysis and experimental results show that this framework can efficiently generate valid lineage trees, while strictly satisfying the constraints specified by domain experts.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132477300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
GIDAC: A prototype for bioimages annotation and clinical data integration GIDAC:生物图像注释和临床数据整合的原型
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822663
P. Vizza, P. Guzzi, P. Veltri, G. Cascini, R. Curia, Loredana Sisca
The analysis of bioimages and their correlated clinical patient information allows to investigate specific diseases and define the corresponding medical protocols. To perform a correct diagnosis and apply a precise therapy, bioimages must be collected and studied together with others relevant data as well as laboratory results, medical annotations and patient history. Today, the management of these data is performed by single systems inside hospital departments that often do not provide dedicated data integration platforms among different departments as well as different health structures to exchange of relevant clinical information. Also, images cannot be annotated or enriched by physicians to trace temporal studies for patients or even among patients with similar diseases. In this contribution, we report the results of a research project called GIDAC (standing for Gestione Integrata DAti Clinici) that aims to define a general purpose framework for the bioimages management and annotations as well as clinical data view and integration in a simple-to-use information system. The proposed framework does not substitute any existing clinical information system but is able in gathering and integrating data by using a XML-based module. The novelty also consists in allowing annotations on DICOM images by means of simple user-interface to take trace of changes intra images as well as comparisons among patients. This system supports oncologists in the management of DICOM images from different devices (e.g., ecograph or PACS) to extract relevant information necessary to query (annotate) images and study similar clinical cases.
生物图像及其相关临床患者信息的分析允许调查特定疾病并确定相应的医疗方案。为了进行正确的诊断和应用精确的治疗,必须收集生物图像并与其他相关数据以及实验室结果、医学注释和患者病史一起研究。目前,这些数据的管理是由医院部门内部的单一系统完成的,这些系统往往没有在不同部门和不同医疗机构之间提供专用的数据集成平台来交换相关的临床信息。此外,医生无法对图像进行注释或丰富,以追踪患者甚至患有类似疾病的患者的时间研究。在这篇文章中,我们报告了一个名为GIDAC (Gestione Integrata DAti Clinici)的研究项目的结果,该项目旨在定义一个通用框架,用于生物图像管理和注释,以及临床数据视图和集成在一个简单易用的信息系统中。该框架不替代任何现有的临床信息系统,而是能够使用基于xml的模块收集和集成数据。其新颖之处还在于允许通过简单的用户界面对DICOM图像进行注释,以跟踪图像内的变化以及患者之间的比较。该系统支持肿瘤学家管理来自不同设备(如ecograph或PACS)的DICOM图像,以提取查询(注释)图像和研究类似临床病例所需的相关信息。
{"title":"GIDAC: A prototype for bioimages annotation and clinical data integration","authors":"P. Vizza, P. Guzzi, P. Veltri, G. Cascini, R. Curia, Loredana Sisca","doi":"10.1109/BIBM.2016.7822663","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822663","url":null,"abstract":"The analysis of bioimages and their correlated clinical patient information allows to investigate specific diseases and define the corresponding medical protocols. To perform a correct diagnosis and apply a precise therapy, bioimages must be collected and studied together with others relevant data as well as laboratory results, medical annotations and patient history. Today, the management of these data is performed by single systems inside hospital departments that often do not provide dedicated data integration platforms among different departments as well as different health structures to exchange of relevant clinical information. Also, images cannot be annotated or enriched by physicians to trace temporal studies for patients or even among patients with similar diseases. In this contribution, we report the results of a research project called GIDAC (standing for Gestione Integrata DAti Clinici) that aims to define a general purpose framework for the bioimages management and annotations as well as clinical data view and integration in a simple-to-use information system. The proposed framework does not substitute any existing clinical information system but is able in gathering and integrating data by using a XML-based module. The novelty also consists in allowing annotations on DICOM images by means of simple user-interface to take trace of changes intra images as well as comparisons among patients. This system supports oncologists in the management of DICOM images from different devices (e.g., ecograph or PACS) to extract relevant information necessary to query (annotate) images and study similar clinical cases.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132232367","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Weighted multiview learning for predicting drug-disease associations 用于预测药物-疾病关联的加权多视图学习
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822603
S. N. Chandrasekaran, Jun Huan
The paradigm of drug discovery has moved from finding new drugs that exhibit therapeutic properties for a disease to reusing existing approved drugs for a newer disease. The association between a drug and a disease involves a complex network of targets and pathways. In order to provide new insights, there has been a constant need for sophisticated tools that have the potential to discover new associations from the underlying drugs-disease interactions. In addition to computational tools, there has been an explosion of data available in terms of drugs, disease and their activity profiles. On one hand, researchers have been using existing machine learning tools that have shown great promise in predicting associations but on the other hand there has been a void in exploiting advance machine learning frameworks to handle this kind of data integration. In this paper, we propose a learning framework called weighted multi-view learning that is a variant of the Multi-view learning framework in which the views are assumed to contribute equally to the prediction whereas our method learns a weight for each view since we hypothesize that certain views might have better prediction capability than others.
药物发现的范式已经从寻找对某种疾病具有治疗特性的新药转变为重新使用现有的已批准药物治疗一种新疾病。药物和疾病之间的联系涉及一个复杂的靶点和途径网络。为了提供新的见解,一直需要有可能从潜在的药物-疾病相互作用中发现新的关联的复杂工具。除了计算工具之外,关于药物、疾病及其活动概况的可用数据也出现了爆炸式增长。一方面,研究人员一直在使用现有的机器学习工具,这些工具在预测关联方面显示出很大的希望,但另一方面,在利用先进的机器学习框架来处理这种数据集成方面一直存在空白。在本文中,我们提出了一种称为加权多视图学习的学习框架,它是多视图学习框架的一种变体,其中假设视图对预测的贡献相同,而我们的方法为每个视图学习权重,因为我们假设某些视图可能比其他视图具有更好的预测能力。
{"title":"Weighted multiview learning for predicting drug-disease associations","authors":"S. N. Chandrasekaran, Jun Huan","doi":"10.1109/BIBM.2016.7822603","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822603","url":null,"abstract":"The paradigm of drug discovery has moved from finding new drugs that exhibit therapeutic properties for a disease to reusing existing approved drugs for a newer disease. The association between a drug and a disease involves a complex network of targets and pathways. In order to provide new insights, there has been a constant need for sophisticated tools that have the potential to discover new associations from the underlying drugs-disease interactions. In addition to computational tools, there has been an explosion of data available in terms of drugs, disease and their activity profiles. On one hand, researchers have been using existing machine learning tools that have shown great promise in predicting associations but on the other hand there has been a void in exploiting advance machine learning frameworks to handle this kind of data integration. In this paper, we propose a learning framework called weighted multi-view learning that is a variant of the Multi-view learning framework in which the views are assumed to contribute equally to the prediction whereas our method learns a weight for each view since we hypothesize that certain views might have better prediction capability than others.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"100 ","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134190462","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Multi-label classification for intelligent health risk prediction 智能健康风险预测的多标签分类
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822657
Runzhi Li, Hongling Zhao, Yusong Lin, Andrew S. Maxwell, Chaoyang Zhang
A Multi-Label Problem Transformation Joint Classification (MLPTJC) method is developed to solve the multi-label classification problem for the health and disease risk prediction based on physical examination records. We adopt a multi-class classification problem transformation method to transform the multi-label classification problem to a multi-class classification problem. Then We propose a Joint Decomposition Subset Classifier method to reduce the infrequent label sets to deal with the imbalance learning problem. Based on MLPTJC, existing cost-sensitive multi-class classification algorithms can be used to train the prediction models. We conduct some experiments to evaluate the performance of the MLPTJC method. The Support Vector Machine (SVM) and Random Forest (RF) algorithms are used for multi-class classification learning. We use the 10-fold cross-validation and metrics such as Average Accuracy, Precision, Recall and F-measure to evaluate the performance. The real physical examination records were employed, which include 62 examination items and 110, 300 anonymous patients. 8 types of diseases were predicted. The experimental results show that the MLPTJC method has better performance in terms of accuracy.
针对基于体检记录的健康与疾病风险预测的多标签分类问题,提出了一种多标签问题转换联合分类方法(MLPTJC)。我们采用多类分类问题转换方法,将多标签分类问题转化为多类分类问题。然后,我们提出了一种联合分解子集分类器方法来减少不频繁的标签集,以解决不平衡学习问题。基于MLPTJC,现有的代价敏感多类分类算法可用于训练预测模型。我们进行了一些实验来评估MLPTJC方法的性能。采用支持向量机(SVM)和随机森林(RF)算法进行多类分类学习。我们使用10倍交叉验证和指标,如平均准确度,精度,召回率和F-measure来评估性能。采用真实体检记录,共62项检查项目,匿名患者110300人。预测了8种疾病。实验结果表明,MLPTJC方法在精度方面具有较好的性能。
{"title":"Multi-label classification for intelligent health risk prediction","authors":"Runzhi Li, Hongling Zhao, Yusong Lin, Andrew S. Maxwell, Chaoyang Zhang","doi":"10.1109/BIBM.2016.7822657","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822657","url":null,"abstract":"A Multi-Label Problem Transformation Joint Classification (MLPTJC) method is developed to solve the multi-label classification problem for the health and disease risk prediction based on physical examination records. We adopt a multi-class classification problem transformation method to transform the multi-label classification problem to a multi-class classification problem. Then We propose a Joint Decomposition Subset Classifier method to reduce the infrequent label sets to deal with the imbalance learning problem. Based on MLPTJC, existing cost-sensitive multi-class classification algorithms can be used to train the prediction models. We conduct some experiments to evaluate the performance of the MLPTJC method. The Support Vector Machine (SVM) and Random Forest (RF) algorithms are used for multi-class classification learning. We use the 10-fold cross-validation and metrics such as Average Accuracy, Precision, Recall and F-measure to evaluate the performance. The real physical examination records were employed, which include 62 examination items and 110, 300 anonymous patients. 8 types of diseases were predicted. The experimental results show that the MLPTJC method has better performance in terms of accuracy.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134600494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1