首页 > 最新文献

2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)最新文献

英文 中文
Overcoming drug resistance by co-targeting 通过共同靶向克服耐药性
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706562
M. Ayati, Golnaz Taheri, S. Arab, L. Wong, C. Eslahchi
Removal or suppression of key proteins in an essential pathway of a pathogen is expected to disrupt the pathway and prohibit the pathogen from performing a vital function. Thus disconnecting multiple essential pathways should disrupt the survival of a pathogen even when it has multiple pathways to drug resistance. We consider a scenario where the drug-resistance pathways are unknown. To disrupt these pathways, we consider a cut set S of G, where G is a connected simple graph representing the protein interaction network of the pathogen, so that G-S splits to two partitions such that the endpoints of each pathway are in different partitions. If the difference between the sizes of the two partitions is high, the probability of existence of a functioning pathway in one partition is increased. Thus, we need to partition the graph into two balanced partitions. We approximate the balanced bipartitioning problem with spectral bipartitioning since finding (2,1)-separator is NP-complete. We test our technique on E. coli and C. jejuni. We show that over 50% of genes in the cut sets are essential. Moreover, all proteins in the cut sets have fundamental roles in cell and inhibition of each of them is harmful for cell survival. Also, 20% and 17% of known targets are in the vertex cut of E. coli and C. jejuni. Hence our approach has produced plausible “co-targets” whose inhibition should counter a pathogen's drug resistance.
去除或抑制病原体基本途径中的关键蛋白质有望破坏该途径并禁止病原体执行重要功能。因此,切断多种基本途径应该会破坏病原体的生存,即使它有多种产生耐药性的途径。我们考虑一种耐药性途径未知的情况。为了破坏这些通路,我们考虑G的切集S,其中G是表示病原体蛋白质相互作用网络的连通简单图,因此G-S分裂为两个分区,使得每个通路的端点位于不同的分区。如果两个分区之间的大小差异很大,则在一个分区中存在有效路径的概率会增加。因此,我们需要将图划分为两个平衡的分区。由于找到(2,1)-分离器是np完全的,我们用谱双分区近似平衡双分区问题。我们在大肠杆菌和空肠杆菌上测试了我们的技术。我们发现,切割集中超过50%的基因是必需的。此外,切割集中的所有蛋白质在细胞中都具有基础作用,对它们中的任何一种的抑制都对细胞存活有害。此外,20%和17%的已知靶点位于大肠杆菌和空肠杆菌的顶点切口。因此,我们的方法产生了似是而非的“共同靶点”,其抑制作用应该能够对抗病原体的耐药性。
{"title":"Overcoming drug resistance by co-targeting","authors":"M. Ayati, Golnaz Taheri, S. Arab, L. Wong, C. Eslahchi","doi":"10.1109/BIBM.2010.5706562","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706562","url":null,"abstract":"Removal or suppression of key proteins in an essential pathway of a pathogen is expected to disrupt the pathway and prohibit the pathogen from performing a vital function. Thus disconnecting multiple essential pathways should disrupt the survival of a pathogen even when it has multiple pathways to drug resistance. We consider a scenario where the drug-resistance pathways are unknown. To disrupt these pathways, we consider a cut set S of G, where G is a connected simple graph representing the protein interaction network of the pathogen, so that G-S splits to two partitions such that the endpoints of each pathway are in different partitions. If the difference between the sizes of the two partitions is high, the probability of existence of a functioning pathway in one partition is increased. Thus, we need to partition the graph into two balanced partitions. We approximate the balanced bipartitioning problem with spectral bipartitioning since finding (2,1)-separator is NP-complete. We test our technique on E. coli and C. jejuni. We show that over 50% of genes in the cut sets are essential. Moreover, all proteins in the cut sets have fundamental roles in cell and inhibition of each of them is harmful for cell survival. Also, 20% and 17% of known targets are in the vertex cut of E. coli and C. jejuni. Hence our approach has produced plausible “co-targets” whose inhibition should counter a pathogen's drug resistance.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123102193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Hierarchical decomposition of vessel skeletons for graph creation and feature extraction 用于图形创建和特征提取的血管骨架分层分解
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706609
K. Drechsler, C. O. Laura
Graphs are useful representations of the liver vas-culature. They support tree matching algorithms in landmark-based registration algorithms, they are useful to separate connected vessels from two different vessel systems and are the basis of vessel annotation tools. In this paper, we propose a hierarchical decomposition of vessel skeletons into sub-branches. This simplifies the process of creating labeled graphs and extracting features. Furthermore, we propose a measure to classify voxels as branch voxels. We applied our method to several datasets with satisfying results and found that the number of sub-branches is normal distributed under rotation.
图表是肝脏血管培养的有用表示。它们支持基于地标的配准算法中的树匹配算法,有助于从两个不同的船舶系统中分离连接的船舶,并且是船舶注释工具的基础。在本文中,我们提出了一种将血管骨架分层分解成子分支的方法。这简化了创建标记图和提取特征的过程。此外,我们提出了一种将体素分类为分支体素的方法。我们将该方法应用于多个数据集,结果令人满意,并且发现子分支的数量在旋转下是正态分布的。
{"title":"Hierarchical decomposition of vessel skeletons for graph creation and feature extraction","authors":"K. Drechsler, C. O. Laura","doi":"10.1109/BIBM.2010.5706609","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706609","url":null,"abstract":"Graphs are useful representations of the liver vas-culature. They support tree matching algorithms in landmark-based registration algorithms, they are useful to separate connected vessels from two different vessel systems and are the basis of vessel annotation tools. In this paper, we propose a hierarchical decomposition of vessel skeletons into sub-branches. This simplifies the process of creating labeled graphs and extracting features. Furthermore, we propose a measure to classify voxels as branch voxels. We applied our method to several datasets with satisfying results and found that the number of sub-branches is normal distributed under rotation.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123718607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Decomposing protein interactome networks by graph entropy 用图熵分解蛋白质相互作用组网络
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706633
Hao Lian, C. Song, Young-Rae Cho
Recent high-throughput experimental methods have generated protein-protein interaction data in the genome scale, called interactome. Various graph clustering algorithms have been applied to the protein interactome networks for identifying protein complexes and predicting functional modules. Although the previous algorithms are scalable and robust, their accuracy is still limited because of complex connectivity of the networks. In this study, we propose a novel information-theoretic definition, Graph Entropy, as a measure of structural complexity of a graph. Loss of graph entropy represents an increase in modularity of the graph. Based on this concept, we present a graph clustering algorithm. Starting from a random seed vertex and its neighbors as a seed cluster, the algorithm iteratively adds or removes vertices on the border of the cluster to minimize graph entropy. We make an additional improvement on the algorithm for generating overlapping clusters. In the experiments with the yeast protein interactome network, we show the graph entropy-based approach has higher accuracy in predicting functional modules than other competing methods.
最近的高通量实验方法产生了基因组尺度的蛋白质-蛋白质相互作用数据,称为相互作用组。各种图聚类算法已应用于蛋白质相互作用组网络,用于识别蛋白质复合物和预测功能模块。虽然现有的算法具有可扩展性和鲁棒性,但由于网络的复杂连通性,其精度仍然受到限制。在这项研究中,我们提出了一个新的信息论定义,图熵,作为一个图的结构复杂性的度量。图熵的损失表示图的模块化程度的增加。基于这一概念,我们提出了一种图聚类算法。该算法从随机的种子顶点及其相邻点作为种子聚类,迭代地增加或删除聚类边界上的顶点,使图熵最小化。我们对生成重叠聚类的算法做了额外的改进。在酵母蛋白相互作用组网络的实验中,我们表明基于图熵的方法在预测功能模块方面比其他竞争方法具有更高的准确性。
{"title":"Decomposing protein interactome networks by graph entropy","authors":"Hao Lian, C. Song, Young-Rae Cho","doi":"10.1109/BIBM.2010.5706633","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706633","url":null,"abstract":"Recent high-throughput experimental methods have generated protein-protein interaction data in the genome scale, called interactome. Various graph clustering algorithms have been applied to the protein interactome networks for identifying protein complexes and predicting functional modules. Although the previous algorithms are scalable and robust, their accuracy is still limited because of complex connectivity of the networks. In this study, we propose a novel information-theoretic definition, Graph Entropy, as a measure of structural complexity of a graph. Loss of graph entropy represents an increase in modularity of the graph. Based on this concept, we present a graph clustering algorithm. Starting from a random seed vertex and its neighbors as a seed cluster, the algorithm iteratively adds or removes vertices on the border of the cluster to minimize graph entropy. We make an additional improvement on the algorithm for generating overlapping clusters. In the experiments with the yeast protein interactome network, we show the graph entropy-based approach has higher accuracy in predicting functional modules than other competing methods.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"63 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116259630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
An Iterated Conditional Modes solution for sparse Bayesian factor modeling of transcriptional regulatory networks 转录调控网络稀疏贝叶斯因子建模的迭代条件模式解
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706587
Jia Meng, Jianqiu Zhang, Yidong Chen, Yufei Huang
The problem of uncovering transcriptional regulation by transcription factors (TFs) based on microarray data is considered. A novel Bayesian sparse correlated rectified factor model (BSCRFM) coupled with its ICM solution is proposed. BSCRFM models the unknown TF protein level activity, the correlated regulations between TFs, and the sparse nature of TF regulated genes and it admits prior knowledge from existing database regarding TF regulated target genes. An efficient Iterated Conditional Modes (ICM) algorithm is developed, and a maximum a posterior (MAP) solution is calculated from multiple ICM results to avoid the local maximum problem, a context-specific transcriptional regulatory network specific to the experimental condition of the microarray data can then be obtained. The proposed model's ICM algorithm and MAP solution are evaluated on the simulated systems and results demonstrated the validity and effectiveness of the proposed approach. The proposed model is also applied to the breast cancer microarray data and a TF regulated network is obtained.
研究了基于微阵列数据揭示转录因子转录调控的问题。提出一种新的贝叶斯稀疏相关校正因子模型(BSCRFM)及其ICM解。BSCRFM对未知的TF蛋白水平活性、TF之间的相关调控以及TF调控基因的稀疏性进行建模,并从现有数据库中获取有关TF调控靶基因的先验知识。提出了一种高效的迭代条件模式(ICM)算法,并从多个ICM结果中计算最大后验(MAP)解,以避免局部极大值问题,从而获得特定于微阵列数据实验条件的上下文特异性转录调控网络。在仿真系统上对所提模型的ICM算法和MAP解进行了评估,结果证明了所提方法的有效性。该模型还应用于乳腺癌微阵列数据,并获得了TF调节网络。
{"title":"An Iterated Conditional Modes solution for sparse Bayesian factor modeling of transcriptional regulatory networks","authors":"Jia Meng, Jianqiu Zhang, Yidong Chen, Yufei Huang","doi":"10.1109/BIBM.2010.5706587","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706587","url":null,"abstract":"The problem of uncovering transcriptional regulation by transcription factors (TFs) based on microarray data is considered. A novel Bayesian sparse correlated rectified factor model (BSCRFM) coupled with its ICM solution is proposed. BSCRFM models the unknown TF protein level activity, the correlated regulations between TFs, and the sparse nature of TF regulated genes and it admits prior knowledge from existing database regarding TF regulated target genes. An efficient Iterated Conditional Modes (ICM) algorithm is developed, and a maximum a posterior (MAP) solution is calculated from multiple ICM results to avoid the local maximum problem, a context-specific transcriptional regulatory network specific to the experimental condition of the microarray data can then be obtained. The proposed model's ICM algorithm and MAP solution are evaluated on the simulated systems and results demonstrated the validity and effectiveness of the proposed approach. The proposed model is also applied to the breast cancer microarray data and a TF regulated network is obtained.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121528279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring a multi-source fusion approach for genomics information retrieval 探索基因组学信息检索的多源融合方法
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706649
Qinmin Hu, Xiangji Huang, Jun Miao
In this paper, we focus on the biomedicine domain to propose a multi-source fusion approach for improving information retrieval performance. First, we consider a common scenario for a metasearch system that has access to multiple baselines with retrieving and ranking documents/passages by their own models. Second, given selected baselines from multiple sources, we employ two modified fusion rules in the proposed approach, reciprocal and combMNZ, to rerank the candidates as the output for evaluation. Third, our empirical study on both 2007 and 2006 genomics data sets demonstrates the viability of the proposed approach to better performance fusion. Fourth, the experimental results show that the reciprocal method provides notable improvements on the individual baseline, especially on the effective passage MAP, the passage2-level and the diversity MAP, the aspect-level.
本文以生物医学领域为研究对象,提出了一种多源信息融合方法来提高信息检索性能。首先,我们考虑一个元搜索系统的常见场景,该系统可以访问多个基线,并通过自己的模型检索和排序文档/段落。其次,给定来自多个来源的选定基线,我们在所提出的方法中使用两个改进的融合规则,互惠和组合,将候选基线重新排序作为评估的输出。第三,我们对2007年和2006年基因组学数据集的实证研究表明,该方法具有更好的性能融合的可行性。第四,实验结果表明,互反方法在个体基线上有显著的改进,特别是在有效通道MAP(通道2级)和多样性MAP(方面级)上。
{"title":"Exploring a multi-source fusion approach for genomics information retrieval","authors":"Qinmin Hu, Xiangji Huang, Jun Miao","doi":"10.1109/BIBM.2010.5706649","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706649","url":null,"abstract":"In this paper, we focus on the biomedicine domain to propose a multi-source fusion approach for improving information retrieval performance. First, we consider a common scenario for a metasearch system that has access to multiple baselines with retrieving and ranking documents/passages by their own models. Second, given selected baselines from multiple sources, we employ two modified fusion rules in the proposed approach, reciprocal and combMNZ, to rerank the candidates as the output for evaluation. Third, our empirical study on both 2007 and 2006 genomics data sets demonstrates the viability of the proposed approach to better performance fusion. Fourth, the experimental results show that the reciprocal method provides notable improvements on the individual baseline, especially on the effective passage MAP, the passage2-level and the diversity MAP, the aspect-level.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"2009 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125647373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Prediction of low coverage prone regions for Illumina sequencing projects using a support vector machine 利用支持向量机预测Illumina测序项目的低覆盖易发区域
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706527
Zejun Zheng, B. Schmidt, G. Bourque
Applications of next-generation sequencing technologies have the potential to bring revolutionary changes to medicine and biology. However, coverage bias can pose a challenge to short read data analysis tools, which rely on high coverage. To address this issue we have developed a support vector machine (SVM) based method for predicting low coverage prone (LCP) regions on a given genome. The developed SVM-based prediction of LCP regions on a given genome can assist data processing procedures based on Illumina sequencing technology, such as de novo sequencing and transcriptome analysis.
下一代测序技术的应用有可能给医学和生物学带来革命性的变化。然而,覆盖率偏差会对依赖于高覆盖率的短读数据分析工具构成挑战。为了解决这个问题,我们开发了一种基于支持向量机(SVM)的方法来预测给定基因组上的低覆盖易发区(LCP)。基于支持向量机的LCP区域预测可以帮助基于Illumina测序技术的数据处理程序,如从头测序和转录组分析。
{"title":"Prediction of low coverage prone regions for Illumina sequencing projects using a support vector machine","authors":"Zejun Zheng, B. Schmidt, G. Bourque","doi":"10.1109/BIBM.2010.5706527","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706527","url":null,"abstract":"Applications of next-generation sequencing technologies have the potential to bring revolutionary changes to medicine and biology. However, coverage bias can pose a challenge to short read data analysis tools, which rely on high coverage. To address this issue we have developed a support vector machine (SVM) based method for predicting low coverage prone (LCP) regions on a given genome. The developed SVM-based prediction of LCP regions on a given genome can assist data processing procedures based on Illumina sequencing technology, such as de novo sequencing and transcriptome analysis.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121745490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Utilizing Cox regression model to assess the relations between predefined gene sets and the survival outcome of lung adenocarcinoma 利用Cox回归模型评估预定义基因集与肺腺癌生存结局的关系
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706566
Jo-Yang Lu, E. Chuang, C. K. Hsiao, M. Tsai, L. Lai, Pei-Chun Chen
The risks of relapse for lung adenocarcinoma patients were still higher than 30%, even after complete surgical resections in early stages. Although lots of prognosis studies using genome-wide profiling had been published, biological meaning and interactions among the prognostic genes were poorly understood. Therefore, we developed a novel method integrating gene set enrichment analysis and Cox-hazard regression model to investigate the relations between predefined gene sets and the survival outcome in lung cancer. The method was able to select gene sets associated with the survival outcome, clustering of the prognostic genes sets, and selection of a representative gene set from each cluster. Furthermore, kernel matrix was used to visualize the similarities between those representative gene sets. In addition to survival outcome, our method can also use other continuous variables to explore other biological interpretation concealed in the predefined gene sets.
肺腺癌患者的复发风险仍高于30%,即使在早期完全手术切除后。尽管已经发表了许多使用全基因组分析的预后研究,但对预后基因的生物学意义和相互作用知之甚少。因此,我们开发了一种结合基因集富集分析和Cox-hazard回归模型的新方法来研究预定义基因集与肺癌生存结局的关系。该方法能够选择与生存结果相关的基因集,对预后基因集进行聚类,并从每个聚类中选择具有代表性的基因集。此外,核矩阵用于可视化这些代表性基因集之间的相似性。除了生存结果,我们的方法还可以使用其他连续变量来探索隐藏在预定义基因集中的其他生物学解释。
{"title":"Utilizing Cox regression model to assess the relations between predefined gene sets and the survival outcome of lung adenocarcinoma","authors":"Jo-Yang Lu, E. Chuang, C. K. Hsiao, M. Tsai, L. Lai, Pei-Chun Chen","doi":"10.1109/BIBM.2010.5706566","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706566","url":null,"abstract":"The risks of relapse for lung adenocarcinoma patients were still higher than 30%, even after complete surgical resections in early stages. Although lots of prognosis studies using genome-wide profiling had been published, biological meaning and interactions among the prognostic genes were poorly understood. Therefore, we developed a novel method integrating gene set enrichment analysis and Cox-hazard regression model to investigate the relations between predefined gene sets and the survival outcome in lung cancer. The method was able to select gene sets associated with the survival outcome, clustering of the prognostic genes sets, and selection of a representative gene set from each cluster. Furthermore, kernel matrix was used to visualize the similarities between those representative gene sets. In addition to survival outcome, our method can also use other continuous variables to explore other biological interpretation concealed in the predefined gene sets.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132857419","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Concurrent analysis of copy number variations and expression profiles to identify genes associated with tumorigenesis and survival outcome in lung adenocarcinoma 拷贝数变异和表达谱的并发分析,以确定与肺腺癌肿瘤发生和生存结果相关的基因
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706578
T. Lu, L. Lai, C. K. Hsiao, Pei-Chun Chen, M. Tsai, E. Chuang
Lung cancer has been one of the major causes of cancer-related death worldwide. To predict survival outcomes of lung cancer patients, many prognosis gene sets were identified by using gene expression microarrays. However, these gene sets were often inconsistent across independent cohorts. To identify genes with more consistency, we combined gene expression and copy number variations (CNVs). Affymetrix SNP 6.0 and u133plus2.0 microarrays were performed on 42 pairs of lung adenocarcinoma patients. The copy number varied regions (CNVR) existed in more than 30% samples were identified and 475 differentially expressed genes with concordant changes were selected for pathway analysis. Thirteen pathways were significantly enriched among the 475 CNV-associated genes, and survival analyses showed these pathways had generally consistent and significant prediction probabilities across three independent microarray studies. Therefore, integration between gene expression and copy number may help to lower false discovery rate and identify genes used to predict survival outcomes.
肺癌一直是全球癌症相关死亡的主要原因之一。为了预测肺癌患者的生存结果,许多预后基因集通过基因表达芯片被鉴定出来。然而,这些基因集在独立的队列中往往不一致。为了鉴定一致性更高的基因,我们将基因表达和拷贝数变异(CNVs)结合起来。对42对肺腺癌患者进行Affymetrix SNP 6.0和u133plus2.0微阵列检测。鉴定了30%以上的样本中存在拷贝数变化区(拷贝数变化区,CNVR),并选择475个具有一致性变化的差异表达基因进行通路分析。在475个cnv相关基因中,有13个通路显著富集,生存分析表明,这些通路在三个独立的微阵列研究中具有普遍一致和显著的预测概率。因此,整合基因表达和拷贝数可能有助于降低错误发现率和识别用于预测生存结果的基因。
{"title":"Concurrent analysis of copy number variations and expression profiles to identify genes associated with tumorigenesis and survival outcome in lung adenocarcinoma","authors":"T. Lu, L. Lai, C. K. Hsiao, Pei-Chun Chen, M. Tsai, E. Chuang","doi":"10.1109/BIBM.2010.5706578","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706578","url":null,"abstract":"Lung cancer has been one of the major causes of cancer-related death worldwide. To predict survival outcomes of lung cancer patients, many prognosis gene sets were identified by using gene expression microarrays. However, these gene sets were often inconsistent across independent cohorts. To identify genes with more consistency, we combined gene expression and copy number variations (CNVs). Affymetrix SNP 6.0 and u133plus2.0 microarrays were performed on 42 pairs of lung adenocarcinoma patients. The copy number varied regions (CNVR) existed in more than 30% samples were identified and 475 differentially expressed genes with concordant changes were selected for pathway analysis. Thirteen pathways were significantly enriched among the 475 CNV-associated genes, and survival analyses showed these pathways had generally consistent and significant prediction probabilities across three independent microarray studies. Therefore, integration between gene expression and copy number may help to lower false discovery rate and identify genes used to predict survival outcomes.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"44 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128870111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Template-based scoring functions for visualizing biological insights of H-2Kb-peptide-TCR complexes 基于模板的评分功能,用于可视化h - 2kb肽- tcr复合物的生物学见解
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706550
I. Liu, Yu-Shu Lo, Jinn-Moon Yang
Class-I major histocompatibility complex (MHC), peptide, and T-cell receptor (TCR) play an essential role of adaptive immune responses. Many prediction servers are available for identification of peptides that bind to MHC class I molecules. These servers are often lack of detailed interacting residues and binding models for analyzing MHC-peptide-TCR interaction mechanisms. This study numerously enhanced the template-based scoring function derived from protein-protein interactions for identifying MHC-peptide-TCR binding models. The scoring function considers both the template similarity and interacting force to ensure the statistically significant interface similarity between the peptide candidates and structure templates. The result shows that our scoring function is comparative to the public websites for identifying MHC binding peptides. Our model, considering both the MHC-peptide and peptide-TCR interfaces, is able to provide visualization and the biological insights of MHC-peptide-TCR binding models. We believe that our model is useful for the development of peptide-based vaccines.
一类主要组织相容性复合体(MHC)、肽和t细胞受体(TCR)在适应性免疫应答中起着重要作用。许多预测服务器可用于鉴定结合MHC I类分子的肽。这些服务器通常缺乏详细的相互作用残基和结合模型来分析mhc -肽- tcr相互作用机制。这项研究大大增强了基于模板的评分功能,该功能来源于蛋白质-蛋白质相互作用,用于鉴定mhc -肽- tcr结合模型。评分函数同时考虑模板相似性和相互作用力,以确保候选肽与结构模板之间的界面相似性具有统计学意义。结果表明,我们的评分函数与公共网站的MHC结合肽鉴定具有可比性。我们的模型考虑了mhc -肽和肽- tcr界面,能够提供mhc -肽- tcr结合模型的可视化和生物学见解。我们相信我们的模型对肽基疫苗的开发是有用的。
{"title":"Template-based scoring functions for visualizing biological insights of H-2Kb-peptide-TCR complexes","authors":"I. Liu, Yu-Shu Lo, Jinn-Moon Yang","doi":"10.1109/BIBM.2010.5706550","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706550","url":null,"abstract":"Class-I major histocompatibility complex (MHC), peptide, and T-cell receptor (TCR) play an essential role of adaptive immune responses. Many prediction servers are available for identification of peptides that bind to MHC class I molecules. These servers are often lack of detailed interacting residues and binding models for analyzing MHC-peptide-TCR interaction mechanisms. This study numerously enhanced the template-based scoring function derived from protein-protein interactions for identifying MHC-peptide-TCR binding models. The scoring function considers both the template similarity and interacting force to ensure the statistically significant interface similarity between the peptide candidates and structure templates. The result shows that our scoring function is comparative to the public websites for identifying MHC binding peptides. Our model, considering both the MHC-peptide and peptide-TCR interfaces, is able to provide visualization and the biological insights of MHC-peptide-TCR binding models. We believe that our model is useful for the development of peptide-based vaccines.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115235352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accurate prediction of ATP-binding residues using sequence and sequence-derived structural descriptors 使用序列和序列衍生的结构描述子准确预测atp结合残基
Pub Date : 2010-12-01 DOI: 10.1109/BIBM.2010.5706533
Ke Chen, M. Mizianty, Lukasz Kurgan
ATP is a ubiquitous nucleotide that provides energy for cellular activities, catalyzes chemical reactions, and is involved in cellular signaling. The knowledge of the ATP-protein interactions helps with annotation of protein functions and finds applications in drug design. We propose a high-throughput machine learning-based predictor, ATPsite, which identifies ATP-binding residues from protein sequences. Statistical tests show that ATPsite significantly outperforms existing ATPint predictor and other solutions which utilize sequence alignment and residue conservation scoring. The improvements stem from the usage of novel custom-designed input features that are based on the sequence, evolutionary profiles, and the sequence-predicted structural descriptors including secondary structure, solvent accessibility, and dihedral angles. A simple consensus of the ATPsite with the sequence-alignment based predictor is shown to give further improvements.
ATP是一种普遍存在的核苷酸,为细胞活动提供能量,催化化学反应,并参与细胞信号传导。atp -蛋白质相互作用的知识有助于蛋白质功能的注释,并在药物设计中找到应用。我们提出了一个基于机器学习的高通量预测器,ATPsite,它可以从蛋白质序列中识别atp结合残基。统计测试表明,ATPsite显著优于现有的ATPint预测器和其他利用序列比对和残基守恒评分的解决方案。这些改进源于使用了基于序列、进化剖面和序列预测的结构描述符(包括二级结构、溶剂可及性和二面角)的新型定制设计的输入特征。atp位点与基于序列比对的预测器的简单一致性显示出进一步的改进。
{"title":"Accurate prediction of ATP-binding residues using sequence and sequence-derived structural descriptors","authors":"Ke Chen, M. Mizianty, Lukasz Kurgan","doi":"10.1109/BIBM.2010.5706533","DOIUrl":"https://doi.org/10.1109/BIBM.2010.5706533","url":null,"abstract":"ATP is a ubiquitous nucleotide that provides energy for cellular activities, catalyzes chemical reactions, and is involved in cellular signaling. The knowledge of the ATP-protein interactions helps with annotation of protein functions and finds applications in drug design. We propose a high-throughput machine learning-based predictor, ATPsite, which identifies ATP-binding residues from protein sequences. Statistical tests show that ATPsite significantly outperforms existing ATPint predictor and other solutions which utilize sequence alignment and residue conservation scoring. The improvements stem from the usage of novel custom-designed input features that are based on the sequence, evolutionary profiles, and the sequence-predicted structural descriptors including secondary structure, solvent accessibility, and dihedral angles. A simple consensus of the ATPsite with the sequence-alignment based predictor is shown to give further improvements.","PeriodicalId":275098,"journal":{"name":"2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129706115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1