首页 > 最新文献

ISRN bioinformatics最新文献

英文 中文
Hierarchical ensemble methods for protein function prediction. 蛋白质功能预测的分层集合方法。
Pub Date : 2014-05-04 eCollection Date: 2014-01-01 DOI: 10.1155/2014/901419
Giorgio Valentini

Protein function prediction is a complex multiclass multilabel classification problem, characterized by multiple issues such as the incompleteness of the available annotations, the integration of multiple sources of high dimensional biomolecular data, the unbalance of several functional classes, and the difficulty of univocally determining negative examples. Moreover, the hierarchical relationships between functional classes that characterize both the Gene Ontology and FunCat taxonomies motivate the development of hierarchy-aware prediction methods that showed significantly better performances than hierarchical-unaware "flat" prediction methods. In this paper, we provide a comprehensive review of hierarchical methods for protein function prediction based on ensembles of learning machines. According to this general approach, a separate learning machine is trained to learn a specific functional term and then the resulting predictions are assembled in a "consensus" ensemble decision, taking into account the hierarchical relationships between classes. The main hierarchical ensemble methods proposed in the literature are discussed in the context of existing computational methods for protein function prediction, highlighting their characteristics, advantages, and limitations. Open problems of this exciting research area of computational biology are finally considered, outlining novel perspectives for future research.

蛋白质功能预测是一个复杂的多类别多标签分类问题,其特点是存在多种问题,如可用注释的不完整性、高维生物分子数据多种来源的整合、多个功能类别的不平衡以及难以统一确定负面示例。此外,作为基因本体论和 FunCat 分类法的特点,功能类之间的分层关系促使我们开发了分层感知预测方法,其性能明显优于无分层感知的 "平面 "预测方法。在本文中,我们全面回顾了基于学习机器集合的蛋白质功能预测分层方法。根据这种一般方法,先训练一个单独的学习机来学习一个特定的功能项,然后在考虑到类之间的层次关系的情况下,将所得到的预测结果汇总到一个 "共识 "集合决策中。本文结合现有的蛋白质功能预测计算方法,讨论了文献中提出的主要分层集合方法,强调了这些方法的特点、优势和局限性。最后还讨论了计算生物学这一令人兴奋的研究领域中尚未解决的问题,并概述了未来研究的新视角。
{"title":"Hierarchical ensemble methods for protein function prediction.","authors":"Giorgio Valentini","doi":"10.1155/2014/901419","DOIUrl":"10.1155/2014/901419","url":null,"abstract":"<p><p>Protein function prediction is a complex multiclass multilabel classification problem, characterized by multiple issues such as the incompleteness of the available annotations, the integration of multiple sources of high dimensional biomolecular data, the unbalance of several functional classes, and the difficulty of univocally determining negative examples. Moreover, the hierarchical relationships between functional classes that characterize both the Gene Ontology and FunCat taxonomies motivate the development of hierarchy-aware prediction methods that showed significantly better performances than hierarchical-unaware \"flat\" prediction methods. In this paper, we provide a comprehensive review of hierarchical methods for protein function prediction based on ensembles of learning machines. According to this general approach, a separate learning machine is trained to learn a specific functional term and then the resulting predictions are assembled in a \"consensus\" ensemble decision, taking into account the hierarchical relationships between classes. The main hierarchical ensemble methods proposed in the literature are discussed in the context of existing computational methods for protein function prediction, highlighting their characteristics, advantages, and limitations. Open problems of this exciting research area of computational biology are finally considered, outlining novel perspectives for future research. </p>","PeriodicalId":90877,"journal":{"name":"ISRN bioinformatics","volume":"2014 ","pages":"901419"},"PeriodicalIF":0.0,"publicationDate":"2014-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393075/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33272132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of merging and meta-analysis as alternative approaches for integrative gene expression analysis. 合并和荟萃分析作为整合基因表达分析的替代方法的比较。
Pub Date : 2014-01-12 eCollection Date: 2014-01-01 DOI: 10.1155/2014/345106
Jonatan Taminau, Cosmin Lazar, Stijn Meganck, Ann Nowé

An increasing amount of microarray gene expression data sets is available through public repositories. Their huge potential in making new findings is yet to be unlocked by making them available for large-scale analysis. In order to do so it is essential that independent studies designed for similar biological problems can be integrated, so that new insights can be obtained. These insights would remain undiscovered when analyzing the individual data sets because it is well known that the small number of biological samples used per experiment is a bottleneck in genomic analysis. By increasing the number of samples the statistical power is increased and more general and reliable conclusions can be drawn. In this work, two different approaches for conducting large-scale analysis of microarray gene expression data-meta-analysis and data merging-are compared in the context of the identification of cancer-related biomarkers, by analyzing six independent lung cancer studies. Within this study, we investigate the hypothesis that analyzing large cohorts of samples resulting in merging independent data sets designed to study the same biological problem results in lower false discovery rates than analyzing the same data sets within a more conservative meta-analysis approach.

越来越多的微阵列基因表达数据集可以通过公共存储库获得。它们在做出新发现方面的巨大潜力尚未通过将它们用于大规模分析而得到释放。为了做到这一点,有必要将针对类似生物学问题设计的独立研究整合起来,以便获得新的见解。在分析单个数据集时,这些见解仍未被发现,因为众所周知,每个实验使用的少量生物样本是基因组分析的瓶颈。通过增加样本数量,可以提高统计能力,得出更普遍、更可靠的结论。在这项工作中,通过分析六项独立的肺癌研究,在癌症相关生物标志物鉴定的背景下,比较了两种不同的微阵列基因表达大规模分析方法——荟萃分析和数据合并。在本研究中,我们调查了这样一个假设,即分析大型样本队列导致合并设计用于研究相同生物学问题的独立数据集的结果比使用更保守的元分析方法分析相同数据集的错误发现率更低。
{"title":"Comparison of merging and meta-analysis as alternative approaches for integrative gene expression analysis.","authors":"Jonatan Taminau,&nbsp;Cosmin Lazar,&nbsp;Stijn Meganck,&nbsp;Ann Nowé","doi":"10.1155/2014/345106","DOIUrl":"https://doi.org/10.1155/2014/345106","url":null,"abstract":"<p><p>An increasing amount of microarray gene expression data sets is available through public repositories. Their huge potential in making new findings is yet to be unlocked by making them available for large-scale analysis. In order to do so it is essential that independent studies designed for similar biological problems can be integrated, so that new insights can be obtained. These insights would remain undiscovered when analyzing the individual data sets because it is well known that the small number of biological samples used per experiment is a bottleneck in genomic analysis. By increasing the number of samples the statistical power is increased and more general and reliable conclusions can be drawn. In this work, two different approaches for conducting large-scale analysis of microarray gene expression data-meta-analysis and data merging-are compared in the context of the identification of cancer-related biomarkers, by analyzing six independent lung cancer studies. Within this study, we investigate the hypothesis that analyzing large cohorts of samples resulting in merging independent data sets designed to study the same biological problem results in lower false discovery rates than analyzing the same data sets within a more conservative meta-analysis approach. </p>","PeriodicalId":90877,"journal":{"name":"ISRN bioinformatics","volume":"2014 ","pages":"345106"},"PeriodicalIF":0.0,"publicationDate":"2014-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/345106","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33272131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 52
NucVoter: A Voting Algorithm for Reliable Nucleosome Prediction Using Next-Generation Sequencing Data. NucVoter:使用下一代测序数据进行可靠核小体预测的投票算法。
Pub Date : 2013-11-07 eCollection Date: 2013-01-01 DOI: 10.1155/2013/174064
Boseon Byeon

Nucleosomes, which consist of DNA wrapped around histone octamers, are dynamic, and their structure, including their location, size, and occupancy, can be transformed. Nucleosomes can regulate gene expression by controlling the DNA accessibility of proteins. Using next-generation sequencing techniques along with such laboratory methods as micrococcal nuclease digestion, predicting the genomic locations of nucleosomes is possible. However, the true locations of nucleosomes are unknown, and it is difficult to determine their exact locations using next-generation sequencing data. This paper proposes a novel voting algorithm, NucVoter, for the reliable prediction of nucleosome locations. Multiple models verify the consensus areas in which nucleosomes are placed by the model with the highest priority. NucVoter significantly improves the performance of nucleosome prediction.

核小体由包裹在组蛋白八聚体周围的DNA组成,是动态的,它们的结构,包括它们的位置、大小和占用,都是可以改变的。核小体可以通过控制蛋白质的DNA可及性来调节基因表达。利用下一代测序技术以及微球菌核酸酶消化等实验室方法,预测核小体的基因组位置是可能的。然而,核小体的真实位置是未知的,并且很难利用下一代测序数据确定它们的确切位置。本文提出了一种新的投票算法NucVoter,用于可靠地预测核小体的位置。多个模型验证了核小体被模型以最高优先级放置的共识区域。NucVoter显著提高核小体预测的性能。
{"title":"NucVoter: A Voting Algorithm for Reliable Nucleosome Prediction Using Next-Generation Sequencing Data.","authors":"Boseon Byeon","doi":"10.1155/2013/174064","DOIUrl":"https://doi.org/10.1155/2013/174064","url":null,"abstract":"<p><p>Nucleosomes, which consist of DNA wrapped around histone octamers, are dynamic, and their structure, including their location, size, and occupancy, can be transformed. Nucleosomes can regulate gene expression by controlling the DNA accessibility of proteins. Using next-generation sequencing techniques along with such laboratory methods as micrococcal nuclease digestion, predicting the genomic locations of nucleosomes is possible. However, the true locations of nucleosomes are unknown, and it is difficult to determine their exact locations using next-generation sequencing data. This paper proposes a novel voting algorithm, NucVoter, for the reliable prediction of nucleosome locations. Multiple models verify the consensus areas in which nucleosomes are placed by the model with the highest priority. NucVoter significantly improves the performance of nucleosome prediction. </p>","PeriodicalId":90877,"journal":{"name":"ISRN bioinformatics","volume":"2013 ","pages":"174064"},"PeriodicalIF":0.0,"publicationDate":"2013-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2013/174064","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33272168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Discovery of YopE Inhibitors by Pharmacophore-Based Virtual Screening and Docking. 基于药物团的虚拟筛选和对接发现YopE抑制剂。
Pub Date : 2013-10-21 eCollection Date: 2013-01-01 DOI: 10.1155/2013/640518
Gizem Ozbuyukkaya, Elif Ozkirimli Olmez, Kutlu O Ulgen

Gram-negative bacteria Yersinia secrete virulence factors that invade eukaryotic cells via type III secretion system. One particular virulence member, Yersinia outer protein E (YopE), targets Rho family of small GTPases by mimicking regulator GAP protein activity, and its secretion mainly induces cytoskeletal disruption and depolymerization of actin stress fibers within the host cell. In this work, potent drug-like inhibitors of YopE are investigated with virtual screening approaches. More than 500,000 unique small molecules from ZINC database were screened with a five-point pharmacophore, comprising three hydrogen acceptors, one hydrogen donor, and one ring, and derived from different salicylidene acylhydrazides. Binding modes and features of these molecules were investigated with a multistep molecular docking approach using Glide software. Virtual screening hits were further analyzed based on their docking score, chemical similarity, pharmacokinetic properties, and the key Arg144 interaction along with other active site residue interactions with the receptor. As a final outcome, a diverse set of ligands with inhibitory potential were proposed.

革兰氏阴性菌耶尔森氏菌通过III型分泌系统分泌毒力因子侵入真核细胞。一种特殊的毒力成员,耶尔森菌外蛋白E (YopE),通过模拟调节GAP蛋白的活性来靶向Rho家族的小gtpase,其分泌主要诱导宿主细胞内细胞骨架破坏和肌动蛋白应激纤维的解聚。在这项工作中,通过虚拟筛选方法研究了YopE的有效药物样抑制剂。从锌数据库中筛选了50多万个独特的小分子,包括三个氢受体、一个氢供体和一个环,这些小分子来自不同的水杨基酰肼。利用Glide软件对这些分子的结合模式和特征进行了多步分子对接研究。根据对接评分、化学相似性、药代动力学特性以及关键的Arg144与其他活性位点残基与受体的相互作用,进一步分析虚拟筛选命中。作为最后的结果,提出了一系列具有抑制潜力的配体。
{"title":"Discovery of YopE Inhibitors by Pharmacophore-Based Virtual Screening and Docking.","authors":"Gizem Ozbuyukkaya,&nbsp;Elif Ozkirimli Olmez,&nbsp;Kutlu O Ulgen","doi":"10.1155/2013/640518","DOIUrl":"https://doi.org/10.1155/2013/640518","url":null,"abstract":"<p><p>Gram-negative bacteria Yersinia secrete virulence factors that invade eukaryotic cells via type III secretion system. One particular virulence member, Yersinia outer protein E (YopE), targets Rho family of small GTPases by mimicking regulator GAP protein activity, and its secretion mainly induces cytoskeletal disruption and depolymerization of actin stress fibers within the host cell. In this work, potent drug-like inhibitors of YopE are investigated with virtual screening approaches. More than 500,000 unique small molecules from ZINC database were screened with a five-point pharmacophore, comprising three hydrogen acceptors, one hydrogen donor, and one ring, and derived from different salicylidene acylhydrazides. Binding modes and features of these molecules were investigated with a multistep molecular docking approach using Glide software. Virtual screening hits were further analyzed based on their docking score, chemical similarity, pharmacokinetic properties, and the key Arg144 interaction along with other active site residue interactions with the receptor. As a final outcome, a diverse set of ligands with inhibitory potential were proposed. </p>","PeriodicalId":90877,"journal":{"name":"ISRN bioinformatics","volume":"2013 ","pages":"640518"},"PeriodicalIF":0.0,"publicationDate":"2013-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2013/640518","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33272127","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Stormbow: A Cloud-Based Tool for Reads Mapping and Expression Quantification in Large-Scale RNA-Seq Studies. Stormbow:一种基于云的工具,用于大规模RNA-Seq研究中的Reads映射和表达定量。
Pub Date : 2013-09-11 eCollection Date: 2013-01-01 DOI: 10.1155/2013/481545
Shanrong Zhao, Kurt Prenger, Lance Smith

RNA-Seq is becoming a promising replacement to microarrays in transcriptome profiling and differential gene expression study. Technical improvements have decreased sequencing costs and, as a result, the size and number of RNA-Seq datasets have increased rapidly. However, the increasing volume of data from large-scale RNA-Seq studies poses a practical challenge for data analysis in a local environment. To meet this challenge, we developed Stormbow, a cloud-based software package, to process large volumes of RNA-Seq data in parallel. The performance of Stormbow has been tested by practically applying it to analyse 178 RNA-Seq samples in the cloud. In our test, it took 6 to 8 hours to process an RNA-Seq sample with 100 million reads, and the average cost was $3.50 per sample. Utilizing Amazon Web Services as the infrastructure for Stormbow allows us to easily scale up to handle large datasets with on-demand computational resources. Stormbow is a scalable, cost effective, and open-source based tool for large-scale RNA-Seq data analysis. Stormbow can be freely downloaded and can be used out of box to process Illumina RNA-Seq datasets.

RNA-Seq正在成为转录组分析和差异基因表达研究中微阵列的有希望的替代品。技术的进步降低了测序成本,因此,RNA-Seq数据集的大小和数量迅速增加。然而,大规模RNA-Seq研究的数据量不断增加,对局部环境下的数据分析提出了实际挑战。为了应对这一挑战,我们开发了基于云的软件包Stormbow,以并行处理大量RNA-Seq数据。Stormbow的性能已经通过实际应用它来分析云中178个RNA-Seq样本进行了测试。在我们的测试中,处理1亿个reads的RNA-Seq样本需要6到8个小时,每个样本的平均成本为3.5美元。利用Amazon Web Services作为Stormbow的基础设施,我们可以通过按需计算资源轻松扩展以处理大型数据集。Stormbow是一个可扩展的、成本有效的、基于开源的大规模RNA-Seq数据分析工具。Stormbow可以免费下载,并且可以开箱即用来处理Illumina RNA-Seq数据集。
{"title":"Stormbow: A Cloud-Based Tool for Reads Mapping and Expression Quantification in Large-Scale RNA-Seq Studies.","authors":"Shanrong Zhao,&nbsp;Kurt Prenger,&nbsp;Lance Smith","doi":"10.1155/2013/481545","DOIUrl":"https://doi.org/10.1155/2013/481545","url":null,"abstract":"<p><p>RNA-Seq is becoming a promising replacement to microarrays in transcriptome profiling and differential gene expression study. Technical improvements have decreased sequencing costs and, as a result, the size and number of RNA-Seq datasets have increased rapidly. However, the increasing volume of data from large-scale RNA-Seq studies poses a practical challenge for data analysis in a local environment. To meet this challenge, we developed Stormbow, a cloud-based software package, to process large volumes of RNA-Seq data in parallel. The performance of Stormbow has been tested by practically applying it to analyse 178 RNA-Seq samples in the cloud. In our test, it took 6 to 8 hours to process an RNA-Seq sample with 100 million reads, and the average cost was $3.50 per sample. Utilizing Amazon Web Services as the infrastructure for Stormbow allows us to easily scale up to handle large datasets with on-demand computational resources. Stormbow is a scalable, cost effective, and open-source based tool for large-scale RNA-Seq data analysis. Stormbow can be freely downloaded and can be used out of box to process Illumina RNA-Seq datasets. </p>","PeriodicalId":90877,"journal":{"name":"ISRN bioinformatics","volume":"2013 ","pages":"481545"},"PeriodicalIF":0.0,"publicationDate":"2013-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2013/481545","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33272173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Modern Computational Techniques for the HMMER Sequence Analysis. hmm序列分析的现代计算技术。
Pub Date : 2013-09-03 eCollection Date: 2013-01-01 DOI: 10.1155/2013/252183
Xiandong Meng, Yanqing Ji

This paper focuses on the latest research and critical reviews on modern computing architectures, software and hardware accelerated algorithms for bioinformatics data analysis with an emphasis on one of the most important sequence analysis applications-hidden Markov models (HMM). We show the detailed performance comparison of sequence analysis tools on various computing platforms recently developed in the bioinformatics society. The characteristics of the sequence analysis, such as data and compute-intensive natures, make it very attractive to optimize and parallelize by using both traditional software approach and innovated hardware acceleration technologies.

本文重点介绍了生物信息学数据分析的现代计算体系结构、软件和硬件加速算法的最新研究和评述,重点介绍了序列分析中最重要的应用之一——隐马尔可夫模型(HMM)。我们展示了在生物信息学社会最近开发的各种计算平台上的序列分析工具的详细性能比较。序列分析的特点,如数据和计算密集型的性质,使得它非常有吸引力的优化和并行利用传统的软件方法和创新的硬件加速技术。
{"title":"Modern Computational Techniques for the HMMER Sequence Analysis.","authors":"Xiandong Meng,&nbsp;Yanqing Ji","doi":"10.1155/2013/252183","DOIUrl":"https://doi.org/10.1155/2013/252183","url":null,"abstract":"<p><p>This paper focuses on the latest research and critical reviews on modern computing architectures, software and hardware accelerated algorithms for bioinformatics data analysis with an emphasis on one of the most important sequence analysis applications-hidden Markov models (HMM). We show the detailed performance comparison of sequence analysis tools on various computing platforms recently developed in the bioinformatics society. The characteristics of the sequence analysis, such as data and compute-intensive natures, make it very attractive to optimize and parallelize by using both traditional software approach and innovated hardware acceleration technologies. </p>","PeriodicalId":90877,"journal":{"name":"ISRN bioinformatics","volume":"2013 ","pages":"252183"},"PeriodicalIF":0.0,"publicationDate":"2013-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393056/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33272169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Construction and Analysis of the Cell Surface's Protein Network for Human Sperm-Egg Interaction. 人精卵相互作用细胞表面蛋白网络的构建与分析。
Pub Date : 2013-08-12 eCollection Date: 2013-01-01 DOI: 10.1155/2013/962760
Soudabeh Sabetian Fard Jahromi, Mohd Shahir Shamsir

Sperm-egg interaction is one of the most impressive processes in sexual reproduction, and understanding the molecular mechanism is crucial in solving problems in infertility and failed in vitro fertilization. The main purpose of this study is to map the sperm-egg interaction network between cell-surface proteins and perform an interaction analysis on this new network. We built the first protein interaction network of human sperm-egg binding and fusion proteins that consists of 84 protein nodes and 112 interactions. The gene ontology analysis identified a number of functional clusters that may be involved in the sperm-egg interaction. These include G-protein coupled receptor protein signaling pathway, cellular membrane fusion, and single fertilization. The PPI network showed a highly interconnected network and identified a set of candidate proteins: ADAM-ZP3, ZP3-CLGN, IZUMO1-CD9, and ADAM2-IZUMO1 that may have an important role in sperm-egg interaction. The result showed that the ADAM2 may mediate interaction between two essential factors CD9 and IZUMO1. The KEGG analysis showed 12 statistically significant pathways with 10 proteins associated with cancer, suggesting a common pathway between tumor fusion and sperm-egg fusion. We believe that the availability of this map will assist future researches in the fertilization mechanism and will also facilitate biological interpretation of sperm-egg interaction.

精卵相互作用是有性生殖过程中最令人印象深刻的过程之一,了解其分子机制对于解决不孕不育和体外受精失败问题至关重要。本研究的主要目的是绘制细胞表面蛋白之间的精卵相互作用网络,并对该网络进行相互作用分析。构建了首个包含84个蛋白节点、112个相互作用的人精卵结合融合蛋白相互作用网络。基因本体论分析确定了一些可能参与精子-卵子相互作用的功能簇。这些途径包括g蛋白偶联受体蛋白信号通路、细胞膜融合和单受精。PPI网络显示出一个高度互联的网络,并鉴定出一组可能在精卵相互作用中起重要作用的候选蛋白:ADAM-ZP3、ZP3-CLGN、IZUMO1-CD9和ADAM2-IZUMO1。结果表明,ADAM2可能介导两个关键因子CD9和IZUMO1之间的相互作用。KEGG分析显示,与癌症相关的10个蛋白中有12个具有统计学意义的通路,提示肿瘤融合和精卵融合之间存在共同的通路。我们相信这张图谱的可用性将有助于未来受精机制的研究,也将有助于对精子-卵子相互作用的生物学解释。
{"title":"Construction and Analysis of the Cell Surface's Protein Network for Human Sperm-Egg Interaction.","authors":"Soudabeh Sabetian Fard Jahromi,&nbsp;Mohd Shahir Shamsir","doi":"10.1155/2013/962760","DOIUrl":"https://doi.org/10.1155/2013/962760","url":null,"abstract":"<p><p>Sperm-egg interaction is one of the most impressive processes in sexual reproduction, and understanding the molecular mechanism is crucial in solving problems in infertility and failed in vitro fertilization. The main purpose of this study is to map the sperm-egg interaction network between cell-surface proteins and perform an interaction analysis on this new network. We built the first protein interaction network of human sperm-egg binding and fusion proteins that consists of 84 protein nodes and 112 interactions. The gene ontology analysis identified a number of functional clusters that may be involved in the sperm-egg interaction. These include G-protein coupled receptor protein signaling pathway, cellular membrane fusion, and single fertilization. The PPI network showed a highly interconnected network and identified a set of candidate proteins: ADAM-ZP3, ZP3-CLGN, IZUMO1-CD9, and ADAM2-IZUMO1 that may have an important role in sperm-egg interaction. The result showed that the ADAM2 may mediate interaction between two essential factors CD9 and IZUMO1. The KEGG analysis showed 12 statistically significant pathways with 10 proteins associated with cancer, suggesting a common pathway between tumor fusion and sperm-egg fusion. We believe that the availability of this map will assist future researches in the fertilization mechanism and will also facilitate biological interpretation of sperm-egg interaction. </p>","PeriodicalId":90877,"journal":{"name":"ISRN bioinformatics","volume":"2013 ","pages":"962760"},"PeriodicalIF":0.0,"publicationDate":"2013-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2013/962760","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33272130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A Computational Approach towards the Understanding of Plasmodium falciparum Multidrug Resistance Protein 1. 恶性疟原虫多药耐药蛋白研究的计算方法
Pub Date : 2013-08-01 eCollection Date: 2013-01-01 DOI: 10.1155/2013/437168
Saumya K Patel, Linz-Buoy George, Sivakumar Prasanth Kumar, Hyacinth N Highland, Yogesh T Jasrai, Himanshu A Pandya, Ketaki R Desai

The emergence of drug resistance in Plasmodium falciparum tremendously affected the chemotherapy worldwide while the intense distribution of chloroquine-resistant strains in most of the endemic areas added more complications in the treatment of malaria. The situation has even worsened by the lack of molecular mechanism to understand the resistance conferred by Plasmodia species. Recent studies have suggested the association of antimalarial resistance with P. falciparum multidrug resistance protein 1 (PfMDR1), an ATP-binding cassette (ABC) transporter and a homologue of human P-glycoprotein 1 (P-gp1). The present study deals about the development of PfMDR1 computational model and the model of substrate transport across PfMDR1 with insights derived from conformations relative to inward- and outward-facing topologies that switch on/off the transportation system. Comparison of ATP docked positions and its structural motif binding properties were found to be similar among other ATPases, and thereby contributes to NBD domains dimerization, a unique structural agreement noticed in Mus musculus Pgp and Escherichia coli MDR transporter homolog (MsbA). The interaction of leading antimalarials and phytochemicals within the active pocket of both wild-type and mutant-type PfMDR1 demonstrated the mode of binding and provided insights of less binding affinity thereby contributing to parasite's resistance mechanism.

恶性疟原虫耐药性的出现极大地影响了世界范围内的化疗,而氯喹耐药菌株在大多数疟疾流行地区的密集分布给疟疾的治疗增加了更多的并发症。由于缺乏分子机制来了解疟原虫物种赋予的耐药性,情况甚至恶化了。最近的研究表明,抗疟药耐药性与恶性疟原虫多药耐药蛋白1 (PfMDR1)、atp结合盒(ABC)转运蛋白和人p -糖蛋白1 (P-gp1)的同源物有关。本研究涉及PfMDR1计算模型的发展和PfMDR1的底物运输模型,并从与打开/关闭运输系统的内向和外向拓扑相关的构象中获得见解。比较其他ATP酶的停靠位置及其结构基基结合特性发现是相似的,从而有助于NBD结构域二聚化,这在小家鼠Pgp和大肠杆菌MDR转运体同源物(MsbA)中发现了独特的结构一致性。野生型和突变型PfMDR1活性口袋内的主要抗疟药物和植物化学物质的相互作用证明了结合模式,并提供了低结合亲和力的见解,从而有助于寄生虫的抗性机制。
{"title":"A Computational Approach towards the Understanding of Plasmodium falciparum Multidrug Resistance Protein 1.","authors":"Saumya K Patel, Linz-Buoy George, Sivakumar Prasanth Kumar, Hyacinth N Highland, Yogesh T Jasrai, Himanshu A Pandya, Ketaki R Desai","doi":"10.1155/2013/437168","DOIUrl":"10.1155/2013/437168","url":null,"abstract":"<p><p>The emergence of drug resistance in Plasmodium falciparum tremendously affected the chemotherapy worldwide while the intense distribution of chloroquine-resistant strains in most of the endemic areas added more complications in the treatment of malaria. The situation has even worsened by the lack of molecular mechanism to understand the resistance conferred by Plasmodia species. Recent studies have suggested the association of antimalarial resistance with P. falciparum multidrug resistance protein 1 (PfMDR1), an ATP-binding cassette (ABC) transporter and a homologue of human P-glycoprotein 1 (P-gp1). The present study deals about the development of PfMDR1 computational model and the model of substrate transport across PfMDR1 with insights derived from conformations relative to inward- and outward-facing topologies that switch on/off the transportation system. Comparison of ATP docked positions and its structural motif binding properties were found to be similar among other ATPases, and thereby contributes to NBD domains dimerization, a unique structural agreement noticed in Mus musculus Pgp and Escherichia coli MDR transporter homolog (MsbA). The interaction of leading antimalarials and phytochemicals within the active pocket of both wild-type and mutant-type PfMDR1 demonstrated the mode of binding and provided insights of less binding affinity thereby contributing to parasite's resistance mechanism. </p>","PeriodicalId":90877,"journal":{"name":"ISRN bioinformatics","volume":"2013 ","pages":"437168"},"PeriodicalIF":0.0,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393060/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33272172","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
SUMOhunt: Combining Spatial Staging between Lysine and SUMO with Random Forests to Predict SUMOylation. SUMOhunt:将赖氨酸和 SUMO 之间的空间分期与随机森林相结合来预测 SUMOylation。
Pub Date : 2013-06-17 eCollection Date: 2013-01-01 DOI: 10.1155/2013/671269
Amna Ijaz

Modification with SUMO protein has many key roles in eukaryotic systems which renders the identification of its target proteins and sites of considerable importance. Information regarding the SUMOylation of a protein may tell us about its subcellular localization, function, and spatial orientation. This modification occurs at particular and not all lysine residues in a given protein. In competition with biochemical means of modified-site recognition, computational methods are strong contenders in the prediction of SUMOylation-undergoing sites on proteins. In this research, physicochemical properties of amino acids retrieved from AAIndex, especially those involved in docking of modifier and target proteins and optimal presentation of target lysine, in combination with sequence information and random forest-based classifier presented in WEKA have been used to develop a prediction model, SUMOhunt, with statistics significantly better than all previous predictors. In this model 97.56% accuracy, 100% sensitivity, 94% specificity, and 0.95 MCC have been achieved which shows that proposed amino acid properties have a significant role in SUMO attachment. SUMOhunt will hence bring great reliability and efficiency in SUMOylation prediction.

SUMO 蛋白修饰在真核生物系统中发挥着许多关键作用,因此识别其靶蛋白和靶点相当重要。有关蛋白质 SUMO 基化的信息可以让我们了解其亚细胞定位、功能和空间定向。这种修饰发生在特定蛋白质的特定而非全部赖氨酸残基上。在与生化修饰位点识别方法的竞争中,计算方法是预测蛋白质 SUMO 乙基化发生位点的有力竞争者。在这项研究中,我们利用从 AAIndex 中检索到的氨基酸理化特性,特别是涉及修饰蛋白和目标蛋白对接以及目标赖氨酸最佳呈现的理化特性,结合序列信息和 WEKA 中基于随机森林的分类器,开发了一个预测模型 SUMOhunt,其统计结果明显优于之前的所有预测器。该模型达到了 97.56% 的准确率、100% 的灵敏度、94% 的特异性和 0.95 的 MCC,这表明所提出的氨基酸特性在 SUMO 附着中具有重要作用。因此,SUMOhunt 将为 SUMOylation 预测带来极大的可靠性和效率。
{"title":"SUMOhunt: Combining Spatial Staging between Lysine and SUMO with Random Forests to Predict SUMOylation.","authors":"Amna Ijaz","doi":"10.1155/2013/671269","DOIUrl":"10.1155/2013/671269","url":null,"abstract":"<p><p>Modification with SUMO protein has many key roles in eukaryotic systems which renders the identification of its target proteins and sites of considerable importance. Information regarding the SUMOylation of a protein may tell us about its subcellular localization, function, and spatial orientation. This modification occurs at particular and not all lysine residues in a given protein. In competition with biochemical means of modified-site recognition, computational methods are strong contenders in the prediction of SUMOylation-undergoing sites on proteins. In this research, physicochemical properties of amino acids retrieved from AAIndex, especially those involved in docking of modifier and target proteins and optimal presentation of target lysine, in combination with sequence information and random forest-based classifier presented in WEKA have been used to develop a prediction model, SUMOhunt, with statistics significantly better than all previous predictors. In this model 97.56% accuracy, 100% sensitivity, 94% specificity, and 0.95 MCC have been achieved which shows that proposed amino acid properties have a significant role in SUMO attachment. SUMOhunt will hence bring great reliability and efficiency in SUMOylation prediction. </p>","PeriodicalId":90877,"journal":{"name":"ISRN bioinformatics","volume":"2013 ","pages":"671269"},"PeriodicalIF":0.0,"publicationDate":"2013-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4393069/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33272128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploiting identifiability and intergene correlation for improved detection of differential expression. 利用可识别性和基因间相关性来改进差异表达的检测。
Pub Date : 2013-06-03 eCollection Date: 2013-01-01 DOI: 10.1155/2013/404717
J R Deller, Hayder Radha, J Justin McCormick

Accurate differential analysis of microarray data strongly depends on effective treatment of intergene correlation. Such dependence is ordinarily accounted for in terms of its effect on significance cutoffs. In this paper, it is shown that correlation can, in fact, be exploited to share information across tests and reorder expression differentials for increased statistical power, regardless of the threshold. Significantly improved differential analysis is the result of two simple measures: (i) adjusting test statistics to exploit information from identifiable genes (the large subset of genes represented on a microarray that can be classified a priori as nondifferential with very high confidence], but (ii) doing so in a way that accounts for linear dependencies among identifiable and nonidentifiable genes. A method is developed that builds upon the widely used two-sample t-statistic approach and uses analysis in Hilbert space to decompose the nonidentified gene vector into two components that are correlated and uncorrelated with the identified set. In the application to data derived from a widely studied prostate cancer database, the proposed method outperforms some of the most highly regarded approaches published to date. Algorithms in MATLAB and in R are available for public download.

微阵列数据的准确差异分析在很大程度上取决于对基因间相关性的有效处理。这种依赖性通常是根据其对显著性截止点的影响来解释的。在本文中,它表明,相关性可以,事实上,被利用来共享信息跨测试和重新排序表达差异增加统计能力,而不考虑阈值。显著改进的差异分析是两个简单措施的结果:(i)调整测试统计以利用来自可识别基因的信息(在微阵列上表示的基因的大子集可以以非常高的置信度先验地分类为非差异),但(ii)以一种解释可识别和不可识别基因之间线性依赖关系的方式这样做。开发了一种方法,该方法建立在广泛使用的双样本t统计方法的基础上,并使用希尔伯特空间分析将未识别的基因载体分解为与识别集相关和不相关的两个组件。在应用于从广泛研究的前列腺癌数据库中获得的数据时,提出的方法优于迄今为止发表的一些最受推崇的方法。MATLAB和R中的算法可供公开下载。
{"title":"Exploiting identifiability and intergene correlation for improved detection of differential expression.","authors":"J R Deller,&nbsp;Hayder Radha,&nbsp;J Justin McCormick","doi":"10.1155/2013/404717","DOIUrl":"https://doi.org/10.1155/2013/404717","url":null,"abstract":"<p><p>Accurate differential analysis of microarray data strongly depends on effective treatment of intergene correlation. Such dependence is ordinarily accounted for in terms of its effect on significance cutoffs. In this paper, it is shown that correlation can, in fact, be exploited to share information across tests and reorder expression differentials for increased statistical power, regardless of the threshold. Significantly improved differential analysis is the result of two simple measures: (i) adjusting test statistics to exploit information from identifiable genes (the large subset of genes represented on a microarray that can be classified a priori as nondifferential with very high confidence], but (ii) doing so in a way that accounts for linear dependencies among identifiable and nonidentifiable genes. A method is developed that builds upon the widely used two-sample t-statistic approach and uses analysis in Hilbert space to decompose the nonidentified gene vector into two components that are correlated and uncorrelated with the identified set. In the application to data derived from a widely studied prostate cancer database, the proposed method outperforms some of the most highly regarded approaches published to date. Algorithms in MATLAB and in R are available for public download. </p>","PeriodicalId":90877,"journal":{"name":"ISRN bioinformatics","volume":"2013 ","pages":"404717"},"PeriodicalIF":0.0,"publicationDate":"2013-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2013/404717","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33272171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
ISRN bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1