首页 > 最新文献

Advances in Bioinformatics最新文献

英文 中文
How good are simplified models for protein structure prediction? 简化模型对蛋白质结构预测有多好?
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2014-01-01 Epub Date: 2014-04-29 DOI: 10.1155/2014/867179
Swakkhar Shatabda, M A Hakim Newton, Mahmood A Rashid, Duc Nghia Pham, Abdul Sattar

Protein structure prediction (PSP) has been one of the most challenging problems in computational biology for several decades. The challenge is largely due to the complexity of the all-atomic details and the unknown nature of the energy function. Researchers have therefore used simplified energy models that consider interaction potentials only between the amino acid monomers in contact on discrete lattices. The restricted nature of the lattices and the energy models poses a twofold concern regarding the assessment of the models. Can a native or a very close structure be obtained when structures are mapped to lattices? Can the contact based energy models on discrete lattices guide the search towards the native structures? In this paper, we use the protein chain lattice fitting (PCLF) problem to address the first concern; we developed a constraint-based local search algorithm for the PCLF problem for cubic and face-centered cubic lattices and found very close lattice fits for the native structures. For the second concern, we use a number of techniques to sample the conformation space and find correlations between energy functions and root mean square deviation (RMSD) distance of the lattice-based structures with the native structures. Our analysis reveals weakness of several contact based energy models used that are popular in PSP.

几十年来,蛋白质结构预测一直是计算生物学中最具挑战性的问题之一。挑战主要是由于全原子细节的复杂性和能量函数的未知性质。因此,研究人员使用简化的能量模型,只考虑离散晶格上接触的氨基酸单体之间的相互作用势。晶格和能量模型的有限性给模型的评估带来了双重问题。当结构映射到晶格时,能得到一个原生结构或一个非常接近的结构吗?离散晶格上基于接触的能量模型能指导对原生结构的搜索吗?在本文中,我们使用蛋白质链晶格拟合(PCLF)问题来解决第一个问题;我们开发了一种基于约束的局部搜索算法来解决立方和面心立方晶格的PCLF问题,并找到了非常接近的晶格拟合的本地结构。对于第二个问题,我们使用了许多技术来对构象空间进行采样,并找到基于晶格的结构与天然结构的能量函数和均方根偏差(RMSD)距离之间的相关性。我们的分析揭示了在PSP中流行的几种基于接触的能量模型的弱点。
{"title":"How good are simplified models for protein structure prediction?","authors":"Swakkhar Shatabda,&nbsp;M A Hakim Newton,&nbsp;Mahmood A Rashid,&nbsp;Duc Nghia Pham,&nbsp;Abdul Sattar","doi":"10.1155/2014/867179","DOIUrl":"https://doi.org/10.1155/2014/867179","url":null,"abstract":"<p><p>Protein structure prediction (PSP) has been one of the most challenging problems in computational biology for several decades. The challenge is largely due to the complexity of the all-atomic details and the unknown nature of the energy function. Researchers have therefore used simplified energy models that consider interaction potentials only between the amino acid monomers in contact on discrete lattices. The restricted nature of the lattices and the energy models poses a twofold concern regarding the assessment of the models. Can a native or a very close structure be obtained when structures are mapped to lattices? Can the contact based energy models on discrete lattices guide the search towards the native structures? In this paper, we use the protein chain lattice fitting (PCLF) problem to address the first concern; we developed a constraint-based local search algorithm for the PCLF problem for cubic and face-centered cubic lattices and found very close lattice fits for the native structures. For the second concern, we use a number of techniques to sample the conformation space and find correlations between energy functions and root mean square deviation (RMSD) distance of the lattice-based structures with the native structures. Our analysis reveals weakness of several contact based energy models used that are popular in PSP. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2014 ","pages":"867179"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/867179","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32380993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Objective and comprehensive evaluation of bisulfite short read mapping tools. 亚硫酸氢盐短读测工具的客观综合评价。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2014-01-01 Epub Date: 2014-04-15 DOI: 10.1155/2014/472045
Hong Tran, Jacob Porter, Ming-An Sun, Hehuang Xie, Liqing Zhang

Background. Large-scale bisulfite treatment and short reads sequencing technology allow comprehensive estimation of methylation states of Cs in the genomes of different tissues, cell types, and developmental stages. Accurate characterization of DNA methylation is essential for understanding genotype phenotype association, gene and environment interaction, diseases, and cancer. Aligning bisulfite short reads to a reference genome has been a challenging task. We compared five bisulfite short read mapping tools, BSMAP, Bismark, BS-Seeker, BiSS, and BRAT-BW, representing two classes of mapping algorithms (hash table and suffix/prefix tries). We examined their mapping efficiency (i.e., the percentage of reads that can be mapped to the genomes), usability, running time, and effects of changing default parameter settings using both real and simulated reads. We also investigated how preprocessing data might affect mapping efficiency. Conclusion. Among the five programs compared, in terms of mapping efficiency, Bismark performs the best on the real data, followed by BiSS, BSMAP, and finally BRAT-BW and BS-Seeker with very similar performance. If CPU time is not a constraint, Bismark is a good choice of program for mapping bisulfite treated short reads. Data quality impacts a great deal mapping efficiency. Although increasing the number of mismatches allowed can increase mapping efficiency, it not only significantly slows down the program, but also runs the risk of having increased false positives. Therefore, users should carefully set the related parameters depending on the quality of their sequencing data.

背景。大规模亚硫酸氢盐处理和短reads测序技术可以全面估计不同组织、细胞类型和发育阶段基因组中Cs的甲基化状态。DNA甲基化的准确表征对于理解基因型表型关联、基因与环境相互作用、疾病和癌症至关重要。亚硫酸氢盐短序列与参考基因组的比对一直是一项具有挑战性的任务。我们比较了五种亚硫酸盐短读映射工具,BSMAP, Bismark, BS-Seeker, bis和BRAT-BW,代表了两类映射算法(哈希表和后缀/前缀尝试)。我们检查了它们的映射效率(即,可以映射到基因组的读取的百分比)、可用性、运行时间,以及使用真实和模拟读取更改默认参数设置的效果。我们还研究了预处理数据如何影响映射效率。结论。在比较的五个程序中,在真实数据的映射效率方面,Bismark表现最好,其次是bis、BSMAP,最后是BRAT-BW和BS-Seeker,两者的性能非常接近。如果CPU时间不受限制,Bismark是一个很好的选择,用于绘制亚硫酸氢盐处理的短读取。数据质量对映射效率有很大影响。尽管增加允许的不匹配数量可以提高映射效率,但它不仅会显著降低程序的速度,而且还会增加误报的风险。因此,用户应根据其测序数据的质量仔细设置相关参数。
{"title":"Objective and comprehensive evaluation of bisulfite short read mapping tools.","authors":"Hong Tran,&nbsp;Jacob Porter,&nbsp;Ming-An Sun,&nbsp;Hehuang Xie,&nbsp;Liqing Zhang","doi":"10.1155/2014/472045","DOIUrl":"https://doi.org/10.1155/2014/472045","url":null,"abstract":"<p><p>Background. Large-scale bisulfite treatment and short reads sequencing technology allow comprehensive estimation of methylation states of Cs in the genomes of different tissues, cell types, and developmental stages. Accurate characterization of DNA methylation is essential for understanding genotype phenotype association, gene and environment interaction, diseases, and cancer. Aligning bisulfite short reads to a reference genome has been a challenging task. We compared five bisulfite short read mapping tools, BSMAP, Bismark, BS-Seeker, BiSS, and BRAT-BW, representing two classes of mapping algorithms (hash table and suffix/prefix tries). We examined their mapping efficiency (i.e., the percentage of reads that can be mapped to the genomes), usability, running time, and effects of changing default parameter settings using both real and simulated reads. We also investigated how preprocessing data might affect mapping efficiency. Conclusion. Among the five programs compared, in terms of mapping efficiency, Bismark performs the best on the real data, followed by BiSS, BSMAP, and finally BRAT-BW and BS-Seeker with very similar performance. If CPU time is not a constraint, Bismark is a good choice of program for mapping bisulfite treated short reads. Data quality impacts a great deal mapping efficiency. Although increasing the number of mismatches allowed can increase mapping efficiency, it not only significantly slows down the program, but also runs the risk of having increased false positives. Therefore, users should carefully set the related parameters depending on the quality of their sequencing data. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2014 ","pages":"472045"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/472045","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32350526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 47
Prediction of B-Cell Epitopes in Listeriolysin O, a Cholesterol Dependent Cytolysin Secreted by Listeria monocytogenes. 李斯特菌分泌的胆固醇依赖性细胞溶解素O的b细胞表位预测。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2014-01-01 Epub Date: 2014-01-02 DOI: 10.1155/2014/871676
Morris S Jones, J Mark Carter

Listeria monocytogenes is a gram-positive, foodborne bacterium responsible for disease in humans and animals. Listeriolysin O (LLO) is a required virulence factor for the pathogenic effects of L. monocytogenes. Bioinformatics revealed conserved putative epitopes of LLO that could be used to develop monoclonal antibodies against LLO. Continuous and discontinuous epitopes were located by using four different B-cell prediction algorithms. Three-dimensional molecular models were generated to more precisely characterize the predicted antigenicity of LLO. Domain 4 was predicted to contain five of eleven continuous epitopes. A large portion of domain 4 was also predicted to comprise discontinuous immunogenic epitopes. Domain 4 of LLO may serve as an immunogen for eliciting monoclonal antibodies that can be used to study the pathogenesis of L. monocytogenes as well as develop an inexpensive assay.

单核细胞增生李斯特菌是一种革兰氏阳性食源性细菌,可引起人类和动物疾病。李斯特菌溶素O (LLO)是单核细胞增生乳杆菌致病所必需的毒力因子。生物信息学揭示了LLO保守的推定表位,可用于开发抗LLO的单克隆抗体。使用四种不同的b细胞预测算法定位连续和不连续的表位。生成三维分子模型以更精确地表征预测的LLO抗原性。预测结构域4包含11个连续表位中的5个。很大一部分结构域4也被预测包含不连续的免疫原性表位。LLO结构域4可以作为一种免疫原,用于诱导单克隆抗体,用于研究单核增生乳杆菌的发病机制,并开发一种廉价的检测方法。
{"title":"Prediction of B-Cell Epitopes in Listeriolysin O, a Cholesterol Dependent Cytolysin Secreted by Listeria monocytogenes.","authors":"Morris S Jones,&nbsp;J Mark Carter","doi":"10.1155/2014/871676","DOIUrl":"https://doi.org/10.1155/2014/871676","url":null,"abstract":"<p><p>Listeria monocytogenes is a gram-positive, foodborne bacterium responsible for disease in humans and animals. Listeriolysin O (LLO) is a required virulence factor for the pathogenic effects of L. monocytogenes. Bioinformatics revealed conserved putative epitopes of LLO that could be used to develop monoclonal antibodies against LLO. Continuous and discontinuous epitopes were located by using four different B-cell prediction algorithms. Three-dimensional molecular models were generated to more precisely characterize the predicted antigenicity of LLO. Domain 4 was predicted to contain five of eleven continuous epitopes. A large portion of domain 4 was also predicted to comprise discontinuous immunogenic epitopes. Domain 4 of LLO may serve as an immunogen for eliciting monoclonal antibodies that can be used to study the pathogenesis of L. monocytogenes as well as develop an inexpensive assay. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2014 ","pages":"871676"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/871676","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32112617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Artificial neural network application in the diagnosis of disease conditions with liver ultrasound images. 人工神经网络在肝脏超声图像疾病诊断中的应用。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2014-01-01 Epub Date: 2014-09-16 DOI: 10.1155/2014/708279
Karthik Kalyan, Binal Jakhia, Ramachandra Dattatraya Lele, Mukund Joshi, Abhay Chowdhary

The preliminary study presented within this paper shows a comparative study of various texture features extracted from liver ultrasonic images by employing Multilayer Perceptron (MLP), a type of artificial neural network, to study the presence of disease conditions. An ultrasound (US) image shows echo-texture patterns, which defines the organ characteristics. Ultrasound images of liver disease conditions such as "fatty liver," "cirrhosis," and "hepatomegaly" produce distinctive echo patterns. However, various ultrasound imaging artifacts and speckle noise make these echo-texture patterns difficult to identify and often hard to distinguish visually. Here, based on the extracted features from the ultrasonic images, we employed an artificial neural network for the diagnosis of disease conditions in liver and finding of the best classifier that distinguishes between abnormal and normal conditions of the liver. Comparison of the overall performance of all the feature classifiers concluded that "mixed feature set" is the best feature set. It showed an excellent rate of accuracy for the training data set. The gray level run length matrix (GLRLM) feature shows better results when the network was tested against unknown data.

本文的初步研究是利用一种人工神经网络——多层感知器(Multilayer Perceptron, MLP)对肝脏超声图像中提取的各种纹理特征进行对比研究,以研究疾病状况的存在。超声(US)图像显示回声纹理模式,确定器官特征。肝脏疾病如“脂肪肝”、“肝硬化”和“肝肿大”的超声图像产生独特的回声模式。然而,各种超声成像伪影和斑点噪声使这些回声纹理模式难以识别,通常难以从视觉上区分。在这里,基于从超声图像中提取的特征,我们使用人工神经网络来诊断肝脏的疾病状况,并找到区分肝脏异常和正常状况的最佳分类器。比较所有特征分类器的总体性能得出“混合特征集”是最好的特征集。它显示了训练数据集的极好准确率。灰度运行长度矩阵(GLRLM)特征在对未知数据进行测试时显示出较好的效果。
{"title":"Artificial neural network application in the diagnosis of disease conditions with liver ultrasound images.","authors":"Karthik Kalyan,&nbsp;Binal Jakhia,&nbsp;Ramachandra Dattatraya Lele,&nbsp;Mukund Joshi,&nbsp;Abhay Chowdhary","doi":"10.1155/2014/708279","DOIUrl":"https://doi.org/10.1155/2014/708279","url":null,"abstract":"<p><p>The preliminary study presented within this paper shows a comparative study of various texture features extracted from liver ultrasonic images by employing Multilayer Perceptron (MLP), a type of artificial neural network, to study the presence of disease conditions. An ultrasound (US) image shows echo-texture patterns, which defines the organ characteristics. Ultrasound images of liver disease conditions such as \"fatty liver,\" \"cirrhosis,\" and \"hepatomegaly\" produce distinctive echo patterns. However, various ultrasound imaging artifacts and speckle noise make these echo-texture patterns difficult to identify and often hard to distinguish visually. Here, based on the extracted features from the ultrasonic images, we employed an artificial neural network for the diagnosis of disease conditions in liver and finding of the best classifier that distinguishes between abnormal and normal conditions of the liver. Comparison of the overall performance of all the feature classifiers concluded that \"mixed feature set\" is the best feature set. It showed an excellent rate of accuracy for the training data set. The gray level run length matrix (GLRLM) feature shows better results when the network was tested against unknown data. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2014 ","pages":"708279"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/708279","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32708360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 88
Ligand Based Pharmacophore Modeling and Virtual Screening Studies to Design Novel HDAC2 Inhibitors. 基于配体的药效团建模和虚拟筛选研究设计新型HDAC2抑制剂。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2014-01-01 Epub Date: 2014-11-26 DOI: 10.1155/2014/812148
Naresh Kandakatla, Geetha Ramakrishnan

Histone deacetylases 2 (HDAC2), Class I histone deacetylase (HDAC) family, emerged as an important therapeutic target for the treatment of various cancers. A total of 48 inhibitors of two different chemotypes were used to generate pharmacophore model using 3D QSAR pharmacophore generation (HypoGen algorithm) module in Discovery Studio. The best HypoGen model consists of four pharmacophore features namely, one hydrogen bond acceptor (HBA), and one hydrogen donor (HBD), one hydrophobic (HYP) and one aromatic centres, (RA). This model was validated against 20 test set compounds and this model was utilized as a 3D query for virtual screening to validate against NCI and Maybridge database and the hits further screened by Lipinski's rule of 5, and a total of 382 hit compounds from NCI and 243 hit compounds from Maybridge were found and were subjected to molecular docking in the active site of HDAC2 (PDB: 3MAX). Finally eight hit compounds, NSC108392, NSC127064, NSC110782, and NSC748337 from NCI database and MFCD01935795, MFCD00830779, MFCD00661790, and MFCD00124221 from Maybridge database, were considered as novel potential HDAC2 inhibitors.

组蛋白去乙酰化酶2 (HDAC2)是一类组蛋白去乙酰化酶(HDAC)家族,是治疗多种癌症的重要靶点。使用Discovery Studio中的3D QSAR药效团生成(HypoGen算法)模块,共选取两种不同化学型的48种抑制剂生成药效团模型。最佳的HypoGen模型由四个药效团特征组成,即一个氢键受体(HBA)、一个氢供体(HBD)、一个疏水中心(HYP)和一个芳香中心(RA)。利用该模型对20个测试集化合物进行验证,并利用该模型作为3D查询进行虚拟筛选,对NCI和Maybridge数据库进行验证,并根据Lipinski的5法则进一步筛选命中,共发现NCI命中化合物382个,Maybridge命中化合物243个,并在HDAC2活性位点进行分子对接(PDB: 3MAX)。最终,NCI数据库中的NSC108392、NSC127064、NSC110782和NSC748337以及Maybridge数据库中的MFCD01935795、MFCD00830779、MFCD00661790和MFCD00124221被认为是新的潜在HDAC2抑制剂。
{"title":"Ligand Based Pharmacophore Modeling and Virtual Screening Studies to Design Novel HDAC2 Inhibitors.","authors":"Naresh Kandakatla,&nbsp;Geetha Ramakrishnan","doi":"10.1155/2014/812148","DOIUrl":"https://doi.org/10.1155/2014/812148","url":null,"abstract":"<p><p>Histone deacetylases 2 (HDAC2), Class I histone deacetylase (HDAC) family, emerged as an important therapeutic target for the treatment of various cancers. A total of 48 inhibitors of two different chemotypes were used to generate pharmacophore model using 3D QSAR pharmacophore generation (HypoGen algorithm) module in Discovery Studio. The best HypoGen model consists of four pharmacophore features namely, one hydrogen bond acceptor (HBA), and one hydrogen donor (HBD), one hydrophobic (HYP) and one aromatic centres, (RA). This model was validated against 20 test set compounds and this model was utilized as a 3D query for virtual screening to validate against NCI and Maybridge database and the hits further screened by Lipinski's rule of 5, and a total of 382 hit compounds from NCI and 243 hit compounds from Maybridge were found and were subjected to molecular docking in the active site of HDAC2 (PDB: 3MAX). Finally eight hit compounds, NSC108392, NSC127064, NSC110782, and NSC748337 from NCI database and MFCD01935795, MFCD00830779, MFCD00661790, and MFCD00124221 from Maybridge database, were considered as novel potential HDAC2 inhibitors. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2014 ","pages":"812148"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/812148","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32923140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 67
A framework for prediction of response to HCV therapy using different data mining techniques. 使用不同数据挖掘技术预测HCV治疗反应的框架。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2014-01-01 Epub Date: 2014-12-11 DOI: 10.1155/2014/181056
Enas M F El Houby

Hepatitis C which is a widely spread disease all over the world is a fatal liver disease caused by Hepatitis C Virus (HCV). The only approved therapy is interferon plus ribavirin. The number of responders to this treatment is low, while its cost is high and side effects are undesirable. Treatment response prediction will help in reducing the patients who suffer from the side effects and high costs without achieving recovery. The aim of this research is to develop a framework which can select the best model to predict HCV patients' response to the treatment of HCV from clinical information. The framework contains three phases which are preprocessing phase to prepare the data for applying Data Mining (DM) techniques, DM phase to apply different DM techniques, and evaluation phase to evaluate and compare the performance of the built models and select the best model as the recommended one. Different DM techniques had been applied which are associative classification, artificial neural network, and decision tree to evaluate the framework. The experimental results showed the effectiveness of the framework in selecting the best model which is the model built by associative classification using histology activity index, fibrosis stage, and alanine amino transferase.

丙型肝炎是由丙型肝炎病毒(HCV)引起的一种致命性肝脏疾病,在世界范围内广泛传播。唯一被批准的治疗方法是干扰素加利巴韦林。对这种治疗有反应的人数很少,而费用高,副作用也不理想。治疗反应预测将有助于减少患者遭受的副作用和高费用而无法实现康复。本研究的目的是建立一个框架,可以从临床信息中选择最佳模型来预测HCV患者对HCV治疗的反应。该框架包含三个阶段:预处理阶段,为应用数据挖掘技术准备数据;数据挖掘阶段,应用不同的数据挖掘技术;评估阶段,评估和比较所构建模型的性能,选择最佳模型作为推荐模型。采用了关联分类、人工神经网络和决策树等不同的决策分析技术对框架进行评价。实验结果表明,该框架能够有效地选择出基于组织活性指数、纤维化分期和丙氨酸氨基转移酶的关联分类模型。
{"title":"A framework for prediction of response to HCV therapy using different data mining techniques.","authors":"Enas M F El Houby","doi":"10.1155/2014/181056","DOIUrl":"https://doi.org/10.1155/2014/181056","url":null,"abstract":"<p><p>Hepatitis C which is a widely spread disease all over the world is a fatal liver disease caused by Hepatitis C Virus (HCV). The only approved therapy is interferon plus ribavirin. The number of responders to this treatment is low, while its cost is high and side effects are undesirable. Treatment response prediction will help in reducing the patients who suffer from the side effects and high costs without achieving recovery. The aim of this research is to develop a framework which can select the best model to predict HCV patients' response to the treatment of HCV from clinical information. The framework contains three phases which are preprocessing phase to prepare the data for applying Data Mining (DM) techniques, DM phase to apply different DM techniques, and evaluation phase to evaluate and compare the performance of the built models and select the best model as the recommended one. Different DM techniques had been applied which are associative classification, artificial neural network, and decision tree to evaluate the framework. The experimental results showed the effectiveness of the framework in selecting the best model which is the model built by associative classification using histology activity index, fibrosis stage, and alanine amino transferase. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2014 ","pages":"181056"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/181056","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32967370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Alternate Phosphorylation/O-GlcNAc Modification on Human Insulin IRSs: A Road towards Impaired Insulin Signaling in Alzheimer and Diabetes. 人胰岛素IRSs的交替磷酸化/O-GlcNAc修饰:阿尔茨海默病和糖尿病中胰岛素信号受损的途径
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2014-01-01 Epub Date: 2014-12-17 DOI: 10.1155/2014/324753
Zainab Jahangir, Waqar Ahmad, Khadija Shabbiri

Impaired insulin signaling has been thought of as important step in both Alzheimer's disease (AD) and type 2 diabetes mellitus (T2DM). Posttranslational modifications (PTMs) regulate functions and interaction of insulin with insulin receptors substrates (IRSs) and activate insulin signaling downstream pathways via autophosphorylation on several tyrosine (TYR) residues on IRSs. Two important insulin receptor substrates 1 and 2 are widely expressed in human, and alternative phosphorylation on their serine (Ser) and threonine (Thr) residues has been known to block the Tyr phosphorylation of IRSs, thus inhibiting insulin signaling and promoting insulin resistance. Like phosphorylation, O-glycosylation modification is important PTM and inhibits phosphorylation on same or neighboring Ser/Thr residues, often called Yin Yang sites. Both IRS-1 and IRS-2 have been shown to be O-glycosylated; however exact sites are not determined yet. In this study, by using neuronal network based prediction methods, we found more than 50 Ser/Thr residues that have potential to be O-glycosylated and may act as possible sites as well. Moreover, alternative phosphorylation and O-glycosylation on IRS-1 Ser-312, 984, 1037, and 1101 may act as possible therapeutic targets to minimize the risk of AD and T2DM.

胰岛素信号受损被认为是阿尔茨海默病(AD)和2型糖尿病(T2DM)的重要步骤。翻译后修饰(PTMs)调节胰岛素与胰岛素受体底物(IRSs)的功能和相互作用,并通过IRSs上几个酪氨酸(TYR)残基的自磷酸化激活胰岛素信号传导下游途径。两种重要的胰岛素受体底物1和2在人体中广泛表达,已知其丝氨酸(Ser)和苏氨酸(Thr)残基的选择性磷酸化可阻断irs的Tyr磷酸化,从而抑制胰岛素信号传导并促进胰岛素抵抗。与磷酸化一样,o -糖基化修饰是重要的PTM,可抑制相同或邻近的丝氨酸/苏氨酸残基(通常称为阴阳位点)上的磷酸化。IRS-1和IRS-2都被证明是o糖基化的;然而,具体地点尚未确定。在这项研究中,我们利用基于神经网络的预测方法,发现了超过50个丝氨酸/苏氨酸残基有可能被o糖基化,并且可能作为可能的位点。此外,IRS-1 Ser-312、984、1037和1101的替代磷酸化和o -糖基化可能是降低AD和T2DM风险的可能治疗靶点。
{"title":"Alternate Phosphorylation/O-GlcNAc Modification on Human Insulin IRSs: A Road towards Impaired Insulin Signaling in Alzheimer and Diabetes.","authors":"Zainab Jahangir,&nbsp;Waqar Ahmad,&nbsp;Khadija Shabbiri","doi":"10.1155/2014/324753","DOIUrl":"https://doi.org/10.1155/2014/324753","url":null,"abstract":"<p><p>Impaired insulin signaling has been thought of as important step in both Alzheimer's disease (AD) and type 2 diabetes mellitus (T2DM). Posttranslational modifications (PTMs) regulate functions and interaction of insulin with insulin receptors substrates (IRSs) and activate insulin signaling downstream pathways via autophosphorylation on several tyrosine (TYR) residues on IRSs. Two important insulin receptor substrates 1 and 2 are widely expressed in human, and alternative phosphorylation on their serine (Ser) and threonine (Thr) residues has been known to block the Tyr phosphorylation of IRSs, thus inhibiting insulin signaling and promoting insulin resistance. Like phosphorylation, O-glycosylation modification is important PTM and inhibits phosphorylation on same or neighboring Ser/Thr residues, often called Yin Yang sites. Both IRS-1 and IRS-2 have been shown to be O-glycosylated; however exact sites are not determined yet. In this study, by using neuronal network based prediction methods, we found more than 50 Ser/Thr residues that have potential to be O-glycosylated and may act as possible sites as well. Moreover, alternative phosphorylation and O-glycosylation on IRS-1 Ser-312, 984, 1037, and 1101 may act as possible therapeutic targets to minimize the risk of AD and T2DM. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2014 ","pages":"324753"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/324753","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32967371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
A Parallel Framework for Multipoint Spiral Search in ab Initio Protein Structure Prediction. 从头算蛋白质结构预测中多点螺旋搜索的并行框架。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2014-01-01 Epub Date: 2014-03-16 DOI: 10.1155/2014/985968
Mahmood A Rashid, Swakkhar Shatabda, M A Hakim Newton, Md Tamjidul Hoque, Abdul Sattar

Protein structure prediction is computationally a very challenging problem. A large number of existing search algorithms attempt to solve the problem by exploring possible structures and finding the one with the minimum free energy. However, these algorithms perform poorly on large sized proteins due to an astronomically wide search space. In this paper, we present a multipoint spiral search framework that uses parallel processing techniques to expedite exploration by starting from different points. In our approach, a set of random initial solutions are generated and distributed to different threads. We allow each thread to run for a predefined period of time. The improved solutions are stored threadwise. When the threads finish, the solutions are merged together and the duplicates are removed. A selected distinct set of solutions are then split to different threads again. In our ab initio protein structure prediction method, we use the three-dimensional face-centred-cubic lattice for structure-backbone mapping. We use both the low resolution hydrophobic-polar energy model and the high-resolution 20 × 20 energy model for search guiding. The experimental results show that our new parallel framework significantly improves the results obtained by the state-of-the-art single-point search approaches for both energy models on three-dimensional face-centred-cubic lattice. We also experimentally show the effectiveness of mixing energy models within parallel threads.

蛋白质结构预测在计算上是一个非常具有挑战性的问题。现有的大量搜索算法试图通过探索可能的结构并找到具有最小自由能的结构来解决问题。然而,由于搜索空间太大,这些算法在大尺寸蛋白质上表现不佳。在本文中,我们提出了一个多点螺旋搜索框架,该框架使用并行处理技术,通过从不同的点开始加速探索。在我们的方法中,生成一组随机初始解并将其分发给不同的线程。我们允许每个线程运行一段预定义的时间。改进的解决方案是按线程存储的。当线程结束时,解决方案合并在一起,并删除重复项。然后将一组不同的解决方案再次拆分到不同的线程中。在我们的从头计算蛋白质结构预测方法中,我们使用三维面心立方晶格进行结构-骨架映射。我们使用低分辨率的疏水极性能量模型和高分辨率的20 × 20能量模型进行搜索指导。实验结果表明,在三维面心立方晶格上,我们的并行框架显著改善了单点搜索方法对两种能量模型的搜索结果。我们还通过实验证明了在并行线程中混合能量模型的有效性。
{"title":"A Parallel Framework for Multipoint Spiral Search in ab Initio Protein Structure Prediction.","authors":"Mahmood A Rashid,&nbsp;Swakkhar Shatabda,&nbsp;M A Hakim Newton,&nbsp;Md Tamjidul Hoque,&nbsp;Abdul Sattar","doi":"10.1155/2014/985968","DOIUrl":"https://doi.org/10.1155/2014/985968","url":null,"abstract":"<p><p>Protein structure prediction is computationally a very challenging problem. A large number of existing search algorithms attempt to solve the problem by exploring possible structures and finding the one with the minimum free energy. However, these algorithms perform poorly on large sized proteins due to an astronomically wide search space. In this paper, we present a multipoint spiral search framework that uses parallel processing techniques to expedite exploration by starting from different points. In our approach, a set of random initial solutions are generated and distributed to different threads. We allow each thread to run for a predefined period of time. The improved solutions are stored threadwise. When the threads finish, the solutions are merged together and the duplicates are removed. A selected distinct set of solutions are then split to different threads again. In our ab initio protein structure prediction method, we use the three-dimensional face-centred-cubic lattice for structure-backbone mapping. We use both the low resolution hydrophobic-polar energy model and the high-resolution 20 × 20 energy model for search guiding. The experimental results show that our new parallel framework significantly improves the results obtained by the state-of-the-art single-point search approaches for both energy models on three-dimensional face-centred-cubic lattice. We also experimentally show the effectiveness of mixing energy models within parallel threads. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2014 ","pages":"985968"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/985968","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32271915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
AUTO-MUTE 2.0: A Portable Framework with Enhanced Capabilities for Predicting Protein Functional Consequences upon Mutation. AUTO-MUTE 2.0:一个便携式框架,增强了预测突变后蛋白质功能后果的能力。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2014-01-01 Epub Date: 2014-08-17 DOI: 10.1155/2014/278385
Majid Masso, Iosif I Vaisman

The AUTO-MUTE 2.0 stand-alone software package includes a collection of programs for predicting functional changes to proteins upon single residue substitutions, developed by combining structure-based features with trained statistical learning models. Three of the predictors evaluate changes to protein stability upon mutation, each complementing a distinct experimental approach. Two additional classifiers are available, one for predicting activity changes due to residue replacements and the other for determining the disease potential of mutations associated with nonsynonymous single nucleotide polymorphisms (nsSNPs) in human proteins. These five command-line driven tools, as well as all the supporting programs, complement those that run our AUTO-MUTE web-based server. Nevertheless, all the codes have been rewritten and substantially altered for the new portable software, and they incorporate several new features based on user feedback. Included among these upgrades is the ability to perform three highly requested tasks: to run "big data" batch jobs; to generate predictions using modified protein data bank (PDB) structures, and unpublished personal models prepared using standard PDB file formatting; and to utilize NMR structure files that contain multiple models.

AUTO-MUTE 2.0独立软件包包括一系列程序,用于预测单个残基替换时蛋白质的功能变化,这些程序是通过结合基于结构的特征和训练过的统计学习模型开发的。其中三种预测因子评估突变后蛋白质稳定性的变化,每种预测因子都补充了一种不同的实验方法。另外两种分类器可用,一种用于预测残基替换引起的活性变化,另一种用于确定与人类蛋白质中非同义单核苷酸多态性(nssnp)相关的突变的疾病潜力。这五个命令行驱动的工具,以及所有的支持程序,补充了那些运行AUTO-MUTE基于web的服务器的工具。然而,所有的代码都被重写了,并为新的可移植软件进行了实质性的修改,并且根据用户的反馈,它们包含了一些新的特性。这些升级包括执行三个高要求任务的能力:运行“大数据”批处理作业;使用修改后的蛋白质数据库(PDB)结构和使用标准PDB文件格式准备的未发表的个人模型生成预测;并利用包含多个模型的NMR结构文件。
{"title":"AUTO-MUTE 2.0: A Portable Framework with Enhanced Capabilities for Predicting Protein Functional Consequences upon Mutation.","authors":"Majid Masso,&nbsp;Iosif I Vaisman","doi":"10.1155/2014/278385","DOIUrl":"https://doi.org/10.1155/2014/278385","url":null,"abstract":"<p><p>The AUTO-MUTE 2.0 stand-alone software package includes a collection of programs for predicting functional changes to proteins upon single residue substitutions, developed by combining structure-based features with trained statistical learning models. Three of the predictors evaluate changes to protein stability upon mutation, each complementing a distinct experimental approach. Two additional classifiers are available, one for predicting activity changes due to residue replacements and the other for determining the disease potential of mutations associated with nonsynonymous single nucleotide polymorphisms (nsSNPs) in human proteins. These five command-line driven tools, as well as all the supporting programs, complement those that run our AUTO-MUTE web-based server. Nevertheless, all the codes have been rewritten and substantially altered for the new portable software, and they incorporate several new features based on user feedback. Included among these upgrades is the ability to perform three highly requested tasks: to run \"big data\" batch jobs; to generate predictions using modified protein data bank (PDB) structures, and unpublished personal models prepared using standard PDB file formatting; and to utilize NMR structure files that contain multiple models. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2014 ","pages":"278385"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/278385","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32647735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 52
Elementary Flux Mode Analysis of Acetyl-CoA Pathway in Carboxydothermus hydrogenoformans Z-2901. 甲酸热菌Z-2901乙酰-辅酶a途径的基本通量模式分析
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2014-01-01 Epub Date: 2014-04-16 DOI: 10.1155/2014/928038
Rajadurai Chinnasamy Perumal, Ashok Selvaraj, Gopal Ramesh Kumar

Carboxydothermus hydrogenoformans is a carboxydotrophic hydrogenogenic bacterium species that produces hydrogen molecule by utilizing carbon monoxide (CO) or pyruvate as a carbon source. To investigate the underlying biochemical mechanism of hydrogen production, an elementary mode analysis of acetyl-CoA pathway was performed to determine the intermediate fluxes by combining linear programming (LP) method available in CellNetAnalyzer software. We hypothesized that addition of enzymes necessary for carbon monoxide fixation and pyruvate dissimilation would enhance the theoretical yield of hydrogen. An in silico gene knockout of pyk, pykC, and mdh genes of modeled acetyl-CoA pathway allows the maximum theoretical hydrogen yield of 47.62 mmol/gCDW/h for 1 mole of carbon monoxide (CO) uptake. The obtained hydrogen yield is comparatively two times greater than the previous experimental data. Therefore, it could be concluded that this elementary flux mode analysis is a crucial way to achieve efficient hydrogen production through acetyl-CoA pathway and act as a model for strain improvement.

羧酸热菌(Carboxydothermus hydrogenformans)是一种利用一氧化碳或丙酮酸作为碳源产生氢分子的羧营养产氢细菌。为了研究产氢的潜在生化机制,利用CellNetAnalyzer软件中的线性规划(LP)方法,对乙酰辅酶a途径进行了初等模式分析,以确定中间通量。我们假设,添加一氧化碳固定和丙酮酸异化所必需的酶将提高氢的理论产率。对模拟的乙酰辅酶a途径的pyk、pykC和mdh基因进行硅基因敲除后,1摩尔一氧化碳(CO)的最大理论产氢量为47.62 mmol/gCDW/h。所得的氢气产率比以前的实验数据高出两倍。因此,可以得出结论,这种初等通量模式分析是通过乙酰辅酶a途径实现高效产氢的重要途径,并可作为菌株改良的模型。
{"title":"Elementary Flux Mode Analysis of Acetyl-CoA Pathway in Carboxydothermus hydrogenoformans Z-2901.","authors":"Rajadurai Chinnasamy Perumal,&nbsp;Ashok Selvaraj,&nbsp;Gopal Ramesh Kumar","doi":"10.1155/2014/928038","DOIUrl":"https://doi.org/10.1155/2014/928038","url":null,"abstract":"<p><p>Carboxydothermus hydrogenoformans is a carboxydotrophic hydrogenogenic bacterium species that produces hydrogen molecule by utilizing carbon monoxide (CO) or pyruvate as a carbon source. To investigate the underlying biochemical mechanism of hydrogen production, an elementary mode analysis of acetyl-CoA pathway was performed to determine the intermediate fluxes by combining linear programming (LP) method available in CellNetAnalyzer software. We hypothesized that addition of enzymes necessary for carbon monoxide fixation and pyruvate dissimilation would enhance the theoretical yield of hydrogen. An in silico gene knockout of pyk, pykC, and mdh genes of modeled acetyl-CoA pathway allows the maximum theoretical hydrogen yield of 47.62 mmol/gCDW/h for 1 mole of carbon monoxide (CO) uptake. The obtained hydrogen yield is comparatively two times greater than the previous experimental data. Therefore, it could be concluded that this elementary flux mode analysis is a crucial way to achieve efficient hydrogen production through acetyl-CoA pathway and act as a model for strain improvement. </p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2014 ","pages":"928038"},"PeriodicalIF":0.0,"publicationDate":"2014-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2014/928038","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"32338574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
Advances in Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1