首页 > 最新文献

Advances in Bioinformatics最新文献

英文 中文
Structural Prediction and Mutational Analysis of Rv3906c Gene of Mycobacterium tuberculosis H37Rv to Determine Its Essentiality in Survival. 对结核分枝杆菌 H37Rv 的 Rv3906c 基因进行结构预测和突变分析,以确定其在存活中的重要性。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-08-15 eCollection Date: 2018-01-01 DOI: 10.1155/2018/6152014
Md Amjad Beg, Shivangi, Sonu Chand Thakur, Laxman S Meena

The emergence of tuberculosis is at the peak; therefore to station it at its lower level we hereby try bioinformatics approach against Mycobacterium tuberculosis [M. tuberculosis] pathogenesis. Rv3906c is a conserved hypothetical gene of M. tuberculosis and contains many GTP binding protein motif DXXG which demonstrate that this gene might be processed in a GTP binding or in GTP hydrolyzing manner. This gene shows interaction with its adjacent genes as well as pcnA which is a polymerase and localized in the extracellular region and found to be a soluble protein. Rv3906c has binding pockets for calcium atom at various positions which prove that calcium might have some role during the process of this gene. GTP binding protein motif DXXG is present in various positions and calcium binds at this site with a C-score of 0.25. Mutational analysis on this motif shows the large decrease of stability after mutation of aspartate residue with glycine. Stress conditions like pH and temperature also change stability of the protein. A decrease in stability at this position might play a role in inhibition of survival of the pathogen. These computational studies of this gene might be a successful step towards drug development against tuberculosis.

结核病的出现正处于高峰期,因此,为了将其控制在较低水平,我们在此尝试针对结核分枝杆菌(M. tuberculosis)发病机制的生物信息学方法。Rv3906c 是结核分枝杆菌的一个保守假定基因,含有许多 GTP 结合蛋白基序 DXXG,这表明该基因可能以 GTP 结合或 GTP 水解的方式进行加工。该基因与相邻基因以及 pcnA(一种聚合酶)相互作用,pcnA 定位于细胞外区域,是一种可溶性蛋白。Rv3906c 在不同位置有与钙原子结合的口袋,这证明钙可能在该基因的作用过程中发挥了某种作用。GTP 结合蛋白图案 DXXG 存在于不同位置,钙与该位置结合的 C 值为 0.25。对该基团的突变分析表明,天冬氨酸残基突变为甘氨酸后,其稳定性大大降低。pH 值和温度等压力条件也会改变蛋白质的稳定性。该位置稳定性的降低可能会抑制病原体的生存。对该基因进行的这些计算研究可能会为开发抗结核药物迈出成功的一步。
{"title":"Structural Prediction and Mutational Analysis of Rv3906c Gene of <i>Mycobacterium tuberculosis</i> H<sub>37</sub>Rv to Determine Its Essentiality in Survival.","authors":"Md Amjad Beg, Shivangi, Sonu Chand Thakur, Laxman S Meena","doi":"10.1155/2018/6152014","DOIUrl":"10.1155/2018/6152014","url":null,"abstract":"<p><p>The emergence of tuberculosis is at the peak; therefore to station it at its lower level we hereby try bioinformatics approach against <i>Mycobacterium tuberculosis</i> [<i>M. tuberculosis</i>] pathogenesis. Rv3906c is a conserved hypothetical gene of <i>M. tuberculosis</i> and contains many GTP binding protein motif DXXG which demonstrate that this gene might be processed in a GTP binding or in GTP hydrolyzing manner. This gene shows interaction with its adjacent genes as well as <i>pcnA</i> which is a polymerase and localized in the extracellular region and found to be a soluble protein. Rv3906c has binding pockets for calcium atom at various positions which prove that calcium might have some role during the process of this gene. GTP binding protein motif DXXG is present in various positions and calcium binds at this site with a C-score of 0.25. Mutational analysis on this motif shows the large decrease of stability after mutation of aspartate residue with glycine. Stress conditions like pH and temperature also change stability of the protein. A decrease in stability at this position might play a role in inhibition of survival of the pathogen. These computational studies of this gene might be a successful step towards drug development against tuberculosis.</p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2018 ","pages":"6152014"},"PeriodicalIF":0.0,"publicationDate":"2018-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6114228/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36466013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gaussian Fuzzy Number for STR-DNA Similarity Calculation Involving Familial and Tribal Relationships. 涉及家族和部落关系的STR-DNA相似性计算的高斯模糊数。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-07-29 eCollection Date: 2018-01-01 DOI: 10.1155/2018/8602513
Maria Susan Anggreainy, M Rahmat Widyanto, Belawati H Widjaja, Nurtami Soedarsono

We performed locus similarity calculation by measuring fuzzy intersection between individual locus and reference locus and then performed CODIS STR-DNA similarity calculation. The fuzzy intersection calculation enables a more robust CODIS STR-DNA similarity calculation due to imprecision caused by noise produced by PCR machine. We also proposed shifted convoluted Gaussian fuzzy number (SCGFN) and Gaussian fuzzy number (GFN) to represent each locus value as improvement of triangular fuzzy number (TFN) as used in previous research. Compared to triangular fuzzy number (TFN), GFN is more realistic to represent uncertainty of locus information because the distribution is assumed to be Gaussian. Then, the original Gaussian fuzzy number (GFN) is convoluted with distribution of certain ethnic locus information to produce the new SCGFN which more represents ethnic information compared to original GFN. Experiments were done for the following cases: people with family relationships, people of the same tribe, and certain tribal populations. The statistical test with analysis of variance (ANOVA) shows the difference in similarity between SCGFN, GFN, and TFN with a significant level of 95%. The Tukey method in ANOVA shows that SCGFN yields a higher similarity which means being better than the GFN and TFN methods. The proposed method enables CODIS STR-DNA similarity calculation which is more robust to noise and performed better CODIS similarity calculation involving familial and tribal relationships.

通过测量个体基因座与参考基因座之间的模糊交集进行基因座相似度计算,然后进行CODIS STR-DNA相似度计算。模糊交集计算使CODIS STR-DNA相似性计算更加稳健,避免了PCR机产生的噪声带来的不精确性。我们还提出了移位卷积高斯模糊数(SCGFN)和高斯模糊数(GFN)来表示每个位点值,作为对三角模糊数(TFN)的改进。与三角模糊数(TFN)相比,GFN假设轨迹信息的分布为高斯分布,因此更能真实地表达轨迹信息的不确定性。然后,将原高斯模糊数(GFN)与某些民族基因座信息的分布进行卷积,得到比原高斯模糊数更能代表民族信息的新高斯模糊数。实验针对以下情况进行:有家庭关系的人,同一部落的人,以及特定的部落人口。方差分析(ANOVA)的统计检验显示,SCGFN、GFN和TFN之间的相似性差异显著水平为95%。方差分析中的Tukey方法表明,SCGFN产生了更高的相似性,这意味着比GFN和TFN方法更好。该方法使CODIS STR-DNA相似度计算对噪声的鲁棒性更强,能够更好地计算涉及家族和部落关系的CODIS相似度。
{"title":"Gaussian Fuzzy Number for STR-DNA Similarity Calculation Involving Familial and Tribal Relationships.","authors":"Maria Susan Anggreainy,&nbsp;M Rahmat Widyanto,&nbsp;Belawati H Widjaja,&nbsp;Nurtami Soedarsono","doi":"10.1155/2018/8602513","DOIUrl":"https://doi.org/10.1155/2018/8602513","url":null,"abstract":"<p><p>We performed locus similarity calculation by measuring fuzzy intersection between individual locus and reference locus and then performed CODIS STR-DNA similarity calculation. The fuzzy intersection calculation enables a more robust CODIS STR-DNA similarity calculation due to imprecision caused by noise produced by PCR machine. We also proposed shifted convoluted Gaussian fuzzy number (SCGFN) and Gaussian fuzzy number (GFN) to represent each locus value as improvement of triangular fuzzy number (TFN) as used in previous research. Compared to triangular fuzzy number (TFN), GFN is more realistic to represent uncertainty of locus information because the distribution is assumed to be Gaussian. Then, the original Gaussian fuzzy number (GFN) is convoluted with distribution of certain ethnic locus information to produce the new SCGFN which more represents ethnic information compared to original GFN. Experiments were done for the following cases: people with family relationships, people of the same tribe, and certain tribal populations. The statistical test with analysis of variance (ANOVA) shows the difference in similarity between SCGFN, GFN, and TFN with a significant level of 95%. The Tukey method in ANOVA shows that SCGFN yields a higher similarity which means being better than the GFN and TFN methods. The proposed method enables CODIS STR-DNA similarity calculation which is more robust to noise and performed better CODIS similarity calculation involving familial and tribal relationships.</p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2018 ","pages":"8602513"},"PeriodicalIF":0.0,"publicationDate":"2018-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2018/8602513","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36434449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Creation of Individual Scientific Concept-Centered Semantic Maps Based on Automated Text-Mining Analysis of PubMed. 基于PubMed自动文本挖掘分析的个体科学概念中心语义图的创建。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-07-26 eCollection Date: 2018-01-01 DOI: 10.1155/2018/4625394
Ekaterina Ilgisonis, Andrey Lisitsa, Valerya Kudryavtseva, Elena Ponomarenko

Concept-centered semantic maps were created based on a text-mining analysis of PubMed using the BiblioEngine_v2018 software. The objects ("concepts") of a semantic map can be MeSH-terms or other terms (names of proteins, diseases, chemical compounds, etc.) structured in the form of controlled vocabularies. The edges between the two objects were automatically calculated based on the index of semantic similarity, which is proportional to the number of publications related to both objects simultaneously. On the one hand, an individual semantic map created based on the already published papers allows us to trace scientific inquiry. On the other hand, a prospective analysis based on the study of PubMed search history enables us to determine the possible directions for future research.

基于PubMed的文本挖掘分析,使用BiblioEngine_v2018软件创建了以概念为中心的语义图。语义图的对象(“概念”)可以是MeSH术语或以受控词汇表形式构建的其他术语(蛋白质、疾病、化合物等的名称)。基于语义相似性指数自动计算两个对象之间的边缘,该指数与同时与两个对象相关的出版物数量成比例。一方面,基于已经发表的论文创建的个体语义图使我们能够追踪科学探究。另一方面,基于PubMed搜索历史研究的前瞻性分析使我们能够确定未来研究的可能方向。
{"title":"Creation of Individual Scientific Concept-Centered Semantic Maps Based on Automated Text-Mining Analysis of PubMed.","authors":"Ekaterina Ilgisonis, Andrey Lisitsa, Valerya Kudryavtseva, Elena Ponomarenko","doi":"10.1155/2018/4625394","DOIUrl":"10.1155/2018/4625394","url":null,"abstract":"<p><p>Concept-centered semantic maps were created based on a text-mining analysis of PubMed using the BiblioEngine_v2018 software. The objects (\"concepts\") of a semantic map can be MeSH-terms or other terms (names of proteins, diseases, chemical compounds, etc.) structured in the form of controlled vocabularies. The edges between the two objects were automatically calculated based on the index of semantic similarity, which is proportional to the number of publications related to both objects simultaneously. On the one hand, an individual semantic map created based on the already published papers allows us to trace scientific inquiry. On the other hand, a prospective analysis based on the study of PubMed search history enables us to determine the possible directions for future research.</p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2018 ","pages":"4625394"},"PeriodicalIF":0.0,"publicationDate":"2018-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6083525/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36427797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Big Data Management for Healthcare Systems: Architecture, Requirements, and Implementation. 医疗保健系统的大数据管理:架构、需求和实现。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-06-21 eCollection Date: 2018-01-01 DOI: 10.1155/2018/4059018
Naoual El Aboudi, Laila Benhlima

The growing amount of data in healthcare industry has made inevitable the adoption of big data techniques in order to improve the quality of healthcare delivery. Despite the integration of big data processing approaches and platforms in existing data management architectures for healthcare systems, these architectures face difficulties in preventing emergency cases. The main contribution of this paper is proposing an extensible big data architecture based on both stream computing and batch computing in order to enhance further the reliability of healthcare systems by generating real-time alerts and making accurate predictions on patient health condition. Based on the proposed architecture, a prototype implementation has been built for healthcare systems in order to generate real-time alerts. The suggested prototype is based on spark and MongoDB tools.

医疗保健行业数据量的不断增长使得采用大数据技术以提高医疗保健服务质量成为必然。尽管在现有的医疗保健系统数据管理架构中集成了大数据处理方法和平台,但这些架构在预防紧急情况方面面临困难。本文的主要贡献是提出了一种基于流计算和批处理计算的可扩展大数据架构,通过生成实时警报和对患者健康状况做出准确预测,进一步提高医疗保健系统的可靠性。基于所提出的体系结构,为医疗保健系统构建了一个原型实现,以生成实时警报。建议的原型是基于spark和MongoDB工具的。
{"title":"Big Data Management for Healthcare Systems: Architecture, Requirements, and Implementation.","authors":"Naoual El Aboudi,&nbsp;Laila Benhlima","doi":"10.1155/2018/4059018","DOIUrl":"https://doi.org/10.1155/2018/4059018","url":null,"abstract":"<p><p>The growing amount of data in healthcare industry has made inevitable the adoption of big data techniques in order to improve the quality of healthcare delivery. Despite the integration of big data processing approaches and platforms in existing data management architectures for healthcare systems, these architectures face difficulties in preventing emergency cases. The main contribution of this paper is proposing an extensible big data architecture based on both stream computing and batch computing in order to enhance further the reliability of healthcare systems by generating real-time alerts and making accurate predictions on patient health condition. Based on the proposed architecture, a prototype implementation has been built for healthcare systems in order to generate real-time alerts. The suggested prototype is based on spark and MongoDB tools.</p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2018 ","pages":"4059018"},"PeriodicalIF":0.0,"publicationDate":"2018-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2018/4059018","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36336300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 87
A Novel Framework for Ab Initio Coarse Protein Structure Prediction. Ab Initio粗蛋白结构预测的新框架。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-06-20 DOI: 10.1155/2018/7607384
Sandhya Parasnath Dubey, S Balaji, N Gopalakrishna Kini, M Sathish Kumar

Hydrophobic-Polar model is a simplified representation of Protein Structure Prediction (PSP) problem. However, even with the HP model, the PSP problem remains NP-complete. This work proposes a systematic and problem specific design for operators of the evolutionary program which hybrids with local search hill climbing, to efficiently explore the search space of PSP and thereby obtain an optimum conformation. The proposed algorithm achieves this by incorporating the following novel features: (i) new initialization method which generates only valid individuals with (rather than random) better fitness values; (ii) use of probability-based selection operators that limit the local convergence; (iii) use of secondary structure based mutation operator that makes the structure more closely to the laboratory determined structure; and (iv) incorporating all the above-mentioned features developed a complete two-tier framework. The developed framework builds the protein conformation on the square and triangular lattice. The test has been performed using benchmark sequences, and a comparative evaluation is done with various state-of-the-art algorithms. Moreover, in addition to hypothetical test sequences, we have tested protein sequences deposited in protein database repository. It has been observed that the proposed framework has shown superior performance regarding accuracy (fitness value) and speed (number of generations needed to attain the final conformation). The concepts used to enhance the performance are generic and can be used with any other population-based search algorithm such as genetic algorithm, ant colony optimization, and immune algorithm.

疏水极性模型是蛋白质结构预测(PSP)问题的简化表示。然而,即使使用HP模型,PSP问题仍然是NP完全问题。本文提出了一种系统的、针对特定问题的进化程序算子设计,将其与局部搜索爬山相结合,以有效地探索PSP的搜索空间,从而获得最优构象。所提出的算法通过结合以下新特征来实现这一点:(i)新的初始化方法,该方法只生成具有(而不是随机的)更好适应度值的有效个体;(ii)使用限制局部收敛的基于概率的选择算子;(iii)使用基于二级结构的突变算子,其使结构更接近实验室确定的结构;以及(iv)结合所有上述特征,开发了一个完整的双层框架。所开发的框架在正方形和三角形晶格上构建蛋白质构象。使用基准序列进行了测试,并使用各种最先进的算法进行了比较评估。此外,除了假设的测试序列外,我们还测试了存储在蛋白质数据库存储库中的蛋白质序列。已经观察到,所提出的框架在准确性(适应度值)和速度(获得最终构象所需的代数)方面显示出优异的性能。用于增强性能的概念是通用的,可以与任何其他基于群体的搜索算法一起使用,如遗传算法、蚁群优化和免疫算法。
{"title":"A Novel Framework for <i>Ab Initio</i> Coarse Protein Structure Prediction.","authors":"Sandhya Parasnath Dubey,&nbsp;S Balaji,&nbsp;N Gopalakrishna Kini,&nbsp;M Sathish Kumar","doi":"10.1155/2018/7607384","DOIUrl":"10.1155/2018/7607384","url":null,"abstract":"<p><p>Hydrophobic-Polar model is a simplified representation of Protein Structure Prediction (PSP) problem. However, even with the HP model, the PSP problem remains NP-complete. This work proposes a systematic and problem specific design for operators of the evolutionary program which hybrids with local search hill climbing, to efficiently explore the search space of PSP and thereby obtain an optimum conformation. The proposed algorithm achieves this by incorporating the following novel features: (i) new initialization method which generates only valid individuals with (rather than random) better fitness values; (ii) use of probability-based selection operators that limit the local convergence; (iii) use of secondary structure based mutation operator that makes the structure more closely to the laboratory determined structure; and (iv) incorporating all the above-mentioned features developed a complete two-tier framework. The developed framework builds the protein conformation on the square and triangular lattice. The test has been performed using benchmark sequences, and a comparative evaluation is done with various state-of-the-art algorithms. Moreover, in addition to hypothetical test sequences, we have tested protein sequences deposited in protein database repository. It has been observed that the proposed framework has shown superior performance regarding accuracy (fitness value) and speed (number of generations needed to attain the final conformation). The concepts used to enhance the performance are generic and can be used with any other population-based search algorithm such as genetic algorithm, ant colony optimization, and immune algorithm.</p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2018 ","pages":"7607384"},"PeriodicalIF":0.0,"publicationDate":"2018-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2018/7607384","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36328364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
In Silico Characterization and Structural Modeling of Dermacentor andersoni p36 Immunosuppressive Protein. 安德氏真皮真皮免疫抑制蛋白的硅表征和结构建模。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-04-08 eCollection Date: 2018-01-01 DOI: 10.1155/2018/7963401
Martin Omulindi Oyugi, Johnson Kangethe Kinyua, Esther Nkirote Magiri, Milcah Wagio Kigoni, Evenilton Pessoa Costa, Naftaly Wang'ombe Githaka

Ticks cause approximately $17-19 billion economic losses to the livestock industry globally. Development of recombinant antitick vaccine is greatly hindered by insufficient knowledge and understanding of proteins expressed by ticks. Ticks secrete immunosuppressant proteins that modulate the host's immune system during blood feeding; these molecules could be a target for antivector vaccine development. Recombinant p36, a 36 kDa immunosuppressor from the saliva of female Dermacentor andersoni, suppresses T-lymphocytes proliferation in vitro. To identify potential unique structural and dynamic properties responsible for the immunosuppressive function of p36 proteins, this study utilized bioinformatic tool to characterize and model structure of D. andersoni p36 protein. Evaluation of p36 protein family as suitable vaccine antigens predicted a p36 homolog in Rhipicephalus appendiculatus, the tick vector of East Coast fever, with an antigenicity score of 0.7701 that compares well with that of Bm86 (0.7681), the protein antigen that constitute commercial tick vaccine Tickgard™. Ab initio modeling of the D. andersoni p36 protein yielded a 3D structure that predicted conserved antigenic region, which has potential of binding immunomodulating ligands including glycerol and lactose, found located within exposed loop, suggesting a likely role in immunosuppressive function of tick p36 proteins. Laboratory confirmation of these preliminary results is necessary in future studies.

蜱虫给全球畜牧业造成了约170亿至190亿美元的经济损失。由于对蜱表达的蛋白质缺乏认识和了解,重组抗蜱疫苗的开发受到很大阻碍。蜱虫分泌免疫抑制蛋白,在吸血过程中调节宿主的免疫系统;这些分子可能成为抗载体疫苗开发的目标。重组蛋白p36是一种36 kDa的免疫抑制因子,来自雌性安德氏真皮单胞虫唾液,体外抑制t淋巴细胞增殖。为了确定p36蛋白免疫抑制功能的潜在独特结构和动态特性,本研究利用生物信息学工具对安德氏d.a andersoni p36蛋白的结构进行了表征和建模。对p36蛋白家族作为合适的疫苗抗原的评估预测,在东海岸热的蜱病媒介阑尾棘头蜱(Rhipicephalus appendiculatus)中存在p36同源物,其抗原性评分为0.7701,与构成商品蜱病疫苗Tickgard™的蛋白抗原Bm86(0.7681)的抗原性评分相当。对d.m andersoni p36蛋白进行从头计算建模,得到了一个3D结构,预测了保守的抗原区域,该区域具有结合免疫调节配体的潜力,包括甘油和乳糖,发现位于暴露环内,这表明蜱虫p36蛋白可能在免疫抑制功能中起作用。在未来的研究中,有必要对这些初步结果进行实验室确认。
{"title":"In Silico Characterization and Structural Modeling of <i>Dermacentor andersoni</i> p36 Immunosuppressive Protein.","authors":"Martin Omulindi Oyugi,&nbsp;Johnson Kangethe Kinyua,&nbsp;Esther Nkirote Magiri,&nbsp;Milcah Wagio Kigoni,&nbsp;Evenilton Pessoa Costa,&nbsp;Naftaly Wang'ombe Githaka","doi":"10.1155/2018/7963401","DOIUrl":"https://doi.org/10.1155/2018/7963401","url":null,"abstract":"<p><p>Ticks cause approximately $17-19 billion economic losses to the livestock industry globally. Development of recombinant antitick vaccine is greatly hindered by insufficient knowledge and understanding of proteins expressed by ticks. Ticks secrete immunosuppressant proteins that modulate the host's immune system during blood feeding; these molecules could be a target for antivector vaccine development. Recombinant p36, a 36 kDa immunosuppressor from the saliva of female <i>Dermacentor andersoni</i>, suppresses T-lymphocytes proliferation <i>in vitro.</i> To identify potential unique structural and dynamic properties responsible for the immunosuppressive function of p36 proteins, this study utilized bioinformatic tool to characterize and model structure of <i>D. andersoni</i> p36 protein. Evaluation of p36 protein family as suitable vaccine antigens predicted a p36 homolog in <i>Rhipicephalus appendiculatus</i>, the tick vector of East Coast fever, with an antigenicity score of 0.7701 that compares well with that of Bm86 (0.7681), the protein antigen that constitute commercial tick vaccine Tickgard™. Ab initio modeling of the <i>D. andersoni</i> p36 protein yielded a 3D structure that predicted conserved antigenic region, which has potential of binding immunomodulating ligands including glycerol and lactose, found located within exposed loop, suggesting a likely role in immunosuppressive function of tick p36 proteins. Laboratory confirmation of these preliminary results is necessary in future studies.</p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2018 ","pages":"7963401"},"PeriodicalIF":0.0,"publicationDate":"2018-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2018/7963401","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36178249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Framework for Parallel Preprocessing of Microarray Data Using Hadoop. 基于Hadoop的微阵列数据并行预处理框架。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2018-03-29 eCollection Date: 2018-01-01 DOI: 10.1155/2018/9391635
Amirhossein Sahlabadi, Ravie Chandren Muniyandi, Mahdi Sahlabadi, Hossein Golshanbafghy

Nowadays, microarray technology has become one of the popular ways to study gene expression and diagnosis of disease. National Center for Biology Information (NCBI) hosts public databases containing large volumes of biological data required to be preprocessed, since they carry high levels of noise and bias. Robust Multiarray Average (RMA) is one of the standard and popular methods that is utilized to preprocess the data and remove the noises. Most of the preprocessing algorithms are time-consuming and not able to handle a large number of datasets with thousands of experiments. Parallel processing can be used to address the above-mentioned issues. Hadoop is a well-known and ideal distributed file system framework that provides a parallel environment to run the experiment. In this research, for the first time, the capability of Hadoop and statistical power of R have been leveraged to parallelize the available preprocessing algorithm called RMA to efficiently process microarray data. The experiment has been run on cluster containing 5 nodes, while each node has 16 cores and 16 GB memory. It compares efficiency and the performance of parallelized RMA using Hadoop with parallelized RMA using affyPara package as well as sequential RMA. The result shows the speed-up rate of the proposed approach outperforms the sequential approach and affyPara approach.

目前,微阵列技术已成为研究基因表达和疾病诊断的热门方法之一。国家生物信息中心(NCBI)拥有包含大量需要预处理的生物数据的公共数据库,因为它们带有高水平的噪音和偏见。鲁棒多阵列平均(RMA)是一种常用的数据预处理和去噪方法。大多数的预处理算法耗时长,不能处理大量的数据集和数千个实验。并行处理可用于解决上述问题。Hadoop是一个著名的、理想的分布式文件系统框架,它提供了一个并行环境来运行实验。在本研究中,首次利用Hadoop的能力和R的统计能力来并行化可用的预处理算法RMA,以有效地处理微阵列数据。实验在包含5个节点的集群上运行,每个节点有16个内核和16gb内存。它比较了使用Hadoop的并行RMA与使用affyPara包的并行RMA以及顺序RMA的效率和性能。结果表明,该方法的加速速度优于顺序方法和affyPara方法。
{"title":"Framework for Parallel Preprocessing of Microarray Data Using Hadoop.","authors":"Amirhossein Sahlabadi,&nbsp;Ravie Chandren Muniyandi,&nbsp;Mahdi Sahlabadi,&nbsp;Hossein Golshanbafghy","doi":"10.1155/2018/9391635","DOIUrl":"https://doi.org/10.1155/2018/9391635","url":null,"abstract":"<p><p>Nowadays, microarray technology has become one of the popular ways to study gene expression and diagnosis of disease. National Center for Biology Information (NCBI) hosts public databases containing large volumes of biological data required to be preprocessed, since they carry high levels of noise and bias. Robust Multiarray Average (RMA) is one of the standard and popular methods that is utilized to preprocess the data and remove the noises. Most of the preprocessing algorithms are time-consuming and not able to handle a large number of datasets with thousands of experiments. Parallel processing can be used to address the above-mentioned issues. Hadoop is a well-known and ideal distributed file system framework that provides a parallel environment to run the experiment. In this research, for the first time, the capability of Hadoop and statistical power of R have been leveraged to parallelize the available preprocessing algorithm called RMA to efficiently process microarray data. The experiment has been run on cluster containing 5 nodes, while each node has 16 cores and 16 GB memory. It compares efficiency and the performance of parallelized RMA using Hadoop with parallelized RMA using affyPara package as well as sequential RMA. The result shows the speed-up rate of the proposed approach outperforms the sequential approach and affyPara approach.</p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2018 ","pages":"9391635"},"PeriodicalIF":0.0,"publicationDate":"2018-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2018/9391635","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36126845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
An Efficient Approach in Analysis of DNA Base Calling Using Neural Fuzzy Model. 一种基于神经模糊模型的DNA碱基调用分析方法。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-01-01 Epub Date: 2017-01-31 DOI: 10.1155/2017/3686025
Safa A Hameed, Raed I Hamed

This paper presented the issues of true representation and a reliable measure for analyzing the DNA base calling is provided. The method implemented dealt with the data set quality in analyzing DNA sequencing, it is investigating solution of the problem of using Neurofuzzy techniques for predicting the confidence value for each base in DNA base calling regarding collecting the data for each base in DNA, and the simulation model of designing the ANFIS contains three subsystems and main system; obtain the three features from the subsystems and in the main system and use the three features to predict the confidence value for each base. This is achieving effective results with high performance in employment.

本文提出了DNA碱基召唤的真实表示问题,并提供了一种可靠的分析方法。所实现的方法处理了DNA测序分析中的数据集质量问题,研究了利用神经模糊技术预测DNA碱基调用中每个碱基的置信度的问题,并设计了ANFIS的仿真模型,该模型包含三个子系统和主系统;从子系统和主系统中获取三个特征,并使用这三个特征来预测每个基的置信度值。这是在就业中取得高绩效的有效结果。
{"title":"An Efficient Approach in Analysis of DNA Base Calling Using Neural Fuzzy Model.","authors":"Safa A Hameed,&nbsp;Raed I Hamed","doi":"10.1155/2017/3686025","DOIUrl":"https://doi.org/10.1155/2017/3686025","url":null,"abstract":"<p><p>This paper presented the issues of true representation and a reliable measure for analyzing the DNA base calling is provided. The method implemented dealt with the data set quality in analyzing DNA sequencing, it is investigating solution of the problem of using Neurofuzzy techniques for predicting the confidence value for each base in DNA base calling regarding collecting the data for each base in DNA, and the simulation model of designing the ANFIS contains three subsystems and main system; obtain the three features from the subsystems and in the main system and use the three features to predict the confidence value for each base. This is achieving effective results with high performance in employment.</p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2017 ","pages":"3686025"},"PeriodicalIF":0.0,"publicationDate":"2017-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2017/3686025","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"34784171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Empirical Comparison of Visualization Tools for Larger-Scale Network Analysis. 大规模网络分析可视化工具的实证比较。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-01-01 Epub Date: 2017-07-18 DOI: 10.1155/2017/1278932
Georgios A Pavlopoulos, David Paez-Espino, Nikos C Kyrpides, Ioannis Iliopoulos

Gene expression, signal transduction, protein/chemical interactions, biomedical literature cooccurrences, and other concepts are often captured in biological network representations where nodes represent a certain bioentity and edges the connections between them. While many tools to manipulate, visualize, and interactively explore such networks already exist, only few of them can scale up and follow today's indisputable information growth. In this review, we shortly list a catalog of available network visualization tools and, from a user-experience point of view, we identify four candidate tools suitable for larger-scale network analysis, visualization, and exploration. We comment on their strengths and their weaknesses and empirically discuss their scalability, user friendliness, and postvisualization capabilities.

基因表达、信号转导、蛋白质/化学相互作用、生物医学文献共发生和其他概念经常被捕获在生物网络表示中,其中节点代表某个生物实体,并将它们之间的连接边缘。虽然已经存在许多工具来操纵、可视化和交互式地探索这些网络,但只有少数工具可以扩大规模,并跟上当今无可争议的信息增长。在这篇综述中,我们简要列出了可用的网络可视化工具的目录,从用户体验的角度来看,我们确定了四种适合大规模网络分析、可视化和探索的候选工具。我们评论了它们的优点和缺点,并经验地讨论了它们的可伸缩性、用户友好性和后可视化功能。
{"title":"Empirical Comparison of Visualization Tools for Larger-Scale Network Analysis.","authors":"Georgios A Pavlopoulos,&nbsp;David Paez-Espino,&nbsp;Nikos C Kyrpides,&nbsp;Ioannis Iliopoulos","doi":"10.1155/2017/1278932","DOIUrl":"https://doi.org/10.1155/2017/1278932","url":null,"abstract":"<p><p>Gene expression, signal transduction, protein/chemical interactions, biomedical literature cooccurrences, and other concepts are often captured in biological network representations where nodes represent a certain bioentity and edges the connections between them. While many tools to manipulate, visualize, and interactively explore such networks already exist, only few of them can scale up and follow today's indisputable information growth. In this review, we shortly list a catalog of available network visualization tools and, from a user-experience point of view, we identify four candidate tools suitable for larger-scale network analysis, visualization, and exploration. We comment on their strengths and their weaknesses and empirically discuss their scalability, user friendliness, and postvisualization capabilities.</p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2017 ","pages":"1278932"},"PeriodicalIF":0.0,"publicationDate":"2017-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2017/1278932","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35264464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
Computational Analysis of Gynura bicolor Bioactive Compounds as Dipeptidyl Peptidase-IV Inhibitor. Gynura双色生物活性化合物作为二肽基肽酶- iv抑制剂的计算分析。
Q1 Biochemistry, Genetics and Molecular Biology Pub Date : 2017-01-01 Epub Date: 2017-08-08 DOI: 10.1155/2017/5124165
Lina Rozano, Muhammad Redha Abdullah Zawawi, Muhamad Aizuddin Ahmad, Indu Bala Jaganath

The inhibition of dipeptidyl peptidase-IV (DPPIV) is a popular route for the treatment of type-2 diabetes. Commercially available gliptin-based drugs such as sitagliptin, anagliptin, linagliptin, saxagliptin, and alogliptin were specifically developed as DPPIV inhibitors for diabetic patients. The use of Gynura bicolor in treating diabetes had been reported in various in vitro experiments. However, an understanding of the inhibitory actions of G. bicolor bioactive compounds on DPPIV is still lacking and this may provide crucial information for the development of more potent and natural sources of DPPIV inhibitors. Evaluation of G. bicolor bioactive compounds for potent DPPIV inhibitors was computationally conducted using Lead IT and iGEMDOCK software, and the best free-binding energy scores for G. bicolor bioactive compounds were evaluated in comparison with the commercial DPPIV inhibitors, sitagliptin, anagliptin, linagliptin, saxagliptin, and alogliptin. Drug-likeness and absorption, distribution, metabolism, and excretion (ADME) analysis were also performed. Based on molecular docking analysis, four of the identified bioactive compounds in G. bicolor, 3-caffeoylquinic acid, 5-O-caffeoylquinic acid, 3,4-dicaffeoylquinic acid, and trans-5-p-coumaroylquinic acid, resulted in lower free-binding energy scores when compared with two of the commercially available gliptin inhibitors. The results revealed that bioactive compounds in G. bicolor are potential natural inhibitors of DPPIV.

抑制二肽基肽酶- iv (DPPIV)是治疗2型糖尿病的常用途径。市售的以格列汀为基础的药物,如西格列汀、安格列汀、利格列汀、沙格列汀和阿格列汀,是专门为糖尿病患者开发的DPPIV抑制剂。在各种体外实验中,已经报道了使用双色菊治疗糖尿病。然而,对双色蓝生物活性化合物对DPPIV的抑制作用的了解仍然缺乏,这可能为开发更有效的天然DPPIV抑制剂提供重要信息。利用Lead IT和iGEMDOCK软件对双色g生物活性化合物的有效DPPIV抑制剂进行了计算评估,并与商业DPPIV抑制剂西格列汀、阿格列汀、利格列汀、沙格列汀和阿格列汀进行了比较,评估了双色g生物活性化合物的最佳自由结合能得分。同时进行药物相似及吸收、分布、代谢和排泄(ADME)分析。基于分子对接分析,与两种市售的格列汀抑制剂相比,G. bicolor中鉴定的4种生物活性化合物(3-咖啡酰基奎宁酸、5- o -咖啡酰基奎宁酸、3,4-二咖啡酰基奎宁酸和反式5-对香豆酰奎宁酸)的自由结合能得分较低。结果表明,双色莲中的活性化合物是潜在的天然DPPIV抑制剂。
{"title":"Computational Analysis of <i>Gynura bicolor</i> Bioactive Compounds as Dipeptidyl Peptidase-IV Inhibitor.","authors":"Lina Rozano,&nbsp;Muhammad Redha Abdullah Zawawi,&nbsp;Muhamad Aizuddin Ahmad,&nbsp;Indu Bala Jaganath","doi":"10.1155/2017/5124165","DOIUrl":"https://doi.org/10.1155/2017/5124165","url":null,"abstract":"<p><p>The inhibition of dipeptidyl peptidase-IV (DPPIV) is a popular route for the treatment of type-2 diabetes. Commercially available gliptin-based drugs such as sitagliptin, anagliptin, linagliptin, saxagliptin, and alogliptin were specifically developed as DPPIV inhibitors for diabetic patients. The use of <i>Gynura bicolor</i> in treating diabetes had been reported in various in vitro experiments. However, an understanding of the inhibitory actions of <i>G. bicolor</i> bioactive compounds on DPPIV is still lacking and this may provide crucial information for the development of more potent and natural sources of DPPIV inhibitors. Evaluation of <i>G. bicolor</i> bioactive compounds for potent DPPIV inhibitors was computationally conducted using Lead IT and iGEMDOCK software, and the best free-binding energy scores for <i>G. bicolor</i> bioactive compounds were evaluated in comparison with the commercial DPPIV inhibitors, sitagliptin, anagliptin, linagliptin, saxagliptin, and alogliptin. Drug-likeness and absorption, distribution, metabolism, and excretion (ADME) analysis were also performed. Based on molecular docking analysis, four of the identified bioactive compounds in <i>G. bicolor</i>, 3-caffeoylquinic acid, 5-O-caffeoylquinic acid, 3,4-dicaffeoylquinic acid, and <i>trans</i>-5-<i>p</i>-coumaroylquinic acid, resulted in lower free-binding energy scores when compared with two of the commercially available gliptin inhibitors. The results revealed that bioactive compounds in <i>G. bicolor</i> are potential natural inhibitors of DPPIV.</p>","PeriodicalId":39059,"journal":{"name":"Advances in Bioinformatics","volume":"2017 ","pages":"5124165"},"PeriodicalIF":0.0,"publicationDate":"2017-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2017/5124165","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35373105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
期刊
Advances in Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1