首页 > 最新文献

Journal of Integrative Bioinformatics最新文献

英文 中文
Special Issue of the 1st International Applied Bioinformatics Conference (iABC'21). 第一届国际应用生物信息学会议(iABC'21)特刊。
IF 1.9 Q1 Medicine Pub Date : 2021-12-16 DOI: 10.1515/jib-2021-0042
Jens Allmer, Mourad Elloumi, Matteo Comin, Ralf Hofestädt
Diseases can be tied to changes at the molecular level within affected cells. This can be concerning transcription, translation, or any other mechanism involved in gene expression, such as post-transcriptional regulation. Instrumentation for the measurement of such molecular changes is readily available and produces large amounts of data. For example, DNA and RNA sequencing, as well as protein quantitation, and sequencing can be achieved via next-generation sequencing andmass spectrometry, respectively. One current challenge is the analysis and integration of the resulting heterogeneous and large datasets. Bioinformatics is the field of study which produces algorithms and integrative approaches to attempt suchdata analyses. The primary aim in algorithmic bioinformatics is, however, the development of algorithms and not their application. Typically, novel algorithms are introduced with a proof of principle, and they are applied to some data for that purpose, but usually not comprehensively. Their data might slightly differ from the proof of principle, inducing further data analysis challenges. Additionally, applying such algorithms to their data may be involved for researchers from the biomedical domain. The 1st International Applied Bioinformatics Conference was conceived to bring together representatives from all research fields involved to increase knowledge transfer. First planned for 2020 and then deferred to 2021 due to the pandemic caused by the Coronavirus [1], the conference was held online. Despite the virtual nature of the conference, attentionwas great.We receivedmany goodmanuscripts and invited a few to submit their full versions to this special issue. The range of topics was extensive, but many submissions concerned the interface of bioinformatics and its application. The selected papers for this special issue also discuss various topics such as sequence alignment and gene network reconstruction. The first paper in this special issue concerns a challenging issue in bioinformatics, the usage of pangenomes instead of single reference genomes and offers a fast variation-aware read mapping algorithm [2]. Mapping is also vital to investigate gene expression, which is essential for the secondmanuscript. It discusses how microRNA and mRNA expression profiles can be investigated [3]. From this, modular networks are inferred, describing post-transcriptional regulatory networks. Such networks are challenging to visualize, which is the focus of the third paper [4]. The work summarizes the state-of-the-art in bicluster visualization and is also based on gene expression data. Next, we move from transcriptomics to metabolomics. A disparity filter was applied to perform network analysis for colorectal cancer as a proof of principle [5]. The final two manuscripts focus more on practical application in cancer. First, the prostate, ovary, testes, and embryo
{"title":"Special Issue of the 1st International Applied Bioinformatics Conference (iABC'21).","authors":"Jens Allmer, Mourad Elloumi, Matteo Comin, Ralf Hofestädt","doi":"10.1515/jib-2021-0042","DOIUrl":"https://doi.org/10.1515/jib-2021-0042","url":null,"abstract":"Diseases can be tied to changes at the molecular level within affected cells. This can be concerning transcription, translation, or any other mechanism involved in gene expression, such as post-transcriptional regulation. Instrumentation for the measurement of such molecular changes is readily available and produces large amounts of data. For example, DNA and RNA sequencing, as well as protein quantitation, and sequencing can be achieved via next-generation sequencing andmass spectrometry, respectively. One current challenge is the analysis and integration of the resulting heterogeneous and large datasets. Bioinformatics is the field of study which produces algorithms and integrative approaches to attempt suchdata analyses. The primary aim in algorithmic bioinformatics is, however, the development of algorithms and not their application. Typically, novel algorithms are introduced with a proof of principle, and they are applied to some data for that purpose, but usually not comprehensively. Their data might slightly differ from the proof of principle, inducing further data analysis challenges. Additionally, applying such algorithms to their data may be involved for researchers from the biomedical domain. The 1st International Applied Bioinformatics Conference was conceived to bring together representatives from all research fields involved to increase knowledge transfer. First planned for 2020 and then deferred to 2021 due to the pandemic caused by the Coronavirus [1], the conference was held online. Despite the virtual nature of the conference, attentionwas great.We receivedmany goodmanuscripts and invited a few to submit their full versions to this special issue. The range of topics was extensive, but many submissions concerned the interface of bioinformatics and its application. The selected papers for this special issue also discuss various topics such as sequence alignment and gene network reconstruction. The first paper in this special issue concerns a challenging issue in bioinformatics, the usage of pangenomes instead of single reference genomes and offers a fast variation-aware read mapping algorithm [2]. Mapping is also vital to investigate gene expression, which is essential for the secondmanuscript. It discusses how microRNA and mRNA expression profiles can be investigated [3]. From this, modular networks are inferred, describing post-transcriptional regulatory networks. Such networks are challenging to visualize, which is the focus of the third paper [4]. The work summarizes the state-of-the-art in bicluster visualization and is also based on gene expression data. Next, we move from transcriptomics to metabolomics. A disparity filter was applied to perform network analysis for colorectal cancer as a proof of principle [5]. The final two manuscripts focus more on practical application in cancer. First, the prostate, ovary, testes, and embryo","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8709735/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39729371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Predicting the possible effect of miR-203a-3p and miR-29a-3p on DNMT3B and GAS7 genes expression. 预测miR-203a-3p和miR-29a-3p对DNMT3B和GAS7基因表达的可能影响。
IF 1.9 Q1 Medicine Pub Date : 2021-12-16 DOI: 10.1515/jib-2021-0016
Afgar Ali, Sattarzadeh Bardsiri Mahla, Vahidi Reza, Farsinejad Alireza

Aberrant expression of genes involved in methylation, including DNA methyltransferase 3 Beta (DNMT3B), can cause hypermethylation of various tumor suppressor genes. In this regard, various molecular factors such as microRNAs can play a critical role in regulating these methyltransferase enzymes and eventually downstream genes such as growth arrest specific 7 (GAS7). Accordingly, in the present study we aimed to predict regulatory effect of miRNAs on DNMT3B and GAS7 genes expression in melanoma cell line. hsa-miR-203a-3p and hsa-miR-29a-3p were predicted and selected using bioinformatics software. The Real-time PCR technique was performed to investigate the regulatory effect of these molecules on the DNMT3B and GAS7 genes expression. Expression analysis of DNMT3B gene in A375 cell line showed that there was a significant increase compared to control (p value = 0.0015). Analysis of hsa-miR-203a-3p and hsa-miR-29a-3p indicated the insignificant decreased expression in melanoma cell line compared to control (p value < 0.05). Compared to control, the expression of GAS7 gene in melanoma cells showed a significant decrease (p value = 0.0323). Finally, our findings showed that the decreased expression of hsa-miR-203a-3p and hsa-miR-29a-3p can hypothesize that their aberrant expression caused DNMT3B dysfunction, possible methylation of the GAS7 gene, and ultimately decreased its expression. However, complementary studies are necessary to definite comment.

包括DNA甲基转移酶3 β (DNMT3B)在内的参与甲基化的基因的异常表达可导致多种肿瘤抑制基因的超甲基化。在这方面,各种分子因子如microrna可以在调节这些甲基转移酶以及最终下游基因如生长停滞特异性7 (GAS7)中发挥关键作用。因此,在本研究中,我们旨在预测miRNAs对黑色素瘤细胞系DNMT3B和GAS7基因表达的调控作用。利用生物信息学软件对hsa-miR-203a-3p和hsa-miR-29a-3p进行预测和选择。采用Real-time PCR技术研究这些分子对DNMT3B和GAS7基因表达的调控作用。DNMT3B基因在A375细胞系中的表达分析显示,与对照组相比,DNMT3B基因的表达显著增加(p值= 0.0015)。分析hsa-miR-203a-3p和hsa-miR-29a-3p在黑色素瘤细胞系中的表达与对照组相比,差异不显著(p值< 0.05)。与对照组相比,GAS7基因在黑色素瘤细胞中的表达显著降低(p值= 0.0323)。最后,我们的研究结果表明,hsa-miR-203a-3p和hsa-miR-29a-3p的表达减少可以假设它们的异常表达导致DNMT3B功能障碍,可能导致GAS7基因甲基化,最终导致其表达减少。然而,补充研究是明确评论的必要条件。
{"title":"Predicting the possible effect of miR-203a-3p and miR-29a-3p on <i>DNMT3B</i> and <i>GAS7</i> genes expression.","authors":"Afgar Ali,&nbsp;Sattarzadeh Bardsiri Mahla,&nbsp;Vahidi Reza,&nbsp;Farsinejad Alireza","doi":"10.1515/jib-2021-0016","DOIUrl":"https://doi.org/10.1515/jib-2021-0016","url":null,"abstract":"<p><p>Aberrant expression of genes involved in methylation, including DNA methyltransferase 3 Beta (<i>DNMT3B</i>), can cause hypermethylation of various tumor suppressor genes. In this regard, various molecular factors such as microRNAs can play a critical role in regulating these methyltransferase enzymes and eventually downstream genes such as growth arrest specific 7 (<i>GAS7</i>). Accordingly, in the present study we aimed to predict regulatory effect of miRNAs on <i>DNMT3B</i> and <i>GAS7</i> genes expression in melanoma cell line. hsa-miR-203a-3p and hsa-miR-29a-3p were predicted and selected using bioinformatics software. The Real-time PCR technique was performed to investigate the regulatory effect of these molecules on the <i>DNMT3B</i> and <i>GAS7</i> genes expression. Expression analysis of <i>DNMT3B</i> gene in A375 cell line showed that there was a significant increase compared to control (<i>p</i> value = 0.0015). Analysis of hsa-miR-203a-3p and hsa-miR-29a-3p indicated the insignificant decreased expression in melanoma cell line compared to control (<i>p</i> value < 0.05). Compared to control, the expression of GAS7 gene in melanoma cells showed a significant decrease (<i>p</i> value = 0.0323). Finally, our findings showed that the decreased expression of hsa-miR-203a-3p and hsa-miR-29a-3p can hypothesize that their aberrant expression caused <i>DNMT3B</i> dysfunction, possible methylation of the <i>GAS7</i> gene, and ultimately decreased its expression. However, complementary studies are necessary to definite comment.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069646/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39731426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Modular network inference between miRNA-mRNA expression profiles using weighted co-expression network analysis. 使用加权共表达网络分析的miRNA-mRNA表达谱之间的模块化网络推断。
IF 1.9 Q1 Medicine Pub Date : 2021-11-22 DOI: 10.1515/jib-2021-0029
Nisar Wani, Debmalya Barh, Khalid Raza

Connecting transcriptional and post-transcriptional regulatory networks solves an important puzzle in the elucidation of gene regulatory mechanisms. To decipher the complexity of these connections, we build co-expression network modules for mRNA as well as miRNA expression profiles of breast cancer data. We construct gene and miRNA co-expression modules using the weighted gene co-expression network analysis (WGCNA) method and establish the significance of these modules (Genes/miRNAs) for cancer phenotype. This work also infers an interaction network between the genes of the turquoise module from mRNA expression data and hubs of the turquoise module from miRNA expression data. A pathway enrichment analysis using a miRsystem web tool for miRNA hubs and some of their targets, reveal their enrichment in several important pathways associated with the progression of cancer.

连接转录和转录后调控网络解决了阐明基因调控机制的一个重要难题。为了破译这些连接的复杂性,我们为乳腺癌数据的mRNA和miRNA表达谱构建了共表达网络模块。我们使用加权基因共表达网络分析(WGCNA)方法构建基因和miRNA共表达模块,并建立这些模块(基因/miRNA)对癌症表型的意义。这项工作还从mRNA表达数据推断出绿松石模块基因之间的相互作用网络,从miRNA表达数据推断出绿松石模块的枢纽。利用miRsystem网络工具对miRNA枢纽及其部分靶点进行通路富集分析,揭示了它们在与癌症进展相关的几个重要通路中的富集。
{"title":"Modular network inference between miRNA-mRNA expression profiles using weighted co-expression network analysis.","authors":"Nisar Wani,&nbsp;Debmalya Barh,&nbsp;Khalid Raza","doi":"10.1515/jib-2021-0029","DOIUrl":"https://doi.org/10.1515/jib-2021-0029","url":null,"abstract":"<p><p>Connecting transcriptional and post-transcriptional regulatory networks solves an important puzzle in the elucidation of gene regulatory mechanisms. To decipher the complexity of these connections, we build co-expression network modules for mRNA as well as miRNA expression profiles of breast cancer data. We construct gene and miRNA co-expression modules using the weighted gene co-expression network analysis (WGCNA) method and establish the significance of these modules (Genes/miRNAs) for cancer phenotype. This work also infers an interaction network between the genes of the turquoise module from mRNA expression data and hubs of the turquoise module from miRNA expression data. A pathway enrichment analysis using a miRsystem web tool for miRNA hubs and some of their targets, reveal their enrichment in several important pathways associated with the progression of cancer.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8709739/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39640284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Disparity-filtered differential correlation network analysis: a case study on CRC metabolomics. 差异过滤的差异相关网络分析:CRC代谢组学的案例研究。
IF 1.9 Q1 Medicine Pub Date : 2021-11-19 DOI: 10.1515/jib-2021-0030
Silvia Sabatini, Amalia Gastaldelli

Differential network analysis has become a widely used technique to investigate changes of interactions among different conditions. Although the relationship between observed interactions and biochemical mechanisms is hard to establish, differential network analysis can provide useful insights about dysregulated pathways and candidate biomarkers. The available methods to detect differential interactions are heterogeneous and often rely on assumptions that are unrealistic in many applications. To address these issues, we develop a novel method for differential network analysis, using the so-called disparity filter as network reduction technique. In addition, we propose a classification model based on the inferred network interactions. The main novelty of this work lies in its ability to preserve connections that are statistically significant with respect to a null model without favouring any resolution scale, as a hard threshold would do, and without Gaussian assumptions. The method was tested using a published metabolomic dataset on colorectal cancer (CRC). Detected hub metabolites were consistent with recent literature and the classifier was able to distinguish CRC from polyp and healthy subjects with great accuracy. In conclusion, the proposed method provides a new simple and effective framework for the identification of differential interaction patterns and improves the biological interpretation of metabolomics data.

差分网络分析已成为研究不同条件间相互作用变化的一种广泛应用的技术。虽然观察到的相互作用和生化机制之间的关系很难建立,但差异网络分析可以为失调途径和候选生物标志物提供有用的见解。检测差异相互作用的可用方法是异构的,并且通常依赖于在许多应用中不现实的假设。为了解决这些问题,我们开发了一种新的差分网络分析方法,使用所谓的视差滤波器作为网络缩减技术。此外,我们提出了一个基于推断网络交互的分类模型。这项工作的主要新颖之处在于它能够保留相对于零模型具有统计意义的连接,而不支持任何分辨率尺度,如硬阈值所做的那样,并且没有高斯假设。该方法使用已发表的结直肠癌(CRC)代谢组学数据集进行了测试。检测到的中枢代谢物与最近的文献一致,分类器能够非常准确地将CRC与息肉和健康受试者区分开来。总之,所提出的方法为鉴别差异相互作用模式提供了一个新的简单有效的框架,并提高了代谢组学数据的生物学解释。
{"title":"Disparity-filtered differential correlation network analysis: a case study on CRC metabolomics.","authors":"Silvia Sabatini,&nbsp;Amalia Gastaldelli","doi":"10.1515/jib-2021-0030","DOIUrl":"https://doi.org/10.1515/jib-2021-0030","url":null,"abstract":"<p><p>Differential network analysis has become a widely used technique to investigate changes of interactions among different conditions. Although the relationship between observed interactions and biochemical mechanisms is hard to establish, differential network analysis can provide useful insights about dysregulated pathways and candidate biomarkers. The available methods to detect differential interactions are heterogeneous and often rely on assumptions that are unrealistic in many applications. To address these issues, we develop a novel method for differential network analysis, using the so-called disparity filter as network reduction technique. In addition, we propose a classification model based on the inferred network interactions. The main novelty of this work lies in its ability to preserve connections that are statistically significant with respect to a null model without favouring any resolution scale, as a hard threshold would do, and without Gaussian assumptions. The method was tested using a published metabolomic dataset on colorectal cancer (CRC). Detected hub metabolites were consistent with recent literature and the classifier was able to distinguish CRC from polyp and healthy subjects with great accuracy. In conclusion, the proposed method provides a new simple and effective framework for the identification of differential interaction patterns and improves the biological interpretation of metabolomics data.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8709737/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39635704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
In silico approach to understand epigenetics of POTEE in ovarian cancer. 用计算机方法了解卵巢癌中POTEE的表观遗传学。
IF 1.9 Q1 Medicine Pub Date : 2021-11-18 DOI: 10.1515/jib-2021-0028
Sahar Qazi, Khalid Raza

Ovarian cancer is the third leading cause of cancer-related deaths in India. Epigenetics mechanisms seemingly plays an important role in ovarian cancer. This paper highlights the crucial epigenetic changes that occur in POTEE that get hypomethylated in ovarian cancer. We utilized the POTEE paralog mRNA sequence to identify major motifs and also performed its enrichment analysis. We identified 6 motifs of varying lengths, out of which only three motifs, including CTTCCAGCAGATGTGGATCA, GGAACTGCC, and CGCCACATGCAGGC were most likely to be present in the nucleotide sequence of POTEE. By enrichment and occurrences identification analyses, we rectified the best match motif as CTTCCAGCAGATGT. Since there is no experimentally verified structure of POTEE paralog, thus, we predicted the POTEE structure using an automated workflow for template-based modeling using the power of a deep neural network. Additionally, to validate our predicted model we used AlphaFold predicted POTEE structure and observed that the residual stretch starting from 237-958 had a very high confidence per residue. Furthermore, POTEE predicted model stability was evaluated using replica exchange molecular dynamic simulation for 50 ns. Our network-based epigenetic analysis discerns only 10 highly significant, direct, and physical associators of POTEE. Our finding aims to provide new insights about the POTEE paralog.

卵巢癌是印度癌症相关死亡的第三大原因。表观遗传学机制似乎在卵巢癌中起重要作用。本文强调了在卵巢癌中发生低甲基化的POTEE发生的关键表观遗传变化。我们利用POTEE平行mRNA序列来鉴定主要基序,并对其进行富集分析。我们确定了6个长度不同的基序,其中只有3个基序最可能存在于POTEE的核苷酸序列中,包括CTTCCAGCAGATGTGGATCA、GGAACTGCC和CGCCACATGCAGGC。通过富集和事件识别分析,确定了最佳匹配基序为CTTCCAGCAGATGT。由于没有实验验证的POTEE平行结构,因此,我们使用基于模板的自动化工作流来预测POTEE结构,并利用深度神经网络的力量进行建模。此外,为了验证我们的预测模型,我们使用AlphaFold预测POTEE结构,并观察到从237-958开始的残差拉伸对每个残差具有非常高的置信度。此外,在50 ns的复制交换分子动力学模拟中,评估了POTEE预测模型的稳定性。我们基于网络的表观遗传分析只发现了10个高度显著的、直接的和物理的POTEE关联。我们的发现旨在提供关于POTEE平行的新见解。
{"title":"<i>In silico</i> approach to understand epigenetics of POTEE in ovarian cancer.","authors":"Sahar Qazi,&nbsp;Khalid Raza","doi":"10.1515/jib-2021-0028","DOIUrl":"https://doi.org/10.1515/jib-2021-0028","url":null,"abstract":"<p><p>Ovarian cancer is the third leading cause of cancer-related deaths in India. Epigenetics mechanisms seemingly plays an important role in ovarian cancer. This paper highlights the crucial epigenetic changes that occur in POTEE that get hypomethylated in ovarian cancer. We utilized the POTEE paralog mRNA sequence to identify major motifs and also performed its enrichment analysis. We identified 6 motifs of varying lengths, out of which only three motifs, including CTTCCAGCAGATGTGGATCA, GGAACTGCC, and CGCCACATGCAGGC were most likely to be present in the nucleotide sequence of POTEE. By enrichment and occurrences identification analyses, we rectified the best match motif as CTTCCAGCAGATGT. Since there is no experimentally verified structure of POTEE paralog, thus, we predicted the POTEE structure using an automated workflow for template-based modeling using the power of a deep neural network. Additionally, to validate our predicted model we used AlphaFold predicted POTEE structure and observed that the residual stretch starting from 237-958 had a very high confidence per residue. Furthermore, POTEE predicted model stability was evaluated using replica exchange molecular dynamic simulation for 50 ns. Our network-based epigenetic analysis discerns only 10 highly significant, direct, and physical associators of POTEE. Our finding aims to provide new insights about the POTEE paralog.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8709732/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39631904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Fast alignment of reads to a variation graph with application to SNP detection. 快速比对读取到变异图与应用于SNP检测。
IF 1.9 Q1 Medicine Pub Date : 2021-11-16 DOI: 10.1515/jib-2021-0032
Maurilio Monsu, Matteo Comin

Sequencing technologies has provided the basis of most modern genome sequencing studies due to its high base-level accuracy and relatively low cost. One of the most demanding step is mapping reads to the human reference genome. The reliance on a single reference human genome could introduce substantial biases in downstream analyses. Pangenomic graph reference representations offer an attractive approach for storing genetic variations. Moreover, it is possible to include known variants in the reference in order to make read mapping, variant calling, and genotyping variant-aware. Only recently a framework for variation graphs, vg [Garrison E, Adam MN, Siren J, et al. Variation graph toolkit improves read mapping by representing genetic variation in the reference. Nat Biotechnol 2018;36:875-9], have improved variation-aware alignment and variant calling in general. The major bottleneck of vg is its high cost of reads mapping to a variation graph. In this paper we study the problem of SNP calling on a variation graph and we present a fast reads alignment tool, named VG SNP-Aware. VG SNP-Aware is able align reads exactly to a variation graph and detect SNPs based on these aligned reads. The results show that VG SNP-Aware can efficiently map reads to a variation graph with a speedup of 40× with respect to vg and similar accuracy on SNPs detection.

测序技术由于其较高的基础精度和相对较低的成本,为大多数现代基因组测序研究提供了基础。其中要求最高的一步是绘制人类参考基因组的图谱。对单一参考人类基因组的依赖可能会在下游分析中引入实质性的偏差。泛基因组图参考表示为存储遗传变异提供了一种有吸引力的方法。此外,可以在参考文献中包括已知的变体,以便进行读取映射,变体调用和基因分型变体感知。直到最近才有了一个变化图的框架[Garrison E, Adam MN, Siren J,等]。变异图工具包通过表示参考文献中的遗传变异来改进读映射。生物技术学报,2018;36:875-9],改进了变异感知校准和变异调用。vg的主要瓶颈是读取映射到变化图的高成本。本文研究了变异图上的SNP调用问题,提出了一种快速读取比对工具——VG SNP- aware。VG SNP-Aware能够将读取精确地对齐到变异图上,并基于这些对齐的读取检测snp。结果表明,VG SNP-Aware可以有效地将读取映射到变化图上,相对于VG的速度提高了40倍,并且在snp检测上具有相似的准确性。
{"title":"Fast alignment of reads to a variation graph with application to SNP detection.","authors":"Maurilio Monsu,&nbsp;Matteo Comin","doi":"10.1515/jib-2021-0032","DOIUrl":"https://doi.org/10.1515/jib-2021-0032","url":null,"abstract":"<p><p>Sequencing technologies has provided the basis of most modern genome sequencing studies due to its high base-level accuracy and relatively low cost. One of the most demanding step is mapping reads to the human reference genome. The reliance on a single reference human genome could introduce substantial biases in downstream analyses. Pangenomic graph reference representations offer an attractive approach for storing genetic variations. Moreover, it is possible to include known variants in the reference in order to make read mapping, variant calling, and genotyping variant-aware. Only recently a framework for variation graphs, <i>vg</i> [Garrison E, Adam MN, Siren J, et al. Variation graph toolkit improves read mapping by representing genetic variation in the reference. Nat Biotechnol 2018;36:875-9], have improved variation-aware alignment and variant calling in general. The major bottleneck of <i>vg</i> is its high cost of reads mapping to a variation graph. In this paper we study the problem of SNP calling on a variation graph and we present a fast reads alignment tool, named VG SNP-Aware. VG SNP-Aware is able align reads exactly to a variation graph and detect SNPs based on these aligned reads. The results show that VG SNP-Aware can efficiently map reads to a variation graph with a speedup of 40× with respect to <i>vg</i> and similar accuracy on SNPs detection.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8709736/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39895114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Glioblastoma gene network reconstruction and ontology analysis by online bioinformatics tools. 利用在线生物信息学工具重建胶质母细胞瘤基因网络及本体分析。
IF 1.9 Q1 Medicine Pub Date : 2021-11-16 DOI: 10.1515/jib-2021-0031
Natalya V Gubanova, Nina G Orlova, Arthur I Dergilev, Nina Y Oparina, Yuriy L Orlov

Glioblastoma is the most aggressive type of brain tumors resistant to a number of antitumor drugs. The problem of therapy and drug treatment course is complicated by extremely high heterogeneity in the benign cell populations, the random arrangement of tumor cells, and polymorphism of their nuclei. The pathogenesis of gliomas needs to be studied using modern cellular technologies, genome- and transcriptome-wide technologies of high-throughput sequencing, analysis of gene expression on microarrays, and methods of modern bioinformatics to find new therapy targets. Functional annotation of genes related to the disease could be retrieved based on genetic databases and cross-validated by integrating complementary experimental data. Gene network reconstruction for a set of genes (proteins) proved to be effective approach to study mechanisms underlying disease progression. We used online bioinformatics tools for annotation of gene list for glioma, reconstruction of gene network and comparative analysis of gene ontology categories. The available tools and the databases for glioblastoma gene analysis are discussed together with the recent progress in this field.

胶质母细胞瘤是最具侵袭性的脑肿瘤类型,对许多抗肿瘤药物具有耐药性。良性细胞群异质性极高,肿瘤细胞排列随机,细胞核多态,使治疗和药物疗程问题复杂化。胶质瘤的发病机制需要利用现代细胞技术、全基因组和转录组高通量测序技术、微阵列基因表达分析以及现代生物信息学方法来研究,以寻找新的治疗靶点。基于遗传数据库检索疾病相关基因的功能注释,并通过整合互补实验数据进行交叉验证。一组基因(蛋白质)的基因网络重构被证明是研究疾病进展机制的有效方法。利用在线生物信息学工具对胶质瘤基因表进行标注、基因网络重构和基因本体分类的比较分析。本文讨论了胶质母细胞瘤基因分析的现有工具和数据库,以及该领域的最新进展。
{"title":"Glioblastoma gene network reconstruction and ontology analysis by online bioinformatics tools.","authors":"Natalya V Gubanova,&nbsp;Nina G Orlova,&nbsp;Arthur I Dergilev,&nbsp;Nina Y Oparina,&nbsp;Yuriy L Orlov","doi":"10.1515/jib-2021-0031","DOIUrl":"https://doi.org/10.1515/jib-2021-0031","url":null,"abstract":"<p><p>Glioblastoma is the most aggressive type of brain tumors resistant to a number of antitumor drugs. The problem of therapy and drug treatment course is complicated by extremely high heterogeneity in the benign cell populations, the random arrangement of tumor cells, and polymorphism of their nuclei. The pathogenesis of gliomas needs to be studied using modern cellular technologies, genome- and transcriptome-wide technologies of high-throughput sequencing, analysis of gene expression on microarrays, and methods of modern bioinformatics to find new therapy targets. Functional annotation of genes related to the disease could be retrieved based on genetic databases and cross-validated by integrating complementary experimental data. Gene network reconstruction for a set of genes (proteins) proved to be effective approach to study mechanisms underlying disease progression. We used online bioinformatics tools for annotation of gene list for glioma, reconstruction of gene network and comparative analysis of gene ontology categories. The available tools and the databases for glioblastoma gene analysis are discussed together with the recent progress in this field.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8709738/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39895113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
An evaluation study of biclusters visualization techniques of gene expression data. 基因表达数据双聚类可视化技术的评价研究。
IF 1.9 Q1 Medicine Pub Date : 2021-10-27 DOI: 10.1515/jib-2021-0019
Haithem Aouabed, Mourad Elloumi, Rodrigo Santamaría

Biclustering is a non-supervised data mining technique used to analyze gene expression data, it consists to classify subgroups of genes that have similar behavior under subgroups of conditions. The classified genes can have independent behavior under other subgroups of conditions. Discovering such co-expressed genes, called biclusters, can be helpful to find specific biological features such as gene interactions under different circumstances. Compared to clustering, biclustering has two main characteristics: bi-dimensionality which means grouping both genes and conditions simultaneously and overlapping which means allowing genes to be in more than one bicluster at the same time. Biclustering algorithms, which continue to be developed at a constant pace, give as output a large number of overlapping biclusters. Visualizing groups of biclusters is still a non-trivial task due to their overlapping. In this paper, we present the most interesting techniques to visualize groups of biclusters and evaluate them.

双聚类是一种用于分析基因表达数据的非监督数据挖掘技术,它包括对在子组条件下具有相似行为的基因进行分类。分类的基因在其他亚组条件下可以有独立的行为。发现这种被称为双聚类的共表达基因有助于发现特定的生物学特征,如不同环境下的基因相互作用。与聚类相比,双聚类有两个主要特点:双维性,即同时对基因和条件进行分组;重叠性,即允许基因同时在多个双聚类中。双聚类算法,继续以恒定的速度发展,输出大量重叠的双聚类。由于它们的重叠,可视化双簇组仍然是一项重要的任务。在本文中,我们提出了最有趣的技术来可视化双聚类群并评估它们。
{"title":"An evaluation study of biclusters visualization techniques of gene expression data.","authors":"Haithem Aouabed,&nbsp;Mourad Elloumi,&nbsp;Rodrigo Santamaría","doi":"10.1515/jib-2021-0019","DOIUrl":"https://doi.org/10.1515/jib-2021-0019","url":null,"abstract":"<p><p><i>Biclustering</i> is a non-supervised data mining technique used to analyze gene expression data, it consists to classify subgroups of genes that have similar behavior under subgroups of conditions. The classified genes can have independent behavior under other subgroups of conditions. Discovering such co-expressed genes, called <i>biclusters</i>, can be helpful to find specific biological features such as gene interactions under different circumstances. Compared to clustering, biclustering has two main characteristics: <i>bi-dimensionality</i> which means grouping both genes and conditions simultaneously and <i>overlapping</i> which means allowing genes to be in more than one bicluster at the same time. Biclustering algorithms, which continue to be developed at a constant pace, give as output a large number of overlapping biclusters. Visualizing groups of biclusters is still a non-trivial task due to their overlapping. In this paper, we present the most interesting techniques to visualize groups of biclusters and evaluate them.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8709740/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39558960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Specifications of standards in systems and synthetic biology: status and developments in 2021. 系统和合成生物学标准规范:2021 年的现状和发展。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-10-22 DOI: 10.1515/jib-2021-0026
Falk Schreiber, Padraig Gleeson, Martin Golebiewski, Thomas E Gorochowski, Michael Hucka, Sarah M Keating, Matthias König, Chris J Myers, David P Nickerson, Björn Sommer, Dagmar Waltemath

This special issue of the Journal of Integrative Bioinformatics contains updated specifications of COMBINE standards in systems and synthetic biology. The 2021 special issue presents four updates of standards: Synthetic Biology Open Language Visual Version 2.3, Synthetic Biology Open Language Visual Version 3.0, Simulation Experiment Description Markup Language Level 1 Version 4, and OMEX Metadata specification Version 1.2. This document can also be consulted to identify the latest specifications of all COMBINE standards.

本期《整合生物信息学杂志》特刊包含 COMBINE 标准在系统生物学和合成生物学方面的最新规范。2021 年特刊介绍了四种更新的标准:合成生物学开放语言可视化 2.3 版》、《合成生物学开放语言可视化 3.0 版》、《模拟实验描述标记语言 1 级 4 版》和《OMEX 元数据规范 1.2 版》。您还可以查阅本文件,了解所有 COMBINE 标准的最新规范。
{"title":"Specifications of standards in systems and synthetic biology: status and developments in 2021.","authors":"Falk Schreiber, Padraig Gleeson, Martin Golebiewski, Thomas E Gorochowski, Michael Hucka, Sarah M Keating, Matthias König, Chris J Myers, David P Nickerson, Björn Sommer, Dagmar Waltemath","doi":"10.1515/jib-2021-0026","DOIUrl":"10.1515/jib-2021-0026","url":null,"abstract":"<p><p>This special issue of the <i>Journal of Integrative Bioinformatics</i> contains updated specifications of COMBINE standards in systems and synthetic biology. The 2021 special issue presents four updates of standards: Synthetic Biology Open Language Visual Version 2.3, Synthetic Biology Open Language Visual Version 3.0, Simulation Experiment Description Markup Language Level 1 Version 4, and OMEX Metadata specification Version 1.2. This document can also be consulted to identify the latest specifications of all COMBINE standards.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2021-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8573232/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39540482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synthetic biology open language visual (SBOL visual) version 3.0. 合成生物学开放语言可视化(SBOL可视化)3.0版。
IF 1.9 Q1 Medicine Pub Date : 2021-10-20 DOI: 10.1515/jib-2021-0013
Hasan Baig, Pedro Fontanarossa, James McLaughlin, James Scott-Brown, Prashant Vaidyanathan, Thomas Gorochowski, Goksel Misirli, Jacob Beal, Chris Myers

People who engineer biological organisms often find it useful to draw diagrams in order to communicate both the structure of the nucleic acid sequences that they are engineering and the functional relationships between sequence features and other molecular species. Some typical practices and conventions have begun to emerge for such diagrams. SBOL Visual aims to organize and systematize such conventions in order to produce a coherent language for expressing the structure and function of genetic designs. This document details version 3.0 of SBOL Visual, a new major revision of the standard. The major difference between SBOL Visual 3 and SBOL Visual 2 is that diagrams and glyphs are defined with respect to the SBOL 3 data model rather than the SBOL 2 data model. A byproduct of this change is that the use of dashed undirected lines for subsystem mappings has been removed, pending future determination on how to represent general SBOL 3 constraints; in the interim, this annotation can still be used as an annotation. Finally, deprecated material has been removed from collection of glyphs: the deprecated "insulator" glyph and "macromolecule" alternative glyphs have been removed, as have the deprecated BioPAX alternatives to SBO terms.

设计生物有机体的人经常发现,绘制图表是很有用的,这样既可以说明他们正在设计的核酸序列的结构,也可以说明序列特征与其他分子物种之间的功能关系。对于这样的图,一些典型的实践和惯例已经开始出现。SBOL Visual旨在组织和系统化这些约定,以便产生一种连贯的语言来表达遗传设计的结构和功能。本文档详细介绍了SBOL Visual 3.0版本,这是该标准的一个新的主要修订。SBOL Visual 3和SBOL Visual 2之间的主要区别在于,图表和符号是根据SBOL 3数据模型而不是SBOL 2数据模型定义的。这一变化的一个副产品是取消了对子系统映射的虚线无向线的使用,这有待于未来如何表示通用SBOL 3约束的决定;在此期间,该注释仍然可以作为注释使用。最后,已从字形集合中删除了已弃用的材料:已弃用的“绝缘体”字形和“大分子”替代字形已被删除,以及已弃用的BioPAX替代SBO术语。
{"title":"Synthetic biology open language visual (SBOL visual) version 3.0.","authors":"Hasan Baig,&nbsp;Pedro Fontanarossa,&nbsp;James McLaughlin,&nbsp;James Scott-Brown,&nbsp;Prashant Vaidyanathan,&nbsp;Thomas Gorochowski,&nbsp;Goksel Misirli,&nbsp;Jacob Beal,&nbsp;Chris Myers","doi":"10.1515/jib-2021-0013","DOIUrl":"https://doi.org/10.1515/jib-2021-0013","url":null,"abstract":"<p><p>People who engineer biological organisms often find it useful to draw diagrams in order to communicate both the structure of the nucleic acid sequences that they are engineering and the functional relationships between sequence features and other molecular species. Some typical practices and conventions have begun to emerge for such diagrams. SBOL Visual aims to organize and systematize such conventions in order to produce a coherent language for expressing the structure and function of genetic designs. This document details version 3.0 of SBOL Visual, a new major revision of the standard. The major difference between SBOL Visual 3 and SBOL Visual 2 is that diagrams and glyphs are defined with respect to the SBOL 3 data model rather than the SBOL 2 data model. A byproduct of this change is that the use of dashed undirected lines for subsystem mappings has been removed, pending future determination on how to represent general SBOL 3 constraints; in the interim, this annotation can still be used as an annotation. Finally, deprecated material has been removed from collection of glyphs: the deprecated \"insulator\" glyph and \"macromolecule\" alternative glyphs have been removed, as have the deprecated BioPAX alternatives to SBO terms.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2021-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8560346/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39532713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
Journal of Integrative Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1