首页 > 最新文献

Journal of Integrative Bioinformatics最新文献

英文 中文
Integrative illustration of a JCVI-syn3A minimal cell. JCVI-syn3A 最小细胞的综合图示。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-06-27 DOI: 10.1515/jib-2022-0013
David S Goodsell

Data from genomics, proteomics, structural biology and cryo-electron microscopy are integrated into a structural illustration of a cross section through an entire JCVI-syn3.0 minimal cell. The illustration is designed with several goals: to inspire excitement in science, to depict the underlying scientific results accurately, and to be feasible in traditional media. Design choices to achieve these goals include reduction of visual complexity with simplified representations, use of orthographic projection to retain scale relationships, and an approach to color that highlights functional compartments of the cell. Given that this simple cell provides an attractive laboratory for exploring the central processes needed for life, several functional narratives are included in the illustration, including division of the cell and the first depiction of an entire cellular proteome. The illustration lays the foundation for 3D molecular modeling of this cell.

来自基因组学、蛋白质组学、结构生物学和低温电子显微镜的数据被整合到整个 JCVI-syn3.0 最小细胞横截面的结构插图中。插图的设计有几个目标:激发对科学的兴趣,准确描述基本的科学成果,并在传统媒体中可行。为实现这些目标,我们在设计上作了如下选择:用简化的表现手法降低视觉复杂性,使用正投影法保留比例关系,以及用色彩突出细胞的功能区。鉴于这个简单的细胞为探索生命所需的核心过程提供了一个极具吸引力的实验室,插图中包含了几个功能性叙述,包括细胞的分裂和对整个细胞蛋白质组的首次描述。该插图为该细胞的三维分子建模奠定了基础。
{"title":"Integrative illustration of a JCVI-syn3A minimal cell.","authors":"David S Goodsell","doi":"10.1515/jib-2022-0013","DOIUrl":"10.1515/jib-2022-0013","url":null,"abstract":"<p><p>Data from genomics, proteomics, structural biology and cryo-electron microscopy are integrated into a structural illustration of a cross section through an entire JCVI-syn3.0 minimal cell. The illustration is designed with several goals: to inspire excitement in science, to depict the underlying scientific results accurately, and to be feasible in traditional media. Design choices to achieve these goals include reduction of visual complexity with simplified representations, use of orthographic projection to retain scale relationships, and an approach to color that highlights functional compartments of the cell. Given that this simple cell provides an attractive laboratory for exploring the central processes needed for life, several functional narratives are included in the illustration, including division of the cell and the first depiction of an entire cellular proteome. The illustration lays the foundation for 3D molecular modeling of this cell.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2022-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9377704/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40395611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Considering best practices in color palettes for molecular visualizations. 考虑分子可视化调色板的最佳实践。
IF 1.9 Q1 Medicine Pub Date : 2022-06-22 DOI: 10.1515/jib-2022-0016
Laura Garrison, Stefan Bruckner

Biomedical illustration and visualization techniques provide a window into complex molecular worlds that are difficult to capture through experimental means alone. Biomedical illustrators frequently employ color to help tell a molecular story, e.g., to identify key molecules in a signaling pathway. Currently, color use for molecules is largely arbitrary and often chosen based on the client, cultural factors, or personal taste. The study of molecular dynamics is relatively young, and some stakeholders argue that color use guidelines would throttle the growth of the field. Instead, content authors have ample creative freedom to choose an aesthetic that, e.g., supports the story they want to tell. However, such creative freedom comes at a price. The color design process is challenging, particularly for those without a background in color theory. The result is a semantically inconsistent color space that reduces the interpretability and effectiveness of molecular visualizations as a whole. Our contribution in this paper is threefold. We first discuss some of the factors that contribute to this array of color palettes. Second, we provide a brief sampling of color palettes used in both industry and research sectors. Lastly, we suggest considerations for developing best practices around color palettes applied to molecular visualization.

生物医学插图和可视化技术提供了一个窗口,进入复杂的分子世界,很难通过单独的实验手段捕获。生物医学插画家经常使用颜色来帮助讲述分子故事,例如,识别信号通路中的关键分子。目前,分子的颜色使用在很大程度上是任意的,通常根据客户、文化因素或个人品味来选择。分子动力学的研究相对年轻,一些利益相关者认为颜色使用指南会阻碍该领域的发展。相反,内容作者有足够的创作自由来选择美学,例如,支持他们想要讲述的故事。然而,这种创作自由是有代价的。色彩设计过程是具有挑战性的,特别是对于那些没有色彩理论背景的人。其结果是一个语义上不一致的色彩空间,降低了整体分子可视化的可解释性和有效性。我们在这篇论文中的贡献有三个方面。我们首先讨论一些促成这种调色板阵列的因素。其次,我们提供了在工业和研究部门使用的调色板的简要抽样。最后,我们建议围绕调色板开发应用于分子可视化的最佳实践。
{"title":"Considering best practices in color palettes for molecular visualizations.","authors":"Laura Garrison,&nbsp;Stefan Bruckner","doi":"10.1515/jib-2022-0016","DOIUrl":"https://doi.org/10.1515/jib-2022-0016","url":null,"abstract":"<p><p>Biomedical illustration and visualization techniques provide a window into complex molecular worlds that are difficult to capture through experimental means alone. Biomedical illustrators frequently employ color to help tell a molecular story, e.g., to identify key molecules in a signaling pathway. Currently, color use for molecules is largely arbitrary and often chosen based on the client, cultural factors, or personal taste. The study of molecular dynamics is relatively young, and some stakeholders argue that color use guidelines would throttle the growth of the field. Instead, content authors have ample creative freedom to choose an aesthetic that, e.g., supports the story they want to tell. However, such creative freedom comes at a price. The color design process is challenging, particularly for those without a background in color theory. The result is a semantically inconsistent color space that reduces the interpretability and effectiveness of molecular visualizations as a whole. Our contribution in this paper is threefold. We first discuss some of the factors that contribute to this array of color palettes. Second, we provide a brief sampling of color palettes used in both industry and research sectors. Lastly, we suggest considerations for developing best practices around color palettes applied to molecular visualization.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2022-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9377702/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40192929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Integrative analysis of chemical properties and functions of drugs for adverse drug reaction prediction based on multi-label deep neural network 基于多标签深度神经网络的药物化学性质和功能综合分析用于药物不良反应预测
IF 1.9 Q1 Medicine Pub Date : 2022-05-19 DOI: 10.1515/jib-2022-0007
Pranab Das, Yogita, V. Pal
Abstract The prediction of adverse drug reactions (ADR) is an important step of drug discovery and design process. Different drug properties have been employed for ADR prediction but the prediction capability of drug properties and drug functions in integrated manner is yet to be explored. In the present work, a multi-label deep neural network and MLSMOTE based methodology has been proposed for ADR prediction. The proposed methodology has been applied on SMILES Strings data of drugs, 17 molecular descriptors data of drugs and drug functions data individually and in integrated manner for ADR prediction. The experimental results shows that the SMILES Strings + drug functions has outperformed other types of data with regards to ADR prediction capability.
摘要药物不良反应(ADR)的预测是药物发现和设计过程中的重要步骤。不同的药物性质已被用于ADR预测,但药物性质和药物功能的综合预测能力尚待探索。在本工作中,提出了一种基于多标签深度神经网络和MLSMOTE的ADR预测方法。该方法已分别应用于药物的SMILES字符串数据、药物的17个分子描述符数据和药物功能数据,并以集成的方式进行ADR预测。实验结果表明,SMILES Strings+药物功能在ADR预测能力方面优于其他类型的数据。
{"title":"Integrative analysis of chemical properties and functions of drugs for adverse drug reaction prediction based on multi-label deep neural network","authors":"Pranab Das, Yogita, V. Pal","doi":"10.1515/jib-2022-0007","DOIUrl":"https://doi.org/10.1515/jib-2022-0007","url":null,"abstract":"Abstract The prediction of adverse drug reactions (ADR) is an important step of drug discovery and design process. Different drug properties have been employed for ADR prediction but the prediction capability of drug properties and drug functions in integrated manner is yet to be explored. In the present work, a multi-label deep neural network and MLSMOTE based methodology has been proposed for ADR prediction. The proposed methodology has been applied on SMILES Strings data of drugs, 17 molecular descriptors data of drugs and drug functions data individually and in integrated manner for ADR prediction. The experimental results shows that the SMILES Strings + drug functions has outperformed other types of data with regards to ADR prediction capability.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2022-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49603676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Genome-wide expression reveals potential biomarkers in breast cancer bone metastasis 全基因组表达揭示了乳腺癌症骨转移的潜在生物标志物
IF 1.9 Q1 Medicine Pub Date : 2022-04-08 DOI: 10.1515/jib-2021-0041
Yashbir Singh, N. Subbarao, Abhinav Jaimini, Quincy A. Hathaway, Amina Kunovac, Bradley J. Erickson, V. Swarup, H. Singh
Abstract Breast cancer metastases are most commonly found in bone, an indication of poor prognosis. Pathway-based biomarkers identification may help elucidate the cellular signature of breast cancer metastasis in bone, further characterizing the etiology and promoting new therapeutic approaches. We extracted gene expression profiles from mouse macrophages from the GEO dataset, GSE152795 using the GEO2R webtool. The differentially expressed genes (DEGs) were filtered by log2 fold-change with threshold 1.5 (FDR < 0.05). STRING database and Enrichr were used for GO-term analysis, miRNA and TF analysis associated with DEGs. Autodock Vienna was exploited to investigate interaction of anti-cancer drugs, Actinomycin-D and Adriamycin. Sensitivity and specificity of DEGs was assessed using receiver operating characteristic (ROC) analyses. A total of 61 DEGs, included 27 down-regulated and 34 up-regulated, were found to be significant in breast cancer bone metastasis. Major DEGs were associated with lipid metabolism and immunological response of tumor tissue. Crucial DEGs, Bcl3, ADGRG7, FABP4, VCAN, and IRF4 were regulated by miRNAs, miR-497, miR-574, miR-138 and TFs, CCDN1, STAT6, IRF8. Docking analysis showed that these genes possessed strong binding with the drugs. ROC analysis demonstrated Bcl3 is specific to metastasis. DEGs Bcl3, ADGRG7, FABP4, IRF4, their regulating miRNAs and TFs have strong impact on proliferation and metastasis of breast cancer in bone tissues. In conclusion, present study revealed that DEGs are directly involved in of breast tumor metastasis in bone tissues. Identified genes, miRNAs, and TFs can be possible drug targets that may be used for the therapeutics. However, further experimental validation is necessary.
摘要癌症转移最常见于骨,预示预后不良。基于途径的生物标志物鉴定可能有助于阐明癌症骨转移的细胞特征,进一步表征病因并促进新的治疗方法。我们使用GEO2R网络工具从GEO数据集GSE152795中提取了小鼠巨噬细胞的基因表达谱。通过阈值1.5的log2倍变化过滤差异表达基因(DEGs)(FDR<0.05)。STRING数据库和Enrichr用于GO术语分析、miRNA和与DEGs相关的TF分析。维也纳自动仓库被用来研究抗癌药物放线菌素-D和阿霉素的相互作用。DEG的敏感性和特异性使用受试者操作特征(ROC)分析进行评估。共发现61个DEG,包括27个下调和34个上调,在乳腺癌症骨转移中具有显著意义。主要DEG与肿瘤组织的脂质代谢和免疫反应有关。关键DEG、Bcl3、ADGRG7、FABP4、VCAN和IRF4受miRNA、miR-497、miR-574、miR-138和TF、CCDN1、STAT6、IRF8的调节。对接分析表明,这些基因与药物具有较强的结合力。ROC分析显示Bcl3对转移具有特异性。DEGs-Bcl3、ADGRG7、FABP4、IRF4及其调控的miRNA和TF对乳腺癌症骨组织的增殖和转移具有强烈的影响。总之,本研究表明DEGs直接参与了乳腺肿瘤在骨组织中的转移。已鉴定的基因、miRNA和TF可能是可用于治疗的药物靶点。然而,进一步的实验验证是必要的。
{"title":"Genome-wide expression reveals potential biomarkers in breast cancer bone metastasis","authors":"Yashbir Singh, N. Subbarao, Abhinav Jaimini, Quincy A. Hathaway, Amina Kunovac, Bradley J. Erickson, V. Swarup, H. Singh","doi":"10.1515/jib-2021-0041","DOIUrl":"https://doi.org/10.1515/jib-2021-0041","url":null,"abstract":"Abstract Breast cancer metastases are most commonly found in bone, an indication of poor prognosis. Pathway-based biomarkers identification may help elucidate the cellular signature of breast cancer metastasis in bone, further characterizing the etiology and promoting new therapeutic approaches. We extracted gene expression profiles from mouse macrophages from the GEO dataset, GSE152795 using the GEO2R webtool. The differentially expressed genes (DEGs) were filtered by log2 fold-change with threshold 1.5 (FDR < 0.05). STRING database and Enrichr were used for GO-term analysis, miRNA and TF analysis associated with DEGs. Autodock Vienna was exploited to investigate interaction of anti-cancer drugs, Actinomycin-D and Adriamycin. Sensitivity and specificity of DEGs was assessed using receiver operating characteristic (ROC) analyses. A total of 61 DEGs, included 27 down-regulated and 34 up-regulated, were found to be significant in breast cancer bone metastasis. Major DEGs were associated with lipid metabolism and immunological response of tumor tissue. Crucial DEGs, Bcl3, ADGRG7, FABP4, VCAN, and IRF4 were regulated by miRNAs, miR-497, miR-574, miR-138 and TFs, CCDN1, STAT6, IRF8. Docking analysis showed that these genes possessed strong binding with the drugs. ROC analysis demonstrated Bcl3 is specific to metastasis. DEGs Bcl3, ADGRG7, FABP4, IRF4, their regulating miRNAs and TFs have strong impact on proliferation and metastasis of breast cancer in bone tissues. In conclusion, present study revealed that DEGs are directly involved in of breast tumor metastasis in bone tissues. Identified genes, miRNAs, and TFs can be possible drug targets that may be used for the therapeutics. However, further experimental validation is necessary.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2022-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46059130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review on methods for predicting miRNA–mRNA regulatory modules miRNA-mRNA调控模块预测方法综述
IF 1.9 Q1 Medicine Pub Date : 2022-04-01 DOI: 10.1515/jib-2020-0048
Madhumita Madhumita, Sushmita Paul
Abstract Identification of complex interactions between miRNAs and mRNAs in a regulatory network helps better understand the underlying biological processes. Previously, identification of these interactions was based on sequence-based predicted target binding information. With the advancement in high-throughput omics technologies, miRNA and mRNA expression for the same set of samples are available. This helps develop more efficient and flexible approaches that work by integrating miRNA and mRNA expression profiles with target binding information. Since these integrative approaches of miRNA–mRNA regulatory modules (MRMs) detection is sufficiently able to capture the minute biological details, 26 such algorithms/methods/tools for MRMs identification are comprehensively reviewed in this article. The study covers the significant features underlying every method. Therefore, the methods are classified into eight groups based on mathematical approaches to understand their working and suitability for one’s study. An algorithm could be selected based on the available information with the users and the biological question under investigation.
摘要在调控网络中识别miRNA和mRNA之间的复杂相互作用有助于更好地理解潜在的生物学过程。以前,这些相互作用的鉴定是基于基于序列的预测靶标结合信息。随着高通量组学技术的进步,同一组样品的miRNA和mRNA表达是可用的。这有助于开发更有效和灵活的方法,通过整合miRNA和mRNA表达谱与靶结合信息来发挥作用。由于这些miRNA-mRNA调节模块(MRM)检测的综合方法足以捕捉微小的生物学细节,本文对26种用于MRM识别的算法/方法/工具进行了全面综述。这项研究涵盖了每种方法背后的重要特征。因此,根据数学方法将这些方法分为八组,以了解它们的有效性和适用性。可以根据用户的可用信息和正在调查的生物学问题来选择算法。
{"title":"A review on methods for predicting miRNA–mRNA regulatory modules","authors":"Madhumita Madhumita, Sushmita Paul","doi":"10.1515/jib-2020-0048","DOIUrl":"https://doi.org/10.1515/jib-2020-0048","url":null,"abstract":"Abstract Identification of complex interactions between miRNAs and mRNAs in a regulatory network helps better understand the underlying biological processes. Previously, identification of these interactions was based on sequence-based predicted target binding information. With the advancement in high-throughput omics technologies, miRNA and mRNA expression for the same set of samples are available. This helps develop more efficient and flexible approaches that work by integrating miRNA and mRNA expression profiles with target binding information. Since these integrative approaches of miRNA–mRNA regulatory modules (MRMs) detection is sufficiently able to capture the minute biological details, 26 such algorithms/methods/tools for MRMs identification are comprehensively reviewed in this article. The study covers the significant features underlying every method. Therefore, the methods are classified into eight groups based on mathematical approaches to understand their working and suitability for one’s study. An algorithm could be selected based on the available information with the users and the biological question under investigation.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2022-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46670434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
ExceS-A: an exon-centric split aligner ExceS-A:一种外显子中心分裂比对器
IF 1.9 Q1 Medicine Pub Date : 2022-03-01 DOI: 10.1515/jib-2021-0040
Franziska Reinhardt, P. Stadler
Abstract Spliced alignments are a key step in the construction of high-quality homology-based annotations of protein sequences. The exon/intron structure, which is computed as part of spliced alignment procedures, often conveys important information for the distinguishing paralogous members of gene families. Here we present an exon-centric pipeline for spliced alignment that is intended in particular for applications that involve exon-by-exon comparisons of coding sequences. We show that the simple, blat-based approach has advantages over established tools in particular for genes with very large introns and applications to fragmented genome assemblies.
剪接比对是构建高质量的基于同源性的蛋白质序列注释的关键步骤。外显子/内含子结构是拼接比对过程的一部分,通常为区分基因家族的同源成员传递重要信息。在这里,我们提出了一个以外显子为中心的拼接比对管道,特别适用于涉及编码序列的外显子对外显子比较的应用。我们表明,简单的,基于blat的方法比现有的工具具有优势,特别是对于具有非常大的内含子的基因和片段化基因组组装的应用。
{"title":"ExceS-A: an exon-centric split aligner","authors":"Franziska Reinhardt, P. Stadler","doi":"10.1515/jib-2021-0040","DOIUrl":"https://doi.org/10.1515/jib-2021-0040","url":null,"abstract":"Abstract Spliced alignments are a key step in the construction of high-quality homology-based annotations of protein sequences. The exon/intron structure, which is computed as part of spliced alignment procedures, often conveys important information for the distinguishing paralogous members of gene families. Here we present an exon-centric pipeline for spliced alignment that is intended in particular for applications that involve exon-by-exon comparisons of coding sequences. We show that the simple, blat-based approach has advantages over established tools in particular for genes with very large introns and applications to fragmented genome assemblies.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48295319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A systematic study of motif pairs that may facilitate enhancer-promoter interactions. 对促进增强子-启动子相互作用的基序对的系统研究。
IF 1.9 Q1 Medicine Pub Date : 2022-02-07 DOI: 10.1515/jib-2021-0038
Saidi Wang, Haiyan Hu, Xiaoman Li

Pairs of interacting transcription factors (TFs) have previously been shown to bind to enhancers and promoters and contribute to their physical interactions. However, to date, we have limited knowledge about such TF pairs. To fill this void, we systematically studied the co-occurrence of TF-binding motifs in interacting enhancer-promoter (EP) pairs in seven human cell lines. We discovered 423 motif pairs that significantly co-occur in enhancers and promoters of interacting EP pairs. We demonstrated that these motif pairs are biologically meaningful and significantly enriched with motif pairs of known interacting TF pairs. We also showed that the identified motif pairs facilitated the discovery of the interacting EP pairs. The developed pipeline, EPmotifPair, together with the predicted motifs and motif pairs, is available at https://doi.org/10.6084/m9.figshare.14192000. Our study provides a comprehensive list of motif pairs that may contribute to EP physical interactions, which facilitate generating meaningful hypotheses for experimental validation.

相互作用转录因子(TFs)对先前已被证明与增强子和启动子结合,并有助于它们的物理相互作用。然而,到目前为止,我们对这种TF对的了解有限。为了填补这一空白,我们系统地研究了7种人类细胞系中相互作用的增强子-启动子(EP)对中tf结合基序的共现性。我们发现423个基序对在相互作用的EP对的增强子和启动子中显著共存。我们证明了这些基序对具有生物学意义,并且与已知相互作用的TF对的基序对显著丰富。我们还发现,已识别的基序对有助于发现相互作用的EP对。开发的管道EPmotifPair以及预测的基序和基序对可在https://doi.org/10.6084/m9.figshare.14192000上获得。我们的研究提供了一个可能有助于EP物理相互作用的基序对的全面列表,这有助于为实验验证产生有意义的假设。
{"title":"A systematic study of motif pairs that may facilitate enhancer-promoter interactions.","authors":"Saidi Wang,&nbsp;Haiyan Hu,&nbsp;Xiaoman Li","doi":"10.1515/jib-2021-0038","DOIUrl":"https://doi.org/10.1515/jib-2021-0038","url":null,"abstract":"<p><p>Pairs of interacting transcription factors (TFs) have previously been shown to bind to enhancers and promoters and contribute to their physical interactions. However, to date, we have limited knowledge about such TF pairs. To fill this void, we systematically studied the co-occurrence of TF-binding motifs in interacting enhancer-promoter (EP) pairs in seven human cell lines. We discovered 423 motif pairs that significantly co-occur in enhancers and promoters of interacting EP pairs. We demonstrated that these motif pairs are biologically meaningful and significantly enriched with motif pairs of known interacting TF pairs. We also showed that the identified motif pairs facilitated the discovery of the interacting EP pairs. The developed pipeline, EPmotifPair, together with the predicted motifs and motif pairs, is available at https://doi.org/10.6084/m9.figshare.14192000. Our study provides a comprehensive list of motif pairs that may contribute to EP physical interactions, which facilitate generating meaningful hypotheses for experimental validation.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2022-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069648/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39897140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
SCARF: a biomedical association rule finding webserver. 一个生物医学关联规则查找web服务器。
IF 1.9 Q1 Medicine Pub Date : 2022-02-04 DOI: 10.1515/jib-2021-0035
Balázs Szalkai, Vince Grolmusz

The analysis of enormous datasets with missing data entries is a standard task in biological and medical data processing. Large-scale, multi-institution clinical studies are the typical examples of such datasets. These sets make possible the search for multi-parametric relations since from the plenty of the data one is likely to find a satisfying number of subjects with the required parameter ensembles. Specifically, finding combinatorial biomarkers for some given condition also needs a very large dataset to analyze. For fast and automatic multi-parametric relation discovery association-rule finding tools are used for more than two decades in the data-mining community. Here we present the SCARF webserver for generalized association rule mining. Association rules are of the form: a AND b AND … AND xy, meaning that the presence of properties a AND b AND … AND x implies property y; our algorithm finds generalized association rules, since it also finds logical disjunctions (i.e., ORs) at the left-hand side, allowing the discovery of more complex rules in a more compressed form in the database. This feature also helps reducing the typically very large result-tables of such studies, since allowing ORs in the left-hand side of a single rule could include dozens of classical rules. The capabilities of the SCARF algorithm were demonstrated in mining the Alzheimer's database of the Coalition Against Major Diseases (CAMD) in our recent publication (Archives of Gerontology and Geriatrics Vol. 73, pp. 300-307, 2017). Here we describe the webserver implementation of the algorithm.

在生物和医学数据处理中,分析大量缺少数据条目的数据集是一项标准任务。大规模、多机构临床研究是此类数据集的典型例子。这些集合使得搜索多参数关系成为可能,因为从大量的数据中,人们很可能找到数量令人满意的具有所需参数集合的主题。具体来说,为某些特定疾病寻找组合生物标志物也需要一个非常大的数据集来分析。为了快速、自动地发现多参数关系,关联规则查找工具在数据挖掘领域已经使用了20多年。本文提出了用于广义关联规则挖掘的SCARF web服务器。关联规则的形式是:a AND b AND…AND x→y,这意味着属性a AND b AND…AND x的存在意味着属性y;我们的算法发现了广义关联规则,因为它也在左侧发现了逻辑析取(即or),从而允许在数据库中以更压缩的形式发现更复杂的规则。这个特性还有助于减少此类研究中通常非常大的结果表,因为在单个规则的左侧允许or可能包含数十个经典规则。在我们最近发表的一篇文章(《老年学和老年病学档案》第73卷,第300-307页,2017年)中,我们在挖掘抗重大疾病联盟(CAMD)的阿尔茨海默病数据库中展示了SCARF算法的能力。这里我们描述了该算法的web服务器实现。
{"title":"SCARF: a biomedical association rule finding webserver.","authors":"Balázs Szalkai,&nbsp;Vince Grolmusz","doi":"10.1515/jib-2021-0035","DOIUrl":"https://doi.org/10.1515/jib-2021-0035","url":null,"abstract":"<p><p>The analysis of enormous datasets with missing data entries is a standard task in biological and medical data processing. Large-scale, multi-institution clinical studies are the typical examples of such datasets. These sets make possible the search for multi-parametric relations since from the plenty of the data one is likely to find a satisfying number of subjects with the required parameter ensembles. Specifically, finding combinatorial biomarkers for some given condition also needs a very large dataset to analyze. For fast and automatic multi-parametric relation discovery association-rule finding tools are used for more than two decades in the data-mining community. Here we present the SCARF webserver for <i>generalized</i> association rule mining. Association rules are of the form: <i>a</i> AND <i>b</i> AND … AND <i>x</i> → <i>y</i>, meaning that the presence of properties <i>a</i> AND <i>b</i> AND … AND <i>x</i> implies property <i>y</i>; our algorithm finds generalized association rules, since it also finds logical disjunctions (i.e., ORs) at the left-hand side, allowing the discovery of more complex rules in a more compressed form in the database. This feature also helps reducing the typically very large result-tables of such studies, since allowing ORs in the left-hand side of a single rule could include dozens of classical rules. The capabilities of the SCARF algorithm were demonstrated in mining the Alzheimer's database of the Coalition Against Major Diseases (CAMD) in our recent publication (Archives of Gerontology and Geriatrics Vol. 73, pp. 300-307, 2017). Here we describe the webserver implementation of the algorithm.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2022-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9135138/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39888837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrative Bioinformatics: History and Future 综合生物信息学:历史与未来
IF 1.9 Q1 Medicine Pub Date : 2022-01-01 DOI: 10.1007/978-981-16-6795-4
{"title":"Integrative Bioinformatics: History and Future","authors":"","doi":"10.1007/978-981-16-6795-4","DOIUrl":"https://doi.org/10.1007/978-981-16-6795-4","url":null,"abstract":"","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.9,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83634855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical estimates of multiple transcription factors binding in the model plant genomes based on ChIP-seq data. 基于 ChIP-seq 数据对模式植物基因组中多个转录因子结合情况的统计估算。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2021-12-21 DOI: 10.1515/jib-2020-0036
Arthur I Dergilev, Nina G Orlova, Oxana B Dobrovolskaya, Yuriy L Orlov

The development of high-throughput genomic sequencing coupled with chromatin immunoprecipitation technologies allows studying the binding sites of the protein transcription factors (TF) in the genome scale. The growth of data volume on the experimentally determined binding sites raises qualitatively new problems for the analysis of gene expression regulation, prediction of transcription factors target genes, and regulatory gene networks reconstruction. Genome regulation remains an insufficiently studied though plants have complex molecular regulatory mechanisms of gene expression and response to environmental stresses. It is important to develop new software tools for the analysis of the TF binding sites location and their clustering in the plant genomes, visualization, and the following statistical estimates. This study presents application of the analysis of multiple TF binding profiles in three evolutionarily distant model plant organisms. The construction and analysis of non-random ChIP-seq binding clusters of the different TFs in mammalian embryonic stem cells were discussed earlier using similar bioinformatics approaches. Such clusters of TF binding sites may indicate the gene regulatory regions, enhancers and gene transcription regulatory hubs. It can be used for analysis of the gene promoters as well as a background for transcription networks reconstruction. We discuss the statistical estimates of the TF binding sites clusters in the model plant genomes. The distributions of the number of different TFs per binding cluster follow same power law distribution for all the genomes studied. The binding clusters in Arabidopsis thaliana genome were discussed here in detail.

高通量基因组测序技术和染色质免疫沉淀技术的发展,使得研究蛋白质转录因子(TF)的结合位点成为可能。实验测定的结合位点数据量的增长为基因表达调控分析、转录因子靶基因预测和调控基因网络重建带来了新的问题。尽管植物的基因表达和对环境胁迫的响应具有复杂的分子调控机制,但基因组调控仍然是一个研究不足的领域。开发新的软件工具来分析植物基因组中转录因子结合位点的位置及其聚类、可视化以及后续的统计估算非常重要。本研究介绍了在三种进化距离较远的模式植物中分析多个 TF 结合图谱的应用。早些时候曾讨论过利用类似的生物信息学方法构建和分析哺乳动物胚胎干细胞中不同TF的非随机ChIP-seq结合簇。这些TF结合位点群可指示基因调控区域、增强子和基因转录调控中心。它可用于分析基因启动子,也可作为转录网络重建的背景。我们讨论了对模型植物基因组中 TF 结合位点群的统计估计。在所研究的所有基因组中,每个结合簇的不同 TF 数量的分布都遵循相同的幂律分布。这里详细讨论了拟南芥基因组中的结合簇。
{"title":"Statistical estimates of multiple transcription factors binding in the model plant genomes based on ChIP-seq data.","authors":"Arthur I Dergilev, Nina G Orlova, Oxana B Dobrovolskaya, Yuriy L Orlov","doi":"10.1515/jib-2020-0036","DOIUrl":"10.1515/jib-2020-0036","url":null,"abstract":"<p><p>The development of high-throughput genomic sequencing coupled with chromatin immunoprecipitation technologies allows studying the binding sites of the protein transcription factors (TF) in the genome scale. The growth of data volume on the experimentally determined binding sites raises qualitatively new problems for the analysis of gene expression regulation, prediction of transcription factors target genes, and regulatory gene networks reconstruction. Genome regulation remains an insufficiently studied though plants have complex molecular regulatory mechanisms of gene expression and response to environmental stresses. It is important to develop new software tools for the analysis of the TF binding sites location and their clustering in the plant genomes, visualization, and the following statistical estimates. This study presents application of the analysis of multiple TF binding profiles in three evolutionarily distant model plant organisms. The construction and analysis of non-random ChIP-seq binding clusters of the different TFs in mammalian embryonic stem cells were discussed earlier using similar bioinformatics approaches. Such clusters of TF binding sites may indicate the gene regulatory regions, enhancers and gene transcription regulatory hubs. It can be used for analysis of the gene promoters as well as a background for transcription networks reconstruction. We discuss the statistical estimates of the TF binding sites clusters in the model plant genomes. The distributions of the number of different TFs per binding cluster follow same power law distribution for all the genomes studied. The binding clusters in <i>Arabidopsis thaliana</i> genome were discussed here in detail.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":null,"pages":null},"PeriodicalIF":1.5,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9069649/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39761184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Integrative Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1