首页 > 最新文献

Briefings in Functional Genomics最新文献

英文 中文
A systematic analyses of different bioinformatics pipelines for genomic data and its impact on deep learning models for chromatin loop prediction. 系统分析基因组数据的不同生物信息学管道及其对染色质环路预测深度学习模型的影响。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-09-27 DOI: 10.1093/bfgp/elae009
Anup Kumar Halder, Abhishek Agarwal, Karolina Jodkowska, Dariusz Plewczynski

Genomic data analysis has witnessed a surge in complexity and volume, primarily driven by the advent of high-throughput technologies. In particular, studying chromatin loops and structures has become pivotal in understanding gene regulation and genome organization. This systematic investigation explores the realm of specialized bioinformatics pipelines designed specifically for the analysis of chromatin loops and structures. Our investigation incorporates two protein (CTCF and Cohesin) factor-specific loop interaction datasets from six distinct pipelines, amassing a comprehensive collection of 36 diverse datasets. Through a meticulous review of existing literature, we offer a holistic perspective on the methodologies, tools and algorithms underpinning the analysis of this multifaceted genomic feature. We illuminate the vast array of approaches deployed, encompassing pivotal aspects such as data preparation pipeline, preprocessing, statistical features and modelling techniques. Beyond this, we rigorously assess the strengths and limitations inherent in these bioinformatics pipelines, shedding light on the interplay between data quality and the performance of deep learning models, ultimately advancing our comprehension of genomic intricacies.

在高通量技术的推动下,基因组数据分析的复杂性和数量激增。特别是,研究染色质环路和结构已成为了解基因调控和基因组组织的关键。这项系统性研究探索了专为分析染色质环路和结构而设计的专业生物信息学管道领域。我们的研究结合了来自六个不同管道的两个蛋白质(CTCF 和 Cohesin)因子特异性环路相互作用数据集,收集了 36 个不同数据集的综合数据集。通过对现有文献的细致回顾,我们从整体的角度探讨了分析这一多方面基因组特征的方法、工具和算法。我们阐明了所采用的大量方法,包括数据准备管道、预处理、统计特征和建模技术等关键方面。除此之外,我们还严格评估了这些生物信息学管道固有的优势和局限性,揭示了数据质量与深度学习模型性能之间的相互作用,最终推动了我们对基因组复杂性的理解。
{"title":"A systematic analyses of different bioinformatics pipelines for genomic data and its impact on deep learning models for chromatin loop prediction.","authors":"Anup Kumar Halder, Abhishek Agarwal, Karolina Jodkowska, Dariusz Plewczynski","doi":"10.1093/bfgp/elae009","DOIUrl":"10.1093/bfgp/elae009","url":null,"abstract":"<p><p>Genomic data analysis has witnessed a surge in complexity and volume, primarily driven by the advent of high-throughput technologies. In particular, studying chromatin loops and structures has become pivotal in understanding gene regulation and genome organization. This systematic investigation explores the realm of specialized bioinformatics pipelines designed specifically for the analysis of chromatin loops and structures. Our investigation incorporates two protein (CTCF and Cohesin) factor-specific loop interaction datasets from six distinct pipelines, amassing a comprehensive collection of 36 diverse datasets. Through a meticulous review of existing literature, we offer a holistic perspective on the methodologies, tools and algorithms underpinning the analysis of this multifaceted genomic feature. We illuminate the vast array of approaches deployed, encompassing pivotal aspects such as data preparation pipeline, preprocessing, statistical features and modelling techniques. Beyond this, we rigorously assess the strengths and limitations inherent in these bioinformatics pipelines, shedding light on the interplay between data quality and the performance of deep learning models, ultimately advancing our comprehension of genomic intricacies.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"538-548"},"PeriodicalIF":2.5,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140330336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancements in genetic techniques and functional genomics for enhancing crop traits and agricultural sustainability. 基因技术和功能基因组学在提高作物性状和农业可持续性方面的进步。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-09-27 DOI: 10.1093/bfgp/elae017
Surender Kumar, Anupama Singh, Chander Mohan Singh Bist, Munish Sharma

Genetic variability is essential for the development of new crop varieties with economically beneficial traits. The traits can be inherited from wild relatives or induced through mutagenesis. Novel genetic elements can then be identified and new gene functions can be predicted. In this study, forward and reverse genetics approaches were described, in addition to their applications in modern crop improvement programs and functional genomics. By using heritable phenotypes and linked genetic markers, forward genetics searches for genes by using traditional genetic mapping and allele frequency estimation. Despite recent advances in sequencing technology, omics and computation, genetic redundancy remains a major challenge in forward genetics. By analyzing close-related genes, we will be able to dissect their functional redundancy and predict possible traits and gene activity patterns. In addition to these predictions, sophisticated reverse gene editing tools can be used to verify them, including TILLING, targeted insertional mutagenesis, gene silencing, gene targeting and genome editing. By using gene knock-down, knock-up and knock-out strategies, these tools are able to detect genetic changes in cells. In addition, epigenome analysis and editing enable the development of novel traits in existing crop cultivars without affecting their genetic makeup by increasing epiallelic variants. Our understanding of gene functions and molecular dynamics of various biological phenomena has been revised by all of these findings. The study also identifies novel genetic targets in crop species to improve yields and stress tolerances through conventional and non-conventional methods. In this article, genetic techniques and functional genomics are specifically discussed and assessed for their potential in crop improvement.

遗传变异对于培育具有经济效益性状的作物新品种至关重要。这些性状可以从野生近缘植物中遗传,也可以通过诱变诱导。这样就可以确定新的遗传元素,预测新的基因功能。本研究介绍了正向遗传学和反向遗传学方法,以及它们在现代作物改良计划和功能基因组学中的应用。正向遗传学利用可遗传的表型和相关遗传标记,通过传统的遗传图谱和等位基因频率估计来寻找基因。尽管最近在测序技术、omics 和计算方面取得了进步,但基因冗余仍然是正向遗传学面临的一大挑战。通过分析密切相关的基因,我们将能够剖析其功能冗余,并预测可能的性状和基因活动模式。除了这些预测之外,还可以使用复杂的反向基因编辑工具来验证这些预测,包括TILLING、定向插入诱变、基因沉默、基因打靶和基因组编辑。通过使用基因敲除、敲上和敲除策略,这些工具能够检测细胞中的基因变化。此外,通过表观基因组分析和编辑,可以在现有作物栽培品种中开发新的性状,而不会因增加外显子变异而影响其基因构成。所有这些发现修正了我们对基因功能和各种生物现象的分子动力学的理解。这项研究还确定了作物物种的新基因靶标,以通过常规和非常规方法提高产量和抗逆性。本文特别讨论了遗传技术和功能基因组学,并评估了它们在作物改良方面的潜力。
{"title":"Advancements in genetic techniques and functional genomics for enhancing crop traits and agricultural sustainability.","authors":"Surender Kumar, Anupama Singh, Chander Mohan Singh Bist, Munish Sharma","doi":"10.1093/bfgp/elae017","DOIUrl":"10.1093/bfgp/elae017","url":null,"abstract":"<p><p>Genetic variability is essential for the development of new crop varieties with economically beneficial traits. The traits can be inherited from wild relatives or induced through mutagenesis. Novel genetic elements can then be identified and new gene functions can be predicted. In this study, forward and reverse genetics approaches were described, in addition to their applications in modern crop improvement programs and functional genomics. By using heritable phenotypes and linked genetic markers, forward genetics searches for genes by using traditional genetic mapping and allele frequency estimation. Despite recent advances in sequencing technology, omics and computation, genetic redundancy remains a major challenge in forward genetics. By analyzing close-related genes, we will be able to dissect their functional redundancy and predict possible traits and gene activity patterns. In addition to these predictions, sophisticated reverse gene editing tools can be used to verify them, including TILLING, targeted insertional mutagenesis, gene silencing, gene targeting and genome editing. By using gene knock-down, knock-up and knock-out strategies, these tools are able to detect genetic changes in cells. In addition, epigenome analysis and editing enable the development of novel traits in existing crop cultivars without affecting their genetic makeup by increasing epiallelic variants. Our understanding of gene functions and molecular dynamics of various biological phenomena has been revised by all of these findings. The study also identifies novel genetic targets in crop species to improve yields and stress tolerances through conventional and non-conventional methods. In this article, genetic techniques and functional genomics are specifically discussed and assessed for their potential in crop improvement.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"607-623"},"PeriodicalIF":2.5,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140873722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding large scale sequencing datasets through changes to protein folding. 通过蛋白质折叠的变化理解大规模测序数据集。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-09-27 DOI: 10.1093/bfgp/elae007
David Shorthouse, Harris Lister, Gemma S Freeman, Benjamin A Hall

The expansion of high-quality, low-cost sequencing has created an enormous opportunity to understand how genetic variants alter cellular behaviour in disease. The high diversity of mutations observed has however drawn a spotlight onto the need for predictive modelling of mutational effects on phenotype from variants of uncertain significance. This is particularly important in the clinic due to the potential value in guiding clinical diagnosis and patient treatment. Recent computational modelling has highlighted the importance of mutation induced protein misfolding as a common mechanism for loss of protein or domain function, aided by developments in methods that make large computational screens tractable. Here we review recent applications of this approach to different genes, and how they have enabled and supported subsequent studies. We further discuss developments in the approach and the role for the approach in light of increasingly high throughput experimental approaches.

高质量、低成本测序技术的发展为了解基因变异如何改变疾病中的细胞行为创造了巨大的机会。然而,观察到的变异的高度多样性使人们注意到,需要对意义不确定的变异对表型的突变影响进行预测建模。这在临床上尤为重要,因为它具有指导临床诊断和患者治疗的潜在价值。最近的计算建模突显了突变诱导的蛋白质错误折叠作为蛋白质或结构域功能丧失的常见机制的重要性,这得益于使大型计算筛选变得可行的方法的发展。在此,我们回顾了这种方法最近在不同基因上的应用,以及它们如何促进和支持了后续研究。我们将进一步讨论该方法的发展,以及该方法在越来越多的高通量实验方法中的作用。
{"title":"Understanding large scale sequencing datasets through changes to protein folding.","authors":"David Shorthouse, Harris Lister, Gemma S Freeman, Benjamin A Hall","doi":"10.1093/bfgp/elae007","DOIUrl":"10.1093/bfgp/elae007","url":null,"abstract":"<p><p>The expansion of high-quality, low-cost sequencing has created an enormous opportunity to understand how genetic variants alter cellular behaviour in disease. The high diversity of mutations observed has however drawn a spotlight onto the need for predictive modelling of mutational effects on phenotype from variants of uncertain significance. This is particularly important in the clinic due to the potential value in guiding clinical diagnosis and patient treatment. Recent computational modelling has highlighted the importance of mutation induced protein misfolding as a common mechanism for loss of protein or domain function, aided by developments in methods that make large computational screens tractable. Here we review recent applications of this approach to different genes, and how they have enabled and supported subsequent studies. We further discuss developments in the approach and the role for the approach in light of increasingly high throughput experimental approaches.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"517-524"},"PeriodicalIF":2.5,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11428155/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140195143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computational drug repurposing for viral infectious diseases: a case study on monkeypox. 病毒性传染病的计算药物再利用:猴痘案例研究。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-09-27 DOI: 10.1093/bfgp/elad058
Sovan Saha, Piyali Chatterjee, Mita Nasipuri, Subhadip Basu, Tapabrata Chakraborti

The traditional method of drug reuse or repurposing has significantly contributed to the identification of new antiviral compounds and therapeutic targets, enabling rapid response to developing infectious illnesses. This article presents an overview of how modern computational methods are used in drug repurposing for the treatment of viral infectious diseases. These methods utilize data sets that include reviewed information on the host's response to pathogens and drugs, as well as various connections such as gene expression patterns and protein-protein interaction networks. We assess the potential benefits and limitations of these methods by examining monkeypox as a specific example, but the knowledge acquired can be applied to other comparable disease scenarios.

传统的药物重复使用或再利用方法极大地促进了新的抗病毒化合物和治疗靶点的确定,使人们能够对发展中的传染性疾病做出快速反应。本文概述了现代计算方法如何用于治疗病毒性传染病的药物再利用。这些方法利用的数据集包括宿主对病原体和药物反应的回顾信息,以及基因表达模式和蛋白质相互作用网络等各种联系。我们以猴痘为具体实例,评估了这些方法的潜在优势和局限性,但所获得的知识也可应用于其他类似的疾病情况。
{"title":"Computational drug repurposing for viral infectious diseases: a case study on monkeypox.","authors":"Sovan Saha, Piyali Chatterjee, Mita Nasipuri, Subhadip Basu, Tapabrata Chakraborti","doi":"10.1093/bfgp/elad058","DOIUrl":"10.1093/bfgp/elad058","url":null,"abstract":"<p><p>The traditional method of drug reuse or repurposing has significantly contributed to the identification of new antiviral compounds and therapeutic targets, enabling rapid response to developing infectious illnesses. This article presents an overview of how modern computational methods are used in drug repurposing for the treatment of viral infectious diseases. These methods utilize data sets that include reviewed information on the host's response to pathogens and drugs, as well as various connections such as gene expression patterns and protein-protein interaction networks. We assess the potential benefits and limitations of these methods by examining monkeypox as a specific example, but the knowledge acquired can be applied to other comparable disease scenarios.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"570-578"},"PeriodicalIF":2.5,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139106946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to: Omics-based deep learning approaches for lung cancer decision-making and therapeutics development. 更正为基于 Omics 的深度学习方法用于肺癌决策和疗法开发。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-09-27 DOI: 10.1093/bfgp/elad046
{"title":"Correction to: Omics-based deep learning approaches for lung cancer decision-making and therapeutics development.","authors":"","doi":"10.1093/bfgp/elad046","DOIUrl":"10.1093/bfgp/elad046","url":null,"abstract":"","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"680-681"},"PeriodicalIF":2.5,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10652953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial for BFG special issue: Computational genomics for precision medicine and personalized healthcare. BFG 特刊编辑:精准医学和个性化医疗的计算基因组学。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-09-27 DOI: 10.1093/bfgp/elae021
Tapabrata Chakraborti, Subhadip Basu
{"title":"Editorial for BFG special issue: Computational genomics for precision medicine and personalized healthcare.","authors":"Tapabrata Chakraborti, Subhadip Basu","doi":"10.1093/bfgp/elae021","DOIUrl":"10.1093/bfgp/elae021","url":null,"abstract":"","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"507-508"},"PeriodicalIF":2.5,"publicationDate":"2024-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141302141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From bench to bedside: potential of translational research in COVID-19 and beyond. 从实验室到床边:2019冠状病毒病及其他领域转化研究的潜力
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-07-19 DOI: 10.1093/bfgp/elad051
Nityendra Shukla, Uzma Shamim, Preeti Agarwal, Rajesh Pandey, Jitendra Narayan

The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and coronavirus disease 2019 (COVID-19) have been around for more than 3 years now. However, due to constant viral evolution, novel variants are emerging, leaving old treatment protocols redundant. As treatment options dwindle, infection rates continue to rise and seasonal infection surges become progressively common across the world, rapid solutions are required. With genomic and proteomic methods generating enormous amounts of data to expand our understanding of SARS-CoV-2 biology, there is an urgent requirement for the development of novel therapeutic methods that can allow translational research to flourish. In this review, we highlight the current state of COVID-19 in the world and the effects of post-infection sequelae. We present the contribution of translational research in COVID-19, with various current and novel therapeutic approaches, including antivirals, monoclonal antibodies and vaccines, as well as alternate treatment methods such as immunomodulators, currently being studied and reiterate the importance of translational research in the development of various strategies to contain COVID-19.

严重急性呼吸综合征冠状病毒2 (SARS-CoV-2)和冠状病毒病2019 (COVID-19)已经存在3年多了。然而,由于病毒不断进化,新的变异不断出现,使旧的治疗方案变得多余。随着治疗选择减少,感染率继续上升,季节性感染激增在世界各地日益普遍,需要快速解决办法。随着基因组学和蛋白质组学方法产生了大量数据,以扩大我们对SARS-CoV-2生物学的理解,迫切需要开发新的治疗方法,使转化研究得以蓬勃发展。在这篇综述中,我们重点介绍了COVID-19在世界上的现状以及感染后后遗症的影响。我们介绍了COVID-19转化研究的贡献,包括目前正在研究的各种现有和新型治疗方法,包括抗病毒药物、单克隆抗体和疫苗,以及免疫调节剂等替代治疗方法,并重申了转化研究在制定各种遏制COVID-19策略中的重要性。
{"title":"From bench to bedside: potential of translational research in COVID-19 and beyond.","authors":"Nityendra Shukla, Uzma Shamim, Preeti Agarwal, Rajesh Pandey, Jitendra Narayan","doi":"10.1093/bfgp/elad051","DOIUrl":"10.1093/bfgp/elad051","url":null,"abstract":"<p><p>The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and coronavirus disease 2019 (COVID-19) have been around for more than 3 years now. However, due to constant viral evolution, novel variants are emerging, leaving old treatment protocols redundant. As treatment options dwindle, infection rates continue to rise and seasonal infection surges become progressively common across the world, rapid solutions are required. With genomic and proteomic methods generating enormous amounts of data to expand our understanding of SARS-CoV-2 biology, there is an urgent requirement for the development of novel therapeutic methods that can allow translational research to flourish. In this review, we highlight the current state of COVID-19 in the world and the effects of post-infection sequelae. We present the contribution of translational research in COVID-19, with various current and novel therapeutic approaches, including antivirals, monoclonal antibodies and vaccines, as well as alternate treatment methods such as immunomodulators, currently being studied and reiterate the importance of translational research in the development of various strategies to contain COVID-19.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"349-362"},"PeriodicalIF":2.5,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138178078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DBPMod: a supervised learning model for computational recognition of DNA-binding proteins in model organisms. DBPMod:用于计算识别模式生物中 DNA 结合蛋白的监督学习模型。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-07-19 DOI: 10.1093/bfgp/elad039
Upendra K Pradhan, Prabina K Meher, Sanchita Naha, Nitesh K Sharma, Aarushi Agarwal, Ajit Gupta, Rajender Parsad

DNA-binding proteins (DBPs) play critical roles in many biological processes, including gene expression, DNA replication, recombination and repair. Understanding the molecular mechanisms underlying these processes depends on the precise identification of DBPs. In recent times, several computational methods have been developed to identify DBPs. However, because of the generic nature of the models, these models are unable to identify species-specific DBPs with higher accuracy. Therefore, a species-specific computational model is needed to predict species-specific DBPs. In this paper, we introduce the computational DBPMod method, which makes use of a machine learning approach to identify species-specific DBPs. For prediction, both shallow learning algorithms and deep learning models were used, with shallow learning models achieving higher accuracy. Additionally, the evolutionary features outperformed sequence-derived features in terms of accuracy. Five model organisms, including Caenorhabditis elegans, Drosophila melanogaster, Escherichia coli, Homo sapiens and Mus musculus, were used to assess the performance of DBPMod. Five-fold cross-validation and independent test set analyses were used to evaluate the prediction accuracy in terms of area under receiver operating characteristic curve (auROC) and area under precision-recall curve (auPRC), which was found to be ~89-92% and ~89-95%, respectively. The comparative results demonstrate that the DBPMod outperforms 12 current state-of-the-art computational approaches in identifying the DBPs for all five model organisms. We further developed the web server of DBPMod to make it easier for researchers to detect DBPs and is publicly available at https://iasri-sg.icar.gov.in/dbpmod/. DBPMod is expected to be an invaluable tool for discovering DBPs, supplementing the current experimental and computational methods.

DNA 结合蛋白(DBPs)在基因表达、DNA 复制、重组和修复等许多生物过程中发挥着关键作用。对这些过程的分子机制的了解取决于对 DBPs 的精确鉴定。近来,人们开发了多种计算方法来识别 DBPs。然而,由于模型的通用性,这些模型无法更准确地识别物种特异性 DBPs。因此,需要一种针对特定物种的计算模型来预测特定物种的 DBPs。本文介绍了计算 DBPMod 方法,该方法利用机器学习方法来识别物种特异性 DBPs。在预测方面,我们使用了浅层学习算法和深度学习模型,其中浅层学习模型的准确率更高。此外,在准确性方面,进化特征优于序列衍生特征。五种模式生物,包括秀丽隐杆线虫、黑腹果蝇、大肠杆菌、智人和麝,被用来评估DBPMod的性能。通过五倍交叉验证和独立测试集分析,以接收者操作特征曲线下面积(auROC)和精确度-召回曲线下面积(auPRC)来评估预测精确度,结果发现预测精确度分别为~89-92%和~89-95%。比较结果表明,DBPMod 在识别所有五种模式生物的 DBPs 方面优于目前最先进的 12 种计算方法。我们进一步开发了 DBPMod 的网络服务器,使研究人员更容易检测 DBPs,该服务器已在 https://iasri-sg.icar.gov.in/dbpmod/ 上公开发布。DBPMod 预计将成为发现 DBPs 的宝贵工具,补充当前的实验和计算方法。
{"title":"DBPMod: a supervised learning model for computational recognition of DNA-binding proteins in model organisms.","authors":"Upendra K Pradhan, Prabina K Meher, Sanchita Naha, Nitesh K Sharma, Aarushi Agarwal, Ajit Gupta, Rajender Parsad","doi":"10.1093/bfgp/elad039","DOIUrl":"10.1093/bfgp/elad039","url":null,"abstract":"<p><p>DNA-binding proteins (DBPs) play critical roles in many biological processes, including gene expression, DNA replication, recombination and repair. Understanding the molecular mechanisms underlying these processes depends on the precise identification of DBPs. In recent times, several computational methods have been developed to identify DBPs. However, because of the generic nature of the models, these models are unable to identify species-specific DBPs with higher accuracy. Therefore, a species-specific computational model is needed to predict species-specific DBPs. In this paper, we introduce the computational DBPMod method, which makes use of a machine learning approach to identify species-specific DBPs. For prediction, both shallow learning algorithms and deep learning models were used, with shallow learning models achieving higher accuracy. Additionally, the evolutionary features outperformed sequence-derived features in terms of accuracy. Five model organisms, including Caenorhabditis elegans, Drosophila melanogaster, Escherichia coli, Homo sapiens and Mus musculus, were used to assess the performance of DBPMod. Five-fold cross-validation and independent test set analyses were used to evaluate the prediction accuracy in terms of area under receiver operating characteristic curve (auROC) and area under precision-recall curve (auPRC), which was found to be ~89-92% and ~89-95%, respectively. The comparative results demonstrate that the DBPMod outperforms 12 current state-of-the-art computational approaches in identifying the DBPs for all five model organisms. We further developed the web server of DBPMod to make it easier for researchers to detect DBPs and is publicly available at https://iasri-sg.icar.gov.in/dbpmod/. DBPMod is expected to be an invaluable tool for discovering DBPs, supplementing the current experimental and computational methods.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"363-372"},"PeriodicalIF":2.5,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10483304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving cell type identification with Gaussian noise-augmented single-cell RNA-seq contrastive learning. 利用高斯噪声增强单细胞 RNA-seq 对比学习改进细胞类型识别。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-07-19 DOI: 10.1093/bfgp/elad059
Ibrahim Alsaggaf, Daniel Buchan, Cen Wan

Cell type identification is an important task for single-cell RNA-sequencing (scRNA-seq) data analysis. Many prediction methods have recently been proposed, but the predictive accuracy of difficult cell type identification tasks is still low. In this work, we proposed a novel Gaussian noise augmentation-based scRNA-seq contrastive learning method (GsRCL) to learn a type of discriminative feature representations for cell type identification tasks. A large-scale computational evaluation suggests that GsRCL successfully outperformed other state-of-the-art predictive methods on difficult cell type identification tasks, while the conventional random genes masking augmentation-based contrastive learning method also improved the accuracy of easy cell type identification tasks in general.

细胞类型鉴定是单细胞 RNA 序列(scRNA-seq)数据分析的一项重要任务。最近提出了许多预测方法,但对困难的细胞类型鉴定任务的预测准确率仍然很低。在这项工作中,我们提出了一种新颖的基于高斯噪声增强的 scRNA-seq 对比学习方法(GsRCL),以学习一种用于细胞类型鉴定任务的判别特征表征。大规模计算评估表明,GsRCL 在高难度细胞类型鉴定任务中的表现成功地超越了其他最先进的预测方法,而传统的基于随机基因掩蔽增强的对比学习方法也普遍提高了简单细胞类型鉴定任务的准确率。
{"title":"Improving cell type identification with Gaussian noise-augmented single-cell RNA-seq contrastive learning.","authors":"Ibrahim Alsaggaf, Daniel Buchan, Cen Wan","doi":"10.1093/bfgp/elad059","DOIUrl":"10.1093/bfgp/elad059","url":null,"abstract":"<p><p>Cell type identification is an important task for single-cell RNA-sequencing (scRNA-seq) data analysis. Many prediction methods have recently been proposed, but the predictive accuracy of difficult cell type identification tasks is still low. In this work, we proposed a novel Gaussian noise augmentation-based scRNA-seq contrastive learning method (GsRCL) to learn a type of discriminative feature representations for cell type identification tasks. A large-scale computational evaluation suggests that GsRCL successfully outperformed other state-of-the-art predictive methods on difficult cell type identification tasks, while the conventional random genes masking augmentation-based contrastive learning method also improved the accuracy of easy cell type identification tasks in general.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"441-451"},"PeriodicalIF":2.5,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139503121","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Genomics in Clinical trials for Breast Cancer. 乳腺癌临床试验中的基因组学。
IF 2.5 3区 生物学 Q3 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Pub Date : 2024-07-19 DOI: 10.1093/bfgp/elad054
David Enoma

Breast cancer (B.C.) still has increasing incidences and mortality rates globally. It is known that B.C. and other cancers have a very high rate of genetic heterogeneity and genomic mutations. Traditional oncology approaches have not been able to provide a lasting solution. Targeted therapeutics have been instrumental in handling the complexity and resistance associated with B.C. However, the progress of genomic technology has transformed our understanding of the genetic landscape of breast cancer, opening new avenues for improved anti-cancer therapeutics. Genomics is critical in developing tailored therapeutics and identifying patients most benefit from these treatments. The next generation of breast cancer clinical trials has incorporated next-generation sequencing technologies into the process, and we have seen benefits. These innovations have led to the approval of better-targeted therapies for patients with breast cancer. Genomics has a role to play in clinical trials, including genomic tests that have been approved, patient selection and prediction of therapeutic response. Multiple clinical trials in breast cancer have been done and are still ongoing, which have applied genomics technology. Precision medicine can be achieved in breast cancer therapy with increased efforts and advanced genomic studies in this domain. Genomics studies assist with patient outcomes improvement and oncology advancement by providing a deeper understanding of the biology behind breast cancer. This article will examine the present state of genomics in breast cancer clinical trials.

在全球范围内,乳腺癌(B.C.)的发病率和死亡率仍在不断上升。众所周知,乳腺癌和其他癌症具有很高的遗传异质性和基因组突变率。传统的肿瘤学方法无法提供持久的解决方案。然而,基因组学技术的进步改变了我们对乳腺癌基因状况的认识,为改进抗癌疗法开辟了新途径。基因组学对于开发有针对性的疗法和确定最受益于这些疗法的患者至关重要。下一代乳腺癌临床试验已将新一代测序技术纳入其中,我们已从中获益。这些创新已使针对乳腺癌患者的更佳疗法获得批准。基因组学可在临床试验中发挥作用,包括已获批准的基因组检测、患者选择和治疗反应预测。应用基因组学技术的多项乳腺癌临床试验已经完成并仍在进行中。随着在乳腺癌治疗领域加大努力和开展先进的基因组研究,可以实现精准医疗。基因组学研究通过深入了解乳腺癌背后的生物学原理,有助于改善患者的治疗效果,推动肿瘤学的发展。本文将探讨基因组学在乳腺癌临床试验中的应用现状。
{"title":"Genomics in Clinical trials for Breast Cancer.","authors":"David Enoma","doi":"10.1093/bfgp/elad054","DOIUrl":"10.1093/bfgp/elad054","url":null,"abstract":"<p><p>Breast cancer (B.C.) still has increasing incidences and mortality rates globally. It is known that B.C. and other cancers have a very high rate of genetic heterogeneity and genomic mutations. Traditional oncology approaches have not been able to provide a lasting solution. Targeted therapeutics have been instrumental in handling the complexity and resistance associated with B.C. However, the progress of genomic technology has transformed our understanding of the genetic landscape of breast cancer, opening new avenues for improved anti-cancer therapeutics. Genomics is critical in developing tailored therapeutics and identifying patients most benefit from these treatments. The next generation of breast cancer clinical trials has incorporated next-generation sequencing technologies into the process, and we have seen benefits. These innovations have led to the approval of better-targeted therapies for patients with breast cancer. Genomics has a role to play in clinical trials, including genomic tests that have been approved, patient selection and prediction of therapeutic response. Multiple clinical trials in breast cancer have been done and are still ongoing, which have applied genomics technology. Precision medicine can be achieved in breast cancer therapy with increased efforts and advanced genomic studies in this domain. Genomics studies assist with patient outcomes improvement and oncology advancement by providing a deeper understanding of the biology behind breast cancer. This article will examine the present state of genomics in breast cancer clinical trials.</p>","PeriodicalId":55323,"journal":{"name":"Briefings in Functional Genomics","volume":" ","pages":"325-334"},"PeriodicalIF":2.5,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139038236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Briefings in Functional Genomics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1