Approaches to increase the validity of gene family identification using manual homology search tools.

IF 1.3 4区 生物学 Q4 GENETICS & HEREDITY Genetica Pub Date : 2023-12-01 Epub Date: 2023-10-10 DOI:10.1007/s10709-023-00196-8
Benjamin J Nestor, Philipp E Bayer, Cassandria G Tay Fernandez, David Edwards, Patrick M Finnegan
{"title":"Approaches to increase the validity of gene family identification using manual homology search tools.","authors":"Benjamin J Nestor, Philipp E Bayer, Cassandria G Tay Fernandez, David Edwards, Patrick M Finnegan","doi":"10.1007/s10709-023-00196-8","DOIUrl":null,"url":null,"abstract":"<p><p>Identifying homologs is an important process in the analysis of genetic patterns underlying traits and evolutionary relationships among species. Analysis of gene families is often used to form and support hypotheses on genetic patterns such as gene presence, absence, or functional divergence which underlie traits examined in functional studies. These analyses often require precise identification of all members in a targeted gene family. Manual pipelines where homology search and orthology assignment tools are used separately are the most common approach for identifying small gene families where accurate identification of all members is important. The ability to curate sequences between steps in manual pipelines allows for simple and precise identification of all possible gene family members. However, the validity of such manual pipeline analyses is often decreased by inappropriate approaches to homology searches including too relaxed or stringent statistical thresholds, inappropriate query sequences, homology classification based on sequence similarity alone, and low-quality proteome or genome sequences. In this article, we propose several approaches to mitigate these issues and allow for precise identification of gene family members and support for hypotheses linking genetic patterns to functional traits.</p>","PeriodicalId":55121,"journal":{"name":"Genetica","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10692271/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genetica","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1007/s10709-023-00196-8","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/10/10 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Identifying homologs is an important process in the analysis of genetic patterns underlying traits and evolutionary relationships among species. Analysis of gene families is often used to form and support hypotheses on genetic patterns such as gene presence, absence, or functional divergence which underlie traits examined in functional studies. These analyses often require precise identification of all members in a targeted gene family. Manual pipelines where homology search and orthology assignment tools are used separately are the most common approach for identifying small gene families where accurate identification of all members is important. The ability to curate sequences between steps in manual pipelines allows for simple and precise identification of all possible gene family members. However, the validity of such manual pipeline analyses is often decreased by inappropriate approaches to homology searches including too relaxed or stringent statistical thresholds, inappropriate query sequences, homology classification based on sequence similarity alone, and low-quality proteome or genome sequences. In this article, we propose several approaches to mitigate these issues and allow for precise identification of gene family members and support for hypotheses linking genetic patterns to functional traits.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用手动同源性搜索工具提高基因家族鉴定有效性的方法。
识别同源物是分析物种间特征和进化关系的遗传模式的一个重要过程。基因家族的分析通常用于形成和支持遗传模式的假设,如基因存在、缺失或功能分化,这些是功能研究中检查的性状的基础。这些分析通常需要精确识别目标基因家族中的所有成员。分别使用同源性搜索和同源性分配工具的手动管道是识别小基因家族的最常见方法,在小基因家族中,准确识别所有成员很重要。在手动管道中的步骤之间策划序列的能力允许对所有可能的基因家族成员进行简单而精确的鉴定。然而,这种手动管道分析的有效性通常会因同源性搜索的不适当方法而降低,包括过于宽松或严格的统计阈值、不适当的查询序列、仅基于序列相似性的同源性分类以及低质量的蛋白质组或基因组序列。在这篇文章中,我们提出了几种方法来缓解这些问题,并允许精确识别基因家族成员,并支持将遗传模式与功能性状联系起来的假设。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Genetica
Genetica 生物-遗传学
CiteScore
2.70
自引率
0.00%
发文量
32
审稿时长
>12 weeks
期刊介绍: Genetica publishes papers dealing with genetics, genomics, and evolution. Our journal covers novel advances in the fields of genomics, conservation genetics, genotype-phenotype interactions, evo-devo, population and quantitative genetics, and biodiversity. Genetica publishes original research articles addressing novel conceptual, experimental, and theoretical issues in these areas, whatever the taxon considered. Biomedical papers and papers on breeding animal and plant genetics are not within the scope of Genetica, unless framed in an evolutionary context. Recent advances in genetics, genomics and evolution are also published in thematic issues and synthesis papers published by experts in the field.
期刊最新文献
Genome-wide identification and data mining reveals major-latex protein (MLP) from the PR-10 protein family played defense-related roles against phytopathogenic challenges in cassava (Manihot esculenta Crantz). Comparative genomic analysis reveals expansion of the DnaJ gene family in Lagerstroemia indica and its members response to salt stress. Identification and expression analysis of XIP gene family members in rice. Genome-wide identification and expression analysis of the universal stress protein (USP) gene family in Arabidopsis thaliana, Zea mays, and Oryza sativa. A multi-tissue de novo transcriptome assembly and relative gene expression of the vulnerable freshwater salmonid Thymallus ligericus.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1