{"title":"Jukes-Cantor Correction for Phylogenetic Tree Reconstruction","authors":"Friday Gabriel Emunefe, Ifeanyichukwu Jeff Ugbene","doi":"10.1101/2024.07.30.605767","DOIUrl":null,"url":null,"abstract":"Phylogenetic tree reconstruction relies on accurate estimation of evolutionary distances between sequences. However, the observed Hamming distance between sequences can be misleading due to saturation, where multiple substitutions at the same site obscure the true evolutionary history. The Jukes-Cantor correction method addresses this by accounting for multiple substitutions, providing a more accurate representation of evolutionary distance. This study investigates the application of the Jukes-Cantor correction to the Hamming distance of genetic sequences in a case study, highlighting its impact on phylogenetic tree reconstruction. Our results demonstrate that the Jukes-Cantor correction significantly improves the accuracy of phylogenetic inference, particularly for sequences with substantial evolutionary divergence. However, the model's reliance on simplifying assumptions, such as equal substitution rates and lack of base composition bias, limits its applicability to sequences with moderate levels of divergence. This study stands as a bedrock for further research into more complex models that can account for model violations and provide more accurate estimations of evolutionary distances for highly divergent sequences.","PeriodicalId":501213,"journal":{"name":"bioRxiv - Systems Biology","volume":"168 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv - Systems Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.07.30.605767","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Phylogenetic tree reconstruction relies on accurate estimation of evolutionary distances between sequences. However, the observed Hamming distance between sequences can be misleading due to saturation, where multiple substitutions at the same site obscure the true evolutionary history. The Jukes-Cantor correction method addresses this by accounting for multiple substitutions, providing a more accurate representation of evolutionary distance. This study investigates the application of the Jukes-Cantor correction to the Hamming distance of genetic sequences in a case study, highlighting its impact on phylogenetic tree reconstruction. Our results demonstrate that the Jukes-Cantor correction significantly improves the accuracy of phylogenetic inference, particularly for sequences with substantial evolutionary divergence. However, the model's reliance on simplifying assumptions, such as equal substitution rates and lack of base composition bias, limits its applicability to sequences with moderate levels of divergence. This study stands as a bedrock for further research into more complex models that can account for model violations and provide more accurate estimations of evolutionary distances for highly divergent sequences.