Qi Chen , Min Deng , Xuan Dai , Wei Wang , Xing Wang , Liu-Sheng Chen , Guo-Hua Huang
{"title":"Phylogenomic data exploration with increased sampling provides new insights into the higher-level relationships of butterflies and moths (Lepidoptera)","authors":"Qi Chen , Min Deng , Xuan Dai , Wei Wang , Xing Wang , Liu-Sheng Chen , Guo-Hua Huang","doi":"10.1016/j.ympev.2024.108113","DOIUrl":null,"url":null,"abstract":"<div><p>A robust and stable phylogenetic framework is a fundamental goal of evolutionary biology. As the third largest insect order in the world following Coleoptera and Diptera, Lepidoptera (butterflies and moths) play a central role in almost every terrestrial ecosystem as indicators of environmental change and serve as important models for biologists exploring questions related to ecology and evolutionary biology. However, for such a charismatic insect group, the higher-level phylogenetic relationships among its superfamilies are still poorly resolved. Compared to earlier phylogenomic studies, we increased taxon sampling among Lepidoptera (37 superfamilies and 68 families containing 263 taxa) and acquired a series of large amino-acid datasets from 69,680 to 400,330 for phylogenomic reconstructions. Using these datasets, we explored the effect of different taxon sampling with significant increases in the number of included genes on tree topology by considering a series of systematic errors using maximum-likelihood (ML) and Bayesian inference (BI) methods. Moreover, we also tested the effectiveness in topology robustness among the three ML-based models. The results showed that taxon sampling is an important determinant in tree robustness of accurate lepidopteran phylogenetic estimation. Long-branch attraction (LBA) caused by site-wise heterogeneity is a significant source of bias giving rise to unstable positions of ditrysian groups in phylogenomic reconstruction. Phylogenetic inference showed the most comprehensive framework to reveal the relationships among lepidopteran superfamilies, and presented some newly relationships with strong supports (Papilionoidea was sister to Gelechioidea and Immoidea was sister to Galacticoidea, respectively), but limited by taxon sampling, the relationships within the species-rich and relatively rapid radiation Ditrysia and especially Apoditrysia remain poorly resolved, which need to increase taxon sampling for further phylogenomic reconstruction. The present study demonstrates that taxon sampling is an important determinant for an accurate lepidopteran tree of life and provides some essential insights for future lepidopteran phylogenomic studies.</p></div>","PeriodicalId":56109,"journal":{"name":"Molecular Phylogenetics and Evolution","volume":null,"pages":null},"PeriodicalIF":3.6000,"publicationDate":"2024-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Phylogenetics and Evolution","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1055790324001052","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
A robust and stable phylogenetic framework is a fundamental goal of evolutionary biology. As the third largest insect order in the world following Coleoptera and Diptera, Lepidoptera (butterflies and moths) play a central role in almost every terrestrial ecosystem as indicators of environmental change and serve as important models for biologists exploring questions related to ecology and evolutionary biology. However, for such a charismatic insect group, the higher-level phylogenetic relationships among its superfamilies are still poorly resolved. Compared to earlier phylogenomic studies, we increased taxon sampling among Lepidoptera (37 superfamilies and 68 families containing 263 taxa) and acquired a series of large amino-acid datasets from 69,680 to 400,330 for phylogenomic reconstructions. Using these datasets, we explored the effect of different taxon sampling with significant increases in the number of included genes on tree topology by considering a series of systematic errors using maximum-likelihood (ML) and Bayesian inference (BI) methods. Moreover, we also tested the effectiveness in topology robustness among the three ML-based models. The results showed that taxon sampling is an important determinant in tree robustness of accurate lepidopteran phylogenetic estimation. Long-branch attraction (LBA) caused by site-wise heterogeneity is a significant source of bias giving rise to unstable positions of ditrysian groups in phylogenomic reconstruction. Phylogenetic inference showed the most comprehensive framework to reveal the relationships among lepidopteran superfamilies, and presented some newly relationships with strong supports (Papilionoidea was sister to Gelechioidea and Immoidea was sister to Galacticoidea, respectively), but limited by taxon sampling, the relationships within the species-rich and relatively rapid radiation Ditrysia and especially Apoditrysia remain poorly resolved, which need to increase taxon sampling for further phylogenomic reconstruction. The present study demonstrates that taxon sampling is an important determinant for an accurate lepidopteran tree of life and provides some essential insights for future lepidopteran phylogenomic studies.
期刊介绍:
Molecular Phylogenetics and Evolution is dedicated to bringing Darwin''s dream within grasp - to "have fairly true genealogical trees of each great kingdom of Nature." The journal provides a forum for molecular studies that advance our understanding of phylogeny and evolution, further the development of phylogenetically more accurate taxonomic classifications, and ultimately bring a unified classification for all the ramifying lines of life. Phylogeographic studies will be considered for publication if they offer EXCEPTIONAL theoretical or empirical advances.