Realistic Scenarios of Phenotypic Variation and Errors in High-Throughput Phenotyping Experiments Minimally Impact the Results of QTL Mapping Analysis.
{"title":"Realistic Scenarios of Phenotypic Variation and Errors in High-Throughput Phenotyping Experiments Minimally Impact the Results of QTL Mapping Analysis.","authors":"Aliyah Brewer, Anna Underhill, Surya Sapkota, Chin-Feng Hwang, Summaira Riaz, Madeline Oravec, Lance Cadle-Davidson","doi":"10.1094/PHYTO-01-25-0007-FI","DOIUrl":null,"url":null,"abstract":"<p><p>High-throughput phenotyping technologies increase the efficiency of breeding programs, but with larger data sets, errors can accumulate. Plant breeders often conduct quantitative trait locus (QTL) mapping, where large sample size and accurate quantitative response estimates are important for detecting small effect QTL. This study examined how phenotype error, inconsistency, and replication changed QTL magnitude and location. Three real sets of phenotype data were used from microscopy robot analysis of grapevine powdery mildew (<i>Erysiphe necator</i>) severity, which previously resulted in discovery of large (R<sup>2</sup> = 85%), intermediate (R<sup>2</sup> = 45%), and small (R<sup>2</sup> = 9%) effect QTL. Custom R scripts were written to induce several realistic sources of error, inconsistency, and varied replication. The results were remarkably robust to these changes. Swapping or shifting 2% of samples or changing disease severity by 50% on one replicate had negligible impact on QTL. Unreplicated simulations produced the largest LOD score range (5.55 to 8.27) and mean LOD score deviation (-1.72 to -3.22; Cohen's D = 1.48 to 2.12). The large effect size QTL (<i>REN12</i>) was always detected. The intermediate effect size QTL (<i>REN13</i>) was detected except when three of the eight replicates were analyzed individually. Even for the small effect size locus (<i>NYVPLG9</i>), error scenarios rarely (2 of 9000 cases) eliminated significant QTL detection, versus no replication (9 of 10). Thus, the benefits of data volume associated with high-throughput phenotyping technologies outweigh the cost of the increased errors tested here. Instead, focus should be spent on examining how each experimental replicate contributes to the result of the QTL mapping analysis.</p>","PeriodicalId":20410,"journal":{"name":"Phytopathology","volume":" ","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Phytopathology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1094/PHYTO-01-25-0007-FI","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
High-throughput phenotyping technologies increase the efficiency of breeding programs, but with larger data sets, errors can accumulate. Plant breeders often conduct quantitative trait locus (QTL) mapping, where large sample size and accurate quantitative response estimates are important for detecting small effect QTL. This study examined how phenotype error, inconsistency, and replication changed QTL magnitude and location. Three real sets of phenotype data were used from microscopy robot analysis of grapevine powdery mildew (Erysiphe necator) severity, which previously resulted in discovery of large (R2 = 85%), intermediate (R2 = 45%), and small (R2 = 9%) effect QTL. Custom R scripts were written to induce several realistic sources of error, inconsistency, and varied replication. The results were remarkably robust to these changes. Swapping or shifting 2% of samples or changing disease severity by 50% on one replicate had negligible impact on QTL. Unreplicated simulations produced the largest LOD score range (5.55 to 8.27) and mean LOD score deviation (-1.72 to -3.22; Cohen's D = 1.48 to 2.12). The large effect size QTL (REN12) was always detected. The intermediate effect size QTL (REN13) was detected except when three of the eight replicates were analyzed individually. Even for the small effect size locus (NYVPLG9), error scenarios rarely (2 of 9000 cases) eliminated significant QTL detection, versus no replication (9 of 10). Thus, the benefits of data volume associated with high-throughput phenotyping technologies outweigh the cost of the increased errors tested here. Instead, focus should be spent on examining how each experimental replicate contributes to the result of the QTL mapping analysis.
期刊介绍:
Phytopathology publishes articles on fundamental research that advances understanding of the nature of plant diseases, the agents that cause them, their spread, the losses they cause, and measures that can be used to control them. Phytopathology considers manuscripts covering all aspects of plant diseases including bacteriology, host-parasite biochemistry and cell biology, biological control, disease control and pest management, description of new pathogen species description of new pathogen species, ecology and population biology, epidemiology, disease etiology, host genetics and resistance, mycology, nematology, plant stress and abiotic disorders, postharvest pathology and mycotoxins, and virology. Papers dealing mainly with taxonomy, such as descriptions of new plant pathogen taxa are acceptable if they include plant disease research results such as pathogenicity, host range, etc. Taxonomic papers that focus on classification, identification, and nomenclature below the subspecies level may also be submitted to Phytopathology.