转基因作物基因拷贝数估算技术的比较评估:Southern 印迹、qPCR、dPCR 和 NGS 的启示

IF 10.1 1区 生物学 Q1 BIOTECHNOLOGY & APPLIED MICROBIOLOGY Plant Biotechnology Journal Pub Date : 2024-09-12 DOI:10.1111/pbi.14466
Wenting Xu, Jingang Liang, Fan Wang, Litao Yang
{"title":"转基因作物基因拷贝数估算技术的比较评估:Southern 印迹、qPCR、dPCR 和 NGS 的启示","authors":"Wenting Xu, Jingang Liang, Fan Wang, Litao Yang","doi":"10.1111/pbi.14466","DOIUrl":null,"url":null,"abstract":"<p>Gene copy number is crucial for understanding genomic architecture and its implications in plant and animal genetics (Alonge <i>et al</i>., <span>2020</span>; Castagnone-Sereno <i>et al</i>., <span>2019</span>). In agriculture, variations in gene copy number (CNVs) are vital as they affect yield, stress resistance and metabolic capabilities (Yuan <i>et al</i>., <span>2021</span>). Transgenesis, involving the introduction of foreign DNA into plant genomes, has revolutionized agriculture by creating genetically modified (GM) plants with desirable traits. Assessing gene copy numbers in GMOs ensures stability and expression of introduced traits and is crucial for regulatory compliance and biosafety assessments (Liang <i>et al</i>., <span>2022</span>). Evaluating gene copy numbers in transgenic plants is technically challenging due to variability in transgene integration events (Faure, <span>2021</span>). Various techniques like Southern blotting (SB), quantitative real-time PCR (qPCR), digital PCR (dPCR) and paired-end whole-genome sequencing (PE-WGS) have been reported for gene copy number determination (Cusenza <i>et al</i>., <span>2021</span>). However, no systematic comparison of these four methods has been reported, especially concerning PE-WGS.</p>\n<p>Here, we performed a comparative benchmarking of gene copy number assessment techniques, including SB, qPCR, dPCR and PE-WGS, employing 4 GM crop events (FG72 soybean, 12-5 maize, G6H1 and G281 rice) as examples (Figure 1a; Data S1).</p>\n<figure><picture>\n<source media=\"(min-width: 1650px)\" srcset=\"/cms/asset/800ca9d8-87ce-48cd-b7cf-1f99deca6de1/pbi14466-fig-0001-m.jpg\"/><img alt=\"Details are in the caption following the image\" data-lg-src=\"/cms/asset/800ca9d8-87ce-48cd-b7cf-1f99deca6de1/pbi14466-fig-0001-m.jpg\" loading=\"lazy\" src=\"/cms/asset/dec6a469-300e-43d3-a4bf-08ed5e20ab2a/pbi14466-fig-0001-m.png\" title=\"Details are in the caption following the image\"/></picture><figcaption>\n<div><strong>Figure 1<span style=\"font-weight:normal\"></span></strong><div>Open in figure viewer<i aria-hidden=\"true\"></i><span>PowerPoint</span></div>\n</div>\n<div>(a) The workflow of the benchmarking study of gene copy number estimation, the diagrams of exogenous gene cassettes of the tested GM events (FG72, G281, G6H1 and 12-5), and the position of hybrid probes in Southern blotting analysis. (b) Southern blotting analysis of four events with various restriction enzymes. GM, GM event; M, DNA marker; P, positive control; WT, the corresponding recipient line of GM event. (c) The constructed standard curves of qPCR assays employing corresponding plasmid calibrators of exogenous and endogenous genes. (d) Summarizes the copy numbers of transgenes determined from Southern blotting, qPCR, ddPCR and PE-WGS analysis. (e) Advantages and disadvantages of the four methods in transgene copy number evaluation. dPCR, digital PCR; GM, genetically modified; PCR, polymerase chain reaction; PE-WGS, paired-end whole-genome sequencing; qPCR, quantitative PCR.</div>\n</figcaption>\n</figure>\n<p>In SB analysis, we used various restriction endonucleases for genomic DNA digestion. For the event G6H1, BamHI, SacI, KpnI and StuI revealed one copy of <i>cry1Ab/vip3H</i> and <i>G6epsps</i>. The G281 event showed a single copy of <i>hLF</i> but uncertain <i>G6epsps</i> copy numbers (one or two). FG72 displayed inconsistent band patterns for <i>2mepsps</i> and <i>hppdPfW336</i>, suggesting copy numbers of one or two. Maize 12-5, analysed with KpnI and XbaI, indicated a single copy of <i>G10epsps</i> and <i>cry1Ab/cry2Aj</i> (Figure 1b, Table S1).</p>\n<p>We employed absolute quantification via qPCR, using endogenous reference genes SPS, Lectin and zSSIIb for rice, soybean and maize, respectively. All assays were validated for high efficiency and precision (Figure 1c, Table S2). Quantitative real-time PCR results showed G6H1's <i>G6epsps</i> and <i>cry1Ab/vip3H</i> had values of 0.98 and 0.96; G281's <i>G6epsps</i> and <i>hLF</i> were 1.68 and 1.54; FG72's <i>2mepsps</i> and <i>hppdPfW336</i> were 1.72 and 1.67; maize 12-5's <i>G10epsps</i> and <i>cry1Ab/cry2Aj</i> were 0.81 and 0.83 (Table S3). These values suggest a single T-DNA fragment integration in G6H1 and 12-5 and two fragments in G281 and FG72.</p>\n<p>Digital PCR provides absolute quantification by comparing target DNA to a reference gene. Results showed G6H1 had 0.94 and 0.97 copies for <i>G6epsps</i> and <i>cry1Ab/vip3H</i>. G281's <i>G6epsps</i> and <i>hLF</i> were 1.85 and 1.93. FG72's <i>2mepsps</i> and <i>hppdPfW336</i> were 1.69 and 1.68. 12-5's <i>G10epsps</i> and cry1Ab/cry2A were 0.57 and 0.59 (Table S4). These indicate single exogenous gene copies in G6H1 and 12-5 and dual copies in G281 and FG72.</p>\n<p>In PE-WGS analysis, the sequencing depths for G6H1, G281, FG72 and 12-5 were 28.81, 28.91, 48.70 and 23.94, respectively (Table S5). Read counts for target genes were used to calculate copy numbers: G6H1 had 1.08 copies of <i>G6epsps</i> and 0.83 copies of <i>cry1Ab/vip3H</i>; G281 had 2.01 copies of <i>G6epsps</i> and 1.91 copies of <i>hLF</i>; FG72 had 1.80 copies of <i>2mepsps</i> and 2.00 copies of <i>hppdPfW336</i>; 12-5 had 0.58 copies of <i>G10epsps</i> and 0.61 copies of <i>cry1Ab/cry2Aj</i> (Table S6).</p>\n<p>Our systematic measurements of four GM events showed that all four techniques are suitable for this purpose to varying degrees. All methods accurately quantified single-copy genes; however, discrepancies emerged for multi-copy genes (Figure 1d). The strengths and limitations of each method concerning various aspects were summarized in Figure 1e.</p>\n<p>Southern blotting is less accurate and sensitive for multi-copy genes. Southern blotting often underestimates multi-copy genes due to complex arrangements like tandem repeats and can overestimate due to incomplete digestion and cross-hybridization. qPCR, while more accurate than SB, struggles with high-copy genes due to resolution limits (around two-fold variation). Proper primer design and reaction optimization can still yield relatively accurate results. Digital PCR excels with high accuracy for multi-copy genes due to its partitioning capability, allowing it to detect minor changes in copy number, such as a 1.2-fold change from 5 to 6 copies (Whale <i>et al</i>., <span>2012</span>). Paired-end whole-genome sequencing also provides precise quantification for multi-copy genes through adequate coverage and sophisticated data analysis tools, especially useful for genes with complex genomic rearrangements (Hehir-Kwa <i>et al</i>., <span>2018</span>). Furthermore, PE-WGS has demonstrated high performance in elucidating the comprehensive molecular characterization of transgenic plants and animals. This includes identifying transgene insertion sites, flanking sequences, entire T-DNA integration structures and plasmid backbone presence, among other features.</p>\n<p>Digital PCR and PE-WGS are most effective for distinguishing heterozygotes from homozygotes. Digital PCR offers absolute quantification without needing a standard curve, providing precise measurements. Paired-end whole-genome sequencing, through high-resolution mapping of paired-end reads, can differentiate between heterozygotes and homozygotes by analysing read depth. It is challenging to distinguish between homozygotes and heterozygotes with SB due to similar patterns from sequence homology. Quantitative PCR can distinguish heterozygotes from homozygotes based on Ct values, but it requires careful calibration and control and is influenced by PCR efficiency.</p>\n<p>Southern blotting and PE-WGS require substantial amounts of DNA, whereas PCR-based methods need significantly less. Quantitative PCR necessitates high-quality DNA free from degradation and PCR inhibitors. Digital PCR is more tolerant of DNA degradation and inhibitors, delivering accurate quantification even with crude DNA extracts (Whale <i>et al</i>., <span>2012</span>).</p>\n<p>Quantitative PCR and dPCR are generally easier to set up and perform than SB and PE-WGS. SB is labor-intensive, involving several complex steps, including DNA digestion, transformation, hybridization and autoradiography. Paired-end whole-genome sequencing requires strict protocols, thorough DNA extraction, library preparation, machine sequencing and data analysis, necessitating skilled personnel in molecular biology and bioinformatics. Quantitative PCR involves optimizing conditions, constructing positive plasmid DNA, creating standard curves and testing, requiring normal molecular laboratory skills. Digital PCR entails sample partitioning using instruments that streamline the process into one platform.</p>\n<p>Quantitative PCR requires the least technical expertise, primarily understanding primer design and qPCR setup. Digital PCR needs moderate expertise, especially for sample emulsification or droplet/well creation. Southern blotting demands high technical skills, particularly for DNA digestion, gel electrophoresis, transfer and hybridization. Paired-end whole-genome sequencing requires significant expertise in data analytics and bioinformatics, besides library preparation and sequencing skills. The experimental duration of qPCR and dPCR is faster than SB and PE-WGS, typically concluding within a day. Southern blotting and PE-WGS require at least 3 days, respectively. Digital PCR is quicker than qPCR because it does not require a standard curve.</p>\n<p>Costwise, SB is relatively cheap owing to lower reagent costs and basic equipment needs. Quantitative PCR has a medium cost, with moderately expensive reagents and higher throughput, which reduces per-sample costs. Digital PCR is more costly due to expensive equipment and lower throughput but provides absolute quantification without standard curves. Paired-end whole-genome sequencing is the most expensive, justified by its comprehensive genomic characterization capabilities beyond single gene copy estimation.</p>\n<p>We propose prioritizing dPCR and PE-WGS for precise gene copy number analysis. Paired-end whole-genome sequencing is especially suited for assessing multiple gene copies within a sample, while dPCR is optimal for smaller quantities per sample, offering robust tools for genomic research and biotechnological applications.</p>","PeriodicalId":221,"journal":{"name":"Plant Biotechnology Journal","volume":"105 1","pages":""},"PeriodicalIF":10.1000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative evaluation of gene copy number estimation techniques in genetically modified crops: insights from Southern blotting, qPCR, dPCR and NGS\",\"authors\":\"Wenting Xu, Jingang Liang, Fan Wang, Litao Yang\",\"doi\":\"10.1111/pbi.14466\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Gene copy number is crucial for understanding genomic architecture and its implications in plant and animal genetics (Alonge <i>et al</i>., <span>2020</span>; Castagnone-Sereno <i>et al</i>., <span>2019</span>). In agriculture, variations in gene copy number (CNVs) are vital as they affect yield, stress resistance and metabolic capabilities (Yuan <i>et al</i>., <span>2021</span>). Transgenesis, involving the introduction of foreign DNA into plant genomes, has revolutionized agriculture by creating genetically modified (GM) plants with desirable traits. Assessing gene copy numbers in GMOs ensures stability and expression of introduced traits and is crucial for regulatory compliance and biosafety assessments (Liang <i>et al</i>., <span>2022</span>). Evaluating gene copy numbers in transgenic plants is technically challenging due to variability in transgene integration events (Faure, <span>2021</span>). Various techniques like Southern blotting (SB), quantitative real-time PCR (qPCR), digital PCR (dPCR) and paired-end whole-genome sequencing (PE-WGS) have been reported for gene copy number determination (Cusenza <i>et al</i>., <span>2021</span>). However, no systematic comparison of these four methods has been reported, especially concerning PE-WGS.</p>\\n<p>Here, we performed a comparative benchmarking of gene copy number assessment techniques, including SB, qPCR, dPCR and PE-WGS, employing 4 GM crop events (FG72 soybean, 12-5 maize, G6H1 and G281 rice) as examples (Figure 1a; Data S1).</p>\\n<figure><picture>\\n<source media=\\\"(min-width: 1650px)\\\" srcset=\\\"/cms/asset/800ca9d8-87ce-48cd-b7cf-1f99deca6de1/pbi14466-fig-0001-m.jpg\\\"/><img alt=\\\"Details are in the caption following the image\\\" data-lg-src=\\\"/cms/asset/800ca9d8-87ce-48cd-b7cf-1f99deca6de1/pbi14466-fig-0001-m.jpg\\\" loading=\\\"lazy\\\" src=\\\"/cms/asset/dec6a469-300e-43d3-a4bf-08ed5e20ab2a/pbi14466-fig-0001-m.png\\\" title=\\\"Details are in the caption following the image\\\"/></picture><figcaption>\\n<div><strong>Figure 1<span style=\\\"font-weight:normal\\\"></span></strong><div>Open in figure viewer<i aria-hidden=\\\"true\\\"></i><span>PowerPoint</span></div>\\n</div>\\n<div>(a) The workflow of the benchmarking study of gene copy number estimation, the diagrams of exogenous gene cassettes of the tested GM events (FG72, G281, G6H1 and 12-5), and the position of hybrid probes in Southern blotting analysis. (b) Southern blotting analysis of four events with various restriction enzymes. GM, GM event; M, DNA marker; P, positive control; WT, the corresponding recipient line of GM event. (c) The constructed standard curves of qPCR assays employing corresponding plasmid calibrators of exogenous and endogenous genes. (d) Summarizes the copy numbers of transgenes determined from Southern blotting, qPCR, ddPCR and PE-WGS analysis. (e) Advantages and disadvantages of the four methods in transgene copy number evaluation. dPCR, digital PCR; GM, genetically modified; PCR, polymerase chain reaction; PE-WGS, paired-end whole-genome sequencing; qPCR, quantitative PCR.</div>\\n</figcaption>\\n</figure>\\n<p>In SB analysis, we used various restriction endonucleases for genomic DNA digestion. For the event G6H1, BamHI, SacI, KpnI and StuI revealed one copy of <i>cry1Ab/vip3H</i> and <i>G6epsps</i>. The G281 event showed a single copy of <i>hLF</i> but uncertain <i>G6epsps</i> copy numbers (one or two). FG72 displayed inconsistent band patterns for <i>2mepsps</i> and <i>hppdPfW336</i>, suggesting copy numbers of one or two. Maize 12-5, analysed with KpnI and XbaI, indicated a single copy of <i>G10epsps</i> and <i>cry1Ab/cry2Aj</i> (Figure 1b, Table S1).</p>\\n<p>We employed absolute quantification via qPCR, using endogenous reference genes SPS, Lectin and zSSIIb for rice, soybean and maize, respectively. All assays were validated for high efficiency and precision (Figure 1c, Table S2). Quantitative real-time PCR results showed G6H1's <i>G6epsps</i> and <i>cry1Ab/vip3H</i> had values of 0.98 and 0.96; G281's <i>G6epsps</i> and <i>hLF</i> were 1.68 and 1.54; FG72's <i>2mepsps</i> and <i>hppdPfW336</i> were 1.72 and 1.67; maize 12-5's <i>G10epsps</i> and <i>cry1Ab/cry2Aj</i> were 0.81 and 0.83 (Table S3). These values suggest a single T-DNA fragment integration in G6H1 and 12-5 and two fragments in G281 and FG72.</p>\\n<p>Digital PCR provides absolute quantification by comparing target DNA to a reference gene. Results showed G6H1 had 0.94 and 0.97 copies for <i>G6epsps</i> and <i>cry1Ab/vip3H</i>. G281's <i>G6epsps</i> and <i>hLF</i> were 1.85 and 1.93. FG72's <i>2mepsps</i> and <i>hppdPfW336</i> were 1.69 and 1.68. 12-5's <i>G10epsps</i> and cry1Ab/cry2A were 0.57 and 0.59 (Table S4). These indicate single exogenous gene copies in G6H1 and 12-5 and dual copies in G281 and FG72.</p>\\n<p>In PE-WGS analysis, the sequencing depths for G6H1, G281, FG72 and 12-5 were 28.81, 28.91, 48.70 and 23.94, respectively (Table S5). Read counts for target genes were used to calculate copy numbers: G6H1 had 1.08 copies of <i>G6epsps</i> and 0.83 copies of <i>cry1Ab/vip3H</i>; G281 had 2.01 copies of <i>G6epsps</i> and 1.91 copies of <i>hLF</i>; FG72 had 1.80 copies of <i>2mepsps</i> and 2.00 copies of <i>hppdPfW336</i>; 12-5 had 0.58 copies of <i>G10epsps</i> and 0.61 copies of <i>cry1Ab/cry2Aj</i> (Table S6).</p>\\n<p>Our systematic measurements of four GM events showed that all four techniques are suitable for this purpose to varying degrees. All methods accurately quantified single-copy genes; however, discrepancies emerged for multi-copy genes (Figure 1d). The strengths and limitations of each method concerning various aspects were summarized in Figure 1e.</p>\\n<p>Southern blotting is less accurate and sensitive for multi-copy genes. Southern blotting often underestimates multi-copy genes due to complex arrangements like tandem repeats and can overestimate due to incomplete digestion and cross-hybridization. qPCR, while more accurate than SB, struggles with high-copy genes due to resolution limits (around two-fold variation). Proper primer design and reaction optimization can still yield relatively accurate results. Digital PCR excels with high accuracy for multi-copy genes due to its partitioning capability, allowing it to detect minor changes in copy number, such as a 1.2-fold change from 5 to 6 copies (Whale <i>et al</i>., <span>2012</span>). Paired-end whole-genome sequencing also provides precise quantification for multi-copy genes through adequate coverage and sophisticated data analysis tools, especially useful for genes with complex genomic rearrangements (Hehir-Kwa <i>et al</i>., <span>2018</span>). Furthermore, PE-WGS has demonstrated high performance in elucidating the comprehensive molecular characterization of transgenic plants and animals. This includes identifying transgene insertion sites, flanking sequences, entire T-DNA integration structures and plasmid backbone presence, among other features.</p>\\n<p>Digital PCR and PE-WGS are most effective for distinguishing heterozygotes from homozygotes. Digital PCR offers absolute quantification without needing a standard curve, providing precise measurements. Paired-end whole-genome sequencing, through high-resolution mapping of paired-end reads, can differentiate between heterozygotes and homozygotes by analysing read depth. It is challenging to distinguish between homozygotes and heterozygotes with SB due to similar patterns from sequence homology. Quantitative PCR can distinguish heterozygotes from homozygotes based on Ct values, but it requires careful calibration and control and is influenced by PCR efficiency.</p>\\n<p>Southern blotting and PE-WGS require substantial amounts of DNA, whereas PCR-based methods need significantly less. Quantitative PCR necessitates high-quality DNA free from degradation and PCR inhibitors. Digital PCR is more tolerant of DNA degradation and inhibitors, delivering accurate quantification even with crude DNA extracts (Whale <i>et al</i>., <span>2012</span>).</p>\\n<p>Quantitative PCR and dPCR are generally easier to set up and perform than SB and PE-WGS. SB is labor-intensive, involving several complex steps, including DNA digestion, transformation, hybridization and autoradiography. Paired-end whole-genome sequencing requires strict protocols, thorough DNA extraction, library preparation, machine sequencing and data analysis, necessitating skilled personnel in molecular biology and bioinformatics. Quantitative PCR involves optimizing conditions, constructing positive plasmid DNA, creating standard curves and testing, requiring normal molecular laboratory skills. Digital PCR entails sample partitioning using instruments that streamline the process into one platform.</p>\\n<p>Quantitative PCR requires the least technical expertise, primarily understanding primer design and qPCR setup. Digital PCR needs moderate expertise, especially for sample emulsification or droplet/well creation. Southern blotting demands high technical skills, particularly for DNA digestion, gel electrophoresis, transfer and hybridization. Paired-end whole-genome sequencing requires significant expertise in data analytics and bioinformatics, besides library preparation and sequencing skills. The experimental duration of qPCR and dPCR is faster than SB and PE-WGS, typically concluding within a day. Southern blotting and PE-WGS require at least 3 days, respectively. Digital PCR is quicker than qPCR because it does not require a standard curve.</p>\\n<p>Costwise, SB is relatively cheap owing to lower reagent costs and basic equipment needs. Quantitative PCR has a medium cost, with moderately expensive reagents and higher throughput, which reduces per-sample costs. Digital PCR is more costly due to expensive equipment and lower throughput but provides absolute quantification without standard curves. Paired-end whole-genome sequencing is the most expensive, justified by its comprehensive genomic characterization capabilities beyond single gene copy estimation.</p>\\n<p>We propose prioritizing dPCR and PE-WGS for precise gene copy number analysis. Paired-end whole-genome sequencing is especially suited for assessing multiple gene copies within a sample, while dPCR is optimal for smaller quantities per sample, offering robust tools for genomic research and biotechnological applications.</p>\",\"PeriodicalId\":221,\"journal\":{\"name\":\"Plant Biotechnology Journal\",\"volume\":\"105 1\",\"pages\":\"\"},\"PeriodicalIF\":10.1000,\"publicationDate\":\"2024-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Plant Biotechnology Journal\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1111/pbi.14466\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOTECHNOLOGY & APPLIED MICROBIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Biotechnology Journal","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1111/pbi.14466","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

基因拷贝数对于了解基因组结构及其对动植物遗传学的影响至关重要(Alonge 等人,2020 年;Castagnone-Sereno 等人,2019 年)。在农业领域,基因拷贝数(CNV)的变化至关重要,因为它们会影响产量、抗逆性和代谢能力(Yuan 等人,2021 年)。转基因涉及将外来 DNA 导入植物基因组,通过创造具有理想性状的转基因植物,使农业发生了革命性变化。评估转基因生物的基因拷贝数可确保所引入性状的稳定性和表达,对于监管合规和生物安全评估至关重要(Liang 等人,2022 年)。由于转基因整合事件的变异性,评估转基因植物的基因拷贝数在技术上具有挑战性(Faure,2021 年)。据报道,有多种技术可用于基因拷贝数测定,如 Southern 印迹(SB)、定量实时 PCR(qPCR)、数字 PCR(dPCR)和成对全基因组测序(PE-WGS)(Cusenza 等,2021 年)。在此,我们以 4 个转基因作物事件(FG72 大豆、12-5 玉米、G6H1 和 G281 水稻)为例,对基因拷贝数评估技术(包括 SB、qPCR、dPCR 和 PE-WGS)进行了比较基准测试(图 1a;数据 S1)。图 1在图形浏览器中打开PowerPoint(a)基因拷贝数估算基准研究的工作流程、被测转基因事件(FG72、G281、G6H1 和 12-5)的外源基因盒图以及 Southern 印迹分析中杂交探针的位置。(b) 用各种限制性酶对四个基因事件进行 Southern 印迹分析。GM,GM 事件;M,DNA 标记;P,阳性对照;WT,GM 事件的相应受体系。(c) 利用外源基因和内源基因的相应质粒校准物构建的 qPCR 检测标准曲线。(d) 通过 Southern 印迹、qPCR、ddPCR 和 PE-WGS 分析确定的转基因拷贝数。(dPCR, digital PCR; GM, genetically modified; PCR, polymerase chain reaction; PE-WGS, paired-end whole-genome sequencing; qPCR, quantitative PCR.在 SB 分析中,我们使用了多种限制性内切酶进行基因组 DNA 消化。在 G6H1 事件中,BamHI、SacI、KpnI 和 StuI 发现了 cry1Ab/vip3H 和 G6epsps 的一个拷贝。G281 事件显示了一个 hLF 拷贝,但 G6epsps 拷贝数不确定(一个或两个)。FG72 显示了 2mepsps 和 hppdPfW336 不一致的条带模式,表明拷贝数为一个或两个。用 KpnI 和 XbaI 分析的玉米 12-5 表明 G10epsps 和 cry1Ab/cry2Aj 只有一个拷贝(图 1b,表 S1)。所有检测方法都经过了高效和精确的验证(图 1c,表 S2)。实时定量 PCR 结果显示,G6H1 的 G6epsps 和 cry1Ab/vip3H 的值分别为 0.98 和 0.96;G281 的 G6epsps 和 hLF 的值分别为 1.68 和 1.54;FG72 的 2mepsps 和 hppdPfW336 的值分别为 1.72 和 1.67;玉米 12-5 的 G10epsps 和 cry1Ab/cry2Aj 的值分别为 0.81 和 0.83(表 S3)。这些数值表明 G6H1 和 12-5 有一个 T-DNA 片段整合,而 G281 和 FG72 有两个片段整合。结果显示,G6H1 的 G6epsps 和 cry1Ab/vip3H 的拷贝数分别为 0.94 和 0.97。G281 的 G6epsps 和 hLF 分别为 1.85 和 1.93。FG72 的 2mepsps 和 hppdPfW336 分别为 1.69 和 1.68。12-5 的 G10epsps 和 cry1Ab/cry2A 分别为 0.57 和 0.59(表 S4)。在 PE-WGS 分析中,G6H1、G281、FG72 和 12-5 的测序深度分别为 28.81、28.91、48.70 和 23.94(表 S5)。目标基因的读数被用来计算拷贝数:G6H1 有 1.08 个 G6epsps 拷贝和 0.83 个 cry1Ab/vip3H 拷贝;G281 有 2.01 个 G6epsps 拷贝和 1.91 个 hLF 拷贝;FG72 有 1.80 个 2mepsps 拷贝和 2.00 个 hppdPfW336 拷贝;12-5 有 0.我们对四种转基因事件的系统测量表明,所有四种技术在不同程度上都适用于这一目的。所有方法都能准确量化单拷贝基因,但在多拷贝基因方面出现了差异(图 1d)。图 1e 总结了每种方法在各方面的优势和局限性。Southern 印迹法往往会因串联重复等复杂排列而低估多拷贝基因,也会因消化不完全和交叉杂交而高估多拷贝基因。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Comparative evaluation of gene copy number estimation techniques in genetically modified crops: insights from Southern blotting, qPCR, dPCR and NGS

Gene copy number is crucial for understanding genomic architecture and its implications in plant and animal genetics (Alonge et al., 2020; Castagnone-Sereno et al., 2019). In agriculture, variations in gene copy number (CNVs) are vital as they affect yield, stress resistance and metabolic capabilities (Yuan et al., 2021). Transgenesis, involving the introduction of foreign DNA into plant genomes, has revolutionized agriculture by creating genetically modified (GM) plants with desirable traits. Assessing gene copy numbers in GMOs ensures stability and expression of introduced traits and is crucial for regulatory compliance and biosafety assessments (Liang et al., 2022). Evaluating gene copy numbers in transgenic plants is technically challenging due to variability in transgene integration events (Faure, 2021). Various techniques like Southern blotting (SB), quantitative real-time PCR (qPCR), digital PCR (dPCR) and paired-end whole-genome sequencing (PE-WGS) have been reported for gene copy number determination (Cusenza et al., 2021). However, no systematic comparison of these four methods has been reported, especially concerning PE-WGS.

Here, we performed a comparative benchmarking of gene copy number assessment techniques, including SB, qPCR, dPCR and PE-WGS, employing 4 GM crop events (FG72 soybean, 12-5 maize, G6H1 and G281 rice) as examples (Figure 1a; Data S1).

Details are in the caption following the image
Figure 1
Open in figure viewerPowerPoint
(a) The workflow of the benchmarking study of gene copy number estimation, the diagrams of exogenous gene cassettes of the tested GM events (FG72, G281, G6H1 and 12-5), and the position of hybrid probes in Southern blotting analysis. (b) Southern blotting analysis of four events with various restriction enzymes. GM, GM event; M, DNA marker; P, positive control; WT, the corresponding recipient line of GM event. (c) The constructed standard curves of qPCR assays employing corresponding plasmid calibrators of exogenous and endogenous genes. (d) Summarizes the copy numbers of transgenes determined from Southern blotting, qPCR, ddPCR and PE-WGS analysis. (e) Advantages and disadvantages of the four methods in transgene copy number evaluation. dPCR, digital PCR; GM, genetically modified; PCR, polymerase chain reaction; PE-WGS, paired-end whole-genome sequencing; qPCR, quantitative PCR.

In SB analysis, we used various restriction endonucleases for genomic DNA digestion. For the event G6H1, BamHI, SacI, KpnI and StuI revealed one copy of cry1Ab/vip3H and G6epsps. The G281 event showed a single copy of hLF but uncertain G6epsps copy numbers (one or two). FG72 displayed inconsistent band patterns for 2mepsps and hppdPfW336, suggesting copy numbers of one or two. Maize 12-5, analysed with KpnI and XbaI, indicated a single copy of G10epsps and cry1Ab/cry2Aj (Figure 1b, Table S1).

We employed absolute quantification via qPCR, using endogenous reference genes SPS, Lectin and zSSIIb for rice, soybean and maize, respectively. All assays were validated for high efficiency and precision (Figure 1c, Table S2). Quantitative real-time PCR results showed G6H1's G6epsps and cry1Ab/vip3H had values of 0.98 and 0.96; G281's G6epsps and hLF were 1.68 and 1.54; FG72's 2mepsps and hppdPfW336 were 1.72 and 1.67; maize 12-5's G10epsps and cry1Ab/cry2Aj were 0.81 and 0.83 (Table S3). These values suggest a single T-DNA fragment integration in G6H1 and 12-5 and two fragments in G281 and FG72.

Digital PCR provides absolute quantification by comparing target DNA to a reference gene. Results showed G6H1 had 0.94 and 0.97 copies for G6epsps and cry1Ab/vip3H. G281's G6epsps and hLF were 1.85 and 1.93. FG72's 2mepsps and hppdPfW336 were 1.69 and 1.68. 12-5's G10epsps and cry1Ab/cry2A were 0.57 and 0.59 (Table S4). These indicate single exogenous gene copies in G6H1 and 12-5 and dual copies in G281 and FG72.

In PE-WGS analysis, the sequencing depths for G6H1, G281, FG72 and 12-5 were 28.81, 28.91, 48.70 and 23.94, respectively (Table S5). Read counts for target genes were used to calculate copy numbers: G6H1 had 1.08 copies of G6epsps and 0.83 copies of cry1Ab/vip3H; G281 had 2.01 copies of G6epsps and 1.91 copies of hLF; FG72 had 1.80 copies of 2mepsps and 2.00 copies of hppdPfW336; 12-5 had 0.58 copies of G10epsps and 0.61 copies of cry1Ab/cry2Aj (Table S6).

Our systematic measurements of four GM events showed that all four techniques are suitable for this purpose to varying degrees. All methods accurately quantified single-copy genes; however, discrepancies emerged for multi-copy genes (Figure 1d). The strengths and limitations of each method concerning various aspects were summarized in Figure 1e.

Southern blotting is less accurate and sensitive for multi-copy genes. Southern blotting often underestimates multi-copy genes due to complex arrangements like tandem repeats and can overestimate due to incomplete digestion and cross-hybridization. qPCR, while more accurate than SB, struggles with high-copy genes due to resolution limits (around two-fold variation). Proper primer design and reaction optimization can still yield relatively accurate results. Digital PCR excels with high accuracy for multi-copy genes due to its partitioning capability, allowing it to detect minor changes in copy number, such as a 1.2-fold change from 5 to 6 copies (Whale et al., 2012). Paired-end whole-genome sequencing also provides precise quantification for multi-copy genes through adequate coverage and sophisticated data analysis tools, especially useful for genes with complex genomic rearrangements (Hehir-Kwa et al., 2018). Furthermore, PE-WGS has demonstrated high performance in elucidating the comprehensive molecular characterization of transgenic plants and animals. This includes identifying transgene insertion sites, flanking sequences, entire T-DNA integration structures and plasmid backbone presence, among other features.

Digital PCR and PE-WGS are most effective for distinguishing heterozygotes from homozygotes. Digital PCR offers absolute quantification without needing a standard curve, providing precise measurements. Paired-end whole-genome sequencing, through high-resolution mapping of paired-end reads, can differentiate between heterozygotes and homozygotes by analysing read depth. It is challenging to distinguish between homozygotes and heterozygotes with SB due to similar patterns from sequence homology. Quantitative PCR can distinguish heterozygotes from homozygotes based on Ct values, but it requires careful calibration and control and is influenced by PCR efficiency.

Southern blotting and PE-WGS require substantial amounts of DNA, whereas PCR-based methods need significantly less. Quantitative PCR necessitates high-quality DNA free from degradation and PCR inhibitors. Digital PCR is more tolerant of DNA degradation and inhibitors, delivering accurate quantification even with crude DNA extracts (Whale et al., 2012).

Quantitative PCR and dPCR are generally easier to set up and perform than SB and PE-WGS. SB is labor-intensive, involving several complex steps, including DNA digestion, transformation, hybridization and autoradiography. Paired-end whole-genome sequencing requires strict protocols, thorough DNA extraction, library preparation, machine sequencing and data analysis, necessitating skilled personnel in molecular biology and bioinformatics. Quantitative PCR involves optimizing conditions, constructing positive plasmid DNA, creating standard curves and testing, requiring normal molecular laboratory skills. Digital PCR entails sample partitioning using instruments that streamline the process into one platform.

Quantitative PCR requires the least technical expertise, primarily understanding primer design and qPCR setup. Digital PCR needs moderate expertise, especially for sample emulsification or droplet/well creation. Southern blotting demands high technical skills, particularly for DNA digestion, gel electrophoresis, transfer and hybridization. Paired-end whole-genome sequencing requires significant expertise in data analytics and bioinformatics, besides library preparation and sequencing skills. The experimental duration of qPCR and dPCR is faster than SB and PE-WGS, typically concluding within a day. Southern blotting and PE-WGS require at least 3 days, respectively. Digital PCR is quicker than qPCR because it does not require a standard curve.

Costwise, SB is relatively cheap owing to lower reagent costs and basic equipment needs. Quantitative PCR has a medium cost, with moderately expensive reagents and higher throughput, which reduces per-sample costs. Digital PCR is more costly due to expensive equipment and lower throughput but provides absolute quantification without standard curves. Paired-end whole-genome sequencing is the most expensive, justified by its comprehensive genomic characterization capabilities beyond single gene copy estimation.

We propose prioritizing dPCR and PE-WGS for precise gene copy number analysis. Paired-end whole-genome sequencing is especially suited for assessing multiple gene copies within a sample, while dPCR is optimal for smaller quantities per sample, offering robust tools for genomic research and biotechnological applications.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Plant Biotechnology Journal
Plant Biotechnology Journal 生物-生物工程与应用微生物
CiteScore
20.50
自引率
2.90%
发文量
201
审稿时长
1 months
期刊介绍: Plant Biotechnology Journal aspires to publish original research and insightful reviews of high impact, authored by prominent researchers in applied plant science. The journal places a special emphasis on molecular plant sciences and their practical applications through plant biotechnology. Our goal is to establish a platform for showcasing significant advances in the field, encompassing curiosity-driven studies with potential applications, strategic research in plant biotechnology, scientific analysis of crucial issues for the beneficial utilization of plant sciences, and assessments of the performance of plant biotechnology products in practical applications.
期刊最新文献
Metabolic engineering of narrow-leafed lupin for the production of enantiomerically pure (−)-sparteine Creation of high-resistant starch rice through systematic editing of amylopectin biosynthetic genes in rs4. Plant-produced SARS-CoV-2 antibody engineered towards enhanced potency and in vivo efficacy Pathway elucidation and heterologous reconstitution of the long-chain alkane pentadecane biosynthesis from Pogostemon cablin. Increasing thermostability of the key photorespiratory enzyme glycerate 3-kinase by structure-based recombination
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1