Lyndsey E Shorey-Kendrick, Brett Davis, Lina Gao, Byung Park, Annette Vu, Cynthia D Morris, Carrie V Breton, Rebecca Fry, Erika Garcia, Rebecca J Schmidt, T Michael O'Shea, Robert S Tepper, Cindy T McEvoy, Eliot R Spindel
{"title":"Development and Validation of a Novel Placental DNA Methylation Biomarker of Maternal Smoking during Pregnancy in the ECHO Program.","authors":"Lyndsey E Shorey-Kendrick, Brett Davis, Lina Gao, Byung Park, Annette Vu, Cynthia D Morris, Carrie V Breton, Rebecca Fry, Erika Garcia, Rebecca J Schmidt, T Michael O'Shea, Robert S Tepper, Cindy T McEvoy, Eliot R Spindel","doi":"10.1289/EHP13838","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Maternal cigarette smoking during pregnancy (MSDP) is associated with numerous adverse health outcomes in infants and children with potential lifelong consequences. Negative effects of MSDP on placental DNA methylation (DNAm), placental structure, and function are well established.</p><p><strong>Objective: </strong>Our aim was to develop biomarkers of MSDP using DNAm measured in placentas (<math><mrow><mi>N</mi><mo>=</mo><mn>96</mn></mrow></math>), collected as part of the Vitamin C to Decrease the Effects of Smoking in Pregnancy on Infant Lung Function double-blind, placebo-controlled randomized clinical trial conducted between 2012 and 2016. We also aimed to develop a digital polymerase chain reaction (PCR) assay for the top ranking cytosine-guanine dinucleotide (CpG) so that large numbers of samples can be screened for exposure at low cost.</p><p><strong>Methods: </strong>We compared the ability of four machine learning methods [logistic least absolute shrinkage and selection operator (LASSO) regression, logistic elastic net regression, random forest, and gradient boosting machine] to classify MSDP based on placental DNAm signatures. We developed separate models using the complete EPIC array dataset and on the subset of probes also found on the 450K array so that models exist for both platforms. For comparison, we developed a model using CpGs previously associated with MSDP in placenta. For each final model, we used model coefficients and normalized beta values to calculate placental smoking index (PSI) scores for each sample. Final models were validated in two external datasets: the Extremely Low Gestational Age Newborn observational study, <math><mrow><mi>N</mi><mo>=</mo><mn>426</mn></mrow></math>; and the Rhode Island Children's Health Study, <math><mrow><mi>N</mi><mo>=</mo><mn>237</mn></mrow></math>.</p><p><strong>Results: </strong>Logistic LASSO regression demonstrated the highest performance in cross-validation testing with the lowest number of input CpGs. Accuracy was greatest in external datasets when using models developed for the same platform. PSI scores in smokers only (<math><mrow><mi>n</mi><mo>=</mo><mn>72</mn></mrow></math>) were moderately correlated with maternal plasma cotinine levels. One CpG (cg27402634), with the largest coefficient in two models, was measured accurately by digital PCR compared with measurement by EPIC array (<math><mrow><mrow><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></mrow><mo>=</mo><mn>0.98</mn></mrow></math>).</p><p><strong>Discussion: </strong>To our knowledge, we have developed the first placental DNAm-based biomarkers of MSDP with broad utility to studies of prenatal disease origins. https://doi.org/10.1289/EHP13838.</p>","PeriodicalId":11862,"journal":{"name":"Environmental Health Perspectives","volume":"132 6","pages":"67005"},"PeriodicalIF":10.1000,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11218700/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Health Perspectives","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1289/EHP13838","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/17 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Maternal cigarette smoking during pregnancy (MSDP) is associated with numerous adverse health outcomes in infants and children with potential lifelong consequences. Negative effects of MSDP on placental DNA methylation (DNAm), placental structure, and function are well established.
Objective: Our aim was to develop biomarkers of MSDP using DNAm measured in placentas (), collected as part of the Vitamin C to Decrease the Effects of Smoking in Pregnancy on Infant Lung Function double-blind, placebo-controlled randomized clinical trial conducted between 2012 and 2016. We also aimed to develop a digital polymerase chain reaction (PCR) assay for the top ranking cytosine-guanine dinucleotide (CpG) so that large numbers of samples can be screened for exposure at low cost.
Methods: We compared the ability of four machine learning methods [logistic least absolute shrinkage and selection operator (LASSO) regression, logistic elastic net regression, random forest, and gradient boosting machine] to classify MSDP based on placental DNAm signatures. We developed separate models using the complete EPIC array dataset and on the subset of probes also found on the 450K array so that models exist for both platforms. For comparison, we developed a model using CpGs previously associated with MSDP in placenta. For each final model, we used model coefficients and normalized beta values to calculate placental smoking index (PSI) scores for each sample. Final models were validated in two external datasets: the Extremely Low Gestational Age Newborn observational study, ; and the Rhode Island Children's Health Study, .
Results: Logistic LASSO regression demonstrated the highest performance in cross-validation testing with the lowest number of input CpGs. Accuracy was greatest in external datasets when using models developed for the same platform. PSI scores in smokers only () were moderately correlated with maternal plasma cotinine levels. One CpG (cg27402634), with the largest coefficient in two models, was measured accurately by digital PCR compared with measurement by EPIC array ().
Discussion: To our knowledge, we have developed the first placental DNAm-based biomarkers of MSDP with broad utility to studies of prenatal disease origins. https://doi.org/10.1289/EHP13838.
期刊介绍:
Environmental Health Perspectives (EHP) is a monthly peer-reviewed journal supported by the National Institute of Environmental Health Sciences, part of the National Institutes of Health under the U.S. Department of Health and Human Services. Its mission is to facilitate discussions on the connections between the environment and human health by publishing top-notch research and news. EHP ranks third in Public, Environmental, and Occupational Health, fourth in Toxicology, and fifth in Environmental Sciences.