Pub Date : 2024-11-01Epub Date: 2024-08-16DOI: 10.1097/EDE.0000000000001785
Ashley I Naimi, Ya-Hui Yu, Lisa M Bodnar
Background: The use of machine learning to estimate exposure effects introduces a dependence between the results of an empirical study and the value of the seed used to fix the pseudo-random number generator.
Methods: We used data from 10,038 pregnant women and a 10% subsample (N = 1004) to examine the extent to which the risk difference for the relation between fruit and vegetable consumption and preeclampsia risk changes under different seed values. We fit an augmented inverse probability weighted estimator with two Super Learner algorithms: a simple algorithm including random forests and single-layer neural networks and a more complex algorithm with a mix of tree-based, regression-based, penalized, and simple algorithms. We evaluated the distributions of risk differences, standard errors, and P values that result from 5000 different seed value selections.
Results: Our findings suggest important variability in the risk difference estimates, as well as an important effect of the stacking algorithm used. The interquartile range width of the risk differences in the full sample with the simple algorithm was 13 per 1000. However, all other interquartile ranges were roughly an order of magnitude lower. The medians of the distributions of risk differences differed according to the sample size and the algorithm used.
Conclusions: Our findings add another dimension of concern regarding the potential for "p-hacking," and further warrant the need to move away from simplistic evidentiary thresholds in empirical research. When empirical results depend on pseudo-random number generator seed values, caution is warranted in interpreting these results.
{"title":"Pseudo-random Number Generator Influences on Average Treatment Effect Estimates Obtained with Machine Learning.","authors":"Ashley I Naimi, Ya-Hui Yu, Lisa M Bodnar","doi":"10.1097/EDE.0000000000001785","DOIUrl":"10.1097/EDE.0000000000001785","url":null,"abstract":"<p><strong>Background: </strong>The use of machine learning to estimate exposure effects introduces a dependence between the results of an empirical study and the value of the seed used to fix the pseudo-random number generator.</p><p><strong>Methods: </strong>We used data from 10,038 pregnant women and a 10% subsample (N = 1004) to examine the extent to which the risk difference for the relation between fruit and vegetable consumption and preeclampsia risk changes under different seed values. We fit an augmented inverse probability weighted estimator with two Super Learner algorithms: a simple algorithm including random forests and single-layer neural networks and a more complex algorithm with a mix of tree-based, regression-based, penalized, and simple algorithms. We evaluated the distributions of risk differences, standard errors, and P values that result from 5000 different seed value selections.</p><p><strong>Results: </strong>Our findings suggest important variability in the risk difference estimates, as well as an important effect of the stacking algorithm used. The interquartile range width of the risk differences in the full sample with the simple algorithm was 13 per 1000. However, all other interquartile ranges were roughly an order of magnitude lower. The medians of the distributions of risk differences differed according to the sample size and the algorithm used.</p><p><strong>Conclusions: </strong>Our findings add another dimension of concern regarding the potential for \"p-hacking,\" and further warrant the need to move away from simplistic evidentiary thresholds in empirical research. When empirical results depend on pseudo-random number generator seed values, caution is warranted in interpreting these results.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"779-786"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11560583/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141992227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-09-30DOI: 10.1097/EDE.0000000000001786
Etsuji Suzuki, Eiji Yamamoto
{"title":"Re: Bias in Calculation of Attributable Fractions Using Relative Risks from Nonsmokers Only.","authors":"Etsuji Suzuki, Eiji Yamamoto","doi":"10.1097/EDE.0000000000001786","DOIUrl":"10.1097/EDE.0000000000001786","url":null,"abstract":"","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"e21-e22"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142003946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-07-23DOI: 10.1097/EDE.0000000000001772
Paige A Bommarito, Sophia M Blaauwendraad, Danielle R Stevens, Michiel A van den Dries, Suzanne Spaan, Anjoeka Pronk, Henning Tiemeier, Romy Gaillard, Leonardo Trasande, Vincent V W Jaddoe, Kelly K Ferguson
Introduction: Prenatal exposure to nonpersistent chemicals, including organophosphate pesticides, phthalates, and bisphenols, is associated with altered fetal and childhood growth. Few studies have examined these associations using longitudinal growth trajectories or considering exposure to chemical mixtures.
Methods: Among 777 participants from the Generation R Study, we used growth mixture models to identify weight and body mass index trajectories using weight and height measures collected from the prenatal period to age 13. We measured exposure biomarkers for organophosphate pesticides, phthalates, and bisphenols in maternal urine at three timepoints during pregnancy. Multinomial logistic regression was used to estimate associations between averaged exposure biomarker concentrations and growth trajectories. We used quantile g-computation to estimate joint associations with growth trajectories.
Results: Phthalic acid (OR = 1.4; 95% CI = 1.01, 1.9) and bisphenol A (OR = 1.5; 95% CI = 1.0, 2.2) were associated with higher odds of a growth trajectory characterized by smaller prenatal and larger childhood weight relative to a referent trajectory of larger prenatal and average childhood weight. Biomarkers of organophosphate pesticides, individually and jointly, were associated with lower odds of a growth trajectory characterized by average prenatal and lower childhood weight.
Conclusions: Exposure to phthalates and bisphenol A was positively associated with a weight trajectory characterized by lower prenatal and higher childhood weight, while exposure to organophosphate pesticides was negatively associated with a trajectory of average prenatal and lower childhood weight. This study is consistent with the hypothesis that nonpersistent chemical exposures disrupt growth trajectories from the prenatal period through childhood.
导言:产前接触非持久性化学物质(包括有机磷农药、邻苯二甲酸盐和双酚)与胎儿和儿童的生长变化有关。很少有研究利用纵向生长轨迹或考虑化学混合物的暴露来研究这些关联:方法:在 R 世代研究的 777 名参与者中,我们使用生长混合物模型,利用从产前到 13 岁期间收集的体重和身高测量数据来确定体重和体重指数(BMI)轨迹。我们在孕期的三个时间点测量了母体尿液中有机磷农药、邻苯二甲酸盐和双酚的暴露生物标志物。我们采用多项式逻辑回归来估计平均暴露生物标记物浓度与生长轨迹之间的关联。我们使用量子 g 计算方法来估计与生长轨迹的联合关联:结果:邻苯二甲酸(OR:1.4,95% CI:1.01,1.9)和双酚 A(BPA;OR:1.5,95% CI:1.0,2.2)与出生前体重较小、儿童期体重较大的生长轨迹相关,而参考轨迹为出生前体重较大、儿童期体重一般。有机磷农药的生物标志物(单独或共同)与以平均产前体重和较低儿童体重为特征的较低生长轨迹几率相关:结论:接触邻苯二甲酸盐和双酚 A 与以较低的产前体重和较高的儿童期体重为特征的体重轨迹呈正相关,而接触有机磷农药与平均的产前体重和较低的儿童期体重轨迹呈负相关。这项研究与非持久性化学物质暴露会扰乱从产前到儿童期的生长轨迹这一假设是一致的。
{"title":"Prenatal Exposure to Nonpersistent Chemicals and Fetal-to-childhood Growth Trajectories.","authors":"Paige A Bommarito, Sophia M Blaauwendraad, Danielle R Stevens, Michiel A van den Dries, Suzanne Spaan, Anjoeka Pronk, Henning Tiemeier, Romy Gaillard, Leonardo Trasande, Vincent V W Jaddoe, Kelly K Ferguson","doi":"10.1097/EDE.0000000000001772","DOIUrl":"10.1097/EDE.0000000000001772","url":null,"abstract":"<p><strong>Introduction: </strong>Prenatal exposure to nonpersistent chemicals, including organophosphate pesticides, phthalates, and bisphenols, is associated with altered fetal and childhood growth. Few studies have examined these associations using longitudinal growth trajectories or considering exposure to chemical mixtures.</p><p><strong>Methods: </strong>Among 777 participants from the Generation R Study, we used growth mixture models to identify weight and body mass index trajectories using weight and height measures collected from the prenatal period to age 13. We measured exposure biomarkers for organophosphate pesticides, phthalates, and bisphenols in maternal urine at three timepoints during pregnancy. Multinomial logistic regression was used to estimate associations between averaged exposure biomarker concentrations and growth trajectories. We used quantile g-computation to estimate joint associations with growth trajectories.</p><p><strong>Results: </strong>Phthalic acid (OR = 1.4; 95% CI = 1.01, 1.9) and bisphenol A (OR = 1.5; 95% CI = 1.0, 2.2) were associated with higher odds of a growth trajectory characterized by smaller prenatal and larger childhood weight relative to a referent trajectory of larger prenatal and average childhood weight. Biomarkers of organophosphate pesticides, individually and jointly, were associated with lower odds of a growth trajectory characterized by average prenatal and lower childhood weight.</p><p><strong>Conclusions: </strong>Exposure to phthalates and bisphenol A was positively associated with a weight trajectory characterized by lower prenatal and higher childhood weight, while exposure to organophosphate pesticides was negatively associated with a trajectory of average prenatal and lower childhood weight. This study is consistent with the hypothesis that nonpersistent chemical exposures disrupt growth trajectories from the prenatal period through childhood.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"874-884"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11444368/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141751374","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-07-26DOI: 10.1097/EDE.0000000000001774
Amelia K Wesselink, Emma L Gause, Keith D Spangler, Perry Hystad, Kipruto Kirwa, Mary D Willis, Gregory A Wellenius, Lauren A Wise
Background: Few epidemiologic studies have examined the association of ambient heat with spontaneous abortion, a common and devastating pregnancy outcome.
Methods: We conducted a case-crossover study nested within Pregnancy Study Online, a preconception cohort study (2013-2022). We included all participants reporting spontaneous abortion (N = 1,524). We defined the case window as the 7 days preceding the event and used time-stratified referent selection to select control windows matched on calendar month and day of week. Within each 7-day case and control window, we measured the mean, maximum, and minimum of daily maximum outdoor air temperatures. We fit splines to examine nonlinear relationships across the entire year and conditional logistic regression to estimate odds ratios (ORs) and 95% confidence interval (CI) of spontaneous abortion with increases in temperature during the warm season (May-September) and decreases during the cool season (November-March).
Results: We found evidence of a U-shaped association between outdoor air temperature and spontaneous abortion risk based on year-round data. When restricting to warm season events (n = 657), the OR for a 10-percentile increase in the mean of lag 0-6 daily maximum temperatures was 1.1 (95% CI: 0.96, 1.2) and, for the maximum, 1.1 (95% CI: 0.99, 1.2). The OR associated with any extreme heat days (>95th county-specific percentile) in the preceding week was 1.2 (95% CI: 0.95, 1.5). Among cool season events (n = 615), there was no appreciable association between lower temperatures and spontaneous abortion risk.
Conclusion: Our study provides evidence of an association between high outdoor temperatures and the incidence of spontaneous abortion.
{"title":"Exposure to Ambient Heat and Risk of Spontaneous Abortion: A Case-Crossover Study.","authors":"Amelia K Wesselink, Emma L Gause, Keith D Spangler, Perry Hystad, Kipruto Kirwa, Mary D Willis, Gregory A Wellenius, Lauren A Wise","doi":"10.1097/EDE.0000000000001774","DOIUrl":"10.1097/EDE.0000000000001774","url":null,"abstract":"<p><strong>Background: </strong>Few epidemiologic studies have examined the association of ambient heat with spontaneous abortion, a common and devastating pregnancy outcome.</p><p><strong>Methods: </strong>We conducted a case-crossover study nested within Pregnancy Study Online, a preconception cohort study (2013-2022). We included all participants reporting spontaneous abortion (N = 1,524). We defined the case window as the 7 days preceding the event and used time-stratified referent selection to select control windows matched on calendar month and day of week. Within each 7-day case and control window, we measured the mean, maximum, and minimum of daily maximum outdoor air temperatures. We fit splines to examine nonlinear relationships across the entire year and conditional logistic regression to estimate odds ratios (ORs) and 95% confidence interval (CI) of spontaneous abortion with increases in temperature during the warm season (May-September) and decreases during the cool season (November-March).</p><p><strong>Results: </strong>We found evidence of a U-shaped association between outdoor air temperature and spontaneous abortion risk based on year-round data. When restricting to warm season events (n = 657), the OR for a 10-percentile increase in the mean of lag 0-6 daily maximum temperatures was 1.1 (95% CI: 0.96, 1.2) and, for the maximum, 1.1 (95% CI: 0.99, 1.2). The OR associated with any extreme heat days (>95th county-specific percentile) in the preceding week was 1.2 (95% CI: 0.95, 1.5). Among cool season events (n = 615), there was no appreciable association between lower temperatures and spontaneous abortion risk.</p><p><strong>Conclusion: </strong>Our study provides evidence of an association between high outdoor temperatures and the incidence of spontaneous abortion.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"864-873"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141765750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-08-14DOI: 10.1097/EDE.0000000000001776
Adway S Wadekar, Jerome P Reiter
Surveys are commonly used to facilitate research in epidemiology, health, and the social and behavioral sciences. Often, these surveys are not simple random samples, and respondents are given weights reflecting their probability of selection into the survey. We show that using survey weights can be beneficial for evaluating the quality of predictive models when splitting data into training and test sets. In particular, we characterize model assessment statistics, such as sensitivity and specificity, as finite population quantities and compute survey-weighted estimates of these quantities with test data comprising a random subset of the original data. Using simulations with data from the National Survey on Drug Use and Health and the National Comorbidity Survey, we show that unweighted metrics estimated with sample test data can misrepresent population performance, but weighted metrics appropriately adjust for the complex sampling design. We also show that this conclusion holds for models trained using upsampling for mitigating class imbalance. The results suggest that weighted metrics should be used when evaluating performance on test data derived from complex surveys.
{"title":"Evaluating Binary Outcome Classifiers Estimated from Survey Data.","authors":"Adway S Wadekar, Jerome P Reiter","doi":"10.1097/EDE.0000000000001776","DOIUrl":"10.1097/EDE.0000000000001776","url":null,"abstract":"<p><p>Surveys are commonly used to facilitate research in epidemiology, health, and the social and behavioral sciences. Often, these surveys are not simple random samples, and respondents are given weights reflecting their probability of selection into the survey. We show that using survey weights can be beneficial for evaluating the quality of predictive models when splitting data into training and test sets. In particular, we characterize model assessment statistics, such as sensitivity and specificity, as finite population quantities and compute survey-weighted estimates of these quantities with test data comprising a random subset of the original data. Using simulations with data from the National Survey on Drug Use and Health and the National Comorbidity Survey, we show that unweighted metrics estimated with sample test data can misrepresent population performance, but weighted metrics appropriately adjust for the complex sampling design. We also show that this conclusion holds for models trained using upsampling for mitigating class imbalance. The results suggest that weighted metrics should be used when evaluating performance on test data derived from complex surveys.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"805-812"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141975420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-08-16DOI: 10.1097/EDE.0000000000001783
Paul N Zivich
{"title":"Commentary: The Seedy Side of Causal Effect Estimation with Machine Learning.","authors":"Paul N Zivich","doi":"10.1097/EDE.0000000000001783","DOIUrl":"10.1097/EDE.0000000000001783","url":null,"abstract":"","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"787-790"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141992226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-08-09DOI: 10.1097/EDE.0000000000001781
Chad W Milando, Yuantong Sun, Yasmin Romitti, Amruta Nori-Sarma, Emma L Gause, Keith R Spangler, Ian Sue Wing, Gregory A Wellenius
Background: Extreme ambient heat is unambiguously associated with a higher risk of illness and death. The Optum Labs Data Warehouse (OLDW), a database of medical claims from US-based patients with commercial or Medicare Advantage health insurance, has been used to quantify heat-related health impacts. Whether results for the insured subpopulation are generalizable to the broader population has, to our knowledge, not been documented. We sought to address this question, for the US population in California from 2012 to 2019.
Methods: We examined changes in daily rates of emergency department encounters and in-patient hospitalization encounters for all-causes, heat-related outcomes, renal disease, mental/behavioral disorders, cardiovascular disease, and respiratory disease. OLDW was the source of health data for insured individuals in California, and health data for the broader population were gathered from the California Department of Health Care Access and Information. We defined extreme heat exposure as any day in a group of 2 or more days with maximum temperatures exceeding the county-specific 97.5th percentile and used a space-time-stratified case-crossover design to assess and compare the impacts of heat on health.
Results: Average incidence rates of medical encounters differed by dataset. However, rate ratios for emergency department encounters were similar across datasets for all causes [ratio of incidence rate ratios (rIRR) = 0.989; 95% confidence interval (CI) = 0.969, 1.009], heat-related causes (rIRR = 1.080; 95% CI = 0.999, 1.168), renal disease (rIRR = 0.963; 95% CI = 0.718, 1.292), and mental health disorders (rIRR = 1.098; 95% CI = 1.004, 1.201). Rate ratios for inpatient encounters were also similar.
Conclusions: This work presents evidence that OLDW can continue to be a resource for estimating the health impacts of extreme heat.
{"title":"Generalizability of Heat-related Health Risk Associations Observed in a Large Healthcare Claims Database of Patients with Commercial Health Insurance.","authors":"Chad W Milando, Yuantong Sun, Yasmin Romitti, Amruta Nori-Sarma, Emma L Gause, Keith R Spangler, Ian Sue Wing, Gregory A Wellenius","doi":"10.1097/EDE.0000000000001781","DOIUrl":"10.1097/EDE.0000000000001781","url":null,"abstract":"<p><strong>Background: </strong>Extreme ambient heat is unambiguously associated with a higher risk of illness and death. The Optum Labs Data Warehouse (OLDW), a database of medical claims from US-based patients with commercial or Medicare Advantage health insurance, has been used to quantify heat-related health impacts. Whether results for the insured subpopulation are generalizable to the broader population has, to our knowledge, not been documented. We sought to address this question, for the US population in California from 2012 to 2019.</p><p><strong>Methods: </strong>We examined changes in daily rates of emergency department encounters and in-patient hospitalization encounters for all-causes, heat-related outcomes, renal disease, mental/behavioral disorders, cardiovascular disease, and respiratory disease. OLDW was the source of health data for insured individuals in California, and health data for the broader population were gathered from the California Department of Health Care Access and Information. We defined extreme heat exposure as any day in a group of 2 or more days with maximum temperatures exceeding the county-specific 97.5th percentile and used a space-time-stratified case-crossover design to assess and compare the impacts of heat on health.</p><p><strong>Results: </strong>Average incidence rates of medical encounters differed by dataset. However, rate ratios for emergency department encounters were similar across datasets for all causes [ratio of incidence rate ratios (rIRR) = 0.989; 95% confidence interval (CI) = 0.969, 1.009], heat-related causes (rIRR = 1.080; 95% CI = 0.999, 1.168), renal disease (rIRR = 0.963; 95% CI = 0.718, 1.292), and mental health disorders (rIRR = 1.098; 95% CI = 1.004, 1.201). Rate ratios for inpatient encounters were also similar.</p><p><strong>Conclusions: </strong>This work presents evidence that OLDW can continue to be a resource for estimating the health impacts of extreme heat.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"844-852"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7616519/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141909829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-09-30DOI: 10.1097/EDE.0000000000001784
Ruta Margelyte, Maria Theresa Redaniel, Scott R Walter, Yvette Pyne, Sam Merriel, John Macleod, Kate Northstone, Kate Tilling
Background: Human papillomavirus (HPV) vaccination has been offered in over a hundred countries worldwide (including the United Kingdom, since September 2008). Controversy around adverse effects persists, with inconsistent evidence from follow-up of randomized controlled trials and confounding by indication limiting the conclusions drawn from larger-scale observational studies. This study aims to estimate the association between receiving a quadrivalent HPV vaccine and the reporting of short-term adverse effects and to demonstrate the utility of regression discontinuity design for examining side effects in routine data.
Methods: We applied a novel regression discontinuity approach to a retrospective population-based cohort using primary care data from the UK Clinical Practice Research Datalink linked to hospital and social deprivation data. We examined the new onset of gastrointestinal, neuromuscular, pain, and headache/migraine symptoms using READ and International Classification of Diseases, tenth revision diagnostic codes. For each year between 2012 and 2017, we compared girls in school year 8 (born July/August) who were eligible to receive the vaccine with girls in year 7 (born September/October) who were not eligible.
Results: Of the 21,853 adolescent girls in the cohort, 10,881 (50%) were eligible for HPV vaccination. There was no evidence of increased new gastrointestinal symptoms (adjusted odds ratio [OR]: 0.99; 95% confidence interval [CI]: 0.85, 1.15), headache/migraine symptoms (OR: 0.84; 95% CI: 0.70, 1.01), or pain symptoms (OR: 1.05; 95% CI: 0.95, 1.16) when comparing those eligible and ineligible for HPV vaccination.
Conclusion: This study found no evidence that HPV vaccination eligibility is associated with reporting short-term adverse effects among adolescent girls.
{"title":"Investigating the Potential Short-term Adverse Effects of the Quadrivalent Human Papillomavirus Vaccine: A Novel Regression Discontinuity Analysis.","authors":"Ruta Margelyte, Maria Theresa Redaniel, Scott R Walter, Yvette Pyne, Sam Merriel, John Macleod, Kate Northstone, Kate Tilling","doi":"10.1097/EDE.0000000000001784","DOIUrl":"10.1097/EDE.0000000000001784","url":null,"abstract":"<p><strong>Background: </strong>Human papillomavirus (HPV) vaccination has been offered in over a hundred countries worldwide (including the United Kingdom, since September 2008). Controversy around adverse effects persists, with inconsistent evidence from follow-up of randomized controlled trials and confounding by indication limiting the conclusions drawn from larger-scale observational studies. This study aims to estimate the association between receiving a quadrivalent HPV vaccine and the reporting of short-term adverse effects and to demonstrate the utility of regression discontinuity design for examining side effects in routine data.</p><p><strong>Methods: </strong>We applied a novel regression discontinuity approach to a retrospective population-based cohort using primary care data from the UK Clinical Practice Research Datalink linked to hospital and social deprivation data. We examined the new onset of gastrointestinal, neuromuscular, pain, and headache/migraine symptoms using READ and International Classification of Diseases, tenth revision diagnostic codes. For each year between 2012 and 2017, we compared girls in school year 8 (born July/August) who were eligible to receive the vaccine with girls in year 7 (born September/October) who were not eligible.</p><p><strong>Results: </strong>Of the 21,853 adolescent girls in the cohort, 10,881 (50%) were eligible for HPV vaccination. There was no evidence of increased new gastrointestinal symptoms (adjusted odds ratio [OR]: 0.99; 95% confidence interval [CI]: 0.85, 1.15), headache/migraine symptoms (OR: 0.84; 95% CI: 0.70, 1.01), or pain symptoms (OR: 1.05; 95% CI: 0.95, 1.16) when comparing those eligible and ineligible for HPV vaccination.</p><p><strong>Conclusion: </strong>This study found no evidence that HPV vaccination eligibility is associated with reporting short-term adverse effects among adolescent girls.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":"35 6","pages":"813-822"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11444347/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142603888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-11-01Epub Date: 2024-08-01DOI: 10.1097/EDE.0000000000001777
Jemar R Bather, Taylor J Robinson, Melody S Goodman
Background: Little attention has been devoted to framing multiple continuous social variables as a "mixture" for social epidemiologic analysis. We propose using the Bayesian kernel machine regression analytic framework that yields univariate, bivariate, and overall exposure mixture effects.
Methods: Using data from the 2023 Survey of Racism and Public Health, we conducted a Bayesian kernel machine regression analysis to study several individual, social, and structural factors as an exposure mixture and their relationships with psychological distress among individuals with at least one police arrest. Factors included racial and economic polarization, neighborhood deprivation, perceived discrimination, police perception, subjective social status, and substance use. We complemented this analysis with a series of unadjusted and adjusted models for each exposure mixture variable.
Results: We found that more self-reported discrimination experiences in the past year (posterior inclusion probability = 1.00) and greater substance use (posterior inclusion probability = 1.00) correlated with higher psychological distress. These associations were consistent with the findings from the unadjusted and adjusted linear regression analyses: past year perceived discrimination (unadjusted b = 2.58, 95% confidence interval [CI]: 1.86, 3.30; adjusted b = 2.20, 95% CI: 1.45, 2.94) and substance use (unadjusted b = 2.92, 95% CI: 2.21, 3.62; adjusted b = 2.59, 95% CI: 1.87, 3.31).
Conclusion: With the rise of big data and the expansion of variables in long-standing cohort and census studies, novel applications of methods from adjacent disciplines are a step forward in identifying exposure mixture associations in social epidemiology and addressing the health needs of socially vulnerable populations.
背景:在社会流行病学分析中,很少有人关注将多个连续社会变量作为 "混合物 "进行分析。我们建议使用贝叶斯核机器回归分析框架,该框架可产生单变量、双变量和总体暴露混合效应:利用 2023 年种族主义与公共健康调查的数据,我们进行了贝叶斯核机器回归分析,以研究作为暴露混合物的若干个人、社会和结构因素及其与至少有一次被警方逮捕的个人的心理困扰之间的关系。这些因素包括种族和经济两极分化、邻里贫困、歧视感知、警察感知、主观社会地位和药物使用。我们针对每个暴露混合变量建立了一系列未调整和调整模型,对上述分析进行了补充:我们发现,过去一年中自我报告的歧视经历越多(后纳入概率 = 1.00),药物使用越多(后纳入概率 = 1.00),心理压力就越大。这些关联与未调整和调整线性回归分析的结果一致:过去一年感知到的歧视(未调整 b = 2.58,95% CI:1.86,3.30;调整 b = 2.20,95% CI:1.45,2.94)和药物使用(未调整 b = 2.92,95% CI:2.21,3.62;调整 b = 2.59,95% CI:1.87,3.31):随着大数据的兴起以及长期队列和普查研究变量的扩大,相邻学科方法的新颖应用在确定社会流行病学中的暴露混合物关联和满足社会弱势群体的健康需求方面向前迈进了一步。
{"title":"Bayesian Kernel Machine Regression for Social Epidemiologic Research.","authors":"Jemar R Bather, Taylor J Robinson, Melody S Goodman","doi":"10.1097/EDE.0000000000001777","DOIUrl":"10.1097/EDE.0000000000001777","url":null,"abstract":"<p><strong>Background: </strong>Little attention has been devoted to framing multiple continuous social variables as a \"mixture\" for social epidemiologic analysis. We propose using the Bayesian kernel machine regression analytic framework that yields univariate, bivariate, and overall exposure mixture effects.</p><p><strong>Methods: </strong>Using data from the 2023 Survey of Racism and Public Health, we conducted a Bayesian kernel machine regression analysis to study several individual, social, and structural factors as an exposure mixture and their relationships with psychological distress among individuals with at least one police arrest. Factors included racial and economic polarization, neighborhood deprivation, perceived discrimination, police perception, subjective social status, and substance use. We complemented this analysis with a series of unadjusted and adjusted models for each exposure mixture variable.</p><p><strong>Results: </strong>We found that more self-reported discrimination experiences in the past year (posterior inclusion probability = 1.00) and greater substance use (posterior inclusion probability = 1.00) correlated with higher psychological distress. These associations were consistent with the findings from the unadjusted and adjusted linear regression analyses: past year perceived discrimination (unadjusted b = 2.58, 95% confidence interval [CI]: 1.86, 3.30; adjusted b = 2.20, 95% CI: 1.45, 2.94) and substance use (unadjusted b = 2.92, 95% CI: 2.21, 3.62; adjusted b = 2.59, 95% CI: 1.87, 3.31).</p><p><strong>Conclusion: </strong>With the rise of big data and the expansion of variables in long-standing cohort and census studies, novel applications of methods from adjacent disciplines are a step forward in identifying exposure mixture associations in social epidemiology and addressing the health needs of socially vulnerable populations.</p>","PeriodicalId":11779,"journal":{"name":"Epidemiology","volume":" ","pages":"735-747"},"PeriodicalIF":4.7,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141859349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}