Connor J McCabe, Jonathan L Helm, Max A Halvorson, Kieran J Blaikie, Christine M Lee, Isaac C Rhew
{"title":"Estimating substance use disparities across intersectional social positions using machine learning: An application of group-lasso interaction network.","authors":"Connor J McCabe, Jonathan L Helm, Max A Halvorson, Kieran J Blaikie, Christine M Lee, Isaac C Rhew","doi":"10.1037/adb0001020","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>An aim of quantitative intersectional research is to model the joint impact of multiple social positions on health risk behaviors. Although moderated multiple regression is frequently used to pursue intersectional research hypotheses, such parametric approaches may produce unreliable effect estimates due to data sparsity and high dimensionality. Machine learning provides viable alternatives, offering greater flexibility in evaluating many candidate interactions amid sparse data conditions, yet remains rarely employed. This study introduces group-lasso interaction network (glinternet), a novel machine learning approach involving hierarchical regularization, to assess intersectional differences in substance use prevalence.</p><p><strong>Method: </strong>Utilizing variable selection and parameter stabilization functionality for main and interaction effects, glinternet was employed to examine two-way interactions between three primary social positions (gender, sexual orientation, and race) predicting heavy episodic drinking, cannabis use, and cigarette use prevalence. Analyses were conducted using the All of Us Research Program (<i>N</i> = 283,403), a national sample with high representation from populations historically underrepresented in biomedical research. Results were replicated using holdout cross-validation and compared against logistic regression estimates.</p><p><strong>Results: </strong>Glinternet prevalence estimates were more stable across discovery and replication samples relative to logistic regression, particularly among sparsely represented groups. Prevalence estimates for cigarette and cannabis use were elevated among sexual minority and White cisgender women compared to heterosexual and non-White women, respectively.</p><p><strong>Conclusions: </strong>Glinternet may improve upon traditional moderated multiple regression methods for pursuing intersectional hypotheses by improving model parsimony and parameter stability, providing novel means for quantifying health disparities among intersectional social positions. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":48325,"journal":{"name":"Psychology of Addictive Behaviors","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychology of Addictive Behaviors","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/adb0001020","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: An aim of quantitative intersectional research is to model the joint impact of multiple social positions on health risk behaviors. Although moderated multiple regression is frequently used to pursue intersectional research hypotheses, such parametric approaches may produce unreliable effect estimates due to data sparsity and high dimensionality. Machine learning provides viable alternatives, offering greater flexibility in evaluating many candidate interactions amid sparse data conditions, yet remains rarely employed. This study introduces group-lasso interaction network (glinternet), a novel machine learning approach involving hierarchical regularization, to assess intersectional differences in substance use prevalence.
Method: Utilizing variable selection and parameter stabilization functionality for main and interaction effects, glinternet was employed to examine two-way interactions between three primary social positions (gender, sexual orientation, and race) predicting heavy episodic drinking, cannabis use, and cigarette use prevalence. Analyses were conducted using the All of Us Research Program (N = 283,403), a national sample with high representation from populations historically underrepresented in biomedical research. Results were replicated using holdout cross-validation and compared against logistic regression estimates.
Results: Glinternet prevalence estimates were more stable across discovery and replication samples relative to logistic regression, particularly among sparsely represented groups. Prevalence estimates for cigarette and cannabis use were elevated among sexual minority and White cisgender women compared to heterosexual and non-White women, respectively.
Conclusions: Glinternet may improve upon traditional moderated multiple regression methods for pursuing intersectional hypotheses by improving model parsimony and parameter stability, providing novel means for quantifying health disparities among intersectional social positions. (PsycInfo Database Record (c) 2024 APA, all rights reserved).
期刊介绍:
Psychology of Addictive Behaviors publishes peer-reviewed original articles related to the psychological aspects of addictive behaviors. The journal includes articles on the following topics: - alcohol and alcoholism - drug use and abuse - eating disorders - smoking and nicotine addiction, and other excessive behaviors (e.g., gambling) Full-length research reports, literature reviews, brief reports, and comments are published.