An overview of modern machine learning methods for effect measure modification analyses in high-dimensional settings

IF 3.1 2区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Ssm-Population Health Pub Date : 2025-02-13 DOI:10.1016/j.ssmph.2025.101764
Michael Cheung, Anna Dimitrova, Tarik Benmarhnia
{"title":"An overview of modern machine learning methods for effect measure modification analyses in high-dimensional settings","authors":"Michael Cheung,&nbsp;Anna Dimitrova,&nbsp;Tarik Benmarhnia","doi":"10.1016/j.ssmph.2025.101764","DOIUrl":null,"url":null,"abstract":"<div><div>A primary concern of public health researchers involves identifying and quantifying heterogeneous exposure effects across population subgroups. Understanding the magnitude and direction of these effects on a given scale provides researchers the ability to recommend policy prescriptions and assess the external validity of findings. Traditional methods for effect measure modification analyses require manual model specification that is often impractical or not feasible to conduct in high-dimensional settings. Recent developments in machine learning aim to solve this issue by utilizing data-driven approaches to estimate heterogeneous exposure effects. However, these methods do not directly identify effect modifiers and estimate corresponding subgroup effects. Consequently, additional analysis techniques are required to use these methods in the context of effect measure modification analyses. While no data-driven method or technique can identify effect modifiers and domain expertise is still required, they may serve an important role in the discovery of vulnerable subgroups when prior knowledge is not available. We summarize and provide the intuition behind these machine learning methods and discuss how they may be employed for effect measure modification analyses to serve as a reference for public health researchers. We discuss their implementation in R with annotated syntax and demonstrate their application by assessing the heterogeneous effects of drought on stunting among children in the Demographic and Health survey data set as a case study.</div></div>","PeriodicalId":47780,"journal":{"name":"Ssm-Population Health","volume":"29 ","pages":"Article 101764"},"PeriodicalIF":3.1000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ssm-Population Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352827325000187","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

A primary concern of public health researchers involves identifying and quantifying heterogeneous exposure effects across population subgroups. Understanding the magnitude and direction of these effects on a given scale provides researchers the ability to recommend policy prescriptions and assess the external validity of findings. Traditional methods for effect measure modification analyses require manual model specification that is often impractical or not feasible to conduct in high-dimensional settings. Recent developments in machine learning aim to solve this issue by utilizing data-driven approaches to estimate heterogeneous exposure effects. However, these methods do not directly identify effect modifiers and estimate corresponding subgroup effects. Consequently, additional analysis techniques are required to use these methods in the context of effect measure modification analyses. While no data-driven method or technique can identify effect modifiers and domain expertise is still required, they may serve an important role in the discovery of vulnerable subgroups when prior knowledge is not available. We summarize and provide the intuition behind these machine learning methods and discuss how they may be employed for effect measure modification analyses to serve as a reference for public health researchers. We discuss their implementation in R with annotated syntax and demonstrate their application by assessing the heterogeneous effects of drought on stunting among children in the Demographic and Health survey data set as a case study.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
现代机器学习方法在高维环境下效果测量修正分析的概述
公共卫生研究人员主要关注的问题是确定和量化跨人群亚群的异质性暴露效应。在一定范围内了解这些影响的大小和方向,使研究人员能够推荐政策处方并评估研究结果的外部有效性。传统的效果度量修改分析方法需要手工模型说明,这在高维环境中通常是不切实际的或不可行的。机器学习的最新发展旨在通过利用数据驱动的方法来估计异质暴露效应来解决这个问题。然而,这些方法不能直接识别效果修饰因子和估计相应的子组效应。因此,在效应测量修正分析的背景下使用这些方法需要额外的分析技术。虽然没有数据驱动的方法或技术可以识别效果修饰符,并且仍然需要领域专业知识,但它们可能在没有先验知识的情况下在发现脆弱亚群方面发挥重要作用。我们总结并提供了这些机器学习方法背后的直觉,并讨论了如何将它们用于效果测量修改分析,以作为公共卫生研究人员的参考。我们用带注释的语法讨论了它们在R中的实现,并以人口与健康调查数据集为例,通过评估干旱对儿童发育迟缓的异质影响,展示了它们的应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Ssm-Population Health
Ssm-Population Health PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH-
CiteScore
6.50
自引率
2.10%
发文量
298
审稿时长
101 days
期刊介绍: SSM - Population Health. The new online only, open access, peer reviewed journal in all areas relating Social Science research to population health. SSM - Population Health shares the same Editors-in Chief and general approach to manuscripts as its sister journal, Social Science & Medicine. The journal takes a broad approach to the field especially welcoming interdisciplinary papers from across the Social Sciences and allied areas. SSM - Population Health offers an alternative outlet for work which might not be considered, or is classed as ''out of scope'' elsewhere, and prioritizes fast peer review and publication to the benefit of authors and readers. The journal welcomes all types of paper from traditional primary research articles, replication studies, short communications, methodological studies, instrument validation, opinion pieces, literature reviews, etc. SSM - Population Health also offers the opportunity to publish special issues or sections to reflect current interest and research in topical or developing areas. The journal fully supports authors wanting to present their research in an innovative fashion though the use of multimedia formats.
期刊最新文献
A case-based complexity approach to health inequality: Understanding and tracing place-based differences to enhance policy calibration. Countering the forgetting of novel health information with 'social boosting'. Housing distress and subsequent health and well-being among older adults: An outcome-wide longitudinal approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1