Advancing Interpretable Regression Analysis for Binary Data: A Novel Distributed Algorithm Approach.

IF 1.8 4区 医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Statistics in Medicine Pub Date : 2024-11-03 DOI:10.1002/sim.10250
Jiayi Tong, Lu Li, Jenna Marie Reps, Vitaly Lorman, Naimin Jing, Mackenzie Edmondson, Xiwei Lou, Ravi Jhaveri, Kelly J Kelleher, Nathan M Pajor, Christopher B Forrest, Jiang Bian, Haitao Chu, Yong Chen
{"title":"Advancing Interpretable Regression Analysis for Binary Data: A Novel Distributed Algorithm Approach.","authors":"Jiayi Tong, Lu Li, Jenna Marie Reps, Vitaly Lorman, Naimin Jing, Mackenzie Edmondson, Xiwei Lou, Ravi Jhaveri, Kelly J Kelleher, Nathan M Pajor, Christopher B Forrest, Jiang Bian, Haitao Chu, Yong Chen","doi":"10.1002/sim.10250","DOIUrl":null,"url":null,"abstract":"<p><p>Sparse data bias, where there is a lack of sufficient cases, is a common problem in data analysis, particularly when studying rare binary outcomes. Although a two-step meta-analysis approach may be used to lessen the bias by combining the summary statistics to increase the number of cases from multiple studies, this method does not completely eliminate bias in effect estimation. In this paper, we propose a one-shot distributed algorithm for estimating relative risk using a modified Poisson regression for binary data, named ODAP-B. We evaluate the performance of our method through both simulation studies and real-world case analyses of postacute sequelae of SARS-CoV-2 infection in children using data from 184 501 children across eight national academic medical centers. Compared with the meta-analysis method, our method provides closer estimates of the relative risk for all outcomes considered including syndromic and systemic outcomes. Our method is communication-efficient and privacy-preserving, requiring only aggregated data to obtain relatively unbiased effect estimates compared with two-step meta-analysis methods. Overall, ODAP-B is an effective distributed learning algorithm for Poisson regression to study rare binary outcomes. The method provides inference on adjusted relative risk with a robust variance estimator.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":" ","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2024-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/sim.10250","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Sparse data bias, where there is a lack of sufficient cases, is a common problem in data analysis, particularly when studying rare binary outcomes. Although a two-step meta-analysis approach may be used to lessen the bias by combining the summary statistics to increase the number of cases from multiple studies, this method does not completely eliminate bias in effect estimation. In this paper, we propose a one-shot distributed algorithm for estimating relative risk using a modified Poisson regression for binary data, named ODAP-B. We evaluate the performance of our method through both simulation studies and real-world case analyses of postacute sequelae of SARS-CoV-2 infection in children using data from 184 501 children across eight national academic medical centers. Compared with the meta-analysis method, our method provides closer estimates of the relative risk for all outcomes considered including syndromic and systemic outcomes. Our method is communication-efficient and privacy-preserving, requiring only aggregated data to obtain relatively unbiased effect estimates compared with two-step meta-analysis methods. Overall, ODAP-B is an effective distributed learning algorithm for Poisson regression to study rare binary outcomes. The method provides inference on adjusted relative risk with a robust variance estimator.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
推进二元数据的可解释回归分析:一种新颖的分布式算法方法
稀疏数据偏差,即缺乏足够的病例,是数据分析中的一个常见问题,尤其是在研究罕见的二元结果时。虽然可以采用两步荟萃分析法,通过合并汇总统计数据来增加多项研究的病例数,从而减少偏倚,但这种方法并不能完全消除效应估计中的偏倚。在本文中,我们提出了一种使用改良泊松回归估计二元数据相对风险的单次分布式算法,命名为 ODAP-B。我们利用八个国家学术医疗中心 184 501 名儿童的数据,通过模拟研究和儿童感染 SARS-CoV-2 后急性后遗症的实际病例分析,评估了我们的方法的性能。与荟萃分析法相比,我们的方法对包括综合征和全身性结果在内的所有结果的相对风险估计更接近。与两步荟萃分析法相比,我们的方法只需要汇总数据,就能获得相对无偏的效应估计值,具有沟通效率高和保护隐私的特点。总体而言,ODAP-B 是一种有效的分布式学习算法,适用于研究罕见二元结局的泊松回归。该方法提供了具有稳健方差估计器的调整相对风险推断。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Statistics in Medicine
Statistics in Medicine 医学-公共卫生、环境卫生与职业卫生
CiteScore
3.40
自引率
10.00%
发文量
334
审稿时长
2-4 weeks
期刊介绍: The journal aims to influence practice in medicine and its associated sciences through the publication of papers on statistical and other quantitative methods. Papers will explain new methods and demonstrate their application, preferably through a substantive, real, motivating example or a comprehensive evaluation based on an illustrative example. Alternatively, papers will report on case-studies where creative use or technical generalizations of established methodology is directed towards a substantive application. Reviews of, and tutorials on, general topics relevant to the application of statistics to medicine will also be published. The main criteria for publication are appropriateness of the statistical methods to a particular medical problem and clarity of exposition. Papers with primarily mathematical content will be excluded. The journal aims to enhance communication between statisticians, clinicians and medical researchers.
期刊最新文献
A Novel Bayesian Spatio-Temporal Surveillance Metric to Predict Emerging Infectious Disease Areas of High Disease Risk. Does Remdesivir Lower COVID-19 Mortality? A Subgroup Analysis of Hospitalized Adults Receiving Supplemental Oxygen. Modeling Chronic Disease Mortality by Methods From Accelerated Life Testing. A Nonparametric Global Win Probability Approach to the Analysis and Sizing of Randomized Controlled Trials With Multiple Endpoints of Different Scales and Missing Data: Beyond O'Brien-Wei-Lachin. Causal Inference for Continuous Multiple Time Point Interventions.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1