Applying machine learning-based multiple imputation methods to nonparametric multiple comparisons in longitudinal clinical studies.

IF 1.2 4区 医学 Q4 PHARMACOLOGY & PHARMACY Journal of Biopharmaceutical Statistics Pub Date : 2024-12-21 DOI:10.1080/10543406.2024.2444243
Tuncay Yanarateş, Erdem Karabulut
{"title":"Applying machine learning-based multiple imputation methods to nonparametric multiple comparisons in longitudinal clinical studies.","authors":"Tuncay Yanarateş, Erdem Karabulut","doi":"10.1080/10543406.2024.2444243","DOIUrl":null,"url":null,"abstract":"<p><p>Dependent samples, in which repeated measurements are made on the same subjects, eliminate potential differences among the subjects. In k-dependent samples, missing data can occur for various reasons. The Skillings-Mack test is used instead of the Friedman test for k-dependent samples with missing observations that are non-normally distributed. If a significant difference exists among groups, nonparametric multiple comparisons need to be performed. In this study, we propose an innovative approach by applying four methods to nonparametric multiple comparisons of incomplete k-dependent samples that are non-normally distributed. The four methods are two nonparametric multiple imputation methods based on machine learning (multiple imputations by chained equations utilizing classification and regression trees (MICE-CART) and random forest (MICE-RF)), one nonparametric imputation method (random hot deck imputation), and the listwise deletion method. We compare the four methods under two missing data mechanisms, four correlation coefficients, two sample sizes, and three percentages of missingness. After implementing different scenarios in a simulation study, the listwise deletion method is inferior to the other methods. MICE-CART and MICE-RF are superior to the other methods for moderate and small sample sizes with well-controlled type 1 error. The two nonparametric multiple imputation methods based on machine learning can be applied to nonparametric multiple comparisons. Therefore, we propose machine learning-based multiple imputation methods for nonparametric multiple comparisons of k-dependent samples with missing observations. The approach was also illustrated with a longitudinal dentistry clinical trial.</p>","PeriodicalId":54870,"journal":{"name":"Journal of Biopharmaceutical Statistics","volume":" ","pages":"1-12"},"PeriodicalIF":1.2000,"publicationDate":"2024-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biopharmaceutical Statistics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/10543406.2024.2444243","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0

Abstract

Dependent samples, in which repeated measurements are made on the same subjects, eliminate potential differences among the subjects. In k-dependent samples, missing data can occur for various reasons. The Skillings-Mack test is used instead of the Friedman test for k-dependent samples with missing observations that are non-normally distributed. If a significant difference exists among groups, nonparametric multiple comparisons need to be performed. In this study, we propose an innovative approach by applying four methods to nonparametric multiple comparisons of incomplete k-dependent samples that are non-normally distributed. The four methods are two nonparametric multiple imputation methods based on machine learning (multiple imputations by chained equations utilizing classification and regression trees (MICE-CART) and random forest (MICE-RF)), one nonparametric imputation method (random hot deck imputation), and the listwise deletion method. We compare the four methods under two missing data mechanisms, four correlation coefficients, two sample sizes, and three percentages of missingness. After implementing different scenarios in a simulation study, the listwise deletion method is inferior to the other methods. MICE-CART and MICE-RF are superior to the other methods for moderate and small sample sizes with well-controlled type 1 error. The two nonparametric multiple imputation methods based on machine learning can be applied to nonparametric multiple comparisons. Therefore, we propose machine learning-based multiple imputation methods for nonparametric multiple comparisons of k-dependent samples with missing observations. The approach was also illustrated with a longitudinal dentistry clinical trial.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
将基于机器学习的多重输入方法应用于纵向临床研究的非参数多重比较。
依赖样本,即对同一受试者进行重复测量,消除受试者之间的潜在差异。在依赖k的样本中,由于各种原因可能会出现数据缺失。对于缺少非正态分布观测值的k相关样本,使用Skillings-Mack检验代替Friedman检验。如果组间存在显著差异,则需要进行非参数多重比较。在本研究中,我们提出了一种创新的方法,将四种方法应用于非正态分布的不完全k依赖样本的非参数多重比较。这四种方法是基于机器学习的两种非参数多重插值方法(利用分类和回归树(MICE-CART)和随机森林(MICE-RF)的链式方程多重插值),一种非参数插值方法(随机热层插值)和列表删除方法。我们在两种缺失数据机制、四种相关系数、两种样本量和三种缺失百分比下比较了四种方法。在模拟研究中实现不同的场景后,列表删除法比其他方法更差。MICE-CART和MICE-RF在中小样本量和1型误差控制良好的情况下优于其他方法。这两种基于机器学习的非参数多重插值方法可以应用于非参数多重比较。因此,我们提出了基于机器学习的多重插值方法,用于缺失观测值的k相关样本的非参数多重比较。该方法也与纵向牙科临床试验说明。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Biopharmaceutical Statistics
Journal of Biopharmaceutical Statistics 医学-统计学与概率论
CiteScore
2.50
自引率
18.20%
发文量
71
审稿时长
6-12 weeks
期刊介绍: The Journal of Biopharmaceutical Statistics, a rapid publication journal, discusses quality applications of statistics in biopharmaceutical research and development. Now publishing six times per year, it includes expositions of statistical methodology with immediate applicability to biopharmaceutical research in the form of full-length and short manuscripts, review articles, selected/invited conference papers, short articles, and letters to the editor. Addressing timely and provocative topics important to the biostatistical profession, the journal covers: Drug, device, and biological research and development; Drug screening and drug design; Assessment of pharmacological activity; Pharmaceutical formulation and scale-up; Preclinical safety assessment; Bioavailability, bioequivalence, and pharmacokinetics; Phase, I, II, and III clinical development including complex innovative designs; Premarket approval assessment of clinical safety; Postmarketing surveillance; Big data and artificial intelligence and applications.
期刊最新文献
Sequential monitoring of cancer immunotherapy trial with random delayed treatment effect. Directed Acyclic Graph Assisted Method For Estimating Average Treatment Effect. Bayesian phase II adaptive randomization by jointly modeling efficacy and toxicity as time-to-event outcomes. Interval estimation of relative risks for combined unilateral and bilateral correlated data. Sample size estimation for recurrent event data using multifrailty and multilevel survival models.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1