Simulating A/B testing versus SMART designs for LLM-driven patient engagement to close preventive care gaps

IF 12.4 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES NPJ Digital Medicine Pub Date : 2024-11-18 DOI:10.1038/s41746-024-01330-2

Sanjay Basu, Dean Schillinger, Sadiq Y. Patel, Joseph Rigdon

{"title":"Simulating A/B testing versus SMART designs for LLM-driven patient engagement to close preventive care gaps","authors":"Sanjay Basu, Dean Schillinger, Sadiq Y. Patel, Joseph Rigdon","doi":"10.1038/s41746-024-01330-2","DOIUrl":null,"url":null,"abstract":"Population health initiatives often rely on cold outreach to close gaps in preventive care, such as overdue screenings or immunizations. Tailoring messages to diverse patient populations remains challenging, as traditional A/B testing requires large sample sizes to test only two alternative messages. With increasing availability of large language models (LLMs), programs can utilize tiered testing among both LLM and manual human agents, presenting the dilemma of identifying which patients need different levels of human support to cost-effectively engage large populations. Using microsimulations, we compared both the statistical power and false positive rates of A/B testing and Sequential Multiple Assignment Randomized Trials (SMART) for developing personalized communications across multiple effect sizes and sample sizes. SMART showed better cost-effectiveness and net benefit across all scenarios, but superior power for detecting heterogeneous treatment effects (HTEs) only in later randomization stages, when populations were more homogeneous and subtle differences drove engagement differences.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":" ","pages":"1-8"},"PeriodicalIF":12.4000,"publicationDate":"2024-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.nature.com/articles/s41746-024-01330-2.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Digital Medicine","FirstCategoryId":"3","ListUrlMain":"https://www.nature.com/articles/s41746-024-01330-2","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

Population health initiatives often rely on cold outreach to close gaps in preventive care, such as overdue screenings or immunizations. Tailoring messages to diverse patient populations remains challenging, as traditional A/B testing requires large sample sizes to test only two alternative messages. With increasing availability of large language models (LLMs), programs can utilize tiered testing among both LLM and manual human agents, presenting the dilemma of identifying which patients need different levels of human support to cost-effectively engage large populations. Using microsimulations, we compared both the statistical power and false positive rates of A/B testing and Sequential Multiple Assignment Randomized Trials (SMART) for developing personalized communications across multiple effect sizes and sample sizes. SMART showed better cost-effectiveness and net benefit across all scenarios, but superior power for detecting heterogeneous treatment effects (HTEs) only in later randomization stages, when populations were more homogeneous and subtle differences drove engagement differences.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

模拟 A/B 测试与 SMART 设计，以 LLM 驱动的患者参与弥补预防性保健差距

全民健康计划通常依靠冷启动来弥补预防性保健方面的不足，如逾期筛查或免疫接种。由于传统的 A/B 测试需要大量样本，只能测试两种备选信息，因此针对不同患者群体定制信息仍具有挑战性。随着大型语言模型（LLM）的日益普及，项目可以在 LLM 和人工代理之间进行分层测试，这就带来了一个难题，即如何确定哪些患者需要不同程度的人工支持，从而以具有成本效益的方式吸引大量人群。通过微观模拟，我们比较了 A/B 测试和序列多重赋值随机试验（SMART）的统计能力和假阳性率，以便在多种效应大小和样本大小下开发个性化通信。在所有情况下，SMART 都显示出更好的成本效益和净收益，但只有在随机化的后期阶段，即人群更加均匀、细微差别导致参与度差异的阶段，SMART 才具有更强的异质性治疗效果 (HTE) 检测能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

NPJ Digital Medicine Multiple-

CiteScore

25.10

自引率

3.30%

发文量

170

审稿时长

15 weeks

期刊介绍： npj Digital Medicine is an online open-access journal that focuses on publishing peer-reviewed research in the field of digital medicine. The journal covers various aspects of digital medicine, including the application and implementation of digital and mobile technologies in clinical settings, virtual healthcare, and the use of artificial intelligence and informatics. The primary goal of the journal is to support innovation and the advancement of healthcare through the integration of new digital and mobile technologies. When determining if a manuscript is suitable for publication, the journal considers four important criteria: novelty, clinical relevance, scientific rigor, and digital innovation.