首页 > 最新文献

Medical Decision Making最新文献

英文 中文
Evidence on Methods for Communicating Health-Related Probabilities: Comparing the Making Numbers Meaningful Systematic Review to the 2021 IPDAS Evidence Paper Recommendations. 与健康相关的概率沟通方法的证据:将Making Numbers有意义的系统评价与2021 IPDAS证据文件建议进行比较。
IF 3.1 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-01 Epub Date: 2025-07-07 DOI: 10.1177/0272989X251346811
Brian J Zikmund-Fisher, Natalie C Benda, Jessica S Ancker

PurposeTo summarize the degree to which evidence from our recent Making Numbers Meaningful (MNM) systematic review of the effects of data presentation format on communication of health numbers supports recommendations from the 2021 International Patient Decision Aids Standards (IPDAS) Collaboration papers on presenting probabilities.MethodsThe MNM review generated 1,119 distinct findings (derived from 316 papers) related to communication of probabilities to patients or other lay audiences, classifying each finding by its relation to audience task, type of stimulus (data and data presentation format), and up to 10 distinct sets of outcomes: identification and/or recall, contrast, categorization, computation, probability perceptions and/or feelings, effectiveness perceptions and/or feelings, behavioral intentions or behavior, trust, preference, and discrimination. Here, we summarize the findings related to each of the 35 IPDAS paper recommendations.ResultsStrong evidence exists to support several IPDAS recommendations, including those related to the use of part-to-whole graphical formats (e.g., icon arrays) and avoidance of verbal probability terms, 1-in-X formats, and relative risk formats to prevent amplification of probability perceptions, effectiveness perceptions, and/or behavioral intentions as well as the use of consistent denominators to improve computation outcomes. However, the evidence base appears weaker and less complete for other IPDAS recommendations (e.g., recommendations regarding numerical estimates in context and evaluative labels). The IPDAS papers and the MNM review agree that both communication of uncertainty and use of interactive formats need further research.ConclusionsThe idea that no one visual or numerical format is optimal for every probability communication situation is both an IPDAS panel recommendation and foundational to the MNM project's design. Although no MNM evidence contradicts IPDAS recommendations, the evidence base needed to support many common probability communication recommendations remains incomplete.HighlightsThe Making Numbers Meaningful (MNM) systematic review of the literature on communicating health numbers provides mixed support for the recommendations of the 2021 International Patient Decision Aids Standards (IPDAS) evidence papers on presenting probabilities in patient decision aids.Both the IPDAS papers and the MNM project agree that no single visual or numerical format is optimal for every probability communication situation.The MNM review provides strong evidentiary support for IPDAS recommendations in favor of using part-to-whole graphical formats (e.g., icon arrays) and consistent denominators.The MNM review also supports the IPDAS cautions against verbal probability terms and 1-in-X formats as well as its concerns about the potential biasing effects of relative risk formats and framing.MNM evidence is weaker related to IPDAS recommendations about placing numerical estimates in context

目的总结我们最近对数据呈现格式对健康号码传达的影响进行的“使数字有意义”(MNM)系统综述的证据在多大程度上支持2021年国际患者决策辅助标准(IPDAS)合作论文中关于呈现概率的建议。MNM回顾产生了1119个不同的发现(来自316篇论文),这些发现与向患者或其他非专业观众传达概率有关,并根据其与观众任务的关系、刺激类型(数据和数据呈现格式)以及多达10组不同的结果对每个发现进行了分类:识别和/或回忆、对比、分类、计算、概率感知和/或感觉、有效性感知和/或感觉、行为意图或行为、信任、偏好和歧视。在这里,我们总结了与35篇IPDAS论文建议相关的研究结果。结果强有力的证据支持IPDAS的一些建议,包括使用部分到整体的图形格式(例如,图标数组)和避免口头概率术语、1-in-X格式和相对风险格式,以防止放大概率感知、有效性感知和/或行为意图,以及使用一致的分母来改善计算结果。然而,IPDAS的其他建议(例如,关于背景下的数字估计和评价标签的建议)的证据基础似乎较弱和不完整。IPDAS论文和MNM综述一致认为,不确定性的传播和交互式格式的使用都需要进一步研究。结论:没有一种视觉或数字格式对所有概率通信情况都是最佳的,这既是IPDAS小组的建议,也是MNM项目设计的基础。虽然没有MNM证据与IPDAS的建议相矛盾,但支持许多常见概率通信建议所需的证据基础仍然不完整。使数字有意义(MNM)系统回顾了关于传达健康号码的文献,为2021年国际患者决策辅助标准(IPDAS)关于在患者决策辅助中呈现概率的证据文件的建议提供了混合支持。IPDAS论文和MNM项目都认为,没有一种单一的视觉或数字格式对每种概率通信情况都是最佳的。MNM的审查为IPDAS的建议提供了强有力的证据支持,这些建议赞成使用部分到整体的图形格式(例如,图标数组)和一致的分母。MNM审查还支持IPDAS对口头概率术语和1-in-X格式的警告,以及对相对风险格式和框架的潜在偏倚影响的担忧。与IPDAS关于在背景下放置数值估计和使用评估标签的建议相关的MNM证据较弱。
{"title":"Evidence on Methods for Communicating Health-Related Probabilities: Comparing the Making Numbers Meaningful Systematic Review to the 2021 IPDAS Evidence Paper Recommendations.","authors":"Brian J Zikmund-Fisher, Natalie C Benda, Jessica S Ancker","doi":"10.1177/0272989X251346811","DOIUrl":"10.1177/0272989X251346811","url":null,"abstract":"<p><p>PurposeTo summarize the degree to which evidence from our recent Making Numbers Meaningful (MNM) systematic review of the effects of data presentation format on communication of health numbers supports recommendations from the 2021 International Patient Decision Aids Standards (IPDAS) Collaboration papers on presenting probabilities.MethodsThe MNM review generated 1,119 distinct findings (derived from 316 papers) related to communication of probabilities to patients or other lay audiences, classifying each finding by its relation to audience task, type of stimulus (data and data presentation format), and up to 10 distinct sets of outcomes: identification and/or recall, contrast, categorization, computation, probability perceptions and/or feelings, effectiveness perceptions and/or feelings, behavioral intentions or behavior, trust, preference, and discrimination. Here, we summarize the findings related to each of the 35 IPDAS paper recommendations.ResultsStrong evidence exists to support several IPDAS recommendations, including those related to the use of part-to-whole graphical formats (e.g., icon arrays) and avoidance of verbal probability terms, 1-in-X formats, and relative risk formats to prevent amplification of probability perceptions, effectiveness perceptions, and/or behavioral intentions as well as the use of consistent denominators to improve computation outcomes. However, the evidence base appears weaker and less complete for other IPDAS recommendations (e.g., recommendations regarding numerical estimates in context and evaluative labels). The IPDAS papers and the MNM review agree that both communication of uncertainty and use of interactive formats need further research.ConclusionsThe idea that no one visual or numerical format is optimal for every probability communication situation is both an IPDAS panel recommendation and foundational to the MNM project's design. Although no MNM evidence contradicts IPDAS recommendations, the evidence base needed to support many common probability communication recommendations remains incomplete.HighlightsThe Making Numbers Meaningful (MNM) systematic review of the literature on communicating health numbers provides mixed support for the recommendations of the 2021 International Patient Decision Aids Standards (IPDAS) evidence papers on presenting probabilities in patient decision aids.Both the IPDAS papers and the MNM project agree that no single visual or numerical format is optimal for every probability communication situation.The MNM review provides strong evidentiary support for IPDAS recommendations in favor of using part-to-whole graphical formats (e.g., icon arrays) and consistent denominators.The MNM review also supports the IPDAS cautions against verbal probability terms and 1-in-X formats as well as its concerns about the potential biasing effects of relative risk formats and framing.MNM evidence is weaker related to IPDAS recommendations about placing numerical estimates in context","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"794-810"},"PeriodicalIF":3.1,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12236432/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144576775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forewarning Artificial Intelligence about Cognitive Biases. 警告人工智能关于认知偏差。
IF 3.1 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-10-01 Epub Date: 2025-06-24 DOI: 10.1177/0272989X251346788
Jonathan Wang, Donald A Redelmeier

Artificial intelligence models display human-like cognitive biases when generating medical recommendations. We tested whether an explicit forewarning, "Please keep in mind cognitive biases and other pitfalls of reasoning," might mitigate biases in OpenAI's generative pretrained transformer large language model. We used 10 clinically nuanced cases to test specific biases with and without a forewarning. Responses from the forewarning group were 50% longer and discussed cognitive biases more than 100 times more frequently compared with responses from the control group. Despite these differences, the forewarning decreased overall bias by only 6.9%, and no bias was extinguished completely. These findings highlight the need for clinician vigilance when interpreting generated responses that might appear seemingly thoughtful and deliberate.HighlightsArtificial intelligence models can be warned to avoid racial and gender bias.Forewarning artificial intelligence models to avoid cognitive biases does not adequately mitigate multiple pitfalls of reasoning.Critical reasoning remains an important clinical skill for practicing physicians.

人工智能模型在提供医疗建议时显示出类似人类的认知偏差。我们测试了一个明确的预警,“请记住认知偏差和其他推理陷阱”,是否可以减轻OpenAI的生成式预训练转换大型语言模型中的偏差。我们使用了10个临床细微差别的病例来测试有或没有预警的特定偏差。预警组的回答比对照组长50%,讨论认知偏差的频率是对照组的100多倍。尽管存在这些差异,但预警只减少了6.9%的总体偏倚,并且没有完全消除偏倚。这些发现强调了临床医生在解释看似深思熟虑和深思熟虑的反应时需要保持警惕。可以警告人工智能模型避免种族和性别偏见。预先警告人工智能模型以避免认知偏差并不能充分减轻推理的多重陷阱。批判性推理仍然是执业医师的一项重要临床技能。
{"title":"Forewarning Artificial Intelligence about Cognitive Biases.","authors":"Jonathan Wang, Donald A Redelmeier","doi":"10.1177/0272989X251346788","DOIUrl":"10.1177/0272989X251346788","url":null,"abstract":"<p><p>Artificial intelligence models display human-like cognitive biases when generating medical recommendations. We tested whether an explicit forewarning, \"Please keep in mind cognitive biases and other pitfalls of reasoning,\" might mitigate biases in OpenAI's generative pretrained transformer large language model. We used 10 clinically nuanced cases to test specific biases with and without a forewarning. Responses from the forewarning group were 50% longer and discussed cognitive biases more than 100 times more frequently compared with responses from the control group. Despite these differences, the forewarning decreased overall bias by only 6.9%, and no bias was extinguished completely. These findings highlight the need for clinician vigilance when interpreting generated responses that might appear seemingly thoughtful and deliberate.HighlightsArtificial intelligence models can be warned to avoid racial and gender bias.Forewarning artificial intelligence models to avoid cognitive biases does not adequately mitigate multiple pitfalls of reasoning.Critical reasoning remains an important clinical skill for practicing physicians.</p>","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"913-916"},"PeriodicalIF":3.1,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12413502/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144477583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
So You've Got a High AUC, Now What? An Overview of Important Considerations when Bringing Machine-Learning Models from Computer to Bedside. 所以你有一个高AUC,现在怎么办?将机器学习模型从计算机应用到病床时的重要考虑概述。
IF 3.1 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-01 Epub Date: 2025-05-29 DOI: 10.1177/0272989X251343082
Jiawen Deng, Mohamed E Elghobashy, Kathleen Zang, Shubh K Patel, Eddie Guo, Kiyan Heybati

Machine-learning (ML) models have the potential to transform health care by enabling more personalized and data-driven clinical decision making. However, their successful implementation in clinical practice requires careful consideration of factors beyond predictive accuracy. We provide an overview of essential considerations for developing clinically applicable ML models, including methods for assessing and improving calibration, selecting appropriate decision thresholds, enhancing model explainability, identifying and mitigating bias, as well as methods for robust validation. We also discuss strategies for improving accessibility to ML models and performing real-world testing.HighlightsThis tutorial provides clinicians with a comprehensive guide to implementing machine-learning classification models in clinical practice.Key areas covered include model calibration, threshold selection, explainability, bias mitigation, validation, and real-world testing, all of which are essential for the clinical deployment of machine-learning models.Following these guidance can help clinicians bridge the gap between machine-learning model development and real-world application and enhance patient care outcomes.

机器学习(ML)模型有可能通过实现更加个性化和数据驱动的临床决策来改变医疗保健。然而,它们在临床实践中的成功实施需要仔细考虑预测准确性之外的因素。我们概述了开发临床应用的ML模型的基本考虑因素,包括评估和改进校准的方法,选择适当的决策阈值,增强模型可解释性,识别和减轻偏差,以及稳健验证的方法。我们还讨论了改进ML模型的可访问性和执行实际测试的策略。本教程为临床医生提供了在临床实践中实现机器学习分类模型的全面指南。涵盖的关键领域包括模型校准、阈值选择、可解释性、偏差缓解、验证和实际测试,所有这些对于机器学习模型的临床部署都是必不可少的。遵循这些指导可以帮助临床医生弥合机器学习模型开发与实际应用之间的差距,并提高患者护理效果。
{"title":"So You've Got a High AUC, Now What? An Overview of Important Considerations when Bringing Machine-Learning Models from Computer to Bedside.","authors":"Jiawen Deng, Mohamed E Elghobashy, Kathleen Zang, Shubh K Patel, Eddie Guo, Kiyan Heybati","doi":"10.1177/0272989X251343082","DOIUrl":"10.1177/0272989X251343082","url":null,"abstract":"<p><p>Machine-learning (ML) models have the potential to transform health care by enabling more personalized and data-driven clinical decision making. However, their successful implementation in clinical practice requires careful consideration of factors beyond predictive accuracy. We provide an overview of essential considerations for developing clinically applicable ML models, including methods for assessing and improving calibration, selecting appropriate decision thresholds, enhancing model explainability, identifying and mitigating bias, as well as methods for robust validation. We also discuss strategies for improving accessibility to ML models and performing real-world testing.HighlightsThis tutorial provides clinicians with a comprehensive guide to implementing machine-learning classification models in clinical practice.Key areas covered include model calibration, threshold selection, explainability, bias mitigation, validation, and real-world testing, all of which are essential for the clinical deployment of machine-learning models.Following these guidance can help clinicians bridge the gap between machine-learning model development and real-world application and enhance patient care outcomes.</p>","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"640-653"},"PeriodicalIF":3.1,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12260203/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144175400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decision Frameworks for Assessing Cost-Effectiveness Given Previous Nonoptimal Decisions. 基于非最优决策评估成本效益的决策框架。
IF 3.1 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-01 Epub Date: 2025-06-12 DOI: 10.1177/0272989X251340941
Doug Coyle, David Glynn, Jeremy D Goldhaber-Fiebert, Edward C F Wilson

IntroductionEconomic evaluations identify the best course of action by a decision maker with respect to the level of health within the overall population. Traditionally, they identify 1 optimal treatment choice. In many jurisdictions, multiple technologies can be covered for the same heterogeneous patient population, which limits the applicability of this framework for directly determining whether a new technology should be covered. This article explores the impact of different decision frameworks within this context.MethodsThree alternate decision frameworks were considered: the traditional normative framework in which only the optimal technology will be covered (normative); a commonly adopted framework in which the new technology is recommended for reimbursement only if it is optimal, with coverage of other technologies remaining as before (current); and a framework that assesses specifically whether coverage of the new technology is optimal, incorporating previous reimbursement decisions and the market share of current technologies (positivist). The implications of the frameworks were assessed using a simulated probabilistic Markov model for a chronic progressive condition.ResultsResults illustrate how the different frameworks can lead to different reimbursement recommendations. This in turn produces differences in population health effects and the resultant price reductions required for covering the new technology.ConclusionBy covering only the optimal treatment option, decision makers can maximize the level of health across a population. If decision makers are unwilling to defund technologies, however, the second best option of adopting the positivist framework has the greatest relevance with respect to deciding whether a new technology should be covered.HighlightsTraditionally, economic evaluations focus on identifying the optimal treatment choice.This paper considers three alternative decision frameworks, within the context of multiple technologies being covered for the same heterogeneous patient population.This paper highlight that if decision makers are unwilling to defund therapies, current approaches to assessing cost effectiveness may be non-optimal.

经济评价确定决策者在总体人口健康水平方面的最佳行动方针。传统上,他们确定一个最佳治疗选择。在许多司法管辖区,可以为相同的异质患者群体涵盖多种技术,这限制了该框架在直接确定是否应涵盖新技术方面的适用性。本文探讨了在这种情况下不同决策框架的影响。方法考虑了三种可供选择的决策框架:传统的规范框架,其中只涵盖最优技术(规范);一种普遍采用的框架,只有在新技术是最佳的情况下才建议偿还,而其他技术的覆盖范围与以前一样(目前);以及一个框架,具体评估新技术的覆盖范围是否最佳,结合以前的报销决定和当前技术的市场份额(实证主义)。使用慢性进行性疾病的模拟概率马尔可夫模型评估框架的含义。结果说明了不同的框架如何导致不同的报销建议。这反过来又造成了人口健康影响的差异,并因此降低了覆盖新技术所需的价格。结论决策者通过只覆盖最优治疗方案,可以最大限度地提高人群的健康水平。但是,如果决策者不愿意撤资技术,则采用实证主义框架的第二个最佳选择对于决定是否应包括一项新技术具有最大的相关性。
{"title":"Decision Frameworks for Assessing Cost-Effectiveness Given Previous Nonoptimal Decisions.","authors":"Doug Coyle, David Glynn, Jeremy D Goldhaber-Fiebert, Edward C F Wilson","doi":"10.1177/0272989X251340941","DOIUrl":"10.1177/0272989X251340941","url":null,"abstract":"<p><p>IntroductionEconomic evaluations identify the best course of action by a decision maker with respect to the level of health within the overall population. Traditionally, they identify 1 optimal treatment choice. In many jurisdictions, multiple technologies can be covered for the same heterogeneous patient population, which limits the applicability of this framework for directly determining whether a new technology should be covered. This article explores the impact of different decision frameworks within this context.MethodsThree alternate decision frameworks were considered: the traditional normative framework in which only the optimal technology will be covered (normative); a commonly adopted framework in which the new technology is recommended for reimbursement only if it is optimal, with coverage of other technologies remaining as before (current); and a framework that assesses specifically whether coverage of the new technology is optimal, incorporating previous reimbursement decisions and the market share of current technologies (positivist). The implications of the frameworks were assessed using a simulated probabilistic Markov model for a chronic progressive condition.ResultsResults illustrate how the different frameworks can lead to different reimbursement recommendations. This in turn produces differences in population health effects and the resultant price reductions required for covering the new technology.ConclusionBy covering only the optimal treatment option, decision makers can maximize the level of health across a population. If decision makers are unwilling to defund technologies, however, the second best option of adopting the positivist framework has the greatest relevance with respect to deciding whether a new technology should be covered.HighlightsTraditionally, economic evaluations focus on identifying the optimal treatment choice.This paper considers three alternative decision frameworks, within the context of multiple technologies being covered for the same heterogeneous patient population.This paper highlight that if decision makers are unwilling to defund therapies, current approaches to assessing cost effectiveness may be non-optimal.</p>","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"703-713"},"PeriodicalIF":3.1,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12260196/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Impact of Machine Learning Mortality Risk Prediction on Clinician Prognostic Accuracy and Decision Support: A Randomized Vignette Study. 机器学习死亡率风险预测对临床医生预后准确性和决策支持的影响:一项随机研究。
IF 3.1 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-01 Epub Date: 2025-07-04 DOI: 10.1177/0272989X251349489
Ravi B Parikh, William J Ferrell, Anthony Girard, Jenna White, Sophia Fang, Justin E Bekelman, Marilyn M Schapira
<p><p>BackgroundMachine learning (ML) algorithms may improve the prognosis for serious illnesses such as cancer, identifying patients who may benefit from earlier palliative care (PC) or advance care planning (ACP). We evaluated the impact of various presentation strategies of a hypothetical ML algorithm on clinician prognostic accuracy and decision making.MethodsThis was a randomized clinical vignette survey study among medical oncologists who treat metastatic non-small-cell lung cancer (mNSCLC). Between March and June 2023, clinicians were shown 3 vignettes of patients presenting with mNSCLC. The vignettes varied by prognostic risk, as defined from the Lung Cancer Prognostic Index (LCPI). Clinicians estimated life expectancy in months and made recommendations about PC and ACP. Clinicians were then shown the same vignette with a hypothetical survival estimate from a black-box ML algorithm; clinicians were randomized to receive the ML prediction using absolute and/or reference-dependent prognostic estimates. The primary outcome was prognostic accuracy relative to the LCPI.ResultsAmong 51 clinicians with complete responses, the median years in practice was 7 (interquartile range 3.5-19), 14 (27.5%) were female, 23 (45.1%) practiced in a community oncology setting, and baseline accuracy was 54.9% (95% confidence interval [CI] 47.0-62.8) across all vignettes. ML presentation improved accuracy (mean change relative to baseline 20.9%, 95% CI 13.9-27.9, <i>P</i> < 0.001). ML outputs using an absolute presentation strategy alone (mean change 27.4%, 95% 16.8-38.1, <i>P</i> < 0.001) or with reference dependence (mean change 33.4%, 95% 23.9-42.8, <i>P</i> < 0.001) improved accuracy, but reference dependence alone did not (mean change 2.0% [95% CI -11.1 to 15.0], <i>P</i> = 0.77). ML presentation did not change the rates of recommending ACP nor PC referral (mean change 1.3% and 0.7%, respectively).LimitationsThe singular use case of prognosis in mNSCLC, low initial response rate.ConclusionsML-based assessments may improve prognostic accuracy but not result in changed decision making.ImplicationsML prognostic algorithms prioritizing explainability and absolute prognoses may have greater impact on clinician decision making.Trial Registration: CT.gov: NCT06463977HighlightsWhile machine learning (ML) algorithms may accurately predict mortality, the impact of prognostic ML on clinicians' prognostic accuracy and decision making and optimal presentation strategies for ML outputs are unclear.In this multicenter randomized survey study among vignettes of patients with advanced cancer, prognostic accuracy improved by 20.9% when clinicians reviewed vignettes with a hypothetical ML mortality risk prediction, with absolute risk presentation strategies resulting in greater accuracy gains than reference-dependent presentations alone.However, ML presentation did not change the rates of recommending advance care planning or palliative care referral (1.3% and 0.7%, respectiv
机器学习(ML)算法可以改善癌症等严重疾病的预后,识别可能受益于早期姑息治疗(PC)或提前护理计划(ACP)的患者。我们评估了假设ML算法的各种呈现策略对临床医生预后准确性和决策的影响。方法:这是一项随机临床调查研究,研究对象是治疗转移性非小细胞肺癌(mNSCLC)的内科肿瘤学家。在2023年3月至6月期间,临床医生展示了3个小片段的小细胞肺癌患者。根据肺癌预后指数(LCPI)的定义,不同患者的预后风险不同。临床医生以月为单位估计预期寿命,并对PC和ACP提出建议。然后向临床医生展示相同的小插曲,并根据黑盒ML算法进行假设的生存估计;临床医生随机接受使用绝对和/或参考依赖预后估计的ML预测。主要结果是相对于LCPI的预后准确性。结果在51名完全缓解的临床医生中,实践的中位数为7年(四分位数范围为3.5-19),14名(27.5%)为女性,23名(45.1%)在社区肿瘤学环境中实践,基线准确性为54.9%(95%置信区间[CI] 47.0-62.8)。ML表现提高了准确性(相对于基线的平均变化20.9%,95% CI 13.9-27.9, P P P P = 0.77)。ML表现没有改变ACP和PC推荐率(平均变化分别为1.3%和0.7%)。局限:在小细胞肺癌中预后的单一用例,初始缓解率低。结论基于sml的评估可提高预后准确性,但不会导致决策改变。结论:优先考虑可解释性和绝对预后的sml预测算法可能对临床医生的决策有更大的影响。虽然机器学习(ML)算法可以准确地预测死亡率,但预后ML对临床医生的预后准确性和决策制定以及ML输出的最佳呈现策略的影响尚不清楚。在这项针对晚期癌症患者的多中心随机调查研究中,当临床医生使用假设的ML死亡风险预测来评估小样本时,预后准确性提高了20.9%,绝对风险表现策略比单独参考依赖表现获得更高的准确性。然而,ML表现并没有改变推荐提前护理计划或姑息治疗转诊的比率(分别为1.3%和0.7%)。无解释的基于ml的预后评估可提高预后准确性,但不会改变有关姑息治疗转诊或预先护理计划的决定。
{"title":"The Impact of Machine Learning Mortality Risk Prediction on Clinician Prognostic Accuracy and Decision Support: A Randomized Vignette Study.","authors":"Ravi B Parikh, William J Ferrell, Anthony Girard, Jenna White, Sophia Fang, Justin E Bekelman, Marilyn M Schapira","doi":"10.1177/0272989X251349489","DOIUrl":"10.1177/0272989X251349489","url":null,"abstract":"&lt;p&gt;&lt;p&gt;BackgroundMachine learning (ML) algorithms may improve the prognosis for serious illnesses such as cancer, identifying patients who may benefit from earlier palliative care (PC) or advance care planning (ACP). We evaluated the impact of various presentation strategies of a hypothetical ML algorithm on clinician prognostic accuracy and decision making.MethodsThis was a randomized clinical vignette survey study among medical oncologists who treat metastatic non-small-cell lung cancer (mNSCLC). Between March and June 2023, clinicians were shown 3 vignettes of patients presenting with mNSCLC. The vignettes varied by prognostic risk, as defined from the Lung Cancer Prognostic Index (LCPI). Clinicians estimated life expectancy in months and made recommendations about PC and ACP. Clinicians were then shown the same vignette with a hypothetical survival estimate from a black-box ML algorithm; clinicians were randomized to receive the ML prediction using absolute and/or reference-dependent prognostic estimates. The primary outcome was prognostic accuracy relative to the LCPI.ResultsAmong 51 clinicians with complete responses, the median years in practice was 7 (interquartile range 3.5-19), 14 (27.5%) were female, 23 (45.1%) practiced in a community oncology setting, and baseline accuracy was 54.9% (95% confidence interval [CI] 47.0-62.8) across all vignettes. ML presentation improved accuracy (mean change relative to baseline 20.9%, 95% CI 13.9-27.9, &lt;i&gt;P&lt;/i&gt; &lt; 0.001). ML outputs using an absolute presentation strategy alone (mean change 27.4%, 95% 16.8-38.1, &lt;i&gt;P&lt;/i&gt; &lt; 0.001) or with reference dependence (mean change 33.4%, 95% 23.9-42.8, &lt;i&gt;P&lt;/i&gt; &lt; 0.001) improved accuracy, but reference dependence alone did not (mean change 2.0% [95% CI -11.1 to 15.0], &lt;i&gt;P&lt;/i&gt; = 0.77). ML presentation did not change the rates of recommending ACP nor PC referral (mean change 1.3% and 0.7%, respectively).LimitationsThe singular use case of prognosis in mNSCLC, low initial response rate.ConclusionsML-based assessments may improve prognostic accuracy but not result in changed decision making.ImplicationsML prognostic algorithms prioritizing explainability and absolute prognoses may have greater impact on clinician decision making.Trial Registration: CT.gov: NCT06463977HighlightsWhile machine learning (ML) algorithms may accurately predict mortality, the impact of prognostic ML on clinicians' prognostic accuracy and decision making and optimal presentation strategies for ML outputs are unclear.In this multicenter randomized survey study among vignettes of patients with advanced cancer, prognostic accuracy improved by 20.9% when clinicians reviewed vignettes with a hypothetical ML mortality risk prediction, with absolute risk presentation strategies resulting in greater accuracy gains than reference-dependent presentations alone.However, ML presentation did not change the rates of recommending advance care planning or palliative care referral (1.3% and 0.7%, respectiv","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"690-702"},"PeriodicalIF":3.1,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12233153/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144561782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Facilitators and Barriers of the Use of Prognostic Models for Clinical Decision Making in Acute Neurologic Care: A Systematic Review. 在急性神经系统护理中使用预后模型进行临床决策的促进因素和障碍:一项系统综述。
IF 3.1 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-01 Epub Date: 2025-06-29 DOI: 10.1177/0272989X251343027
Ellen X Y Hu, Evelien S van Hoorn, Isabel R A Retel Helmrich, Susanne Muehlschlegel, Judith A C Rietjens, Hester F Lingsma

BackgroundPrognostic models are crucial for predicting patient outcomes and aiding clinical decision making. Despite their availability in acute neurologic care, their use in clinical practice is limited, with insufficient reflection on reasons for this scarce implementation.PurposeTo summarize facilitators and barriers among clinicians affecting the use of prognostic models in acute neurologic care.Data SourcesSystematic searches were conducted in Embase, Medline ALL, Web of Science Core Collection, and Cochrane Central Register of Controlled Trials from inception until February 2024.Study SelectionEligible studies included those providing clinicians' perspectives on the use of prognostic models in acute neurologic care.Data ExtractionData were extracted concerning study characteristics, study aim, data collection and analysis, prognostic models, participant characteristics, facilitators, and barriers. Risk of bias was assessed using the Qualsyst tool.Data SynthesisFindings were structured around the Unified Theory of Acceptance and Use of Technology framework. Identified facilitators included improved communication with patients and surrogate decision makers (n = 9), reassurance of clinical judgment (n = 6) perceived improved patient outcomes (n = 4), standardization of care (n = 4), resource optimization (n = 3), and extension of clinical knowledge (n = 3). Barriers included perceived misinterpretation during risk communication (n = 3), mistrust in data (n = 3), perceived reduction of clinicians' autonomy (n = 3), and ethical considerations (n = 2). In total, 15 studies were included, with all but 1 demonstrating good methodological quality. None were excluded due to poor quality ratings.LimitationsThis review identifies limitations, including study heterogeneity, exclusion of gray literature, and the scarcity of evaluations on model implementation.ConclusionsUnderstanding facilitators and barriers may enhance prognostic model development and implementation. Bridging the gap between development and clinical use requires improved collaboration among researchers, clinicians, patients, and surrogate decision makers.HighlightsThis is the first systematic review to summarize published facilitators and barriers affecting the use of prognostic models in acute neurologic care from the clinicians' perspective.Commonly reported barriers and facilitators were consistent with several domains of the Unified Theory of Acceptance and Use of Technology model, including effort expectancy, social influence, and facilitating conditions, with the focus on the performance expectancy domain.Future implementation research including collaboration with researchers from different fields, clinicians, patients, and their surrogate decision makers may be highly valuable for future model development and implementation.

预后模型对于预测患者预后和辅助临床决策至关重要。尽管它们在急性神经系统护理中可用,但它们在临床实践中的使用是有限的,对这种稀缺实施的原因没有充分的反思。目的总结影响临床医生在急性神经内科护理中使用预后模型的因素和障碍。从成立到2024年2月,在Embase、Medline ALL、Web of Science Core Collection和Cochrane Central Register of Controlled Trials中进行了系统检索。研究选择:符合条件的研究包括临床医生对急性神经系统护理中使用预后模型的观点。数据提取提取有关研究特征、研究目的、数据收集和分析、预后模型、参与者特征、促进因素和障碍的数据。使用Qualsyst工具评估偏倚风险。数据综合研究结果是围绕技术接受和使用的统一理论框架构建的。确定的促进因素包括改善与患者和替代决策者的沟通(n = 9),保证临床判断(n = 6),改善患者预后(n = 4),标准化护理(n = 4),资源优化(n = 3)和扩展临床知识(n = 3)。障碍包括风险沟通过程中感知到的误解(n = 3)、对数据的不信任(n = 3)、临床医生自主性的降低(n = 3)和伦理考虑(n = 2)。总共纳入了15项研究,除1项研究外,其余研究均具有良好的方法学质量。没有一例因质量评分差而被排除在外。本综述确定了局限性,包括研究异质性、灰色文献的排除以及模型实施评估的稀缺性。结论了解促进因素和障碍因素可以促进预后模型的开发和实施。弥合开发和临床使用之间的差距需要改善研究人员、临床医生、患者和替代决策者之间的合作。这是第一个系统综述,从临床医生的角度总结了已发表的影响急性神经系统护理中使用预后模型的促进因素和障碍。通常报告的障碍和促进因素与技术接受和使用统一理论模型的几个领域是一致的,包括努力预期、社会影响和促进条件,重点是绩效预期领域。未来的实施研究,包括与来自不同领域的研究人员、临床医生、患者及其代理决策者的合作,可能对未来模型的开发和实施非常有价值。
{"title":"Facilitators and Barriers of the Use of Prognostic Models for Clinical Decision Making in Acute Neurologic Care: A Systematic Review.","authors":"Ellen X Y Hu, Evelien S van Hoorn, Isabel R A Retel Helmrich, Susanne Muehlschlegel, Judith A C Rietjens, Hester F Lingsma","doi":"10.1177/0272989X251343027","DOIUrl":"10.1177/0272989X251343027","url":null,"abstract":"<p><p>BackgroundPrognostic models are crucial for predicting patient outcomes and aiding clinical decision making. Despite their availability in acute neurologic care, their use in clinical practice is limited, with insufficient reflection on reasons for this scarce implementation.PurposeTo summarize facilitators and barriers among clinicians affecting the use of prognostic models in acute neurologic care.Data SourcesSystematic searches were conducted in Embase, Medline ALL, Web of Science Core Collection, and Cochrane Central Register of Controlled Trials from inception until February 2024.Study SelectionEligible studies included those providing clinicians' perspectives on the use of prognostic models in acute neurologic care.Data ExtractionData were extracted concerning study characteristics, study aim, data collection and analysis, prognostic models, participant characteristics, facilitators, and barriers. Risk of bias was assessed using the Qualsyst tool.Data SynthesisFindings were structured around the Unified Theory of Acceptance and Use of Technology framework. Identified facilitators included improved communication with patients and surrogate decision makers (<i>n</i> = 9), reassurance of clinical judgment (<i>n</i> = 6) perceived improved patient outcomes (<i>n</i> = 4), standardization of care (<i>n</i> = 4), resource optimization (<i>n</i> = 3), and extension of clinical knowledge (<i>n</i> = 3). Barriers included perceived misinterpretation during risk communication (<i>n</i> = 3), mistrust in data (<i>n</i> = 3), perceived reduction of clinicians' autonomy (<i>n</i> = 3), and ethical considerations (<i>n</i> = 2). In total, 15 studies were included, with all but 1 demonstrating good methodological quality. None were excluded due to poor quality ratings.LimitationsThis review identifies limitations, including study heterogeneity, exclusion of gray literature, and the scarcity of evaluations on model implementation.ConclusionsUnderstanding facilitators and barriers may enhance prognostic model development and implementation. Bridging the gap between development and clinical use requires improved collaboration among researchers, clinicians, patients, and surrogate decision makers.HighlightsThis is the first systematic review to summarize published facilitators and barriers affecting the use of prognostic models in acute neurologic care from the clinicians' perspective.Commonly reported barriers and facilitators were consistent with several domains of the Unified Theory of Acceptance and Use of Technology model, including effort expectancy, social influence, and facilitating conditions, with the focus on the performance expectancy domain.Future implementation research including collaboration with researchers from different fields, clinicians, patients, and their surrogate decision makers may be highly valuable for future model development and implementation.</p>","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"753-770"},"PeriodicalIF":3.1,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12260205/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144530739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Postpartum Sterilization after a Preterm Delivery Is Not Associated with Decision Regret. 早产后的产后绝育与决策后悔无关。
IF 3.1 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-01 Epub Date: 2025-06-21 DOI: 10.1177/0272989X251341478
Marika Toscano, Sarah J Betstadt, Sara Spielman, Gayathri Guru Murthy, Brooke A Levandowski

BackgroundAlthough sterilization is one of the most effective methods of birth control, some physicians may hesitate to perform postpartum sterilizations on patients after preterm birth, as preterm labor and delivery may preclude adequate counseling.MethodsThis is a cross-sectional study conducted at a single, tertiary care, academic institution of adult pregnant patients who experienced a spontaneous or iatrogenic preterm delivery between March 15, 2011, and May 10, 2014 and underwent postpartum female surgical sterilization within 12 wk of delivery. A validated Decision Regret Scale was administered 7 to 11 y later. Univariate and bivariate analyses were conducted. Unadjusted and multivariate logistic regression analyses identified factors associated with moderate to severe decision regret.ResultsMost participants (75.5%) with a preterm delivery reported no or mild regret associated with their sterilization. Circumstances surrounding the sterilization decision were positive, as 85.7% reported having enough information, 81.6% reported enough emotional support, and 75.5% reported adequate decision time. Adjusting for maternal and gestational age at delivery plus other covariates, only those reporting they had adequate time to make their sterilization decision remained significantly associated with no or mild regret (odds ratio: 0.002, 95% confidence interval: <0.001-0.61).DiscussionStudy results indicated high confidence in the sterilization decision, which was not affected by maternal age at delivery or the fact that the individual had a preterm delivery, emphasizing the importance of individualized counseling and support for patients during the decision-making process.ConclusionProviding adequate time for patients to decide on postpartum surgical sterilization was the most important factor for decreased sterilization regret.ImplicationsThe decision for sterilization should be made using a patient-centered, shared decision-making framework.HighlightsAmong patients with a preterm delivery who underwent postpartum surgical sterilization, maternal age at delivery was not associated with increased decision regret.Providing adequate time for patients to decide on postpartum surgical sterilization was the most important factor for decreased sterilization regret among patients with a preterm delivery.We must trust the patient knows they are making the right decision for themselves in that moment, even if this is at the time of a preterm delivery.

虽然绝育是最有效的节育方法之一,但一些医生可能会犹豫是否对早产后的患者进行产后绝育,因为早产和分娩可能会妨碍充分的咨询。方法:这是一项在单一三级医疗学术机构进行的横断面研究,研究对象为2011年3月15日至2014年5月10日期间发生自发性或医源性早产并在分娩后12周内接受女性手术绝育的成年孕妇。7到11年后进行了一个有效的决策后悔量表。进行了单因素和双因素分析。未调整和多变量逻辑回归分析确定了与中度至重度决策后悔相关的因素。结果大多数早产参与者(75.5%)报告与绝育相关的无后悔或轻微后悔。围绕绝育决定的情况是积极的,85.7%的人表示有足够的信息,81.6%的人表示有足够的情感支持,75.5%的人表示有足够的决策时间。调整分娩时的产妇和胎龄以及其他协变量,只有那些报告自己有足够时间做出绝育决定的人仍然与没有或轻微后悔显著相关(优势比:0.002,95%置信区间:
{"title":"Postpartum Sterilization after a Preterm Delivery Is Not Associated with Decision Regret.","authors":"Marika Toscano, Sarah J Betstadt, Sara Spielman, Gayathri Guru Murthy, Brooke A Levandowski","doi":"10.1177/0272989X251341478","DOIUrl":"10.1177/0272989X251341478","url":null,"abstract":"<p><p>BackgroundAlthough sterilization is one of the most effective methods of birth control, some physicians may hesitate to perform postpartum sterilizations on patients after preterm birth, as preterm labor and delivery may preclude adequate counseling.MethodsThis is a cross-sectional study conducted at a single, tertiary care, academic institution of adult pregnant patients who experienced a spontaneous or iatrogenic preterm delivery between March 15, 2011, and May 10, 2014 and underwent postpartum female surgical sterilization within 12 wk of delivery. A validated Decision Regret Scale was administered 7 to 11 y later. Univariate and bivariate analyses were conducted. Unadjusted and multivariate logistic regression analyses identified factors associated with moderate to severe decision regret.ResultsMost participants (75.5%) with a preterm delivery reported no or mild regret associated with their sterilization. Circumstances surrounding the sterilization decision were positive, as 85.7% reported having enough information, 81.6% reported enough emotional support, and 75.5% reported adequate decision time. Adjusting for maternal and gestational age at delivery plus other covariates, only those reporting they had adequate time to make their sterilization decision remained significantly associated with no or mild regret (odds ratio: 0.002, 95% confidence interval: <0.001-0.61).DiscussionStudy results indicated high confidence in the sterilization decision, which was not affected by maternal age at delivery or the fact that the individual had a preterm delivery, emphasizing the importance of individualized counseling and support for patients during the decision-making process.ConclusionProviding adequate time for patients to decide on postpartum surgical sterilization was the most important factor for decreased sterilization regret.ImplicationsThe decision for sterilization should be made using a patient-centered, shared decision-making framework.HighlightsAmong patients with a preterm delivery who underwent postpartum surgical sterilization, maternal age at delivery was not associated with increased decision regret.Providing adequate time for patients to decide on postpartum surgical sterilization was the most important factor for decreased sterilization regret among patients with a preterm delivery.We must trust the patient knows they are making the right decision for themselves in that moment, even if this is at the time of a preterm delivery.</p>","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"654-664"},"PeriodicalIF":3.1,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12270760/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144340575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Limited Sample Size and Follow-up on Partitioned Survival and Multistate Modeling-Based Health Economic Models: A Simulation Study. 有限样本量和随访对分区生存和基于多状态建模的健康经济模型的影响:模拟研究。
IF 3.1 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-01 Epub Date: 2025-06-25 DOI: 10.1177/0272989X251342596
Jaclyn M Beca, Kelvin K W Chan, David M J Naimark, Petros Pechlivanoglou

BackgroundEconomic models often require extrapolation of clinical time-to-event data for multiple events. Two modeling approaches in oncology that incorporate time dependency include partitioned survival models (PSM) and semi-Markov decision models estimated using multistate modeling (MSM). The objective of this simulation study was to assess the performance of PSM and MSM across datasets with varying sample size and degrees of censoring.MethodsWe generated disease trajectories of progression and death for multiple hypothetical populations with advanced cancers. These populations served as the sampling pool for simulated trial cohorts with multiple sample sizes and various levels of follow-up. We estimated MSM and PSM by fitting survival models to these simulated datasets with different approaches to incorporating general population mortality (GPM) and selected best-fitting models using statistical criteria. Mean survival was compared with "true" population values to assess error.ResultsWith near complete follow-up, both PSMs and MSMs accurately estimated mean population survival, while smaller samples and shorter follow-up times were associated with a larger error across approaches and clinical scenarios, especially for more distant clinical endpoints. MSMs were slightly more often not estimable when informed by studies with small sample sizes or short follow-up, due to low numbers at risk for the downstream transition. However, when estimable, the MSM models more commonly produced a smaller error in mean survival than the PSMs did.ConclusionsCaution should be taken with all modeling approaches when the underlying data are very limited, particularly PSMs, due to the large errors produced. When estimable and for selections based on statistical criteria, MSMs performed similar to or better than PSMs in estimating mean survival with limited data.HighlightsCaution should be taken with all modeling approaches when underlying data are very limited.Partitioned survival models (PSMs) can lead to significant errors, particularly with limited follow-up. Incorporating general population mortality (GPM) via internal additive hazards improved estimates of mean survival, but the effects were modest.When estimable, decision models based on multistate modeling (MSM) produced similar or smaller error in mean survival compared with PSM, but small samples or limited deaths after progression produce additional challenges for fitting MSMs; more research is needed to improve estimation of MSMs and similar state transition-based modeling methods with limited data.Future studies are needed to assess the applicability of these findings to comparative analyses estimating incremental survival benefits.

经济模型通常需要对多个事件的临床事件时间数据进行外推。肿瘤学中包含时间依赖性的两种建模方法包括分割生存模型(PSM)和使用多状态建模(MSM)估计的半马尔可夫决策模型。本模拟研究的目的是评估PSM和MSM在不同样本量和审查程度的数据集上的性能。方法:我们为多个假设的晚期癌症人群生成了疾病进展和死亡的轨迹。这些人群作为具有多个样本量和不同随访水平的模拟试验队列的抽样池。我们通过将生存模型与这些模拟数据集进行拟合来估计MSM和PSM,并采用不同的方法纳入一般人口死亡率(GPM),并使用统计标准选择最佳拟合模型。将平均生存率与“真实”人口值进行比较,以评估误差。在接近完全随访的情况下,psm和msm都能准确地估计平均人群生存率,而较小的样本和较短的随访时间与方法和临床情况的较大误差相关,特别是对于较远的临床终点。当样本量较小或随访时间较短的研究提供信息时,由于下游转移的风险较低,MSMs往往无法估计。然而,当可估计时,MSM模型通常比psm模型在平均生存中产生更小的误差。结论:当基础数据非常有限时,所有建模方法都应谨慎,特别是psm,因为会产生很大的误差。当可估计时,对于基于统计标准的选择,在有限数据下估计平均生存时,MSMs的表现与psm相似或更好。当底层数据非常有限时,所有建模方法都应该谨慎。分区生存模型(psm)可能导致严重的错误,特别是在随访有限的情况下。通过内部累加性危险纳入一般人群死亡率(GPM)改善了平均生存的估计,但效果不大。当可估计时,与PSM相比,基于多状态建模(MSM)的决策模型在平均生存方面产生了类似或更小的误差,但小样本或进展后有限的死亡对MSM的拟合产生了额外的挑战;在数据有限的情况下,需要进一步研究改进msm的估计和类似的基于状态转换的建模方法。未来的研究需要评估这些发现的适用性,以比较分析估计增加的生存益处。
{"title":"Impact of Limited Sample Size and Follow-up on Partitioned Survival and Multistate Modeling-Based Health Economic Models: A Simulation Study.","authors":"Jaclyn M Beca, Kelvin K W Chan, David M J Naimark, Petros Pechlivanoglou","doi":"10.1177/0272989X251342596","DOIUrl":"10.1177/0272989X251342596","url":null,"abstract":"<p><p>BackgroundEconomic models often require extrapolation of clinical time-to-event data for multiple events. Two modeling approaches in oncology that incorporate time dependency include partitioned survival models (PSM) and semi-Markov decision models estimated using multistate modeling (MSM). The objective of this simulation study was to assess the performance of PSM and MSM across datasets with varying sample size and degrees of censoring.MethodsWe generated disease trajectories of progression and death for multiple hypothetical populations with advanced cancers. These populations served as the sampling pool for simulated trial cohorts with multiple sample sizes and various levels of follow-up. We estimated MSM and PSM by fitting survival models to these simulated datasets with different approaches to incorporating general population mortality (GPM) and selected best-fitting models using statistical criteria. Mean survival was compared with \"true\" population values to assess error.ResultsWith near complete follow-up, both PSMs and MSMs accurately estimated mean population survival, while smaller samples and shorter follow-up times were associated with a larger error across approaches and clinical scenarios, especially for more distant clinical endpoints. MSMs were slightly more often not estimable when informed by studies with small sample sizes or short follow-up, due to low numbers at risk for the downstream transition. However, when estimable, the MSM models more commonly produced a smaller error in mean survival than the PSMs did.ConclusionsCaution should be taken with all modeling approaches when the underlying data are very limited, particularly PSMs, due to the large errors produced. When estimable and for selections based on statistical criteria, MSMs performed similar to or better than PSMs in estimating mean survival with limited data.HighlightsCaution should be taken with all modeling approaches when underlying data are very limited.Partitioned survival models (PSMs) can lead to significant errors, particularly with limited follow-up. Incorporating general population mortality (GPM) via internal additive hazards improved estimates of mean survival, but the effects were modest.When estimable, decision models based on multistate modeling (MSM) produced similar or smaller error in mean survival compared with PSM, but small samples or limited deaths after progression produce additional challenges for fitting MSMs; more research is needed to improve estimation of MSMs and similar state transition-based modeling methods with limited data.Future studies are needed to assess the applicability of these findings to comparative analyses estimating incremental survival benefits.</p>","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"714-725"},"PeriodicalIF":3.1,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12260197/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144499005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mapping and Linking between the EQ-5D-5L and the PROMIS Domains in the United States. 美国EQ-5D-5L与PROMIS结构域的映射与连接。
IF 3.1 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-01 Epub Date: 2025-06-13 DOI: 10.1177/0272989X251340990
Xiaodan Tang, Ron D Hays, David Cella, Sarah Acaster, Benjamin David Schalet, Asia Sikora Kessler, Montserrat Vera Llonch, Janel Hanmer

ObjectivesThe EQ-5D-5L and Patient-Reported Outcomes Measurement Information System (PROMIS®) preference score (PROPr) are preference-based measures. This study compares mapping and linking approaches to align the PROPr and the PROMIS domains included in PROPr plus Anxiety with EQ-5D-5L item responses and preference scores.MethodsA general population sample of 983 adults completed the online survey. Regression-based mapping methods and item response theory (IRT) linking methods were used to align scores. Mapping was used to predict EQ-5D-5L item responses or preference scores using PROMIS domain scores. Equating strategies were applied to address regression to the mean. The linking approach estimated item parameters of EQ-5D-5L based on the PROMIS score metric and generated bidirectional crosswalks between EQ-5D-5L item responses and relevant PROMIS domain scores.ResultsEQ-5D-5L item responses were significantly accounted for by PROMIS domains of Anxiety, Depression, Fatigue, Pain Interference, Physical Function, Social Roles, and Sleep Disturbance. EQ-5D-5L preference scores were accounted for by the same PROMIS domains, excluding Anxiety and Fatigue, and by the PROPr preference scores. IRT-linking crosswalks were generated between EQ-5D-5L item responses and PROMIS domains of Physical Function, Pain, and Depression. Small differences were found between observed and predicted scores for all 3 methods. The direct mapping approach (directly predicting EQ-5D-5L scores) with the equipercentile equating strategy proved superior to the linking method due to improved prediction accuracy and comparable score range coverage.ConclusionsThe PROPr and the PROMIS domains included in the PROMIS-29+2 predict EQ-5D-5L preference scores or item responses. Both methods can generate acceptably precise EQ-5D-5L preference scores, with the direct mapping approach using the equating strategy offering better precision. We summarized recommended score conversion tables based on available and desired scores.HighlightsThis study compares mapping (score prediction) and IRT-based linking approaches to align the PROPr and the PROMIS domains with EQ-5D-5L item responses and preference scores.Researchers, clinicians, and stakeholders can use this study's regression formulas and score crosswalks to convert scores between PROMIS and EQ-5D-5L.Mapping can generate more precise scores, while linking offers greater flexibility in score estimation when fewer PROMIS domain scores are collected.

EQ-5D-5L和患者报告结果测量信息系统(PROMIS®)偏好评分(PROPr)是基于偏好的测量方法。本研究比较了将PROPr + Anxiety中包含的PROPr和PROMIS域与EQ-5D-5L项目反应和偏好分数对齐的映射和链接方法。方法对983名成年人进行在线调查。采用基于回归的映射方法和项目反应理论(IRT)链接方法对分数进行对齐。使用映射来预测EQ-5D-5L项目的反应或使用PROMIS域得分的偏好得分。采用相等策略来解决回归均值问题。链接法基于PROMIS得分指标估计EQ-5D-5L的项目参数,生成EQ-5D-5L项目反应与相关PROMIS领域得分之间的双向交叉曲线。结果tq - 5d - 5l项目的回答被焦虑、抑郁、疲劳、疼痛干扰、身体功能、社会角色和睡眠障碍的PROMIS域显著地解释。EQ-5D-5L偏好得分由相同的PROMIS域(不包括焦虑和疲劳)和PROPr偏好得分来解释。在EQ-5D-5L项目反应与身体功能、疼痛和抑郁的PROMIS域之间产生了irt连接的交叉通道。所有3种方法的观察得分和预测得分之间存在微小差异。采用等百分位相等策略的直接映射法(直接预测EQ-5D-5L分数)由于预测精度和可比较分数范围覆盖范围的提高而优于链接法。结论promise -29+2中包含的PROPr和PROMIS结构域预测EQ-5D-5L偏好得分或项目反应。这两种方法都可以产生可接受的精确EQ-5D-5L偏好分数,使用等同策略的直接映射方法提供更好的精度。我们总结了基于可用分数和期望分数的推荐分数转换表。本研究比较了映射(分数预测)和基于ird的链接方法,以将PROPr和PROMIS域与EQ-5D-5L项目反应和偏好分数对齐。研究人员、临床医生和利益相关者可以使用本研究的回归公式和人行横道评分来转换PROMIS和EQ-5D-5L之间的得分。映射可以生成更精确的分数,而链接在收集较少的PROMIS域分数时提供更大的分数估计灵活性。
{"title":"Mapping and Linking between the EQ-5D-5L and the PROMIS Domains in the United States.","authors":"Xiaodan Tang, Ron D Hays, David Cella, Sarah Acaster, Benjamin David Schalet, Asia Sikora Kessler, Montserrat Vera Llonch, Janel Hanmer","doi":"10.1177/0272989X251340990","DOIUrl":"10.1177/0272989X251340990","url":null,"abstract":"<p><p>ObjectivesThe EQ-5D-5L and Patient-Reported Outcomes Measurement Information System (PROMIS®) preference score (PROPr) are preference-based measures. This study compares mapping and linking approaches to align the PROPr and the PROMIS domains included in PROPr plus Anxiety with EQ-5D-5L item responses and preference scores.MethodsA general population sample of 983 adults completed the online survey. Regression-based mapping methods and item response theory (IRT) linking methods were used to align scores. Mapping was used to predict EQ-5D-5L item responses or preference scores using PROMIS domain scores. Equating strategies were applied to address regression to the mean. The linking approach estimated item parameters of EQ-5D-5L based on the PROMIS score metric and generated bidirectional crosswalks between EQ-5D-5L item responses and relevant PROMIS domain scores.ResultsEQ-5D-5L item responses were significantly accounted for by PROMIS domains of Anxiety, Depression, Fatigue, Pain Interference, Physical Function, Social Roles, and Sleep Disturbance. EQ-5D-5L preference scores were accounted for by the same PROMIS domains, excluding Anxiety and Fatigue, and by the PROPr preference scores. IRT-linking crosswalks were generated between EQ-5D-5L item responses and PROMIS domains of Physical Function, Pain, and Depression. Small differences were found between observed and predicted scores for all 3 methods. The direct mapping approach (directly predicting EQ-5D-5L scores) with the equipercentile equating strategy proved superior to the linking method due to improved prediction accuracy and comparable score range coverage.ConclusionsThe PROPr and the PROMIS domains included in the PROMIS-29+2 predict EQ-5D-5L preference scores or item responses. Both methods can generate acceptably precise EQ-5D-5L preference scores, with the direct mapping approach using the equating strategy offering better precision. We summarized recommended score conversion tables based on available and desired scores.HighlightsThis study compares mapping (score prediction) and IRT-based linking approaches to align the PROPr and the PROMIS domains with EQ-5D-5L item responses and preference scores.Researchers, clinicians, and stakeholders can use this study's regression formulas and score crosswalks to convert scores between PROMIS and EQ-5D-5L.Mapping can generate more precise scores, while linking offers greater flexibility in score estimation when fewer PROMIS domain scores are collected.</p>","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"740-752"},"PeriodicalIF":3.1,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12260195/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144286947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing Potential Contributors of Health-Related Quality of Life and Mortality Among US Older Adults. 比较美国老年人健康相关生活质量和死亡率的潜在影响因素。
IF 3.1 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-01 Epub Date: 2025-06-12 DOI: 10.1177/0272989X251340709
Haomiao Jia, Erica I Lubetkin

BackgroundMany contributing factors can influence individuals' health, and these factors may not affect health outcomes equally. This study compared the importance of 38 predictors of health-related quality of life (HRQOL) and 2-y mortality for US older adults.MethodsData were from the Medicare Health Outcome Survey Cohort 23 (baseline 2020, follow-up 2022). This study included participants ≥65 y (N = 142,551). HRQOL measures included physically unhealthy days (PUD), mentally unhealthy days (MUD), and activity limitation days (ALD) from the Healthy Days questions and 3 measures from the Veterans RAND 12-Item Health Survey (VR-12). A variable's importance was measured as the average gain in R2 after adding the variable in all submodels.ResultsFor physical health (PUD), pain interfered with daily activities was the most important predictor with an importance score (I) of 8.4, indicating that this variable contributed 8.4% variance of PUD. Other leading predictors included pain interfered with socializing (I = 7.3) and pain rating (I = 6.7). For mental health (MUD), depression (I = 11.6) was far more important than any of the other predictors, contributing 38% of the total importance. For perceived disability (ALD), pain interfered with socializing was the most important predictor (I = 8.3), followed by difficulty doing errands (I = 6.1) and pain interfered with activities (I = 6.0). Of note, this general pattern was consistent for VR-12 HRQOL measures. Variables' importance scores for 2-y morality were very different from that for HRQOL. Age (I = 2.8) and difficulty doing errands (I = 2.6) were the most important variables.ConclusionsThis study demonstrated a large discrepancy in the variables' importance for HRQOL and 2-y mortality. Functional limitations/disabilities and geriatric syndromes were more important for the prediction of HRQOL than were chronic conditions and other factors combined.HighlightsFor older adults, large differences were found in variable importance for explaining health-related quality of life (HRQOL) and 2-y mortality among 38 explanatory variables, including functional limitations, geriatric syndromes, chronic conditions, and other factors.Pain and pain interference, difficulty doing errands, difficulty concentrating, memory problems, problems with walking/balance, and depression were the most important predictors of HRQOL.Age, marital status, education, difficulty doing errands, congestive heart failure, chronic obstructive pulmonary disease, and any cancer were more important for 2-y mortality than HRQOL.Health care providers and policy makers should focus on the impact of multimorbidity and the interaction between often multifactorial conditions, as opposed to focusing only on individual diseases.

许多因素可以影响个人的健康,而这些因素对健康结果的影响可能并不平等。本研究比较了美国老年人健康相关生活质量(HRQOL)和2岁死亡率的38个预测因素的重要性。方法数据来自医疗保险健康结局调查第23队列(基线2020年,随访2022年)。本研究纳入年龄≥65岁的参与者(N = 142,551)。HRQOL测量包括来自健康日问题的身体不健康日(PUD)、精神不健康日(MUD)和活动限制日(ALD),以及来自退伍军人RAND 12项健康调查(VR-12)的3项测量。在所有子模型中加入变量后,以R2中的平均增益来衡量变量的重要性。结果对于身体健康(PUD),疼痛干扰日常活动是最重要的预测因素,重要性评分(I)为8.4,表明该变量对PUD的方差贡献了8.4%。其他主要预测因素包括疼痛干扰社交(I = 7.3)和疼痛评分(I = 6.7)。对于心理健康(MUD),抑郁(I = 11.6)比任何其他预测因素都重要得多,占总重要性的38%。对于感知残疾(ALD),疼痛干扰社交是最重要的预测因子(I = 8.3),其次是办事困难(I = 6.1)和疼痛干扰活动(I = 6.0)。值得注意的是,这种一般模式与VR-12 HRQOL测量一致。2-y道德的变量重要性得分与HRQOL有很大差异。年龄(I = 2.8)和办事困难(I = 2.6)是最重要的变量。结论各变量对HRQOL和2年死亡率的重要性存在较大差异。功能限制/残疾和老年综合征对HRQOL的预测比慢性病和其他因素的综合更重要。对于老年人,在解释与健康相关的生活质量(HRQOL)和2年死亡率的38个解释变量(包括功能限制、老年综合征、慢性病和其他因素)中,发现变量重要性存在很大差异。疼痛和疼痛干扰、做事困难、注意力难以集中、记忆问题、行走/平衡问题和抑郁是HRQOL最重要的预测因子。年龄、婚姻状况、教育程度、办事困难、充血性心力衰竭、慢性阻塞性肺病和任何癌症对2岁死亡率的影响都大于HRQOL。卫生保健提供者和政策制定者应注重多发病的影响以及往往是多因素疾病之间的相互作用,而不是只注重个别疾病。
{"title":"Comparing Potential Contributors of Health-Related Quality of Life and Mortality Among US Older Adults.","authors":"Haomiao Jia, Erica I Lubetkin","doi":"10.1177/0272989X251340709","DOIUrl":"10.1177/0272989X251340709","url":null,"abstract":"<p><p>BackgroundMany contributing factors can influence individuals' health, and these factors may not affect health outcomes equally. This study compared the importance of 38 predictors of health-related quality of life (HRQOL) and 2-y mortality for US older adults.MethodsData were from the Medicare Health Outcome Survey Cohort 23 (baseline 2020, follow-up 2022). This study included participants ≥65 y (<i>N</i> = 142,551). HRQOL measures included physically unhealthy days (PUD), mentally unhealthy days (MUD), and activity limitation days (ALD) from the Healthy Days questions and 3 measures from the Veterans RAND 12-Item Health Survey (VR-12). A variable's importance was measured as the average gain in <i>R</i><sup>2</sup> after adding the variable in all submodels.ResultsFor physical health (PUD), pain interfered with daily activities was the most important predictor with an importance score (I) of 8.4, indicating that this variable contributed 8.4% variance of PUD. Other leading predictors included pain interfered with socializing (I = 7.3) and pain rating (I = 6.7). For mental health (MUD), depression (I = 11.6) was far more important than any of the other predictors, contributing 38% of the total importance. For perceived disability (ALD), pain interfered with socializing was the most important predictor (I = 8.3), followed by difficulty doing errands (I = 6.1) and pain interfered with activities (I = 6.0). Of note, this general pattern was consistent for VR-12 HRQOL measures. Variables' importance scores for 2-y morality were very different from that for HRQOL. Age (I = 2.8) and difficulty doing errands (I = 2.6) were the most important variables.ConclusionsThis study demonstrated a large discrepancy in the variables' importance for HRQOL and 2-y mortality. Functional limitations/disabilities and geriatric syndromes were more important for the prediction of HRQOL than were chronic conditions and other factors combined.HighlightsFor older adults, large differences were found in variable importance for explaining health-related quality of life (HRQOL) and 2-y mortality among 38 explanatory variables, including functional limitations, geriatric syndromes, chronic conditions, and other factors.Pain and pain interference, difficulty doing errands, difficulty concentrating, memory problems, problems with walking/balance, and depression were the most important predictors of HRQOL.Age, marital status, education, difficulty doing errands, congestive heart failure, chronic obstructive pulmonary disease, and any cancer were more important for 2-y mortality than HRQOL.Health care providers and policy makers should focus on the impact of multimorbidity and the interaction between often multifactorial conditions, as opposed to focusing only on individual diseases.</p>","PeriodicalId":49839,"journal":{"name":"Medical Decision Making","volume":" ","pages":"675-689"},"PeriodicalIF":3.1,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Medical Decision Making
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1