首页 > 最新文献

Journal of Evidence‐Based Medicine最新文献

英文 中文
Strategies for the Analysis and Elimination of Hallucinations in Artificial Intelligence Generated Medical Knowledge 人工智能生成医学知识中的幻觉分析与消除策略
IF 3.5 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL Pub Date : 2025-09-22 DOI: 10.1111/jebm.70075
Fengxian Chen, Yan Li, Yaolong Chen, Zhaoxiang Bian, La Duo, Qingguo Zhou, Lu Zhang, ADVANCED Working Group

The application of artificial intelligence (AI) in healthcare has become increasingly widespread, showing significant potential in assisting with diagnosis and treatment. However, generative AI (GAI) models often produce “hallucinations”—plausible but factually incorrect or unsubstantiated outputs—that threaten clinical decision-making and patient safety. This article systematically analyzes the causes of hallucinations across data, training, and inference dimensions and proposes multi-dimensional strategies to mitigate them. Our findings reveal three critical conclusions: The technical optimization through knowledge graphs and multi-stage training significantly reduces hallucinations, while clinical integration through expert feedback loops and multidisciplinary workflows enhances output reliability. Additionally, implementing robust evaluation systems that combine adversarial testing and real-world validation substantially improves factual accuracy in clinical settings. These integrated strategies underscore the importance of harmonizing technical advancements with clinical governance to develop trustworthy, patient-centric AI systems.

人工智能(AI)在医疗保健中的应用日益广泛,在辅助诊断和治疗方面显示出巨大的潜力。然而,生成式人工智能(GAI)模型经常产生“幻觉”——看似合理但实际上不正确或未经证实的输出——这威胁到临床决策和患者安全。本文系统地分析了数据、训练和推理维度上产生幻觉的原因,并提出了减轻幻觉的多维策略。我们的研究结果揭示了三个关键结论:通过知识图谱和多阶段培训的技术优化显著减少了幻觉,而通过专家反馈回路和多学科工作流程的临床整合提高了输出的可靠性。此外,实施强大的评估系统,结合对抗性测试和现实世界验证,大大提高了临床环境中的事实准确性。这些综合战略强调了协调技术进步与临床治理以开发可信赖的、以患者为中心的人工智能系统的重要性。
{"title":"Strategies for the Analysis and Elimination of Hallucinations in Artificial Intelligence Generated Medical Knowledge","authors":"Fengxian Chen,&nbsp;Yan Li,&nbsp;Yaolong Chen,&nbsp;Zhaoxiang Bian,&nbsp;La Duo,&nbsp;Qingguo Zhou,&nbsp;Lu Zhang,&nbsp;ADVANCED Working Group","doi":"10.1111/jebm.70075","DOIUrl":"10.1111/jebm.70075","url":null,"abstract":"<div>\u0000 \u0000 <p>The application of artificial intelligence (AI) in healthcare has become increasingly widespread, showing significant potential in assisting with diagnosis and treatment. However, generative AI (GAI) models often produce “hallucinations”—plausible but factually incorrect or unsubstantiated outputs—that threaten clinical decision-making and patient safety. This article systematically analyzes the causes of hallucinations across data, training, and inference dimensions and proposes multi-dimensional strategies to mitigate them. Our findings reveal three critical conclusions: The technical optimization through knowledge graphs and multi-stage training significantly reduces hallucinations, while clinical integration through expert feedback loops and multidisciplinary workflows enhances output reliability. Additionally, implementing robust evaluation systems that combine adversarial testing and real-world validation substantially improves factual accuracy in clinical settings. These integrated strategies underscore the importance of harmonizing technical advancements with clinical governance to develop trustworthy, patient-centric AI systems.</p>\u0000 </div>","PeriodicalId":16090,"journal":{"name":"Journal of Evidence‐Based Medicine","volume":"18 3","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145124918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disease Burden of Gastrointestinal Tumors in China From 1990 to 2021, an Analysis for the Global Burden of Disease Study 2021 1990 - 2021年中国胃肠道肿瘤疾病负担——2021年全球疾病负担研究分析
IF 3.5 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL Pub Date : 2025-09-20 DOI: 10.1111/jebm.70072
Lanwei Guo, Jiani Yuan, Lin Cai, Chenxin Zhu, Yan Zheng, Haiyan Yang, Yanyan Liu

Objective

China faces a significant burden of gastrointestinal tumors driven by socioeconomic, environmental, and lifestyle factors. Using GBD2021 data, this study analyses epidemiological trends and disease burden for six major gastrointestinal tumor cancers (esophagus, gastric, colorectum, liver, pancreas, gallbladder and biliary tract) in China (1990–2021).

Methods

The GBD 2021 was used to extract the incidence, mortality, and disability-adjusted life years (DALYs) data of gastrointestinal tumors in China. Age-standardized rates (ASRs) and 95% uncertainty intervals (UIs) were calculated. Temporal trends were assessed by joinpoint regression analysis, and average annual percent change (AAPC) and annual percentage change (APC) were calculated and analyzed stratified by gender and age group.

Results

In 2021, China recorded 1.96 million new gastrointestinal cancer cases, with 1.35 million deaths and 33.07 million DALYs. Gastric cancer led in mortality, and colorectal cancer demonstrated the most rapid incidence growth (AAPC = 1.68). Significant reductions were observed in gastric cancer age-standardized mortality rates (ASMR) (AAPC = –2.44) and esophageal cancer age-standardized disability-adjusted life year rates (ASDR) (AAPC = –2.31). Gender disparities were particularly pronounced in esophageal cancer, with the male-to-female mortality ratio (M/F) escalating from 2.50 (1990) to 4.12 (2021). The age group with the highest mortality burden was 70–74, while the age group with the most significant loss of DALYs was 65–69.

Conclusion

China has significantly reduced gastrointestinal cancer burden, but gender and age disparities persist, necessitating targeted interventions. Future efforts should focus on tertiary prevention for high-risk groups, especially males and the elderly, while enhancing molecular subtyping and regional data stratification for precision cancer control.

目的:受社会经济、环境和生活方式等因素的影响,中国面临着显著的胃肠道肿瘤负担。本研究采用GBD2021数据,分析1990-2021年中国六大胃肠道肿瘤(食道、胃、结直肠、肝、胰腺、胆囊和胆道)的流行趋势和疾病负担。方法:采用GBD 2021提取中国胃肠道肿瘤的发病率、死亡率和伤残调整生命年(DALYs)数据。计算年龄标准化率(ASRs)和95%不确定区间(UIs)。采用连接点回归分析评估时间趋势,按性别和年龄分层计算平均年变化百分比(AAPC)和年变化百分比(APC)。结果:2021年,中国新增胃肠道肿瘤病例196万例,死亡135万例,DALYs 3307万例。胃癌死亡率最高,结直肠癌发病率增长最快(AAPC = 1.68)。胃癌年龄标准化死亡率(ASMR) (AAPC = -2.44)和食管癌年龄标准化残疾调整生命年率(ASDR) (AAPC = -2.31)均有显著降低。性别差异在食管癌中尤为明显,男女死亡率(M/F)从2.50(1990年)上升到4.12(2021年)。死亡负担最高的年龄组为70-74岁,而残疾调整生命年损失最显著的年龄组为65-69岁。结论:中国胃肠道肿瘤负担显著减轻,但性别和年龄差异仍然存在,需要有针对性的干预措施。未来的工作应侧重于高危人群的三级预防,特别是男性和老年人,同时加强分子分型和区域数据分层,以实现精确的癌症控制。
{"title":"Disease Burden of Gastrointestinal Tumors in China From 1990 to 2021, an Analysis for the Global Burden of Disease Study 2021","authors":"Lanwei Guo,&nbsp;Jiani Yuan,&nbsp;Lin Cai,&nbsp;Chenxin Zhu,&nbsp;Yan Zheng,&nbsp;Haiyan Yang,&nbsp;Yanyan Liu","doi":"10.1111/jebm.70072","DOIUrl":"10.1111/jebm.70072","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <h3> Objective</h3>\u0000 \u0000 <p>China faces a significant burden of gastrointestinal tumors driven by socioeconomic, environmental, and lifestyle factors. Using GBD2021 data, this study analyses epidemiological trends and disease burden for six major gastrointestinal tumor cancers (esophagus, gastric, colorectum, liver, pancreas, gallbladder and biliary tract) in China (1990–2021).</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Methods</h3>\u0000 \u0000 <p>The GBD 2021 was used to extract the incidence, mortality, and disability-adjusted life years (DALYs) data of gastrointestinal tumors in China. Age-standardized rates (ASRs) and 95% uncertainty intervals (UIs) were calculated. Temporal trends were assessed by joinpoint regression analysis, and average annual percent change (AAPC) and annual percentage change (APC) were calculated and analyzed stratified by gender and age group.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Results</h3>\u0000 \u0000 <p>In 2021, China recorded 1.96 million new gastrointestinal cancer cases, with 1.35 million deaths and 33.07 million DALYs. Gastric cancer led in mortality, and colorectal cancer demonstrated the most rapid incidence growth (AAPC = 1.68). Significant reductions were observed in gastric cancer age-standardized mortality rates (ASMR) (AAPC = –2.44) and esophageal cancer age-standardized disability-adjusted life year rates (ASDR) (AAPC = –2.31). Gender disparities were particularly pronounced in esophageal cancer, with the male-to-female mortality ratio (M/F) escalating from 2.50 (1990) to 4.12 (2021). The age group with the highest mortality burden was 70–74, while the age group with the most significant loss of DALYs was 65–69.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Conclusion</h3>\u0000 \u0000 <p>China has significantly reduced gastrointestinal cancer burden, but gender and age disparities persist, necessitating targeted interventions. Future efforts should focus on tertiary prevention for high-risk groups, especially males and the elderly, while enhancing molecular subtyping and regional data stratification for precision cancer control.</p>\u0000 </section>\u0000 </div>","PeriodicalId":16090,"journal":{"name":"Journal of Evidence‐Based Medicine","volume":"18 3","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jebm.70072","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145092174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficacy and Safety of Xiaoer Chiqiao Qingre Syrup in the Treatment of Pedo-Anemopyretic Cold With Stagnant Syndrome in Children: A Multicenter, Randomized, Double-Blind, Double-Dummy, Parallel, Positive-Controlled, Noninferiority Trial 小儿赤翘清热糖浆治疗小儿风热寒瘀证的疗效及安全性:多中心、随机、双盲、双虚拟、平行、正对照、非劣效性试验
IF 3.5 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL Pub Date : 2025-09-18 DOI: 10.1111/jebm.70077
Chengliang Zhong, Shengxuan Guo, Yimin Yang, Aizhen Wang, Zheng Xue, Mengqing Wang, Guihua Song, Kun Yang, Hai Wang, Wei Zhong, Ya Gao, Zhigang Liu, Minghui Wang, Yuyan Chen, Xinmin Li, Siyuan Hu
{"title":"Efficacy and Safety of Xiaoer Chiqiao Qingre Syrup in the Treatment of Pedo-Anemopyretic Cold With Stagnant Syndrome in Children: A Multicenter, Randomized, Double-Blind, Double-Dummy, Parallel, Positive-Controlled, Noninferiority Trial","authors":"Chengliang Zhong,&nbsp;Shengxuan Guo,&nbsp;Yimin Yang,&nbsp;Aizhen Wang,&nbsp;Zheng Xue,&nbsp;Mengqing Wang,&nbsp;Guihua Song,&nbsp;Kun Yang,&nbsp;Hai Wang,&nbsp;Wei Zhong,&nbsp;Ya Gao,&nbsp;Zhigang Liu,&nbsp;Minghui Wang,&nbsp;Yuyan Chen,&nbsp;Xinmin Li,&nbsp;Siyuan Hu","doi":"10.1111/jebm.70077","DOIUrl":"10.1111/jebm.70077","url":null,"abstract":"","PeriodicalId":16090,"journal":{"name":"Journal of Evidence‐Based Medicine","volume":"18 3","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145085956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cardiac Autonomic Dysfunction in Obstructive Sleep Apnea: The Hidden Role of Vitamin D Deficiency 阻塞性睡眠呼吸暂停的心脏自主神经功能障碍:维生素D缺乏的隐藏作用。
IF 3.5 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL Pub Date : 2025-09-17 DOI: 10.1111/jebm.70071
Huai Heng Loh, Siow Phing Tay, Ai Jiun Koa, Mei Ching Yong, Asri Said, Chee Shee Chai, Natasya Marliana Abdul Malik, Anselm Ting Su, Bonnie Bao Chee Tang, Florence Hui Sieng Tan, Norlela Sukor
<p>Obstructive sleep apnea (OSA) is a common sleep-breathing disorder, affecting an estimated 9–38% of adults worldwide, with prevalence increasing in parallel with rising obesity rates and aging population [<span>1</span>]. While men are still more likely to be diagnosed with OSA, this condition is becoming more common in women [<span>2</span>]. Coronary artery disease (CAD), a common complication of OSA, is driven by a combination of intermittent hypoxia, metabolic dysfunction, and systemic inflammation, which together accelerate atherosclerosis [<span>3</span>]. This underscores the importance of identifying modifiable factors to mitigate cardiovascular risks in these patients. A key mechanism linking OSA to heightened cardiovascular risks in CAD is cardiac autonomic dysfunction, characterized by increased sympathetic activity and reduced parasympathetic tone. These alterations can be assessed through heart rate variability (HRV), a well-established predictor of adverse cardiovascular outcomes.</p><p>Patients with OSA often exhibit lower serum 25-hydoxyvitamin D [25(OH)D] levels, with a high prevalence of vitamin D deficiency (VDD) compared to those without OSA [<span>4</span>]. OSA and VDD share overlapping risk factors, with interplay between obesity, hypoxia, reduced vitamin D absorption, inflammation, and effects on upper airway muscle function [<span>4</span>]. VDD is linked to higher cardiovascular risk and mortality and may be associated with poorer HRV [<span>5, 6</span>], although evidence remains limited.</p><p>This study aimed to investigate the role of vitamin D in cardiovascular autonomic function among OSA patients, addressing a critical gap in understanding the interplay between VDD, OSA, and cardiovascular risks. Given the contribution of systemic inflammation and endothelial dysfunction to cardiovascular risks in OSA, we also assessed high-sensitivity C-reactive protein (hsCRP) and brachial artery flow-mediated dilatation (BAFMD) as secondary endpoints, providing complementary insights alongside cardiac autonomic function. The study was conducted in accordance with the principles outlined in the Declaration of Helsinki. Ethical approval was obtained from the Medical Research Ethical Committee of Malaysia (NMRR-21-1472-60812) and UNIMAS Medical Ethics Committee (UNIMAS/TNC(PI)/09-65/01). All patients who fulfilled study criteria were recruited after informed consent obtained. This study was part of a broader research effort, <i>Cardiovascular Impacts of RAAS and Vitamin D in Obstructive Sleep Apnea (CARD-OSA)</i>, which sought to explore the complex roles of renin-angiotensin-aldosterone system and vitamin D in OSA and their broader implications for patient health.</p><p>Designed as a cross-sectional study, it was conducted at a tertiary sleep center from June 2022 to May 2024. Briefly, patients aged ≥18 years with a body mass index (BMI) ≥27.5 kg/m<sup>2</sup>, and confirmed OSA were recruited. The BMI cut-off used, following
阻塞性睡眠呼吸暂停(OSA)是一种常见的睡眠呼吸障碍,全球约有9-38%的成年人受其影响,其患病率与肥胖率上升和人口老龄化同步上升。虽然男性仍然更容易被诊断为阻塞性睡眠呼吸暂停,但这种情况在女性中越来越普遍。冠状动脉疾病(CAD)是OSA的常见并发症,由间歇性缺氧、代谢功能障碍和全身性炎症共同驱动,这些因素共同加速动脉粥样硬化的发生。这强调了确定可改变因素以减轻这些患者心血管风险的重要性。将OSA与冠心病心血管风险增加联系起来的一个关键机制是心脏自主神经功能障碍,其特征是交感神经活动增加和副交感神经张力降低。这些改变可以通过心率变异性(HRV)来评估,HRV是一种公认的心血管不良结局的预测因子。OSA患者通常表现出较低的血清25-羟基维生素D [25(OH)D]水平,与无OSA患者相比,维生素D缺乏症(VDD)的患病率较高。OSA和VDD具有重叠的危险因素,肥胖、缺氧、维生素D吸收减少、炎症和上呼吸道肌肉功能影响[4]之间存在相互作用。VDD与较高的心血管风险和死亡率有关,并可能与较差的HRV有关[5,6],尽管证据仍然有限。本研究旨在探讨维生素D在OSA患者心血管自主功能中的作用,解决VDD、OSA和心血管风险之间相互作用的关键空白。考虑到全身性炎症和内皮功能障碍对OSA患者心血管风险的影响,我们还评估了高敏c反应蛋白(hsCRP)和肱动脉血流介导的扩张(BAFMD)作为次要终点,提供了与心脏自主神经功能互补的见解。这项研究是按照《赫尔辛基宣言》所概述的原则进行的。获得了马来西亚医学研究伦理委员会(NMRR-21-1472-60812)和UNIMAS医学伦理委员会(UNIMAS/TNC(PI)/09-65/01)的伦理批准。所有符合研究标准的患者在获得知情同意后被招募。这项研究是一项更广泛的研究工作的一部分,RAAS和维生素D对阻塞性睡眠呼吸暂停(CARD-OSA)的心血管影响,该研究旨在探索肾素-血管紧张素-醛固酮系统和维生素D在OSA中的复杂作用及其对患者健康的更广泛影响。该研究是一项横断面研究,于2022年6月至2024年5月在一家三级睡眠中心进行。简而言之,招募年龄≥18岁、体重指数(BMI)≥27.5 kg/m2且确诊为OSA的患者。根据世卫组织亚太地区肥胖标准,BMI临界值的使用旨在关注肥胖个体,这些个体包括阻塞性睡眠呼吸暂停和室性阻塞性睡眠呼吸暂停的高风险亚组。那些服用维生素D和钙补充剂、患有慢性肾病、恶性肿瘤、心律异常、充血性心力衰竭、甲状腺或甲状旁腺疾病不受控制以及怀孕的人被排除在研究之外。虽然研究参与者没有正式筛查冠心病,但医疗记录中的合并症数据被记录下来。所有参与者都接受了人体测量、血清25(OH)D、代谢参数、hsCRP和HRV的评估。使用Polar H10腕带传感器评估心脏自主功能,短期HRV数据超过3分钟,长期HRV分析使用24小时动态心电图监测。根据方案[7]对BAFMD的超声进行评估。回归分析的潜在混杂因素是根据先前文献中HRV和维生素D状态的确定联系及其在OSA中的临床相关性来选择的。用于分析的HRV参数及其正常值和范围分别显示在表S1和表S2中。OSA严重程度分为轻度(呼吸暂停-低通气指数,AHI 5 ~ 15/h)、中度(AHI 15 ~ 30/h)和重度(AHI≥30/h)。根据血清25(OH)D水平对维生素D进行分类,即≤20 ng/mL为VDD,≥20 ng/mL为非VDD (nVDD)。BAFMD评估的内皮功能正常定义为≥7.1%,这是区分心血管风险个体的最佳临界值。共筛查797例疑似OSA患者,共招募204例(平均年龄43.4岁,49%为男性)(图S1)。大多数患者有严重的OSA, AHI中位数为39.0/h(范围为30.1-125.9 /h)。研究参与者的人口统计数据、合并症、代谢谱、心血管危险指标和HRV参数显示在表S3中。与nVDD组相比,VDD患者更年轻,主要由马来族和女性参与者组成,血脂异常患病率较低,但BMI较高。 除此之外,其他合并症和代谢参数没有显著差异。与nVDD组相比,VDD患者的hsCRP较高(表S3), HRV较差(表1)。然而,在BAFMD方面没有差异。Pearson相关分析显示血清25(OH)D水平与交感神经系统(SNS)指数(r = -0.185, p = -0.009)和应激指数(r = -0.145, p = 0.041)呈负相关(表S4)。如多元线性回归分析(表S5)所示,即使在调整了混杂因素后,这种关联仍然显著。选择调整R2最高的最佳拟合模型,导致两种模型之间的变量存在差异。我们的研究揭示了两个关键发现:首先,与nVDD患者相比,OSA合并VDD患者表现出较低的HRV和较高的hsCRP;其次,较低的维生素D浓度与较差的HRV相关,特别是在SNS激活增强方面,SNS和应激指数显著升高。这些发现强调了OSA患者维生素D状态与心脏自主神经功能之间复杂的相互作用。OSA和VDD之间的关联已经得到了很好的确立,在之前的文献[4]中详细描述了这种双向关系。尽管维生素D缺乏症是否直接导致不良心血管结果的证据仍不一致,但已经提出了几种潜在的途径。这些机制包括氧化应激和炎症,它们促进血管损伤,反映在VDD组观察到较高的hsCRP水平。内皮功能障碍(以一氧化氮可用性和血管反应性受损为特征)在我们的队列中并不明显,而心脏自主控制的失调是明显的,VDD患者的HRV指数较差,表明交感神经占优,副交感神经活动减少。维生素D通过钙代谢和VDR活性影响心脏组织并调节心脏收缩力。低维生素D水平也与结构和离子通道重塑有关,导致复极间隔延长和副交感神经活动异常[10]。这与我们的研究结果一致,即使在调整混杂因素后,血清25(OH)D水平与HRV参数,特别是SNS和应激指数显着相关,尽管调整后的低R2表明该模型解释了HRV方差的适度比例。这些结果表明,维生素D缺乏症和OSA严重程度可能通过增加SNS活动和破坏自主神经平衡,在一定程度上导致OSA患者心血管风险升高。HRV是一种对心脏自主神经功能的无创测量,反映了SNS和PNS活动的波动。HRV异常与心血管疾病和死亡率增加有关。HRV降低,表明SNS活动增强或PNS活动减弱,与不良结局有关,包括恶性心律失常和心源性猝死[11]。与24小时记录相比,短期HRV评估,如在我们的研究中进行的评估,因其预测死亡率的能力和人工制品处理的便利性而受到欢迎,而24小时记录更容易受到来自日常活动的个体间差异的影响。在我们的研究中使用Polar H10设备确保了这些指标的精确测量,进一步加强了我们研究结果的可靠性。尽管我们的研究具有稳健性,但仍应注意到一些局限性。首先,我们研究的横断面设计排除了确定OSA患者VDD与心血管风险之间的因果关系。其次,由于资源和预算的限制,没有研究与维生素D相关的遗传多态性,例如涉及维生素D结合蛋白和维生素D受体的遗传多态性。这些遗传因素可能进一步阐明维生素D状态与心脏自主神经功能之间关系的个体差异。第三,由于我们的队列包括较高比例的女性(51%)和中年人,研究结果可能无法推广到更广泛的OSA人群。这可能反映了当地妇女和中年人的转诊模式和求医行为[13,14]。此外,在我们的研究中
{"title":"Cardiac Autonomic Dysfunction in Obstructive Sleep Apnea: The Hidden Role of Vitamin D Deficiency","authors":"Huai Heng Loh,&nbsp;Siow Phing Tay,&nbsp;Ai Jiun Koa,&nbsp;Mei Ching Yong,&nbsp;Asri Said,&nbsp;Chee Shee Chai,&nbsp;Natasya Marliana Abdul Malik,&nbsp;Anselm Ting Su,&nbsp;Bonnie Bao Chee Tang,&nbsp;Florence Hui Sieng Tan,&nbsp;Norlela Sukor","doi":"10.1111/jebm.70071","DOIUrl":"10.1111/jebm.70071","url":null,"abstract":"&lt;p&gt;Obstructive sleep apnea (OSA) is a common sleep-breathing disorder, affecting an estimated 9–38% of adults worldwide, with prevalence increasing in parallel with rising obesity rates and aging population [&lt;span&gt;1&lt;/span&gt;]. While men are still more likely to be diagnosed with OSA, this condition is becoming more common in women [&lt;span&gt;2&lt;/span&gt;]. Coronary artery disease (CAD), a common complication of OSA, is driven by a combination of intermittent hypoxia, metabolic dysfunction, and systemic inflammation, which together accelerate atherosclerosis [&lt;span&gt;3&lt;/span&gt;]. This underscores the importance of identifying modifiable factors to mitigate cardiovascular risks in these patients. A key mechanism linking OSA to heightened cardiovascular risks in CAD is cardiac autonomic dysfunction, characterized by increased sympathetic activity and reduced parasympathetic tone. These alterations can be assessed through heart rate variability (HRV), a well-established predictor of adverse cardiovascular outcomes.&lt;/p&gt;&lt;p&gt;Patients with OSA often exhibit lower serum 25-hydoxyvitamin D [25(OH)D] levels, with a high prevalence of vitamin D deficiency (VDD) compared to those without OSA [&lt;span&gt;4&lt;/span&gt;]. OSA and VDD share overlapping risk factors, with interplay between obesity, hypoxia, reduced vitamin D absorption, inflammation, and effects on upper airway muscle function [&lt;span&gt;4&lt;/span&gt;]. VDD is linked to higher cardiovascular risk and mortality and may be associated with poorer HRV [&lt;span&gt;5, 6&lt;/span&gt;], although evidence remains limited.&lt;/p&gt;&lt;p&gt;This study aimed to investigate the role of vitamin D in cardiovascular autonomic function among OSA patients, addressing a critical gap in understanding the interplay between VDD, OSA, and cardiovascular risks. Given the contribution of systemic inflammation and endothelial dysfunction to cardiovascular risks in OSA, we also assessed high-sensitivity C-reactive protein (hsCRP) and brachial artery flow-mediated dilatation (BAFMD) as secondary endpoints, providing complementary insights alongside cardiac autonomic function. The study was conducted in accordance with the principles outlined in the Declaration of Helsinki. Ethical approval was obtained from the Medical Research Ethical Committee of Malaysia (NMRR-21-1472-60812) and UNIMAS Medical Ethics Committee (UNIMAS/TNC(PI)/09-65/01). All patients who fulfilled study criteria were recruited after informed consent obtained. This study was part of a broader research effort, &lt;i&gt;Cardiovascular Impacts of RAAS and Vitamin D in Obstructive Sleep Apnea (CARD-OSA)&lt;/i&gt;, which sought to explore the complex roles of renin-angiotensin-aldosterone system and vitamin D in OSA and their broader implications for patient health.&lt;/p&gt;&lt;p&gt;Designed as a cross-sectional study, it was conducted at a tertiary sleep center from June 2022 to May 2024. Briefly, patients aged ≥18 years with a body mass index (BMI) ≥27.5 kg/m&lt;sup&gt;2&lt;/sup&gt;, and confirmed OSA were recruited. The BMI cut-off used, following ","PeriodicalId":16090,"journal":{"name":"Journal of Evidence‐Based Medicine","volume":"18 3","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jebm.70071","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145080912","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating Evidence With AI: ChatGPT's Performance on the PEDro Scale in Physical Therapy Research 用AI评估证据:ChatGPT在物理治疗研究中PEDro量表的表现。
IF 3.5 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL Pub Date : 2025-09-17 DOI: 10.1111/jebm.70074
Jie Hao, Zixuan Yao, Andréas Remis, Xin Yu
{"title":"Evaluating Evidence With AI: ChatGPT's Performance on the PEDro Scale in Physical Therapy Research","authors":"Jie Hao,&nbsp;Zixuan Yao,&nbsp;Andréas Remis,&nbsp;Xin Yu","doi":"10.1111/jebm.70074","DOIUrl":"10.1111/jebm.70074","url":null,"abstract":"","PeriodicalId":16090,"journal":{"name":"Journal of Evidence‐Based Medicine","volume":"18 3","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145080958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chinese Guideline for the Rehabilitation Treatment of Osteoporosis (2024 Edition) 中国骨质疏松症康复治疗指南(2024年版)
IF 3.5 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL Pub Date : 2025-09-13 DOI: 10.1111/jebm.70056
Haiming Wang, Yi Li, Chengqi He

Osteoporosis (OP) is one of the most serious health problems, causing a huge economic burden to patients, families, and society. OP rehabilitation treatment plays an important role in pain-relieving, reducing the risk of fracture, improving the ability of daily activities, and promoting the healing of OP fractures, and has been increasingly valued and recommended by domestic and foreign guidelines. This guideline was updated based on the Chinese Expert Consensus on Rehabilitation Intervention for Primary OP 2019 editions and was initiated by the Chinese Society of Physical Medicine and rehabilitation and the West China Hospital of Sichuan University. This guideline development was guided by domestic and international guideline development methods and principles and selected through clinical issue screening and deconstruction and two rounds of Delphi questionnaire consultation. The International Classification of Functioning, Disability and Health (ICF) was used as the theoretical framework, and the Grading of Recommendations Assessment, Development and Evaluation (GRADE) was used based on the best available evidence. The guideline was developed based on the best available evidence, used the GRADE method to grade the quality of evidence and recommendations, and reported according to the Reporting Items for Practice Guidelines in Healthcare. Taking into account patients' preferences and values and the needs of Chinese clinical practice, it puts forward 22 recommendations covering seven aspects (rehabilitation assessment, therapeutic modalities, occupational therapy, assistive devices, cognitive behavior and psychological therapy, traditional Chinese medicine therapy, and health education) to systematically standardize OP rehabilitation.

骨质疏松症(Osteoporosis, OP)是最严重的健康问题之一,给患者、家庭和社会造成了巨大的经济负担。OP康复治疗在缓解疼痛、降低骨折风险、提高日常活动能力、促进OP骨折愈合等方面具有重要作用,越来越受到国内外指南的重视和推荐。本指南由中国物理医学与康复学会和四川大学华西医院联合发起,根据《中国初级OP康复干预专家共识2019》进行更新。本指南制定以国内外指南制定方法和原则为指导,通过临床问题筛选解构和两轮德尔菲问卷咨询进行选择。使用国际功能、残疾和健康分类(ICF)作为理论框架,并根据现有的最佳证据使用建议评估、发展和评价分级(GRADE)。该指南是根据现有的最佳证据制定的,使用GRADE方法对证据和建议的质量进行分级,并根据《医疗保健实践指南报告项目》进行报告。结合患者偏好和价值观,结合我国临床实践需要,提出康复评估、治疗方式、职业治疗、辅助器具、认知行为与心理治疗、中医治疗、健康教育等7个方面的22条建议,系统规范OP康复。
{"title":"Chinese Guideline for the Rehabilitation Treatment of Osteoporosis (2024 Edition)","authors":"Haiming Wang,&nbsp;Yi Li,&nbsp;Chengqi He","doi":"10.1111/jebm.70056","DOIUrl":"10.1111/jebm.70056","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <p>Osteoporosis (OP) is one of the most serious health problems, causing a huge economic burden to patients, families, and society. OP rehabilitation treatment plays an important role in pain-relieving, reducing the risk of fracture, improving the ability of daily activities, and promoting the healing of OP fractures, and has been increasingly valued and recommended by domestic and foreign guidelines. This guideline was updated based on the Chinese Expert Consensus on Rehabilitation Intervention for Primary OP 2019 editions and was initiated by the Chinese Society of Physical Medicine and rehabilitation and the West China Hospital of Sichuan University. This guideline development was guided by domestic and international guideline development methods and principles and selected through clinical issue screening and deconstruction and two rounds of Delphi questionnaire consultation. The International Classification of Functioning, Disability and Health (ICF) was used as the theoretical framework, and the Grading of Recommendations Assessment, Development and Evaluation (GRADE) was used based on the best available evidence. The guideline was developed based on the best available evidence, used the GRADE method to grade the quality of evidence and recommendations, and reported according to the Reporting Items for Practice Guidelines in Healthcare. Taking into account patients' preferences and values and the needs of Chinese clinical practice, it puts forward 22 recommendations covering seven aspects (rehabilitation assessment, therapeutic modalities, occupational therapy, assistive devices, cognitive behavior and psychological therapy, traditional Chinese medicine therapy, and health education) to systematically standardize OP rehabilitation.</p>\u0000 </section>\u0000 </div>","PeriodicalId":16090,"journal":{"name":"Journal of Evidence‐Based Medicine","volume":"18 3","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145054173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Evidence to Recommendations With Large Language Models: A Feasibility Study 从证据到建议与大型语言模型:可行性研究。
IF 3.5 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL Pub Date : 2025-09-11 DOI: 10.1111/jebm.70067
Weilong Zhao, Danni Xia, Ziying Ye, Honghao Lai, Mingyao Sun, Jiajie Huang, Jiayi Liu, Jianing Liu, Long Ge

Background

Formulating evidene-based recommendations for practice guidelines is a complex process that requires substantial expertise. Artificial intelligence (AI) is promising in accelerating the guideline development process. This study evaluates the feasibility of leveraging five large language models (LLMs)—ChatGPT-3.5, Claude-3 sonnet, Bard, ChatGLM-4, Kimi chat—to generate recommendations based on structured evidence, assesses their concordance, and explores the potential for AI.

Methods

The general and specific prompts were drafted and validated. We searched PubMed to include evidence-based guidelines related to health and lifestyle. We randomly selected one recommendation from every included guideline as the sample and extracted the evidence base supporting the selected recommendations. The prompts and evidence were fed into five LLMs to generate structured recommendations.

Results

ChatGPT-3.5 demonstrated the highest proficiency in comprehensively extracting and synthesizing evidence to formulate novel insights. Bard consistently adhered to existing guideline principles, aligning its algorithm with these tenets. Claude generated fewer topical recommendations, focusing instead on evidence analysis and mitigating irrelevant information. ChatGLM-4 exhibited a balanced approach, combining evidence extraction with adherence to guideline principles. Kimi showed potential in generating concise and targeted recommendations. Among the six generated recommendations, average consistency ranged from 50% to 91.7%.

Conclusion

The findings of this study suggest that LLMs hold immense potential in accelerating the formulation of evidence-based recommendations. LLMs can rapidly and comprehensively extract and synthesize relevant information from structured evidence, generating recommendations that align with the available evidence.

背景:为实践指南制定基于证据的建议是一个复杂的过程,需要大量的专业知识。人工智能(AI)有望加快指南的制定过程。本研究评估了利用五种大型语言模型(llm)——chatggt -3.5、Claude-3十四行诗、Bard、ChatGLM-4、Kimi聊天——基于结构化证据生成推荐的可行性,评估了它们的一致性,并探索了人工智能的潜力。方法:编制通用提示和特定提示并进行验证。我们在PubMed上搜索了与健康和生活方式相关的循证指南。我们从每个纳入的指南中随机选择一个建议作为样本,并提取支持所选建议的证据基础。这些提示和证据被输入5个法学硕士,以生成结构化的建议。结果:ChatGPT-3.5在综合提取和合成证据以形成新见解方面表现出最高的熟练程度。巴德始终坚持现有的指导原则,使其算法与这些原则保持一致。克劳德提出的专题建议较少,而是侧重于证据分析和减轻不相关的信息。ChatGLM-4展示了一种平衡的方法,将证据提取与遵守指导原则相结合。Kimi展示了在生成简洁而有针对性的推荐方面的潜力。在生成的六项建议中,平均一致性从50%到91.7%不等。结论:本研究的结果表明,法学硕士在加速循证建议的制定方面具有巨大的潜力。法学硕士可以快速、全面地从结构化证据中提取和综合相关信息,生成与现有证据一致的建议。
{"title":"From Evidence to Recommendations With Large Language Models: A Feasibility Study","authors":"Weilong Zhao,&nbsp;Danni Xia,&nbsp;Ziying Ye,&nbsp;Honghao Lai,&nbsp;Mingyao Sun,&nbsp;Jiajie Huang,&nbsp;Jiayi Liu,&nbsp;Jianing Liu,&nbsp;Long Ge","doi":"10.1111/jebm.70067","DOIUrl":"10.1111/jebm.70067","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <h3> Background</h3>\u0000 \u0000 <p>Formulating evidene-based recommendations for practice guidelines is a complex process that requires substantial expertise. Artificial intelligence (AI) is promising in accelerating the guideline development process. This study evaluates the feasibility of leveraging five large language models (LLMs)—ChatGPT-3.5, Claude-3 sonnet, Bard, ChatGLM-4, Kimi chat—to generate recommendations based on structured evidence, assesses their concordance, and explores the potential for AI.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Methods</h3>\u0000 \u0000 <p>The general and specific prompts were drafted and validated. We searched PubMed to include evidence-based guidelines related to health and lifestyle. We randomly selected one recommendation from every included guideline as the sample and extracted the evidence base supporting the selected recommendations. The prompts and evidence were fed into five LLMs to generate structured recommendations.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Results</h3>\u0000 \u0000 <p>ChatGPT-3.5 demonstrated the highest proficiency in comprehensively extracting and synthesizing evidence to formulate novel insights. Bard consistently adhered to existing guideline principles, aligning its algorithm with these tenets. Claude generated fewer topical recommendations, focusing instead on evidence analysis and mitigating irrelevant information. ChatGLM-4 exhibited a balanced approach, combining evidence extraction with adherence to guideline principles. Kimi showed potential in generating concise and targeted recommendations. Among the six generated recommendations, average consistency ranged from 50% to 91.7%.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Conclusion</h3>\u0000 \u0000 <p>The findings of this study suggest that LLMs hold immense potential in accelerating the formulation of evidence-based recommendations. LLMs can rapidly and comprehensively extract and synthesize relevant information from structured evidence, generating recommendations that align with the available evidence.</p>\u0000 </section>\u0000 </div>","PeriodicalId":16090,"journal":{"name":"Journal of Evidence‐Based Medicine","volume":"18 3","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145040250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Developing and Externally Validating the Multivariable Prediction Model for White-Coat Hypertension 白大褂高血压多变量预测模型的建立与外部验证。
IF 3.5 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL Pub Date : 2025-09-09 DOI: 10.1111/jebm.70064
Shali Hao, Xiaomei Zhang, Lingxiao Li, Libin Mo, Yangguang Liu, Jiahuan Li, Wenli Wang, Jiandi Wu, Yuli Huang

Aim

The white-coat hypertension (WCH) detection by monitoring the out-of-office blood pressure (BP) consumes resources and time. This study aimed at developing the prediction model based on patients’ characteristics obtained from clinical data.

Methods

Individuals who participated in two large hospitals health check-up examination were screened. Participants with twice readings of elevated office blood pressure in different visits, while no history of hypertension were included. Combination with home blood pressure monitoring, participants were defined as having WCH or sustained hypertension (SH), respectively. Independent predictors were found by employing multivariate logistic regression on training set. A nomogram was built using independent predictors.

Results

In total, 383 outpatients with elevated office blood pressure were enrolled. Two hundred and thirty-three of them from one hospital were included for the development of the prediction model (training sets), and 150 patients from another independent study site were included for external validation (external validation sets). We identified six predictors including office systolic blood pressure, body mass index, sex, total cholesterol, homocysteine, and heart rate being linked to WCH diagnosis. Area under receiver operating characteristic curve (AUC) for the model was 0.792 and 0.692 regarding training and external validation sets, respectively. The calibration curve and decision curve analyses further demonstrated that the model had good performance for distinguishing WCH from SH.

Conclusions

This prediction model can help clinicians to identify WCH individuals from those with SH, providing an effective tool for guiding personalized recommendations of abnormal blood pressure management.

目的:监测办公室外血压(BP)检测白大衣高血压(WCH)消耗资源和时间。本研究旨在建立基于临床资料中患者特征的预测模型。方法:对参加两家大型医院健康体检的个体进行筛查。参与者在不同的就诊中有两次血压升高的读数,但没有高血压史。结合家庭血压监测,参与者分别被定义为WCH或持续性高血压(SH)。对训练集进行多元逻辑回归,发现独立的预测因子。使用独立预测因子建立了nomogram。结果:共纳入383例办公室血压升高的门诊患者。来自一家医院的233名患者被纳入预测模型(训练集),来自另一个独立研究地点的150名患者被纳入外部验证(外部验证集)。我们确定了六个预测因素,包括办公室收缩压、体重指数、性别、总胆固醇、同型半胱氨酸和心率与WCH诊断有关。对于训练集和外部验证集,模型的接收者工作特征曲线下面积(AUC)分别为0.792和0.692。校正曲线和决策曲线分析进一步表明,该预测模型具有较好的区分WCH和SH的效果。结论:该预测模型可以帮助临床医生区分WCH和SH患者,为指导个性化的异常血压管理建议提供有效工具。
{"title":"Developing and Externally Validating the Multivariable Prediction Model for White-Coat Hypertension","authors":"Shali Hao,&nbsp;Xiaomei Zhang,&nbsp;Lingxiao Li,&nbsp;Libin Mo,&nbsp;Yangguang Liu,&nbsp;Jiahuan Li,&nbsp;Wenli Wang,&nbsp;Jiandi Wu,&nbsp;Yuli Huang","doi":"10.1111/jebm.70064","DOIUrl":"10.1111/jebm.70064","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <h3> Aim</h3>\u0000 \u0000 <p>The white-coat hypertension (WCH) detection by monitoring the out-of-office blood pressure (BP) consumes resources and time. This study aimed at developing the prediction model based on patients’ characteristics obtained from clinical data.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Methods</h3>\u0000 \u0000 <p>Individuals who participated in two large hospitals health check-up examination were screened. Participants with twice readings of elevated office blood pressure in different visits, while no history of hypertension were included. Combination with home blood pressure monitoring, participants were defined as having WCH or sustained hypertension (SH), respectively. Independent predictors were found by employing multivariate logistic regression on training set. A nomogram was built using independent predictors.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Results</h3>\u0000 \u0000 <p>In total, 383 outpatients with elevated office blood pressure were enrolled. Two hundred and thirty-three of them from one hospital were included for the development of the prediction model (training sets), and 150 patients from another independent study site were included for external validation (external validation sets). We identified six predictors including office systolic blood pressure, body mass index, sex, total cholesterol, homocysteine, and heart rate being linked to WCH diagnosis. Area under receiver operating characteristic curve (AUC) for the model was 0.792 and 0.692 regarding training and external validation sets, respectively. The calibration curve and decision curve analyses further demonstrated that the model had good performance for distinguishing WCH from SH.</p>\u0000 </section>\u0000 \u0000 <section>\u0000 \u0000 <h3> Conclusions</h3>\u0000 \u0000 <p>This prediction model can help clinicians to identify WCH individuals from those with SH, providing an effective tool for guiding personalized recommendations of abnormal blood pressure management.</p>\u0000 </section>\u0000 </div>","PeriodicalId":16090,"journal":{"name":"Journal of Evidence‐Based Medicine","volume":"18 3","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145029802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Burden of Myelodysplastic Syndromes: A Literature Review of Epidemiological and Humanistic Aspects 骨髓增生异常综合征的负担:流行病学和人文方面的文献综述。
IF 3.5 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL Pub Date : 2025-09-09 DOI: 10.1111/jebm.70065
Shitong Xie, Jiajun Yan, Preston Tse, Brittany Humphries, Feng Xie
<p>Myelodysplastic syndromes (MDS) are a group of clonal neoplastic bone marrow disorders characterized by ineffective hematopoiesis and cytopenia [<span>1</span>]. Given the increasing incidence of MDS in aging populations and its chronic disease burden, understanding its epidemiological trends and humanistic burden is crucial. Despite several reviews that have examined epidemiological or humanistic burden of MD, existing literature is often limited in scope, focusing on specific subgroups, for example, higher-risk MDS, or single geographic regions. Furthermore, heterogeneous study methodologies have led to inconsistent estimates of MDS incidence, prevalence, and mortality, complicating comparisons across populations. The aim of this study was to conduct a scoping literature review to summarize both the epidemiological and humanistic burden of MDS.</p><p>A systematic search was conducted to understand the epidemiological, humanistic, and economic burden of MDS. The search was undertaken from inception to July 13, 2021 in Embase and Medline. A detailed search strategy is presented in Table S1. Bibliographic searches of systematic reviews identified during the screening were conducted to validate the search strategy and identify any additional publications that were not retrieved through the original search. Studies were included if they (1) included patients with MDS and assessed epidemiologic (i.e., prevalence, incidence or mortality) or humanistic (i.e., patient-reported outcome using either a generic or disease-specific instrument) outcomes; (2) had a sample of more than 100, and (3) were published in English. Systematic reviews were excluded, but if relevant, their reference lists were cross-referenced. Study selection involved two phases: an initial screening based on title and abstract followed by a full-text review. Both phases of screening were performed by two reviewers independently and in duplicate. Any discrepancies were solved by discussion between the two reviewers or consulting a third reviewer when necessary. A study mapping process was conducted to match publications reporting on the same study. For included studies, information on study and patient characteristics, and outcomes of interest were extracted by two reviewers independently using a standardized data extraction form. Discrepancies were reconciled by discussion between the two reviewers or in consultation with a third reviewer when necessary.</p><p>The study employed descriptive analyses to summarize the epidemiological and HRQoL outcomes in patients with MDS. The descriptive statistics included age-standardized incidence rate (ASIR), prevalence, and survival rates. Data were organized in Excel and visualized with R using distribution plots and trend graphs.</p><p>A total of 63 papers (56 studies) reported on the epidemiological burden of MDS and 57 papers (46 studies) on the HRQoL burden of MDS. The selection process is summarized in Figure S1. Details on studies repor
骨髓增生异常综合征(MDS)是一组以造血功能低下和细胞减少为特征的克隆性肿瘤骨髓疾病。鉴于MDS在老年人群中发病率的增加及其慢性疾病负担,了解其流行病学趋势和人文负担至关重要。尽管有几篇综述研究了MD的流行病学或人文负担,但现有文献的范围往往有限,主要集中在特定的亚群,例如高危MDS或单一地理区域。此外,异质性研究方法导致MDS发病率、患病率和死亡率的估计不一致,使人群间的比较复杂化。本研究的目的是进行大范围的文献综述,总结MDS的流行病学和人文负担。为了了解MDS的流行病学、人文和经济负担,进行了系统的检索。搜索从开始到2021年7月13日在Embase和Medline进行。表S1给出了详细的搜索策略。对筛选过程中确定的系统综述进行书目检索,以验证检索策略,并确定未通过原始检索检索到的任何其他出版物。纳入以下研究:(1)纳入MDS患者并评估流行病学(即患病率、发病率或死亡率)或人文病学(即使用通用或疾病特异性工具的患者报告结果)结果;(2)样本超过100人,(3)以英文出版。系统评价被排除在外,但如果相关,他们的参考文献列表被交叉引用。研究选择包括两个阶段:基于标题和摘要的初步筛选,然后是全文审查。两个阶段的筛选都是由两个独立的审稿人进行的,并且是重复的。任何差异均由两位审稿人讨论解决,必要时可咨询第三方审稿人。对同一研究的出版物进行了研究映射过程的匹配。对于纳入的研究,有关研究和患者特征的信息以及感兴趣的结果由两名评论者使用标准化数据提取表独立提取。通过两位审稿人之间的讨论或必要时与第三位审稿人协商来协调差异。本研究采用描述性分析对MDS患者的流行病学和HRQoL结果进行总结。描述性统计包括年龄标准化发病率(ASIR)、患病率和生存率。数据在Excel中组织,并使用R使用分布图和趋势图进行可视化。共有63篇论文(56项研究)报道MDS的流行病学负担,57篇论文(46项研究)报道MDS的HRQoL负担。选择过程如图S1所示。报告流行病学和HRQoL结果的研究详情分别见表S2和表S3。如表1所示,56项报告流行病学结果的研究中,数据库分析40项(71.4%),回顾性队列研究11项(19.6%),前瞻性队列研究5项(8.9%)。研究在不同地区进行:欧洲30项(53%)研究,北美13项(23%)研究,亚洲7项(13%)研究,其余在大洋洲、南美洲和非洲进行。诊断为MDS的患者中位年龄为62岁至79岁。12个国家的16项研究报告了成人ASIR,从韩国的0.98到阿根廷的11.5 / 100000不等(图S2a)。男性的ASIR(1.0-7.9 / 100000)高于女性(0.5-6.2 / 100000)(图S2b)。四项研究显示ASIR随年龄增长而上升,从50岁以下患者的0.3-0.9 / 10万增加到80岁以上患者的28.5-84.4 / 10万(图S2c)。法国和英国报告的儿科患者asir分别为0.03 / 100,000和0.04 / 100,000。粗略患病率从登记数据中的每10万人6.2-12.0人到索赔数据中的每10万人155人不等(表S2)。1年生存率为66%-69%,3年和5年生存率分别为42% - 45%和27% - 46%。3年和5年生存率均随年龄增长而下降。年龄标准化死亡率为1.14-1.63 / 10万,粗死亡率从极低风险MDS的0.15到极高风险疾病的6.44。在报告HRQoL结果的46项研究中(表S3), 22项(48%)研究在欧洲和北美进行,9项(20%)研究在多个国家或地区进行,2项研究在土耳其和以色列进行。22项研究(48%)在没有风险分层的MDS患者中进行,19项(41%)研究专门针对低至中度MDS风险患者,5项(11%)研究针对高风险MDS患者。 24项研究(52%)关注MDS患者的HRQoL, 22项研究(48%)专门评估治疗相关的HRQoL。最常用的癌症特异性HRQoL工具是欧洲癌症研究和治疗组织生活质量问卷- core 30 (EORTC QLQ-C30)和癌症治疗-贫血功能评估(FACT-AN),共占纳入研究的55%(图S3)。另一种常用的通用HRQoL仪器是EQ-5D-3L,在19%的纳入研究中使用。13项研究报告了EORTC QLQ-C30的整体评分。如图S4a所示,基线时的总体评分范围为50.6 - 66.7。对于低风险MDS患者,评分范围为52.2 - 62.3,对于高风险MDS患者,评分范围为50.6 - 59.7。有11项研究报告了EORTC QLQ-C30维度得分(图S4b)。身体功能和角色功能被认为是受影响最大的领域,认知功能和社会功能受到的影响最小。疲劳、疼痛和食欲减退是最严重的症状,恶心/呕吐是最不严重的。在五项独特的研究中评估了与MDS相关的FACT-AN总分(图S5)。FACT-AN总分在111 - 132之间,表明MDS患者的生活质量处于中等水平。在六项研究中测量了健康效用,在所有六项研究中使用EQ-5D-3L,在一项研究中使用EORTC生活质量效用核心10维度(qu - c10d)(图S6)[1-7]。EQ-5D-3L测量的健康效用值范围为0.73 ~ 0.83,EORTC qu - c10d测量的健康效用值范围为0.73 ~ 0.77。本研究总结了MDS的流行病学和人文负担的证据格局。随着世界各国人口平均年龄的不断上升,MDS的负担预计会增加。MDS不仅死亡率高,还会对患者的人文负担产生重大影响,这受到疾病和治疗特异性因素的影响。然而,证据基础,特别是与流行病学有关的证据基础是异质性的,并且缺乏评估人文负担的疾病特异性措施。确定MDS的发病率和患病率对于了解疾病负担和优化医疗资源配置至关重要[10]。我们发现ASIR在每10万人中0.98到11.5之间,老年人和男性的比例更高。在先前使用监测、流行病学和最终结果数据(SEER)数据的综述中,Zeidan等人类似地报告了ASIR随年龄和男性优势而上升(男性为4.7-7.7 / 10万,女性为2.5-3.4 / 10万)。Zeidan等人还报道,与其他种族相比,亚洲人的ASIR最低,尤其是白人。然而,按种族分层和来自非西方国家的数据仍然很少。这是一个重要的差距,因为种族化群体在MDS的药物试验中仍然代表性不足。总体而言,我们的研究结果与之前的一项综述一致,该综述报道了MDS的全球粗患病率,在年龄组、性别和种族中为0.22至13.2 / 100,000。研究设计和人群特征的变化使得这些估计在具有挑战性的研究中进行比较。MDS的生存率仍然很低,5年生存率从澳大利亚的27%到韩国的46%不等,因患者特征(如年龄、性别)和疾病风险(如IPSS-R分类)而异。EORTC QLQ-C30和FACT-AN是MDS患者最常用的HRQoL测量仪器[12,13]。这些完善的、经过验证的癌症患者仪器有时与一般的HRQoL测量方法(如EQ-5D)一起使用。鉴于疾病和治疗对患者HRQoL的相关影响,mds特异性仪器可能更适合于捕捉该患者群体所经历的人文负担。正在努力开发这样的工具,包括MDS/AML的MD安德森症状量表(MDASI),骨髓增生异常生活质量量表(QUALMS)和29项心理测量生活质量E
{"title":"Burden of Myelodysplastic Syndromes: A Literature Review of Epidemiological and Humanistic Aspects","authors":"Shitong Xie,&nbsp;Jiajun Yan,&nbsp;Preston Tse,&nbsp;Brittany Humphries,&nbsp;Feng Xie","doi":"10.1111/jebm.70065","DOIUrl":"10.1111/jebm.70065","url":null,"abstract":"&lt;p&gt;Myelodysplastic syndromes (MDS) are a group of clonal neoplastic bone marrow disorders characterized by ineffective hematopoiesis and cytopenia [&lt;span&gt;1&lt;/span&gt;]. Given the increasing incidence of MDS in aging populations and its chronic disease burden, understanding its epidemiological trends and humanistic burden is crucial. Despite several reviews that have examined epidemiological or humanistic burden of MD, existing literature is often limited in scope, focusing on specific subgroups, for example, higher-risk MDS, or single geographic regions. Furthermore, heterogeneous study methodologies have led to inconsistent estimates of MDS incidence, prevalence, and mortality, complicating comparisons across populations. The aim of this study was to conduct a scoping literature review to summarize both the epidemiological and humanistic burden of MDS.&lt;/p&gt;&lt;p&gt;A systematic search was conducted to understand the epidemiological, humanistic, and economic burden of MDS. The search was undertaken from inception to July 13, 2021 in Embase and Medline. A detailed search strategy is presented in Table S1. Bibliographic searches of systematic reviews identified during the screening were conducted to validate the search strategy and identify any additional publications that were not retrieved through the original search. Studies were included if they (1) included patients with MDS and assessed epidemiologic (i.e., prevalence, incidence or mortality) or humanistic (i.e., patient-reported outcome using either a generic or disease-specific instrument) outcomes; (2) had a sample of more than 100, and (3) were published in English. Systematic reviews were excluded, but if relevant, their reference lists were cross-referenced. Study selection involved two phases: an initial screening based on title and abstract followed by a full-text review. Both phases of screening were performed by two reviewers independently and in duplicate. Any discrepancies were solved by discussion between the two reviewers or consulting a third reviewer when necessary. A study mapping process was conducted to match publications reporting on the same study. For included studies, information on study and patient characteristics, and outcomes of interest were extracted by two reviewers independently using a standardized data extraction form. Discrepancies were reconciled by discussion between the two reviewers or in consultation with a third reviewer when necessary.&lt;/p&gt;&lt;p&gt;The study employed descriptive analyses to summarize the epidemiological and HRQoL outcomes in patients with MDS. The descriptive statistics included age-standardized incidence rate (ASIR), prevalence, and survival rates. Data were organized in Excel and visualized with R using distribution plots and trend graphs.&lt;/p&gt;&lt;p&gt;A total of 63 papers (56 studies) reported on the epidemiological burden of MDS and 57 papers (46 studies) on the HRQoL burden of MDS. The selection process is summarized in Figure S1. Details on studies repor","PeriodicalId":16090,"journal":{"name":"Journal of Evidence‐Based Medicine","volume":"18 3","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jebm.70065","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145029841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Examining and Interpreting Doi Plot Asymmetry in Meta-Analyses of Randomized Controlled Trials 随机对照试验meta分析中Doi图不对称的检验与解释。
IF 3.5 2区 医学 Q1 MEDICINE, GENERAL & INTERNAL Pub Date : 2025-08-29 DOI: 10.1111/jebm.70063
Luis Furuya-Kanamori, Xanthoula Rousou, Polychronis Kostoulas, Suhail A. R. Doi

Systematic reviews and meta-analyses are considered the highest level of evidence, but their reliability can be undermined by publication bias. Traditional methods for assessing publication bias, such as funnel plots and p-value-based tests (e.g., Egger test), have notable limitations, including reliance on subjective interpretation and dependence on the number of studies included in a meta-analysis (k). The Doi plot and LFK index offer promising alternatives, providing improved visualization and quantification of plot asymmetry. This study revisits the application of the Doi plot and LFK index for detecting publication bias, addresses recent criticisms, and evaluates their performance compared to p-value-based methods using simulation study. Simulations included scenarios with varying study numbers (k = 5, 10, 20, 50), study sample sizes (small, large), and simulated bias level (ρ = 0, –0.3, –0.5, –0.9) generated using the Copas selection model. Diagnostic performance metrics (i.e., sensitivity and specificity) were estimated and compared for the LFK index and Egger test. The LFK index exhibited consistent higher sensitivity across varying k and simulated bias levels. In contrast, the Egger test was highly dependent on k, with sensitivity declining sharply in small meta-analyses (k < 20). Specificity of the LFK index adjusted with random error, while Egger test specificity remained fixed at ∼90%. The Doi plot and LFK index effectively address the limitations of traditional methods, offering robust k-independent performance and more reliable detection of publication bias. These findings support a transition to the Doi plot and LFK index for publication bias assessment in meta-analyses.

系统评价和荟萃分析被认为是最高级别的证据,但它们的可靠性可能会受到发表偏倚的影响。评估发表偏倚的传统方法,如漏斗图和基于p值的检验(如Egger检验),有明显的局限性,包括依赖于主观解释和依赖于meta分析中纳入的研究数量(k)。Doi图和LFK指数提供了有希望的替代方案,提供了更好的可视化和量化图的不对称性。本研究回顾了Doi图和LFK指数在检测发表偏倚方面的应用,解决了最近的批评,并通过模拟研究评估了它们与基于p值的方法相比的性能。模拟包括使用Copas选择模型生成的不同研究数量(k = 5、10、20、50)、研究样本量(小、大)和模拟偏差水平(ρ = 0、-0.3、-0.5、-0.9)的场景。估计并比较LFK指数和Egger检验的诊断性能指标(即敏感性和特异性)。LFK指数在不同的k和模拟偏差水平上表现出一致的更高灵敏度。相反,Egger检验高度依赖于k,在小型荟萃分析中灵敏度急剧下降(k < 20)。LFK指标的特异性随随机误差进行调整,而Egger试验特异性保持固定在~ 90%。Doi图和LFK指数有效地解决了传统方法的局限性,提供了稳健的k无关性能和更可靠的发表偏倚检测。这些发现支持在荟萃分析中过渡到Doi图和LFK指数进行发表偏倚评估。
{"title":"Examining and Interpreting Doi Plot Asymmetry in Meta-Analyses of Randomized Controlled Trials","authors":"Luis Furuya-Kanamori,&nbsp;Xanthoula Rousou,&nbsp;Polychronis Kostoulas,&nbsp;Suhail A. R. Doi","doi":"10.1111/jebm.70063","DOIUrl":"10.1111/jebm.70063","url":null,"abstract":"<div>\u0000 \u0000 \u0000 <section>\u0000 \u0000 <p>Systematic reviews and meta-analyses are considered the highest level of evidence, but their reliability can be undermined by publication bias. Traditional methods for assessing publication bias, such as funnel plots and <i>p</i>-value-based tests (e.g., Egger test), have notable limitations, including reliance on subjective interpretation and dependence on the number of studies included in a meta-analysis (<i>k</i>). The Doi plot and LFK index offer promising alternatives, providing improved visualization and quantification of plot asymmetry. This study revisits the application of the Doi plot and LFK index for detecting publication bias, addresses recent criticisms, and evaluates their performance compared to <i>p</i>-value-based methods using simulation study. Simulations included scenarios with varying study numbers (<i>k</i> = 5, 10, 20, 50), study sample sizes (small, large), and simulated bias level (<i>ρ</i> = 0, –0.3, –0.5, –0.9) generated using the Copas selection model. Diagnostic performance metrics (i.e., sensitivity and specificity) were estimated and compared for the LFK index and Egger test. The LFK index exhibited consistent higher sensitivity across varying <i>k</i> and simulated bias levels. In contrast, the Egger test was highly dependent on <i>k</i>, with sensitivity declining sharply in small meta-analyses (<i>k</i> &lt; 20). Specificity of the LFK index adjusted with random error, while Egger test specificity remained fixed at ∼90%. The Doi plot and LFK index effectively address the limitations of traditional methods, offering robust <i>k</i>-independent performance and more reliable detection of publication bias. These findings support a transition to the Doi plot and LFK index for publication bias assessment in meta-analyses.</p>\u0000 </section>\u0000 </div>","PeriodicalId":16090,"journal":{"name":"Journal of Evidence‐Based Medicine","volume":"18 3","pages":""},"PeriodicalIF":3.5,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/jebm.70063","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144957108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Evidence‐Based Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1