首页 > 最新文献

PLOS digital health最新文献

英文 中文
Meeting people where they are: Crowdsourcing goal-specific personalized wellness practices. 满足人们的需求:众包特定目标的个性化健康实践。
Pub Date : 2024-11-19 eCollection Date: 2024-11-01 DOI: 10.1371/journal.pdig.0000650
Johanna E Hidalgo, Julia Kim, Jordan Llorin, Kathryn Stanton, Josh Cherian, Laura Bloomfield, Mikaela Fudolig, Matthew Price, Jennifer Ha, Natalie Noble, Christopher M Danforth, Peter Sheridan Dodds, Jason Fanning, Ryan S McGinnis, Ellen W McGinnis

Objectives: Despite the development of efficacious wellness interventions, sustainable wellness behavior change remains challenging. To optimize engagement, initiating small behaviors that build upon existing practices congruent with individuals' lifestyles may promote sustainable wellness behavior change. In this study, we crowd-sourced helpful, flexible, and engaging wellness practices to identify a list of those commonly used for improving sleep, productivity, and physical, emotional, and social wellness from participants who felt they had been successful in these dimensions.

Method: We recruited a representative sample of 992 U.S. residents to survey the wellness dimensions in which they had achieved success and their specific wellness practices.

Results: Responses were aggregated across demographic, health, lifestyle factors, and wellness dimension. Exploration of these data revealed that there was little overlap in preferred practices across wellness dimensions. Within wellness dimensions, preferred practices were similar across demographic factors, especially within the top 3-4 most selected practices. Interestingly, daily wellness practices differ from those typically recommended as efficacious by research studies and seem to be impacted by health status (e.g., depression, cardiovascular disease). Additionally, we developed and provide for public use a web dashboard that visualizes and enables exploration of the study results.

Conclusions: Findings identify personalized, sustainable wellness practices targeted at specific wellness dimensions. Future studies could leverage tailored practices as recommendations for optimizing the development of healthier behaviors.

目标:尽管开发了有效的健康干预措施,但可持续的健康行为改变仍然具有挑战性。为了优化参与度,在符合个人生活方式的现有实践基础上发起一些小行为可能会促进可持续的健康行为改变。在这项研究中,我们从认为自己在这些方面取得了成功的参与者那里收集了一些有用、灵活、有吸引力的保健实践,以确定一份常用于改善睡眠、提高工作效率、改善身体、情感和社交健康的实践清单:我们招募了具有代表性的992名美国居民样本,调查他们在哪些健康方面取得了成功以及他们的具体健康实践:结果:我们根据人口统计学、健康、生活方式因素和健康维度对回答进行了汇总。对这些数据的分析表明,不同健康维度的首选做法几乎没有重叠。在健康维度中,不同人口统计因素的首选做法相似,尤其是在选择最多的前 3-4 种做法中。有趣的是,日常保健实践与研究报告通常推荐的具有疗效的实践不同,而且似乎会受到健康状况(如抑郁症、心血管疾病)的影响。此外,我们还开发并提供了一个网络仪表板供公众使用,该仪表板可将研究结果可视化并进行探索:研究结果确定了针对特定健康维度的个性化、可持续的健康实践。未来的研究可以利用量身定制的实践作为优化健康行为发展的建议。
{"title":"Meeting people where they are: Crowdsourcing goal-specific personalized wellness practices.","authors":"Johanna E Hidalgo, Julia Kim, Jordan Llorin, Kathryn Stanton, Josh Cherian, Laura Bloomfield, Mikaela Fudolig, Matthew Price, Jennifer Ha, Natalie Noble, Christopher M Danforth, Peter Sheridan Dodds, Jason Fanning, Ryan S McGinnis, Ellen W McGinnis","doi":"10.1371/journal.pdig.0000650","DOIUrl":"10.1371/journal.pdig.0000650","url":null,"abstract":"<p><strong>Objectives: </strong>Despite the development of efficacious wellness interventions, sustainable wellness behavior change remains challenging. To optimize engagement, initiating small behaviors that build upon existing practices congruent with individuals' lifestyles may promote sustainable wellness behavior change. In this study, we crowd-sourced helpful, flexible, and engaging wellness practices to identify a list of those commonly used for improving sleep, productivity, and physical, emotional, and social wellness from participants who felt they had been successful in these dimensions.</p><p><strong>Method: </strong>We recruited a representative sample of 992 U.S. residents to survey the wellness dimensions in which they had achieved success and their specific wellness practices.</p><p><strong>Results: </strong>Responses were aggregated across demographic, health, lifestyle factors, and wellness dimension. Exploration of these data revealed that there was little overlap in preferred practices across wellness dimensions. Within wellness dimensions, preferred practices were similar across demographic factors, especially within the top 3-4 most selected practices. Interestingly, daily wellness practices differ from those typically recommended as efficacious by research studies and seem to be impacted by health status (e.g., depression, cardiovascular disease). Additionally, we developed and provide for public use a web dashboard that visualizes and enables exploration of the study results.</p><p><strong>Conclusions: </strong>Findings identify personalized, sustainable wellness practices targeted at specific wellness dimensions. Future studies could leverage tailored practices as recommendations for optimizing the development of healthier behaviors.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 11","pages":"e0000650"},"PeriodicalIF":0.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11575832/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142677517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large language models in medicine: A review of current clinical trials across healthcare applications. 医学中的大型语言模型:当前医疗应用领域临床试验回顾。
Pub Date : 2024-11-19 eCollection Date: 2024-11-01 DOI: 10.1371/journal.pdig.0000662
Mahmud Omar, Girish N Nadkarni, Eyal Klang, Benjamin S Glicksberg

This review analyzes current clinical trials investigating large language models' (LLMs) applications in healthcare. We identified 27 trials (5 published and 22 ongoing) across 4 main clinical applications: patient care, data handling, decision support, and research assistance. Our analysis reveals diverse LLM uses, from clinical documentation to medical decision-making. Published trials show promise but highlight accuracy concerns. Ongoing studies explore novel applications like patient education and informed consent. Most trials occur in the United States of America and China. We discuss the challenges of evaluating rapidly evolving LLMs through clinical trials and identify gaps in current research. This review aims to inform future studies and guide the integration of LLMs into clinical practice.

本综述分析了目前研究大型语言模型(LLMs)在医疗保健领域应用的临床试验。我们确定了 27 项试验(5 项已发表,22 项正在进行中),涉及 4 个主要临床应用领域:患者护理、数据处理、决策支持和研究辅助。我们的分析揭示了 LLM 的多种用途,从临床文档到医疗决策。已发表的试验显示了前景,但也凸显了准确性方面的问题。正在进行的研究探索了病人教育和知情同意等新型应用。大多数试验发生在美国和中国。我们讨论了通过临床试验评估快速发展的 LLM 所面临的挑战,并找出了当前研究中存在的差距。本综述旨在为未来的研究提供信息,并指导将 LLMs 纳入临床实践。
{"title":"Large language models in medicine: A review of current clinical trials across healthcare applications.","authors":"Mahmud Omar, Girish N Nadkarni, Eyal Klang, Benjamin S Glicksberg","doi":"10.1371/journal.pdig.0000662","DOIUrl":"10.1371/journal.pdig.0000662","url":null,"abstract":"<p><p>This review analyzes current clinical trials investigating large language models' (LLMs) applications in healthcare. We identified 27 trials (5 published and 22 ongoing) across 4 main clinical applications: patient care, data handling, decision support, and research assistance. Our analysis reveals diverse LLM uses, from clinical documentation to medical decision-making. Published trials show promise but highlight accuracy concerns. Ongoing studies explore novel applications like patient education and informed consent. Most trials occur in the United States of America and China. We discuss the challenges of evaluating rapidly evolving LLMs through clinical trials and identify gaps in current research. This review aims to inform future studies and guide the integration of LLMs into clinical practice.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 11","pages":"e0000662"},"PeriodicalIF":0.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11575759/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142677480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A transparent and standardized performance measurement platform is needed for on-prescription digital health apps to enable ongoing performance monitoring. 处方数字医疗应用程序需要一个透明、标准化的性能测量平台,以实现持续的性能监测。
Pub Date : 2024-11-15 eCollection Date: 2024-11-01 DOI: 10.1371/journal.pdig.0000656
Cindy Welzel, Stefanie Brückner, Celia Brightwell, Matthew Fenech, Stephen Gilbert
{"title":"A transparent and standardized performance measurement platform is needed for on-prescription digital health apps to enable ongoing performance monitoring.","authors":"Cindy Welzel, Stefanie Brückner, Celia Brightwell, Matthew Fenech, Stephen Gilbert","doi":"10.1371/journal.pdig.0000656","DOIUrl":"10.1371/journal.pdig.0000656","url":null,"abstract":"","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 11","pages":"e0000656"},"PeriodicalIF":0.0,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11567564/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142640227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging explainable artificial intelligence for early prediction of bloodstream infections using historical electronic health records. 利用可解释人工智能,利用历史电子健康记录及早预测血流感染。
Pub Date : 2024-11-14 eCollection Date: 2024-11-01 DOI: 10.1371/journal.pdig.0000506
Rajeev Bopche, Lise Tuset Gustad, Jan Egil Afset, Birgitta Ehrnström, Jan Kristian Damås, Øystein Nytrø

Bloodstream infections (BSIs) are a severe public health threat due to their rapid progression into critical conditions like sepsis. This study presents a novel eXplainable Artificial Intelligence (XAI) framework to predict BSIs using historical electronic health records (EHRs). Leveraging a dataset from St. Olavs Hospital in Trondheim, Norway, encompassing 35,591 patients, the framework integrates demographic, laboratory, and comprehensive medical history data to classify patients into high-risk and low-risk BSI groups. By avoiding reliance on real-time clinical data, our model allows for enhanced scalability across various healthcare settings, including resource-limited environments. The XAI framework significantly outperformed traditional models, particularly with tree-based algorithms, demonstrating superior specificity and sensitivity in BSI prediction. This approach promises to optimize resource allocation and potentially reduce healthcare costs while providing interpretability for clinical decision-making, making it a valuable tool in hospital systems for early intervention and improved patient outcomes.

血流感染(BSI)是一种严重的公共卫生威胁,因为它们会迅速发展为败血症等危重症。本研究提出了一种新颖的可解释人工智能(XAI)框架,利用历史电子健康记录(EHR)预测 BSI。该框架利用挪威特隆赫姆圣奥拉夫斯医院(St. Olavs Hospital)包含 35,591 名患者的数据集,整合了人口统计学、实验室和综合病史数据,将患者分为高风险和低风险 BSI 组别。通过避免对实时临床数据的依赖,我们的模型提高了在各种医疗环境(包括资源有限的环境)中的可扩展性。XAI 框架的性能明显优于传统模型,尤其是基于树的算法,在 BSI 预测方面表现出卓越的特异性和灵敏度。这种方法有望优化资源分配,降低医疗成本,同时为临床决策提供可解释性,使其成为医院系统早期干预和改善患者预后的重要工具。
{"title":"Leveraging explainable artificial intelligence for early prediction of bloodstream infections using historical electronic health records.","authors":"Rajeev Bopche, Lise Tuset Gustad, Jan Egil Afset, Birgitta Ehrnström, Jan Kristian Damås, Øystein Nytrø","doi":"10.1371/journal.pdig.0000506","DOIUrl":"10.1371/journal.pdig.0000506","url":null,"abstract":"<p><p>Bloodstream infections (BSIs) are a severe public health threat due to their rapid progression into critical conditions like sepsis. This study presents a novel eXplainable Artificial Intelligence (XAI) framework to predict BSIs using historical electronic health records (EHRs). Leveraging a dataset from St. Olavs Hospital in Trondheim, Norway, encompassing 35,591 patients, the framework integrates demographic, laboratory, and comprehensive medical history data to classify patients into high-risk and low-risk BSI groups. By avoiding reliance on real-time clinical data, our model allows for enhanced scalability across various healthcare settings, including resource-limited environments. The XAI framework significantly outperformed traditional models, particularly with tree-based algorithms, demonstrating superior specificity and sensitivity in BSI prediction. This approach promises to optimize resource allocation and potentially reduce healthcare costs while providing interpretability for clinical decision-making, making it a valuable tool in hospital systems for early intervention and improved patient outcomes.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 11","pages":"e0000506"},"PeriodicalIF":0.0,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11563427/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142634322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How can digital citizen science approaches improve ethical smartphone use surveillance among youth: Traditional surveys versus ecological momentary assessments. 数字公民科学方法如何改进对青少年使用智能手机的道德监督:传统调查与生态瞬间评估。
Pub Date : 2024-11-11 eCollection Date: 2024-11-01 DOI: 10.1371/journal.pdig.0000448
Sarah Al-Akshar, Sheriff Tolulope Ibrahim, Tarun Reddy Katapally

Ubiquitous use of smartphones among youth poses significant challenges related to non-communicable diseases, including poor mental health. Although traditional survey measures can be used to assess smartphone use among youth, they are subject to recall bias. This study aims to compare self-reported smartphone use via retrospective modified traditional recall survey and prospective Ecological Momentary Assessments (EMAs) among youth. This study uses data from the Smart Platform, which engages with youth as citizen scientists. Youth (N = 77) aged 13-21 years in two urban jurisdictions in Canada (Regina and Saskatoon) engaged with our research team using a custom-built application via their own smartphones to report on a range of behaviours and outcomes on eight consecutive days. Youth reported smartphone use utilizing a traditional validated measure, which was modified to capture retrospective smartphone use on both weekdays and weekend days. In addition, daily EMAs were also time-triggered over a period of eight days to capture prospective smartphone use. Demographic, behavioural, and contextual factors were also collected. Data analyses included t-test and linear regression using Python statistical software. There was a significant difference between weekdays, weekends and overall smartphone use reported retrospectively and prospectively (p-value = <0.001), with youth reporting less smartphone use via EMAs. Overall retrospective smartphone use was significantly associated with not having a part-time job (β = 139.64, 95% confidence interval [CI] = 34.759, 244.519, p-value = 0.010) and having more than two friends who are physically active (β = -114.72, 95%[CI] = -208.872, -20.569, p-value = 0.018). However, prospective smartphone use reported via EMAs was not associated with any behavioural and contextual factors. The findings of this study have implications for appropriately understanding and monitoring smartphone use in the digital age among youth. EMAs can potentially minimize recall bias of smartphone use among youth, and other behaviours such as physical activity. More importantly, digital citizen science approaches that engage large populations of youth using their own smartphones can transform how we ethically monitor and mitigate the impact of excessive smartphone use.

青少年普遍使用智能手机,这给非传染性疾病(包括不良心理健康)带来了重大挑战。虽然传统的调查方法可用于评估青少年使用智能手机的情况,但它们会受到回忆偏差的影响。本研究旨在比较青少年通过回顾性改良传统回忆调查和前瞻性生态瞬间评估(EMA)自我报告的智能手机使用情况。本研究使用了智能平台(Smart Platform)的数据,该平台让青少年作为公民科学家参与其中。加拿大两个城市辖区(里贾纳和萨斯卡通)13-21 岁的青少年(77 人)通过自己的智能手机与我们的研究团队一起使用定制的应用程序,连续八天报告一系列行为和结果。青少年使用传统的有效测量方法报告智能手机的使用情况,该方法经过修改,可以捕捉平日和周末智能手机使用情况的回顾。此外,每天的 EMA 也会在八天内进行时间触发,以捕捉前瞻性的智能手机使用情况。此外,还收集了人口、行为和环境因素。数据分析包括使用 Python 统计软件进行 t 检验和线性回归。回顾性和前瞻性报告的工作日、周末和智能手机总体使用情况之间存在明显差异(p 值 = 0.05)。
{"title":"How can digital citizen science approaches improve ethical smartphone use surveillance among youth: Traditional surveys versus ecological momentary assessments.","authors":"Sarah Al-Akshar, Sheriff Tolulope Ibrahim, Tarun Reddy Katapally","doi":"10.1371/journal.pdig.0000448","DOIUrl":"10.1371/journal.pdig.0000448","url":null,"abstract":"<p><p>Ubiquitous use of smartphones among youth poses significant challenges related to non-communicable diseases, including poor mental health. Although traditional survey measures can be used to assess smartphone use among youth, they are subject to recall bias. This study aims to compare self-reported smartphone use via retrospective modified traditional recall survey and prospective Ecological Momentary Assessments (EMAs) among youth. This study uses data from the Smart Platform, which engages with youth as citizen scientists. Youth (N = 77) aged 13-21 years in two urban jurisdictions in Canada (Regina and Saskatoon) engaged with our research team using a custom-built application via their own smartphones to report on a range of behaviours and outcomes on eight consecutive days. Youth reported smartphone use utilizing a traditional validated measure, which was modified to capture retrospective smartphone use on both weekdays and weekend days. In addition, daily EMAs were also time-triggered over a period of eight days to capture prospective smartphone use. Demographic, behavioural, and contextual factors were also collected. Data analyses included t-test and linear regression using Python statistical software. There was a significant difference between weekdays, weekends and overall smartphone use reported retrospectively and prospectively (p-value = <0.001), with youth reporting less smartphone use via EMAs. Overall retrospective smartphone use was significantly associated with not having a part-time job (β = 139.64, 95% confidence interval [CI] = 34.759, 244.519, p-value = 0.010) and having more than two friends who are physically active (β = -114.72, 95%[CI] = -208.872, -20.569, p-value = 0.018). However, prospective smartphone use reported via EMAs was not associated with any behavioural and contextual factors. The findings of this study have implications for appropriately understanding and monitoring smartphone use in the digital age among youth. EMAs can potentially minimize recall bias of smartphone use among youth, and other behaviours such as physical activity. More importantly, digital citizen science approaches that engage large populations of youth using their own smartphones can transform how we ethically monitor and mitigate the impact of excessive smartphone use.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 11","pages":"e0000448"},"PeriodicalIF":0.0,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11554190/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142634320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New colleague or gimmick hurdle? A user-centric scoping review of the barriers and facilitators of robots in hospitals. 新同事还是噱头障碍?以用户为中心,对医院使用机器人的障碍和促进因素进行范围界定。
Pub Date : 2024-11-11 eCollection Date: 2024-11-01 DOI: 10.1371/journal.pdig.0000660
Mathias Kofoed Rasmussen, Anna Schneider-Kamp, Tobias Hyrup, Alessandro Godono

Healthcare systems are confronted with a multitude of challenges, including the imperative to enhance accessibility, efficiency, cost-effectiveness, and the quality of healthcare delivery. These challenges are exacerbated by current healthcare personnel shortages, prospects of future shortfalls, insufficient recruitment efforts, increasing prevalence of chronic diseases, global viral concerns, and ageing populations. To address this escalating demand for healthcare services, healthcare systems are increasingly adopting robotic technology and artificial intelligence (AI), which promise to optimise costs, improve working conditions, and increase the quality of care. This article focuses on deepening our understanding of the barriers and facilitators associated with integrating robotic technologies in hospital environments. To this end, we conducted a scoping literature review to consolidate emerging themes pertaining to the experiences, viewpoints perspectives, and behaviours of hospital employees as professional users of robots in hospitals. Through screening 501 original research articles from Web-of-Science, we identified and reviewed in full-text 40 pertinent user-centric studies of the integration of robots into hospitals. Our review revealed and analysed 14 themes in-depth, of which we identified seven as barriers and seven as facilitators. Through a structuring of the barriers and facilitators, we reveal a notable misalignment between these barriers and facilitators: Finding that organisational aspects are at the core of most barriers, we suggest that future research should investigate the dynamics between hospital employees as professional users and the procedures and workflows of the hospitals as institutions, as well as the ambivalent role of anthropomorphisation of hospital robots, and emerging issues of privacy and confidentiality raised by increasingly communicative robots. Ultimately, this perspective on the integration of robots in hospitals transcends debates on the capabilities and limits of the robotic technology itself, shedding light on the complexity of integrating new technologies into hospital environments and contributing to an understanding of possible futures in healthcare innovation.

医疗保健系统面临着诸多挑战,包括必须提高医疗保健服务的可及性、效率、成本效益和质量。目前的医疗保健人员短缺、未来可能出现的人员短缺、招聘力度不够、慢性病发病率不断上升、全球病毒问题以及人口老龄化等问题加剧了这些挑战。为了应对不断增长的医疗服务需求,医疗系统正越来越多地采用机器人技术和人工智能(AI),它们有望优化成本、改善工作条件并提高医疗质量。本文的重点是加深我们对在医院环境中整合机器人技术的相关障碍和促进因素的理解。为此,我们进行了一次范围广泛的文献综述,以整合与医院员工作为医院机器人专业用户的经验、观点和行为有关的新兴主题。通过筛选 Web-of-Science 上的 501 篇原创研究文章,我们确定并全文审阅了 40 篇以用户为中心的有关将机器人集成到医院中的相关研究。我们的综述揭示并深入分析了 14 个主题,其中 7 个是障碍,7 个是促进因素。通过对障碍和促进因素的结构化分析,我们发现这些障碍和促进因素之间存在明显的错位:我们发现组织方面是大多数障碍的核心,因此建议未来的研究应调查作为专业用户的医院员工与作为机构的医院的程序和工作流程之间的动态关系,以及医院机器人拟人化的矛盾作用,以及交流性越来越强的机器人所引发的新出现的隐私和保密问题。归根结底,关于医院机器人整合的这一视角超越了对机器人技术本身的能力和局限性的争论,揭示了将新技术融入医院环境的复杂性,有助于理解医疗保健创新的可能未来。
{"title":"New colleague or gimmick hurdle? A user-centric scoping review of the barriers and facilitators of robots in hospitals.","authors":"Mathias Kofoed Rasmussen, Anna Schneider-Kamp, Tobias Hyrup, Alessandro Godono","doi":"10.1371/journal.pdig.0000660","DOIUrl":"10.1371/journal.pdig.0000660","url":null,"abstract":"<p><p>Healthcare systems are confronted with a multitude of challenges, including the imperative to enhance accessibility, efficiency, cost-effectiveness, and the quality of healthcare delivery. These challenges are exacerbated by current healthcare personnel shortages, prospects of future shortfalls, insufficient recruitment efforts, increasing prevalence of chronic diseases, global viral concerns, and ageing populations. To address this escalating demand for healthcare services, healthcare systems are increasingly adopting robotic technology and artificial intelligence (AI), which promise to optimise costs, improve working conditions, and increase the quality of care. This article focuses on deepening our understanding of the barriers and facilitators associated with integrating robotic technologies in hospital environments. To this end, we conducted a scoping literature review to consolidate emerging themes pertaining to the experiences, viewpoints perspectives, and behaviours of hospital employees as professional users of robots in hospitals. Through screening 501 original research articles from Web-of-Science, we identified and reviewed in full-text 40 pertinent user-centric studies of the integration of robots into hospitals. Our review revealed and analysed 14 themes in-depth, of which we identified seven as barriers and seven as facilitators. Through a structuring of the barriers and facilitators, we reveal a notable misalignment between these barriers and facilitators: Finding that organisational aspects are at the core of most barriers, we suggest that future research should investigate the dynamics between hospital employees as professional users and the procedures and workflows of the hospitals as institutions, as well as the ambivalent role of anthropomorphisation of hospital robots, and emerging issues of privacy and confidentiality raised by increasingly communicative robots. Ultimately, this perspective on the integration of robots in hospitals transcends debates on the capabilities and limits of the robotic technology itself, shedding light on the complexity of integrating new technologies into hospital environments and contributing to an understanding of possible futures in healthcare innovation.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 11","pages":"e0000660"},"PeriodicalIF":0.0,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11554139/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142634324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diagnostic accuracy of a smartphone-based device (VistaView) for detection of diabetic retinopathy: A prospective study. 基于智能手机的设备(VistaView)检测糖尿病视网膜病变的诊断准确性:前瞻性研究。
Pub Date : 2024-11-08 eCollection Date: 2024-11-01 DOI: 10.1371/journal.pdig.0000649
Rida Shahzad, Arshad Mehmood, Danish Shabbir, M A Rehman Siddiqui

Background: Diabetic retinopathy (DR) is a leading cause of blindness globally. The gold standard for DR screening is stereoscopic colour fundus photography with tabletop cameras. VistaView is a novel smartphone-based retinal camera which offers mydriatic retinal imaging. This study compares the diagnostic accuracy of the smartphone-based VistaView camera compared to a traditional desk mounted fundus camera (Triton Topcon). We also compare the agreement between graders for DR screening between VistaView images and Topcon images.

Methodology: This prospective study took place between December 2021 and June 2022 in Pakistan. Consecutive diabetic patients were imaged following mydriasis using both VistaView and Topcon cameras at the same sitting. All images were graded independently by two graders based on the International Classification of Diabetic Retinopathy (ICDR) criteria. Individual grades were assigned for severity of DR and maculopathy in each image. Diagnostic accuracy was calculated using the Topcon camera as the gold standard. Agreement between graders for each device was calculated as intraclass correlation coefficient (ICC) (95% CI) and Cohen's weighted kappa (k).

Principal findings: A total of 1428 images were available from 371 patients with both cameras. After excluding ungradable images, a total of 1231 images were graded. The sensitivity of VistaView for any DR was 69.9% (95% CI 62.2-76.6%) while the specificity was 92.9% (95% CI 89.9-95.1%), and PPV and NPV were 80.5% (95% CI 73-86.4%) and 88.1% (95% CI 84.5-90.9) respectively. The sensitivity of VistaView for RDR was 69.7% (95% CI 61.7-76.8%) while the specificity was 94.2% (95% CI 91.3-96.1%), and PPV and NPV were 81.5% (95% CI 73.6-87.6%) and 89.4% (95% CI 86-92%) respectively. The sensitivity for detecting maculopathy in VistaView was 71.2% (95% CI 62.8-78.4%), while the specificity was 86.4% (82.6-89.4%). The PPV and NPV of detecting maculopathy were 63% (95% CI 54.9-70.5%) and 90.1% (95% CI 86.8-92.9%) respectively. For VistaView, the ICC of DR grades was 78% (95% CI, 75-82%) between the two graders and that of maculopathy grades was 66% (95% CI, 59-71%). The Cohen's kappa for retinopathy grades of VistaView images was 0.61 (95% CI, 0.55-0.67, p<0.001), while that for maculopathy grades was 0.49 (95% CI 0.42-0.57, p<0.001). For images from the Topcon desktop camera, the ICC of DR grades was 85% (95% CI, 83-87%), while that of maculopathy grades was 79% (95% CI, 75-82%). The Cohen's kappa for retinopathy grades of Topcon images was 0.68 (95% CI, 0.63-0.74, p<0.001), while that for maculopathy grades was 0.65 (95% CI, 0.58-0.72, p<0.001).

Conclusion: The VistaView offers moderate diagnostic accuracy for DR screening and may be used as a screening tool in LMIC.

背景:糖尿病视网膜病变(DR糖尿病视网膜病变(DR)是全球致盲的主要原因。糖尿病视网膜病变筛查的黄金标准是使用台式照相机进行立体彩色眼底摄影。VistaView 是一种基于智能手机的新型视网膜相机,可提供眼底视网膜成像。本研究比较了基于智能手机的 VistaView 相机与传统台式眼底相机(Triton Topcon)的诊断准确性。我们还比较了 VistaView 图像和 Topcon 图像在 DR 筛查中分级人员之间的一致性:这项前瞻性研究于 2021 年 12 月至 2022 年 6 月在巴基斯坦进行。连续的糖尿病患者在同一坐姿下,使用 VistaView 和 Topcon 相机在瞳孔散大后进行成像。根据国际糖尿病视网膜病变分类(ICDR)标准,由两名分级人员对所有图像进行独立分级。根据每张图像中 DR 和黄斑病变的严重程度划分等级。诊断准确性以 Topcon 相机作为金标准进行计算。每种设备的分级者之间的一致性按类内相关系数(ICC)(95% CI)和科恩加权卡帕(k)计算:共有 371 名患者的 1428 张图像使用了这两种相机。排除无法分级的图像后,共有 1231 张图像进行了分级。VistaView 对任何 DR 的敏感性为 69.9%(95% CI 62.2-76.6%),特异性为 92.9%(95% CI 89.9-95.1%),PPV 和 NPV 分别为 80.5%(95% CI 73-86.4%)和 88.1%(95% CI 84.5-90.9)。VistaView 检测 RDR 的灵敏度为 69.7% (95% CI 61.7-76.8%),特异度为 94.2% (95% CI 91.3-96.1%),PPV 和 NPV 分别为 81.5% (95% CI 73.6-87.6%)和 89.4% (95% CI 86-92%)。VistaView 检测黄斑病变的灵敏度为 71.2% (95% CI 62.8-78.4%),特异度为 86.4% (82.6-89.4%)。检测黄斑病变的 PPV 和 NPV 分别为 63% (95% CI 54.9-70.5%) 和 90.1% (95% CI 86.8-92.9%)。在 VistaView 中,两个分级者之间 DR 分级的 ICC 为 78% (95% CI, 75-82%),黄斑病变分级的 ICC 为 66% (95% CI, 59-71%)。VistaView 图像视网膜病变等级的 Cohen's kappa 为 0.61 (95% CI, 0.55-0.67, pConclusion):VistaView对DR筛查的诊断准确性适中,可用作低收入国家的筛查工具。
{"title":"Diagnostic accuracy of a smartphone-based device (VistaView) for detection of diabetic retinopathy: A prospective study.","authors":"Rida Shahzad, Arshad Mehmood, Danish Shabbir, M A Rehman Siddiqui","doi":"10.1371/journal.pdig.0000649","DOIUrl":"10.1371/journal.pdig.0000649","url":null,"abstract":"<p><strong>Background: </strong>Diabetic retinopathy (DR) is a leading cause of blindness globally. The gold standard for DR screening is stereoscopic colour fundus photography with tabletop cameras. VistaView is a novel smartphone-based retinal camera which offers mydriatic retinal imaging. This study compares the diagnostic accuracy of the smartphone-based VistaView camera compared to a traditional desk mounted fundus camera (Triton Topcon). We also compare the agreement between graders for DR screening between VistaView images and Topcon images.</p><p><strong>Methodology: </strong>This prospective study took place between December 2021 and June 2022 in Pakistan. Consecutive diabetic patients were imaged following mydriasis using both VistaView and Topcon cameras at the same sitting. All images were graded independently by two graders based on the International Classification of Diabetic Retinopathy (ICDR) criteria. Individual grades were assigned for severity of DR and maculopathy in each image. Diagnostic accuracy was calculated using the Topcon camera as the gold standard. Agreement between graders for each device was calculated as intraclass correlation coefficient (ICC) (95% CI) and Cohen's weighted kappa (k).</p><p><strong>Principal findings: </strong>A total of 1428 images were available from 371 patients with both cameras. After excluding ungradable images, a total of 1231 images were graded. The sensitivity of VistaView for any DR was 69.9% (95% CI 62.2-76.6%) while the specificity was 92.9% (95% CI 89.9-95.1%), and PPV and NPV were 80.5% (95% CI 73-86.4%) and 88.1% (95% CI 84.5-90.9) respectively. The sensitivity of VistaView for RDR was 69.7% (95% CI 61.7-76.8%) while the specificity was 94.2% (95% CI 91.3-96.1%), and PPV and NPV were 81.5% (95% CI 73.6-87.6%) and 89.4% (95% CI 86-92%) respectively. The sensitivity for detecting maculopathy in VistaView was 71.2% (95% CI 62.8-78.4%), while the specificity was 86.4% (82.6-89.4%). The PPV and NPV of detecting maculopathy were 63% (95% CI 54.9-70.5%) and 90.1% (95% CI 86.8-92.9%) respectively. For VistaView, the ICC of DR grades was 78% (95% CI, 75-82%) between the two graders and that of maculopathy grades was 66% (95% CI, 59-71%). The Cohen's kappa for retinopathy grades of VistaView images was 0.61 (95% CI, 0.55-0.67, p<0.001), while that for maculopathy grades was 0.49 (95% CI 0.42-0.57, p<0.001). For images from the Topcon desktop camera, the ICC of DR grades was 85% (95% CI, 83-87%), while that of maculopathy grades was 79% (95% CI, 75-82%). The Cohen's kappa for retinopathy grades of Topcon images was 0.68 (95% CI, 0.63-0.74, p<0.001), while that for maculopathy grades was 0.65 (95% CI, 0.58-0.72, p<0.001).</p><p><strong>Conclusion: </strong>The VistaView offers moderate diagnostic accuracy for DR screening and may be used as a screening tool in LMIC.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 11","pages":"e0000649"},"PeriodicalIF":0.0,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11548746/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142607220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Early identification of children with Attention-Deficit/Hyperactivity Disorder (ADHD). 早期识别患有注意力缺陷/多动症(ADHD)的儿童。
Pub Date : 2024-11-07 eCollection Date: 2024-11-01 DOI: 10.1371/journal.pdig.0000620
Yang S Liu, Fernanda Talarico, Dan Metes, Yipeng Song, Mengzhe Wang, Lawrence Kiyang, Dori Wearmouth, Shelly Vik, Yifeng Wei, Yanbo Zhang, Jake Hayward, Ghalib Ahmed, Ashley Gaskin, Russell Greiner, Andrew Greenshaw, Alex Alexander, Magdalena Janus, Bo Cao

Signs and symptoms of Attention-Deficit/Hyperactivity Disorder (ADHD) are present at preschool ages and often not identified for early intervention. We aimed to use machine learning to detect ADHD early among kindergarten-aged children using population-level administrative health data and a childhood developmental vulnerability surveillance tool: Early Development Instrument (EDI). The study cohort consists of 23,494 children born in Alberta, Canada, who attended kindergarten in 2016 without a diagnosis of ADHD. In a four-year follow-up period, 1,680 children were later identified with ADHD using case definition. We trained and tested machine learning models to predict ADHD prospectively. The best-performing model using administrative and EDI data could reliably predict ADHD and achieved an Area Under the Curve (AUC) of 0.811 during cross-validation. Key predictive factors included EDI subdomain scores, sex, and socioeconomic status. Our findings suggest that machine learning algorithms that use population-level surveillance data could be a valuable tool for early identification of ADHD.

注意力缺陷/多动障碍(ADHD)的体征和症状在学龄前就已出现,但往往无法识别,无法进行早期干预。我们的目标是利用人口一级的行政健康数据和儿童发育脆弱性监测工具,使用机器学习来早期检测幼儿园学龄儿童的多动症:早期发展工具(EDI)。研究队列由 23,494 名出生于加拿大艾伯塔省的儿童组成,这些儿童于 2016 年进入幼儿园,但未被诊断出患有多动症。在为期四年的随访中,有 1680 名儿童后来通过病例定义被确定患有多动症。我们对机器学习模型进行了训练和测试,以便对多动症进行前瞻性预测。使用管理数据和 EDI 数据的最佳模型可以可靠地预测多动症,在交叉验证中的曲线下面积 (AUC) 达到了 0.811。主要预测因素包括 EDI 子域得分、性别和社会经济地位。我们的研究结果表明,使用人群监测数据的机器学习算法可以成为早期识别多动症的重要工具。
{"title":"Early identification of children with Attention-Deficit/Hyperactivity Disorder (ADHD).","authors":"Yang S Liu, Fernanda Talarico, Dan Metes, Yipeng Song, Mengzhe Wang, Lawrence Kiyang, Dori Wearmouth, Shelly Vik, Yifeng Wei, Yanbo Zhang, Jake Hayward, Ghalib Ahmed, Ashley Gaskin, Russell Greiner, Andrew Greenshaw, Alex Alexander, Magdalena Janus, Bo Cao","doi":"10.1371/journal.pdig.0000620","DOIUrl":"10.1371/journal.pdig.0000620","url":null,"abstract":"<p><p>Signs and symptoms of Attention-Deficit/Hyperactivity Disorder (ADHD) are present at preschool ages and often not identified for early intervention. We aimed to use machine learning to detect ADHD early among kindergarten-aged children using population-level administrative health data and a childhood developmental vulnerability surveillance tool: Early Development Instrument (EDI). The study cohort consists of 23,494 children born in Alberta, Canada, who attended kindergarten in 2016 without a diagnosis of ADHD. In a four-year follow-up period, 1,680 children were later identified with ADHD using case definition. We trained and tested machine learning models to predict ADHD prospectively. The best-performing model using administrative and EDI data could reliably predict ADHD and achieved an Area Under the Curve (AUC) of 0.811 during cross-validation. Key predictive factors included EDI subdomain scores, sex, and socioeconomic status. Our findings suggest that machine learning algorithms that use population-level surveillance data could be a valuable tool for early identification of ADHD.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 11","pages":"e0000620"},"PeriodicalIF":0.0,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11542831/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142607297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining OpenStreetMap mapping and route optimization algorithms to inform the delivery of community health interventions at the last mile. 结合 OpenStreetMap 地图绘制和路线优化算法,为最后一英里的社区卫生干预提供信息。
Pub Date : 2024-11-07 eCollection Date: 2024-11-01 DOI: 10.1371/journal.pdig.0000621
Mauricianot Randriamihaja, Felana Angella Ihantamalala, Feno H Rafenoarimalala, Karen E Finnegan, Luc Rakotonirina, Benedicte Razafinjato, Matthew H Bonds, Michelle V Evans, Andres Garchitorena

Community health programs are gaining relevance within national health systems and becoming inherently more complex. To ensure that community health programs lead to equitable geographic access to care, the WHO recommends adapting the target population and workload of community health workers (CHWs) according to the local geographic context and population size of the communities they serve. Geographic optimization could be particularly beneficial for those activities that require CHWs to visit households door-to-door for last mile delivery of care. The goal of this study was to demonstrate how geographic optimization can be applied to inform community health programs in rural areas of the developing world. We developed a decision-making tool based on OpenStreetMap mapping and route optimization algorithms in order to inform the micro-planning and implementation of two kinds of community health interventions requiring door-to-door delivery: mass distribution campaigns and proactive community case management (proCCM) programs. We applied the Vehicle Routing Problem with Time Windows (VRPTW) algorithm to optimize the on-foot routes that CHWs take to visit households in their catchment, using a geographic dataset obtained from mapping on OpenStreetMap comprising over 100,000 buildings and 20,000 km of footpaths in the rural district of Ifanadiana, Madagascar. We found that personnel-day requirements ranged from less than 15 to over 60 per CHW catchment for mass distribution campaigns, and from less than 5 to over 20 for proCCM programs, assuming 1 visit per month. To illustrate how these VRPTW algorithms can be used by operational teams, we developed an "e-health" platform to visualize resource requirements, CHW optimal schedules and itineraries according to customizable intervention designs and hypotheses. Further development and scale-up of these tools could help optimize community health programs and other last mile delivery activities, in line with WHO recommendations, linking a new era of big data analytics with the most basic forms of frontline care in resource poor areas.

社区卫生计划在国家卫生系统中的相关性越来越大,其本身也变得越来越复杂。为确保社区卫生计划能在公平的地域范围内提供医疗服务,世卫组织建议根据当地的地理环境和所服务社区的人口规模,调整社区卫生工作人员(CHWs)的目标人群和工作量。对于那些需要社区保健员挨家挨户提供最后一英里医疗服务的活动来说,地理优化尤其有益。本研究的目标是展示如何将地理优化应用于发展中国家农村地区的社区卫生计划。我们开发了一种基于 OpenStreetMap 地图和路线优化算法的决策工具,以便为两种需要上门服务的社区卫生干预项目的微观规划和实施提供信息:大规模分发活动和主动社区病例管理(proCCM)项目。我们利用从 OpenStreetMap 地图中获取的地理数据集,包括马达加斯加 Ifanadiana 农村地区的 10 万多栋建筑和 2 万公里人行道,采用带时间窗口的车辆路由问题 (VRPTW) 算法来优化社区保健工作者走访其集水区住户的步行路线。我们发现,在大规模分发活动中,每个社区保健工作者集聚区所需的人日从不到 15 天到超过 60 天不等;而在促进儿童疾病防治计划中,假设每月访问一次,所需的人日从不到 5 天到超过 20 天不等。为了说明操作团队如何使用这些 VRPTW 算法,我们开发了一个 "电子健康 "平台,以便根据可定制的干预设计和假设,直观显示资源需求、卫生保健工作者的最佳时间表和行程。根据世界卫生组织的建议,进一步开发和推广这些工具有助于优化社区卫生计划和其他最后一英里交付活动,将新时代的大数据分析与资源贫乏地区最基本的一线护理联系起来。
{"title":"Combining OpenStreetMap mapping and route optimization algorithms to inform the delivery of community health interventions at the last mile.","authors":"Mauricianot Randriamihaja, Felana Angella Ihantamalala, Feno H Rafenoarimalala, Karen E Finnegan, Luc Rakotonirina, Benedicte Razafinjato, Matthew H Bonds, Michelle V Evans, Andres Garchitorena","doi":"10.1371/journal.pdig.0000621","DOIUrl":"10.1371/journal.pdig.0000621","url":null,"abstract":"<p><p>Community health programs are gaining relevance within national health systems and becoming inherently more complex. To ensure that community health programs lead to equitable geographic access to care, the WHO recommends adapting the target population and workload of community health workers (CHWs) according to the local geographic context and population size of the communities they serve. Geographic optimization could be particularly beneficial for those activities that require CHWs to visit households door-to-door for last mile delivery of care. The goal of this study was to demonstrate how geographic optimization can be applied to inform community health programs in rural areas of the developing world. We developed a decision-making tool based on OpenStreetMap mapping and route optimization algorithms in order to inform the micro-planning and implementation of two kinds of community health interventions requiring door-to-door delivery: mass distribution campaigns and proactive community case management (proCCM) programs. We applied the Vehicle Routing Problem with Time Windows (VRPTW) algorithm to optimize the on-foot routes that CHWs take to visit households in their catchment, using a geographic dataset obtained from mapping on OpenStreetMap comprising over 100,000 buildings and 20,000 km of footpaths in the rural district of Ifanadiana, Madagascar. We found that personnel-day requirements ranged from less than 15 to over 60 per CHW catchment for mass distribution campaigns, and from less than 5 to over 20 for proCCM programs, assuming 1 visit per month. To illustrate how these VRPTW algorithms can be used by operational teams, we developed an \"e-health\" platform to visualize resource requirements, CHW optimal schedules and itineraries according to customizable intervention designs and hypotheses. Further development and scale-up of these tools could help optimize community health programs and other last mile delivery activities, in line with WHO recommendations, linking a new era of big data analytics with the most basic forms of frontline care in resource poor areas.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 11","pages":"e0000621"},"PeriodicalIF":0.0,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11542841/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142607214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bias in medical AI: Implications for clinical decision-making. 医学人工智能中的偏见:对临床决策的影响。
Pub Date : 2024-11-07 eCollection Date: 2024-11-01 DOI: 10.1371/journal.pdig.0000651
James L Cross, Michael A Choma, John A Onofrey

Biases in medical artificial intelligence (AI) arise and compound throughout the AI lifecycle. These biases can have significant clinical consequences, especially in applications that involve clinical decision-making. Left unaddressed, biased medical AI can lead to substandard clinical decisions and the perpetuation and exacerbation of longstanding healthcare disparities. We discuss potential biases that can arise at different stages in the AI development pipeline and how they can affect AI algorithms and clinical decision-making. Bias can occur in data features and labels, model development and evaluation, deployment, and publication. Insufficient sample sizes for certain patient groups can result in suboptimal performance, algorithm underestimation, and clinically unmeaningful predictions. Missing patient findings can also produce biased model behavior, including capturable but nonrandomly missing data, such as diagnosis codes, and data that is not usually or not easily captured, such as social determinants of health. Expertly annotated labels used to train supervised learning models may reflect implicit cognitive biases or substandard care practices. Overreliance on performance metrics during model development may obscure bias and diminish a model's clinical utility. When applied to data outside the training cohort, model performance can deteriorate from previous validation and can do so differentially across subgroups. How end users interact with deployed solutions can introduce bias. Finally, where models are developed and published, and by whom, impacts the trajectories and priorities of future medical AI development. Solutions to mitigate bias must be implemented with care, which include the collection of large and diverse data sets, statistical debiasing methods, thorough model evaluation, emphasis on model interpretability, and standardized bias reporting and transparency requirements. Prior to real-world implementation in clinical settings, rigorous validation through clinical trials is critical to demonstrate unbiased application. Addressing biases across model development stages is crucial for ensuring all patients benefit equitably from the future of medical AI.

医学人工智能(AI)中的偏差会在整个人工智能生命周期中出现并不断加剧。这些偏差会对临床产生重大影响,尤其是在涉及临床决策的应用中。如果不加以解决,带有偏见的医疗人工智能可能会导致不合格的临床决策,并使长期存在的医疗差距永久化和加剧。我们将讨论在人工智能开发管道的不同阶段可能出现的潜在偏差,以及这些偏差会如何影响人工智能算法和临床决策。偏见可能出现在数据特征和标签、模型开发和评估、部署和发布中。某些患者群体的样本量不足会导致性能不达标、算法估计不足以及临床预测无意义。缺失的患者研究结果也会导致模型行为出现偏差,包括诊断代码等可捕获但非随机缺失的数据,以及健康的社会决定因素等通常不会或不易捕获的数据。用于训练监督学习模型的专家注释标签可能会反映出隐含的认知偏差或不合标准的护理实践。在模型开发过程中过度依赖性能指标可能会掩盖偏差,降低模型的临床实用性。当应用到训练队列以外的数据时,模型的性能可能会比之前的验证结果更差,而且在不同的亚组中会有不同的表现。最终用户如何与已部署的解决方案互动也会产生偏差。最后,模型在哪里开发和发布,由谁开发和发布,都会影响未来医疗人工智能的发展轨迹和优先顺序。减轻偏倚的解决方案必须谨慎实施,其中包括收集大量多样的数据集、统计去伪存真方法、全面的模型评估、强调模型的可解释性以及标准化的偏倚报告和透明度要求。在临床环境中实际应用之前,通过临床试验进行严格验证对证明无偏见应用至关重要。要确保所有患者都能公平地受益于未来的医疗人工智能,解决模型开发阶段的偏差问题至关重要。
{"title":"Bias in medical AI: Implications for clinical decision-making.","authors":"James L Cross, Michael A Choma, John A Onofrey","doi":"10.1371/journal.pdig.0000651","DOIUrl":"10.1371/journal.pdig.0000651","url":null,"abstract":"<p><p>Biases in medical artificial intelligence (AI) arise and compound throughout the AI lifecycle. These biases can have significant clinical consequences, especially in applications that involve clinical decision-making. Left unaddressed, biased medical AI can lead to substandard clinical decisions and the perpetuation and exacerbation of longstanding healthcare disparities. We discuss potential biases that can arise at different stages in the AI development pipeline and how they can affect AI algorithms and clinical decision-making. Bias can occur in data features and labels, model development and evaluation, deployment, and publication. Insufficient sample sizes for certain patient groups can result in suboptimal performance, algorithm underestimation, and clinically unmeaningful predictions. Missing patient findings can also produce biased model behavior, including capturable but nonrandomly missing data, such as diagnosis codes, and data that is not usually or not easily captured, such as social determinants of health. Expertly annotated labels used to train supervised learning models may reflect implicit cognitive biases or substandard care practices. Overreliance on performance metrics during model development may obscure bias and diminish a model's clinical utility. When applied to data outside the training cohort, model performance can deteriorate from previous validation and can do so differentially across subgroups. How end users interact with deployed solutions can introduce bias. Finally, where models are developed and published, and by whom, impacts the trajectories and priorities of future medical AI development. Solutions to mitigate bias must be implemented with care, which include the collection of large and diverse data sets, statistical debiasing methods, thorough model evaluation, emphasis on model interpretability, and standardized bias reporting and transparency requirements. Prior to real-world implementation in clinical settings, rigorous validation through clinical trials is critical to demonstrate unbiased application. Addressing biases across model development stages is crucial for ensuring all patients benefit equitably from the future of medical AI.</p>","PeriodicalId":74465,"journal":{"name":"PLOS digital health","volume":"3 11","pages":"e0000651"},"PeriodicalIF":0.0,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11542778/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142607208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
PLOS digital health
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1