首页 > 最新文献

JMIR AI最新文献

英文 中文
Advancing Privacy-Preserving Health Care Analytics and Implementation of the Personal Health Train: Federated Deep Learning Study.
Pub Date : 2025-02-06 DOI: 10.2196/60847
Ananya Choudhury, Leroy Volmer, Frank Martin, Rianne Fijten, Leonard Wee, Andre Dekker, Johan van Soest
<p><strong>Background: </strong>The rapid advancement of deep learning in health care presents significant opportunities for automating complex medical tasks and improving clinical workflows. However, widespread adoption is impeded by data privacy concerns and the necessity for large, diverse datasets across multiple institutions. Federated learning (FL) has emerged as a viable solution, enabling collaborative artificial intelligence model development without sharing individual patient data. To effectively implement FL in health care, robust and secure infrastructures are essential. Developing such federated deep learning frameworks is crucial to harnessing the full potential of artificial intelligence while ensuring patient data privacy and regulatory compliance.</p><p><strong>Objective: </strong>The objective is to introduce an innovative FL infrastructure called the Personal Health Train (PHT) that includes the procedural, technical, and governance components needed to implement FL on real-world health care data, including training deep learning neural networks. The study aims to apply this federated deep learning infrastructure to the use case of gross tumor volume segmentation on chest computed tomography images of patients with lung cancer and present the results from a proof-of-concept experiment.</p><p><strong>Methods: </strong>The PHT framework addresses the challenges of data privacy when sharing data, by keeping data close to the source and instead bringing the analysis to the data. Technologically, PHT requires 3 interdependent components: "tracks" (protected communication channels), "trains" (containerized software apps), and "stations" (institutional data repositories), which are supported by the open source "Vantage6" software. The study applies this federated deep learning infrastructure to the use case of gross tumor volume segmentation on chest computed tomography images of patients with lung cancer, with the introduction of an additional component called the secure aggregation server, where the model averaging is done in a trusted and inaccessible environment.</p><p><strong>Results: </strong>We demonstrated the feasibility of executing deep learning algorithms in a federated manner using PHT and presented the results from a proof-of-concept study. The infrastructure linked 12 hospitals across 8 nations, covering 4 continents, demonstrating the scalability and global reach of the proposed approach. During the execution and training of the deep learning algorithm, no data were shared outside the hospital.</p><p><strong>Conclusions: </strong>The findings of the proof-of-concept study, as well as the implications and limitations of the infrastructure and the results, are discussed. The application of federated deep learning to unstructured medical imaging data, facilitated by the PHT framework and Vantage6 platform, represents a significant advancement in the field. The proposed infrastructure addresses the challenges of data priva
{"title":"Advancing Privacy-Preserving Health Care Analytics and Implementation of the Personal Health Train: Federated Deep Learning Study.","authors":"Ananya Choudhury, Leroy Volmer, Frank Martin, Rianne Fijten, Leonard Wee, Andre Dekker, Johan van Soest","doi":"10.2196/60847","DOIUrl":"https://doi.org/10.2196/60847","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;The rapid advancement of deep learning in health care presents significant opportunities for automating complex medical tasks and improving clinical workflows. However, widespread adoption is impeded by data privacy concerns and the necessity for large, diverse datasets across multiple institutions. Federated learning (FL) has emerged as a viable solution, enabling collaborative artificial intelligence model development without sharing individual patient data. To effectively implement FL in health care, robust and secure infrastructures are essential. Developing such federated deep learning frameworks is crucial to harnessing the full potential of artificial intelligence while ensuring patient data privacy and regulatory compliance.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;The objective is to introduce an innovative FL infrastructure called the Personal Health Train (PHT) that includes the procedural, technical, and governance components needed to implement FL on real-world health care data, including training deep learning neural networks. The study aims to apply this federated deep learning infrastructure to the use case of gross tumor volume segmentation on chest computed tomography images of patients with lung cancer and present the results from a proof-of-concept experiment.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;The PHT framework addresses the challenges of data privacy when sharing data, by keeping data close to the source and instead bringing the analysis to the data. Technologically, PHT requires 3 interdependent components: \"tracks\" (protected communication channels), \"trains\" (containerized software apps), and \"stations\" (institutional data repositories), which are supported by the open source \"Vantage6\" software. The study applies this federated deep learning infrastructure to the use case of gross tumor volume segmentation on chest computed tomography images of patients with lung cancer, with the introduction of an additional component called the secure aggregation server, where the model averaging is done in a trusted and inaccessible environment.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;We demonstrated the feasibility of executing deep learning algorithms in a federated manner using PHT and presented the results from a proof-of-concept study. The infrastructure linked 12 hospitals across 8 nations, covering 4 continents, demonstrating the scalability and global reach of the proposed approach. During the execution and training of the deep learning algorithm, no data were shared outside the hospital.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;The findings of the proof-of-concept study, as well as the implications and limitations of the infrastructure and the results, are discussed. The application of federated deep learning to unstructured medical imaging data, facilitated by the PHT framework and Vantage6 platform, represents a significant advancement in the field. The proposed infrastructure addresses the challenges of data priva","PeriodicalId":73551,"journal":{"name":"JMIR AI","volume":"4 ","pages":"e60847"},"PeriodicalIF":0.0,"publicationDate":"2025-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143257561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Urgency Prediction for Medical Laboratory Tests Through Optimal Sparse Decision Tree: Case Study With Echocardiograms.
Pub Date : 2025-01-29 DOI: 10.2196/64188
Yiqun Jiang, Qing Li, Yu-Li Huang, Wenli Zhang

Background: In the contemporary realm of health care, laboratory tests stand as cornerstone components, driving the advancement of precision medicine. These tests offer intricate insights into a variety of medical conditions, thereby facilitating diagnosis, prognosis, and treatments. However, the accessibility of certain tests is hindered by factors such as high costs, a shortage of specialized personnel, or geographic disparities, posing obstacles to achieving equitable health care. For example, an echocardiogram is a type of laboratory test that is extremely important and not easily accessible. The increasing demand for echocardiograms underscores the imperative for more efficient scheduling protocols. Despite this pressing need, limited research has been conducted in this area.

Objective: The study aims to develop an interpretable machine learning model for determining the urgency of patients requiring echocardiograms, thereby aiding in the prioritization of scheduling procedures. Furthermore, this study aims to glean insights into the pivotal attributes influencing the prioritization of echocardiogram appointments, leveraging the high interpretability of the machine learning model.

Methods: Empirical and predictive analyses have been conducted to assess the urgency of patients based on a large real-world echocardiogram appointment dataset (ie, 34,293 appointments) sourced from electronic health records encompassing administrative information, referral diagnosis, and underlying patient conditions. We used a state-of-the-art interpretable machine learning algorithm, the optimal sparse decision tree (OSDT), renowned for its high accuracy and interpretability, to investigate the attributes pertinent to echocardiogram appointments.

Results: The method demonstrated satisfactory performance (F1-score=36.18% with an improvement of 1.7% and F2-score=28.18% with an improvement of 0.79% by the best-performing baseline model) in comparison to the best-performing baseline model. Moreover, due to its high interpretability, the results provide valuable medical insights regarding the identification of urgent patients for tests through the extraction of decision rules from the OSDT model.

Conclusions: The method demonstrated state-of-the-art predictive performance, affirming its effectiveness. Furthermore, we validate the decision rules derived from the OSDT model by comparing them with established medical knowledge. These interpretable results (eg, attribute importance and decision rules from the OSDT model) underscore the potential of our approach in prioritizing patient urgency for echocardiogram appointments and can be extended to prioritize other laboratory test appointments using electronic health record data.

{"title":"Urgency Prediction for Medical Laboratory Tests Through Optimal Sparse Decision Tree: Case Study With Echocardiograms.","authors":"Yiqun Jiang, Qing Li, Yu-Li Huang, Wenli Zhang","doi":"10.2196/64188","DOIUrl":"10.2196/64188","url":null,"abstract":"<p><strong>Background: </strong>In the contemporary realm of health care, laboratory tests stand as cornerstone components, driving the advancement of precision medicine. These tests offer intricate insights into a variety of medical conditions, thereby facilitating diagnosis, prognosis, and treatments. However, the accessibility of certain tests is hindered by factors such as high costs, a shortage of specialized personnel, or geographic disparities, posing obstacles to achieving equitable health care. For example, an echocardiogram is a type of laboratory test that is extremely important and not easily accessible. The increasing demand for echocardiograms underscores the imperative for more efficient scheduling protocols. Despite this pressing need, limited research has been conducted in this area.</p><p><strong>Objective: </strong>The study aims to develop an interpretable machine learning model for determining the urgency of patients requiring echocardiograms, thereby aiding in the prioritization of scheduling procedures. Furthermore, this study aims to glean insights into the pivotal attributes influencing the prioritization of echocardiogram appointments, leveraging the high interpretability of the machine learning model.</p><p><strong>Methods: </strong>Empirical and predictive analyses have been conducted to assess the urgency of patients based on a large real-world echocardiogram appointment dataset (ie, 34,293 appointments) sourced from electronic health records encompassing administrative information, referral diagnosis, and underlying patient conditions. We used a state-of-the-art interpretable machine learning algorithm, the optimal sparse decision tree (OSDT), renowned for its high accuracy and interpretability, to investigate the attributes pertinent to echocardiogram appointments.</p><p><strong>Results: </strong>The method demonstrated satisfactory performance (F<sub>1</sub>-score=36.18% with an improvement of 1.7% and F<sub>2</sub>-score=28.18% with an improvement of 0.79% by the best-performing baseline model) in comparison to the best-performing baseline model. Moreover, due to its high interpretability, the results provide valuable medical insights regarding the identification of urgent patients for tests through the extraction of decision rules from the OSDT model.</p><p><strong>Conclusions: </strong>The method demonstrated state-of-the-art predictive performance, affirming its effectiveness. Furthermore, we validate the decision rules derived from the OSDT model by comparing them with established medical knowledge. These interpretable results (eg, attribute importance and decision rules from the OSDT model) underscore the potential of our approach in prioritizing patient urgency for echocardiogram appointments and can be extended to prioritize other laboratory test appointments using electronic health record data.</p>","PeriodicalId":73551,"journal":{"name":"JMIR AI","volume":"4 ","pages":"e64188"},"PeriodicalIF":0.0,"publicationDate":"2025-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143061551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of Use Cases, Target Groups, and Motivations Around Adopting Smart Speakers for Health Care and Social Care Settings: Scoping Review. 识别在医疗保健和社会护理环境中采用智能扬声器的用例、目标群体和动机:范围审查。
Pub Date : 2025-01-13 DOI: 10.2196/55673
Sebastian Merkel, Sabrina Schorr

Background: Conversational agents (CAs) are finding increasing application in health and social care, not least due to their growing use in the home. Recent developments in artificial intelligence, machine learning, and natural language processing have enabled a variety of new uses for CAs. One type of CA that has received increasing attention recently is smart speakers.

Objective: The aim of our study was to identify the use cases, user groups, and settings of smart speakers in health and social care. We also wanted to identify the key motivations for developers and designers to use this particular type of technology.

Methods: We conducted a scoping review to provide an overview of the literature on smart speakers in health and social care. The literature search was conducted between February 2023 and March 2023 and included 3 databases (PubMed, Scopus, and Sociological Abstracts), supplemented by Google Scholar. Several keywords were used, including technology (eg, voice assistant), product name (eg, Amazon Alexa), and setting (health care or social care). Publications were included if they met the predefined inclusion criteria: (1) published after 2015 and (2) used a smart speaker in a health care or social care setting. Publications were excluded if they met one of the following criteria: (1) did not report on the specific devices used, (2) did not focus specifically on smart speakers, (3) were systematic reviews and other forms of literature-based publications, and (4) were not published in English. Two reviewers collected, reviewed, abstracted, and analyzed the data using qualitative content analysis.

Results: A total of 27 articles were included in the final review. These articles covered a wide range of use cases in different settings, such as private homes, hospitals, long-term care facilities, and outpatient services. The main target group was patients, especially older users, followed by doctors and other medical staff members.

Conclusions: The results show that smart speakers have diverse applications in health and social care, addressing different contexts and audiences. Their affordability and easy-to-use interfaces make them attractive to various stakeholders. It seems likely that, due to technical advances in artificial intelligence and the market power of the companies behind the devices, there will be more use cases for smart speakers in the near future.

背景:对话式代理(CA)在医疗和社会护理领域的应用越来越广泛,尤其是由于其在家庭中的应用越来越多。人工智能、机器学习和自然语言处理领域的最新发展为会话代理提供了多种新用途。智能扬声器就是近来受到越来越多关注的一种 CA:我们的研究旨在确定智能扬声器在医疗和社会护理领域的使用案例、用户群体和使用环境。我们还希望确定开发人员和设计人员使用这种特殊技术的主要动机:我们进行了一次范围界定审查,以提供有关医疗和社会护理领域智能扬声器的文献概览。文献检索在 2023 年 2 月至 2023 年 3 月期间进行,包括 3 个数据库(PubMed、Scopus 和 Sociological Abstracts),并以 Google Scholar 作为补充。使用了多个关键词,包括技术(如语音助手)、产品名称(如亚马逊 Alexa)和环境(医疗或社会医疗)。符合预定义纳入标准的出版物均被纳入:(1) 2015 年之后发表;(2) 在医疗保健或社会护理环境中使用智能扬声器。符合以下标准之一的文献将被排除在外:(1)未报告所使用的具体设备,(2)未特别关注智能扬声器,(3)为系统性综述或其他形式的文献类出版物,(4)非英文发表。两名审稿人采用定性内容分析法收集、审查、摘录和分析数据:结果:共有 27 篇文章被纳入最终评审。这些文章涵盖了不同环境下的各种使用案例,如私人住宅、医院、长期护理机构和门诊服务。主要目标群体是病人,尤其是老年用户,其次是医生和其他医务人员:研究结果表明,智能扬声器在医疗和社会护理领域有多种应用,可满足不同环境和受众的需求。智能扬声器价格低廉、界面简单易用,因此对各利益相关方都具有吸引力。由于人工智能技术的进步和设备背后公司的市场力量,智能扬声器在不久的将来可能会有更多的使用案例。
{"title":"Identification of Use Cases, Target Groups, and Motivations Around Adopting Smart Speakers for Health Care and Social Care Settings: Scoping Review.","authors":"Sebastian Merkel, Sabrina Schorr","doi":"10.2196/55673","DOIUrl":"10.2196/55673","url":null,"abstract":"<p><strong>Background: </strong>Conversational agents (CAs) are finding increasing application in health and social care, not least due to their growing use in the home. Recent developments in artificial intelligence, machine learning, and natural language processing have enabled a variety of new uses for CAs. One type of CA that has received increasing attention recently is smart speakers.</p><p><strong>Objective: </strong>The aim of our study was to identify the use cases, user groups, and settings of smart speakers in health and social care. We also wanted to identify the key motivations for developers and designers to use this particular type of technology.</p><p><strong>Methods: </strong>We conducted a scoping review to provide an overview of the literature on smart speakers in health and social care. The literature search was conducted between February 2023 and March 2023 and included 3 databases (PubMed, Scopus, and Sociological Abstracts), supplemented by Google Scholar. Several keywords were used, including technology (eg, voice assistant), product name (eg, Amazon Alexa), and setting (health care or social care). Publications were included if they met the predefined inclusion criteria: (1) published after 2015 and (2) used a smart speaker in a health care or social care setting. Publications were excluded if they met one of the following criteria: (1) did not report on the specific devices used, (2) did not focus specifically on smart speakers, (3) were systematic reviews and other forms of literature-based publications, and (4) were not published in English. Two reviewers collected, reviewed, abstracted, and analyzed the data using qualitative content analysis.</p><p><strong>Results: </strong>A total of 27 articles were included in the final review. These articles covered a wide range of use cases in different settings, such as private homes, hospitals, long-term care facilities, and outpatient services. The main target group was patients, especially older users, followed by doctors and other medical staff members.</p><p><strong>Conclusions: </strong>The results show that smart speakers have diverse applications in health and social care, addressing different contexts and audiences. Their affordability and easy-to-use interfaces make them attractive to various stakeholders. It seems likely that, due to technical advances in artificial intelligence and the market power of the companies behind the devices, there will be more use cases for smart speakers in the near future.</p>","PeriodicalId":73551,"journal":{"name":"JMIR AI","volume":"4 ","pages":"e55673"},"PeriodicalIF":0.0,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11773277/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142973707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating ChatGPT's Efficacy in Pediatric Pneumonia Detection From Chest X-Rays: Comparative Analysis of Specialized AI Models. 评估ChatGPT在儿童胸部x线肺炎检测中的疗效:专业人工智能模型的比较分析。
Pub Date : 2025-01-10 DOI: 10.2196/67621
Nitin Chetla, Mihir Tandon, Joseph Chang, Kunal Sukhija, Romil Patel, Ramon Sanchez
{"title":"Evaluating ChatGPT's Efficacy in Pediatric Pneumonia Detection From Chest X-Rays: Comparative Analysis of Specialized AI Models.","authors":"Nitin Chetla, Mihir Tandon, Joseph Chang, Kunal Sukhija, Romil Patel, Ramon Sanchez","doi":"10.2196/67621","DOIUrl":"10.2196/67621","url":null,"abstract":"","PeriodicalId":73551,"journal":{"name":"JMIR AI","volume":"4 ","pages":"e67621"},"PeriodicalIF":0.0,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11759907/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142962602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Interpretable, Transparent, and Unobtrusive Detection of Acute Marijuana Intoxication in Natural Environments: Harnessing Smart Devices and Explainable AI to Empower Just-In-Time Adaptive Interventions: Longitudinal Observational Study. 在自然环境中加强对急性大麻中毒的可解释、透明和不显眼的检测:利用智能设备和可解释的人工智能来增强即时适应性干预:纵向观察研究。
Pub Date : 2025-01-02 DOI: 10.2196/52270
Sang Won Bae, Tammy Chung, Tongze Zhang, Anind K Dey, Rahul Islam

Background: Acute marijuana intoxication can impair motor skills and cognitive functions such as attention and information processing. However, traditional tests, like blood, urine, and saliva, fail to accurately detect acute marijuana intoxication in real time.

Objective: This study aims to explore whether integrating smartphone-based sensors with readily accessible wearable activity trackers, like Fitbit, can enhance the detection of acute marijuana intoxication in naturalistic settings. No previous research has investigated the effectiveness of passive sensing technologies for enhancing algorithm accuracy or enhancing the interpretability of digital phenotyping through explainable artificial intelligence in real-life scenarios. This approach aims to provide insights into how individuals interact with digital devices during algorithmic decision-making, particularly for detecting moderate to intensive marijuana intoxication in real-world contexts.

Methods: Sensor data from smartphones and Fitbits, along with self-reported marijuana use, were collected from 33 young adults over a 30-day period using the experience sampling method. Participants rated their level of intoxication on a scale from 1 to 10 within 15 minutes of consuming marijuana and during 3 daily semirandom prompts. The ratings were categorized as not intoxicated (0), low (1-3), and moderate to intense intoxication (4-10). The study analyzed the performance of models using mobile phone data only, Fitbit data only, and a combination of both (MobiFit) in detecting acute marijuana intoxication.

Results: The eXtreme Gradient Boosting Machine classifier showed that the MobiFit model, which combines mobile phone and wearable device data, achieved 99% accuracy (area under the curve=0.99; F1-score=0.85) in detecting acute marijuana intoxication in natural environments. The F1-score indicated significant improvements in sensitivity and specificity for the combined MobiFit model compared to using mobile or Fitbit data alone. Explainable artificial intelligence revealed that moderate to intense self-reported marijuana intoxication was associated with specific smartphone and Fitbit metrics, including elevated minimum heart rate, reduced macromovement, and increased noise energy around participants.

Conclusions: This study demonstrates the potential of using smartphone sensors and wearable devices for interpretable, transparent, and unobtrusive monitoring of acute marijuana intoxication in daily life. Advanced algorithmic decision-making provides valuable insight into behavioral, physiological, and environmental factors that could support timely interventions to reduce marijuana-related harm. Future real-world applications of these algorithms should be evaluated in collaboration with clinical experts to enhance their practicality and effectiveness.

背景:急性大麻中毒可损害运动技能和认知功能,如注意力和信息处理。然而,传统的检测方法,如血液、尿液和唾液,无法实时准确地检测出急性大麻中毒。目的:本研究旨在探索将基于智能手机的传感器与Fitbit等可穿戴活动追踪器相结合,是否可以增强对自然环境下急性大麻中毒的检测。之前没有研究调查过被动传感技术在现实生活场景中通过可解释的人工智能提高算法准确性或增强数字表型可解释性的有效性。该方法旨在深入了解个人在算法决策过程中如何与数字设备互动,特别是在现实世界中检测中度到重度大麻中毒。方法:采用经验抽样法,在30天的时间里收集了33名年轻人的智能手机和fitbit传感器数据,以及他们自己报告的大麻使用情况。参与者在吸食大麻的15分钟内和每天三次半随机提示期间,将他们的中毒程度从1到10打分。评分分为未中毒(0)、低中毒(1-3)和中度至重度中毒(4-10)。该研究分析了仅使用手机数据、仅使用Fitbit数据以及两者结合(MobiFit)检测急性大麻中毒的模型的性能。结果:eXtreme Gradient Boosting Machine分类器显示,结合手机和可穿戴设备数据的MobiFit模型准确率达到99%(曲线下面积=0.99;F1-score=0.85)对自然环境下急性大麻中毒的检测效果。f1评分表明,与单独使用移动或Fitbit数据相比,联合使用MobiFit模型在敏感性和特异性方面有显著提高。可解释的人工智能显示,中度至重度自我报告的大麻中毒与特定的智能手机和Fitbit指标有关,包括最低心率升高、宏观运动减少和参与者周围噪音能量增加。结论:本研究证明了在日常生活中使用智能手机传感器和可穿戴设备对急性大麻中毒进行可解释、透明和不显眼的监测的潜力。先进的算法决策提供了对行为、生理和环境因素的宝贵见解,可以支持及时干预以减少大麻相关危害。未来这些算法的实际应用应与临床专家合作评估,以提高其实用性和有效性。
{"title":"Enhancing Interpretable, Transparent, and Unobtrusive Detection of Acute Marijuana Intoxication in Natural Environments: Harnessing Smart Devices and Explainable AI to Empower Just-In-Time Adaptive Interventions: Longitudinal Observational Study.","authors":"Sang Won Bae, Tammy Chung, Tongze Zhang, Anind K Dey, Rahul Islam","doi":"10.2196/52270","DOIUrl":"10.2196/52270","url":null,"abstract":"<p><strong>Background: </strong>Acute marijuana intoxication can impair motor skills and cognitive functions such as attention and information processing. However, traditional tests, like blood, urine, and saliva, fail to accurately detect acute marijuana intoxication in real time.</p><p><strong>Objective: </strong>This study aims to explore whether integrating smartphone-based sensors with readily accessible wearable activity trackers, like Fitbit, can enhance the detection of acute marijuana intoxication in naturalistic settings. No previous research has investigated the effectiveness of passive sensing technologies for enhancing algorithm accuracy or enhancing the interpretability of digital phenotyping through explainable artificial intelligence in real-life scenarios. This approach aims to provide insights into how individuals interact with digital devices during algorithmic decision-making, particularly for detecting moderate to intensive marijuana intoxication in real-world contexts.</p><p><strong>Methods: </strong>Sensor data from smartphones and Fitbits, along with self-reported marijuana use, were collected from 33 young adults over a 30-day period using the experience sampling method. Participants rated their level of intoxication on a scale from 1 to 10 within 15 minutes of consuming marijuana and during 3 daily semirandom prompts. The ratings were categorized as not intoxicated (0), low (1-3), and moderate to intense intoxication (4-10). The study analyzed the performance of models using mobile phone data only, Fitbit data only, and a combination of both (MobiFit) in detecting acute marijuana intoxication.</p><p><strong>Results: </strong>The eXtreme Gradient Boosting Machine classifier showed that the MobiFit model, which combines mobile phone and wearable device data, achieved 99% accuracy (area under the curve=0.99; F<sub>1</sub>-score=0.85) in detecting acute marijuana intoxication in natural environments. The F<sub>1</sub>-score indicated significant improvements in sensitivity and specificity for the combined MobiFit model compared to using mobile or Fitbit data alone. Explainable artificial intelligence revealed that moderate to intense self-reported marijuana intoxication was associated with specific smartphone and Fitbit metrics, including elevated minimum heart rate, reduced macromovement, and increased noise energy around participants.</p><p><strong>Conclusions: </strong>This study demonstrates the potential of using smartphone sensors and wearable devices for interpretable, transparent, and unobtrusive monitoring of acute marijuana intoxication in daily life. Advanced algorithmic decision-making provides valuable insight into behavioral, physiological, and environmental factors that could support timely interventions to reduce marijuana-related harm. Future real-world applications of these algorithms should be evaluated in collaboration with clinical experts to enhance their practicality and effectiveness.</p>","PeriodicalId":73551,"journal":{"name":"JMIR AI","volume":"4 ","pages":"e52270"},"PeriodicalIF":0.0,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11739728/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142923993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Geospatial Modeling of Deep Neural Visual Features for Predicting Obesity Prevalence in Missouri: Quantitative Study. 预测密苏里州肥胖流行的深度神经视觉特征的地理空间建模:定量研究。
Pub Date : 2024-12-17 DOI: 10.2196/64362
Butros M Dahu, Solaiman Khan, Imad Eddine Toubal, Mariam Alshehri, Carlos I Martinez-Villar, Olabode B Ogundele, Lincoln R Sheets, Grant J Scott

Background: The global obesity epidemic demands innovative approaches to understand its complex environmental and social determinants. Spatial technologies, such as geographic information systems, remote sensing, and spatial machine learning, offer new insights into this health issue. This study uses deep learning and spatial modeling to predict obesity rates for census tracts in Missouri.

Objective: This study aims to develop a scalable method for predicting obesity prevalence using deep convolutional neural networks applied to satellite imagery and geospatial analysis, focusing on 1052 census tracts in Missouri.

Methods: Our analysis followed 3 steps. First, Sentinel-2 satellite images were processed using the Residual Network-50 model to extract environmental features from 63,592 image chips (224×224 pixels). Second, these features were merged with obesity rate data from the Centers for Disease Control and Prevention for Missouri census tracts. Third, a spatial lag model was used to predict obesity rates and analyze the association between deep neural visual features and obesity prevalence. Spatial autocorrelation was used to identify clusters of obesity rates.

Results: Substantial spatial clustering of obesity rates was found across Missouri, with a Moran I value of 0.68, indicating similar obesity rates among neighboring census tracts. The spatial lag model demonstrated strong predictive performance, with an R2 of 0.93 and a spatial pseudo R2 of 0.92, explaining 93% of the variation in obesity rates. Local indicators from a spatial association analysis revealed regions with distinct high and low clusters of obesity, which were visualized through choropleth maps.

Conclusions: This study highlights the effectiveness of integrating deep convolutional neural networks and spatial modeling to predict obesity prevalence based on environmental features from satellite imagery. The model's high accuracy and ability to capture spatial patterns offer valuable insights for public health interventions. Future work should expand the geographical scope and include socioeconomic data to further refine the model for broader applications in obesity research.

背景:全球肥胖症的流行需要创新的方法来了解其复杂的环境和社会决定因素。地理信息系统、遥感和空间机器学习等空间技术为了解这一健康问题提供了新的视角。本研究利用深度学习和空间建模来预测密苏里州人口普查区的肥胖率:本研究旨在开发一种可扩展的方法,利用应用于卫星图像和地理空间分析的深度卷积神经网络预测肥胖患病率,重点关注密苏里州的 1052 个人口普查区:我们的分析分为三个步骤。首先,使用残差网络-50 模型处理哨兵-2 卫星图像,从 63,592 个图像片(224×224 像素)中提取环境特征。其次,将这些特征与美国疾病控制和预防中心提供的密苏里州人口普查区肥胖率数据合并。第三,使用空间滞后模型预测肥胖率,并分析深度神经视觉特征与肥胖率之间的关联。利用空间自相关性确定肥胖率集群:结果:在密苏里州各地发现了大量肥胖率空间集群,莫兰 I 值为 0.68,表明相邻人口普查区的肥胖率相似。空间滞后模型显示出很强的预测能力,R2 为 0.93,空间伪 R2 为 0.92,解释了肥胖率变化的 93%。通过空间关联分析得出的本地指标显示,肥胖率较高和较低的地区有明显的集群,这些集群可通过choropleth地图直观地显示出来:本研究强调了深度卷积神经网络与空间建模相结合,根据卫星图像的环境特征预测肥胖患病率的有效性。该模型的高准确性和捕捉空间模式的能力为公共卫生干预提供了宝贵的见解。未来的工作应扩大地理范围并纳入社会经济数据,以进一步完善该模型,使其在肥胖研究中得到更广泛的应用。
{"title":"Geospatial Modeling of Deep Neural Visual Features for Predicting Obesity Prevalence in Missouri: Quantitative Study.","authors":"Butros M Dahu, Solaiman Khan, Imad Eddine Toubal, Mariam Alshehri, Carlos I Martinez-Villar, Olabode B Ogundele, Lincoln R Sheets, Grant J Scott","doi":"10.2196/64362","DOIUrl":"10.2196/64362","url":null,"abstract":"<p><strong>Background: </strong>The global obesity epidemic demands innovative approaches to understand its complex environmental and social determinants. Spatial technologies, such as geographic information systems, remote sensing, and spatial machine learning, offer new insights into this health issue. This study uses deep learning and spatial modeling to predict obesity rates for census tracts in Missouri.</p><p><strong>Objective: </strong>This study aims to develop a scalable method for predicting obesity prevalence using deep convolutional neural networks applied to satellite imagery and geospatial analysis, focusing on 1052 census tracts in Missouri.</p><p><strong>Methods: </strong>Our analysis followed 3 steps. First, Sentinel-2 satellite images were processed using the Residual Network-50 model to extract environmental features from 63,592 image chips (224×224 pixels). Second, these features were merged with obesity rate data from the Centers for Disease Control and Prevention for Missouri census tracts. Third, a spatial lag model was used to predict obesity rates and analyze the association between deep neural visual features and obesity prevalence. Spatial autocorrelation was used to identify clusters of obesity rates.</p><p><strong>Results: </strong>Substantial spatial clustering of obesity rates was found across Missouri, with a Moran I value of 0.68, indicating similar obesity rates among neighboring census tracts. The spatial lag model demonstrated strong predictive performance, with an R<sup>2</sup> of 0.93 and a spatial pseudo R<sup>2</sup> of 0.92, explaining 93% of the variation in obesity rates. Local indicators from a spatial association analysis revealed regions with distinct high and low clusters of obesity, which were visualized through choropleth maps.</p><p><strong>Conclusions: </strong>This study highlights the effectiveness of integrating deep convolutional neural networks and spatial modeling to predict obesity prevalence based on environmental features from satellite imagery. The model's high accuracy and ability to capture spatial patterns offer valuable insights for public health interventions. Future work should expand the geographical scope and include socioeconomic data to further refine the model for broader applications in obesity research.</p>","PeriodicalId":73551,"journal":{"name":"JMIR AI","volume":"3 ","pages":"e64362"},"PeriodicalIF":0.0,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11688583/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142840476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Current State of Community-Driven Radiological AI Deployment in Medical Imaging. 医学影像领域社区驱动的放射人工智能部署现状。
Pub Date : 2024-12-09 DOI: 10.2196/55833
Vikash Gupta, Barbaros Erdal, Carolina Ramirez, Ralf Floca, Bradley Genereaux, Sidney Bryson, Christopher Bridge, Jens Kleesiek, Felix Nensa, Rickmer Braren, Khaled Younis, Tobias Penzkofer, Andreas Michael Bucher, Ming Melvin Qin, Gigon Bae, Hyeonhoon Lee, M Jorge Cardoso, Sebastien Ourselin, Eric Kerfoot, Rahul Choudhury, Richard D White, Tessa Cook, David Bericat, Matthew Lungren, Risto Haukioja, Haris Shuaib

Artificial intelligence (AI) has become commonplace in solving routine everyday tasks. Because of the exponential growth in medical imaging data volume and complexity, the workload on radiologists is steadily increasing. AI has been shown to improve efficiency in medical image generation, processing, and interpretation, and various such AI models have been developed across research laboratories worldwide. However, very few of these, if any, find their way into routine clinical use, a discrepancy that reflects the divide between AI research and successful AI translation. The goal of this paper is to give an overview of the intersection of AI and medical imaging landscapes. We also want to inform the readers about the importance of using standards in their radiology workflow and the challenges associated with deploying AI models in the clinical workflow. The main focus of this paper is to examine the existing condition of radiology workflow and identify the challenges hindering the implementation of AI in hospital settings. This report reflects extensive weekly discussions and practical problem-solving expertise accumulated over multiple years by industry experts, imaging informatics professionals, research scientists, and clinicians. To gain a deeper understanding of the requirements for deploying AI models, we introduce a taxonomy of AI use cases, supplemented by real-world instances of AI model integration within hospitals. We will also explain how the need for AI integration in radiology can be addressed using the Medical Open Network for AI (MONAI). MONAI is an open-source consortium for providing reproducible deep learning solutions and integration tools for radiology practice in hospitals.

人工智能(AI)在解决日常事务方面已经变得司空见惯。由于医学影像数据量和复杂性的指数级增长,放射科医生的工作量正在稳步增加。人工智能已被证明可以提高医学图像生成、处理和解释的效率,世界各地的研究实验室已经开发了各种人工智能模型。然而,其中很少有(如果有的话)能够进入常规临床应用,这一差异反映了人工智能研究与成功的人工智能翻译之间的鸿沟。本文的目的是概述人工智能和医学成像景观的交叉。我们还希望告知读者在放射学工作流程中使用标准的重要性,以及在临床工作流程中部署人工智能模型所面临的挑战。本文的主要重点是研究放射学工作流程的现有状况,并确定阻碍在医院环境中实施人工智能的挑战。该报告反映了行业专家、成像信息学专业人员、研究科学家和临床医生多年来积累的广泛的每周讨论和实际解决问题的专业知识。为了更深入地了解部署人工智能模型的需求,我们引入了人工智能用例的分类,并辅以医院内人工智能模型集成的实际实例。我们还将解释如何使用人工智能医疗开放网络(MONAI)来解决放射学中人工智能集成的需求。MONAI是一个开源联盟,为医院放射学实践提供可重复的深度学习解决方案和集成工具。
{"title":"Current State of Community-Driven Radiological AI Deployment in Medical Imaging.","authors":"Vikash Gupta, Barbaros Erdal, Carolina Ramirez, Ralf Floca, Bradley Genereaux, Sidney Bryson, Christopher Bridge, Jens Kleesiek, Felix Nensa, Rickmer Braren, Khaled Younis, Tobias Penzkofer, Andreas Michael Bucher, Ming Melvin Qin, Gigon Bae, Hyeonhoon Lee, M Jorge Cardoso, Sebastien Ourselin, Eric Kerfoot, Rahul Choudhury, Richard D White, Tessa Cook, David Bericat, Matthew Lungren, Risto Haukioja, Haris Shuaib","doi":"10.2196/55833","DOIUrl":"10.2196/55833","url":null,"abstract":"<p><p>Artificial intelligence (AI) has become commonplace in solving routine everyday tasks. Because of the exponential growth in medical imaging data volume and complexity, the workload on radiologists is steadily increasing. AI has been shown to improve efficiency in medical image generation, processing, and interpretation, and various such AI models have been developed across research laboratories worldwide. However, very few of these, if any, find their way into routine clinical use, a discrepancy that reflects the divide between AI research and successful AI translation. The goal of this paper is to give an overview of the intersection of AI and medical imaging landscapes. We also want to inform the readers about the importance of using standards in their radiology workflow and the challenges associated with deploying AI models in the clinical workflow. The main focus of this paper is to examine the existing condition of radiology workflow and identify the challenges hindering the implementation of AI in hospital settings. This report reflects extensive weekly discussions and practical problem-solving expertise accumulated over multiple years by industry experts, imaging informatics professionals, research scientists, and clinicians. To gain a deeper understanding of the requirements for deploying AI models, we introduce a taxonomy of AI use cases, supplemented by real-world instances of AI model integration within hospitals. We will also explain how the need for AI integration in radiology can be addressed using the Medical Open Network for AI (MONAI). MONAI is an open-source consortium for providing reproducible deep learning solutions and integration tools for radiology practice in hospitals.</p>","PeriodicalId":73551,"journal":{"name":"JMIR AI","volume":"3 ","pages":"e55833"},"PeriodicalIF":0.0,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11667144/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142803843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensuring Appropriate Representation in Artificial Intelligence-Generated Medical Imagery: Protocol for a Methodological Approach to Address Skin Tone Bias. 确保人工智能生成的医学图像具有适当的代表性:解决肤色偏差的方法协议》。
Pub Date : 2024-11-27 DOI: 10.2196/58275
Andrew O'Malley, Miriam Veenhuizen, Ayla Ahmed

Background: In medical education, particularly in anatomy and dermatology, generative artificial intelligence (AI) can be used to create customized illustrations. However, the underrepresentation of darker skin tones in medical textbooks and elsewhere, which serve as training data for AI, poses a significant challenge in ensuring diverse and inclusive educational materials.

Objective: This study aims to evaluate the extent of skin tone diversity in AI-generated medical images and to test whether the representation of skin tones can be improved by modifying AI prompts to better reflect the demographic makeup of the US population.

Methods: In total, 2 standard AI models (Dall-E [OpenAI] and Midjourney [Midjourney Inc]) each generated 100 images of people with psoriasis. In addition, a custom model was developed that incorporated a prompt injection aimed at "forcing" the AI (Dall-E 3) to reflect the skin tone distribution of the US population according to the 2012 American National Election Survey. This custom model generated another set of 100 images. The skin tones in these images were assessed by 3 researchers using the New Immigrant Survey skin tone scale, with the median value representing each image. A chi-square goodness of fit analysis compared the skin tone distributions from each set of images to that of the US population.

Results: The standard AI models (Dalle-3 and Midjourney) demonstrated a significant difference between the expected skin tones of the US population and the observed tones in the generated images (P<.001). Both standard AI models overrepresented lighter skin. Conversely, the custom model with the modified prompt yielded a distribution of skin tones that closely matched the expected demographic representation, showing no significant difference (P=.04).

Conclusions: This study reveals a notable bias in AI-generated medical images, predominantly underrepresenting darker skin tones. This bias can be effectively addressed by modifying AI prompts to incorporate real-life demographic distributions. The findings emphasize the need for conscious efforts in AI development to ensure diverse and representative outputs, particularly in educational and medical contexts. Users of generative AI tools should be aware that these biases exist, and that similar tendencies may also exist in other types of generative AI (eg, large language models) and in other characteristics (eg, sex, gender, culture, and ethnicity). Injecting demographic data into AI prompts may effectively counteract these biases, ensuring a more accurate representation of the general population.

背景:在医学教育中,尤其是在解剖学和皮肤病学中,生成式人工智能(AI)可用于创建定制插图。然而,在作为人工智能训练数据的医学教科书和其他书籍中,深肤色的代表性不足,这对确保教育材料的多样性和包容性构成了巨大挑战:本研究旨在评估人工智能生成的医学图像中肤色多样性的程度,并测试是否可以通过修改人工智能提示来改善肤色的代表性,从而更好地反映美国人口的构成:总共有两个标准人工智能模型(Dall-E [OpenAI] 和 Midjourney [Midjourney Inc])各生成了 100 张银屑病患者的图像。此外,我们还开发了一个自定义模型,该模型包含一个提示注射,旨在 "强制 "人工智能(Dall-E 3)根据 2012 年美国全国选举调查反映美国人口的肤色分布。该自定义模型生成了另外 100 张图像。这些图像中的肤色由 3 名研究人员使用新移民调查肤色量表进行评估,中值代表每张图像。通过卡方拟合优度分析,将每组图像的肤色分布与美国人口的肤色分布进行了比较:结果:标准人工智能模型(Dalle-3 和 Midjourney)显示,美国人口的预期肤色与生成图像中观察到的肤色之间存在显著差异(PC 结论:这项研究揭示了人工智能生成的医学图像存在明显偏差,主要是对深肤色代表不足。通过修改人工智能提示,将现实生活中的人口分布纳入其中,可以有效解决这一偏差。研究结果强调,在开发人工智能时需要有意识地确保输出结果的多样性和代表性,尤其是在教育和医疗领域。生成式人工智能工具的用户应该意识到这些偏见的存在,而且其他类型的生成式人工智能(如大型语言模型)和其他特征(如性、性别、文化和种族)也可能存在类似的倾向。在人工智能提示中注入人口统计数据可以有效抵消这些偏见,确保更准确地代表普通人群。
{"title":"Ensuring Appropriate Representation in Artificial Intelligence-Generated Medical Imagery: Protocol for a Methodological Approach to Address Skin Tone Bias.","authors":"Andrew O'Malley, Miriam Veenhuizen, Ayla Ahmed","doi":"10.2196/58275","DOIUrl":"10.2196/58275","url":null,"abstract":"<p><strong>Background: </strong>In medical education, particularly in anatomy and dermatology, generative artificial intelligence (AI) can be used to create customized illustrations. However, the underrepresentation of darker skin tones in medical textbooks and elsewhere, which serve as training data for AI, poses a significant challenge in ensuring diverse and inclusive educational materials.</p><p><strong>Objective: </strong>This study aims to evaluate the extent of skin tone diversity in AI-generated medical images and to test whether the representation of skin tones can be improved by modifying AI prompts to better reflect the demographic makeup of the US population.</p><p><strong>Methods: </strong>In total, 2 standard AI models (Dall-E [OpenAI] and Midjourney [Midjourney Inc]) each generated 100 images of people with psoriasis. In addition, a custom model was developed that incorporated a prompt injection aimed at \"forcing\" the AI (Dall-E 3) to reflect the skin tone distribution of the US population according to the 2012 American National Election Survey. This custom model generated another set of 100 images. The skin tones in these images were assessed by 3 researchers using the New Immigrant Survey skin tone scale, with the median value representing each image. A chi-square goodness of fit analysis compared the skin tone distributions from each set of images to that of the US population.</p><p><strong>Results: </strong>The standard AI models (Dalle-3 and Midjourney) demonstrated a significant difference between the expected skin tones of the US population and the observed tones in the generated images (P<.001). Both standard AI models overrepresented lighter skin. Conversely, the custom model with the modified prompt yielded a distribution of skin tones that closely matched the expected demographic representation, showing no significant difference (P=.04).</p><p><strong>Conclusions: </strong>This study reveals a notable bias in AI-generated medical images, predominantly underrepresenting darker skin tones. This bias can be effectively addressed by modifying AI prompts to incorporate real-life demographic distributions. The findings emphasize the need for conscious efforts in AI development to ensure diverse and representative outputs, particularly in educational and medical contexts. Users of generative AI tools should be aware that these biases exist, and that similar tendencies may also exist in other types of generative AI (eg, large language models) and in other characteristics (eg, sex, gender, culture, and ethnicity). Injecting demographic data into AI prompts may effectively counteract these biases, ensuring a more accurate representation of the general population.</p>","PeriodicalId":73551,"journal":{"name":"JMIR AI","volume":"3 ","pages":"e58275"},"PeriodicalIF":0.0,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11635324/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142735197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How Explainable Artificial Intelligence Can Increase or Decrease Clinicians' Trust in AI Applications in Health Care: Systematic Review. 可解释的人工智能如何增加或减少临床医生对医疗领域人工智能应用的信任?系统回顾。
Pub Date : 2024-10-30 DOI: 10.2196/53207
Rikard Rosenbacke, Åsa Melhus, Martin McKee, David Stuckler
<p><strong>Background: </strong>Artificial intelligence (AI) has significant potential in clinical practice. However, its "black box" nature can lead clinicians to question its value. The challenge is to create sufficient trust for clinicians to feel comfortable using AI, but not so much that they defer to it even when it produces results that conflict with their clinical judgment in ways that lead to incorrect decisions. Explainable AI (XAI) aims to address this by providing explanations of how AI algorithms reach their conclusions. However, it remains unclear whether such explanations foster an appropriate degree of trust to ensure the optimal use of AI in clinical practice.</p><p><strong>Objective: </strong>This study aims to systematically review and synthesize empirical evidence on the impact of XAI on clinicians' trust in AI-driven clinical decision-making.</p><p><strong>Methods: </strong>A systematic review was conducted in accordance with PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, searching PubMed and Web of Science databases. Studies were included if they empirically measured the impact of XAI on clinicians' trust using cognition- or affect-based measures. Out of 778 articles screened, 10 met the inclusion criteria. We assessed the risk of bias using standard tools appropriate to the methodology of each paper.</p><p><strong>Results: </strong>The risk of bias in all papers was moderate or moderate to high. All included studies operationalized trust primarily through cognitive-based definitions, with 2 also incorporating affect-based measures. Out of these, 5 studies reported that XAI increased clinicians' trust compared with standard AI, particularly when the explanations were clear, concise, and relevant to clinical practice. In addition, 3 studies found no significant effect of XAI on trust, and the presence of explanations does not automatically improve trust. Notably, 2 studies highlighted that XAI could either enhance or diminish trust, depending on the complexity and coherence of the provided explanations. The majority of studies suggest that XAI has the potential to enhance clinicians' trust in recommendations generated by AI. However, complex or contradictory explanations can undermine this trust. More critically, trust in AI is not inherently beneficial, as AI recommendations are not infallible. These findings underscore the nuanced role of explanation quality and suggest that trust can be modulated through the careful design of XAI systems.</p><p><strong>Conclusions: </strong>Excessive trust in incorrect advice generated by AI can adversely impact clinical accuracy, just as can happen when correct advice is distrusted. Future research should focus on refining both cognitive and affect-based measures of trust and on developing strategies to achieve an appropriate balance in terms of trust, preventing both blind trust and undue skepticism. Optimizing trust in AI systems is essential for
背景:人工智能(AI)在临床实践中具有巨大潜力。然而,人工智能的 "黑箱 "特性会让临床医生质疑其价值。我们面临的挑战是如何建立足够的信任,让临床医生能够放心使用人工智能,但又不能过度依赖人工智能,即使人工智能得出的结果与他们的临床判断相冲突,从而导致错误的决策。可解释的人工智能(XAI)旨在通过解释人工智能算法如何得出结论来解决这一问题。然而,这种解释是否能促进适当程度的信任,以确保在临床实践中优化使用人工智能,目前仍不清楚:本研究旨在系统回顾和综合 XAI 对临床医生信任人工智能驱动的临床决策的影响的实证证据:方法:根据PRISMA(系统综述和Meta分析的首选报告项目)指南,搜索PubMed和Web of Science数据库,进行系统综述。如果研究使用基于认知或情感的测量方法实证测量了 XAI 对临床医生信任度的影响,则被纳入研究。在筛选出的 778 篇文章中,有 10 篇符合纳入标准。我们使用适合每篇论文方法的标准工具对偏倚风险进行了评估:所有论文的偏倚风险均为中度或中度至高度。所有纳入的研究都主要通过基于认知的定义对信任进行操作,其中两篇研究还采用了基于情感的测量方法。其中,5 项研究报告称,与标准人工智能相比,XAI 增加了临床医生的信任度,尤其是在解释清晰、简明且与临床实践相关的情况下。此外,3 项研究发现 XAI 对信任度没有显著影响,而且解释的存在并不会自动提高信任度。值得注意的是,有 2 项研究强调,根据所提供解释的复杂性和连贯性,XAI 可以增强或削弱信任。大多数研究表明,XAI 有可能提高临床医生对人工智能生成的建议的信任度。然而,复杂或自相矛盾的解释可能会破坏这种信任。更关键的是,对人工智能的信任并非天生有益,因为人工智能的建议并非无懈可击。这些发现强调了解释质量的微妙作用,并表明可以通过精心设计 XAI 系统来调节信任度:结论:过度信任人工智能生成的错误建议会对临床准确性产生不利影响,正如不信任正确建议一样。未来的研究应侧重于完善基于认知和情感的信任测量方法,并制定策略以实现信任方面的适当平衡,防止盲目信任和过度怀疑。优化对人工智能系统的信任对其有效融入临床实践至关重要。
{"title":"How Explainable Artificial Intelligence Can Increase or Decrease Clinicians' Trust in AI Applications in Health Care: Systematic Review.","authors":"Rikard Rosenbacke, Åsa Melhus, Martin McKee, David Stuckler","doi":"10.2196/53207","DOIUrl":"10.2196/53207","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Artificial intelligence (AI) has significant potential in clinical practice. However, its \"black box\" nature can lead clinicians to question its value. The challenge is to create sufficient trust for clinicians to feel comfortable using AI, but not so much that they defer to it even when it produces results that conflict with their clinical judgment in ways that lead to incorrect decisions. Explainable AI (XAI) aims to address this by providing explanations of how AI algorithms reach their conclusions. However, it remains unclear whether such explanations foster an appropriate degree of trust to ensure the optimal use of AI in clinical practice.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This study aims to systematically review and synthesize empirical evidence on the impact of XAI on clinicians' trust in AI-driven clinical decision-making.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;A systematic review was conducted in accordance with PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines, searching PubMed and Web of Science databases. Studies were included if they empirically measured the impact of XAI on clinicians' trust using cognition- or affect-based measures. Out of 778 articles screened, 10 met the inclusion criteria. We assessed the risk of bias using standard tools appropriate to the methodology of each paper.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;The risk of bias in all papers was moderate or moderate to high. All included studies operationalized trust primarily through cognitive-based definitions, with 2 also incorporating affect-based measures. Out of these, 5 studies reported that XAI increased clinicians' trust compared with standard AI, particularly when the explanations were clear, concise, and relevant to clinical practice. In addition, 3 studies found no significant effect of XAI on trust, and the presence of explanations does not automatically improve trust. Notably, 2 studies highlighted that XAI could either enhance or diminish trust, depending on the complexity and coherence of the provided explanations. The majority of studies suggest that XAI has the potential to enhance clinicians' trust in recommendations generated by AI. However, complex or contradictory explanations can undermine this trust. More critically, trust in AI is not inherently beneficial, as AI recommendations are not infallible. These findings underscore the nuanced role of explanation quality and suggest that trust can be modulated through the careful design of XAI systems.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;Excessive trust in incorrect advice generated by AI can adversely impact clinical accuracy, just as can happen when correct advice is distrusted. Future research should focus on refining both cognitive and affect-based measures of trust and on developing strategies to achieve an appropriate balance in terms of trust, preventing both blind trust and undue skepticism. Optimizing trust in AI systems is essential for","PeriodicalId":73551,"journal":{"name":"JMIR AI","volume":"3 ","pages":"e53207"},"PeriodicalIF":0.0,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561425/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142549351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding AI's Role in Endometriosis Patient Education and Evaluating Its Information and Accuracy: Systematic Review. 了解人工智能在子宫内膜异位症患者教育中的作用并评估其信息和准确性:系统综述。
Pub Date : 2024-10-30 DOI: 10.2196/64593
Juliana Almeida Oliveira, Karine Eskandar, Emre Kar, Flávia Ribeiro de Oliveira, Agnaldo Lopes da Silva Filho
<p><strong>Background: </strong>Endometriosis is a chronic gynecological condition that affects a significant portion of women of reproductive age, leading to debilitating symptoms such as chronic pelvic pain and infertility. Despite advancements in diagnosis and management, patient education remains a critical challenge. With the rapid growth of digital platforms, artificial intelligence (AI) has emerged as a potential tool to enhance patient education and access to information.</p><p><strong>Objective: </strong>This systematic review aims to explore the role of AI in facilitating education and improving information accessibility for individuals with endometriosis.</p><p><strong>Methods: </strong>This review followed the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines to ensure rigorous and transparent reporting. We conducted a comprehensive search of PubMed; Embase; the Regional Online Information System for Scientific Journals of Latin America, the Caribbean, Spain and Portugal (LATINDEX); Latin American and Caribbean Literature in Health Sciences (LILACS); Institute of Electrical and Electronics Engineers (IEEE) Xplore, and the Cochrane Central Register of Controlled Trials using the terms "endometriosis" and "artificial intelligence." Studies were selected based on their focus on AI applications in patient education or information dissemination regarding endometriosis. We included studies that evaluated AI-driven tools for assessing patient knowledge and addressed frequently asked questions related to endometriosis. Data extraction and quality assessment were conducted independently by 2 authors, with discrepancies resolved through consensus.</p><p><strong>Results: </strong>Out of 400 initial search results, 11 studies met the inclusion criteria and were fully reviewed. We ultimately included 3 studies, 1 of which was an abstract. The studies examined the use of AI models, such as ChatGPT (OpenAI), machine learning, and natural language processing, in providing educational resources and answering common questions about endometriosis. The findings indicated that AI tools, particularly large language models, offer accurate responses to frequently asked questions with varying degrees of sufficiency across different categories. AI's integration with social media platforms also highlights its potential to identify patients' needs and enhance information dissemination.</p><p><strong>Conclusions: </strong>AI holds promise in advancing patient education and information access for endometriosis, providing accurate and comprehensive answers to common queries, and facilitating a better understanding of the condition. However, challenges remain in ensuring ethical use, equitable access, and maintaining accuracy across diverse patient populations. Future research should focus on developing standardized approaches for evaluating AI's impact on patient education and exploring its integration into clinical practice to
背景:子宫内膜异位症是一种慢性妇科疾病,影响着相当一部分育龄妇女,导致慢性盆腔疼痛和不孕症等使人衰弱的症状。尽管在诊断和管理方面取得了进步,但患者教育仍是一项严峻的挑战。随着数字平台的快速发展,人工智能(AI)已成为加强患者教育和信息获取的潜在工具:本系统综述旨在探讨人工智能在促进子宫内膜异位症患者教育和提高信息可及性方面的作用:本综述遵循系统综述和荟萃分析首选报告项目(PRISMA)指南,以确保报告的严谨性和透明度。我们使用 "子宫内膜异位症 "和 "人工智能 "这两个词对 PubMed、Embase、拉丁美洲、加勒比海、西班牙和葡萄牙科学期刊区域在线信息系统(LATINDEX)、拉丁美洲和加勒比海健康科学文献(LILACS)、电气和电子工程师协会(IEEE)Xplore 以及 Cochrane 对照试验中央登记册进行了全面检索。我们根据人工智能在子宫内膜异位症患者教育或信息传播中的应用重点来选择研究。我们纳入的研究评估了用于评估患者知识的人工智能驱动工具,并解决了与子宫内膜异位症相关的常见问题。数据提取和质量评估由两位作者独立完成,不一致之处通过共识解决:在 400 项初步搜索结果中,有 11 项研究符合纳入标准,并进行了全面审查。我们最终纳入了 3 项研究,其中 1 项为摘要。这些研究考察了人工智能模型(如 ChatGPT (OpenAI))、机器学习和自然语言处理在提供教育资源和回答子宫内膜异位症常见问题方面的应用。研究结果表明,人工智能工具,尤其是大型语言模型,可以准确回答常见问题,但不同类别的问题回答的充分程度各不相同。人工智能与社交媒体平台的整合也凸显了它在确定患者需求和加强信息传播方面的潜力:结论:人工智能有望推动子宫内膜异位症的患者教育和信息获取,为常见问题提供准确而全面的答案,并促进人们更好地了解这种疾病。然而,在确保道德使用、公平获取以及在不同患者群体中保持准确性方面仍存在挑战。未来的研究应侧重于开发标准化方法,以评估人工智能对患者教育的影响,并探索将其融入临床实践,以加强对子宫内膜异位症患者的支持。
{"title":"Understanding AI's Role in Endometriosis Patient Education and Evaluating Its Information and Accuracy: Systematic Review.","authors":"Juliana Almeida Oliveira, Karine Eskandar, Emre Kar, Flávia Ribeiro de Oliveira, Agnaldo Lopes da Silva Filho","doi":"10.2196/64593","DOIUrl":"10.2196/64593","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Endometriosis is a chronic gynecological condition that affects a significant portion of women of reproductive age, leading to debilitating symptoms such as chronic pelvic pain and infertility. Despite advancements in diagnosis and management, patient education remains a critical challenge. With the rapid growth of digital platforms, artificial intelligence (AI) has emerged as a potential tool to enhance patient education and access to information.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Objective: &lt;/strong&gt;This systematic review aims to explore the role of AI in facilitating education and improving information accessibility for individuals with endometriosis.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;This review followed the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) guidelines to ensure rigorous and transparent reporting. We conducted a comprehensive search of PubMed; Embase; the Regional Online Information System for Scientific Journals of Latin America, the Caribbean, Spain and Portugal (LATINDEX); Latin American and Caribbean Literature in Health Sciences (LILACS); Institute of Electrical and Electronics Engineers (IEEE) Xplore, and the Cochrane Central Register of Controlled Trials using the terms \"endometriosis\" and \"artificial intelligence.\" Studies were selected based on their focus on AI applications in patient education or information dissemination regarding endometriosis. We included studies that evaluated AI-driven tools for assessing patient knowledge and addressed frequently asked questions related to endometriosis. Data extraction and quality assessment were conducted independently by 2 authors, with discrepancies resolved through consensus.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;Out of 400 initial search results, 11 studies met the inclusion criteria and were fully reviewed. We ultimately included 3 studies, 1 of which was an abstract. The studies examined the use of AI models, such as ChatGPT (OpenAI), machine learning, and natural language processing, in providing educational resources and answering common questions about endometriosis. The findings indicated that AI tools, particularly large language models, offer accurate responses to frequently asked questions with varying degrees of sufficiency across different categories. AI's integration with social media platforms also highlights its potential to identify patients' needs and enhance information dissemination.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;AI holds promise in advancing patient education and information access for endometriosis, providing accurate and comprehensive answers to common queries, and facilitating a better understanding of the condition. However, challenges remain in ensuring ethical use, equitable access, and maintaining accuracy across diverse patient populations. Future research should focus on developing standardized approaches for evaluating AI's impact on patient education and exploring its integration into clinical practice to ","PeriodicalId":73551,"journal":{"name":"JMIR AI","volume":"3 ","pages":"e64593"},"PeriodicalIF":0.0,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11561426/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142549353","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JMIR AI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1