首页 > 最新文献

medRxiv - Health Informatics最新文献

英文 中文
Dynamic Graph Transformer for Brain Disorder Diagnosis 用于脑部疾病诊断的动态图变换器
Pub Date : 2024-09-06 DOI: 10.1101/2024.09.05.24313048
Ahsan Shehzad, Dongyu Zhang, Shuo Yu, Shagufta Abid, Feng Xia
Dynamic brain networks are crucial for diagnosing brain disorders, as they reveal changes in brain activity and connectivity over time. Previous methods exploit the sliding window approach on fMRI data to construct these networks. However, this approach encounters two major issues: fixed temporal length, which inadequately captures the temporal dynamics of brain activity, and global spatial scope, which introduces noise and reduces sensitivity to localized dysfunctions when applied across the entire brain. These issues can lead to inaccurate brain network representations, potentially resulting in misdiagnosis. To overcome these challenges, we propose BrainDGT, a dynamic Graph Transformer model that adaptively captures and analyzes modular brain activities for improved diagnosis of brain disorders. BrainDGT addresses the fixed temporal length issue by estimating adaptive brain states through deconvolution of the Hemodynamic Response Function (HRF), avoiding the constraints of fixed-size windows. It also addresses the global spatial scope issue by segmenting fMRI scans into functional modules based on established brain networks for detailed, module-specific analysis. The model employs a dual attention mechanism: graph-attention extracts structural features from dynamic brain network snapshots, while self-attention identifies significant temporal dependencies. These spatio-temporal features are adaptively fused into a unified representation for disorder classification. BrainDGT’s effectiveness is validated through classification experiments on three real fMRI datasets ADNI, PPMI, and ABIDE demonstrating superior performance compared to state-of-the-art methods. BrainDGT improves brain disorder diagnosis by offering adaptive, localized analysis of dynamic brain networks, advancing neuroimaging and enabling more precise treatments in biomedical research.
动态大脑网络对于诊断脑部疾病至关重要,因为它们揭示了大脑活动和连接性随时间的变化。以往的方法利用 fMRI 数据的滑动窗口方法来构建这些网络。然而,这种方法存在两个主要问题:一是固定的时间长度,无法充分捕捉大脑活动的时间动态;二是全局空间范围,在应用于整个大脑时会引入噪音,降低对局部功能障碍的敏感性。这些问题会导致大脑网络表征不准确,从而可能造成误诊。为了克服这些挑战,我们提出了 BrainDGT 模型,它是一种动态图形变换器模型,能自适应地捕捉和分析模块化的大脑活动,从而改进对大脑疾病的诊断。BrainDGT 通过对血液动力学响应函数(HRF)的解卷积来估计自适应的大脑状态,避免了固定大小窗口的限制,从而解决了固定时间长度的问题。它还根据已建立的大脑网络将 fMRI 扫描分割成功能模块,以进行详细的特定模块分析,从而解决了全局空间范围问题。该模型采用了双重注意机制:图注意从动态脑网络快照中提取结构特征,而自我注意则识别重要的时间依赖关系。这些时空特征被自适应地融合到一个统一的表征中,用于失调分类。通过对三个真实的 fMRI 数据集 ADNI、PPMI 和 ABIDE 的分类实验,BrainDGT 的有效性得到了验证,与最先进的方法相比,BrainDGT 的性能更加卓越。BrainDGT 通过对动态脑网络进行自适应的局部分析,改进了脑部疾病诊断,推动了神经成像技术的发展,使生物医学研究中的治疗更加精确。
{"title":"Dynamic Graph Transformer for Brain Disorder Diagnosis","authors":"Ahsan Shehzad, Dongyu Zhang, Shuo Yu, Shagufta Abid, Feng Xia","doi":"10.1101/2024.09.05.24313048","DOIUrl":"https://doi.org/10.1101/2024.09.05.24313048","url":null,"abstract":"Dynamic brain networks are crucial for diagnosing brain disorders, as they reveal changes in brain activity and connectivity over time. Previous methods exploit the sliding window approach on fMRI data to construct these networks. However, this approach encounters two major issues: fixed temporal length, which inadequately captures the temporal dynamics of brain activity, and global spatial scope, which introduces noise and reduces sensitivity to localized dysfunctions when applied across the entire brain. These issues can lead to inaccurate brain network representations, potentially resulting in misdiagnosis. To overcome these challenges, we propose BrainDGT, a dynamic Graph Transformer model that adaptively captures and analyzes modular brain activities for improved diagnosis of brain disorders. BrainDGT addresses the fixed temporal length issue by estimating adaptive brain states through deconvolution of the Hemodynamic Response Function (HRF), avoiding the constraints of fixed-size windows. It also addresses the global spatial scope issue by segmenting fMRI scans into functional modules based on established brain networks for detailed, module-specific analysis. The model employs a dual attention mechanism: graph-attention extracts structural features from dynamic brain network snapshots, while self-attention identifies significant temporal dependencies. These spatio-temporal features are adaptively fused into a unified representation for disorder classification. BrainDGT’s effectiveness is validated through classification experiments on three real fMRI datasets ADNI, PPMI, and ABIDE demonstrating superior performance compared to state-of-the-art methods. BrainDGT improves brain disorder diagnosis by offering adaptive, localized analysis of dynamic brain networks, advancing neuroimaging and enabling more precise treatments in biomedical research.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and initial evaluation of a conversational agent for Alzheimer’s disease 开发和初步评估阿尔茨海默病对话代理程序
Pub Date : 2024-09-06 DOI: 10.1101/2024.09.04.24312955
Natalia Castano-Villegas, Isabella Llano, Maria Camila Villa, Julian Martinez, Jose Zea, Tatiana Urrea, Alejandra Maria Bañol, Carlos Bohorquez, Nelson Martinez
Background Conversational Agents have attracted attention for personal and professional use. Their specialisation in the medical field is being explored. Conversational Agents (CA) have accomplished passing-level performance in medical school examinations and shown empathy when responding to patient questions. Alzheimer’s disease is characterized by the progression of cognitive and somatic decline. As the leading cause of dementia in the elderly, it is the subject of continuous investigations, which result in a constant stream of new information. Physicians are expected to keep up with the latest clinical guidelines; however, they aren’t always able to do so due to the large amount of information and their busy schedules.
背景 对话式人工智能在个人和专业领域的应用备受关注。人们正在探索它们在医疗领域的专业应用。会话代理(CA)在医学院考试中取得了及格水平的成绩,并在回答病人问题时表现出同情心。阿尔茨海默病的特点是认知能力和躯体机能不断衰退。作为老年人痴呆症的主要病因,它是持续调查的主题,从而产生了源源不断的新信息。医生应该跟上最新的临床指南,但由于信息量大,工作繁忙,他们并非总能做到这一点。
{"title":"Development and initial evaluation of a conversational agent for Alzheimer’s disease","authors":"Natalia Castano-Villegas, Isabella Llano, Maria Camila Villa, Julian Martinez, Jose Zea, Tatiana Urrea, Alejandra Maria Bañol, Carlos Bohorquez, Nelson Martinez","doi":"10.1101/2024.09.04.24312955","DOIUrl":"https://doi.org/10.1101/2024.09.04.24312955","url":null,"abstract":"<strong>Background</strong> Conversational Agents have attracted attention for personal and professional use. Their specialisation in the medical field is being explored. Conversational Agents (CA) have accomplished passing-level performance in medical school examinations and shown empathy when responding to patient questions. Alzheimer’s disease is characterized by the progression of cognitive and somatic decline. As the leading cause of dementia in the elderly, it is the subject of continuous investigations, which result in a constant stream of new information. Physicians are expected to keep up with the latest clinical guidelines; however, they aren’t always able to do so due to the large amount of information and their busy schedules.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"46 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Common Longitudinal Intensive Care Unit data Format (CLIF) to enable multi-institutional federated critical illness research 通用纵向重症监护室数据格式(CLIF),实现多机构联合重症研究
Pub Date : 2024-09-04 DOI: 10.1101/2024.09.04.24313058
Juan C. Rojas, Patrick G. Lyons, Kaveri Chhikara, Vaishvik Chaudhari, Sivasubramanium V. Bhavani, Muna Nour, Kevin G. Buell, Kevin D. Smith, Catherine A. Gao, Saki Amagai, Chengsheng Mao, Yuan Luo, Anna K Barker, Mark Nuppnau, Haley Beck, Rachel Baccile, Michael Hermsen, Zewei Liao, Brenna Park-Egan, Kyle A Carey, XuanHan, Chad H Hochberg, Nicholas E Ingraham, William F Parker
Background Critical illness, or acute organ failure requiring life support, threatens over five million American lives annually. Electronic health record (EHR) data are a source of granular information that could generate crucial insights into the nature and optimal treatment of critical illness. However, data management, security, and standardization are barriers to large-scale critical illness EHR studies.
背景 危重病或需要生命支持的急性器官衰竭每年威胁着 500 多万美国人的生命。电子健康记录(EHR)数据是细粒度信息的来源,可为了解危重病的性质和最佳治疗方法提供重要依据。然而,数据管理、安全性和标准化是大规模危重病电子病历研究的障碍。
{"title":"A Common Longitudinal Intensive Care Unit data Format (CLIF) to enable multi-institutional federated critical illness research","authors":"Juan C. Rojas, Patrick G. Lyons, Kaveri Chhikara, Vaishvik Chaudhari, Sivasubramanium V. Bhavani, Muna Nour, Kevin G. Buell, Kevin D. Smith, Catherine A. Gao, Saki Amagai, Chengsheng Mao, Yuan Luo, Anna K Barker, Mark Nuppnau, Haley Beck, Rachel Baccile, Michael Hermsen, Zewei Liao, Brenna Park-Egan, Kyle A Carey, XuanHan, Chad H Hochberg, Nicholas E Ingraham, William F Parker","doi":"10.1101/2024.09.04.24313058","DOIUrl":"https://doi.org/10.1101/2024.09.04.24313058","url":null,"abstract":"<strong>Background</strong> Critical illness, or acute organ failure requiring life support, threatens over five million American lives annually. Electronic health record (EHR) data are a source of granular information that could generate crucial insights into the nature and optimal treatment of critical illness. However, data management, security, and standardization are barriers to large-scale critical illness EHR studies.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"40 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Similar performance of 8 machine learning models on 71 censored medical datasets: a case for simplicity 8 种机器学习模型在 71 个删减医学数据集上的相似表现:简洁性案例
Pub Date : 2024-09-04 DOI: 10.1101/2024.09.03.24312994
Louis Rebaud, Nicolò Capobianco, Nicolas Captier, Thibault Escobar, Bruce Spottiswoode, Irène Buvat
In the analysis of medical data with censored outcomes, identifying the optimal machine learning pipeline is a challenging task, often requiring extensive preprocessing, feature selection, model testing, and tuning. To investigate the impact of the choice of pipeline on prediction performance, we evaluated 9 machine learning models on 71 medical datasets with censored targets. Only the decision tree model was consistently underperforming, while the other 8 models performed similarly across datasets, with little to no improvement from preprocessing optimization and hyperparameter tuning. Interestingly, more complex models did not outperform simpler ones, and reciprocally. ICARE, a straightforward model univariately learning only the sign of each feature instead of a weight, demonstrated similar performance to other models across most datasets while exhibiting lower overfitting, particularly in high-dimensional datasets. These findings suggest that using the ICARE model to build signatures between centers could improve reproducibility. Our findings also challenge the traditional approach of extensive model testing and tuning to improve performance.
在分析有删减结果的医疗数据时,确定最佳的机器学习管道是一项具有挑战性的任务,通常需要进行大量的预处理、特征选择、模型测试和调整。为了研究选择管道对预测性能的影响,我们在 71 个有删减目标的医疗数据集上评估了 9 种机器学习模型。只有决策树模型一直表现不佳,而其他 8 个模型在不同数据集上的表现类似,预处理优化和超参数调整几乎没有改善。有趣的是,更复杂的模型并没有优于更简单的模型,而且是相反。ICARE 是一种只学习每个特征的符号而不是权重的简单模型,它在大多数数据集上的表现与其他模型相似,但过拟合程度较低,尤其是在高维数据集上。这些研究结果表明,使用 ICARE 模型建立中心间的特征可以提高可重复性。我们的研究结果还挑战了通过大量模型测试和调整来提高性能的传统方法。
{"title":"Similar performance of 8 machine learning models on 71 censored medical datasets: a case for simplicity","authors":"Louis Rebaud, Nicolò Capobianco, Nicolas Captier, Thibault Escobar, Bruce Spottiswoode, Irène Buvat","doi":"10.1101/2024.09.03.24312994","DOIUrl":"https://doi.org/10.1101/2024.09.03.24312994","url":null,"abstract":"In the analysis of medical data with censored outcomes, identifying the optimal machine learning pipeline is a challenging task, often requiring extensive preprocessing, feature selection, model testing, and tuning. To investigate the impact of the choice of pipeline on prediction performance, we evaluated 9 machine learning models on 71 medical datasets with censored targets. Only the decision tree model was consistently underperforming, while the other 8 models performed similarly across datasets, with little to no improvement from preprocessing optimization and hyperparameter tuning. Interestingly, more complex models did not outperform simpler ones, and reciprocally. ICARE, a straightforward model univariately learning only the sign of each feature instead of a weight, demonstrated similar performance to other models across most datasets while exhibiting lower overfitting, particularly in high-dimensional datasets. These findings suggest that using the ICARE model to build signatures between centers could improve reproducibility. Our findings also challenge the traditional approach of extensive model testing and tuning to improve performance.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"58 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Designing a computer-assisted diagnosis system for cardiomegaly detection and radiology report generation 设计用于心脏肿大检测和放射报告生成的计算机辅助诊断系统
Pub Date : 2024-09-04 DOI: 10.1101/2024.09.02.24311997
Tianhao Zhu, Kexin Xu, Wonchan Son, Kristofer Linton-Reid, Marc Boubnovski-Martell, Matt Grech-Sollars, Antoine D. Lain, Joram M. Posma
Chest X-ray (CXR) is a conventional diagnostic tool for cardiothoracic assessment, boasting a high degree of costeffectiveness and versatility. However, with an increasing number of scans to be evaluated by radiologists, they can suffer from fatigue which might impede diagnostic accuracy and slow down report generation. We describe a prototype computer-assisted diagnosis (CAD) pipeline employing computer vision (CV) and Natural Language Processing (NLP) trained on the publicly available MIMICCXR dataset. We perform image quality assessment, view labelling, segmentation-based cardiomegaly severity classification, and use the output of the severity classification for large language model-based report generation. Four certified radiologists assessed the output accuracy of the CAD pipeline. Across the dataset composed of 377,100 CXR images and 227,827 free-text radiology reports, our system identified 0.18% of cases with mixedsex mentions, 0.02% of poor quality images (F1=0.81), and 0.28% of wrongly labelled views (accuracy 99.4%), furthermore it assigned views for 4.18% of images which have unlabelled views. For binary cardiomegaly classification, we achieve state-of-the-art performance of 95.2% accuracy. The inter-radiologist agreement on evaluating the report’s semantics and correctness for radiologistMIMIC is 0.62 (strict agreement) and 0.85 (relaxed agreement) similar to the radiologist-CAD agreement of 0.55 (strict) and 0.93 (relaxed). Our work found and corrected several incorrect or missing metadata annotations for the MIMIC-CXR dataset, and the performance of our CAD system suggests performance on par with human radiologists. Future improvements revolve around improved text generation and the development of CV tools for other diseases.
胸部 X 光(CXR)是心胸评估的传统诊断工具,具有成本效益高、用途广泛等优点。然而,随着放射科医生需要评估的扫描数量越来越多,他们可能会感到疲劳,这可能会影响诊断的准确性,并减慢报告生成的速度。我们介绍了一种计算机辅助诊断 (CAD) 管道原型,它采用计算机视觉 (CV) 和自然语言处理 (NLP),在公开可用的 MIMICCXR 数据集上进行训练。我们进行图像质量评估、视图标记、基于分割的心脏肿大严重程度分类,并将严重程度分类的输出用于基于大语言模型的报告生成。四位经过认证的放射科专家对 CAD 管道的输出准确性进行了评估。在由 377,100 张 CXR 图像和 227,827 份自由文本放射学报告组成的数据集中,我们的系统识别出了 0.18% 的混合性提及病例、0.02% 的劣质图像(F1=0.81)和 0.28% 的错误标注视图(准确率为 99.4%),此外还为 4.18% 的未标注视图的图像分配了视图。对于二元心肌肥大分类,我们的准确率达到了 95.2% 的一流水平。放射科医师间对放射科医师 MIMIC 报告语义和正确性评估的一致性为 0.62(严格一致)和 0.85(宽松一致),与放射科医师与计算机断层扫描(CAD)的一致性 0.55(严格一致)和 0.93(宽松一致)相似。我们的工作发现并纠正了 MIMIC-CXR 数据集中几个错误或缺失的元数据注释,我们的 CAD 系统的性能表明与人类放射科医生的性能相当。未来的改进主要围绕改进文本生成和开发其他疾病的 CV 工具。
{"title":"Designing a computer-assisted diagnosis system for cardiomegaly detection and radiology report generation","authors":"Tianhao Zhu, Kexin Xu, Wonchan Son, Kristofer Linton-Reid, Marc Boubnovski-Martell, Matt Grech-Sollars, Antoine D. Lain, Joram M. Posma","doi":"10.1101/2024.09.02.24311997","DOIUrl":"https://doi.org/10.1101/2024.09.02.24311997","url":null,"abstract":"Chest X-ray (CXR) is a conventional diagnostic tool for cardiothoracic assessment, boasting a high degree of costeffectiveness and versatility. However, with an increasing number of scans to be evaluated by radiologists, they can suffer from fatigue which might impede diagnostic accuracy and slow down report generation. We describe a prototype computer-assisted diagnosis (CAD) pipeline employing computer vision (CV) and Natural Language Processing (NLP) trained on the publicly available MIMICCXR dataset. We perform image quality assessment, view labelling, segmentation-based cardiomegaly severity classification, and use the output of the severity classification for large language model-based report generation. Four certified radiologists assessed the output accuracy of the CAD pipeline. Across the dataset composed of 377,100 CXR images and 227,827 free-text radiology reports, our system identified 0.18% of cases with mixedsex mentions, 0.02% of poor quality images (F1=0.81), and 0.28% of wrongly labelled views (accuracy 99.4%), furthermore it assigned views for 4.18% of images which have unlabelled views. For binary cardiomegaly classification, we achieve state-of-the-art performance of 95.2% accuracy. The inter-radiologist agreement on evaluating the report’s semantics and correctness for radiologistMIMIC is 0.62 (strict agreement) and 0.85 (relaxed agreement) similar to the radiologist-CAD agreement of 0.55 (strict) and 0.93 (relaxed). Our work found and corrected several incorrect or missing metadata annotations for the MIMIC-CXR dataset, and the performance of our CAD system suggests performance on par with human radiologists. Future improvements revolve around improved text generation and the development of CV tools for other diseases.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"58 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visual-Textual Integration in LLMs for Medical Diagnosis: A Quantitative Analysis 用于医学诊断的 LLM 中的视觉-文本整合:定量分析
Pub Date : 2024-09-03 DOI: 10.1101/2024.08.31.24312878
Reem Agbareia, Mahmud Omar, Shelly Soffer, Benjamin S Glicksberg, Girish N Nadkarni, Eyal Klang
Background and Aim Visual data from images is essential for many medical diagnoses. This study evaluates the performance of multimodal Large Language Models (LLMs) in integrating textual and visual information for diagnostic purposes.
背景和目的 来自图像的视觉数据对许多医疗诊断至关重要。本研究评估了多模态大语言模型(LLM)在整合文本和视觉信息用于诊断方面的性能。
{"title":"Visual-Textual Integration in LLMs for Medical Diagnosis: A Quantitative Analysis","authors":"Reem Agbareia, Mahmud Omar, Shelly Soffer, Benjamin S Glicksberg, Girish N Nadkarni, Eyal Klang","doi":"10.1101/2024.08.31.24312878","DOIUrl":"https://doi.org/10.1101/2024.08.31.24312878","url":null,"abstract":"<strong>Background and Aim</strong> Visual data from images is essential for many medical diagnoses. This study evaluates the performance of multimodal Large Language Models (LLMs) in integrating textual and visual information for diagnostic purposes.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"119 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dengue nowcasting in Brazil by combining official surveillance data and Google Trends information 结合官方监测数据和谷歌趋势信息对巴西登革热进行预测
Pub Date : 2024-09-03 DOI: 10.1101/2024.09.02.24312934
Yang Xiao, Guilherme Soares, Leonardo Bastos, Rafael Izbicki, Paula Moraga
Dengue is a mosquito-borne viral disease that poses significant public health challenges in tropical and sub-tropical regions worldwide. Surveillance systems are essential for dengue prevention and control. However, traditional systems often rely on delayed data, limiting their effectiveness. To address this, nowcasting methods are needed to estimate underreported cases, enabling more timely decision-making. This study evaluates the value of using Google Trends indices of dengue-related keywords to complement official dengue data for nowcasting dengue in Brazil, a country frequently affected by this disease. We compare various nowcasting approaches that incorporate autoregressive features from official dengue cases, Google Trends data, and a combination of both, using a naive approach as a baseline. The performance of these methods is evaluated by nowcasting weekly dengue cases from March to June 2024 across Brazilian states. Error measures and 95% coverage probabilities reveal that models incorporating Google Trends data enhance the accuracy of weekly nowcasts across states and offer valuable insights into dengue activity levels. To support real-time decision-making, we also present Dengue Tracker, a website that displays weekly dengue nowcasts and trends to inform both decision-makers and the public, improving situational awareness of dengue activity. In conclusion, the study demonstrates the value of digital data sources in enhancing dengue nowcasting, and emphasizes the value of integrating alternative data streams into traditional surveillance systems for better-informed decision-making.
登革热是一种由蚊子传播的病毒性疾病,对全球热带和亚热带地区的公共卫生构成重大挑战。监测系统对登革热的预防和控制至关重要。然而,传统系统往往依赖于延迟数据,从而限制了其有效性。为解决这一问题,需要采用预报方法来估计漏报病例,以便更及时地做出决策。巴西是登革热疫情高发国家,本研究评估了使用谷歌登革热相关关键词趋势指数来补充官方登革热数据的价值。我们比较了纳入官方登革热病例自回归特征、Google Trends 数据以及两者结合的各种即时预测方法,并以一种简单的方法作为基线。通过对巴西各州 2024 年 3 月至 6 月的每周登革热病例进行预测,对这些方法的性能进行了评估。误差测量和 95% 的覆盖概率显示,包含谷歌趋势数据的模型提高了各州每周即时预测的准确性,并为登革热活动水平提供了有价值的见解。为了支持实时决策,我们还推出了登革热追踪网站,该网站显示每周登革热预报和趋势,为决策者和公众提供信息,提高对登革热活动的态势感知。总之,这项研究证明了数字数据源在加强登革热即时预报方面的价值,并强调了将其他数据流整合到传统监测系统中以做出更明智决策的价值。
{"title":"Dengue nowcasting in Brazil by combining official surveillance data and Google Trends information","authors":"Yang Xiao, Guilherme Soares, Leonardo Bastos, Rafael Izbicki, Paula Moraga","doi":"10.1101/2024.09.02.24312934","DOIUrl":"https://doi.org/10.1101/2024.09.02.24312934","url":null,"abstract":"Dengue is a mosquito-borne viral disease that poses significant public health challenges in tropical and sub-tropical regions worldwide. Surveillance systems are essential for dengue prevention and control. However, traditional systems often rely on delayed data, limiting their effectiveness. To address this, nowcasting methods are needed to estimate underreported cases, enabling more timely decision-making. This study evaluates the value of using Google Trends indices of dengue-related keywords to complement official dengue data for nowcasting dengue in Brazil, a country frequently affected by this disease. We compare various nowcasting approaches that incorporate autoregressive features from official dengue cases, Google Trends data, and a combination of both, using a naive approach as a baseline. The performance of these methods is evaluated by nowcasting weekly dengue cases from March to June 2024 across Brazilian states. Error measures and 95% coverage probabilities reveal that models incorporating Google Trends data enhance the accuracy of weekly nowcasts across states and offer valuable insights into dengue activity levels. To support real-time decision-making, we also present Dengue Tracker, a website that displays weekly dengue nowcasts and trends to inform both decision-makers and the public, improving situational awareness of dengue activity. In conclusion, the study demonstrates the value of digital data sources in enhancing dengue nowcasting, and emphasizes the value of integrating alternative data streams into traditional surveillance systems for better-informed decision-making.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding technology-related prescribing errors for system optimisation: the Technology-Related Error Mechanism (TREM) classification 了解与技术相关的处方错误以优化系统:与技术相关的错误机制(TREM)分类
Pub Date : 2024-09-03 DOI: 10.1101/2024.09.02.24312874
Magdalena Z. Raban, Alison Merchant, Erin Fitzpatrick, Melissa T. Baysari, Ling Li, Peter J. Gates, Johanna I. Westbrook
Objectives Technology-related prescribing errors curtail the positive impacts of computerised provider order entry (CPOE) on medication safety. Understanding how technology-related errors occur can inform CPOE optimisation. Previously, we developed a classification of the underlying mechanisms of technology-related errors using prescribing error data from two adult hospitals. Our objective was to update the classification using paediatric prescribing error data, and to assess the reliability with which reviewers could independently apply the classification.
目标 与技术相关的处方错误会削弱计算机化医嘱输入系统(CPOE)对用药安全的积极影响。了解与技术相关的错误是如何发生的,可以为 CPOE 的优化提供依据。此前,我们利用两家成人医院的处方错误数据,对技术相关错误的基本机制进行了分类。我们的目标是利用儿科处方错误数据更新该分类,并评估审查员独立应用该分类的可靠性。
{"title":"Understanding technology-related prescribing errors for system optimisation: the Technology-Related Error Mechanism (TREM) classification","authors":"Magdalena Z. Raban, Alison Merchant, Erin Fitzpatrick, Melissa T. Baysari, Ling Li, Peter J. Gates, Johanna I. Westbrook","doi":"10.1101/2024.09.02.24312874","DOIUrl":"https://doi.org/10.1101/2024.09.02.24312874","url":null,"abstract":"<strong>Objectives</strong> Technology-related prescribing errors curtail the positive impacts of computerised provider order entry (CPOE) on medication safety. Understanding how technology-related errors occur can inform CPOE optimisation. Previously, we developed a classification of the underlying mechanisms of technology-related errors using prescribing error data from two adult hospitals. Our objective was to update the classification using paediatric prescribing error data, and to assess the reliability with which reviewers could independently apply the classification.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"79 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LLM-AIx: An open source pipeline for Information Extraction from unstructured medical text based on privacy preserving Large Language Models LLM-AIx:基于隐私保护大语言模型的非结构化医学文本信息提取开源管道
Pub Date : 2024-09-03 DOI: 10.1101/2024.09.02.24312917
Isabella Catharina Wiest, Fabian Wolf, Marie-Elisabeth Leßmann, Marko van Treeck, Dyke Ferber, Jiefu Zhu, Heiko Boehme, Keno K. Bressem, Hannes Ulrich, Matthias P. Ebert, Jakob Nikolas Kather
In clinical science and practice, text data, such as clinical letters or procedure reports, is stored in an unstructured way. This type of data is not a quantifiable resource for any kind of quantitative investigations and any manual review or structured information retrieval is time-consuming and costly. The capabilities of Large Language Models (LLMs) mark a paradigm shift in natural language processing and offer new possibilities for structured Information Extraction (IE) from medical free text. This protocol describes a workflow for LLM based information extraction (LLM-AIx), enabling extraction of predefined entities from unstructured text using privacy preserving LLMs. By converting unstructured clinical text into structured data, LLM-AIx addresses a critical barrier in clinical research and practice, where the efficient extraction of information is essential for improving clinical decision-making, enhancing patient outcomes, and facilitating large-scale data analysis.
在临床科学和实践中,临床信件或手术报告等文本数据是以非结构化方式存储的。这类数据对于任何类型的定量研究来说都不是可量化的资源,而且任何人工审核或结构化信息检索都非常耗时和昂贵。大语言模型(LLM)的功能标志着自然语言处理的范式转变,为从医学自由文本中进行结构化信息提取(IE)提供了新的可能性。本协议描述了基于 LLM 的信息提取(LLM-AIx)工作流程,利用保护隐私的 LLM 从非结构化文本中提取预定义实体。通过将非结构化临床文本转换为结构化数据,LLM-AIx 解决了临床研究和实践中的一个关键障碍,即有效提取信息对于改善临床决策、提高患者疗效和促进大规模数据分析至关重要。
{"title":"LLM-AIx: An open source pipeline for Information Extraction from unstructured medical text based on privacy preserving Large Language Models","authors":"Isabella Catharina Wiest, Fabian Wolf, Marie-Elisabeth Leßmann, Marko van Treeck, Dyke Ferber, Jiefu Zhu, Heiko Boehme, Keno K. Bressem, Hannes Ulrich, Matthias P. Ebert, Jakob Nikolas Kather","doi":"10.1101/2024.09.02.24312917","DOIUrl":"https://doi.org/10.1101/2024.09.02.24312917","url":null,"abstract":"In clinical science and practice, text data, such as clinical letters or procedure reports, is stored in an unstructured way. This type of data is not a quantifiable resource for any kind of quantitative investigations and any manual review or structured information retrieval is time-consuming and costly. The capabilities of Large Language Models (LLMs) mark a paradigm shift in natural language processing and offer new possibilities for structured Information Extraction (IE) from medical free text. This protocol describes a workflow for LLM based information extraction (LLM-AIx), enabling extraction of predefined entities from unstructured text using privacy preserving LLMs. By converting unstructured clinical text into structured data, LLM-AIx addresses a critical barrier in clinical research and practice, where the efficient extraction of information is essential for improving clinical decision-making, enhancing patient outcomes, and facilitating large-scale data analysis.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"256 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multinational attitudes towards AI in healthcare and diagnostics among hospital patients 多国医院患者对医疗和诊断领域人工智能的态度
Pub Date : 2024-09-02 DOI: 10.1101/2024.09.01.24312016
Felix Busch, Lena Hoffmann, Lina Xu, Longjiang Zhang, Bin Hu, Ignacio García-Juárez, Liz N Toapanta-Yanchapaxi, Natalia Gorelik, Valérie Gorelik, Gaston A Rodriguez-Granillo, Carlos Ferrarotti, Nguyen N Cuong, Chau AP Thi, Murat Tuncel, Gürsan Kaya, Sergio M Solis-Barquero, Maria C Mendez Avila, Nevena G Ivanova, Felipe C Kitamura, Karina YI Hayama, Monserrat L Puntunet Bates, Pedro Iturralde Torres, Esteban Ortiz-Prado, Juan S Izquierdo-Condoy, Gilbert M Schwarz, Jochen G Hofstaetter, Michihiro Hide, Konagi Takeda, Barbara Perić, Gašper Pilko, Hans O Thulesius, Thomas A Lindow, Israel K Kolawole, Samuel Adegboyega Olatoke, Andrzej Grzybowski, Alexandru Corlateanu, Oana-Simina Iaconi, Ting Li, Izabela Domitrz, Katarzyna Kępczyńska, Matúš Mihalčin, Lenka Fašaneková, Tomasz Zatoński, Katarzyna Fułek, András Molnár, Stefani Maihoub, Zenewton A da Silva Gama, Luca Saba, Petros Sountoulides, Marcus R Makowski, Hugo JWL Aerts, Lisa C Adams, Keno K Bressem, COMFORT consortium
The successful implementation of artificial intelligence (AI) in healthcare is dependent upon the acceptance of this technology by key stakeholders, particularly patients, who are the primary beneficiaries of AI-driven outcomes. This international, multicenter, cross-sectional study assessed the attitudes of hospital patients towards AI in healthcare across 43 countries. A total of 13806 patients at 74 hospitals were surveyed between February and November 2023, with 64.8% from the Global North and 35.2% from the Global South. The findings indicate a predominantly favorable general view of AI in healthcare, with 57.6% of respondents expressing a positive attitude. However, attitudes exhibited notable variation based on demographic characteristics, health status, and technological literacy. Female respondents and those with poorer health status exhibited fewer positive attitudes towards AI use in medicine. Conversely, higher levels of AI knowledge and frequent use of technology devices were associated with more positive attitudes. It is noteworthy that less than half of the participants expressed positive attitudes regarding all items pertaining to trust in AI. The lowest level of trust was observed for the accuracy of AI in providing information regarding treatment responses. Patients exhibited a strong preference for explainable AI and physician-led decision-making, even if it meant slightly compromised accuracy. This large-scale, multinational study provides a comprehensive perspective on patient attitudes towards AI in healthcare across six continents. Findings suggest a need for tailored AI implementation strategies that consider patient demographics, health status, and preferences for explainable AI and physician oversight. All study data has been made publicly available to encourage replication and further investigation.
人工智能(AI)在医疗保健领域的成功应用取决于主要利益相关者对该技术的接受程度,尤其是患者,他们是人工智能成果的主要受益者。这项国际多中心横断面研究评估了 43 个国家的医院患者对医疗保健领域人工智能的态度。在 2023 年 2 月至 11 月期间,共有 74 家医院的 13806 名患者接受了调查,其中 64.8% 来自全球北方地区,35.2% 来自全球南方地区。调查结果表明,人们对医疗保健领域的人工智能普遍持赞成态度,57.6%的受访者表达了积极的态度。然而,根据人口特征、健康状况和技术素养的不同,人们的态度也表现出明显的差异。女性受访者和健康状况较差的受访者对人工智能在医疗中的应用表现出的积极态度较少。相反,人工智能知识水平较高和经常使用技术设备的受访者则表现出更积极的态度。值得注意的是,只有不到一半的参与者对有关人工智能信任度的所有项目都表示了积极的态度。在人工智能提供治疗反应信息的准确性方面,信任度最低。患者对可解释的人工智能和医生主导的决策表现出强烈的偏好,即使这意味着准确性会略微打折扣。这项大规模的跨国研究全面透视了六大洲患者对医疗保健领域人工智能的态度。研究结果表明,有必要制定量身定制的人工智能实施策略,考虑患者的人口统计、健康状况以及对可解释的人工智能和医生监督的偏好。所有研究数据均已公开,以鼓励复制和进一步调查。
{"title":"Multinational attitudes towards AI in healthcare and diagnostics among hospital patients","authors":"Felix Busch, Lena Hoffmann, Lina Xu, Longjiang Zhang, Bin Hu, Ignacio García-Juárez, Liz N Toapanta-Yanchapaxi, Natalia Gorelik, Valérie Gorelik, Gaston A Rodriguez-Granillo, Carlos Ferrarotti, Nguyen N Cuong, Chau AP Thi, Murat Tuncel, Gürsan Kaya, Sergio M Solis-Barquero, Maria C Mendez Avila, Nevena G Ivanova, Felipe C Kitamura, Karina YI Hayama, Monserrat L Puntunet Bates, Pedro Iturralde Torres, Esteban Ortiz-Prado, Juan S Izquierdo-Condoy, Gilbert M Schwarz, Jochen G Hofstaetter, Michihiro Hide, Konagi Takeda, Barbara Perić, Gašper Pilko, Hans O Thulesius, Thomas A Lindow, Israel K Kolawole, Samuel Adegboyega Olatoke, Andrzej Grzybowski, Alexandru Corlateanu, Oana-Simina Iaconi, Ting Li, Izabela Domitrz, Katarzyna Kępczyńska, Matúš Mihalčin, Lenka Fašaneková, Tomasz Zatoński, Katarzyna Fułek, András Molnár, Stefani Maihoub, Zenewton A da Silva Gama, Luca Saba, Petros Sountoulides, Marcus R Makowski, Hugo JWL Aerts, Lisa C Adams, Keno K Bressem, COMFORT consortium","doi":"10.1101/2024.09.01.24312016","DOIUrl":"https://doi.org/10.1101/2024.09.01.24312016","url":null,"abstract":"The successful implementation of artificial intelligence (AI) in healthcare is dependent upon the acceptance of this technology by key stakeholders, particularly patients, who are the primary beneficiaries of AI-driven outcomes. This international, multicenter, cross-sectional study assessed the attitudes of hospital patients towards AI in healthcare across 43 countries. A total of 13806 patients at 74 hospitals were surveyed between February and November 2023, with 64.8% from the Global North and 35.2% from the Global South. The findings indicate a predominantly favorable general view of AI in healthcare, with 57.6% of respondents expressing a positive attitude. However, attitudes exhibited notable variation based on demographic characteristics, health status, and technological literacy. Female respondents and those with poorer health status exhibited fewer positive attitudes towards AI use in medicine. Conversely, higher levels of AI knowledge and frequent use of technology devices were associated with more positive attitudes. It is noteworthy that less than half of the participants expressed positive attitudes regarding all items pertaining to trust in AI. The lowest level of trust was observed for the accuracy of AI in providing information regarding treatment responses. Patients exhibited a strong preference for explainable AI and physician-led decision-making, even if it meant slightly compromised accuracy. This large-scale, multinational study provides a comprehensive perspective on patient attitudes towards AI in healthcare across six continents. Findings suggest a need for tailored AI implementation strategies that consider patient demographics, health status, and preferences for explainable AI and physician oversight. All study data has been made publicly available to encourage replication and further investigation.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
medRxiv - Health Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1