首页 > 最新文献

medRxiv - Health Informatics最新文献

英文 中文
The Use of Conversational Agents in Self-Management: A Retrospective Analysis 在自我管理中使用对话代理:回顾性分析
Pub Date : 2024-09-02 DOI: 10.1101/2024.09.01.24312881
Selahattin Colakoglu, Mustafa Durmus, Zeynep Pelin Polat, Asli Yildiz, Emre Sezgin
Background Understanding user engagement with conversational agents (CAs) in mobile health apps is crucial for improving sustained usage. We analyzed CA interactions in a mobile health app to identify usage patterns and potential barriers.
背景 了解用户与移动医疗应用程序中的对话代理(CA)的互动对于提高持续使用率至关重要。我们分析了一款移动健康应用程序中的 CA 互动,以确定使用模式和潜在障碍。
{"title":"The Use of Conversational Agents in Self-Management: A Retrospective Analysis","authors":"Selahattin Colakoglu, Mustafa Durmus, Zeynep Pelin Polat, Asli Yildiz, Emre Sezgin","doi":"10.1101/2024.09.01.24312881","DOIUrl":"https://doi.org/10.1101/2024.09.01.24312881","url":null,"abstract":"<strong>Background</strong> Understanding user engagement with conversational agents (CAs) in mobile health apps is crucial for improving sustained usage. We analyzed CA interactions in a mobile health app to identify usage patterns and potential barriers.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"18 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What Constitutes High Risk for Venous Thromboembolism? Comparing Approaches to Determining an Appropriate Threshold 什么是静脉血栓栓塞症高危人群?比较确定适当阈值的方法
Pub Date : 2024-09-01 DOI: 10.1101/2024.08.30.24312871
Benjamin G Mittman, Bo Hu, Rebecca Schulte, Phuc Le, Matthew A Pappas, Aaron Hamilton, Michael B Rothberg
Background Guidelines recommend pharmacological venous thromboembolism (VTE) prophylaxis only for high-risk patients, but the probability of VTE considered “high-risk” is not specified. Our objective was to define an appropriate probability threshold (or range) for VTE risk stratification and corresponding prophylaxis in medical inpatients.
背景指南建议仅对高危患者进行静脉血栓栓塞(VTE)药物预防,但并未明确规定 "高危 "VTE 的概率。我们的目标是为内科住院患者的 VTE 风险分层和相应的预防措施确定一个合适的概率阈值(或范围)。
{"title":"What Constitutes High Risk for Venous Thromboembolism? Comparing Approaches to Determining an Appropriate Threshold","authors":"Benjamin G Mittman, Bo Hu, Rebecca Schulte, Phuc Le, Matthew A Pappas, Aaron Hamilton, Michael B Rothberg","doi":"10.1101/2024.08.30.24312871","DOIUrl":"https://doi.org/10.1101/2024.08.30.24312871","url":null,"abstract":"<strong>Background</strong> Guidelines recommend pharmacological venous thromboembolism (VTE) prophylaxis only for high-risk patients, but the probability of VTE considered “high-risk” is not specified. Our objective was to define an appropriate probability threshold (or range) for VTE risk stratification and corresponding prophylaxis in medical inpatients.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"38 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ubie Symptom Checker: A Clinical Vignette Simulation Study Ubie 症状检查器:临床小故事模拟研究
Pub Date : 2024-08-31 DOI: 10.1101/2024.08.29.24312810
N. Kenji Taylor, Takashi Nishibayashi
Background AI-driven symptom checkers (SC) are increasingly adopted in healthcare for their potential to provide users with accessible and immediate preliminary health education. These tools, powered by advanced artificial intelligence algorithms, assist patients in quickly assessing their symptoms. Previous studies using clinical vignette approaches have evaluated SC accuracy, highlighting both strengths and areas for improvement.
背景人工智能驱动的症状检查器(SC)越来越多地应用于医疗保健领域,因为它们可以为用户提供方便、即时的初步健康教育。这些工具由先进的人工智能算法驱动,可帮助患者快速评估自己的症状。以往使用临床小故事方法进行的研究对症状检查器的准确性进行了评估,强调了其优点和有待改进的地方。
{"title":"Ubie Symptom Checker: A Clinical Vignette Simulation Study","authors":"N. Kenji Taylor, Takashi Nishibayashi","doi":"10.1101/2024.08.29.24312810","DOIUrl":"https://doi.org/10.1101/2024.08.29.24312810","url":null,"abstract":"<strong>Background</strong> AI-driven symptom checkers (SC) are increasingly adopted in healthcare for their potential to provide users with accessible and immediate preliminary health education. These tools, powered by advanced artificial intelligence algorithms, assist patients in quickly assessing their symptoms. Previous studies using clinical vignette approaches have evaluated SC accuracy, highlighting both strengths and areas for improvement.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"53 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BrainGT: Multifunctional Brain Graph Transformer for Brain Disorder Diagnosis BrainGT:用于脑部疾病诊断的多功能脑图转换器
Pub Date : 2024-08-31 DOI: 10.1101/2024.08.30.24312819
Ahsan Shehzad, Shuo Yu, Dongyu Zhang, Shagufta Abid, Xinrui Cheng, Jingjing Zhou, Feng Xia
Brain networks play a crucial role in the diagnosis of brain disorders by enabling the identification of abnormal patterns and connections in brain activities. Previous studies exploit the Pearson’s correlation coefficient to construct functional brain networks from fMRI data and use graph learning to diagnose brain diseases. However, correlation-based brain networks are overly dense (often fully connected), which obscures meaningful connections and complicates subsequent analyses. This dense connectivity poses substantial performance challenges to traditional graph transformers, which are primarily designed for sparse graphs. Consequently, this results in a notable reduction in diagnostic accuracy. To address this challenging issue, we propose a multifunctional brain graph transformer model for brain disorders diagnosis, namely BrainGT, which is capable of constructing multifunctional brain networks rather than a dense brain network from fMRI data. It utilizes the fusion of self-attention and cross-attention mechanisms to learn important features within and across multiple functional brain networks. Classification (diagnosis) experiments conducted on three real fMRI datasets (i.e., ADNI, PPMI, and ABIDE) demonstrate the superiority of the proposed BrainGT over state-of-the-art methods.
脑网络能够识别大脑活动中的异常模式和连接,在诊断脑部疾病方面发挥着至关重要的作用。以往的研究利用皮尔逊相关系数从 fMRI 数据中构建脑功能网络,并利用图学习诊断脑部疾病。然而,基于相关性的大脑网络过于密集(通常是全连接),这掩盖了有意义的连接,并使后续分析复杂化。这种密集连接对传统图转换器的性能提出了巨大挑战,因为传统图转换器主要是针对稀疏图设计的。因此,这导致诊断准确性明显降低。为了解决这个具有挑战性的问题,我们提出了一种用于脑部疾病诊断的多功能脑图转换器模型,即 BrainGT,它能够从 fMRI 数据中构建多功能脑网络,而不是密集的脑网络。它利用自我注意和交叉注意机制的融合,学习多个功能脑网络内部和之间的重要特征。在三个真实的 fMRI 数据集(即 ADNI、PPMI 和 ABIDE)上进行的分类(诊断)实验证明了所提出的 BrainGT 优于最先进的方法。
{"title":"BrainGT: Multifunctional Brain Graph Transformer for Brain Disorder Diagnosis","authors":"Ahsan Shehzad, Shuo Yu, Dongyu Zhang, Shagufta Abid, Xinrui Cheng, Jingjing Zhou, Feng Xia","doi":"10.1101/2024.08.30.24312819","DOIUrl":"https://doi.org/10.1101/2024.08.30.24312819","url":null,"abstract":"Brain networks play a crucial role in the diagnosis of brain disorders by enabling the identification of abnormal patterns and connections in brain activities. Previous studies exploit the Pearson’s correlation coefficient to construct functional brain networks from fMRI data and use graph learning to diagnose brain diseases. However, correlation-based brain networks are overly dense (often fully connected), which obscures meaningful connections and complicates subsequent analyses. This dense connectivity poses substantial performance challenges to traditional graph transformers, which are primarily designed for sparse graphs. Consequently, this results in a notable reduction in diagnostic accuracy. To address this challenging issue, we propose a multifunctional brain graph transformer model for brain disorders diagnosis, namely BrainGT, which is capable of constructing multifunctional brain networks rather than a dense brain network from fMRI data. It utilizes the fusion of self-attention and cross-attention mechanisms to learn important features within and across multiple functional brain networks. Classification (diagnosis) experiments conducted on three real fMRI datasets (i.e., ADNI, PPMI, and ABIDE) demonstrate the superiority of the proposed BrainGT over state-of-the-art methods.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"58 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211504","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Studying Veteran food insecurity longitudinally using electronic health record data and natural language processing 利用电子健康记录数据和自然语言处理技术纵向研究退伍军人的粮食不安全问题
Pub Date : 2024-08-31 DOI: 10.1101/2024.08.30.24312861
Alec B. Chapman, Talia Panadero, Rachel Dalrymple, Alicia Cohen, Nipa Kamdar, Farhana Pethani, Andrea Kalvesmaki, Richard E. Nelson, Jorie Butler
Food insecurity is an important social risk factor that is directly linked to patient health and well-being. The Department of Veterans Affairs (VA) aims to identify and resolve food insecurity through social and clinical interventions. However, evaluating the impact of such interventions is made challenging by the lack of follow-up data on Veteran food insecurity status. One potential solution is to leverage documentation of food insecurity in electronic health records (EHRs). In this paper, we developed and validated a natural language processing system to identify food insecurity status from clinical notes and applied it to study longitudinal trajectories of food insecurity among a large cohort of food insecure Veterans. Our analyses provide insight into the timing and persistence of Veteran food insecurity; in the future, our methods will be used to evaluate food insecurity interventions and evaluate VA policy.
粮食不安全是一个重要的社会风险因素,与病人的健康和福祉直接相关。退伍军人事务部(VA)旨在通过社会和临床干预措施来识别和解决粮食不安全问题。然而,由于缺乏退伍军人粮食不安全状况的后续数据,评估此类干预措施的影响变得十分困难。一个潜在的解决方案是利用电子健康记录 (EHR) 中的食物不安全记录。在本文中,我们开发并验证了一种自然语言处理系统,用于从临床记录中识别粮食不安全状况,并将其应用于研究一大批粮食不安全退伍军人的粮食不安全纵向轨迹。我们的分析深入揭示了退伍军人食物不安全的时间和持续性;将来,我们的方法将用于评估食物不安全干预措施和评估退伍军人事务部的政策。
{"title":"Studying Veteran food insecurity longitudinally using electronic health record data and natural language processing","authors":"Alec B. Chapman, Talia Panadero, Rachel Dalrymple, Alicia Cohen, Nipa Kamdar, Farhana Pethani, Andrea Kalvesmaki, Richard E. Nelson, Jorie Butler","doi":"10.1101/2024.08.30.24312861","DOIUrl":"https://doi.org/10.1101/2024.08.30.24312861","url":null,"abstract":"Food insecurity is an important social risk factor that is directly linked to patient health and well-being. The Department of Veterans Affairs (VA) aims to identify and resolve food insecurity through social and clinical interventions. However, evaluating the impact of such interventions is made challenging by the lack of follow-up data on Veteran food insecurity status. One potential solution is to leverage documentation of food insecurity in electronic health records (EHRs). In this paper, we developed and validated a natural language processing system to identify food insecurity status from clinical notes and applied it to study longitudinal trajectories of food insecurity among a large cohort of food insecure Veterans. Our analyses provide insight into the timing and persistence of Veteran food insecurity; in the future, our methods will be used to evaluate food insecurity interventions and evaluate VA policy.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"63 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Open Access Data Repository and Common Data Model for Pulse Oximeter Performance Data 脉搏氧饱和度仪性能数据的开放存取数据存储库和通用数据模型
Pub Date : 2024-08-31 DOI: 10.1101/2024.08.30.24312744
Nicholas Fong, Michael S. Lipnick, Ella Behnke, Yu Chou, Seif Elmankabadi, Lily Ortiz, Christopher S. Almond, Isabella Auchus, Garrett W. Burnett, Ronald Bisegerwa, Desireé R Conrad, Carolyn M. Hendrickson, Shubhada Hooli, Robert Kopotic, Gregory Leeb, Daniel Martin, Eric D. McCollum, Ellis P. Monk, Kelvin L. Moore, Leonid Shmuylovich, J. Brady Scott, An-Kwok Ian Wong, Tianyue Zhou, Romain Pirracchio, Philip E. Bickler, John Feiner, Tyler Law
The OpenOximetry Repository is a structured database storing clinical and lab pulse oximetry data, serving as a centralized repository and data model for pulse oximetry initiatives. It supports measurements of arterial oxygen saturation (SaO2) by arterial blood gas co-oximetry and pulse oximetry (SpO2), alongside processed and unprocessed photoplethysmography (PPG) data and other metadata. This includes skin color measurements, finger diameter, vital signs (e.g., arterial blood pressure, end-tidal carbon dioxide), and arterial blood gas parameters (e.g., acid-base balance, hemoglobin concentration).
OpenOximetry Repository 是一个存储临床和实验室脉搏血氧仪数据的结构化数据库,是脉搏血氧仪计划的集中存储库和数据模型。它支持通过动脉血气协同氧饱和度和脉搏氧饱和度 (SpO2) 测量动脉血氧饱和度 (SaO2),以及已处理和未处理的光电血氧饱和度 (PPG) 数据和其他元数据。这包括皮肤颜色测量、手指直径、生命体征(如动脉血压、潮气末二氧化碳)和动脉血气参数(如酸碱平衡、血红蛋白浓度)。
{"title":"Open Access Data Repository and Common Data Model for Pulse Oximeter Performance Data","authors":"Nicholas Fong, Michael S. Lipnick, Ella Behnke, Yu Chou, Seif Elmankabadi, Lily Ortiz, Christopher S. Almond, Isabella Auchus, Garrett W. Burnett, Ronald Bisegerwa, Desireé R Conrad, Carolyn M. Hendrickson, Shubhada Hooli, Robert Kopotic, Gregory Leeb, Daniel Martin, Eric D. McCollum, Ellis P. Monk, Kelvin L. Moore, Leonid Shmuylovich, J. Brady Scott, An-Kwok Ian Wong, Tianyue Zhou, Romain Pirracchio, Philip E. Bickler, John Feiner, Tyler Law","doi":"10.1101/2024.08.30.24312744","DOIUrl":"https://doi.org/10.1101/2024.08.30.24312744","url":null,"abstract":"The OpenOximetry Repository is a structured database storing clinical and lab pulse oximetry data, serving as a centralized repository and data model for pulse oximetry initiatives. It supports measurements of arterial oxygen saturation (SaO2) by arterial blood gas co-oximetry and pulse oximetry (SpO2), alongside processed and unprocessed photoplethysmography (PPG) data and other metadata. This includes skin color measurements, finger diameter, vital signs (e.g., arterial blood pressure, end-tidal carbon dioxide), and arterial blood gas parameters (e.g., acid-base balance, hemoglobin concentration).","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"127 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Natural language processing to evaluate texting conversations between patients and healthcare providers during COVID-19 Home-Based Care in Rwanda at scale 利用自然语言处理技术评估卢旺达 COVID-19 家庭护理期间患者与医疗服务提供者之间的大规模短信对话
Pub Date : 2024-08-31 DOI: 10.1101/2024.08.30.24312636
Richard T Lester, Matthew Manson, Muhammed Semakula, Hyeju Jang, Hassan Mugabo, Ali Magzari, Junhong Ma Blackmer, Fanan Fattah, Simon Pierre Niyonsenga, Edson Rwagasore, Charles Ruranga, Eric Remera, Jean Claude S. Ngabonziza, Giuseppe Carenini, Sabin Nsanzimana
Isolation of patients with communicable infectious diseases limits spread of pathogens but can be difficult to manage outside hospitals. Rwanda deployed a digital health service nationally to assist public health clinicians to remotely monitor and support SARS-CoV-2 cases via their mobile phones using daily interactive short message service (SMS) check-ins. We aimed to assess the texting patterns and communicated topics to understand patient experiences. We extracted data on all COVID-19 cases and exposed contacts who were enrolled in the WelTel text messaging program between March 18, 2020, and March 31, 2022, and linked demographic and clinical data from the national COVID-19 registry. A sample of the text conversation corpus was English-translated and labeled with topics of interest defined by medical experts. Multiple natural language processing (NLP) topic classification models were trained and compared using F1 scores. Best performing models were applied to classify unlabeled conversations. Total 33,081 isolated patients (mean age 33·9, range 0-100), 44% female, including 30,398 cases and 2,683 contacts) were registered in WelTel. Registered patients generated 12,119 interactive text conversations in Kinyarwanda (n=8,183, 67%), English (n=3,069, 25%) and other languages. Sufficiently trained large language models (LLMs) were unavailable for Kinyarwanda. Traditional machine learning (ML) models outperformed fine-tuned transformer architecture language models on the native untranslated language corpus, however, the reverse was observed of models trained on English-only data. The most frequently identified topics discussed included symptoms (69%), diagnostics (38%), social issues (19%), prevention (18%), healthcare logistics (16%), and treatment (8·5%). Education, advice, and triage on these topics were provided to patients. Interactive text messaging can be used to remotely support isolated patients in pandemics at scale. NLP can help evaluate the medical and social factors that affect isolated patients which could ultimately inform precision public health responses to future pandemics.
隔离传染性疾病患者可以限制病原体的传播,但在医院外却很难管理。卢旺达在全国范围内部署了一项数字医疗服务,以协助公共卫生临床医生通过手机使用每日互动短信服务(SMS)签到对 SARS-CoV-2 病例进行远程监控和支持。我们旨在评估短信模式和交流主题,以了解患者的经历。我们提取了 2020 年 3 月 18 日至 2022 年 3 月 31 日期间加入 WelTel 短信项目的所有 COVID-19 病例和接触者的数据,并将全国 COVID-19 登记处的人口统计学和临床数据联系起来。文本对话语料库的样本经过英语翻译,并标注了医学专家定义的相关主题。对多个自然语言处理(NLP)主题分类模型进行了训练,并使用 F1 分数进行比较。表现最好的模型被用于对未标记的对话进行分类。WelTel 共登记了 33,081 名孤立患者(平均年龄 33-9,范围 0-100),其中 44% 为女性,包括 30,398 个病例和 2,683 个联系人。已登记的患者以基尼亚卢旺达语(8183 人,占 67%)、英语(3069 人,占 25%)和其他语言进行了 12119 次互动文本对话。基尼亚卢旺达语没有经过充分训练的大型语言模型(LLM)。在本地未翻译语言语料库中,传统机器学习(ML)模型的表现优于微调转换器架构语言模型,但在纯英语数据中训练的模型则相反。最常见的讨论主题包括症状(69%)、诊断(38%)、社会问题(19%)、预防(18%)、医疗物流(16%)和治疗(8-5%)。就这些主题向患者提供了教育、建议和分流服务。互动短信可用于大规模远程支持大流行病中与世隔绝的患者。NLP 可以帮助评估影响被隔离患者的医疗和社会因素,最终为未来大流行病的精确公共卫生应对措施提供信息。
{"title":"Natural language processing to evaluate texting conversations between patients and healthcare providers during COVID-19 Home-Based Care in Rwanda at scale","authors":"Richard T Lester, Matthew Manson, Muhammed Semakula, Hyeju Jang, Hassan Mugabo, Ali Magzari, Junhong Ma Blackmer, Fanan Fattah, Simon Pierre Niyonsenga, Edson Rwagasore, Charles Ruranga, Eric Remera, Jean Claude S. Ngabonziza, Giuseppe Carenini, Sabin Nsanzimana","doi":"10.1101/2024.08.30.24312636","DOIUrl":"https://doi.org/10.1101/2024.08.30.24312636","url":null,"abstract":"Isolation of patients with communicable infectious diseases limits spread of pathogens but can be difficult to manage outside hospitals. Rwanda deployed a digital health service nationally to assist public health clinicians to remotely monitor and support SARS-CoV-2 cases via their mobile phones using daily interactive short message service (SMS) check-ins. We aimed to assess the texting patterns and communicated topics to understand patient experiences. We extracted data on all COVID-19 cases and exposed contacts who were enrolled in the WelTel text messaging program between March 18, 2020, and March 31, 2022, and linked demographic and clinical data from the national COVID-19 registry. A sample of the text conversation corpus was English-translated and labeled with topics of interest defined by medical experts. Multiple natural language processing (NLP) topic classification models were trained and compared using F1 scores. Best performing models were applied to classify unlabeled conversations. Total 33,081 isolated patients (mean age 33·9, range 0-100), 44% female, including 30,398 cases and 2,683 contacts) were registered in WelTel. Registered patients generated 12,119 interactive text conversations in Kinyarwanda (n=8,183, 67%), English (n=3,069, 25%) and other languages. Sufficiently trained large language models (LLMs) were unavailable for Kinyarwanda. Traditional machine learning (ML) models outperformed fine-tuned transformer architecture language models on the native untranslated language corpus, however, the reverse was observed of models trained on English-only data. The most frequently identified topics discussed included symptoms (69%), diagnostics (38%), social issues (19%), prevention (18%), healthcare logistics (16%), and treatment (8·5%). Education, advice, and triage on these topics were provided to patients. Interactive text messaging can be used to remotely support isolated patients in pandemics at scale. NLP can help evaluate the medical and social factors that affect isolated patients which could ultimately inform precision public health responses to future pandemics.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"26 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LSD600: the first corpus of biomedical abstracts annotated with lifestyle–disease relations LSD600:首个注释了生活方式与疾病关系的生物医学摘要语料库
Pub Date : 2024-08-31 DOI: 10.1101/2024.08.30.24312862
Esmaeil Nourani, Evangelia-Mantelena Makri, Xiqing Mao, Sampo Pyysalo, Søren Brunak, Katerina Nastou, Lars Juhl Jensen
Lifestyle factors (LSFs) are increasingly recognized as instrumental in both the development and control of diseases. Despite their importance, there is a lack of methods to extract relations between LSFs and diseases from the literature, a step necessary to consolidate the currently available knowledge into a structured form. As simple co-occurrence-based relation extraction (RE) approaches are unable to distinguish between the different types of LSF-disease relations, context-aware transformer-based models are required to extract and classify these relations into specific relation types. No comprehensive LSF–disease RE system existed, primarily due to the lack of a suitable corpus for developing it. We present LSD600, the first corpus specifically designed for LSF-disease RE, comprising 600 abstracts with 1900 relations of eight distinct types between 5,027 diseases and 6,930 LSF entities. We evaluated LSD600’s quality by training a RoBERTa model on the corpus, achieving an F-score of 68.5% for the multi-label RE task on the held-out test set. We further validated LSD600 by using the trained model on the two Nutrition-Disease and FoodDisease datasets, where it achieved F-scores of 70.7% and 80.7%, respectively. Building on these performance results, LSD600 and the RE system trained on it can be valuable resources to fill the existing gap in this area and pave the way for downstream applications.
人们日益认识到,生活方式因素(LSFs)在疾病的发生和控制中起着重要作用。尽管生活方式因素非常重要,但目前还缺乏从文献中提取生活方式因素与疾病之间关系的方法,而这是将现有知识整合成结构化形式的必要步骤。由于简单的基于共现的关系提取(RE)方法无法区分 LSF-疾病关系的不同类型,因此需要基于上下文感知转换器的模型来提取这些关系并将其分类为特定的关系类型。目前还没有全面的 LSF-疾病 RE 系统,主要原因是缺乏合适的语料库来开发该系统。我们提出了 LSD600,这是第一个专门为 LSF-疾病 RE 设计的语料库,由 600 个摘要组成,包含 5,027 种疾病和 6,930 个 LSF 实体之间八种不同类型的 1900 种关系。我们在该语料库上训练了一个 RoBERTa 模型,对 LSD600 的质量进行了评估,在测试集上的多标签 RE 任务中取得了 68.5% 的 F-score。我们还在营养疾病和食品疾病两个数据集上使用训练好的模型进一步验证了 LSD600,其 F 分数分别达到了 70.7% 和 80.7%。在这些性能结果的基础上,LSD600 及其训练的 RE 系统可以成为填补该领域现有空白的宝贵资源,并为下游应用铺平道路。
{"title":"LSD600: the first corpus of biomedical abstracts annotated with lifestyle–disease relations","authors":"Esmaeil Nourani, Evangelia-Mantelena Makri, Xiqing Mao, Sampo Pyysalo, Søren Brunak, Katerina Nastou, Lars Juhl Jensen","doi":"10.1101/2024.08.30.24312862","DOIUrl":"https://doi.org/10.1101/2024.08.30.24312862","url":null,"abstract":"Lifestyle factors (LSFs) are increasingly recognized as instrumental in both the development and control of diseases. Despite their importance, there is a lack of methods to extract relations between LSFs and diseases from the literature, a step necessary to consolidate the currently available knowledge into a structured form. As simple co-occurrence-based relation extraction (RE) approaches are unable to distinguish between the different types of LSF-disease relations, context-aware transformer-based models are required to extract and classify these relations into specific relation types. No comprehensive LSF–disease RE system existed, primarily due to the lack of a suitable corpus for developing it. We present LSD600, the first corpus specifically designed for LSF-disease RE, comprising 600 abstracts with 1900 relations of eight distinct types between 5,027 diseases and 6,930 LSF entities. We evaluated LSD600’s quality by training a RoBERTa model on the corpus, achieving an F-score of 68.5% for the multi-label RE task on the held-out test set. We further validated LSD600 by using the trained model on the two Nutrition-Disease and FoodDisease datasets, where it achieved F-scores of 70.7% and 80.7%, respectively. Building on these performance results, LSD600 and the RE system trained on it can be valuable resources to fill the existing gap in this area and pave the way for downstream applications.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"20 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical refinement of case vignettes for digital health research 对数字健康研究案例进行统计改进
Pub Date : 2024-08-30 DOI: 10.1101/2024.08.30.24312824
Marvin Kopka, Markus A. Feufel
Digital health research often relies on case vignettes (descriptions of fictitious or real patients) to navigate ethical and practical challenges. Despite their utility, the quality and lack of standardization of these vignettes has often been criticized, especially in studies on symptom-assessment applications (SAAs) and triage decision-making. To address this, our paper introduces a method to refine an existing set of vignettes, drawing on principles from classical test theory. First, we removed any vignette with an item difficulty of zero and an item-total correlation below zero. Second, we stratified the remaining vignettes to reflect the natural base rates of symptoms that SAAs are typically approached with, selecting those vignettes with the highest item-total correlation in each quota. Although this two-step procedure reduced the size of the original vignette set by 40%, comparing triage performance on the reduced and the original vignette sets, we found a strong correlation (r = 0.747 to r = 0.997, p < .001). This indicates that using our refinement method helps identifying vignettes with high predictive power of an agent’s triage performance while simultaneously increasing cost-efficiency of vignette-based evaluation studies. This might ultimately lead to higher research quality and more reliable results.
数字健康研究通常依赖病例小故事(对虚构或真实患者的描述)来应对伦理和实际挑战。尽管这些小故事很有用,但其质量和缺乏标准化的问题经常受到批评,尤其是在症状评估应用(SAA)和分诊决策研究中。为了解决这个问题,我们的论文借鉴了经典测试理论的原则,介绍了一种完善现有小故事集的方法。首先,我们删除了所有项目难度为零、项目总相关性低于零的小测验。其次,我们对剩余的小题进行分层,以反映自闭症患者通常会出现的症状的自然基数,并在每个配额中选择项目-总相关性最高的小题。尽管这两步程序将原始小节集的规模缩小了 40%,但比较缩小后的小节集和原始小节集的分流效果,我们发现两者之间存在很强的相关性(r = 0.747 到 r = 0.997,p <.001)。这表明,使用我们的细化方法有助于识别对代理的分流性能具有较高预测能力的小插图,同时提高基于小插图的评估研究的成本效益。这最终可能会带来更高的研究质量和更可靠的结果。
{"title":"Statistical refinement of case vignettes for digital health research","authors":"Marvin Kopka, Markus A. Feufel","doi":"10.1101/2024.08.30.24312824","DOIUrl":"https://doi.org/10.1101/2024.08.30.24312824","url":null,"abstract":"Digital health research often relies on case vignettes (descriptions of fictitious or real patients) to navigate ethical and practical challenges. Despite their utility, the quality and lack of standardization of these vignettes has often been criticized, especially in studies on symptom-assessment applications (SAAs) and triage decision-making. To address this, our paper introduces a method to refine an existing set of vignettes, drawing on principles from classical test theory. First, we removed any vignette with an item difficulty of zero and an item-total correlation below zero. Second, we stratified the remaining vignettes to reflect the natural base rates of symptoms that SAAs are typically approached with, selecting those vignettes with the highest item-total correlation in each quota. Although this two-step procedure reduced the size of the original vignette set by 40%, comparing triage performance on the reduced and the original vignette sets, we found a strong correlation (r = 0.747 to r = 0.997, p &lt; .001). This indicates that using our refinement method helps identifying vignettes with high predictive power of an agent’s triage performance while simultaneously increasing cost-efficiency of vignette-based evaluation studies. This might ultimately lead to higher research quality and more reliable results.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perceptions about the Use of Virtual Assistants for Seeking Health Information among Caregivers of Young Childhood Cancer Survivors 儿童癌症年轻幸存者的照顾者对使用虚拟助手寻求健康信息的看法
Pub Date : 2024-08-29 DOI: 10.1101/2024.08.28.24312737
Emre Sezgin, Daniel I. Jackson, Kate Kaufman, Micah Skeens, Cynthia A. Gerhardt, Emily L. Moscato
Purpose This study examined the perceptions of caregivers of young childhood cancer survivors (YCCS) regarding the use of virtual assistant (VA) technology for health information seeking and care management. The study aim was to understand how VAs can support caregivers, especially those from underserved communities, in navigating health information related to cancer survivorship.
目的 本研究调查了儿童癌症年轻幸存者(YCCS)的照顾者对使用虚拟助手(VA)技术寻求健康信息和进行护理管理的看法。研究目的是了解虚拟助理如何支持护理人员,尤其是那些来自服务不足社区的护理人员,浏览与癌症幸存者相关的健康信息。
{"title":"Perceptions about the Use of Virtual Assistants for Seeking Health Information among Caregivers of Young Childhood Cancer Survivors","authors":"Emre Sezgin, Daniel I. Jackson, Kate Kaufman, Micah Skeens, Cynthia A. Gerhardt, Emily L. Moscato","doi":"10.1101/2024.08.28.24312737","DOIUrl":"https://doi.org/10.1101/2024.08.28.24312737","url":null,"abstract":"<strong>Purpose</strong> This study examined the perceptions of caregivers of young childhood cancer survivors (YCCS) regarding the use of virtual assistant (VA) technology for health information seeking and care management. The study aim was to understand how VAs can support caregivers, especially those from underserved communities, in navigating health information related to cancer survivorship.","PeriodicalId":501454,"journal":{"name":"medRxiv - Health Informatics","volume":"33 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142211510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
medRxiv - Health Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1