首页 > 最新文献

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science最新文献

英文 中文
Suicide Phenotyping from Clinical Notes in Safety-Net Psychiatric Hospital Using Multi-Label Classification with Pre-Trained Language Models. 基于多标签分类和预训练语言模型的安全网精神病院临床记录自杀表型分析。
Zehan Li, Yan Hu, Scott Lane, Salih Selek, Lokesh Shahani, Rodrigo Machado-Vieira, Jair Soares, Hua Xu, Hongfang Liu, Ming Huang

Accurate identification and categorization of suicidal events can yield better suicide precautions, reducing operational burden, and improving care quality in high-acuity psychiatric settings. Pre-trained language models offer promise for identifying suicidality from unstructured clinical narratives. We evaluated the performance of four BERT-based models using two fine-tuning strategies (multiple single-label and single multi-label) for detecting coexisting suicidal events from 500 annotated psychiatric evaluation notes. The notes were labeled for suicidal ideation (SI), suicide attempts (SA), exposure to suicide (ES), and non-suicidal self-injury (NSSI). RoBERTa outperformed other models using binary relevance (acc=0.86, F1=0.78). MentalBERT (F1=0.74) also exceeded BioClinicalBERT (F1=0.72). RoBERTa fine-tuned with a single multi-label classifier further improved performance (acc=0.88, F1=0.81), highlighting that models pre-trained on domain-relevant data and the single multi-label classification strategy enhance efficiency and performance.

准确识别和分类自杀事件可以更好地预防自杀,减轻操作负担,提高高急性精神病学机构的护理质量。预先训练的语言模型有望从非结构化的临床叙述中识别自杀倾向。我们使用两种微调策略(多个单标签和单个多标签)评估了四种基于bert的模型的性能,用于从500个注释的精神病评估笔记中检测共存的自杀事件。这些笔记被标记为自杀意念(SI)、自杀企图(SA)、自杀暴露(ES)和非自杀自伤(NSSI)。RoBERTa优于其他使用二元相关性的模型(acc=0.86, F1=0.78)。MentalBERT (F1=0.74)也超过BioClinicalBERT (F1=0.72)。RoBERTa对单个多标签分类器进行了微调,进一步提高了性能(acc=0.88, F1=0.81),突出表明在领域相关数据上预训练的模型和单个多标签分类策略提高了效率和性能。
{"title":"Suicide Phenotyping from Clinical Notes in Safety-Net Psychiatric Hospital Using Multi-Label Classification with Pre-Trained Language Models.","authors":"Zehan Li, Yan Hu, Scott Lane, Salih Selek, Lokesh Shahani, Rodrigo Machado-Vieira, Jair Soares, Hua Xu, Hongfang Liu, Ming Huang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Accurate identification and categorization of suicidal events can yield better suicide precautions, reducing operational burden, and improving care quality in high-acuity psychiatric settings. Pre-trained language models offer promise for identifying suicidality from unstructured clinical narratives. We evaluated the performance of four BERT-based models using two fine-tuning strategies (multiple single-label and single multi-label) for detecting coexisting suicidal events from 500 annotated psychiatric evaluation notes. The notes were labeled for suicidal ideation (SI), suicide attempts (SA), exposure to suicide (ES), and non-suicidal self-injury (NSSI). RoBERTa outperformed other models using binary relevance (acc=0.86, F1=0.78). MentalBERT (F1=0.74) also exceeded BioClinicalBERT (F1=0.72). RoBERTa fine-tuned with a single multi-label classifier further improved performance (acc=0.88, F1=0.81), highlighting that models pre-trained on domain-relevant data and the single multi-label classification strategy enhance efficiency and performance.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"260-269"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150747/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Positionally Encoded Transformer for Monitoring Health Contexts of Hajj Pilgrims from Wearable Sensor Data. 利用可穿戴传感器数据监测朝觐朝圣者健康状况的位置编码变压器。
Nazim A Belabbaci, Raphael Anaadumba, Mohammad Arif Ul Alam

Monitoring the health of individuals during physically demanding tasks, such as the Hajj pilgrimage, requires robust methods for real-time detection of health-relevant contexts, including physical tiredness, emotional mood, and activity type. This paper introduces a positionally encoded Transformer model designed to detect these contexts from time-series data collected via wearable sensors. The model leverages Long Short-Term Memory (LSTM) for feature extraction and Transformer layers for context classification, utilizing positional encoding to capture the sequential dependencies within the sensor data. Our experiments, using data from 19 participants, show that the proposed model achieves high classification accuracy across multiple health-relevant contexts, significantly improving real-time health monitoring.

在完成对体力要求很高的任务(如朝觐)期间监测个人的健康状况,需要强有力的方法来实时检测与健康相关的情况,包括身体疲劳、情绪情绪和活动类型。本文介绍了一种位置编码的Transformer模型,用于从可穿戴传感器收集的时间序列数据中检测这些上下文。该模型利用长短期记忆(LSTM)进行特征提取,利用Transformer层进行上下文分类,利用位置编码捕获传感器数据中的顺序依赖项。我们使用来自19个参与者的数据进行的实验表明,所提出的模型在多种健康相关背景下实现了很高的分类精度,显著提高了实时健康监测。
{"title":"A Positionally Encoded Transformer for Monitoring Health Contexts of Hajj Pilgrims from Wearable Sensor Data.","authors":"Nazim A Belabbaci, Raphael Anaadumba, Mohammad Arif Ul Alam","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Monitoring the health of individuals during physically demanding tasks, such as the Hajj pilgrimage, requires robust methods for real-time detection of health-relevant contexts, including physical tiredness, emotional mood, and activity type. This paper introduces a positionally encoded Transformer model designed to detect these contexts from time-series data collected via wearable sensors. The model leverages Long Short-Term Memory (LSTM) for feature extraction and Transformer layers for context classification, utilizing positional encoding to capture the sequential dependencies within the sensor data. Our experiments, using data from 19 participants, show that the proposed model achieves high classification accuracy across multiple health-relevant contexts, significantly improving real-time health monitoring.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"84-94"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150694/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing Geographic Diversity in Systematic Reviews: A 3D Interactive Approach Using Cochrane SRs in IPF. 评估系统评价中的地理多样性:在IPF中使用Cochrane SRs的3D互动方法。
Hui Li, Jinlian Wang, Hongfang Liu

The Systematic reviews (SRs) for Idiopathic Pulmonary Fibrosis (IPF) play a crucial role in guiding evidence-based healthcare by synthesizing data across multiple studies. A key factor in ensuring the reliability and applicability of these reviews is the geographic diversity of the authors involved, as this can significantly influence the generalizability of findings. Traditional 2D maps used to visualize author locations often fall short in capturing the depth and regional disparities effectively, as overlapping points or dense clusters can obscure critical details, resulting in an incomplete view of geographic distribution. To address these limitations, this study introduces a novel approach that combines a 3D geographic map and a Temporal-Spatial Graph Attention Network (TS-GAT) to assess and visualize the geographic diversity of authors in SRs on IPF. The 3D visualization provides an enhanced, layered representation of author locations, revealing hidden regional disparities and biases. The TS-GAT captures both temporal and spatial relationships in the author collaboration network, allowing for deeper insights into the evolution of geographic representation over time. This integrated approach aims to uncover potential biases in global representation, offering a comprehensive understanding of the geographic spread and temporal trends in authorship within SRs, ultimately contributing to more balanced and inclusive evidence synthesis.

特发性肺纤维化(IPF)的系统评价(SRs)通过综合多个研究的数据,在指导循证医疗保健方面发挥着至关重要的作用。确保这些综述的可靠性和适用性的一个关键因素是所涉及作者的地理多样性,因为这可以显著影响研究结果的普遍性。用于可视化作者位置的传统2D地图往往无法有效地捕捉深度和区域差异,因为重叠的点或密集的集群会掩盖关键细节,导致地理分布的不完整视图。为了解决这些限制,本研究引入了一种结合三维地理地图和时空图注意网络(TS-GAT)的新方法,以评估和可视化IPF上SRs作者的地理多样性。3D可视化提供了作者位置的增强分层表示,揭示了隐藏的区域差异和偏见。TS-GAT捕获了作者协作网络中的时间和空间关系,允许更深入地了解地理代表性随时间的演变。这种综合方法旨在揭示全球代表性的潜在偏见,全面了解SRs中作者身份的地理分布和时间趋势,最终有助于更平衡和包容的证据合成。
{"title":"Assessing Geographic Diversity in Systematic Reviews: A 3D Interactive Approach Using Cochrane SRs in IPF.","authors":"Hui Li, Jinlian Wang, Hongfang Liu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The Systematic reviews (SRs) for Idiopathic Pulmonary Fibrosis (IPF) play a crucial role in guiding evidence-based healthcare by synthesizing data across multiple studies. A key factor in ensuring the reliability and applicability of these reviews is the geographic diversity of the authors involved, as this can significantly influence the generalizability of findings. Traditional 2D maps used to visualize author locations often fall short in capturing the depth and regional disparities effectively, as overlapping points or dense clusters can obscure critical details, resulting in an incomplete view of geographic distribution. To address these limitations, this study introduces a novel approach that combines a 3D geographic map and a Temporal-Spatial Graph Attention Network (TS-GAT) to assess and visualize the geographic diversity of authors in SRs on IPF. The 3D visualization provides an enhanced, layered representation of author locations, revealing hidden regional disparities and biases. The TS-GAT captures both temporal and spatial relationships in the author collaboration network, allowing for deeper insights into the evolution of geographic representation over time. This integrated approach aims to uncover potential biases in global representation, offering a comprehensive understanding of the geographic spread and temporal trends in authorship within SRs, ultimately contributing to more balanced and inclusive evidence synthesis.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"290-299"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150703/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automating and Evaluating Large Language Models for Accurate Text Summarization Under Zero-Shot Conditions. 自动化和评估大型语言模型在零射击条件下准确的文本摘要。
Maria Priebe Mendes Rocha, Hilda B Klasky

Automated text summarization (ATS) is crucial for collecting specialized, domain-specific information. Zero-shot learning (ZSL) allows large language models (LLMs) to respond to prompts on information not included in their training, playing a vital role in this process. This study evaluates LLMs' effectiveness in generating accurate summaries under ZSL conditions and explores using retrieval augmented generation (RAG) and prompt engineering to enhance factual accuracy and understanding. We combined LLMs with summarization modeling, prompt engineering, and RAG, evaluating the summaries using the METEOR metric and keyword frequencies through word clouds. Results indicate that LLMs are generally well-suited for ATS tasks, demonstrating an ability to handle specialized information under ZSL conditions with RAG. However, web scraping limitations hinder a single generalized retrieval mechanism. While LLMs show promise for ATS under ZSL conditions with RAG, challenges like goal misgeneralization and web scraping limitations need addressing. Future research should focus on solutions to these issues.

自动文本摘要(ATS)对于收集专门化的、特定于领域的信息至关重要。Zero-shot learning (ZSL)允许大型语言模型(llm)对训练中未包含的信息做出响应,在此过程中起着至关重要的作用。本研究评估了llm在ZSL条件下生成准确摘要的有效性,并探索了使用检索增强生成(RAG)和提示工程来提高事实准确性和理解。我们将法学硕士与摘要建模、提示工程和RAG结合起来,使用METEOR度量和通过词云的关键字频率来评估摘要。结果表明llm通常非常适合ATS任务,展示了在ZSL条件下使用RAG处理专门信息的能力。然而,网页抓取的局限性阻碍了单一的通用检索机制。虽然llm显示了在ZSL条件下使用RAG进行ATS的希望,但需要解决目标错误概括和网络抓取限制等挑战。未来的研究应侧重于解决这些问题。
{"title":"Automating and Evaluating Large Language Models for Accurate Text Summarization Under Zero-Shot Conditions.","authors":"Maria Priebe Mendes Rocha, Hilda B Klasky","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Automated text summarization (ATS) is crucial for collecting specialized, domain-specific information. Zero-shot learning (ZSL) allows large language models (LLMs) to respond to prompts on information not included in their training, playing a vital role in this process. This study evaluates LLMs' effectiveness in generating accurate summaries under ZSL conditions and explores using retrieval augmented generation (RAG) and prompt engineering to enhance factual accuracy and understanding. We combined LLMs with summarization modeling, prompt engineering, and RAG, evaluating the summaries using the METEOR metric and keyword frequencies through word clouds. Results indicate that LLMs are generally well-suited for ATS tasks, demonstrating an ability to handle specialized information under ZSL conditions with RAG. However, web scraping limitations hinder a single generalized retrieval mechanism. While LLMs show promise for ATS under ZSL conditions with RAG, challenges like goal misgeneralization and web scraping limitations need addressing. Future research should focus on solutions to these issues.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"461-470"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150706/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning. 生物生物学- nlu:通过指令调谐实现更一般化的医学语言理解。
Yujuan Velvin Fu, Giridhar Kaushik Ramachandran, Namu Park, Kevin Lybarger, Fei Xia, Ozlem Uzuner, Meliha Yetisgen

Large language models (LLMs) such as ChatGPT are fine-tuned on large and diverse instruction-following corpora, and can generalize to new tasks. However, those instruction-tuned LLMs often perform poorly in specialized medical natural language understanding (NLU) tasks that require domain knowledge, granular text comprehension, and structured data extraction. To bridge the gap, we: (1) propose a unified prompting format for 7 important NLU tasks, (2) curate an instruction-tuning dataset, MNLU-Instruct, utilizing diverse existing open-source medical NLU corpora, and (3) develop BioMistral-NLU, a generalizable medical NLU model, through fine-tuning BioMistral on MNLU-Instruct. We evaluate BioMistral-NLU in a zero-shot setting, across 6 important NLU tasks, from two widely adopted medical NLU benchmarks: BLUE and BLURB. Our experiments show that our BioMistral-NLU outperforms the original BioMistral, as well as the proprietary LLMs - ChatGPT and GPT-4. Our dataset-agnostic prompting strategy and instruction tuning step over diverse NLU tasks enhance LLMs' generalizability across diverse medical NLU tasks. Our ablation experiments show that instruction-tuning on a wider variety of tasks, even when the total number of training instances remains constant, enhances downstream zero-shot generalization.

像ChatGPT这样的大型语言模型(llm)在大型和多样化的指令遵循语料库上进行了微调,并且可以推广到新的任务。然而,那些指令调优的法学硕士通常在需要领域知识、细粒度文本理解和结构化数据提取的专业医学自然语言理解(NLU)任务中表现不佳。为了弥补这一差距,我们:(1)提出了7个重要NLU任务的统一提示格式;(2)利用现有的多种开源医学NLU语料库构建了一个指令调优数据集mnlu - directive;(3)通过对mnlu - directive上的BioMistral进行微调,开发了一个可推广的医学NLU模型BioMistral-NLU。我们从两个广泛采用的医学NLU基准:BLUE和BLURB,在零射击设置中评估BioMistral-NLU,跨越6个重要的NLU任务。我们的实验表明,我们的BioMistral- nlu优于原始的BioMistral,以及专有的LLMs - ChatGPT和GPT-4。我们在不同NLU任务上的数据集不可知提示策略和指令调整步骤增强了llm在不同医学NLU任务中的泛化性。我们的消融实验表明,在更广泛的任务上进行指令调优,即使在训练实例总数保持不变的情况下,也能增强下游的零射击泛化。
{"title":"BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning.","authors":"Yujuan Velvin Fu, Giridhar Kaushik Ramachandran, Namu Park, Kevin Lybarger, Fei Xia, Ozlem Uzuner, Meliha Yetisgen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Large language models (LLMs) such as ChatGPT are fine-tuned on large and diverse instruction-following corpora, and can generalize to new tasks. However, those instruction-tuned LLMs often perform poorly in specialized medical natural language understanding (NLU) tasks that require domain knowledge, granular text comprehension, and structured data extraction. To bridge the gap, we: (1) propose a unified prompting format for 7 important NLU tasks, (2) curate an instruction-tuning dataset, MNLU-Instruct, utilizing diverse existing open-source medical NLU corpora, and (3) develop BioMistral-NLU, a generalizable medical NLU model, through fine-tuning BioMistral on MNLU-Instruct. We evaluate BioMistral-NLU in a zero-shot setting, across 6 important NLU tasks, from two widely adopted medical NLU benchmarks: BLUE and BLURB. Our experiments show that our BioMistral-NLU outperforms the original BioMistral, as well as the proprietary LLMs - ChatGPT and GPT-4. Our dataset-agnostic prompting strategy and instruction tuning step over diverse NLU tasks enhance LLMs' generalizability across diverse medical NLU tasks. Our ablation experiments show that instruction-tuning on a wider variety of tasks, even when the total number of training instances remains constant, enhances downstream zero-shot generalization.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"149-158"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150714/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building Trust in Clinical AI: A Web-Based Explainable Decision Support System for Chronic Kidney Disease. 在临床人工智能中建立信任:一个基于网络的可解释的慢性肾脏疾病决策支持系统。
Krishna Mridha, Ming Wang, Lijun Zhang

Chronic Kidney Disease (CKD) is a significant global public health issue, affecting over 10% of the population. Timely diagnosis is crucial for effective management. Leveraging machine learning within healthcare offers promising advancements in predictive diagnostics. We developed a Web-Based Clinical Decision Support System (CDSS) for CKD, incorporating advanced Explainable AI (XAI) methods, specifically SHAP (Shapley Additive Explanations) and LIME (Local Interpretable Model-agnostic Explanations). The model employs and evaluates multiple classifiers: KNN, Random Forest, AdaBoost, XGBoost, CatBoost, and Extra Trees, to predict CKD. The effectiveness of the models is assessed by measuring their accuracy, analyzing confusion matrix statistics, and the AUC. AdaBoost achieved a 100% accuracy rate. Except for KNN, all classifiers consistently reached perfect precision and sensitivity. Additionally, we present a real-time web-based application to operationalize the model, enhancing trust and accessibility for healthcare practitioners and stakeholder.

慢性肾脏疾病(CKD)是一个重大的全球公共卫生问题,影响着超过10%的人口。及时诊断是有效治疗的关键。在医疗保健领域利用机器学习为预测性诊断提供了有希望的进步。我们为CKD开发了一个基于网络的临床决策支持系统(CDSS),结合了先进的可解释人工智能(XAI)方法,特别是Shapley加性解释(Shapley Additive explanation)和LIME (Local Interpretable Model-agnostic explanation)。该模型采用并评估多个分类器:KNN、Random Forest、AdaBoost、XGBoost、CatBoost和Extra Trees来预测CKD。通过测量模型的准确性、分析混淆矩阵统计和AUC来评估模型的有效性。AdaBoost达到了100%的准确率。除KNN外,所有分类器的精度和灵敏度都达到了很好的水平。此外,我们提出了一个实时的基于web的应用程序来操作模型,增强了医疗保健从业者和利益相关者的信任和可访问性。
{"title":"Building Trust in Clinical AI: A Web-Based Explainable Decision Support System for Chronic Kidney Disease.","authors":"Krishna Mridha, Ming Wang, Lijun Zhang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Chronic Kidney Disease (CKD) is a significant global public health issue, affecting over 10% of the population. Timely diagnosis is crucial for effective management. Leveraging machine learning within healthcare offers promising advancements in predictive diagnostics. We developed a Web-Based Clinical Decision Support System (CDSS) for CKD, incorporating advanced Explainable AI (XAI) methods, specifically SHAP (Shapley Additive Explanations) and LIME (Local Interpretable Model-agnostic Explanations). The model employs and evaluates multiple classifiers: KNN, Random Forest, AdaBoost, XGBoost, CatBoost, and Extra Trees, to predict CKD. The effectiveness of the models is assessed by measuring their accuracy, analyzing confusion matrix statistics, and the AUC. AdaBoost achieved a 100% accuracy rate. Except for KNN, all classifiers consistently reached perfect precision and sensitivity. Additionally, we present a real-time web-based application to operationalize the model, enhancing trust and accessibility for healthcare practitioners and stakeholder.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"375-384"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150721/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the Impact of Social Determinants of Health on Diagnostic Delays and Access to Antifibrotic Treatment in Idiopathic Pulmonary Fibrosis. 调查健康的社会决定因素对特发性肺纤维化诊断延迟和获得抗纤维化治疗的影响。
Rui Li, Qiuhao Lu, Andrew Wen, Jinlian Wang, Sunyang Fu, Xiaoyang Ruan, Liwei Wang, Hongfang Liu

Idiopathic pulmonary fibrosis (IPF) is a rare disease that is challenging to diagnose. Patients with IPF often spend years awaiting a diagnosis after the onset of initial respiratory symptoms, and only a small percentage receive antifibrotic treatment. In this study, we examine the associations between social determinants of health (SDoH) and two critical factors: time to IPF diagnosis following the onset of initial respiratory symptoms, and whether the patient receives antifibrotic treatment. To approximate individual SDoH characteristics, we extract demographic-specific averages from zip code-level data using the American Community Survey (via the U.S. Census Bureau API). Two classification models are constructed, including logistic regression and XGBoost classification. The results indicate that for time-to-diagnosis, the top three SDoH factors are education, gender, and insurance coverage. Patients with higher education levels and better insurance are more likely to receive a quicker diagnosis, with males having an advantage over females. For antifibrotic treatment, the top three SDoH factors are insurance, gender, and race. Patients with better insurance coverage are more likely to receive antifibrotic treatment, with males and White patients having an advantage over females and patients of other ethnicities. This research may help address disparities in the diagnosis and treatment of IPF related to socioeconomic status.

特发性肺纤维化(IPF)是一种罕见的疾病,具有挑战性的诊断。IPF患者在出现最初的呼吸道症状后,往往要等待数年才能得到诊断,而且只有一小部分患者接受抗纤维化治疗。在这项研究中,我们研究了健康的社会决定因素(SDoH)与两个关键因素之间的关系:初始呼吸道症状发作后到IPF诊断的时间,以及患者是否接受抗纤维化治疗。为了近似个人SDoH特征,我们使用美国社区调查(通过美国人口普查局API)从邮政编码级别的数据中提取特定于人口统计学的平均值。构建了逻辑回归和XGBoost分类两种分类模型。结果表明,对于诊断时间,最重要的三个SDoH因素是教育程度、性别和保险覆盖率。受教育程度较高、保险较好的患者更有可能得到更快的诊断,男性比女性有优势。对于抗纤维化治疗,前三个SDoH因素是保险、性别和种族。有较好保险覆盖的患者更有可能接受抗纤维化治疗,男性和白人患者比女性和其他种族的患者有优势。本研究可能有助于解决与社会经济地位相关的IPF诊断和治疗差异。
{"title":"Investigating the Impact of Social Determinants of Health on Diagnostic Delays and Access to Antifibrotic Treatment in Idiopathic Pulmonary Fibrosis.","authors":"Rui Li, Qiuhao Lu, Andrew Wen, Jinlian Wang, Sunyang Fu, Xiaoyang Ruan, Liwei Wang, Hongfang Liu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Idiopathic pulmonary fibrosis (IPF) is a rare disease that is challenging to diagnose. Patients with IPF often spend years awaiting a diagnosis after the onset of initial respiratory symptoms, and only a small percentage receive antifibrotic treatment. In this study, we examine the associations between social determinants of health (SDoH) and two critical factors: time to IPF diagnosis following the onset of initial respiratory symptoms, and whether the patient receives antifibrotic treatment. To approximate individual SDoH characteristics, we extract demographic-specific averages from zip code-level data using the American Community Survey (via the U.S. Census Bureau API). Two classification models are constructed, including logistic regression and XGBoost classification. The results indicate that for time-to-diagnosis, the top three SDoH factors are education, gender, and insurance coverage. Patients with higher education levels and better insurance are more likely to receive a quicker diagnosis, with males having an advantage over females. For antifibrotic treatment, the top three SDoH factors are insurance, gender, and race. Patients with better insurance coverage are more likely to receive antifibrotic treatment, with males and White patients having an advantage over females and patients of other ethnicities. This research may help address disparities in the diagnosis and treatment of IPF related to socioeconomic status.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"280-289"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150710/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276854","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging GPT-4o for Automated Extraction of Neural Projections from Scientific Literature. 利用gpt - 40从科学文献中自动提取神经投影。
Rashmie Abeysinghe, Gorbachev Jowah, Licong Cui, Samden D Lhatoo, Guo-Qiang Zhang

Sudden Unexpected Death in Epilepsy (SUDEP) is a major cause of death for epilepsy patients having uncontrolled seizures. Understanding the complex neural circuits within the central nervous system is crucial for understanding the mechanisms underlying cardiorespiratory regulation, particularly in the context of SUDEP. This study explores the potential of GPT-4o, a cutting-edge language model, to automate the extraction of neural projections from scientific literature. We developed prompts to extract neuroscientific structures, extract projections, and perform synonym harmonization. Applying the approach to four neuroscientific articles, the method extracted 205 projections. A random sample of 100 projections identified was handed over to a domain expert for review where 95 were found to be correct. Therefore, GPT-4o was determined to be accurate in parsing complex scientific texts in extracting neural projections. Future work will involve extracting additional entities like techniques and species information for the projections identified.

癫痫猝死(SUDEP)是癫痫发作不受控制的癫痫患者死亡的主要原因。了解中枢神经系统内复杂的神经回路对于理解心肺调节机制至关重要,特别是在猝死的背景下。本研究探索了gpt - 40这一前沿语言模型在自动提取科学文献中的神经投影方面的潜力。我们开发了提示提取神经科学结构,提取投影,并执行同义词协调。将该方法应用于四篇神经科学论文,该方法提取了205个投影。从100个预测中随机抽取样本,交给领域专家进行审查,其中95个被发现是正确的。因此,在提取神经投影时,gpt - 40在解析复杂科学文本时是准确的。今后的工作将涉及为已确定的预测提取技术和物种信息等其他实体。
{"title":"Leveraging GPT-4o for Automated Extraction of Neural Projections from Scientific Literature.","authors":"Rashmie Abeysinghe, Gorbachev Jowah, Licong Cui, Samden D Lhatoo, Guo-Qiang Zhang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Sudden Unexpected Death in Epilepsy (SUDEP) is a major cause of death for epilepsy patients having uncontrolled seizures. Understanding the complex neural circuits within the central nervous system is crucial for understanding the mechanisms underlying cardiorespiratory regulation, particularly in the context of SUDEP. This study explores the potential of GPT-4o, a cutting-edge language model, to automate the extraction of neural projections from scientific literature. We developed prompts to extract neuroscientific structures, extract projections, and perform synonym harmonization. Applying the approach to four neuroscientific articles, the method extracted 205 projections. A random sample of 100 projections identified was handed over to a domain expert for review where 95 were found to be correct. Therefore, GPT-4o was determined to be accurate in parsing complex scientific texts in extracting neural projections. Future work will involve extracting additional entities like techniques and species information for the projections identified.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"32-41"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150692/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276856","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Social Determinants of Health in Alzheimer's Research Using LLM-Augmented Literature Mining and Knowledge Graphs. 利用法学硕士增强文献挖掘和知识图谱在阿尔茨海默病研究中利用健康的社会决定因素。
Tianqi Shang, Shu Yang, Weiqing He, Tianhua Zhai, Dawei Li, Bojian Hou, Tianlong Chen, Jason H Moore, Marylyn D Ritchie, Li Shen

Growing evidence suggests that social determinants of health (SDoH), a set of nonmedical factors, affect individuals' risks of developing Alzheimer's disease (AD) and related dementias. Nevertheless, the etiological mechanisms underlying such relationships remain largely unclear, mainly due to difficulties in collecting relevant information. This study presents a novel, automated framework that leverages recent advancements of large language model (LLM) and natural language processing techniques to mine SDoH knowledge from extensive literature and integrate it with AD-related biological entities extracted from the general-purpose knowledge graph PrimeKG. Utilizing graph neural networks, we performed link prediction tasks to evaluate the resultant SDoH-augmented knowledge graph. Our framework shows promise for enhancing knowledge discovery in AD and can be generalized to other SDoH-related research areas, offering a new tool for exploring the impact of social determinants on health outcomes. Our code is available at: https://github.com/hwq0726/SDoHenPKG.

越来越多的证据表明,健康的社会决定因素(SDoH),一组非医学因素,影响个人患阿尔茨海默病(AD)和相关痴呆的风险。然而,这些关系背后的病因机制仍然不清楚,主要是由于收集相关信息的困难。本研究提出了一个新颖的自动化框架,该框架利用大型语言模型(LLM)和自然语言处理技术的最新进展,从广泛的文献中挖掘SDoH知识,并将其与从通用知识图PrimeKG中提取的ad相关生物实体相集成。利用图神经网络,我们执行链接预测任务来评估得到的sdoh增强知识图。我们的框架有望加强阿尔茨海默病的知识发现,并可推广到其他与阿尔茨海默病相关的研究领域,为探索社会决定因素对健康结果的影响提供了一种新工具。我们的代码可在:https://github.com/hwq0726/SDoHenPKG。
{"title":"Leveraging Social Determinants of Health in Alzheimer's Research Using LLM-Augmented Literature Mining and Knowledge Graphs.","authors":"Tianqi Shang, Shu Yang, Weiqing He, Tianhua Zhai, Dawei Li, Bojian Hou, Tianlong Chen, Jason H Moore, Marylyn D Ritchie, Li Shen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p><i>Growing evidence suggests that social determinants of health (SDoH), a set of nonmedical factors, affect individuals' risks of developing Alzheimer's disease (AD) and related dementias. Nevertheless, the etiological mechanisms underlying such relationships remain largely unclear, mainly due to difficulties in collecting relevant information. This study presents a novel, automated framework that leverages recent advancements of large language model (LLM) and natural language processing techniques to mine SDoH knowledge from extensive literature and integrate it with AD-related biological entities extracted from the general-purpose knowledge graph PrimeKG. Utilizing graph neural networks, we performed link prediction tasks to evaluate the resultant SDoH-augmented knowledge graph. Our framework shows promise for enhancing knowledge discovery in AD and can be generalized to other SDoH-related research areas, offering a new tool for exploring the impact of social determinants on health outcomes. Our code is available at:</i> https://github.com/hwq0726/SDoHenPKG.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"491-500"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150755/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable Diagnosis Prediction through Neuro-Symbolic Integration. 神经符号整合的可解释诊断预测。
Qiuhao Lu, Rui Li, Elham Sagheb, Andrew Wen, Jinlian Wang, Liwei Wang, Jungwei W Fan, Hongfang Liu

Diagnosis prediction is a critical task in healthcare, where timely and accurate identification of medical conditions can significantly impact patient outcomes. Traditional machine learning and deep learning models have achieved notable success in this domain but often lack interpretability which is a crucial requirement in clinical settings. In this study, we explore the use of neuro-symbolic methods, specifically Logical Neural Networks (LNNs), to develop explainable models for diagnosis prediction. Essentially, we design and implement LNN-based models that integrate domain-specific knowledge through logical rules with learnable weights and thresholds. Our models, particularly Mmulti-pathway and Mcomprehensive, demonstrate superior performance over traditional models such as Logistic Regression, SVM, and Random Forest, achieving higher accuracy (up to 80.52%) and AUROC scores (up to 0.8457) in the case study of diabetes prediction. The learned weights and thresholds within the LNN models provide direct insights into feature contributions, enhancing interpretability without compromising predictive power. These findings highlight the potential of neuro-symbolic approaches in bridging the gap between accuracy and explainability in healthcare AI applications. By offering transparent and adaptable diagnostic models, our work contributes to the advancement ofprecision medicine and supports the development of equitable healthcare solutions. Future research will focus on extending these methods to larger and more diverse datasets to further validate their applicability across different medical conditions and populations.

诊断预测是医疗保健中的一项关键任务,及时准确地识别医疗状况可以显著影响患者的治疗结果。传统的机器学习和深度学习模型在这一领域取得了显著的成功,但往往缺乏可解释性,这是临床环境的关键要求。在这项研究中,我们探索使用神经符号方法,特别是逻辑神经网络(LNNs),来开发诊断预测的可解释模型。本质上,我们设计并实现了基于lnn的模型,该模型通过具有可学习权值和阈值的逻辑规则集成了特定领域的知识。我们的模型,特别是Mmulti-pathway和Mcomprehensive,在糖尿病预测的案例研究中表现出优于传统模型(如Logistic回归、SVM和Random Forest)的性能,达到更高的准确率(高达80.52%)和AUROC分数(高达0.8457)。LNN模型中的学习权值和阈值提供了对特征贡献的直接洞察,在不影响预测能力的情况下增强了可解释性。这些发现突出了神经符号方法在弥合医疗人工智能应用中准确性和可解释性之间差距方面的潜力。通过提供透明和适应性强的诊断模型,我们的工作有助于推进精准医疗,并支持公平医疗解决方案的发展。未来的研究将侧重于将这些方法扩展到更大、更多样化的数据集,以进一步验证它们在不同医疗条件和人群中的适用性。
{"title":"Explainable Diagnosis Prediction through Neuro-Symbolic Integration.","authors":"Qiuhao Lu, Rui Li, Elham Sagheb, Andrew Wen, Jinlian Wang, Liwei Wang, Jungwei W Fan, Hongfang Liu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Diagnosis prediction is a critical task in healthcare, where timely and accurate identification of medical conditions can significantly impact patient outcomes. Traditional machine learning and deep learning models have achieved notable success in this domain but often lack interpretability which is a crucial requirement in clinical settings. In this study, we explore the use of neuro-symbolic methods, specifically Logical Neural Networks (LNNs), to develop explainable models for diagnosis prediction. Essentially, we design and implement LNN-based models that integrate domain-specific knowledge through logical rules with learnable weights and thresholds. Our models, particularly Mmulti-pathway and Mcomprehensive, demonstrate superior performance over traditional models such as Logistic Regression, SVM, and Random Forest, achieving higher accuracy (up to 80.52%) and AUROC scores (up to 0.8457) in the case study of diabetes prediction. The learned weights and thresholds within the LNN models provide direct insights into feature contributions, enhancing interpretability without compromising predictive power. These findings highlight the potential of neuro-symbolic approaches in bridging the gap between accuracy and explainability in healthcare AI applications. By offering transparent and adaptable diagnostic models, our work contributes to the advancement ofprecision medicine and supports the development of equitable healthcare solutions. Future research will focus on extending these methods to larger and more diverse datasets to further validate their applicability across different medical conditions and populations.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"332-341"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150699/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1