首页 > 最新文献

IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics最新文献

英文 中文
Do LLMs Surpass Encoders for Biomedical NER? llm是否超越了生物医学NER编码器?
Pub Date : 2025-06-01 Epub Date: 2025-07-22 DOI: 10.1109/ICHI64645.2025.00048
Motasem S Obeidat, Md Sultan Al Nahian, Ramakanth Kavuluru

Recognizing spans of biomedical concepts and their types (e.g., drug or gene) in free text, often called biomedical named entity recognition (NER), is a basic component of information extraction (IE) pipelines. Without a strong NER component, other applications, such as knowledge discovery and information retrieval, are not practical. State-of-the-art in NER shifted from traditional ML models to deep neural networks with transformer-based encoder models (e.g., BERT) emerging as the current standard. However, decoder models (also called large language models or LLMs) are gaining traction in IE. But LLM-driven NER often ignores positional information due to the generative nature of decoder models. Furthermore, they are computationally very expensive (both in inference time and hardware needs). Hence, it is worth exploring if they actually excel at biomedical NER and assess any associated trade-offs (performance vs efficiency). This is exactly what we do in this effort employing the same BIO entity tagging scheme (that retains positional information) using five different datasets with varying proportions of longer entities. Our results show that the LLMs chosen (Mistral and Llama: 8B range) often outperform best encoder models (BERT-(un)cased, BiomedBERT, and DeBERTav3: 300M range) by 2-8% in F-scores except for one dataset, where they equal encoder performance. This gain is more prominent among longer entities of length ≥ 3 tokens. However, LLMs are one to two orders of magnitude more expensive at inference time and may need cost prohibitive hardware. Thus, when performance differences are small or real time user feedback is needed, encoder models might still be more suitable than LLMs.

在自由文本中识别生物医学概念及其类型(如药物或基因)的范围,通常称为生物医学命名实体识别(NER),是信息提取(IE)管道的基本组成部分。如果没有一个强大的NER组件,其他应用,如知识发现和信息检索,是不实用的。NER的最新技术从传统的ML模型转向深度神经网络,基于变压器的编码器模型(例如BERT)成为当前的标准。然而,解码器模型(也称为大型语言模型或llm)在IE中越来越受欢迎。但是由于解码器模型的生成特性,llm驱动的NER常常忽略位置信息。此外,它们在计算上非常昂贵(在推理时间和硬件需求方面)。因此,值得探索的是,他们是否真的擅长生物医学NER,并评估任何相关的权衡(性能与效率)。这正是我们在这项工作中所做的,使用相同的BIO实体标记方案(保留位置信息),使用五个不同的数据集,这些数据集具有不同比例的较长的实体。我们的研究结果表明,除了一个数据集之外,所选择的llm (Mistral和Llama: 8B范围)在f分数上通常比最佳编码器模型(BERT-(un) cases, BiomedBERT和DeBERTav3: 300M范围)高出2-8%,其中一个数据集的编码器性能相同。这种增益在长度≥3个令牌的较长实体中更为突出。然而,llm在推理时的成本要高出一到两个数量级,并且可能需要成本高昂的硬件。因此,当性能差异很小或需要实时用户反馈时,编码器模型可能仍然比llm更合适。
{"title":"Do LLMs Surpass Encoders for Biomedical NER?","authors":"Motasem S Obeidat, Md Sultan Al Nahian, Ramakanth Kavuluru","doi":"10.1109/ICHI64645.2025.00048","DOIUrl":"10.1109/ICHI64645.2025.00048","url":null,"abstract":"<p><p>Recognizing spans of biomedical concepts and their types (e.g., drug or gene) in free text, often called biomedical named entity recognition (NER), is a basic component of information extraction (IE) pipelines. Without a strong NER component, other applications, such as knowledge discovery and information retrieval, are not practical. State-of-the-art in NER shifted from traditional ML models to deep neural networks with transformer-based encoder models (e.g., BERT) emerging as the current standard. However, decoder models (also called large language models or LLMs) are gaining traction in IE. But LLM-driven NER often ignores positional information due to the generative nature of decoder models. Furthermore, they are computationally very expensive (both in inference time and hardware needs). Hence, it is worth exploring if they actually excel at biomedical NER and assess any associated trade-offs (performance vs efficiency). This is exactly what we do in this effort employing the same BIO entity tagging scheme (that retains positional information) using five different datasets with varying proportions of longer entities. Our results show that the LLMs chosen (Mistral and Llama: 8B range) often outperform best encoder models (BERT-(un)cased, BiomedBERT, and DeBERTav3: 300M range) by 2-8% in F-scores except for one dataset, where they equal encoder performance. This gain is more prominent among longer entities of length ≥ 3 tokens. However, LLMs are one to two orders of magnitude more expensive at inference time and may need cost prohibitive hardware. Thus, when performance differences are small or real time user feedback is needed, encoder models might still be more suitable than LLMs.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2025 ","pages":"352-358"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12335919/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144818434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
What to consider when developing a new molecular HIV surveillance tool: Perspectives of key stakeholders working in HIV prevention and treatment. 在开发一种新的HIV分子监测工具时需要考虑什么:从事HIV预防和治疗工作的关键利益相关者的观点。
Pub Date : 2025-06-01 Epub Date: 2025-07-22 DOI: 10.1109/ichi64645.2025.00072
Shantrel S Canidate, Hannah R Gracy, Sean McIntosh, Yiyang Liu, Rebecca Fisk-Hoffman, Shannon Rich, Carla Mavian, Robert L Cook, Mattia Prosperi, Marco Salemi

Developing and validating novel molecular HIV surveillance (MHS) tools capable of predicting the growth and trajectory of localized outbreaks driven by specific transmission clusters is key to the Ending the HIV Epidemic in the United States initiative. This study explored stakeholders' perspectives on HIV prevention and treatment regarding a developing deep-learning framework, DeepDynaForecast, and its ability to predict HIV transmission cluster trajectories and inform decision-making on HIV prevention and treatment scale-up approaches in Florida. We conducted five virtual focus group discussions with 16 clinical health professionals and state and local public health personnel. Focus group discussions were audio-recorded, transcribed using Zoom transcription, and manually coded using a reflexive thematic analysis approach. Overall, participants reported a high level of acceptability for using MHS tools. However, when exploring their perspectives on using the DeepDynaForecast tool,participants discussed their acceptance criteria, including key features that the DeepDynaForecast tool should have and the need to determine the data types the tool should generate to meet their needs and be deemed acceptable. Before implementation, participants felt the tool should undergo extensive software testing, followed by end-users receiving comprehensive training and the developers determining how the DeepDynaForecast tool could integrate with existing MHS tools. Likewise, participants discussed using the data generated by DeepDynaForecast to increase HIV prevention, education, outreach activities, and mobilization efforts in communities where the most HIV diagnoses occur, as well as increase behavioral change communication efforts. Participants also expressed concerns about HIV-related stigma, a potentially dangerous unintended consequence of using existing and new MHS tools. Current MHS tools have helped inform and evaluate HIV prevention and treatment efforts in the US. A novel MHS tool such as DeepDynaForecast may be critical to achieving the Ending the HIV Epidemic (EHE) goals and curbing the spread of HIV in Florida and in the US.

开发和验证新型艾滋病毒分子监测(MHS)工具,这些工具能够预测由特定传播集群驱动的局部暴发的增长和轨迹,这是“结束美国艾滋病毒流行”倡议的关键。本研究探讨了利益相关者对艾滋病预防和治疗的看法,涉及一个正在开发的深度学习框架,deepdynafforecast,及其预测艾滋病毒传播集群轨迹的能力,并为佛罗里达州艾滋病毒预防和治疗扩大方法的决策提供信息。我们与16名临床卫生专业人员以及州和地方公共卫生人员进行了5次虚拟焦点小组讨论。对焦点小组讨论进行录音,使用Zoom转录,并使用反思性主题分析方法进行手动编码。总体而言,参与者报告了使用MHS工具的高水平可接受性。然而,当探讨他们对使用DeepDynaForecast工具的看法时,参与者讨论了他们的接受标准,包括DeepDynaForecast工具应该具有的关键功能,以及确定工具应该生成的数据类型以满足他们的需求并被认为是可接受的。在实施之前,与会者认为该工具应该进行广泛的软件测试,然后由最终用户接受全面的培训,开发人员确定如何将deepdynafforecast工具与现有的MHS工具集成。同样,与会者讨论了如何利用deepdynafforecast生成的数据,在艾滋病毒诊断最多的社区加强艾滋病毒预防、教育、推广活动和动员工作,以及加强行为改变沟通工作。与会者还表达了对艾滋病毒相关污名的担忧,这是使用现有和新的妇幼保健工具的潜在危险的意外后果。目前MHS工具已经帮助告知和评估美国的艾滋病毒预防和治疗工作。一种新的MHS工具,如deepdynafforecast,可能对实现终结艾滋病毒流行(EHE)的目标和遏制艾滋病毒在佛罗里达州和美国的传播至关重要。
{"title":"What to consider when developing a new molecular HIV surveillance tool: Perspectives of key stakeholders working in HIV prevention and treatment.","authors":"Shantrel S Canidate, Hannah R Gracy, Sean McIntosh, Yiyang Liu, Rebecca Fisk-Hoffman, Shannon Rich, Carla Mavian, Robert L Cook, Mattia Prosperi, Marco Salemi","doi":"10.1109/ichi64645.2025.00072","DOIUrl":"10.1109/ichi64645.2025.00072","url":null,"abstract":"<p><p>Developing and validating novel molecular HIV surveillance (MHS) tools capable of predicting the growth and trajectory of localized outbreaks driven by specific transmission clusters is key to the <i>Ending the HIV Epidemic in the United States initiative</i>. This study explored stakeholders' perspectives on HIV prevention and treatment regarding a developing deep-learning framework, <i>DeepDynaForecast,</i> and its ability to predict HIV transmission cluster trajectories and inform decision-making on HIV prevention and treatment scale-up approaches in Florida. We conducted five virtual focus group discussions with 16 clinical health professionals and state and local public health personnel. Focus group discussions were audio-recorded, transcribed using Zoom transcription, and manually coded using a reflexive thematic analysis approach. Overall, participants reported a high level of acceptability for using MHS tools. However, when exploring their perspectives on using the DeepDynaForecast tool,participants discussed their acceptance criteria, including key features that the DeepDynaForecast tool should have and the need to determine the data types the tool should generate to meet their needs and be deemed acceptable. Before implementation, participants felt the tool should undergo extensive software testing, followed by end-users receiving comprehensive training and the developers determining how the DeepDynaForecast tool could integrate with existing MHS tools. Likewise, participants discussed using the data generated by DeepDynaForecast to increase HIV prevention, education, outreach activities, and mobilization efforts in communities where the most HIV diagnoses occur, as well as increase behavioral change communication efforts. Participants also expressed concerns about HIV-related stigma, a potentially dangerous unintended consequence of using existing and new MHS tools. Current MHS tools have helped inform and evaluate HIV prevention and treatment efforts in the US. A novel MHS tool such as DeepDynaForecast may be critical to achieving the Ending the HIV Epidemic (EHE) goals and curbing the spread of HIV in Florida and in the US.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2025 ","pages":"588-597"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12345360/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144857199","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lab-AI: Using Retrieval Augmentation to Enhance Language Models for Personalized Lab Test Interpretation in Clinical Medicine. 实验室人工智能:使用检索增强来增强临床医学个性化实验室测试解释的语言模型。
Xiaoyu Wang, Haoyong Ouyang, Balu Bhasuran, Xiao Luo, Karim Hanna, Mia Liza A Lustria, Zhe He

Accurate interpretation of lab results is crucial in clinical medicine, yet most patient portals use universal normal ranges, ignoring conditional factors like age and gender. This study introduces Lab-AI, an interactive system that offers personalized normal ranges using retrieval-augmented generation (RAG) from credible health sources. Lab-AI has two modules: factor retrieval and normal range retrieval. We tested these on 122 lab tests: 40 with conditional factors and 82 without. For tests with factors, normal ranges depend on patient-specific information. Our results show GPT-4-turbo with RAG achieved a 0.948 F1 score for factor retrieval and 0.995 accuracy for normal range retrieval. GPT-4-turbo with RAG outperformed the best non-RAG system by 33.5% in factor retrieval and showed 132% and 100% improvements in question-level and lab-level performance, respectively, for normal range retrieval. These findings highlight Lab-AI's potential to enhance patient understanding of lab results.

对实验室结果的准确解释在临床医学中至关重要,但大多数患者门户使用普遍的正常范围,忽略了年龄和性别等条件因素。本研究介绍了Lab-AI,这是一个交互式系统,使用来自可靠卫生来源的检索增强生成(RAG)提供个性化的正常范围。Lab-AI有两个模块:因子检索和正常范围检索。我们在122项实验室测试中对这些进行了测试:40项有条件因素,82项没有。对于带有因子的测试,正常范围取决于患者特定的信息。结果表明,GPT-4-turbo在因子检索方面的F1得分为0.948,在正常范围检索方面的准确率为0.995。使用RAG的GPT-4-turbo在因子检索方面的表现比最佳的非RAG系统高出33.5%,在正常范围检索方面,问题水平和实验室水平的性能分别提高了132%和100%。这些发现突出了lab - ai在增强患者对实验室结果理解方面的潜力。
{"title":"Lab-AI: Using Retrieval Augmentation to Enhance Language Models for Personalized Lab Test Interpretation in Clinical Medicine.","authors":"Xiaoyu Wang, Haoyong Ouyang, Balu Bhasuran, Xiao Luo, Karim Hanna, Mia Liza A Lustria, Zhe He","doi":"10.1109/ICHI64645.2025.00025","DOIUrl":"10.1109/ICHI64645.2025.00025","url":null,"abstract":"<p><p>Accurate interpretation of lab results is crucial in clinical medicine, yet most patient portals use universal normal ranges, ignoring conditional factors like age and gender. This study introduces Lab-AI, an interactive system that offers personalized normal ranges using retrieval-augmented generation (RAG) from credible health sources. Lab-AI has two modules: factor retrieval and normal range retrieval. We tested these on 122 lab tests: 40 with conditional factors and 82 without. For tests with factors, normal ranges depend on patient-specific information. Our results show GPT-4-turbo with RAG achieved a 0.948 F1 score for factor retrieval and 0.995 accuracy for normal range retrieval. GPT-4-turbo with RAG outperformed the best non-RAG system by 33.5% in factor retrieval and showed 132% and 100% improvements in question-level and lab-level performance, respectively, for normal range retrieval. These findings highlight Lab-AI's potential to enhance patient understanding of lab results.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2025 ","pages":"359-368"},"PeriodicalIF":0.0,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12707499/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145776375","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Attention-based Imputation of Missing Values in Electronic Health Records Tabular Data. 基于注意力的电子健康记录表格数据缺失值估算。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00030
Ibna Kowsar, Shourav B Rabbani, Manar D Samad

The imputation of missing values (IMV) in electronic health records tabular data is crucial to enable machine learning for patient-specific predictive modeling. While IMV methods are developed in biostatistics and recently in machine learning, deep learning-based solutions have shown limited success in learning tabular data. This paper proposes a novel attention-based missing value imputation framework that learns to reconstruct data with missing values leveraging between-feature (self-attention) or between-sample attentions. We adopt data manipulation methods used in contrastive learning to improve the generalization of the trained imputation model. The proposed self-attention imputation method outperforms state-of-the-art statistical and machine learning-based (decision-tree) imputation methods, reducing the normalized root mean squared error by 18.4% to 74.7% on five tabular data sets and 52.6% to 82.6% on two electronic health records data sets. The proposed attention-based missing value imputation method shows superior performance across a wide range of missingness (10% to 50%) when the values are missing completely at random.

电子健康记录表格数据中缺失值的估算(IMV)对于机器学习进行特定患者预测建模至关重要。虽然生物统计学和最近的机器学习领域都开发了缺失值估算方法,但基于深度学习的解决方案在学习表格数据方面的成功率有限。本文提出了一种新颖的基于注意力的缺失值估算框架,它能利用特征间(自我注意力)或样本间注意力学习重建缺失值数据。我们采用了对比学习中使用的数据处理方法,以提高训练有素的估算模型的泛化能力。所提出的自我注意力估算方法优于最先进的统计和基于机器学习(决策树)的估算方法,在五个表格数据集上将归一化均方根误差降低了 18.4% 到 74.7%,在两个电子健康记录数据集上将归一化均方根误差降低了 52.6% 到 82.6%。当数值完全随机缺失时,所提出的基于注意力的缺失值估算方法在很大的缺失率范围(10% 到 50%)内都表现出了卓越的性能。
{"title":"Attention-based Imputation of Missing Values in Electronic Health Records Tabular Data.","authors":"Ibna Kowsar, Shourav B Rabbani, Manar D Samad","doi":"10.1109/ichi61247.2024.00030","DOIUrl":"10.1109/ichi61247.2024.00030","url":null,"abstract":"<p><p>The imputation of missing values (IMV) in electronic health records tabular data is crucial to enable machine learning for patient-specific predictive modeling. While IMV methods are developed in biostatistics and recently in machine learning, deep learning-based solutions have shown limited success in learning tabular data. This paper proposes a novel attention-based missing value imputation framework that learns to reconstruct data with missing values leveraging between-feature (self-attention) or between-sample attentions. We adopt data manipulation methods used in contrastive learning to improve the generalization of the trained imputation model. The proposed self-attention imputation method outperforms state-of-the-art statistical and machine learning-based (decision-tree) imputation methods, reducing the normalized root mean squared error by 18.4% to 74.7% on five tabular data sets and 52.6% to 82.6% on two electronic health records data sets. The proposed attention-based missing value imputation method shows superior performance across a wide range of missingness (10% to 50%) when the values are missing completely at random.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"177-182"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11463999/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142395730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Online Transfer Learning for RSV Case Detection. RSV病例检测的在线迁移学习。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00074
Yiming Sun, Yuhe Gao, Runxue Bao, Gregory F Cooper, Jessi Espino, Harry Hochheiser, Marian G Michaels, John M Aronis, Chenxi Song, Ye Ye

Transfer learning has become a pivotal technique in machine learning and has proven to be effective in various real-world applications. However, utilizing this technique for classification tasks with sequential data often faces challenges, primarily attributed to the scarcity of class labels. To address this challenge, we introduce Multi-Source Adaptive Weighting (MSAW), an online multi-source transfer learning method. MSAW integrates a dynamic weighting mechanism into an ensemble framework, enabling automatic adjustment of weights based on the relevance and contribution of each source (representing historical knowledge) and target model (learning from newly acquired data). We demonstrate the effectiveness of MSAW by applying it to detect Respiratory Syncytial Virus cases within Emergency Department visits, utilizing multiple years of electronic health records from the University of Pittsburgh Medical Center. Our method demonstrates performance improvements over many baselines, including refining pre-trained models with online learning as well as three static weighting approaches, showing MSAW's capacity to integrate historical knowledge with progressively accumulated new data. This study indicates the potential of online transfer learning in healthcare, particularly for developing machine learning models that dynamically adapt to evolving situations where new data is incrementally accumulated.

迁移学习已经成为机器学习中的一项关键技术,并已被证明在各种实际应用中是有效的。然而,将这种技术用于具有顺序数据的分类任务常常面临挑战,这主要归因于类标签的稀缺性。为了解决这一挑战,我们引入了多源自适应加权(MSAW),一种在线多源迁移学习方法。MSAW将动态加权机制集成到集成框架中,可以根据每个源(代表历史知识)和目标模型(从新获取的数据中学习)的相关性和贡献自动调整权重。我们利用匹兹堡大学医学中心多年来的电子健康记录,将MSAW应用于检测急诊科就诊中的呼吸道合胞病毒病例,从而证明了MSAW的有效性。我们的方法证明了在许多基线上的性能改进,包括使用在线学习和三种静态加权方法来改进预训练模型,显示了MSAW将历史知识与逐步积累的新数据整合在一起的能力。这项研究表明了在线迁移学习在医疗保健领域的潜力,特别是在开发动态适应新数据逐渐积累的不断变化的情况的机器学习模型方面。
{"title":"Online Transfer Learning for RSV Case Detection.","authors":"Yiming Sun, Yuhe Gao, Runxue Bao, Gregory F Cooper, Jessi Espino, Harry Hochheiser, Marian G Michaels, John M Aronis, Chenxi Song, Ye Ye","doi":"10.1109/ichi61247.2024.00074","DOIUrl":"10.1109/ichi61247.2024.00074","url":null,"abstract":"<p><p>Transfer learning has become a pivotal technique in machine learning and has proven to be effective in various real-world applications. However, utilizing this technique for classification tasks with sequential data often faces challenges, primarily attributed to the scarcity of class labels. To address this challenge, we introduce <i>Multi-Source Adaptive Weighting</i> (MSAW), an online multi-source transfer learning method. MSAW integrates a dynamic weighting mechanism into an ensemble framework, enabling automatic adjustment of weights based on the relevance and contribution of each source (representing historical knowledge) and target model (learning from newly acquired data). We demonstrate the effectiveness of MSAW by applying it to detect Respiratory Syncytial Virus cases within Emergency Department visits, utilizing multiple years of electronic health records from the University of Pittsburgh Medical Center. Our method demonstrates performance improvements over many baselines, including refining pre-trained models with online learning as well as three static weighting approaches, showing MSAW's capacity to integrate historical knowledge with progressively accumulated new data. This study indicates the potential of online transfer learning in healthcare, particularly for developing machine learning models that dynamically adapt to evolving situations where new data is incrementally accumulated.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"512-521"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12086431/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144103272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Professional Radiologists' Expertise to Enhance LLMs' Evaluation for AI-generated Radiology Reports. 利用专业放射学专家的专业知识,加强法律硕士对人工智能生成的放射学报告的评估。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00058
Qingqing Zhu, Xiuying Chen, Qiao Jin, Benjamin Hou, Tejas Sudharshan Mathai, Pritam Mukherjee, Xin Gao, Ronald M Summers, Zhiyong Lu

In radiology, Artificial Intelligence (AI) has significantly advanced report generation, but automatic evaluation of these AI-produced reports remains challenging. Current metrics, such as Conventional Natural Language Generation (NLG) and Clinical Efficacy (CE), often fall short in capturing the semantic intricacies of clinical contexts or overemphasize clinical details, undermining report clarity. To overcome these issues, our proposed method synergizes the expertise of professional radiologists with Large Language Models (LLMs), like GPT-3.5 and GPT-4. Utilizing In-Context Instruction Learning (ICIL) and Chain of Thought (CoT) reasoning, our approach aligns LLM evaluations with radiologist standards, enabling detailed comparisons between human and AI-generated reports. This is further enhanced by a Regression model that aggregates sentence evaluation scores. Experimental results show that our "Detailed GPT-4 (5-shot)" model achieves a correlation that is 0.48, outperforming the METEOR metric by 0.19, while our "Regressed GPT-4" model shows even greater alignment(0.64) with expert evaluations, exceeding the best existing metric by a 0.35 margin. Moreover, the robustness of our explanations has been validated through a thorough iterative strategy. We plan to publicly release annotations from radiology experts, setting a new standard for accuracy in future assessments. This underscores the potential of our approach in enhancing the quality assessment of AI-driven medical reports.

在放射学中,人工智能(AI)在报告生成方面取得了显著进展,但对这些人工智能生成的报告进行自动评估仍然具有挑战性。目前的指标,如常规自然语言生成(NLG)和临床疗效(CE),往往在捕捉临床上下文的语义复杂性或过分强调临床细节方面不足,破坏了报告的清晰度。为了克服这些问题,我们提出的方法将专业放射科医生的专业知识与大型语言模型(llm)(如GPT-3.5和GPT-4)相结合。利用情境教学(ICIL)和思维链(CoT)推理,我们的方法将LLM评估与放射科医生的标准保持一致,从而可以详细比较人类和人工智能生成的报告。这是进一步增强的回归模型,汇总句子评价分数。实验结果表明,我们的“详细GPT-4 (5-shot)”模型实现了0.48的相关性,比METEOR指标高出0.19,而我们的“回归GPT-4”模型与专家评估的一致性更高(0.64),比现有的最佳指标高出0.35。此外,我们的解释的健壮性已经通过一个彻底的迭代策略得到了验证。我们计划公开发布放射学专家的注释,为未来评估的准确性制定新的标准。这凸显了我们的方法在加强人工智能驱动的医疗报告的质量评估方面的潜力。
{"title":"Leveraging Professional Radiologists' Expertise to Enhance LLMs' Evaluation for AI-generated Radiology Reports.","authors":"Qingqing Zhu, Xiuying Chen, Qiao Jin, Benjamin Hou, Tejas Sudharshan Mathai, Pritam Mukherjee, Xin Gao, Ronald M Summers, Zhiyong Lu","doi":"10.1109/ichi61247.2024.00058","DOIUrl":"10.1109/ichi61247.2024.00058","url":null,"abstract":"<p><p>In radiology, Artificial Intelligence (AI) has significantly advanced report generation, but automatic evaluation of these AI-produced reports remains challenging. Current metrics, such as Conventional Natural Language Generation (NLG) and Clinical Efficacy (CE), often fall short in capturing the semantic intricacies of clinical contexts or overemphasize clinical details, undermining report clarity. To overcome these issues, our proposed method synergizes the expertise of professional radiologists with Large Language Models (LLMs), like GPT-3.5 and GPT-4. Utilizing In-Context Instruction Learning (ICIL) and Chain of Thought (CoT) reasoning, our approach aligns LLM evaluations with radiologist standards, enabling detailed comparisons between human and AI-generated reports. This is further enhanced by a Regression model that aggregates sentence evaluation scores. Experimental results show that our \"Detailed GPT-4 (5-shot)\" model achieves a correlation that is 0.48, outperforming the METEOR metric by 0.19, while our \"Regressed GPT-4\" model shows even greater alignment(0.64) with expert evaluations, exceeding the best existing metric by a 0.35 margin. Moreover, the robustness of our explanations has been validated through a thorough iterative strategy. We plan to publicly release annotations from radiology experts, setting a new standard for accuracy in future assessments. This underscores the potential of our approach in enhancing the quality assessment of AI-driven medical reports.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"402-411"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11651630/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142848618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying Symptoms of Delirium from Clinical Narratives Using Natural Language Processing. 用自然语言处理从临床叙述中识别谵妄症状
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00046
Aokun Chen, Daniel Paredes, Zehao Yu, Xiwei Lou, Roberta Brunson, Jamie N Thomas, Kimberly A Martinez, Robert J Lucero, Tanja Magoc, Laurence M Solberg, Urszula A Snigurska, Sarah E Ser, Mattia Prosperi, Jiang Bian, Ragnhildur I Bjarnadottir, Yonghui Wu

Delirium is an acute decline or fluctuation in attention, awareness, or other cognitive function that can lead to serious adverse outcomes. Despite the severe outcomes, delirium is frequently unrecognized and uncoded in patients' electronic health records (EHRs) due to its transient and diverse nature. Natural language processing (NLP), a key technology that extracts medical concepts from clinical narratives, has shown great potential in studies of delirium outcomes and symptoms. To assist in the diagnosis and phenotyping of delirium, we formed an expert panel to categorize diverse delirium symptoms, composed annotation guidelines, created a delirium corpus with diverse delirium symptoms, and developed NLP methods to extract delirium symptoms from clinical notes. We compared 5 state-of-the-art transformer models including 2 models (BERT and RoBERTa) from the general domain and 3 models (BERT_MIMIC, RoBERTa_MIMIC, and GatorTron) from the clinical domain. GatorTron achieved the best strict and lenient F1 scores of 0.8055 and 0.8759, respectively. We conducted an error analysis to identify challenges in annotating delirium symptoms and developing NLP systems. To the best of our knowledge, this is the first large language model-based delirium symptom extraction system. Our study lays the foundation for the future development of computable phenotypes and diagnosis methods for delirium.

谵妄是注意力、意识或其他认知功能的急性下降或波动,可导致严重的不良后果。尽管谵妄的后果严重,但由于其短暂性和多样性,谵妄在患者的电子健康记录(EHRs)中经常未被识别和未编码。自然语言处理(NLP)是一项从临床叙述中提取医学概念的关键技术,在谵妄结局和症状的研究中显示出巨大的潜力。为了帮助谵妄的诊断和分型,我们组建了专家小组对谵妄的多种症状进行分类,编写了注释指南,创建了包含多种谵妄症状的谵妄语料库,并开发了从临床记录中提取谵妄症状的NLP方法。我们比较了5种最先进的变压器模型,其中2种模型(BERT和RoBERTa)来自一般领域,3种模型(BERT_MIMIC, RoBERTa_MIMIC和GatorTron)来自临床领域。GatorTron在严格F1和宽松F1中得分最高,分别为0.8055和0.8759。我们进行了错误分析,以确定在注释谵妄症状和开发NLP系统方面的挑战。据我们所知,这是第一个基于语言模型的谵妄症状提取系统。本研究为今后谵妄的可计算表型和诊断方法的发展奠定了基础。
{"title":"Identifying Symptoms of Delirium from Clinical Narratives Using Natural Language Processing.","authors":"Aokun Chen, Daniel Paredes, Zehao Yu, Xiwei Lou, Roberta Brunson, Jamie N Thomas, Kimberly A Martinez, Robert J Lucero, Tanja Magoc, Laurence M Solberg, Urszula A Snigurska, Sarah E Ser, Mattia Prosperi, Jiang Bian, Ragnhildur I Bjarnadottir, Yonghui Wu","doi":"10.1109/ichi61247.2024.00046","DOIUrl":"10.1109/ichi61247.2024.00046","url":null,"abstract":"<p><p>Delirium is an acute decline or fluctuation in attention, awareness, or other cognitive function that can lead to serious adverse outcomes. Despite the severe outcomes, delirium is frequently unrecognized and uncoded in patients' electronic health records (EHRs) due to its transient and diverse nature. Natural language processing (NLP), a key technology that extracts medical concepts from clinical narratives, has shown great potential in studies of delirium outcomes and symptoms. To assist in the diagnosis and phenotyping of delirium, we formed an expert panel to categorize diverse delirium symptoms, composed annotation guidelines, created a delirium corpus with diverse delirium symptoms, and developed NLP methods to extract delirium symptoms from clinical notes. We compared 5 state-of-the-art transformer models including 2 models (BERT and RoBERTa) from the general domain and 3 models (BERT_MIMIC, RoBERTa_MIMIC, and GatorTron) from the clinical domain. GatorTron achieved the best strict and lenient F1 scores of 0.8055 and 0.8759, respectively. We conducted an error analysis to identify challenges in annotating delirium symptoms and developing NLP systems. To the best of our knowledge, this is the first large language model-based delirium symptom extraction system. Our study lays the foundation for the future development of computable phenotypes and diagnosis methods for delirium.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"305-311"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11670120/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142900616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learning to Rank Complex Biomedical Hypotheses for Accelerating Scientific Discovery. 学习对复杂生物医学假设进行排序以加速科学发现。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00044
Juncheng Ding, Shailesh Dahal, Bijaya Adhikari, Kishlay Jha

Hypothesis generation (HG) is a fundamental problem in biomedical text mining that uncovers plausible implicit links ( B terms) between two disjoint concepts of interest ( A and C terms). Over the past decade, many HG approaches based on distributional statistics, graph-theoretic measures, and supervised machine learning methods have been proposed. Despite significant advances made, the existing approaches have two major limitations. First, they mainly focus on enumerating hypotheses and often neglect to rank them in a semantically meaningful way. This leads to wasted time and resources as researchers may focus on hypotheses that are ultimately not supported by experimental evidence. Second, the existing approaches are designed to rank hypotheses with only one intermediate or evidence term (referred as simple hypotheses), and thus are unable to handle hypotheses with multiple intermediate terms (referred as complex hypotheses). This is limiting because recent research has shown that the complex hypotheses could be of greater practical value than simple ones, especially in the early stages of scientific discovery. To address these issues, we propose a new HG ranking approach that leverages upon the expressive power of Graph Neural Networks (GNN) coupled with a domain-knowledge guided Noise-Contrastive Estimation (NCE) strategy to effectively rank both simple and complex biomedical hypotheses. Specifically, the message passing capabilities of GNN allows our approach to capture the rich interactions between biomedical entities and succinctly handle the complex hypotheses with variable intermediate terms. Moreover, the proposed domain knowledge-guided NCE strategy enables the ranking of complex hypotheses based on their coherence with the established biomedical knowledge. Extensive experiment results on five recognized biomedical datasets show that the proposed approach consistently outperforms the existing baselines and prioritizes hypotheses worthy of potential clinical trials.

假设生成(HG)是生物医学文本挖掘中的一个基本问题,它揭示了两个不相交的感兴趣概念(a项和C项)之间可能的隐含联系(B项)。在过去的十年中,已经提出了许多基于分布统计、图论度量和监督机器学习方法的HG方法。尽管取得了重大进展,但现有的方法有两个主要局限性。首先,它们主要侧重于列举假设,而往往忽略了以语义有意义的方式对假设进行排序。这导致浪费时间和资源,因为研究人员可能会专注于最终没有实验证据支持的假设。其次,现有的方法被设计为只有一个中间项或证据项的假设排序(称为简单假设),因此无法处理具有多个中间项的假设(称为复杂假设)。这是有限的,因为最近的研究表明,复杂的假设可能比简单的更有实际价值,特别是在科学发现的早期阶段。为了解决这些问题,我们提出了一种新的HG排序方法,该方法利用图神经网络(GNN)的表达能力以及领域知识引导的噪声对比估计(NCE)策略来有效地对简单和复杂的生物医学假设进行排序。具体来说,GNN的消息传递能力使我们的方法能够捕获生物医学实体之间丰富的相互作用,并简洁地处理具有可变中间项的复杂假设。此外,提出的领域知识引导的NCE策略可以根据复杂假设与已建立的生物医学知识的一致性对其进行排名。在五个公认的生物医学数据集上进行的大量实验结果表明,所提出的方法始终优于现有的基线,并优先考虑值得潜在临床试验的假设。
{"title":"Learning to Rank Complex Biomedical Hypotheses for Accelerating Scientific Discovery.","authors":"Juncheng Ding, Shailesh Dahal, Bijaya Adhikari, Kishlay Jha","doi":"10.1109/ichi61247.2024.00044","DOIUrl":"10.1109/ichi61247.2024.00044","url":null,"abstract":"<p><p>Hypothesis generation (HG) is a fundamental problem in biomedical text mining that uncovers plausible implicit links ( <math><mi>B</mi></math> terms) between two disjoint concepts of interest ( <math><mi>A</mi></math> and <math><mi>C</mi></math> terms). Over the past decade, many HG approaches based on distributional statistics, graph-theoretic measures, and supervised machine learning methods have been proposed. Despite significant advances made, the existing approaches have two major limitations. First, they mainly focus on enumerating hypotheses and often neglect to rank them in a semantically meaningful way. This leads to wasted time and resources as researchers may focus on hypotheses that are ultimately not supported by experimental evidence. Second, the existing approaches are designed to rank hypotheses with only one intermediate or evidence term (referred as simple hypotheses), and thus are unable to handle hypotheses with multiple intermediate terms (referred as complex hypotheses). This is limiting because recent research has shown that the complex hypotheses could be of greater practical value than simple ones, especially in the early stages of scientific discovery. To address these issues, we propose a new HG ranking approach that leverages upon the expressive power of Graph Neural Networks (GNN) coupled with a domain-knowledge guided Noise-Contrastive Estimation (NCE) strategy to effectively rank both simple and complex biomedical hypotheses. Specifically, the message passing capabilities of GNN allows our approach to capture the rich interactions between biomedical entities and succinctly handle the complex hypotheses with variable intermediate terms. Moreover, the proposed domain knowledge-guided NCE strategy enables the ranking of complex hypotheses based on their coherence with the established biomedical knowledge. Extensive experiment results on five recognized biomedical datasets show that the proposed approach consistently outperforms the existing baselines and prioritizes hypotheses worthy of potential clinical trials.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"285-293"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11920884/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143665495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assertion Detection in Clinical Natural Language Processing using Large Language Models. 基于大型语言模型的临床自然语言处理断言检测。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00039
Yuelyu Ji, Zeshui Yu, Yanshan Wang

In this study, we aim to address the task of assertion detection when extracting medical concepts from clinical notes, a key process in clinical natural language processing (NLP). Assertion detection in clinical NLP usually involves identifying assertion types for medical concepts in the clinical text, namely certainty (whether the medical concept is positive, negated, possible, or hypothetical), temporality (whether the medical concept is for present or the past history), and experiencer (whether the medical concept is described for the patient or a family member). These assertion types are essential for healthcare professionals to quickly and clearly understand the context of medical conditions from unstructured clinical texts, directly influencing the quality and outcomes of patient care. Although widely used, traditional methods, particularly rule-based NLP systems and machine learning or deep learning models, demand intensive manual efforts to create patterns and tend to overlook less common assertion types, leading to an incomplete understanding of the context. To address this challenge, our research introduces a novel methodology that utilizes Large Language Models (LLMs) pre-trained on a vast array of medical data for assertion detection. We enhanced the current method with advanced reasoning techniques, including Tree of Thought (ToT), Chain of Thought (CoT), and Self-Consistency (SC), and refine it further with Low-Rank Adaptation (LoRA) fine-tuning. We first evaluated the model on the i2b2 2010 assertion dataset. Our method achieved a micro-averaged F-1 of 0.89, with 0.11 improvements over the previous works. To further assess the generalizability of our approach, we extended our evaluation to a local dataset that focused on sleep concept extraction. Our approach achieved an F-1 of 0.74, which is 0.31 higher than the previous method. The results show that using LLMs is a viable option for assertion detection in clinical NLP and can potentially integrate with other LLM-based concept extraction models for clinical NLP tasks.

在本研究中,我们旨在解决临床自然语言处理(NLP)中的关键过程——从临床笔记中提取医学概念时的断言检测问题。临床NLP中的断言检测通常涉及识别临床文本中医学概念的断言类型,即确定性(医学概念是肯定的、否定的、可能的还是假设的)、时间性(医学概念是针对现在还是过去的历史)和经验者(医学概念是针对患者还是家庭成员描述的)。这些断言类型对于医疗保健专业人员从非结构化临床文本中快速清晰地理解医疗条件的背景至关重要,直接影响患者护理的质量和结果。尽管被广泛使用,传统方法,特别是基于规则的NLP系统和机器学习或深度学习模型,需要大量的手工工作来创建模式,并且往往忽略了不太常见的断言类型,导致对上下文的不完整理解。为了应对这一挑战,我们的研究引入了一种新的方法,该方法利用在大量医疗数据上预训练的大型语言模型(llm)进行断言检测。我们采用先进的推理技术,包括思想树(ToT)、思想链(CoT)和自一致性(SC),对现有方法进行了改进,并进一步采用低秩自适应(LoRA)微调对其进行了改进。我们首先在i2b2 2010断言数据集上评估该模型。我们的方法实现了0.89的微平均F-1,比以前的工作提高了0.11。为了进一步评估我们方法的普遍性,我们将我们的评估扩展到一个专注于睡眠概念提取的本地数据集。我们的方法获得了0.74的F-1,比以前的方法高0.31。结果表明,使用llm是临床NLP中断言检测的可行选择,并且可以与其他基于llm的概念提取模型集成,用于临床NLP任务。
{"title":"Assertion Detection in Clinical Natural Language Processing using Large Language Models.","authors":"Yuelyu Ji, Zeshui Yu, Yanshan Wang","doi":"10.1109/ichi61247.2024.00039","DOIUrl":"10.1109/ichi61247.2024.00039","url":null,"abstract":"<p><p>In this study, we aim to address the task of assertion detection when extracting medical concepts from clinical notes, a key process in clinical natural language processing (NLP). Assertion detection in clinical NLP usually involves identifying assertion types for medical concepts in the clinical text, namely certainty (whether the medical concept is positive, negated, possible, or hypothetical), temporality (whether the medical concept is for present or the past history), and experiencer (whether the medical concept is described for the patient or a family member). These assertion types are essential for healthcare professionals to quickly and clearly understand the context of medical conditions from unstructured clinical texts, directly influencing the quality and outcomes of patient care. Although widely used, traditional methods, particularly rule-based NLP systems and machine learning or deep learning models, demand intensive manual efforts to create patterns and tend to overlook less common assertion types, leading to an incomplete understanding of the context. To address this challenge, our research introduces a novel methodology that utilizes Large Language Models (LLMs) pre-trained on a vast array of medical data for assertion detection. We enhanced the current method with advanced reasoning techniques, including Tree of Thought (ToT), Chain of Thought (CoT), and Self-Consistency (SC), and refine it further with Low-Rank Adaptation (LoRA) fine-tuning. We first evaluated the model on the i2b2 2010 assertion dataset. Our method achieved a micro-averaged F-1 of 0.89, with 0.11 improvements over the previous works. To further assess the generalizability of our approach, we extended our evaluation to a local dataset that focused on sleep concept extraction. Our approach achieved an F-1 of 0.74, which is 0.31 higher than the previous method. The results show that using LLMs is a viable option for assertion detection in clinical NLP and can potentially integrate with other LLM-based concept extraction models for clinical NLP tasks.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"242-247"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11908446/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143652454","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An average-case efficient two-stage algorithm for enumerating all longest common substrings of minimum length k between genome pairs. 一种平均情况下高效的两阶段算法,用于枚举基因组对之间最小长度为 k 的所有最长公共子串。
Pub Date : 2024-06-01 Epub Date: 2024-08-22 DOI: 10.1109/ichi61247.2024.00020
Mattia Prosperi, Simone Marini, Christina Boucher

A problem extension of the longest common substring (LCS) between two texts is the enumeration of all LCSs given a minimum length k (ALCS- k ), along with their positions in each text. In bioinformatics, an efficient solution to the ALCS- k for very long texts -genomes or metagenomes- can provide useful insights to discover genetic signatures responsible for biological mechanisms. The ALCS- k problem has two additional requirements compared to the LCS problem: one is the minimum length k , and the other is that all common strings longer than k must be reported. We present an efficient, two-stage ALCS- k algorithm exploiting the spectrum of text substrings of length k ( k -mers). Our approach yields a worst-case time complexity loglinear in the number of k -mers for the first stage, and an average-case loglinear in the number of common k -mers for the second stage (several orders of magnitudes smaller than the total k -mer spectrum). The space complexity is linear in the first phase (disk-based), and on average linear in the second phase (disk- and memory-based). Tests performed on genomes for different organisms (including viruses, bacteria and animal chromosomes) show that run times are consistent with our theoretical estimates; further, comparisons with MUMmer4 show an asymptotic advantage with divergent genomes.

两个文本之间最长公共子串(LCS)问题的扩展是枚举给定最小长度 k 的所有 LCS(ALCS- k)以及它们在每个文本中的位置。在生物信息学中,针对超长文本--基因组或元基因组--的 ALCS- k 的有效解决方案可以为发现生物机制的遗传特征提供有用的见解。与 LCS 问题相比,ALCS- k 问题有两个额外的要求:一个是最小长度 k,另一个是必须报告所有长于 k 的普通字符串。我们提出了一种高效的两阶段 ALCS- k 算法,该算法利用了长度为 k 的文本子串谱(k -mers)。我们的方法在最坏情况下,第一阶段的时间复杂度与 k -mers 的数量成对数线性关系,在平均情况下,第二阶段的时间复杂度与常见 k -mers 的数量成对数线性关系(比总 k -mers 频谱小几个数量级)。空间复杂度在第一阶段(基于磁盘)是线性的,在第二阶段(基于磁盘和内存)平均是线性的。在不同生物体(包括病毒、细菌和动物染色体)基因组上进行的测试表明,运行时间与我们的理论估计值一致;此外,与 MUMmer4 的比较显示,在不同基因组上具有渐进优势。
{"title":"<ArticleTitle xmlns:ns0=\"http://www.w3.org/1998/Math/MathML\">An average-case efficient two-stage algorithm for enumerating all longest common substrings of minimum length <ns0:math><ns0:mi>k</ns0:mi></ns0:math> between genome pairs.","authors":"Mattia Prosperi, Simone Marini, Christina Boucher","doi":"10.1109/ichi61247.2024.00020","DOIUrl":"10.1109/ichi61247.2024.00020","url":null,"abstract":"<p><p>A problem extension of the longest common substring (LCS) between two texts is the enumeration of all LCSs given a minimum length <math><mi>k</mi></math> (ALCS- <math><mi>k</mi></math> ), along with their positions in each text. In bioinformatics, an efficient solution to the ALCS- <math><mi>k</mi></math> for very long texts -genomes or metagenomes- can provide useful insights to discover genetic signatures responsible for biological mechanisms. The ALCS- <math><mi>k</mi></math> problem has two additional requirements compared to the LCS problem: one is the minimum length <math><mi>k</mi></math> , and the other is that all common strings longer than <math><mi>k</mi></math> must be reported. We present an efficient, two-stage ALCS- <math><mi>k</mi></math> algorithm exploiting the spectrum of text substrings of length <math><mi>k</mi></math> ( <math><mi>k</mi></math> -mers). Our approach yields a worst-case time complexity loglinear in the number of <math><mi>k</mi></math> -mers for the first stage, and an average-case loglinear in the number of common <math><mi>k</mi></math> -mers for the second stage (several orders of magnitudes smaller than the total <math><mi>k</mi></math> -mer spectrum). The space complexity is linear in the first phase (disk-based), and on average linear in the second phase (disk- and memory-based). Tests performed on genomes for different organisms (including viruses, bacteria and animal chromosomes) show that run times are consistent with our theoretical estimates; further, comparisons with MUMmer4 show an asymptotic advantage with divergent genomes.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2024 ","pages":"93-102"},"PeriodicalIF":0.0,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11412151/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142302596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1