首页 > 最新文献

IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics最新文献

英文 中文
Exploring the Effect of Eligibility Criteria on AD Severity and Severe Adverse Event in Eligible Patients. 探讨合格标准对符合条件的患者的注意力缺失严重程度和严重不良事件的影响。
Pub Date : 2023-06-01 Epub Date: 2023-12-11 DOI: 10.1109/ichi57859.2023.00139
Aokun Chen, Qian Li, Elizabeth Shenkman, Yonghui Wu, Yi Guo, Jiang Bian

Clinical trials were vital tools to prove the effectiveness and safety of medications. To maximize generalizability, the study sample should represent the sample population and the target population. However, the clinical trial design tends to favor the evaluation of drug safety and procedure (i.e., internal validity) without clear knowledge of its penalty on trial generalizability (i.e., external validity). Alzheimer's Disease (AD) trials are known to have generalizability issues. Thus, in this study, we explore the effect of eligibility criteria on the AD severity patients and the severe adverse event (SAE) among the eligible patients.

临床试验是证明药物有效性和安全性的重要工具。为了最大限度地提高可推广性,研究样本应代表样本人群和目标人群。然而,临床试验设计往往偏重于药物安全性和程序的评估(即内部效度),而不清楚其对试验可推广性(即外部效度)的影响。众所周知,阿尔茨海默病(AD)试验存在可推广性问题。因此,在本研究中,我们探讨了合格标准对 AD 严重程度患者和合格患者中严重不良事件(SAE)的影响。
{"title":"Exploring the Effect of Eligibility Criteria on AD Severity and Severe Adverse Event in Eligible Patients.","authors":"Aokun Chen, Qian Li, Elizabeth Shenkman, Yonghui Wu, Yi Guo, Jiang Bian","doi":"10.1109/ichi57859.2023.00139","DOIUrl":"10.1109/ichi57859.2023.00139","url":null,"abstract":"<p><p>Clinical trials were vital tools to prove the effectiveness and safety of medications. To maximize generalizability, the study sample should represent the sample population and the target population. However, the clinical trial design tends to favor the evaluation of drug safety and procedure (i.e., internal validity) without clear knowledge of its penalty on trial generalizability (i.e., external validity). Alzheimer's Disease (AD) trials are known to have generalizability issues. Thus, in this study, we explore the effect of eligibility criteria on the AD severity patients and the severe adverse event (SAE) among the eligible patients.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2023 ","pages":"756-759"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11273173/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141790216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of Offensive Language in Social Media Using Prompt Learning. 运用提示学习识别社交媒体中的冒犯性语言。
Pub Date : 2023-06-01 Epub Date: 2023-12-11 DOI: 10.1109/ichi57859.2023.00122
Leilei Su, Yifan Peng, Zezheng Wang, Cong Sun

Offensive language refers to the use of language in a manner that may offend or harm others who are within earshot or view in a public place. Given the importance of identifying such language in social media for promoting emotional well-being, we propose a prompt learning method and compare its performance with fine-tuning on two widely used datasets, HatEval and OffensEval. Experimental results demonstrate that prompt learning can achieve a performance improvement over fine-tuning in a fully supervised setting.

冒犯性语言是指在公共场所使用可能冒犯或伤害他人的语言。鉴于在社交媒体中识别此类语言对于促进情绪健康的重要性,我们提出了一种快速学习方法,并将其性能与两个广泛使用的数据集(HatEval和OffensEval)的微调进行了比较。实验结果表明,在完全监督的情况下,快速学习比微调更能提高性能。
{"title":"Identification of Offensive Language in Social Media Using Prompt Learning.","authors":"Leilei Su, Yifan Peng, Zezheng Wang, Cong Sun","doi":"10.1109/ichi57859.2023.00122","DOIUrl":"10.1109/ichi57859.2023.00122","url":null,"abstract":"<p><p>Offensive language refers to the use of language in a manner that may offend or harm others who are within earshot or view in a public place. Given the importance of identifying such language in social media for promoting emotional well-being, we propose a prompt learning method and compare its performance with fine-tuning on two widely used datasets, HatEval and OffensEval. Experimental results demonstrate that prompt learning can achieve a performance improvement over fine-tuning in a fully supervised setting.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2023 ","pages":"690-691"},"PeriodicalIF":0.0,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11811837/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143400880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mitigating Membership Inference in Deep Learning Applications with High Dimensional Genomic Data. 基于高维基因组数据的深度学习应用中的隶属推理缓解。
Chonghao Zhang, Luca Bonomi

The use of deep learning techniques in medical applications holds great promises for advancing health care. However, there are growing privacy concerns regarding what information about individual data contributors (i.e., patients in the training set) these deep models may reveal when shared with external users. In this work, we first investigate the membership privacy risks in sharing deep learning models for cancer genomics tasks, and then study the applicability of privacy-protecting strategies for mitigating these privacy risks.

深度学习技术在医疗应用中的应用为推进医疗保健带来了巨大的希望。然而,对于这些深度模型在与外部用户共享时可能泄露的个人数据贡献者(即训练集中的患者)的信息,人们越来越关注隐私问题。在这项工作中,我们首先研究了癌症基因组学任务共享深度学习模型中的成员隐私风险,然后研究了隐私保护策略在减轻这些隐私风险方面的适用性。
{"title":"Mitigating Membership Inference in Deep Learning Applications with High Dimensional Genomic Data.","authors":"Chonghao Zhang,&nbsp;Luca Bonomi","doi":"10.1109/ichi54592.2022.00101","DOIUrl":"https://doi.org/10.1109/ichi54592.2022.00101","url":null,"abstract":"<p><p>The use of deep learning techniques in medical applications holds great promises for advancing health care. However, there are growing privacy concerns regarding what information about individual data contributors (i.e., patients in the training set) these deep models may reveal when shared with external users. In this work, we first investigate the membership privacy risks in sharing deep learning models for cancer genomics tasks, and then study the applicability of privacy-protecting strategies for mitigating these privacy risks.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2022 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9473339/pdf/nihms-1815588.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10181248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mining Social Media Data to Predict COVID-19 Case Counts. 挖掘社交媒体数据预测COVID-19病例数
Pub Date : 2022-06-01 Epub Date: 2022-09-08 DOI: 10.1109/ichi54592.2022.00027
Maksims Kazijevs, Furkan A Akyelken, Manar D Samad

The unpredictability and unknowns surrounding the ongoing coronavirus disease (COVID-19) pandemic have led to an unprecedented consequence taking a heavy toll on the lives and economies of all countries. There have been efforts to predict COVID-19 case counts (CCC) using epidemiological data and numerical tokens online, which may allow early preventive measures to slow the spread of the disease. In this paper, we use state-of-the-art natural language processing (NLP) algorithms to numerically encode COVID-19 related tweets originated from eight cities in the United States and predict city-specific CCC up to eight days in the future. A city-embedding is proposed to obtain a time series representation of daily tweets posted from a city, which is then used to predict case counts using a custom long-short term memory (LSTM) model. The universal sentence encoder yields the best normalized root mean squared error (NRMSE) 0.090 (0.039), averaged across all cities in predicting CCC six days in the future. The R 2 scores in predicting CCC are more than 0.70 and often over 0.8, which suggests a strong correlation between the actual and our model predicted CCC values. Our analyses show that the NRMSE and R 2 scores are consistently robust across different cities and different numbers of time steps in time series data. Results show that the LSTM model can learn the mapping between the NLP-encoded tweet semantics and the case counts, which infers that social media text can be directly mined to identify the future course of the pandemic.

正在进行的冠状病毒病(COVID-19)大流行的不可预测性和不确定性导致了前所未有的后果,给所有国家的生命和经济造成了沉重打击。人们一直在努力利用流行病学数据和数字代币在线预测COVID-19病例数(CCC),这可能有助于采取早期预防措施,减缓疾病的传播。在本文中,我们使用最先进的自然语言处理(NLP)算法对来自美国8个城市的COVID-19相关推文进行数字编码,并预测未来8天内特定城市的CCC。提出了一种城市嵌入方法,以获得来自城市的每日tweet的时间序列表示,然后使用自定义的长短期记忆(LSTM)模型来预测案例数。通用句子编码器在预测未来6天的CCC时,在所有城市中产生的最佳标准化均方根误差(NRMSE)为0.090(0.039)。预测CCC的r2得分均在0.70以上,往往在0.8以上,表明实际预测的CCC值与模型预测的CCC值具有较强的相关性。我们的分析表明,在时间序列数据中,NRMSE和r2分数在不同城市和不同时间步长的数据中都具有一致性的稳健性。结果表明,LSTM模型可以学习nlp编码的推文语义与病例数之间的映射,这意味着可以直接挖掘社交媒体文本来识别大流行的未来进程。
{"title":"Mining Social Media Data to Predict COVID-19 Case Counts.","authors":"Maksims Kazijevs,&nbsp;Furkan A Akyelken,&nbsp;Manar D Samad","doi":"10.1109/ichi54592.2022.00027","DOIUrl":"https://doi.org/10.1109/ichi54592.2022.00027","url":null,"abstract":"<p><p>The unpredictability and unknowns surrounding the ongoing coronavirus disease (COVID-19) pandemic have led to an unprecedented consequence taking a heavy toll on the lives and economies of all countries. There have been efforts to predict COVID-19 case counts (CCC) using epidemiological data and numerical tokens online, which may allow early preventive measures to slow the spread of the disease. In this paper, we use state-of-the-art natural language processing (NLP) algorithms to numerically encode COVID-19 related tweets originated from eight cities in the United States and predict city-specific CCC up to eight days in the future. A city-embedding is proposed to obtain a time series representation of daily tweets posted from a city, which is then used to predict case counts using a custom long-short term memory (LSTM) model. The universal sentence encoder yields the best normalized root mean squared error (NRMSE) 0.090 (0.039), averaged across all cities in predicting CCC six days in the future. The <i>R</i> <sup>2</sup> scores in predicting CCC are more than 0.70 and often over 0.8, which suggests a strong correlation between the actual and our model predicted CCC values. Our analyses show that the NRMSE and <i>R</i> <sup>2</sup> scores are consistently robust across different cities and different numbers of time steps in time series data. Results show that the LSTM model can learn the mapping between the NLP-encoded tweet semantics and the case counts, which infers that social media text can be directly mined to identify the future course of the pandemic.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":" ","pages":"104-111"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9490453/pdf/nihms-1836082.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33477762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Sharing Time-to-Event Data with Privacy Protection. 在保护隐私的前提下共享时间到事件数据。
Pub Date : 2022-06-01 Epub Date: 2022-09-08 DOI: 10.1109/ichi54592.2022.00014
Luca Bonomi, Liyue Fan

Sharing time-to-event data is beneficial for enabling collaborative research efforts (e.g., survival studies), facilitating the design of effective interventions, and advancing patient care (e.g., early diagnosis). Despite numerous privacy solutions for sharing time-to-event data, recent research studies have shown that external information may become available (e.g., self-disclosure of study participation on social media) to an adversary, posing new privacy concerns. In this work, we formulate a cohort inference attack for time-to-event data sharing, in which an informed adversary aims at inferring the membership of a target individual in a specific cohort. Our study investigates the privacy risks associated with time-to-event data and evaluates the empirical privacy protection offered by popular privacy-protecting solutions (e.g., binning, differential privacy). Furthermore, we propose a novel approach to privately release individual level time-to-event data with high utility, while providing indistinguishability guarantees for the input value. Our method TE-Sanitizer is shown to provide effective mitigation against the inference attacks and high usefulness in survival analysis. The results and discussion provide domain experts with insights on the privacy and the usefulness of the studied methods.

共享从时间到事件的数据有利于开展合作研究(如生存研究)、促进有效干预措施的设计以及推动患者护理(如早期诊断)。尽管有许多针对共享时间到事件数据的隐私解决方案,但最近的研究表明,外部信息可能会被对手获取(例如,在社交媒体上自我披露参与研究的情况),从而带来新的隐私问题。在这项工作中,我们提出了一种针对时间到事件数据共享的队列推断攻击,在这种攻击中,知情的对手旨在推断目标个体在特定队列中的成员资格。我们的研究调查了与时间到事件数据相关的隐私风险,并评估了流行的隐私保护解决方案(如分档、差分隐私)所提供的经验隐私保护。此外,我们还提出了一种新方法,在为输入值提供不可区分性保证的同时,私下发布具有高效用的个体级时间到事件数据。研究表明,我们的 TE-Sanitizer 方法能有效缓解推理攻击,并在生存分析中具有很高的实用性。研究结果和讨论为领域专家提供了有关所研究方法的隐私性和实用性的见解。
{"title":"Sharing Time-to-Event Data with Privacy Protection.","authors":"Luca Bonomi, Liyue Fan","doi":"10.1109/ichi54592.2022.00014","DOIUrl":"10.1109/ichi54592.2022.00014","url":null,"abstract":"<p><p>Sharing time-to-event data is beneficial for enabling collaborative research efforts (e.g., survival studies), facilitating the design of effective interventions, and advancing patient care (e.g., early diagnosis). Despite numerous privacy solutions for sharing time-to-event data, recent research studies have shown that external information may become available (e.g., self-disclosure of study participation on social media) to an adversary, posing new privacy concerns. In this work, we formulate a cohort inference attack for time-to-event data sharing, in which an informed adversary aims at inferring the membership of a target individual in a specific cohort. Our study investigates the privacy risks associated with time-to-event data and evaluates the empirical privacy protection offered by popular privacy-protecting solutions (e.g., binning, differential privacy). Furthermore, we propose a novel approach to privately release individual level time-to-event data with high utility, while providing indistinguishability guarantees for the input value. Our method TE-Sanitizer is shown to provide effective mitigation against the inference attacks and high usefulness in survival analysis. The results and discussion provide domain experts with insights on the privacy and the usefulness of the studied methods.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2022 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9473343/pdf/nihms-1815589.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10181249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of Dementia Signals from Longitudinal Clinical Visits Using One-Class Classification. 利用单类分类从纵向临床访问中检测痴呆症信号
Pub Date : 2022-06-01 Epub Date: 2022-09-08 DOI: 10.1109/ichi54592.2022.00040
Omar A Ibrahim, Sunyang Fu, Maria Vassilaki, Michelle M Mielke, Jennifer St Sauver, Ronald C Petersen, Sunghwan Sohn

Dementia is one of the major health challenges in aging populations, with 50 million people diagnosed worldwide. However, dementia is often underdiagnosed or delayed resulting in missed opportunities for appropriate care plans. Identifying early signs of dementia is essential for better life quality of aging populations. Monitoring early signs of individual health changes could help clinicians diagnose dementia in its early stages with more effective treatment plans. However, rare data for dementia cases compared to the normal (i.e., imbalance class distribution) make it challenging to develop robust supervised learning models. In order to alleviate this issue, we investigated one-class classification (OCC) techniques, which use only majority class (i.e., normal cases) in model development to detect dementia signals from older adult clinical visits. The OCC models identify abnormality of older adults' longitudinal health conditions to predict incident dementia. The predictive performance of the OCC was compared with a recent streaming clustering-based technique and demonstrated higher predictive power. Our analysis showed that OCC has a promising potential to increase power in predicting dementia.

痴呆症是老龄人口面临的主要健康挑战之一,全世界有 5000 万人被诊断出患有痴呆症。然而,痴呆症往往诊断不足或被延误,导致错失制定适当护理计划的机会。要提高老龄人口的生活质量,识别痴呆症的早期症状至关重要。监测个人健康变化的早期迹象可帮助临床医生在痴呆症的早期阶段进行诊断,并制定更有效的治疗计划。然而,与正常人相比,痴呆症病例的数据非常稀少(即类分布不平衡),这给开发稳健的监督学习模型带来了挑战。为了缓解这一问题,我们研究了单类分类(OCC)技术,该技术在开发模型时只使用多数类(即正常病例),以检测老年人临床就诊中的痴呆信号。OCC 模型能识别老年人纵向健康状况的异常,从而预测痴呆症的发生。我们将 OCC 的预测性能与最新的基于流式聚类的技术进行了比较,结果表明 OCC 具有更高的预测能力。我们的分析表明,OCC 有希望提高痴呆症的预测能力。
{"title":"Detection of Dementia Signals from Longitudinal Clinical Visits Using One-Class Classification.","authors":"Omar A Ibrahim, Sunyang Fu, Maria Vassilaki, Michelle M Mielke, Jennifer St Sauver, Ronald C Petersen, Sunghwan Sohn","doi":"10.1109/ichi54592.2022.00040","DOIUrl":"10.1109/ichi54592.2022.00040","url":null,"abstract":"<p><p>Dementia is one of the major health challenges in aging populations, with 50 million people diagnosed worldwide. However, dementia is often underdiagnosed or delayed resulting in missed opportunities for appropriate care plans. Identifying early signs of dementia is essential for better life quality of aging populations. Monitoring early signs of individual health changes could help clinicians diagnose dementia in its early stages with more effective treatment plans. However, rare data for dementia cases compared to the normal (i.e., imbalance class distribution) make it challenging to develop robust supervised learning models. In order to alleviate this issue, we investigated one-class classification (OCC) techniques, which use only majority class (i.e., normal cases) in model development to detect dementia signals from older adult clinical visits. The OCC models identify abnormality of older adults' longitudinal health conditions to predict incident dementia. The predictive performance of the OCC was compared with a recent streaming clustering-based technique and demonstrated higher predictive power. Our analysis showed that OCC has a promising potential to increase power in predicting dementia.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2022 ","pages":"211-216"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9728104/pdf/nihms-1852693.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9328507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comparison of few-shot and traditional named entity recognition models for medical text. 医学文本少镜头与传统命名实体识别模型的比较。
Yao Ge, Yuting Guo, Yuan-Chi Yang, Mohammed Ali Al-Garadi, Abeed Sarker

Many research problems involving medical texts have limited amounts of annotated data available (e.g., expressions of rare diseases). Traditional supervised machine learning algorithms, particularly those based on deep neural networks, require large volumes of annotated data, and they underperform when only small amounts of labeled data are available. Few-shot learning (FSL) is a category of machine learning models that are designed with the intent of solving problems that have small annotated datasets available. However, there is no current study that compares the performances of FSL models with traditional models (e.g., conditional random fields) for medical text at different training set sizes. In this paper, we attempted to fill this gap in research by comparing multiple FSL models with traditional models for the task of named entity recognition (NER) from medical texts. Using five health-related annotated NER datasets, we benchmarked three traditional NER models based on BERT-BERT-Linear Classifier (BLC), BERT-CRF (BC) and SANER; and three FSL NER models-StructShot & NNShot, Few-Shot Slot Tagging (FS-ST) and ProtoNER. Our benchmarking results show that almost all models, whether traditional or FSL, achieve significantly lower performances compared to the state-of-the-art with small amounts of training data. For the NER experiments we executed, the F1-scores were very low with small training sets, typically below 30%. FSL models that were reported to perform well on non-medical texts significantly underperformed, compared to their reported best, on medical texts. Our experiments also suggest that FSL methods tend to perform worse on data sets from noisy sources of medical texts, such as social media (which includes misspellings and colloquial expressions), compared to less noisy sources such as medical literature. Our experiments demonstrate that the current state-of-the-art FSL systems are not yet suitable for effective NER in medical natural language processing tasks, and further research needs to be carried out to improve their performances. Creation of specialized, standardized datasets replicating real-world scenarios may help to move this category of methods forward.

许多涉及医学文本的研究问题的可用注释数据数量有限(例如,罕见疾病的表达)。传统的监督机器学习算法,特别是那些基于深度神经网络的算法,需要大量的标注数据,当只有少量的标记数据可用时,它们的表现不佳。FSL (Few-shot learning)是一类机器学习模型,其设计目的是解决具有小注释数据集的问题。然而,目前还没有研究将FSL模型与传统模型(如条件随机场)在不同训练集大小下的医学文本性能进行比较。在本文中,我们试图通过比较多个FSL模型与传统模型在医学文本命名实体识别(NER)任务上的差异来填补这一研究空白。利用5个与健康相关的注释NER数据集,我们对基于bert - bert线性分类器(BLC)、BERT-CRF (BC)和SANER的三种传统NER模型进行了基准测试;以及三个FSL NER模型- structshot & NNShot, Few-Shot Slot Tagging (FS-ST)和ProtoNER。我们的基准测试结果表明,与使用少量训练数据的最先进模型相比,几乎所有模型(无论是传统模型还是FSL模型)的性能都要低得多。对于我们执行的NER实验,f1分数在小训练集上非常低,通常低于30%。据报道,在非医学文本上表现良好的FSL模型,与在医学文本上表现最好的模型相比,表现明显不佳。我们的实验还表明,与医学文献等噪音较小的来源相比,FSL方法在嘈杂的医学文本来源(如社交媒体(包括拼写错误和口语化表达))的数据集上的表现往往更差。我们的实验表明,目前最先进的FSL系统还不适合有效的NER医学自然语言处理任务,需要进一步的研究来提高其性能。创建专门的、标准化的复制真实世界场景的数据集可能有助于推动这类方法的发展。
{"title":"A comparison of few-shot and traditional named entity recognition models for medical text.","authors":"Yao Ge,&nbsp;Yuting Guo,&nbsp;Yuan-Chi Yang,&nbsp;Mohammed Ali Al-Garadi,&nbsp;Abeed Sarker","doi":"10.1109/ichi54592.2022.00024","DOIUrl":"https://doi.org/10.1109/ichi54592.2022.00024","url":null,"abstract":"<p><p>Many research problems involving medical texts have limited amounts of annotated data available (<i>e.g</i>., expressions of rare diseases). Traditional supervised machine learning algorithms, particularly those based on deep neural networks, require large volumes of annotated data, and they underperform when only small amounts of labeled data are available. Few-shot learning (FSL) is a category of machine learning models that are designed with the intent of solving problems that have small annotated datasets available. However, there is no current study that compares the performances of FSL models with traditional models (<i>e.g</i>., conditional random fields) for medical text at different training set sizes. In this paper, we attempted to fill this gap in research by comparing multiple FSL models with traditional models for the task of named entity recognition (NER) from medical texts. Using five health-related annotated NER datasets, we benchmarked three traditional NER models based on BERT-BERT-Linear Classifier (BLC), BERT-CRF (BC) and SANER; and three FSL NER models-StructShot & NNShot, Few-Shot Slot Tagging (FS-ST) and ProtoNER. Our benchmarking results show that almost all models, whether traditional or FSL, achieve significantly lower performances compared to the state-of-the-art with small amounts of training data. For the NER experiments we executed, the F<sub>1</sub>-scores were very low with small training sets, typically below 30%. FSL models that were reported to perform well on non-medical texts significantly underperformed, compared to their reported best, on medical texts. Our experiments also suggest that FSL methods tend to perform worse on data sets from noisy sources of medical texts, such as social media (which includes misspellings and colloquial expressions), compared to less noisy sources such as medical literature. Our experiments demonstrate that the current state-of-the-art FSL systems are not yet suitable for effective NER in medical natural language processing tasks, and further research needs to be carried out to improve their performances. Creation of specialized, standardized datasets replicating real-world scenarios may help to move this category of methods forward.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2022 ","pages":"84-89"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462421/pdf/nihms-1926966.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10186790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Classifying Drug Ratings Using User Reviews with Transformer-Based Language Models. 使用基于变压器的语言模型的用户评论对药物评级进行分类。
Akhil Shiju, Zhe He

Drug review websites such as Drugs.com provide users' textual reviews and numeric ratings of drugs. These reviews along with the ratings are used for the consumers for choosing a drug. However, the numeric ratings may not always be consistent with text reviews and purely relying on the rating score for finding positive/negative reviews may not be reliable. Automatic classification of user ratings based on textual review can create a more reliable rating for drugs. In this project, we built classification models to classify drug review ratings using textual reviews with traditional machine learning and deep learning models. Traditional machine learning models including Random Forest and Naive Bayesian classifiers were built using TF-IDF features as input. Also, transformer-based neural network models including BERT, Bio_ClinicalBERT, RoBERTa, XLNet, ELECTRA, and ALBERT were built using the raw text as input. Overall, Bio_ClinicalBERT model outperformed the other models with an overall accuracy of 87%. We further identified concepts of the Unified Medical Language System (UMLS) from the postings and analyzed their semantic types stratified by class types. This research demonstrated that transformer-based models can be used to classify drug reviews based solely on textual reviews.

Drugs.com等药物评论网站提供用户对药物的文字评论和数字评级。这些评论与评级一起用于消费者选择药物。然而,数字评级可能并不总是与文本评论一致,纯粹依靠评级分数来寻找正面/负面评论可能并不可靠。基于文本审查的用户评级自动分类可以为药物创建更可靠的评级。在这个项目中,我们建立了分类模型,使用传统机器学习和深度学习模型的文本评论对药物审评评级进行分类。传统的机器学习模型包括随机森林和朴素贝叶斯分类器,使用TF-IDF特征作为输入。此外,基于变压器的神经网络模型包括BERT, Bio_ClinicalBERT, RoBERTa, XLNet, ELECTRA和ALBERT使用原始文本作为输入。总体而言,Bio_ClinicalBERT模型以87%的总体准确率优于其他模型。我们进一步从帖子中确定了统一医学语言系统(UMLS)的概念,并分析了它们按类类型分层的语义类型。本研究表明,基于变压器的模型可以用于仅基于文本评论的药物评论分类。
{"title":"Classifying Drug Ratings Using User Reviews with Transformer-Based Language Models.","authors":"Akhil Shiju,&nbsp;Zhe He","doi":"10.1109/ichi54592.2022.00035","DOIUrl":"https://doi.org/10.1109/ichi54592.2022.00035","url":null,"abstract":"<p><p>Drug review websites such as Drugs.com provide users' textual reviews and numeric ratings of drugs. These reviews along with the ratings are used for the consumers for choosing a drug. However, the numeric ratings may not always be consistent with text reviews and purely relying on the rating score for finding positive/negative reviews may not be reliable. Automatic classification of user ratings based on textual review can create a more reliable rating for drugs. In this project, we built classification models to classify drug review ratings using textual reviews with traditional machine learning and deep learning models. Traditional machine learning models including Random Forest and Naive Bayesian classifiers were built using TF-IDF features as input. Also, transformer-based neural network models including BERT, Bio_ClinicalBERT, RoBERTa, XLNet, ELECTRA, and ALBERT were built using the raw text as input. Overall, Bio_ClinicalBERT model outperformed the other models with an overall accuracy of 87%. We further identified concepts of the Unified Medical Language System (UMLS) from the postings and analyzed their semantic types stratified by class types. This research demonstrated that transformer-based models can be used to classify drug reviews based solely on textual reviews.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2022 ","pages":"163-169"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744636/pdf/nihms-1855900.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10701370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Annotating Music Therapy, Chiropractic and Aquatic Exercise Using Electronic Health Record. 使用电子健康记录解说音乐治疗、脊椎按摩和水上运动。
Pub Date : 2022-06-01 Epub Date: 2022-09-08 DOI: 10.1109/ichi54592.2022.00121
Huixue Zhou, Greg Silverman, Zhongran Niu, Jenzi Silverman, Roni Evans, Robin Austin, Rui Zhang

Complementary and Integrative Health (CIH) has gained increasing popularity in the past decades. The overall goal of this study is to represent information pertinent to music therapy, chiropractic and aquatic exercise in an EHR system. A total of 300 clinical notes were randomly selected and manually annotated. Annotations were made for status, symptom and frequency of each approach. This set of annotations was used as a gold standard to evaluate performance of NLP systems used in this study (specifically BioMedICUS, MetaMap and cTAKES) for extracting CIH concepts. Three NLP systems achieved an average lenient match F1-score of 0.50 in all three CIH approaches. BioMedICUS achieved the best performance in music therapy with an F1-score of 0.73. This study is a pilot to investigate CIH representation in clinical note and lays a foundation for using EHR for clinical research for CIH approaches.

在过去的几十年里,补充和综合健康(CIH)越来越受欢迎。本研究的总体目标是在EHR系统中呈现与音乐治疗、脊椎按摩和水上运动相关的信息。共有300份临床记录被随机选择并手动注释。对每种方法的状态、症状和频率进行了注释。这组注释被用作评估本研究中用于提取CIH概念的NLP系统(特别是BioMedICUS、MetaMap和cTAKES)性能的金标准。三个NLP系统在所有三种CIH方法中实现了0.50的平均宽松比赛F1分数。BioMedICUS在音乐治疗方面取得了最佳成绩,F1成绩为0.73。本研究是研究临床笔记中CIH表现的试点,为使用EHR进行CIH方法的临床研究奠定了基础。
{"title":"Annotating Music Therapy, Chiropractic and Aquatic Exercise Using Electronic Health Record.","authors":"Huixue Zhou,&nbsp;Greg Silverman,&nbsp;Zhongran Niu,&nbsp;Jenzi Silverman,&nbsp;Roni Evans,&nbsp;Robin Austin,&nbsp;Rui Zhang","doi":"10.1109/ichi54592.2022.00121","DOIUrl":"10.1109/ichi54592.2022.00121","url":null,"abstract":"<p><p>Complementary and Integrative Health (CIH) has gained increasing popularity in the past decades. The overall goal of this study is to represent information pertinent to music therapy, chiropractic and aquatic exercise in an EHR system. A total of 300 clinical notes were randomly selected and manually annotated. Annotations were made for <i>status</i>, <i>symptom</i> and <i>frequency</i> of each approach. This set of annotations was used as a gold standard to evaluate performance of NLP systems used in this study (specifically BioMedICUS, MetaMap and cTAKES) for extracting CIH concepts. Three NLP systems achieved an average lenient match F1-score of 0.50 in all three CIH approaches. BioMedICUS achieved the best performance in music therapy with an F1-score of 0.73. This study is a pilot to investigate CIH representation in clinical note and lays a foundation for using EHR for clinical research for CIH approaches.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":"2022 ","pages":"610-611"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10110363/pdf/nihms-1890434.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9751841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Radiology Text Analysis System (RadText): Architecture and Evaluation. 放射学文本分析系统(RadText):架构和评估。
Pub Date : 2022-06-01 Epub Date: 2022-09-08 DOI: 10.1109/ichi54592.2022.00050
Song Wang, Mingquan Lin, Ying Ding, George Shih, Zhiyong Lu, Yifan Peng

Analyzing radiology reports is a time-consuming and error-prone task, which raises the need for an efficient automated radiology report analysis system to alleviate the workloads of radiologists and encourage precise diagnosis. In this work, we present RadText, a high-performance open-source Python radiology text analysis system. RadText offers an easy-to-use text analysis pipeline, including de-identification, section segmentation, sentence split and word tokenization, named entity recognition, parsing, and negation detection. Superior to existing widely used toolkits, RadText features a hybrid text processing schema, supports raw text processing and local processing, which enables higher accuracy, better usability and improved data privacy. RadText adopts BioC as the unified interface, and also standardizes the output into a structured representation that is compatible with Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), which allows for a more systematic approach to observational research across multiple, disparate data sources. We evaluated RadText on the MIMIC-CXR dataset, with five new disease labels that we annotated for this work. RadText demonstrates highly accurate classification performances, with a 0.91 average precision, 0.94 average recall and 0.92 average F-1 score. We also annotated a test set for the five new disease labels to facilitate future research or applications. We have made our code, documentations, examples and the test set available at https://github.com/bionlplab/radtext.

分析放射学报告是一项耗时且容易出错的任务,因此需要一个高效的自动化放射学报告分析系统,以减轻放射科医生的工作量并鼓励精确诊断。在这项工作中,我们提出了RadText,一个高性能的开源Python放射学文本分析系统。RadText提供了一个易于使用的文本分析管道,包括去识别、部分分割、句子分割和单词标记化、命名实体识别、解析和否定检测。优于现有广泛使用的工具包,RadText具有混合文本处理模式,支持原始文本处理和本地处理,从而实现更高的准确性,更好的可用性和改进的数据隐私。RadText采用BioC作为统一接口,并将输出标准化为与观察性医疗结果合作伙伴关系(OMOP)公共数据模型(CDM)兼容的结构化表示,该模型允许采用更系统的方法跨多个不同数据源进行观察性研究。我们在MIMIC-CXR数据集上评估了RadText,我们为这项工作注释了五个新的疾病标签。RadText显示出高度准确的分类性能,平均精度为0.91,平均召回率为0.94,平均F-1得分为0.92。我们还为五种新的疾病标签标注了一个测试集,以方便未来的研究或应用。我们已经在https://github.com/bionlplab/radtext上提供了代码、文档、示例和测试集。
{"title":"Radiology Text Analysis System (RadText): Architecture and Evaluation.","authors":"Song Wang,&nbsp;Mingquan Lin,&nbsp;Ying Ding,&nbsp;George Shih,&nbsp;Zhiyong Lu,&nbsp;Yifan Peng","doi":"10.1109/ichi54592.2022.00050","DOIUrl":"https://doi.org/10.1109/ichi54592.2022.00050","url":null,"abstract":"<p><p>Analyzing radiology reports is a time-consuming and error-prone task, which raises the need for an efficient automated radiology report analysis system to alleviate the workloads of radiologists and encourage precise diagnosis. In this work, we present RadText, a high-performance open-source Python radiology text analysis system. RadText offers an easy-to-use text analysis pipeline, including de-identification, section segmentation, sentence split and word tokenization, named entity recognition, parsing, and negation detection. Superior to existing widely used toolkits, RadText features a hybrid text processing schema, supports raw text processing and local processing, which enables higher accuracy, better usability and improved data privacy. RadText adopts BioC as the unified interface, and also standardizes the output into a structured representation that is compatible with Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), which allows for a more systematic approach to observational research across multiple, disparate data sources. We evaluated RadText on the MIMIC-CXR dataset, with five new disease labels that we annotated for this work. RadText demonstrates highly accurate classification performances, with a 0.91 average precision, 0.94 average recall and 0.92 average F-1 score. We also annotated a test set for the five new disease labels to facilitate future research or applications. We have made our code, documentations, examples and the test set available at https://github.com/bionlplab/radtext.</p>","PeriodicalId":73284,"journal":{"name":"IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics","volume":" ","pages":"288-296"},"PeriodicalIF":0.0,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9484781/pdf/nihms-1836549.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40373631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1