Proceedings of the ACM Conference on Health, Inference, and Learning最新文献_第5页

Extracting medical entities from social media 从社交媒体中提取医疗实体

Proceedings of the ACM Conference on Health, Inference, and Learning

Pub Date : 2020-04-02 DOI: 10.1145/3368555.3384467

S. Šćepanović, E. Martin-Lopez, D. Quercia, Khan Baykaner

Accurately extracting medical entities from social media is challenging because people use informal language with different expressions for the same concept, and they also make spelling mistakes. Previous work either focused on specific diseases (e.g., depression) or drugs (e.g., opioids) or, if working with a wide-set of medical entities, only tackled individual and small-scale benchmark datasets (e.g., AskaPatient). In this work, we first demonstrated how to accurately extract a wide variety of medical entities such as symptoms, diseases, and drug names on three benchmark datasets from varied social media sources, and then also validated this approach on a large-scale Reddit dataset. We first implemented a deep-learning method using contextual embeddings that upon two existing benchmark datasets, one containing annotated AskaPatient posts (CADEC) and the other containing annotated tweets (Micromed), outperformed existing state-of-the-art methods. Second, we created an additional benchmark dataset by annotating medical entities in 2K Reddit posts (made publicly available under the name of MedRed) and showed that our method also performs well on this new dataset. Finally, to demonstrate that our method accurately extracts a wide variety of medical entities on a large scale, we applied the model pre-trained on MedRed to half a million Reddit posts. The posts came from disease-specific subreddits so we could categorise them into 18 diseases based on the subreddit. We then trained a machine-learning classifier to predict the post's category solely from the extracted medical entities. The average F1 score across categories was .87. These results open up new cost-effective opportunities for modeling, tracking and even predicting health behavior at scale.

准确地从社交媒体中提取医疗实体是一项挑战，因为人们对同一个概念使用不同表达的非正式语言，而且他们也会犯拼写错误。以前的工作要么集中在特定疾病(如抑郁症)或药物(如阿片类药物)上，要么在与广泛的医疗实体合作时，只处理个人和小规模的基准数据集(如askappatient)。在这项工作中，我们首先演示了如何在来自不同社交媒体来源的三个基准数据集上准确地提取各种各样的医疗实体，如症状、疾病和药物名称，然后还在大规模的Reddit数据集上验证了这种方法。我们首先使用上下文嵌入实现了一种深度学习方法，该方法基于两个现有的基准数据集，一个包含带注释的AskaPatient帖子(CADEC)，另一个包含带注释的tweet (Micromed)，优于现有的最先进的方法。其次，我们通过在2K个Reddit帖子(以MedRed的名义公开提供)中注释医疗实体创建了一个额外的基准数据集，并表明我们的方法在这个新数据集上也表现良好。最后，为了证明我们的方法可以大规模地准确提取各种各样的医疗实体，我们将MedRed上预训练的模型应用于50万个Reddit帖子。这些帖子来自特定疾病的子reddit，所以我们可以根据子reddit将它们分类为18种疾病。然后，我们训练了一个机器学习分类器，仅从提取的医疗实体中预测帖子的类别。各类别的F1平均得分为0.87。这些结果为大规模建模、跟踪甚至预测健康行为开辟了新的具有成本效益的机会。

{"title":"Extracting medical entities from social media","authors":"S. Šćepanović, E. Martin-Lopez, D. Quercia, Khan Baykaner","doi":"10.1145/3368555.3384467","DOIUrl":"https://doi.org/10.1145/3368555.3384467","url":null,"abstract":"Accurately extracting medical entities from social media is challenging because people use informal language with different expressions for the same concept, and they also make spelling mistakes. Previous work either focused on specific diseases (e.g., depression) or drugs (e.g., opioids) or, if working with a wide-set of medical entities, only tackled individual and small-scale benchmark datasets (e.g., AskaPatient). In this work, we first demonstrated how to accurately extract a wide variety of medical entities such as symptoms, diseases, and drug names on three benchmark datasets from varied social media sources, and then also validated this approach on a large-scale Reddit dataset. We first implemented a deep-learning method using contextual embeddings that upon two existing benchmark datasets, one containing annotated AskaPatient posts (CADEC) and the other containing annotated tweets (Micromed), outperformed existing state-of-the-art methods. Second, we created an additional benchmark dataset by annotating medical entities in 2K Reddit posts (made publicly available under the name of MedRed) and showed that our method also performs well on this new dataset. Finally, to demonstrate that our method accurately extracts a wide variety of medical entities on a large scale, we applied the model pre-trained on MedRed to half a million Reddit posts. The posts came from disease-specific subreddits so we could categorise them into 18 diseases based on the subreddit. We then trained a machine-learning classifier to predict the post's category solely from the extracted medical entities. The average F1 score across categories was .87. These results open up new cost-effective opportunities for modeling, tracking and even predicting health behavior at scale.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78562336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 26

Interactive hybrid approach to combine machine and human intelligence for personalized rehabilitation assessment 结合机器和人类智能进行个性化康复评估的交互式混合方法

Proceedings of the ACM Conference on Health, Inference, and Learning

Pub Date : 2020-04-02 DOI: 10.1145/3368555.3384452

Min Hun Lee, D. Siewiorek, A. Smailagic, A. Bernardino, S. Badia

Automated assessment of rehabilitation exercises using machine learning has a potential to improve current rehabilitation practices. However, it is challenging to completely replicate therapist's decision making on the assessment of patients with various physical conditions. This paper describes an interactive machine learning approach that iteratively integrates a data-driven model with expert's knowledge to assess the quality of rehabilitation exercises. Among a large set of kinematic features of the exercise motions, our approach identifies the most salient features for assessment using reinforcement learning and generates a user-specific analysis to elicit feature relevance from a therapist for personalized rehabilitation assessment. While accommodating therapist's feedback on feature relevance, our approach can tune a generic assessment model into a personalized model. Specifically, our approach improves performance to predict assessment from 0.8279 to 0.9116 average F1-scores of three upper-limb rehabilitation exercises (p < 0.01). Our work demonstrates that machine learning models with feature selection can generate kinematic feature-based analysis as explanations on predictions of a model to elicit expert's knowledge of assessment, and how machine learning models can augment with expert's knowledge for personalized rehabilitation assessment.

使用机器学习对康复练习进行自动评估有可能改善当前的康复实践。然而，要完全复制治疗师对各种身体状况患者的评估决策是具有挑战性的。本文描述了一种交互式机器学习方法，该方法迭代地将数据驱动模型与专家知识集成在一起，以评估康复练习的质量。在大量运动动作的运动学特征中，我们的方法识别出最显著的特征，使用强化学习进行评估，并生成针对用户的分析，从治疗师那里获得特征相关性，以进行个性化康复评估。在适应治疗师对特征相关性的反馈的同时，我们的方法可以将通用评估模型调整为个性化模型。具体而言，我们的方法提高了成绩，预测三种上肢康复运动的平均f1得分从0.8279到0.9116 (p < 0.01)。我们的工作表明，具有特征选择的机器学习模型可以生成基于运动学特征的分析，作为对模型预测的解释，以引出专家的评估知识，以及机器学习模型如何与专家的知识相结合，以进行个性化康复评估。

{"title":"Interactive hybrid approach to combine machine and human intelligence for personalized rehabilitation assessment","authors":"Min Hun Lee, D. Siewiorek, A. Smailagic, A. Bernardino, S. Badia","doi":"10.1145/3368555.3384452","DOIUrl":"https://doi.org/10.1145/3368555.3384452","url":null,"abstract":"Automated assessment of rehabilitation exercises using machine learning has a potential to improve current rehabilitation practices. However, it is challenging to completely replicate therapist's decision making on the assessment of patients with various physical conditions. This paper describes an interactive machine learning approach that iteratively integrates a data-driven model with expert's knowledge to assess the quality of rehabilitation exercises. Among a large set of kinematic features of the exercise motions, our approach identifies the most salient features for assessment using reinforcement learning and generates a user-specific analysis to elicit feature relevance from a therapist for personalized rehabilitation assessment. While accommodating therapist's feedback on feature relevance, our approach can tune a generic assessment model into a personalized model. Specifically, our approach improves performance to predict assessment from 0.8279 to 0.9116 average F1-scores of three upper-limb rehabilitation exercises (p < 0.01). Our work demonstrates that machine learning models with feature selection can generate kinematic feature-based analysis as explanations on predictions of a model to elicit expert's knowledge of assessment, and how machine learning models can augment with expert's knowledge for personalized rehabilitation assessment.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82401429","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

TASTE: Temporal and Static Tensor Factorization for Phenotyping Electronic Health Records. TASTE：用于电子健康记录表型的时态和静态张量因子化。

Proceedings of the ACM Conference on Health, Inference, and Learning

Pub Date : 2020-04-01 DOI: 10.1145/3368555.3384464

Ardavan Afshar, Ioakeim Perros, Haesun Park, Christopher deFilippi, Xiaowei Yan, Walter Stewart, Joyce Ho, Jimeng Sun

Phenotyping electronic health records (EHR) focuses on defining meaningful patient groups (e.g., heart failure group and diabetes group) and identifying the temporal evolution of patients in those groups. Tensor factorization has been an effective tool for phenotyping. Most of the existing works assume either a static patient representation with aggregate data or only model temporal data. However, real EHR data contain both temporal (e.g., longitudinal clinical visits) and static information (e.g., patient demographics), which are difficult to model simultaneously. In this paper, we propose Temporal And Static TEnsor factorization (TASTE) that jointly models both static and temporal information to extract phenotypes. TASTE combines the PARAFAC2 model with non-negative matrix factorization to model a temporal and a static tensor. To fit the proposed model, we transform the original problem into simpler ones which are optimally solved in an alternating fashion. For each of the sub-problems, our proposed mathematical re-formulations lead to efficient sub-problem solvers. Comprehensive experiments on large EHR data from a heart failure (HF) study confirmed that TASTE is up to 14× faster than several baselines and the resulting phenotypes were confirmed to be clinically meaningful by a cardiologist. Using 60 phenotypes extracted by TASTE, a simple logistic regression can achieve the same level of area under the curve (AUC) for HF prediction compared to a deep learning model using recurrent neural networks (RNN) with 345 features.

电子健康记录（EHR）表型分析的重点是定义有意义的患者组别（如心衰组和糖尿病组），并识别这些组别中患者的时间演变。张量因子化是表型分析的有效工具。现有的大多数研究都假定病人代表是一个静态的集合数据，或者只对时间数据进行建模。然而，真实的电子病历数据既包含时间信息（如纵向临床访问），也包含静态信息（如患者人口统计数据），这两种信息很难同时建模。在本文中，我们提出了时态和静态因式分解法（Temporal And Static TEnsor factorization，TASTE），它能对静态和时态信息进行联合建模，从而提取表型。TASTE 将 PARAFAC2 模型与非负矩阵因式分解相结合，为时间和静态张量建模。为了适应所提出的模型，我们将原始问题转化为更简单的问题，并以交替方式优化解决。对于每一个子问题，我们提出的数学重新表述都能带来高效的子问题求解器。在一项心力衰竭（HF）研究的大量电子病历数据上进行的综合实验证实，TASTE 比几种基线方法快 14 倍，而且得出的表型经心脏病专家确认具有临床意义。使用 TASTE 提取的 60 个表型，与使用具有 345 个特征的递归神经网络 (RNN) 的深度学习模型相比，简单的逻辑回归能达到相同的高频预测曲线下面积 (AUC)。

{"title":"TASTE: Temporal and Static Tensor Factorization for Phenotyping Electronic Health Records.","authors":"Ardavan Afshar, Ioakeim Perros, Haesun Park, Christopher deFilippi, Xiaowei Yan, Walter Stewart, Joyce Ho, Jimeng Sun","doi":"10.1145/3368555.3384464","DOIUrl":"10.1145/3368555.3384464","url":null,"abstract":"Phenotyping electronic health records (EHR) focuses on defining meaningful patient groups (e.g., heart failure group and diabetes group) and identifying the temporal evolution of patients in those groups. Tensor factorization has been an effective tool for phenotyping. Most of the existing works assume either a static patient representation with aggregate data or only model temporal data. However, real EHR data contain both temporal (e.g., longitudinal clinical visits) and static information (e.g., patient demographics), which are difficult to model simultaneously. In this paper, we propose Temporal And Static TEnsor factorization (TASTE) that jointly models both static and temporal information to extract phenotypes. TASTE combines the PARAFAC2 model with non-negative matrix factorization to model a temporal and a static tensor. To fit the proposed model, we transform the original problem into simpler ones which are optimally solved in an alternating fashion. For each of the sub-problems, our proposed mathematical re-formulations lead to efficient sub-problem solvers. Comprehensive experiments on large EHR data from a heart failure (HF) study confirmed that TASTE is up to 14× faster than several baselines and the resulting phenotypes were confirmed to be clinically meaningful by a cardiologist. Using 60 phenotypes extracted by TASTE, a simple logistic regression can achieve the same level of area under the curve (AUC) for HF prediction compared to a deep learning model using recurrent neural networks (RNN) with 345 features.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7924914/pdf/nihms-1587674.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25428959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CaliForest: Calibrated Random Forest for Health Data. califforest:健康数据校准随机森林。

Proceedings of the ACM Conference on Health, Inference, and Learning

Pub Date : 2020-04-01 Epub Date: 2020-04-02 DOI: 10.1145/3368555.3384461

Yubin Park, Joyce C Ho

Real-world predictive models in healthcare should be evaluated in terms of discrimination, the ability to differentiate between high and low risk events, and calibration, or the accuracy of the risk estimates. Unfortunately, calibration is often neglected and only discrimination is analyzed. Calibration is crucial for personalized medicine as they play an increasing role in the decision making process. Since random forest is a popular model for many healthcare applications, we propose CaliForest, a new calibrated random forest. Unlike existing calibration methodologies, CaliForest utilizes the out-of-bag samples to avoid the explicit construction of a calibration set. We evaluated CaliForest on two risk prediction tasks obtained from the publicly-available MIMIC-III database. Evaluation on these binary prediction tasks demonstrates that CaliForest can achieve the same discriminative power as random forest while obtaining a better-calibrated model evaluated across six different metrics. CaliForest is published on the standard Python software repository and the code is openly available on Github.

应该根据甄别、区分高风险和低风险事件的能力以及校准或风险估计的准确性来评估医疗保健中的实际预测模型。不幸的是，校准常常被忽略，只分析了判别。校准对于个性化医疗至关重要，因为它们在决策过程中发挥着越来越大的作用。由于随机森林是许多医疗保健应用的流行模型，我们提出了califforest，一种新的校准随机森林。与现有的校准方法不同，califforest利用袋外样本来避免明确构建校准集。我们从公开的MIMIC-III数据库中获得了两个风险预测任务，对califforest进行了评估。对这些二元预测任务的评估表明，califforest可以达到与random forest相同的判别能力，同时获得跨六个不同指标评估的更好校准的模型。califforest发布在标准Python软件存储库上，其代码在Github上公开可用。

{"title":"CaliForest: Calibrated Random Forest for Health Data.","authors":"Yubin Park, Joyce C Ho","doi":"10.1145/3368555.3384461","DOIUrl":"https://doi.org/10.1145/3368555.3384461","url":null,"abstract":"Real-world predictive models in healthcare should be evaluated in terms of discrimination, the ability to differentiate between high and low risk events, and calibration, or the accuracy of the risk estimates. Unfortunately, calibration is often neglected and only discrimination is analyzed. Calibration is crucial for personalized medicine as they play an increasing role in the decision making process. Since random forest is a popular model for many healthcare applications, we propose CaliForest, a new calibrated random forest. Unlike existing calibration methodologies, CaliForest utilizes the out-of-bag samples to avoid the explicit construction of a calibration set. We evaluated CaliForest on two risk prediction tasks obtained from the publicly-available MIMIC-III database. Evaluation on these binary prediction tasks demonstrates that CaliForest can achieve the same discriminative power as random forest while obtaining a better-calibrated model evaluated across six different metrics. CaliForest is published on the standard Python software repository and the code is openly available on Github.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3368555.3384461","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39226442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Multiple Instance Learning for Predicting Necrotizing Enterocolitis in Premature Infants Using Microbiome Data. 利用微生物组数据预测早产儿坏死性小肠结肠炎的多实例学习。

Proceedings of the ACM Conference on Health, Inference, and Learning

Pub Date : 2020-04-01 Epub Date: 2020-04-02 DOI: 10.1145/3368555.3384466

Thomas A Hooven, Adam Yun Chao Lin, Ansaf Salleb-Aouissi

Necrotizing enterocolitis (NEC) is a life-threatening intestinal disease that primarily affects preterm infants during their first weeks after birth. Mortality rates associated with NEC are 15-30%, and surviving infants are susceptible to multiple serious, long-term complications. The disease is sporadic and, with currently available tools, unpredictable. We are creating an early warning system that uses stool microbiome features, combined with clinical and demographic information, to identify infants at high risk of developing NEC. Our approach uses a multiple instance learning, neural network-based system that could be used to generate daily or weekly NEC predictions for premature infants. The approach was selected to effectively utilize sparse and weakly annotated datasets characteristic of stool microbiome analysis. Here we describe initial validation of our system, using clinical and microbiome data from a nested case-control study of 161 preterm infants. We show receiver-operator curve areas above 0.9, with 75% of dominant predictive samples for NEC-affected infants identified at least 24 hours prior to disease onset. Our results pave the way for development of a real-time early warning system for NEC using a limited set of basic clinical and demographic details combined with stool microbiome data.

坏死性小肠结肠炎（NEC）是一种危及生命的肠道疾病，主要影响早产儿出生后的头几周。与NEC相关的死亡率为15-30%，存活的婴儿易患多种严重的长期并发症。这种疾病是散发性的，而且用目前可用的工具，是不可预测的。我们正在创建一个早期预警系统，该系统利用粪便微生物组特征，结合临床和人口统计信息，来识别NEC高危婴儿。我们的方法使用了一个基于神经网络的多实例学习系统，该系统可用于生成早产儿的每日或每周NEC预测。选择该方法是为了有效利用粪便微生物组分析的稀疏和弱注释数据集。在这里，我们使用来自161名早产儿的嵌套病例对照研究的临床和微生物组数据来描述我们的系统的初步验证。我们显示了0.9以上的受试者-操作者曲线区域，75%的NEC影响婴儿的主要预测样本在疾病发作前至少24小时确定。我们的研究结果为开发NEC实时预警系统铺平了道路，该系统使用一组有限的基本临床和人口统计细节，并结合粪便微生物组数据。

{"title":"Multiple Instance Learning for Predicting Necrotizing Enterocolitis in Premature Infants Using Microbiome Data.","authors":"Thomas A Hooven, Adam Yun Chao Lin, Ansaf Salleb-Aouissi","doi":"10.1145/3368555.3384466","DOIUrl":"10.1145/3368555.3384466","url":null,"abstract":"Necrotizing enterocolitis (NEC) is a life-threatening intestinal disease that primarily affects preterm infants during their first weeks after birth. Mortality rates associated with NEC are 15-30%, and surviving infants are susceptible to multiple serious, long-term complications. The disease is sporadic and, with currently available tools, unpredictable. We are creating an early warning system that uses stool microbiome features, combined with clinical and demographic information, to identify infants at high risk of developing NEC. Our approach uses a multiple instance learning, neural network-based system that could be used to generate daily or weekly NEC predictions for premature infants. The approach was selected to effectively utilize sparse and weakly annotated datasets characteristic of stool microbiome analysis. Here we describe initial validation of our system, using clinical and microbiome data from a nested case-control study of 161 preterm infants. We show receiver-operator curve areas above 0.9, with 75% of dominant predictive samples for NEC-affected infants identified at least 24 hours prior to disease onset. Our results pave the way for development of a real-time early warning system for NEC using a limited set of basic clinical and demographic details combined with stool microbiome data.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3368555.3384466","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39229696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Adverse Drug Reaction Discovery from Electronic Health Records with Deep Neural Networks. 基于深度神经网络的电子健康记录药物不良反应发现。

Proceedings of the ACM Conference on Health, Inference, and Learning

Pub Date : 2020-04-01 DOI: 10.1145/3368555.3384459

Wei Zhang, Peggy Peissig, Zhaobin Kuang, David Page

Adverse drug reactions (ADRs) are detrimental and unexpected clinical incidents caused by drug intake. The increasing availability of massive quantities of longitudinal event data such as electronic health records (EHRs) has redefined ADR discovery as a big data analytics problem, where data-hungry deep neural networks are especially suitable because of the abundance of the data. To this end, we introduce neural self-controlled case series (NSCCS), a deep learning framework for ADR discovery from EHRs. NSCCS rigorously follows a self-controlled case series design to adjust implicitly and efficiently for individual heterogeneity. In this way, NSCCS is robust to time-invariant confounding issues and thus more capable of identifying associations that reflect the underlying mechanism between various types of drugs and adverse conditions. We apply NSCCS to a large-scale, real-world EHR dataset and empirically demonstrate its superior performance with comprehensive experiments on a benchmark ADR discovery task.

药物不良反应(adr)是由于药物摄入而引起的有害的、意想不到的临床事件。电子健康记录(EHRs)等大量纵向事件数据的可用性不断增加，将ADR发现重新定义为一个大数据分析问题，由于数据丰富，需要大量数据的深度神经网络特别适合。为此，我们引入了神经自控制病例序列(NSCCS)，这是一种用于从电子病历中发现不良反应的深度学习框架。NSCCS严格遵循自我控制的病例系列设计，以隐式和有效地调整个体异质性。通过这种方式，NSCCS对时不变混淆问题具有鲁棒性，因此更有能力识别反映各种药物和不良状况之间潜在机制的关联。我们将NSCCS应用于大规模的真实EHR数据集，并通过对基准ADR发现任务的综合实验实证证明了其优越的性能。

引用次数: 7

MMiDaS-AE: Multi-modal Missing Data aware Stacked Autoencoder for Biomedical Abstract Screening. MMiDaS-AE:用于生物医学摘要筛选的多模态缺失数据感知堆叠自编码器。

Proceedings of the ACM Conference on Health, Inference, and Learning

Pub Date : 2020-04-01 Epub Date: 2020-04-02 DOI: 10.1145/3368555.3384463

Eric W Lee, Byron C Wallace, Karla I Galaviz, Joyce C Ho

Systematic review (SR) is an essential process to identify, evaluate, and summarize the findings of all relevant individual studies concerning health-related questions. However, conducting a SR is labor-intensive, as identifying relevant studies is a daunting process that entails multiple researchers screening thousands of articles for relevance. In this paper, we propose MMiDaS-AE, a Multi-modal Missing Data aware Stacked Autoencoder, for semi-automating screening for SRs. We use a multi-modal view that exploits three representations, of: 1) documents, 2) topics, and 3) citation networks. Documents that contain similar words will be nearby in the document embedding space. Models can also exploit the relationship between documents and the associated SR MeSH terms to capture article relevancy. Finally, related works will likely share the same citations, and thus closely related articles would, intuitively, be trained to be close to each other in the embedding space. However, using all three learned representations as features directly result in an unwieldy number of parameters. Thus, motivated by recent work on multi-modal auto-encoders, we adopt a multi-modal stacked autoencoder that can learn a shared representation encoding all three representations in a compressed space. However, in practice one or more of these modalities may be missing for an article (e.g., if we cannot recover citation information). Therefore, we propose to learn to impute the shared representation even when specific inputs are missing. We find this new model significantly improves performance on a dataset consisting of 15 SRs compared to existing approaches.

系统综述（SR）是识别、评估和总结所有与健康相关问题的相关个体研究结果的重要过程。然而，进行SR是劳动密集型的，因为识别相关研究是一个艰巨的过程，需要多名研究人员筛选数千篇文章的相关性。在本文中，我们提出了MMiDaS AE，一种多模态缺失数据感知堆叠自动编码器，用于SR的半自动筛选。我们使用了一个多模态视图，它利用了以下三种表示：1）文档，2）主题和3）引用网络。包含相似单词的文档将位于文档嵌入空间的附近。模型还可以利用文档和相关联的SR-MeSH术语之间的关系来捕获文章相关性。最后，相关作品可能会共享相同的引文，因此，直观地说，密切相关的文章会被训练成在嵌入空间中彼此接近。然而，使用所有三种学习的表示作为特征直接导致参数数量的笨拙。因此，受最近关于多模态自动编码器的工作的启发，我们采用了一种多模态堆叠自动编码器，它可以学习在压缩空间中对所有三种表示进行编码的共享表示。然而，在实践中，一篇文章可能缺少其中一种或多种模式（例如，如果我们无法恢复引用信息）。因此，我们建议即使在缺少特定输入的情况下，也要学会估算共享表示。我们发现，与现有方法相比，这种新模型显著提高了由15个SR组成的数据集的性能。

{"title":"MMiDaS-AE: Multi-modal Missing Data aware Stacked Autoencoder for Biomedical Abstract Screening.","authors":"Eric W Lee, Byron C Wallace, Karla I Galaviz, Joyce C Ho","doi":"10.1145/3368555.3384463","DOIUrl":"10.1145/3368555.3384463","url":null,"abstract":"Systematic review (SR) is an essential process to identify, evaluate, and summarize the findings of all relevant individual studies concerning health-related questions. However, conducting a SR is labor-intensive, as identifying relevant studies is a daunting process that entails multiple researchers screening thousands of articles for relevance. In this paper, we propose MMiDaS-AE, a Multi-modal Missing Data aware Stacked Autoencoder, for semi-automating screening for SRs. We use a multi-modal view that exploits three representations, of: 1) documents, 2) topics, and 3) citation networks. Documents that contain similar words will be nearby in the document embedding space. Models can also exploit the relationship between documents and the associated SR MeSH terms to capture article relevancy. Finally, related works will likely share the same citations, and thus closely related articles would, intuitively, be trained to be close to each other in the embedding space. However, using all three learned representations as features directly result in an unwieldy number of parameters. Thus, motivated by recent work on multi-modal auto-encoders, we adopt a multi-modal stacked autoencoder that can learn a shared representation encoding all three representations in a compressed space. However, in practice one or more of these modalities may be missing for an article (e.g., if we cannot recover citation information). Therefore, we propose to learn to impute the shared representation even when specific inputs are missing. We find this new model significantly improves performance on a dataset consisting of 15 SRs compared to existing approaches.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3368555.3384463","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39226443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Variational Learning of Individual Survival Distributions. 个体生存分布的变分学习。

Proceedings of the ACM Conference on Health, Inference, and Learning

Pub Date : 2020-04-01 Epub Date: 2020-04-02 DOI: 10.1145/3368555.3384454

Zidi Xiu, Chenyang Tao, Ricardo Henao

The abundance of modern health data provides many opportunities for the use of machine learning techniques to build better statistical models to improve clinical decision making. Predicting time-to-event distributions, also known as survival analysis, plays a key role in many clinical applications. We introduce a variational time-to-event prediction model, named Variational Survival Inference (VSI), which builds upon recent advances in distribution learning techniques and deep neural networks. VSI addresses the challenges of non-parametric distribution estimation by (i) relaxing the restrictive modeling assumptions made in classical models, and (ii) efficiently handling the censored observations, i.e., events that occur outside the observation window, all within the variational framework. To validate the effectiveness of our approach, an extensive set of experiments on both synthetic and real-world datasets is carried out, showing improved performance relative to competing solutions.

丰富的现代健康数据为使用机器学习技术建立更好的统计模型以改进临床决策提供了许多机会。预测时间到事件的分布，也称为生存分析，在许多临床应用中发挥着关键作用。我们介绍了一种变分时间到事件预测模型，称为变分生存推理（VSI），该模型建立在分布学习技术和深度神经网络的最新进展之上。VSI通过（i）放宽经典模型中的限制性建模假设，以及（ii）有效地处理截尾观测，即观测窗口外发生的事件，来解决非参数分布估计的挑战，所有这些都在变分框架内。为了验证我们方法的有效性，在合成和真实世界的数据集上进行了一组广泛的实验，显示出相对于竞争解决方案的性能有所提高。

引用次数: 10

Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging. 隐藏分层导致医学成像机器学习中有临床意义的失败。

Proceedings of the ACM Conference on Health, Inference, and Learning

Pub Date : 2020-04-01 DOI: 10.1145/3368555.3384468

Luke Oakden-Rayner, Jared Dunnmon, Gustavo Carneiro, Christopher Ré

Machine learning models for medical image analysis often suffer from poor performance on important subsets of a population that are not identified during training or testing. For example, overall performance of a cancer detection model may be high, but the model may still consistently miss a rare but aggressive cancer subtype. We refer to this problem as hidden stratification, and observe that it results from incompletely describing the meaningful variation in a dataset. While hidden stratification can substantially reduce the clinical efficacy of machine learning models, its effects remain difficult to measure. In this work, we assess the utility of several possible techniques for measuring hidden stratification effects, and characterize these effects both via synthetic experiments on the CIFAR-100 benchmark dataset and on multiple real-world medical imaging datasets. Using these measurement techniques, we find evidence that hidden stratification can occur in unidentified imaging subsets with low prevalence, low label quality, subtle distinguishing features, or spurious correlates, and that it can result in relative performance differences of over 20% on clinically important subsets. Finally, we discuss the clinical implications of our findings, and suggest that evaluation of hidden stratification should be a critical component of any machine learning deployment in medical imaging.

用于医学图像分析的机器学习模型通常在训练或测试期间未识别的重要子集上表现不佳。例如，癌症检测模型的整体性能可能很高，但该模型仍然可能始终遗漏罕见但具有侵袭性的癌症亚型。我们将这个问题称为隐藏分层，并观察到它是由于不完全描述数据集中有意义的变化而导致的。虽然隐藏分层会大大降低机器学习模型的临床疗效，但其效果仍然难以衡量。在这项工作中，我们评估了几种可能用于测量隐藏分层效应的技术的效用，并通过在CIFAR-100基准数据集和多个现实世界医学成像数据集上的综合实验来表征这些效应。使用这些测量技术，我们发现证据表明，隐藏分层可能发生在低患病率、低标签质量、细微区别特征或虚假相关性的未识别成像子集中，并且它可能导致临床重要子集的相对性能差异超过20%。最后，我们讨论了我们的研究结果的临床意义，并建议隐藏分层的评估应该是医学成像中任何机器学习部署的关键组成部分。

{"title":"Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging.","authors":"Luke Oakden-Rayner, Jared Dunnmon, Gustavo Carneiro, Christopher Ré","doi":"10.1145/3368555.3384468","DOIUrl":"https://doi.org/10.1145/3368555.3384468","url":null,"abstract":"Machine learning models for medical image analysis often suffer from poor performance on important subsets of a population that are not identified during training or testing. For example, overall performance of a cancer detection model may be high, but the model may still consistently miss a rare but aggressive cancer subtype. We refer to this problem as hidden stratification, and observe that it results from incompletely describing the meaningful variation in a dataset. While hidden stratification can substantially reduce the clinical efficacy of machine learning models, its effects remain difficult to measure. In this work, we assess the utility of several possible techniques for measuring hidden stratification effects, and characterize these effects both via synthetic experiments on the CIFAR-100 benchmark dataset and on multiple real-world medical imaging datasets. Using these measurement techniques, we find evidence that hidden stratification can occur in unidentified imaging subsets with low prevalence, low label quality, subtle distinguishing features, or spurious correlates, and that it can result in relative performance differences of over 20% on clinically important subsets. Finally, we discuss the clinical implications of our findings, and suggest that evaluation of hidden stratification should be a critical component of any machine learning deployment in medical imaging.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3368555.3384468","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38608685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 268

Deidentification of free-text medical records using pre-trained bidirectional transformers. 使用预训练的双向变换器消除自由文本医疗记录的身份识别。

Proceedings of the ACM Conference on Health, Inference, and Learning

Pub Date : 2020-04-01 Epub Date: 2020-04-02 DOI: 10.1145/3368555.3384455

Alistair E W Johnson, Lucas Bulgarelli, Tom J Pollard

The ability of caregivers and investigators to share patient data is fundamental to many areas of clinical practice and biomedical research. Prior to sharing, it is often necessary to remove identifiers such as names, contact details, and dates in order to protect patient privacy. Deidentification, the process of removing identifiers, is challenging, however. High-quality annotated data for developing models is scarce; many target identifiers are highly heterogenous (for example, there are uncountable variations of patient names); and in practice anything less than perfect sensitivity may be considered a failure. As a result, patient data is often withheld when sharing would be beneficial, and identifiable patient data is often divulged when a deidentified version would suffice. In recent years, advances in machine learning methods have led to rapid performance improvements in natural language processing tasks, in particular with the advent of large-scale pretrained language models. In this paper we develop and evaluate an approach for deidentification of clinical notes based on a bidirectional transformer model. We propose human interpretable evaluation measures and demonstrate state of the art performance against modern baseline models. Finally, we highlight current challenges in deidentification, including the absence of clear annotation guidelines, lack of portability of models, and paucity of training data. Code to develop our model is open source, allowing for broad reuse.

护理人员和研究人员共享患者数据的能力是临床实践和生物医学研究许多领域的基础。在共享之前，为了保护患者隐私，通常有必要删除姓名、联系方式和日期等标识符。然而，去标识化，即去除标识符的过程，是一项具有挑战性的工作。用于开发模型的高质量注释数据非常稀缺；许多目标标识符具有高度异质性（例如，患者姓名的变化难以计数）；在实践中，任何不够完美的灵敏度都可能被视为失败。因此，当共享病人数据有益时，病人数据却往往被隐瞒；当去标识化版本就足够时，可识别的病人数据却往往被泄露。近年来，机器学习方法的进步使自然语言处理任务的性能迅速提高，特别是随着大规模预训练语言模型的出现。在本文中，我们开发并评估了一种基于双向转换器模型的临床笔记去标识化方法。我们提出了人类可解释的评估指标，并展示了与现代基线模型相比的最新性能。最后，我们强调了当前去标识化面临的挑战，包括缺乏明确的注释指南、模型缺乏可移植性以及训练数据匮乏。开发我们模型的代码是开源的，允许广泛重用。

{"title":"Deidentification of free-text medical records using pre-trained bidirectional transformers.","authors":"Alistair E W Johnson, Lucas Bulgarelli, Tom J Pollard","doi":"10.1145/3368555.3384455","DOIUrl":"10.1145/3368555.3384455","url":null,"abstract":"The ability of caregivers and investigators to share patient data is fundamental to many areas of clinical practice and biomedical research. Prior to sharing, it is often necessary to remove identifiers such as names, contact details, and dates in order to protect patient privacy. Deidentification, the process of removing identifiers, is challenging, however. High-quality annotated data for developing models is scarce; many target identifiers are highly heterogenous (for example, there are uncountable variations of patient names); and in practice anything less than perfect sensitivity may be considered a failure. As a result, patient data is often withheld when sharing would be beneficial, and identifiable patient data is often divulged when a deidentified version would suffice. In recent years, advances in machine learning methods have led to rapid performance improvements in natural language processing tasks, in particular with the advent of large-scale pretrained language models. In this paper we develop and evaluate an approach for deidentification of clinical notes based on a bidirectional transformer model. We propose human interpretable evaluation measures and demonstrate state of the art performance against modern baseline models. Finally, we highlight current challenges in deidentification, including the absence of clear annotation guidelines, lack of portability of models, and paucity of training data. Code to develop our model is open source, allowing for broad reuse.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/e7/a6/nihms-1679580.PMC8330601.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39277422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0