Machine Learning in Health Care最新文献

Latent Temporal Flows for Multivariate Analysis of Wearables Data 可穿戴设备数据多元分析的潜在时间流

Machine Learning in Health Care

Pub Date : 2022-10-14 DOI: 10.48550/arXiv.2210.07475

Magda Amiridi, Gregory Darnell, S. Jewell

Increased use of sensor signals from wearable devices as rich sources of physiological data has sparked growing interest in developing health monitoring systems to identify changes in an individual's health profile. Indeed, machine learning models for sensor signals have enabled a diverse range of healthcare related applications including early detection of abnormalities, fertility tracking, and adverse drug effect prediction. However, these models can fail to account for the dependent high-dimensional nature of the underlying sensor signals. In this paper, we introduce Latent Temporal Flows, a method for multivariate time-series modeling tailored to this setting. We assume that a set of sequences is generated from a multivariate probabilistic model of an unobserved time-varying low-dimensional latent vector. Latent Temporal Flows simultaneously recovers a transformation of the observed sequences into lower-dimensional latent representations via deep autoencoder mappings, and estimates a temporally-conditioned probabilistic model via normalizing flows. Using data from the Apple Heart and Movement Study (AH&MS), we illustrate promising forecasting performance on these challenging signals. Additionally, by analyzing two and three dimensional representations learned by our model, we show that we can identify participants' $text{VO}_2text{max}$, a main indicator and summary of cardio-respiratory fitness, using only lower-level signals. Finally, we show that the proposed method consistently outperforms the state-of-the-art in multi-step forecasting benchmarks (achieving at least a $10%$ performance improvement) on several real-world datasets, while enjoying increased computational efficiency.

越来越多地使用来自可穿戴设备的传感器信号作为丰富的生理数据来源，这激发了人们对开发健康监测系统以识别个人健康状况变化的兴趣。事实上，传感器信号的机器学习模型已经实现了各种与医疗保健相关的应用，包括早期检测异常、生育跟踪和药物不良反应预测。然而，这些模型可能无法解释底层传感器信号的依赖高维性质。在本文中，我们介绍了潜在时间流，这是一种针对这种情况量身定制的多变量时间序列建模方法。我们假设一组序列是由一个未观察到的时变低维潜在向量的多元概率模型生成的。潜在时间流同时通过深度自编码器映射恢复观察序列到低维潜在表示的转换，并通过归一化流估计时间条件概率模型。使用苹果心脏和运动研究(AH&MS)的数据，我们说明了在这些具有挑战性的信号上有希望的预测性能。此外，通过分析我们的模型学习到的二维和三维表征，我们表明我们可以识别参与者的$text{VO}_2text{max}$，这是心肺健康的主要指标和总结，仅使用较低水平的信号。最后，我们表明，在几个真实世界的数据集上，所提出的方法在多步预测基准中始终优于最先进的方法(实现至少10%的性能改进)，同时提高了计算效率。

{"title":"Latent Temporal Flows for Multivariate Analysis of Wearables Data","authors":"Magda Amiridi, Gregory Darnell, S. Jewell","doi":"10.48550/arXiv.2210.07475","DOIUrl":"https://doi.org/10.48550/arXiv.2210.07475","url":null,"abstract":"Increased use of sensor signals from wearable devices as rich sources of physiological data has sparked growing interest in developing health monitoring systems to identify changes in an individual's health profile. Indeed, machine learning models for sensor signals have enabled a diverse range of healthcare related applications including early detection of abnormalities, fertility tracking, and adverse drug effect prediction. However, these models can fail to account for the dependent high-dimensional nature of the underlying sensor signals. In this paper, we introduce Latent Temporal Flows, a method for multivariate time-series modeling tailored to this setting. We assume that a set of sequences is generated from a multivariate probabilistic model of an unobserved time-varying low-dimensional latent vector. Latent Temporal Flows simultaneously recovers a transformation of the observed sequences into lower-dimensional latent representations via deep autoencoder mappings, and estimates a temporally-conditioned probabilistic model via normalizing flows. Using data from the Apple Heart and Movement Study (AH&MS), we illustrate promising forecasting performance on these challenging signals. Additionally, by analyzing two and three dimensional representations learned by our model, we show that we can identify participants' $text{VO}_2text{max}$, a main indicator and summary of cardio-respiratory fitness, using only lower-level signals. Finally, we show that the proposed method consistently outperforms the state-of-the-art in multi-step forecasting benchmarks (achieving at least a $10%$ performance improvement) on several real-world datasets, while enjoying increased computational efficiency.","PeriodicalId":231229,"journal":{"name":"Machine Learning in Health Care","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126884090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Ensembling Neural Networks for Improved Prediction and Privacy in Early Diagnosis of Sepsis 集成神经网络在脓毒症早期诊断中的预测和隐私改进

Machine Learning in Health Care

Pub Date : 2022-09-01 DOI: 10.48550/arXiv.2209.00439

Shigehiko Schamoni, M. Hagmann, S. Riezler

Ensembling neural networks is a long-standing technique for improving the generalization error of neural networks by combining networks with orthogonal properties via a committee decision. We show that this technique is an ideal fit for machine learning on medical data: First, ensembles are amenable to parallel and asynchronous learning, thus enabling efficient training of patient-specific component neural networks. Second, building on the idea of minimizing generalization error by selecting uncorrelated patient-specific networks, we show that one can build an ensemble of a few selected patient-specific models that outperforms a single model trained on much larger pooled datasets. Third, the non-iterative ensemble combination step is an optimal low-dimensional entry point to apply output perturbation to guarantee the privacy of the patient-specific networks. We exemplify our framework of differentially private ensembles on the task of early prediction of sepsis, using real-life intensive care unit data labeled by clinical experts.

集成神经网络是一种长期存在的技术，它通过委员会决策将具有正交特性的神经网络组合在一起，以改善神经网络的泛化误差。我们表明，这种技术非常适合医疗数据上的机器学习:首先，集成可以进行并行和异步学习，从而能够有效地训练特定于患者的组件神经网络。其次，基于通过选择不相关的特定患者网络来最小化泛化误差的想法，我们表明可以构建几个特定患者模型的集合，其性能优于在更大的池数据集上训练的单个模型。第三，非迭代集成组合步骤是应用输出摄动来保证特定患者网络隐私的最佳低维入口点。我们举例说明了我们的框架不同的私人集合对脓毒症的早期预测任务，使用临床专家标记的现实生活重症监护病房数据。

引用次数: 1

Development and Validation of ML-DQA - a Machine Learning Data Quality Assurance Framework for Healthcare ML-DQA的开发和验证——医疗保健机器学习数据质量保证框架

Machine Learning in Health Care

Pub Date : 2022-08-04 DOI: 10.48550/arXiv.2208.02670

M. Sendak, Gaurav Sirdeshmukh, Timothy N. Ochoa, H. Premo, Linda Tang, Kira L. Niederhoffer, Sarah Reed, Kaivalya Deshpande, E. Sterrett, M. Bauer, L. Snyder, Afreen I. Shariff, D. Whellan, J. Riggio, D. Gaieski, Kristin M Corey, Megan Richards, M. Gao, M. Nichols, Bradley Heintze, William S Knechtle, W. Ratliff, S. Balu

The approaches by which the machine learning and clinical research communities utilize real world data (RWD), including data captured in the electronic health record (EHR), vary dramatically. While clinical researchers cautiously use RWD for clinical investigations, ML for healthcare teams consume public datasets with minimal scrutiny to develop new algorithms. This study bridges this gap by developing and validating ML-DQA, a data quality assurance framework grounded in RWD best practices. The ML-DQA framework is applied to five ML projects across two geographies, different medical conditions, and different cohorts. A total of 2,999 quality checks and 24 quality reports were generated on RWD gathered on 247,536 patients across the five projects. Five generalizable practices emerge: all projects used a similar method to group redundant data element representations; all projects used automated utilities to build diagnosis and medication data elements; all projects used a common library of rules-based transformations; all projects used a unified approach to assign data quality checks to data elements; and all projects used a similar approach to clinical adjudication. An average of 5.8 individuals, including clinicians, data scientists, and trainees, were involved in implementing ML-DQA for each project and an average of 23.4 data elements per project were either transformed or removed in response to ML-DQA. This study demonstrates the importance role of ML-DQA in healthcare projects and provides teams a framework to conduct these essential activities.

机器学习和临床研究界利用现实世界数据(RWD)的方法，包括电子健康记录(EHR)中捕获的数据，差异很大。虽然临床研究人员谨慎地使用RWD进行临床调查，但医疗团队的ML使用公共数据集，以最少的审查来开发新算法。本研究通过开发和验证基于RWD最佳实践的数据质量保证框架ML-DQA，弥合了这一差距。ML- dqa框架应用于跨越两个地区、不同医疗条件和不同队列的五个ML项目。五个项目共收集了247,536名患者的RWD，共生成了2,999次质量检查和24份质量报告。出现了五个可推广的实践:所有项目都使用类似的方法对冗余数据元素表示进行分组;所有项目都使用自动化实用程序来构建诊断和药物数据元素;所有项目都使用基于规则的转换的公共库;所有项目都使用统一的方法为数据元素分配数据质量检查;所有的项目都使用了类似的方法来进行临床裁决。平均5.8个人，包括临床医生、数据科学家和学员，参与每个项目的ML-DQA实施，每个项目平均有23.4个数据元素被转换或删除，以响应ML-DQA。本研究证明了ML-DQA在医疗保健项目中的重要作用，并为团队提供了执行这些基本活动的框架。

{"title":"Development and Validation of ML-DQA - a Machine Learning Data Quality Assurance Framework for Healthcare","authors":"M. Sendak, Gaurav Sirdeshmukh, Timothy N. Ochoa, H. Premo, Linda Tang, Kira L. Niederhoffer, Sarah Reed, Kaivalya Deshpande, E. Sterrett, M. Bauer, L. Snyder, Afreen I. Shariff, D. Whellan, J. Riggio, D. Gaieski, Kristin M Corey, Megan Richards, M. Gao, M. Nichols, Bradley Heintze, William S Knechtle, W. Ratliff, S. Balu","doi":"10.48550/arXiv.2208.02670","DOIUrl":"https://doi.org/10.48550/arXiv.2208.02670","url":null,"abstract":"The approaches by which the machine learning and clinical research communities utilize real world data (RWD), including data captured in the electronic health record (EHR), vary dramatically. While clinical researchers cautiously use RWD for clinical investigations, ML for healthcare teams consume public datasets with minimal scrutiny to develop new algorithms. This study bridges this gap by developing and validating ML-DQA, a data quality assurance framework grounded in RWD best practices. The ML-DQA framework is applied to five ML projects across two geographies, different medical conditions, and different cohorts. A total of 2,999 quality checks and 24 quality reports were generated on RWD gathered on 247,536 patients across the five projects. Five generalizable practices emerge: all projects used a similar method to group redundant data element representations; all projects used automated utilities to build diagnosis and medication data elements; all projects used a common library of rules-based transformations; all projects used a unified approach to assign data quality checks to data elements; and all projects used a similar approach to clinical adjudication. An average of 5.8 individuals, including clinicians, data scientists, and trainees, were involved in implementing ML-DQA for each project and an average of 23.4 data elements per project were either transformed or removed in response to ML-DQA. This study demonstrates the importance role of ML-DQA in healthcare projects and provides teams a framework to conduct these essential activities.","PeriodicalId":231229,"journal":{"name":"Machine Learning in Health Care","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114699339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

HiCu: Leveraging Hierarchy for Curriculum Learning in Automated ICD Coding HiCu:在自动化ICD编码中利用层次结构进行课程学习

Machine Learning in Health Care

Pub Date : 2022-08-03 DOI: 10.48550/arXiv.2208.02301

Weiming Ren, Ruijing Zeng, Tong Wu, Tianshu Zhu, R. G. Krishnan

There are several opportunities for automation in healthcare that can improve clinician throughput. One such example is assistive tools to document diagnosis codes when clinicians write notes. We study the automation of medical code prediction using curriculum learning, which is a training strategy for machine learning models that gradually increases the hardness of the learning tasks from easy to difficult. One of the challenges in curriculum learning is the design of curricula -- i.e., in the sequential design of tasks that gradually increase in difficulty. We propose Hierarchical Curriculum Learning (HiCu), an algorithm that uses graph structure in the space of outputs to design curricula for multi-label classification. We create curricula for multi-label classification models that predict ICD diagnosis and procedure codes from natural language descriptions of patients. By leveraging the hierarchy of ICD codes, which groups diagnosis codes based on various organ systems in the human body, we find that our proposed curricula improve the generalization of neural network-based predictive models across recurrent, convolutional, and transformer-based architectures. Our code is available at https://github.com/wren93/HiCu-ICD.

在医疗保健领域有几个实现自动化的机会，可以提高临床医生的吞吐量。其中一个例子是临床医生写笔记时记录诊断代码的辅助工具。我们使用课程学习来研究医疗代码预测的自动化，课程学习是一种机器学习模型的训练策略，它逐渐将学习任务的难度从简单增加到困难。课程学习的挑战之一是课程的设计，即在逐步增加难度的任务的顺序设计中。我们提出了分层课程学习(HiCu)，一种利用输出空间中的图结构来设计多标签分类课程的算法。我们为多标签分类模型创建课程，从患者的自然语言描述中预测ICD诊断和程序代码。通过利用ICD代码的层次结构，将基于人体各种器官系统的诊断代码分组，我们发现我们提出的课程提高了基于神经网络的预测模型在循环、卷积和基于变压器的架构中的泛化。我们的代码可在https://github.com/wren93/HiCu-ICD上获得。

{"title":"HiCu: Leveraging Hierarchy for Curriculum Learning in Automated ICD Coding","authors":"Weiming Ren, Ruijing Zeng, Tong Wu, Tianshu Zhu, R. G. Krishnan","doi":"10.48550/arXiv.2208.02301","DOIUrl":"https://doi.org/10.48550/arXiv.2208.02301","url":null,"abstract":"There are several opportunities for automation in healthcare that can improve clinician throughput. One such example is assistive tools to document diagnosis codes when clinicians write notes. We study the automation of medical code prediction using curriculum learning, which is a training strategy for machine learning models that gradually increases the hardness of the learning tasks from easy to difficult. One of the challenges in curriculum learning is the design of curricula -- i.e., in the sequential design of tasks that gradually increase in difficulty. We propose Hierarchical Curriculum Learning (HiCu), an algorithm that uses graph structure in the space of outputs to design curricula for multi-label classification. We create curricula for multi-label classification models that predict ICD diagnosis and procedure codes from natural language descriptions of patients. By leveraging the hierarchy of ICD codes, which groups diagnosis codes based on various organ systems in the human body, we find that our proposed curricula improve the generalization of neural network-based predictive models across recurrent, convolutional, and transformer-based architectures. Our code is available at https://github.com/wren93/HiCu-ICD.","PeriodicalId":231229,"journal":{"name":"Machine Learning in Health Care","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131566828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Weakly Supervised Deep Instance Nuclei Detection using Points Annotation in 3D Cardiovascular Immunofluorescent Images 基于点注释的三维心血管免疫荧光图像弱监督深度核检测

Machine Learning in Health Care

Pub Date : 2022-07-29 DOI: 10.48550/arXiv.2208.00098

Nazanin Moradinasab, Y. Sharma, Laura S. Shankman, G. Owens, Donald E. Brown

Two major causes of death in the United States and worldwide are stroke and myocardial infarction. The underlying cause of both is thrombi released from ruptured or eroded unstable atherosclerotic plaques that occlude vessels in the heart (myocardial infarction) or the brain (stroke). Clinical studies show that plaque composition plays a more important role than lesion size in plaque rupture or erosion events. To determine the plaque composition, various cell types in 3D cardiovascular immunofluorescent images of plaque lesions are counted. However, counting these cells manually is expensive, time-consuming, and prone to human error. These challenges of manual counting motivate the need for an automated approach to localize and count the cells in images. The purpose of this study is to develop an automatic approach to accurately detect and count cells in 3D immunofluorescent images with minimal annotation effort. In this study, we used a weakly supervised learning approach to train the HoVer-Net segmentation model using point annotations to detect nuclei in fluorescent images. The advantage of using point annotations is that they require less effort as opposed to pixel-wise annotation. To train the HoVer-Net model using point annotations, we adopted a popularly used cluster labeling approach to transform point annotations into accurate binary masks of cell nuclei. Traditionally, these approaches have generated binary masks from point annotations, leaving a region around the object unlabeled (which is typically ignored during model training). However, these areas may contain important information that helps determine the boundary between cells. Therefore, we used the entropy minimization loss function in these areas to encourage the model to output more confident predictions on the unlabeled areas. Our comparison studies indicate that the HoVer-Net model trained using our weakly ...

在美国和世界范围内，导致死亡的两大原因是中风和心肌梗塞。两者的根本原因都是由破裂或侵蚀不稳定的动脉粥样硬化斑块释放的血栓，这些斑块阻塞了心脏(心肌梗死)或大脑(中风)的血管。临床研究表明，在斑块破裂或糜烂事件中，斑块组成比斑块大小起着更重要的作用。为了确定斑块组成，对斑块病变的三维心血管免疫荧光图像中的各种细胞类型进行计数。然而，手动计算这些单元格既昂贵又耗时，而且容易出现人为错误。人工计数的这些挑战促使人们需要一种自动化的方法来定位和计数图像中的细胞。本研究的目的是开发一种自动方法，以最小的注释努力准确地检测和计数3D免疫荧光图像中的细胞。在这项研究中，我们使用弱监督学习方法来训练HoVer-Net分割模型，使用点注释来检测荧光图像中的核。使用点注释的优点是，与逐像素注释相比，它们需要的工作量更少。为了使用点标注训练HoVer-Net模型，我们采用了一种常用的聚类标记方法，将点标注转化为精确的细胞核二值掩模。传统上，这些方法从点注释生成二进制掩码，在对象周围留下未标记的区域(在模型训练期间通常会忽略这一点)。然而，这些区域可能包含有助于确定细胞之间边界的重要信息。因此，我们在这些区域中使用熵最小化损失函数来鼓励模型在未标记区域上输出更有信心的预测。我们的比较研究表明，使用我们的弱…

{"title":"Weakly Supervised Deep Instance Nuclei Detection using Points Annotation in 3D Cardiovascular Immunofluorescent Images","authors":"Nazanin Moradinasab, Y. Sharma, Laura S. Shankman, G. Owens, Donald E. Brown","doi":"10.48550/arXiv.2208.00098","DOIUrl":"https://doi.org/10.48550/arXiv.2208.00098","url":null,"abstract":"Two major causes of death in the United States and worldwide are stroke and myocardial infarction. The underlying cause of both is thrombi released from ruptured or eroded unstable atherosclerotic plaques that occlude vessels in the heart (myocardial infarction) or the brain (stroke). Clinical studies show that plaque composition plays a more important role than lesion size in plaque rupture or erosion events. To determine the plaque composition, various cell types in 3D cardiovascular immunofluorescent images of plaque lesions are counted. However, counting these cells manually is expensive, time-consuming, and prone to human error. These challenges of manual counting motivate the need for an automated approach to localize and count the cells in images. The purpose of this study is to develop an automatic approach to accurately detect and count cells in 3D immunofluorescent images with minimal annotation effort. In this study, we used a weakly supervised learning approach to train the HoVer-Net segmentation model using point annotations to detect nuclei in fluorescent images. The advantage of using point annotations is that they require less effort as opposed to pixel-wise annotation. To train the HoVer-Net model using point annotations, we adopted a popularly used cluster labeling approach to transform point annotations into accurate binary masks of cell nuclei. Traditionally, these approaches have generated binary masks from point annotations, leaving a region around the object unlabeled (which is typically ignored during model training). However, these areas may contain important information that helps determine the boundary between cells. Therefore, we used the entropy minimization loss function in these areas to encourage the model to output more confident predictions on the unlabeled areas. Our comparison studies indicate that the HoVer-Net model trained using our weakly ...","PeriodicalId":231229,"journal":{"name":"Machine Learning in Health Care","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126676141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Debiasing Deep Chest X-Ray Classifiers using Intra- and Post-processing Methods 使用内处理和后处理方法去偏深胸部x线分类器

Machine Learning in Health Care

Pub Date : 2022-07-26 DOI: 10.48550/arXiv.2208.00781

Ricards Marcinkevics, Ece Ozkan, Julia E. Vogt

Deep neural networks for image-based screening and computer-aided diagnosis have achieved expert-level performance on various medical imaging modalities, including chest radiographs. Recently, several works have indicated that these state-of-the-art classifiers can be biased with respect to sensitive patient attributes, such as race or gender, leading to growing concerns about demographic disparities and discrimination resulting from algorithmic and model-based decision-making in healthcare. Fair machine learning has focused on mitigating such biases against disadvantaged or marginalised groups, mainly concentrating on tabular data or natural images. This work presents two novel intra-processing techniques based on fine-tuning and pruning an already-trained neural network. These methods are simple yet effective and can be readily applied post hoc in a setting where the protected attribute is unknown during the model development and test time. In addition, we compare several intra- and post-processing approaches applied to debiasing deep chest X-ray classifiers. To the best of our knowledge, this is one of the first efforts studying debiasing methods on chest radiographs. Our results suggest that the considered approaches successfully mitigate biases in fully connected and convolutional neural networks offering stable performance under various settings. The discussed methods can help achieve group fairness of deep medical image classifiers when deploying them in domains with different fairness considerations and constraints.

用于基于图像的筛查和计算机辅助诊断的深度神经网络在各种医学成像模式(包括胸部x线片)上达到了专家级的性能。最近，一些研究表明，这些最先进的分类器可能会对敏感的患者属性(如种族或性别)产生偏见，从而导致人们越来越关注医疗保健中基于算法和模型的决策所导致的人口差异和歧视。公平的机器学习侧重于减轻对弱势或边缘群体的偏见，主要集中在表格数据或自然图像上。这项工作提出了两种新的基于微调和修剪已经训练好的神经网络的内部处理技术。这些方法简单而有效，并且可以很容易地在模型开发和测试期间被保护属性未知的情况下应用。此外，我们比较了几种用于去除胸部深x线分类器的内处理和后处理方法。据我们所知，这是研究胸部x线片去偏方法的首次努力之一。我们的研究结果表明，所考虑的方法成功地减轻了全连接和卷积神经网络中的偏差，在各种设置下提供稳定的性能。当将深度医学图像分类器部署在具有不同公平性考虑和约束条件的领域时，所讨论的方法有助于实现深度医学图像分类器的分组公平性。

{"title":"Debiasing Deep Chest X-Ray Classifiers using Intra- and Post-processing Methods","authors":"Ricards Marcinkevics, Ece Ozkan, Julia E. Vogt","doi":"10.48550/arXiv.2208.00781","DOIUrl":"https://doi.org/10.48550/arXiv.2208.00781","url":null,"abstract":"Deep neural networks for image-based screening and computer-aided diagnosis have achieved expert-level performance on various medical imaging modalities, including chest radiographs. Recently, several works have indicated that these state-of-the-art classifiers can be biased with respect to sensitive patient attributes, such as race or gender, leading to growing concerns about demographic disparities and discrimination resulting from algorithmic and model-based decision-making in healthcare. Fair machine learning has focused on mitigating such biases against disadvantaged or marginalised groups, mainly concentrating on tabular data or natural images. This work presents two novel intra-processing techniques based on fine-tuning and pruning an already-trained neural network. These methods are simple yet effective and can be readily applied post hoc in a setting where the protected attribute is unknown during the model development and test time. In addition, we compare several intra- and post-processing approaches applied to debiasing deep chest X-ray classifiers. To the best of our knowledge, this is one of the first efforts studying debiasing methods on chest radiographs. Our results suggest that the considered approaches successfully mitigate biases in fully connected and convolutional neural networks offering stable performance under various settings. The discussed methods can help achieve group fairness of deep medical image classifiers when deploying them in domains with different fairness considerations and constraints.","PeriodicalId":231229,"journal":{"name":"Machine Learning in Health Care","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125037440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Density-Aware Personalized Training for Risk Prediction in Imbalanced Medical Data 不平衡医疗数据中风险预测的密度感知个性化训练

Machine Learning in Health Care

Pub Date : 2022-07-23 DOI: 10.48550/arXiv.2207.11382

Zepeng Huo, Xiaoning Qian, Shuai Huang, Zhangyang Wang, Bobak J. Mortazavi

Medical events of interest, such as mortality, often happen at a low rate in electronic medical records, as most admitted patients survive. Training models with this imbalance rate (class density discrepancy) may lead to suboptimal prediction. Traditionally this problem is addressed through ad-hoc methods such as resampling or reweighting but performance in many cases is still limited. We propose a framework for training models for this imbalance issue: 1) we first decouple the feature extraction and classification process, adjusting training batches separately for each component to mitigate bias caused by class density discrepancy; 2) we train the network with both a density-aware loss and a learnable cost matrix for misclassifications. We demonstrate our model's improved performance in real-world medical datasets (TOPCAT and MIMIC-III) to show improved AUC-ROC, AUC-PRC, Brier Skill Score compared with the baselines in the domain.

令人感兴趣的医疗事件(如死亡率)在电子医疗记录中的发生率通常很低，因为大多数住院患者都存活了下来。具有这种不平衡率(类密度差异)的训练模型可能导致次优预测。传统上，这个问题是通过重新采样或重新加权等特殊方法来解决的，但在许多情况下，性能仍然有限。针对这种不平衡问题，我们提出了一个训练模型框架:1)我们首先解耦特征提取和分类过程，分别调整每个组件的训练批次，以减轻类密度差异引起的偏差;2)我们用密度感知损失和可学习的错误分类代价矩阵来训练网络。我们在现实世界的医疗数据集(TOPCAT和MIMIC-III)中展示了我们的模型的改进性能，以显示与领域中的基线相比，AUC-ROC, AUC-PRC, Brier Skill Score得到了改进。

引用次数: 1

ICE-NODE: Integration of Clinical Embeddings with Neural Ordinary Differential Equations ICE-NODE:临床嵌入与神经常微分方程的集成

Machine Learning in Health Care

Pub Date : 2022-07-05 DOI: 10.48550/arXiv.2207.01873

Asem Alaa, Erik Mayer, Mauricio Barahona

Early diagnosis of disease can lead to improved health outcomes, including higher survival rates and lower treatment costs. With the massive amount of information available in electronic health records (EHRs), there is great potential to use machine learning (ML) methods to model disease progression aimed at early prediction of disease onset and other outcomes. In this work, we employ recent innovations in neural ODEs combined with rich semantic embeddings of clinical codes to harness the full temporal information of EHRs. We propose ICE-NODE (Integration of Clinical Embeddings with Neural Ordinary Differential Equations), an architecture that temporally integrates embeddings of clinical codes and neural ODEs to learn and predict patient trajectories in EHRs. We apply our method to the publicly available MIMIC-III and MIMIC-IV datasets, and we find improved prediction results compared to state-of-the-art methods, specifically for clinical codes that are not frequently observed in EHRs. We also show that ICE-NODE is more competent at predicting certain medical conditions, like acute renal failure, pulmonary heart disease and birth-related problems, where the full temporal information could provide important information. Furthermore, ICE-NODE is also able to produce patient risk trajectories over time that can be exploited for further detailed predictions of disease evolution.

疾病的早期诊断可以改善健康结果，包括提高存活率和降低治疗费用。随着电子健康记录(EHRs)中可用的大量信息，使用机器学习(ML)方法模拟疾病进展的潜力很大，旨在早期预测疾病发作和其他结果。在这项工作中，我们采用了神经ode的最新创新，结合临床代码的丰富语义嵌入来利用电子病历的完整时间信息。我们提出了ICE-NODE(集成临床嵌入与神经常微分方程)，这是一个暂时集成临床代码和神经ode嵌入的架构，以学习和预测电子病历中的患者轨迹。我们将我们的方法应用于公开可用的MIMIC-III和MIMIC-IV数据集，我们发现与最先进的方法相比，预测结果有所改善，特别是对于在电子病历中不经常观察到的临床代码。我们还表明，ICE-NODE在预测某些医疗状况方面更有能力，如急性肾功能衰竭、肺源性心脏病和出生相关问题，在这些方面，完整的时间信息可以提供重要的信息。此外，ICE-NODE还能够产生随时间推移的患者风险轨迹，可用于进一步详细预测疾病演变。

{"title":"ICE-NODE: Integration of Clinical Embeddings with Neural Ordinary Differential Equations","authors":"Asem Alaa, Erik Mayer, Mauricio Barahona","doi":"10.48550/arXiv.2207.01873","DOIUrl":"https://doi.org/10.48550/arXiv.2207.01873","url":null,"abstract":"Early diagnosis of disease can lead to improved health outcomes, including higher survival rates and lower treatment costs. With the massive amount of information available in electronic health records (EHRs), there is great potential to use machine learning (ML) methods to model disease progression aimed at early prediction of disease onset and other outcomes. In this work, we employ recent innovations in neural ODEs combined with rich semantic embeddings of clinical codes to harness the full temporal information of EHRs. We propose ICE-NODE (Integration of Clinical Embeddings with Neural Ordinary Differential Equations), an architecture that temporally integrates embeddings of clinical codes and neural ODEs to learn and predict patient trajectories in EHRs. We apply our method to the publicly available MIMIC-III and MIMIC-IV datasets, and we find improved prediction results compared to state-of-the-art methods, specifically for clinical codes that are not frequently observed in EHRs. We also show that ICE-NODE is more competent at predicting certain medical conditions, like acute renal failure, pulmonary heart disease and birth-related problems, where the full temporal information could provide important information. Furthermore, ICE-NODE is also able to produce patient risk trajectories over time that can be exploited for further detailed predictions of disease evolution.","PeriodicalId":231229,"journal":{"name":"Machine Learning in Health Care","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124515693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Classifying Unstructured Clinical Notes via Automatic Weak Supervision 基于自动弱监督的非结构化临床笔记分类

Machine Learning in Health Care

Pub Date : 2022-06-24 DOI: 10.48550/arXiv.2206.12088

Chufan Gao, Mononito Goswami, Jieshi Chen, A. Dubrawski

Healthcare providers usually record detailed notes of the clinical care delivered to each patient for clinical, research, and billing purposes. Due to the unstructured nature of these narratives, providers employ dedicated staff to assign diagnostic codes to patients' diagnoses using the International Classification of Diseases (ICD) coding system. This manual process is not only time-consuming but also costly and error-prone. Prior work demonstrated potential utility of Machine Learning (ML) methodology in automating this process, but it has relied on large quantities of manually labeled data to train the models. Additionally, diagnostic coding systems evolve with time, which makes traditional supervised learning strategies unable to generalize beyond local applications. In this work, we introduce a general weakly-supervised text classification framework that learns from class-label descriptions only, without the need to use any human-labeled documents. It leverages the linguistic domain knowledge stored within pre-trained language models and the data programming framework to assign code labels to individual texts. We demonstrate the efficacy and flexibility of our method by comparing it to state-of-the-art weak text classifiers across four real-world text classification datasets, in addition to assigning ICD codes to medical notes in the publicly available MIMIC-III database.

医疗保健提供者通常为临床、研究和计费目的记录向每位患者提供的临床护理的详细记录。由于这些叙述的非结构化性质，提供者雇用专门的工作人员使用国际疾病分类(ICD)编码系统为患者的诊断分配诊断代码。这个手工过程不仅耗时，而且成本高，而且容易出错。先前的工作证明了机器学习(ML)方法在自动化这一过程中的潜在效用，但它依赖于大量手动标记的数据来训练模型。此外，诊断编码系统随着时间的推移而发展，这使得传统的监督学习策略无法推广到局部应用之外。在这项工作中，我们引入了一个通用的弱监督文本分类框架，它只从类标签描述中学习，而不需要使用任何人工标记的文档。它利用存储在预训练语言模型中的语言领域知识和数据编程框架为单个文本分配代码标签。除了将ICD代码分配给公开可用的MIMIC-III数据库中的医疗记录外，我们还将该方法与四个现实世界文本分类数据集中最先进的弱文本分类器进行比较，从而证明了该方法的有效性和灵活性。

{"title":"Classifying Unstructured Clinical Notes via Automatic Weak Supervision","authors":"Chufan Gao, Mononito Goswami, Jieshi Chen, A. Dubrawski","doi":"10.48550/arXiv.2206.12088","DOIUrl":"https://doi.org/10.48550/arXiv.2206.12088","url":null,"abstract":"Healthcare providers usually record detailed notes of the clinical care delivered to each patient for clinical, research, and billing purposes. Due to the unstructured nature of these narratives, providers employ dedicated staff to assign diagnostic codes to patients' diagnoses using the International Classification of Diseases (ICD) coding system. This manual process is not only time-consuming but also costly and error-prone. Prior work demonstrated potential utility of Machine Learning (ML) methodology in automating this process, but it has relied on large quantities of manually labeled data to train the models. Additionally, diagnostic coding systems evolve with time, which makes traditional supervised learning strategies unable to generalize beyond local applications. In this work, we introduce a general weakly-supervised text classification framework that learns from class-label descriptions only, without the need to use any human-labeled documents. It leverages the linguistic domain knowledge stored within pre-trained language models and the data programming framework to assign code labels to individual texts. We demonstrate the efficacy and flexibility of our method by comparing it to state-of-the-art weak text classifiers across four real-world text classification datasets, in addition to assigning ICD codes to medical notes in the publicly available MIMIC-III database.","PeriodicalId":231229,"journal":{"name":"Machine Learning in Health Care","volume":"109 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123159700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

auton-survival: an Open-Source Package for Regression, Counterfactual Estimation, Evaluation and Phenotyping with Censored Time-to-Event Data 自我生存:一个开源软件包，用于回归，反事实估计，评估和表型与审查时间到事件的数据

Machine Learning in Health Care

Pub Date : 2022-04-15 DOI: 10.48550/arXiv.2204.07276

Chirag Nagpal, Willa Potosnak, A. Dubrawski

Applications of machine learning in healthcare often require working with time-to-event prediction tasks including prognostication of an adverse event, re-hospitalization or death. Such outcomes are typically subject to censoring due to loss of follow up. Standard machine learning methods cannot be applied in a straightforward manner to datasets with censored outcomes. In this paper, we present auton-survival, an open-source repository of tools to streamline working with censored time-to-event or survival data. auton-survival includes tools for survival regression, adjustment in the presence of domain shift, counterfactual estimation, phenotyping for risk stratification, evaluation, as well as estimation of treatment effects. Through real world case studies employing a large subset of the SEER oncology incidence data, we demonstrate the ability of auton-survival to rapidly support data scientists in answering complex health and epidemiological questions.

机器学习在医疗保健中的应用通常需要处理事件时间预测任务，包括不良事件、再次住院或死亡的预测。由于缺乏随访，这些结果通常会受到审查。标准的机器学习方法不能以一种直接的方式应用于具有审查结果的数据集。在本文中，我们提出了自动生存，这是一个开源的工具存储库，用于简化处理审查的时间到事件或生存数据。自我生存包括生存回归、领域转移时的调整、反事实估计、风险分层的表型、评估以及治疗效果估计等工具。通过使用大量SEER肿瘤发病率数据的现实世界案例研究，我们展示了自主生存的能力，可以快速支持数据科学家回答复杂的健康和流行病学问题。

引用次数: 8