首页 > 最新文献

Proceedings of the ACM Conference on Health, Inference, and Learning最新文献

英文 中文
MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering MedMCQA:用于医学领域问答的大规模多主题多选择数据集
Pub Date : 2022-03-27 DOI: 10.48550/arXiv.2203.14371
Ankit Pal, Logesh Kumar Umapathi, Malaikannan Sankarasubbu
This paper introduces MedMCQA, a new large-scale, Multiple-Choice Question Answering (MCQA) dataset designed to address real-world medical entrance exam questions. More than 194k high-quality AIIMS &NEET PG entrance exam MCQs covering 2.4k healthcare topics and 21 medical subjects are collected with an average token length of 12.77 and high topical diversity. Each sample contains a question, correct answer(s), and other options which requires a deeper language understanding as it tests the 10+ reasoning abilities of a model across a wide range of medical subjects &topics. A detailed explanation of the solution, along with the above information, is provided in this study.
本文介绍了MedMCQA,一个新的大规模选择题答案(MCQA)数据集,旨在解决现实世界的医学入学考试问题。收集了超过194,000个高质量的AIIMS &NEET PG入学考试mcq,涵盖24,000个医疗保健主题和21个医学科目,平均令牌长度为12.77,主题多样性高。每个样本包含一个问题、正确答案和其他选项,这些选项需要更深入的语言理解,因为它测试了模型在广泛的医学科目和主题上的10+推理能力。本研究提供了解决方案的详细解释以及上述信息。
{"title":"MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering","authors":"Ankit Pal, Logesh Kumar Umapathi, Malaikannan Sankarasubbu","doi":"10.48550/arXiv.2203.14371","DOIUrl":"https://doi.org/10.48550/arXiv.2203.14371","url":null,"abstract":"This paper introduces MedMCQA, a new large-scale, Multiple-Choice Question Answering (MCQA) dataset designed to address real-world medical entrance exam questions. More than 194k high-quality AIIMS &NEET PG entrance exam MCQs covering 2.4k healthcare topics and 21 medical subjects are collected with an average token length of 12.77 and high topical diversity. Each sample contains a question, correct answer(s), and other options which requires a deeper language understanding as it tests the 10+ reasoning abilities of a model across a wide range of medical subjects &topics. A detailed explanation of the solution, along with the above information, is provided in this study.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82120766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 70
Improving the Fairness of Chest X-ray Classifiers 提高胸部x线分类器的公平性
Pub Date : 2022-03-23 DOI: 10.48550/arXiv.2203.12609
Haoran Zhang, Natalie Dullerud, Karsten Roth, Lauren Oakden-Rayner, S. Pfohl, M. Ghassemi
Deep learning models have reached or surpassed human-level performance in the field of medical imaging, especially in disease diagnosis using chest x-rays. However, prior work has found that such classifiers can exhibit biases in the form of gaps in predictive performance across protected groups. In this paper, we question whether striving to achieve zero disparities in predictive performance (i.e. group fairness) is the appropriate fairness definition in the clinical setting, over minimax fairness, which focuses on maximizing the performance of the worst-case group. We benchmark the performance of nine methods in improving classifier fairness across these two definitions. We find, consistent with prior work on non-clinical data, that methods which strive to achieve better worst-group performance do not outperform simple data balancing. We also find that methods which achieve group fairness do so by worsening performance for all groups. In light of these results, we discuss the utility of fairness definitions in the clinical setting, advocating for an investigation of the bias-inducing mechanisms in the underlying data generating process whenever possible.
深度学习模型在医学成像领域的表现已经达到或超过了人类水平,特别是在使用胸部x光进行疾病诊断方面。然而,先前的研究发现,这种分类器可能会在受保护的群体中表现出预测性能差距的形式。在本文中,我们质疑在临床环境中,努力实现预测绩效的零差异(即群体公平)是否合适的公平定义,而不是最小化最大公平,其重点是最大化最差情况组的绩效。我们对这两种定义中提高分类器公平性的九种方法的性能进行了基准测试。我们发现,与之前在非临床数据上的工作一致,努力实现更好的最差组性能的方法并不优于简单的数据平衡。我们还发现,实现群体公平的方法是通过降低所有群体的表现来实现的。鉴于这些结果,我们讨论了公平性定义在临床环境中的效用,主张尽可能调查潜在数据生成过程中的偏见诱发机制。
{"title":"Improving the Fairness of Chest X-ray Classifiers","authors":"Haoran Zhang, Natalie Dullerud, Karsten Roth, Lauren Oakden-Rayner, S. Pfohl, M. Ghassemi","doi":"10.48550/arXiv.2203.12609","DOIUrl":"https://doi.org/10.48550/arXiv.2203.12609","url":null,"abstract":"Deep learning models have reached or surpassed human-level performance in the field of medical imaging, especially in disease diagnosis using chest x-rays. However, prior work has found that such classifiers can exhibit biases in the form of gaps in predictive performance across protected groups. In this paper, we question whether striving to achieve zero disparities in predictive performance (i.e. group fairness) is the appropriate fairness definition in the clinical setting, over minimax fairness, which focuses on maximizing the performance of the worst-case group. We benchmark the performance of nine methods in improving classifier fairness across these two definitions. We find, consistent with prior work on non-clinical data, that methods which strive to achieve better worst-group performance do not outperform simple data balancing. We also find that methods which achieve group fairness do so by worsening performance for all groups. In light of these results, we discuss the utility of fairness definitions in the clinical setting, advocating for an investigation of the bias-inducing mechanisms in the underlying data generating process whenever possible.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91037689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
PhysioMTL: Personalizing Physiological Patterns using Optimal Transport Multi-Task Regression PhysioMTL:使用最优传输多任务回归个性化生理模式
Pub Date : 2022-03-19 DOI: 10.48550/arXiv.2203.12595
Jiacheng Zhu, Gregory Darnell, Agni Kumar, Ding Zhao, Bo Li, X. Nguyen, Shirley You Ren
Heart rate variability (HRV) is a practical and noninvasive measure of autonomic nervous system activity, which plays an essential role in cardiovascular health. However, using HRV to assess physiology status is challenging. Even in clinical settings, HRV is sensitive to acute stressors such as physical activity, mental stress, hydration, alcohol, and sleep. Wearable devices provide convenient HRV measurements, but the irregularity of measurements and uncaptured stressors can bias conventional analytical methods. To better interpret HRV measurements for downstream healthcare applications, we learn a personalized diurnal rhythm as an accurate physiological indicator for each individual. We develop Physiological Multitask-Learning (PhysioMTL) by harnessing Optimal Transport theory within a Multitask-learning (MTL) framework. The proposed method learns an individual-specific predictive model from heterogeneous observations, and enables estimation of an optimal transport map that yields a push forward operation onto the demographic features for each task. Our model outperforms competing MTL methodologies on unobserved predictive tasks for synthetic and two real-world datasets. Specifically, our method provides remarkable prediction results on unseen held-out subjects given only $20%$ of the subjects in real-world observational studies. Furthermore, our model enables a counterfactual engine that generates the effect of acute stressors and chronic conditions on HRV rhythms.
心率变异性(HRV)是一种实用且无创的自主神经系统活动测量方法,在心血管健康中起着至关重要的作用。然而,使用心率变异来评估生理状态是具有挑战性的。即使在临床环境中,HRV对急性应激源(如身体活动、精神压力、水合作用、酒精和睡眠)也很敏感。可穿戴设备提供了方便的HRV测量,但测量的不规则性和未捕获的应力源可能会影响传统的分析方法。为了更好地解释下游医疗保健应用的HRV测量,我们学习个性化的昼夜节律作为每个人的准确生理指标。我们通过在多任务学习(MTL)框架内利用最优传输理论开发了生理多任务学习(PhysioMTL)。所提出的方法从异构观测中学习个体特定的预测模型,并能够估计出最优的运输图,从而产生针对每个任务的人口特征的推进操作。我们的模型在合成和两个真实世界数据集的未观察到的预测任务上优于竞争的MTL方法。具体来说,我们的方法对现实世界观察研究中仅占20%的未见过的被试提供了显著的预测结果。此外,我们的模型实现了一个反事实引擎,该引擎产生急性压力源和慢性条件对HRV节律的影响。
{"title":"PhysioMTL: Personalizing Physiological Patterns using Optimal Transport Multi-Task Regression","authors":"Jiacheng Zhu, Gregory Darnell, Agni Kumar, Ding Zhao, Bo Li, X. Nguyen, Shirley You Ren","doi":"10.48550/arXiv.2203.12595","DOIUrl":"https://doi.org/10.48550/arXiv.2203.12595","url":null,"abstract":"Heart rate variability (HRV) is a practical and noninvasive measure of autonomic nervous system activity, which plays an essential role in cardiovascular health. However, using HRV to assess physiology status is challenging. Even in clinical settings, HRV is sensitive to acute stressors such as physical activity, mental stress, hydration, alcohol, and sleep. Wearable devices provide convenient HRV measurements, but the irregularity of measurements and uncaptured stressors can bias conventional analytical methods. To better interpret HRV measurements for downstream healthcare applications, we learn a personalized diurnal rhythm as an accurate physiological indicator for each individual. We develop Physiological Multitask-Learning (PhysioMTL) by harnessing Optimal Transport theory within a Multitask-learning (MTL) framework. The proposed method learns an individual-specific predictive model from heterogeneous observations, and enables estimation of an optimal transport map that yields a push forward operation onto the demographic features for each task. Our model outperforms competing MTL methodologies on unobserved predictive tasks for synthetic and two real-world datasets. Specifically, our method provides remarkable prediction results on unseen held-out subjects given only $20%$ of the subjects in real-world observational studies. Furthermore, our model enables a counterfactual engine that generates the effect of acute stressors and chronic conditions on HRV rhythms.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72861703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Graph-Text Multi-Modal Pre-training for Medical Representation Learning 医学表征学习的图文多模态预训练
Pub Date : 2022-03-18 DOI: 10.48550/arXiv.2203.09994
Sungjin Park, Seongsu Bae, Jiho Kim, Tackeun Kim, E. Choi
As the volume of Electronic Health Records (EHR) sharply grows, there has been emerging interest in learning the representation of EHR for healthcare applications. Representation learning of EHR requires appropriate modeling of the two dominant modalities in EHR: structured data and unstructured text. In this paper, we present MedGTX, a pre-trained model for multi-modal representation learning of the structured and textual EHR data. MedGTX uses a novel graph encoder to exploit the graphical nature of structured EHR data, and a text encoder to handle unstructured text, and a cross-modal encoder to learn a joint representation space. We pre-train our model through four proxy tasks on MIMIC-III, an open-source EHR data, and evaluate our model on two clinical benchmarks and three novel downstream tasks which tackle real-world problems in EHR data. The results consistently show the effectiveness of pre-training the model for joint representation of both structured and unstructured information from EHR. Given the promising performance of MedGTX, we believe this work opens a new door to jointly understanding the two fundamental modalities of EHR data.
随着电子健康记录(EHR)数量的急剧增长,人们对学习医疗保健应用中EHR的表示越来越感兴趣。电子病历的表示学习需要对电子病历中的两种主要模式:结构化数据和非结构化文本进行适当的建模。在本文中,我们提出MedGTX,一个用于结构化和文本电子病历数据的多模态表示学习的预训练模型。MedGTX使用一种新颖的图形编码器来利用结构化电子病历数据的图形特性,使用文本编码器来处理非结构化文本,使用跨模态编码器来学习联合表示空间。我们通过在开源电子病历数据MIMIC-III上的四个代理任务对模型进行预训练,并在两个临床基准和三个解决电子病历数据中现实问题的新下游任务上评估我们的模型。结果一致表明预训练模型对电子病历中结构化和非结构化信息的联合表示是有效的。鉴于MedGTX的良好表现,我们相信这项工作为共同理解电子病历数据的两种基本模式打开了一扇新的大门。
{"title":"Graph-Text Multi-Modal Pre-training for Medical Representation Learning","authors":"Sungjin Park, Seongsu Bae, Jiho Kim, Tackeun Kim, E. Choi","doi":"10.48550/arXiv.2203.09994","DOIUrl":"https://doi.org/10.48550/arXiv.2203.09994","url":null,"abstract":"As the volume of Electronic Health Records (EHR) sharply grows, there has been emerging interest in learning the representation of EHR for healthcare applications. Representation learning of EHR requires appropriate modeling of the two dominant modalities in EHR: structured data and unstructured text. In this paper, we present MedGTX, a pre-trained model for multi-modal representation learning of the structured and textual EHR data. MedGTX uses a novel graph encoder to exploit the graphical nature of structured EHR data, and a text encoder to handle unstructured text, and a cross-modal encoder to learn a joint representation space. We pre-train our model through four proxy tasks on MIMIC-III, an open-source EHR data, and evaluate our model on two clinical benchmarks and three novel downstream tasks which tackle real-world problems in EHR data. The results consistently show the effectiveness of pre-training the model for joint representation of both structured and unstructured information from EHR. Given the promising performance of MedGTX, we believe this work opens a new door to jointly understanding the two fundamental modalities of EHR data.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81706884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Uncertainty-Aware Text-to-Program for Question Answering on Structured Electronic Health Records 结构化电子健康记录问题回答的不确定性感知文本到程序
Pub Date : 2022-03-14 DOI: 10.48550/arXiv.2203.06918
Daeyoung Kim, Seongsu Bae, S. Kim, E. Choi
Question Answering on Electronic Health Records (EHR-QA) has a significant impact on the healthcare domain, and it is being actively studied. Previous research on structured EHR-QA focuses on converting natural language queries into query language such as SQL or SPARQL (NLQ2Query), so the problem scope is limited to pre-defined data types by the specific query language. In order to expand the EHR-QA task beyond this limitation to handle multi-modal medical data and solve complex inference in the future, more primitive systemic language is needed. In this paper, we design the program-based model (NLQ2Program) for EHR-QA as the first step towards the future direction. We tackle MIMICSPARQL*, the graph-based EHR-QA dataset, via a program-based approach in a semi-supervised manner in order to overcome the absence of gold programs. Without the gold program, our proposed model shows comparable performance to the previous state-of-the-art model, which is an NLQ2Query model (0.9% gain). In addition, for a reliable EHR-QA model, we apply the uncertainty decomposition method to measure the ambiguity in the input question. We empirically confirmed data uncertainty is most indicative of the ambiguity in the input question.
电子健康记录问答(EHR-QA)对医疗保健领域产生了重大影响,人们正在积极研究它。以往对结构化EHR-QA的研究侧重于将自然语言查询转换为查询语言,如SQL或SPARQL (NLQ2Query),因此问题范围被特定查询语言限制在预定义的数据类型中。为了在未来将EHR-QA任务扩展到处理多模态医疗数据和解决复杂推理,需要更原始的系统语言。在本文中,我们设计了基于程序的EHR-QA模型(NLQ2Program),作为迈向未来方向的第一步。为了克服黄金程序的缺失,我们以半监督的方式,通过基于程序的方法处理基于图形的EHR-QA数据集MIMICSPARQL*。在没有金牌程序的情况下,我们提出的模型显示出与之前最先进的模型相当的性能,该模型是一个NLQ2Query模型(增益0.9%)。此外,为了建立可靠的EHR-QA模型,我们采用不确定性分解方法来度量输入问题中的模糊度。我们的经验证实,数据的不确定性是输入问题的模糊性的最指示性。
{"title":"Uncertainty-Aware Text-to-Program for Question Answering on Structured Electronic Health Records","authors":"Daeyoung Kim, Seongsu Bae, S. Kim, E. Choi","doi":"10.48550/arXiv.2203.06918","DOIUrl":"https://doi.org/10.48550/arXiv.2203.06918","url":null,"abstract":"Question Answering on Electronic Health Records (EHR-QA) has a significant impact on the healthcare domain, and it is being actively studied. Previous research on structured EHR-QA focuses on converting natural language queries into query language such as SQL or SPARQL (NLQ2Query), so the problem scope is limited to pre-defined data types by the specific query language. In order to expand the EHR-QA task beyond this limitation to handle multi-modal medical data and solve complex inference in the future, more primitive systemic language is needed. In this paper, we design the program-based model (NLQ2Program) for EHR-QA as the first step towards the future direction. We tackle MIMICSPARQL*, the graph-based EHR-QA dataset, via a program-based approach in a semi-supervised manner in order to overcome the absence of gold programs. Without the gold program, our proposed model shows comparable performance to the previous state-of-the-art model, which is an NLQ2Query model (0.9% gain). In addition, for a reliable EHR-QA model, we apply the uncertainty decomposition method to measure the ambiguity in the input question. We empirically confirmed data uncertainty is most indicative of the ambiguity in the input question.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88370855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Affinitention nets: kernel perspective on attention architectures for set classification with applications to medical text and images 关联网络:集中分类关注架构的核视角与医学文本和图像的应用
Pub Date : 2021-04-08 DOI: 10.1145/3450439.3451856
D. Dov, Serge Assaad, Shijing Si, Rui Wang, Hongteng Xu, S. Kovalsky, Jonathan Bell, D. Range, Jonathan Cohen, Ricardo Henao, L. Carin
Set classification is the task of predicting a single label from a set comprising multiple instances. The examples we consider are pathology slides represented by sets of patches and medical text data represented by sets of word embeddings. State-of-the-art methods, such as the transformer network, typically use attention mechanisms to learn representations of set data, by modeling interactions between instances of the set. These methods, however, have complex heuristic architectures comprising multiple heads and layers. The complexity of attention architectures hampers their training when only a small number of labeled sets is available, as is often the case in medical applications. To address this problem, we present a kernel-based representation learning framework that links learning affinity kernels to learning representations from attention architectures. We show that learning a combination of the sum and the product of kernels is equivalent to learning representations from multi-head multi-layer attention architectures. From our framework, we devise a simplified attention architecture which we term affinitention (affinity-attention) nets. We demonstrate the application of affinitention nets to the classification of the Set-Cifar10 dataset, thyroid malignancy prediction from pathology slides, as well as patient text-message triage. We show that affinitention nets provide competitive results compared to heuristic attention architectures and outperform other competing methods.
集合分类是从包含多个实例的集合中预测单个标签的任务。我们考虑的例子是由一组补丁表示的病理切片和由一组词嵌入表示的医学文本数据。最先进的方法,如变压器网络,通常使用注意机制来学习集合数据的表示,通过建模集合实例之间的交互。然而,这些方法具有复杂的启发式架构,包括多个头部和层。当只有少量标记集可用时,注意力架构的复杂性阻碍了它们的训练,这在医疗应用中经常出现。为了解决这个问题,我们提出了一个基于核的表示学习框架,该框架将学习亲和核与来自注意架构的学习表示联系起来。我们表明,学习核的和和乘积的组合相当于学习来自多头多层注意力体系结构的表示。从我们的框架中,我们设计了一个简化的注意力架构,我们称之为亲和力(亲和力-注意力)网络。我们演示了亲和性网络在Set-Cifar10数据集分类、病理切片甲状腺恶性预测以及患者短信分类中的应用。我们表明,与启发式注意力架构相比,亲和性网络提供了竞争结果,并且优于其他竞争方法。
{"title":"Affinitention nets: kernel perspective on attention architectures for set classification with applications to medical text and images","authors":"D. Dov, Serge Assaad, Shijing Si, Rui Wang, Hongteng Xu, S. Kovalsky, Jonathan Bell, D. Range, Jonathan Cohen, Ricardo Henao, L. Carin","doi":"10.1145/3450439.3451856","DOIUrl":"https://doi.org/10.1145/3450439.3451856","url":null,"abstract":"Set classification is the task of predicting a single label from a set comprising multiple instances. The examples we consider are pathology slides represented by sets of patches and medical text data represented by sets of word embeddings. State-of-the-art methods, such as the transformer network, typically use attention mechanisms to learn representations of set data, by modeling interactions between instances of the set. These methods, however, have complex heuristic architectures comprising multiple heads and layers. The complexity of attention architectures hampers their training when only a small number of labeled sets is available, as is often the case in medical applications. To address this problem, we present a kernel-based representation learning framework that links learning affinity kernels to learning representations from attention architectures. We show that learning a combination of the sum and the product of kernels is equivalent to learning representations from multi-head multi-layer attention architectures. From our framework, we devise a simplified attention architecture which we term affinitention (affinity-attention) nets. We demonstrate the application of affinitention nets to the classification of the Set-Cifar10 dataset, thyroid malignancy prediction from pathology slides, as well as patient text-message triage. We show that affinitention nets provide competitive results compared to heuristic attention architectures and outperform other competing methods.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76963313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predictive models for colorectal cancer recurrence using multi-modal healthcare data 基于多模式医疗数据的结直肠癌复发预测模型
Pub Date : 2021-04-08 DOI: 10.1145/3450439.3451868
D. Ho, I. Tan, M. Motani
Colorectal cancer recurrence is a major clinical problem - around 30-40% of patients who are treated with curative intent surgery will experience cancer relapse. Proactive prognostication is critical for early detection and treatment of recurrence. However, the common clinical approach to monitoring recurrence through testing for carcinoembryonic antigen (CEA) does not possess a strong prognostic performance. In our paper, we study a series of machine and deep learning architectures that exploit heterogeneous healthcare data to predict colorectal cancer recurrence. In particular, we demonstrate three different approaches to extract and integrate features from multiple modalities including longitudinal as well as tabular clinical data. Our best model employs a hybrid architecture that takes in multi-modal inputs and comprises: 1) a Transformer model carefully modified to extract high-quality features from time-series data, and 2) a Multi-Layered Perceptron (MLP) that learns tabular data features, followed by feature integration and classification for prediction of recurrence. It achieves an AUROC score of 0.95, as well as precision, sensitivity and specificity scores of 0.83, 0.80 and 0.96 respectively, surpassing the performance of all-known published results based on CEA, as well as most commercially available diagnostic assays. Our results could lead to better post-operative management and follow-up of colorectal cancer patients.
结直肠癌复发是一个主要的临床问题——大约30-40%接受治疗目的手术治疗的患者会经历癌症复发。积极预测是早期发现和治疗复发的关键。然而,通过检测癌胚抗原(CEA)来监测复发的常见临床方法并没有很强的预后表现。在我们的论文中,我们研究了一系列利用异构医疗数据来预测结直肠癌复发的机器和深度学习架构。特别是,我们展示了三种不同的方法来提取和整合多种模式的特征,包括纵向和表格临床数据。我们最好的模型采用了一种混合架构,它接受多模态输入,包括:1)一个经过仔细修改的Transformer模型,从时间序列数据中提取高质量的特征,以及2)一个多层感知器(MLP),它学习表格数据特征,然后进行特征集成和分类,以预测递归。该方法的AUROC评分为0.95,精密度、灵敏度和特异性评分分别为0.83、0.80和0.96,超过了所有已知的基于CEA的已发表结果,以及大多数市售的诊断分析方法。我们的研究结果可以为结直肠癌患者提供更好的术后管理和随访。
{"title":"Predictive models for colorectal cancer recurrence using multi-modal healthcare data","authors":"D. Ho, I. Tan, M. Motani","doi":"10.1145/3450439.3451868","DOIUrl":"https://doi.org/10.1145/3450439.3451868","url":null,"abstract":"Colorectal cancer recurrence is a major clinical problem - around 30-40% of patients who are treated with curative intent surgery will experience cancer relapse. Proactive prognostication is critical for early detection and treatment of recurrence. However, the common clinical approach to monitoring recurrence through testing for carcinoembryonic antigen (CEA) does not possess a strong prognostic performance. In our paper, we study a series of machine and deep learning architectures that exploit heterogeneous healthcare data to predict colorectal cancer recurrence. In particular, we demonstrate three different approaches to extract and integrate features from multiple modalities including longitudinal as well as tabular clinical data. Our best model employs a hybrid architecture that takes in multi-modal inputs and comprises: 1) a Transformer model carefully modified to extract high-quality features from time-series data, and 2) a Multi-Layered Perceptron (MLP) that learns tabular data features, followed by feature integration and classification for prediction of recurrence. It achieves an AUROC score of 0.95, as well as precision, sensitivity and specificity scores of 0.83, 0.80 and 0.96 respectively, surpassing the performance of all-known published results based on CEA, as well as most commercially available diagnostic assays. Our results could lead to better post-operative management and follow-up of colorectal cancer patients.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85405378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
T-DPSOM: an interpretable clustering method for unsupervised learning of patient health states T-DPSOM:一种用于患者健康状态无监督学习的可解释聚类方法
Pub Date : 2021-04-08 DOI: 10.1145/3450439.3451872
Laura Manduchi, Matthias Hüser, M. Faltys, Julia E. Vogt, G. Rätsch, Vincent Fortuin
Generating interpretable visualizations of multivariate time series in the intensive care unit is of great practical importance. Clinicians seek to condense complex clinical observations into intuitively understandable critical illness patterns, like failures of different organ systems. They would greatly benefit from a low-dimensional representation in which the trajectories of the patients' pathology become apparent and relevant health features are highlighted. To this end, we propose to use the latent topological structure of Self-Organizing Maps (SOMs) to achieve an interpretable latent representation of ICU time series and combine it with recent advances in deep clustering. Specifically, we (a) present a novel way to fit SOMs with probabilistic cluster assignments (PSOM), (b) propose a new deep architecture for probabilistic clustering (DPSOM) using a VAE, and (c) extend our architecture to cluster and forecast clinical states in time series (T-DPSOM). We show that our model achieves superior clustering performance compared to state-of-the-art SOM-based clustering methods while maintaining the favorable visualization properties of SOMs. On the eICU data-set, we demonstrate that T-DPSOM provides interpretable visualizations of patient state trajectories and uncertainty estimation. We show that our method rediscovers well-known clinical patient characteristics, such as a dynamic variant of the Acute Physiology And Chronic Health Evaluation (APACHE) score. Moreover, we illustrate how it can disentangle individual organ dysfunctions on disjoint regions of the two-dimensional SOM map.
在重症监护病房中生成可解释的多变量时间序列可视化具有重要的实际意义。临床医生试图将复杂的临床观察结果浓缩成直观易懂的重症模式,比如不同器官系统的衰竭。他们将极大地受益于低维表示,其中患者的病理轨迹变得明显,相关的健康特征被突出显示。为此,我们建议使用自组织映射(SOMs)的潜在拓扑结构来实现ICU时间序列的可解释潜在表示,并将其与深度聚类的最新进展相结合。具体来说,我们(a)提出了一种用概率聚类分配(PSOM)拟合SOMs的新方法,(b)使用VAE提出了一种新的概率聚类(DPSOM)深度架构,(c)将我们的架构扩展到时间序列的聚类和预测临床状态(T-DPSOM)。我们表明,与最先进的基于som的聚类方法相比,我们的模型实现了优越的聚类性能,同时保持了som的良好可视化特性。在eICU数据集上,我们证明T-DPSOM提供了患者状态轨迹和不确定性估计的可解释可视化。我们表明,我们的方法重新发现了众所周知的临床患者特征,例如急性生理和慢性健康评估(APACHE)评分的动态变体。此外,我们说明了它如何在二维SOM图的不相交区域上解开单个器官功能障碍。
{"title":"T-DPSOM: an interpretable clustering method for unsupervised learning of patient health states","authors":"Laura Manduchi, Matthias Hüser, M. Faltys, Julia E. Vogt, G. Rätsch, Vincent Fortuin","doi":"10.1145/3450439.3451872","DOIUrl":"https://doi.org/10.1145/3450439.3451872","url":null,"abstract":"Generating interpretable visualizations of multivariate time series in the intensive care unit is of great practical importance. Clinicians seek to condense complex clinical observations into intuitively understandable critical illness patterns, like failures of different organ systems. They would greatly benefit from a low-dimensional representation in which the trajectories of the patients' pathology become apparent and relevant health features are highlighted. To this end, we propose to use the latent topological structure of Self-Organizing Maps (SOMs) to achieve an interpretable latent representation of ICU time series and combine it with recent advances in deep clustering. Specifically, we (a) present a novel way to fit SOMs with probabilistic cluster assignments (PSOM), (b) propose a new deep architecture for probabilistic clustering (DPSOM) using a VAE, and (c) extend our architecture to cluster and forecast clinical states in time series (T-DPSOM). We show that our model achieves superior clustering performance compared to state-of-the-art SOM-based clustering methods while maintaining the favorable visualization properties of SOMs. On the eICU data-set, we demonstrate that T-DPSOM provides interpretable visualizations of patient state trajectories and uncertainty estimation. We show that our method rediscovers well-known clinical patient characteristics, such as a dynamic variant of the Acute Physiology And Chronic Health Evaluation (APACHE) score. Moreover, we illustrate how it can disentangle individual organ dysfunctions on disjoint regions of the two-dimensional SOM map.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82808989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A comprehensive EHR timeseries pre-training benchmark 一个全面的EHR时间序列预训练基准
Pub Date : 2021-04-08 DOI: 10.1145/3450439.3451877
Matthew B. A. McDermott, Bret A. Nestor, Evan Kim, Wancong Zhang, A. Goldenberg, Peter Szolovits, M. Ghassemi
Pre-training (PT) has been used successfully in many areas of machine learning. One area where PT would be extremely impactful is over electronic health record (EHR) data. Successful PT strategies on this modality could improve model performance in data-scarce contexts such as modeling for rare diseases or allowing smaller hospitals to benefit from data from larger health systems. While many PT strategies have been explored in other domains, much less exploration has occurred for EHR data. One reason for this may be the lack of standardized benchmarks suitable for developing and testing PT algorithms. In this work, we establish a PT benchmark dataset for EHR timeseries data, establishing cohorts, a diverse set of fine-tuning tasks, and PT-focused evaluation regimes across two public EHR datasets: MIMIC-III and eICU. This benchmark fills an essential hole in the field by enabling a robust manner of iterating on PT strategies for this modality. To show the value of this benchmark and provide baselines for further research, we also profile two simple PT algorithms: a self-supervised, masked imputation system and a weakly-supervised, multi-task system. We find that PT strategies (in particular weakly-supervised PT methods) can offer significant gains over traditional learning in few-shot settings, especially on tasks with strong class imbalance. Our full benchmark and code are publicly available at https://github.com/mmcdermott/comprehensive_MTL_EHR
预训练(PT)已经成功地应用于机器学习的许多领域。PT极具影响力的一个领域是电子健康记录(EHR)数据。基于这种模式的成功PT战略可以改善数据稀缺环境下的模型性能,例如为罕见疾病建模或允许小型医院从大型卫生系统的数据中受益。虽然许多PT策略已经在其他领域进行了探索,但对电子病历数据的探索要少得多。造成这种情况的一个原因可能是缺乏适合开发和测试PT算法的标准化基准。在这项工作中,我们为EHR时间序列数据建立了一个PT基准数据集,建立了队列,多种微调任务,并在两个公共EHR数据集(MIMIC-III和eICU)中建立了以PT为重点的评估机制。该基准通过支持对这种模式的PT策略进行迭代的健壮方式,填补了该领域的一个重要空白。为了展示该基准的价值并为进一步研究提供基准,我们还介绍了两种简单的PT算法:自监督、掩码输入系统和弱监督、多任务系统。我们发现PT策略(特别是弱监督PT方法)可以在少数镜头设置中提供比传统学习显著的收益,特别是在具有强烈类别不平衡的任务中。我们完整的基准测试和代码可以在https://github.com/mmcdermott/comprehensive_MTL_EHR上公开获得
{"title":"A comprehensive EHR timeseries pre-training benchmark","authors":"Matthew B. A. McDermott, Bret A. Nestor, Evan Kim, Wancong Zhang, A. Goldenberg, Peter Szolovits, M. Ghassemi","doi":"10.1145/3450439.3451877","DOIUrl":"https://doi.org/10.1145/3450439.3451877","url":null,"abstract":"Pre-training (PT) has been used successfully in many areas of machine learning. One area where PT would be extremely impactful is over electronic health record (EHR) data. Successful PT strategies on this modality could improve model performance in data-scarce contexts such as modeling for rare diseases or allowing smaller hospitals to benefit from data from larger health systems. While many PT strategies have been explored in other domains, much less exploration has occurred for EHR data. One reason for this may be the lack of standardized benchmarks suitable for developing and testing PT algorithms. In this work, we establish a PT benchmark dataset for EHR timeseries data, establishing cohorts, a diverse set of fine-tuning tasks, and PT-focused evaluation regimes across two public EHR datasets: MIMIC-III and eICU. This benchmark fills an essential hole in the field by enabling a robust manner of iterating on PT strategies for this modality. To show the value of this benchmark and provide baselines for further research, we also profile two simple PT algorithms: a self-supervised, masked imputation system and a weakly-supervised, multi-task system. We find that PT strategies (in particular weakly-supervised PT methods) can offer significant gains over traditional learning in few-shot settings, especially on tasks with strong class imbalance. Our full benchmark and code are publicly available at https://github.com/mmcdermott/comprehensive_MTL_EHR","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79435894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Enabling counterfactual survival analysis with balanced representations 通过平衡的表示实现反事实生存分析
Pub Date : 2021-04-08 DOI: 10.1145/3450439.3451875
Paidamoyo Chapfuwa, Serge Assaad, Shuxi Zeng, M. Pencina, L. Carin, Ricardo Henao
Balanced representation learning methods have been applied successfully to counterfactual inference from observational data. However, approaches that account for survival outcomes are relatively limited. Survival data are frequently encountered across diverse medical applications, i.e., drug development, risk profiling, and clinical trials, and such data are also relevant in fields like manufacturing (e.g., for equipment monitoring). When the outcome of interest is a time-to-event, special precautions for handling censored events need to be taken, as ignoring censored outcomes may lead to biased estimates. We propose a theoretically grounded unified framework for counterfactual inference applicable to survival outcomes. Further, we formulate a nonparametric hazard ratio metric for evaluating average and individualized treatment effects. Experimental results on real-world and semi-synthetic datasets, the latter of which we introduce, demonstrate that the proposed approach significantly outperforms competitive alternatives in both survival-outcome prediction and treatment-effect estimation.
平衡表征学习方法已经成功地应用于从观测数据中进行反事实推理。然而,能够解释生存结果的方法相对有限。生存数据经常在各种医疗应用中遇到,即药物开发、风险分析和临床试验,这些数据也与制造等领域相关(例如,用于设备监测)。当感兴趣的结果是到事件的时间时,需要采取特殊的预防措施来处理经过审查的事件,因为忽略经过审查的结果可能导致有偏差的估计。我们提出了一个适用于生存结果的反事实推理的理论基础统一框架。此外,我们制定了一个非参数风险比度量来评估平均和个性化治疗效果。在真实世界和半合成数据集上的实验结果表明,我们提出的方法在生存结果预测和治疗效果估计方面都明显优于其他竞争方法。
{"title":"Enabling counterfactual survival analysis with balanced representations","authors":"Paidamoyo Chapfuwa, Serge Assaad, Shuxi Zeng, M. Pencina, L. Carin, Ricardo Henao","doi":"10.1145/3450439.3451875","DOIUrl":"https://doi.org/10.1145/3450439.3451875","url":null,"abstract":"Balanced representation learning methods have been applied successfully to counterfactual inference from observational data. However, approaches that account for survival outcomes are relatively limited. Survival data are frequently encountered across diverse medical applications, i.e., drug development, risk profiling, and clinical trials, and such data are also relevant in fields like manufacturing (e.g., for equipment monitoring). When the outcome of interest is a time-to-event, special precautions for handling censored events need to be taken, as ignoring censored outcomes may lead to biased estimates. We propose a theoretically grounded unified framework for counterfactual inference applicable to survival outcomes. Further, we formulate a nonparametric hazard ratio metric for evaluating average and individualized treatment effects. Experimental results on real-world and semi-synthetic datasets, the latter of which we introduce, demonstrate that the proposed approach significantly outperforms competitive alternatives in both survival-outcome prediction and treatment-effect estimation.","PeriodicalId":87342,"journal":{"name":"Proceedings of the ACM Conference on Health, Inference, and Learning","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73537292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
Proceedings of the ACM Conference on Health, Inference, and Learning
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1