Proceedings. IEEE International Conference on Data Mining最新文献

Enhancing Personalized Healthcare via Capturing Disease Severity, Interaction, and Progression. 通过捕捉疾病严重程度、互动和进展情况，加强个性化医疗保健。

Proceedings. IEEE International Conference on Data Mining

Pub Date : 2023-12-01 Epub Date: 2024-02-05 DOI: 10.1109/icdm58522.2023.00173

Yanchao Tan, Zihao Zhou, Leisheng Yu, Weiming Liu, Chaochao Chen, Guofang Ma, Xiao Hu, Vicki S Hertzberg, Carl Yang

Personalized diagnosis prediction based on electronic health records (EHR) of patients is a promising yet challenging task for AI in healthcare. Existing studies typically ignore the heterogeneity of diseases across different patients. For example, diabetes can have different complications across different patients (e.g., hyperlipidemia and circulatory disorder), which requires personalized diagnoses and treatments. Specifically, existing models fail to consider 1) varying severity of the same diseases for different patients, 2) complex interactions among syndromic diseases, and 3) dynamic progression of chronic diseases. In this work, we propose to perform personalized diagnosis prediction based on EHR data via capturing disease severity, interaction, and progression. In particular, we enable personalized disease representations via severity-driven embeddings at the disease level. Then, at the visit level, we propose to capture higher-order interactions among diseases that can collectively affect patients' health status via hypergraph-based aggregation; at the patient level, we devise a personalized generative model based on neural ordinary differential equations to capture the continuous-time disease progressions underlying discrete and incomplete visits. Extensive experiments on two real-world EHR datasets show significant performance gains brought by our approach, yielding average improvements of 10.70% for diagnosis prediction over state-of-the-art competitors.

基于患者电子健康记录（EHR）的个性化诊断预测是人工智能在医疗保健领域一项前景广阔但又充满挑战的任务。现有的研究通常忽略了不同患者疾病的异质性。例如，糖尿病在不同患者身上会产生不同的并发症（如高脂血症和循环障碍），这就需要个性化的诊断和治疗。具体来说，现有模型未能考虑到：1）不同患者同种疾病的严重程度不同；2）综合征疾病之间复杂的相互作用；3）慢性疾病的动态发展。在这项工作中，我们建议通过捕捉疾病的严重程度、相互作用和进展情况，在电子病历数据的基础上进行个性化诊断预测。特别是，我们在疾病层面通过严重性驱动的嵌入实现了个性化疾病表征。然后，在就诊层面，我们建议通过基于超图的聚合来捕捉疾病间的高阶交互，从而共同影响患者的健康状况；在患者层面，我们设计了一个基于神经常微分方程的个性化生成模型，以捕捉离散和不完整就诊背后的连续时间疾病进展。在两个真实世界的电子病历数据集上进行的广泛实验表明，我们的方法带来了显著的性能提升，在诊断预测方面比最先进的竞争对手平均提高了 10.70%。

{"title":"Enhancing Personalized Healthcare via Capturing Disease Severity, Interaction, and Progression.","authors":"Yanchao Tan, Zihao Zhou, Leisheng Yu, Weiming Liu, Chaochao Chen, Guofang Ma, Xiao Hu, Vicki S Hertzberg, Carl Yang","doi":"10.1109/icdm58522.2023.00173","DOIUrl":"10.1109/icdm58522.2023.00173","url":null,"abstract":"Personalized diagnosis prediction based on electronic health records (EHR) of patients is a promising yet challenging task for AI in healthcare. Existing studies typically ignore the heterogeneity of diseases across different patients. For example, diabetes can have different complications across different patients (e.g., hyperlipidemia and circulatory disorder), which requires personalized diagnoses and treatments. Specifically, existing models fail to consider 1) varying severity of the same diseases for different patients, 2) complex interactions among syndromic diseases, and 3) dynamic progression of chronic diseases. In this work, we propose to perform personalized diagnosis prediction based on EHR data via capturing disease severity, interaction, and progression. In particular, we enable personalized disease representations via severity-driven embeddings at the disease level. Then, at the visit level, we propose to capture higher-order interactions among diseases that can collectively affect patients' health status via hypergraph-based aggregation; at the patient level, we devise a personalized generative model based on neural ordinary differential equations to capture the continuous-time disease progressions underlying discrete and incomplete visits. Extensive experiments on two real-world EHR datasets show significant performance gains brought by our approach, yielding average improvements of 10.70% for diagnosis prediction over state-of-the-art competitors.","PeriodicalId":74565,"journal":{"name":"Proceedings. IEEE International Conference on Data Mining","volume":"2023 ","pages":"1349-1354"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10868667/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139742807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Heterogeneous Treatment Effect Estimation with Subpopulation Identification for Personalized Medicine in Opioid Use Disorder. 针对阿片类药物使用障碍的个性化医疗，通过亚人群识别进行异质性治疗效果估算。

Proceedings. IEEE International Conference on Data Mining

Pub Date : 2023-12-01 Epub Date: 2024-02-05 DOI: 10.1109/icdm58522.2023.00127

Seungyeon Lee, Ruoqi Liu, Wenyu Song, Ping Zhang

Deep learning models have demonstrated promising results in estimating treatment effects (TEE). However, most of them overlook the variations in treatment outcomes among subgroups with distinct characteristics. This limitation hinders their ability to provide accurate estimations and treatment recommendations for specific subgroups. In this study, we introduce a novel neural network-based framework, named SubgroupTE, which incorporates subgroup identification and treatment effect estimation. SubgroupTE identifies diverse subgroups and simultaneously estimates treatment effects for each subgroup, improving the treatment effect estimation by considering the heterogeneity of treatment responses. Comparative experiments on synthetic data show that SubgroupTE outperforms existing models in treatment effect estimation. Furthermore, experiments on a real-world dataset related to opioid use disorder (OUD) demonstrate the potential of our approach to enhance personalized treatment recommendations for OUD patients.

深度学习模型在估计治疗效果（TEE）方面取得了可喜的成果。然而，它们大多忽略了具有不同特征的亚组之间治疗效果的差异。这一局限性阻碍了它们为特定亚组提供准确估计和治疗建议的能力。在本研究中，我们引入了一种基于神经网络的新型框架，名为 SubgroupTE，它将亚组识别和治疗效果估计结合在一起。SubgroupTE 可以识别不同的亚组，并同时估计每个亚组的治疗效果，通过考虑治疗反应的异质性来改进治疗效果估计。合成数据的对比实验表明，SubgroupTE 在治疗效果估计方面优于现有模型。此外，在与阿片类药物使用障碍（OUD）相关的真实世界数据集上进行的实验也证明了我们的方法在增强针对 OUD 患者的个性化治疗建议方面的潜力。

{"title":"Heterogeneous Treatment Effect Estimation with Subpopulation Identification for Personalized Medicine in Opioid Use Disorder.","authors":"Seungyeon Lee, Ruoqi Liu, Wenyu Song, Ping Zhang","doi":"10.1109/icdm58522.2023.00127","DOIUrl":"10.1109/icdm58522.2023.00127","url":null,"abstract":"Deep learning models have demonstrated promising results in estimating treatment effects (TEE). However, most of them overlook the variations in treatment outcomes among subgroups with distinct characteristics. This limitation hinders their ability to provide accurate estimations and treatment recommendations for specific subgroups. In this study, we introduce a novel neural network-based framework, named SubgroupTE, which incorporates subgroup identification and treatment effect estimation. SubgroupTE identifies diverse subgroups and simultaneously estimates treatment effects for each subgroup, improving the treatment effect estimation by considering the heterogeneity of treatment responses. Comparative experiments on synthetic data show that SubgroupTE outperforms existing models in treatment effect estimation. Furthermore, experiments on a real-world dataset related to opioid use disorder (OUD) demonstrate the potential of our approach to enhance personalized treatment recommendations for OUD patients.","PeriodicalId":74565,"journal":{"name":"Proceedings. IEEE International Conference on Data Mining","volume":"2023 ","pages":"1079-1084"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10883421/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139934592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging. RoS-KD：一种用于噪声医学成像的稳健随机知识提取方法。

Proceedings. IEEE International Conference on Data Mining

Pub Date : 2022-11-01 Epub Date: 2023-02-01 DOI: 10.1109/icdm54844.2022.00118

Ajay Jaiswal, Kumar Ashutosh, Justin F Rousseau, Yifan Peng, Zhangyang Wang, Ying Ding

AI-powered Medical Imaging has recently achieved enormous attention due to its ability to provide fast-paced healthcare diagnoses. However, it usually suffers from a lack of high-quality datasets due to high annotation cost, inter-observer variability, human annotator error, and errors in computer-generated labels. Deep learning models trained on noisy labelled datasets are sensitive to the noise type and lead to less generalization on the unseen samples. To address this challenge, we propose a Robust Stochastic Knowledge Distillation (RoS-KD) framework which mimics the notion of learning a topic from multiple sources to ensure deterrence in learning noisy information. More specifically, RoS-KD learns a smooth, well-informed, and robust student manifold by distilling knowledge from multiple teachers trained on overlapping subsets of training data. Our extensive experiments on popular medical imaging classification tasks (cardiopulmonary disease and lesion classification) using real-world datasets, show the performance benefit of RoS-KD, its ability to distill knowledge from many popular large networks (ResNet-50, DenseNet-121, MobileNet-V2) in a comparatively small network, and its robustness to adversarial attacks (PGD, FSGM). More specifically, RoS-KD achieves > 2% and > 4% improvement on F1-score for lesion classification and cardiopulmonary disease classification tasks, respectively, when the underlying student is ResNet-18 against recent competitive knowledge distillation baseline. Additionally, on cardiopulmonary disease classification task, RoS-KD outperforms most of the SOTA baselines by ~1% gain in AUC score.

人工智能医疗成像最近因其提供快节奏医疗诊断的能力而受到极大关注。然而，由于高注释成本、观察者之间的可变性、人工注释器错误和计算机生成标签中的错误，它通常缺乏高质量的数据集。在有噪声标记的数据集上训练的深度学习模型对噪声类型敏感，并且对看不见的样本的泛化能力较弱。为了应对这一挑战，我们提出了一个鲁棒随机知识提取（RoS-KD）框架，该框架模拟了从多个来源学习主题的概念，以确保在学习噪声信息时具有威慑力。更具体地说，RoS KD通过从接受过重叠训练数据子集训练的多名教师身上提取知识，学习到一个流畅、消息灵通、强健的学生歧管。我们使用真实世界的数据集对流行的医学成像分类任务（心肺疾病和病变分类）进行了广泛的实验，显示了RoS-KD的性能优势，它能够在相对较小的网络中从许多流行的大型网络（ResNet-50、DenseNet-121、MobileNet-V2）中提取知识，以及它对对抗性攻击（PGD、FSGM）的鲁棒性。更具体地说，当基础学生是ResNet-18时，与最近的竞争性知识提取基线相比，RoS KD在病变分类和心肺疾病分类任务的F1得分上分别提高了>2%和>4%。此外，在心肺疾病分类任务中，RoS-KD的AUC得分比大多数SOTA基线高出约1%。

{"title":"RoS-KD: A Robust Stochastic Knowledge Distillation Approach for Noisy Medical Imaging.","authors":"Ajay Jaiswal, Kumar Ashutosh, Justin F Rousseau, Yifan Peng, Zhangyang Wang, Ying Ding","doi":"10.1109/icdm54844.2022.00118","DOIUrl":"10.1109/icdm54844.2022.00118","url":null,"abstract":"AI-powered Medical Imaging has recently achieved enormous attention due to its ability to provide fast-paced healthcare diagnoses. However, it usually suffers from a lack of high-quality datasets due to high annotation cost, inter-observer variability, human annotator error, and errors in computer-generated labels. Deep learning models trained on noisy labelled datasets are sensitive to the noise type and lead to less generalization on the unseen samples. To address this challenge, we propose a Robust Stochastic Knowledge Distillation (RoS-KD) framework which mimics the notion of learning a topic from multiple sources to ensure deterrence in learning noisy information. More specifically, RoS-KD learns a smooth, well-informed, and robust student manifold by distilling knowledge from multiple teachers trained on overlapping subsets of training data. Our extensive experiments on popular medical imaging classification tasks (cardiopulmonary disease and lesion classification) using real-world datasets, show the performance benefit of RoS-KD, its ability to distill knowledge from many popular large networks (ResNet-50, DenseNet-121, MobileNet-V2) in a comparatively small network, and its robustness to adversarial attacks (PGD, FSGM). More specifically, RoS-KD achieves > 2% and > 4% improvement on F1-score for lesion classification and cardiopulmonary disease classification tasks, respectively, when the underlying student is ResNet-18 against recent competitive knowledge distillation baseline. Additionally, on cardiopulmonary disease classification task, RoS-KD outperforms most of the SOTA baselines by ~1% gain in AUC score.","PeriodicalId":74565,"journal":{"name":"Proceedings. IEEE International Conference on Data Mining","volume":"2022 ","pages":"981-986"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10082964/pdf/nihms-1888486.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9294932","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Robust Unsupervised Domain Adaptation from A Corrupted Source. 来自损坏源的鲁棒无监督域自适应。

Proceedings. IEEE International Conference on Data Mining

Pub Date : 2022-11-01 Epub Date: 2023-02-01 DOI: 10.1109/icdm54844.2022.00171

Shuyang Yu, Zhuangdi Zhu, Boyang Liu, Anil K Jain, Jiayu Zhou

Unsupervised Domain Adaptation (UDA) provides a promising solution for learning without supervision, which transfers knowledge from relevant source domains with accessible labeled training data. Existing UDA solutions hinge on clean training data with a short-tail distribution from the source domain, which can be fragile when the source domain data is corrupted either inherently or via adversarial attacks. In this work, we propose an effective framework to address the challenges of UDA from corrupted source domains in a principled manner. Specifically, we perform knowledge ensemble from multiple domain-invariant models that are learned on random partitions of training data. To further address the distribution shift from the source to the target domain, we refine each of the learned models via mutual information maximization, which adaptively obtains the predictive information of the target domain with high confidence. Extensive empirical studies demonstrate that the proposed approach is robust against various types of poisoned data attacks while achieving high asymptotic performance on the target domain.

无监督领域自适应（UDA）为无监督学习提供了一种很有前途的解决方案，它通过可访问的标记训练数据从相关源领域转移知识。现有的UDA解决方案依赖于来自源域的具有短尾分布的干净训练数据，当源域数据被固有地或通过对抗性攻击破坏时，这可能是脆弱的。在这项工作中，我们提出了一个有效的框架，以原则的方式应对来自损坏源域的UDA挑战。具体来说，我们从在训练数据的随机分区上学习的多个域不变模型中执行知识集成。为了进一步解决从源域到目标域的分布偏移，我们通过互信息最大化来细化每个学习的模型，该模型自适应地以高置信度获得目标域的预测信息。大量的实证研究表明，所提出的方法对各种类型的中毒数据攻击具有鲁棒性，同时在目标域上实现了较高的渐近性能。

{"title":"Robust Unsupervised Domain Adaptation from A Corrupted Source.","authors":"Shuyang Yu, Zhuangdi Zhu, Boyang Liu, Anil K Jain, Jiayu Zhou","doi":"10.1109/icdm54844.2022.00171","DOIUrl":"10.1109/icdm54844.2022.00171","url":null,"abstract":"Unsupervised Domain Adaptation (UDA) provides a promising solution for learning without supervision, which transfers knowledge from relevant source domains with accessible labeled training data. Existing UDA solutions hinge on clean training data with a short-tail distribution from the source domain, which can be fragile when the source domain data is corrupted either inherently or via adversarial attacks. In this work, we propose an effective framework to address the challenges of UDA from corrupted source domains in a principled manner. Specifically, we perform knowledge ensemble from multiple domain-invariant models that are learned on random partitions of training data. To further address the distribution shift from the source to the target domain, we refine each of the learned models via mutual information maximization, which adaptively obtains the predictive information of the target domain with high confidence. Extensive empirical studies demonstrate that the proposed approach is robust against various types of poisoned data attacks while achieving high asymptotic performance on the target domain.","PeriodicalId":74565,"journal":{"name":"Proceedings. IEEE International Conference on Data Mining","volume":"2022 ","pages":"1299-1304"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10097501/pdf/nihms-1888097.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9664945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Communication Efficient Tensor Factorization for Decentralized Healthcare Networks. 针对分散式医疗保健网络的通信高效张量因式分解。

Proceedings. IEEE International Conference on Data Mining

Pub Date : 2021-12-01 Epub Date: 2022-01-24 DOI: 10.1109/icdm51629.2021.00147

Jing Ma, Qiuchen Zhang, Jian Lou, Li Xiong, Sivasubramanium Bhavani, Joyce C Ho

Tensor factorization has been proved as an efficient unsupervised learning approach for health data analysis, especially for computational phenotyping, where the high-dimensional Electronic Health Records (EHRs) with patients history of medical procedures, medications, diagnosis, lab tests, etc., are converted to meaningful and interpretable medical concepts. Federated tensor factorization distributes the tensor computation to multiple workers under the coordination of a central server, which enables jointly learning the phenotypes across multiple hospitals while preserving the privacy of the patient information. However, existing federated tensor factorization algorithms encounter the single-point-failure issue with the involvement of the central server, which is not only easily exposed to external attacks, but also limits the number of clients sharing information with the server under restricted uplink bandwidth. In this paper, we propose CiderTF, a communication-efficient decentralized generalized tensor factorization, which reduces the uplink communication cost by leveraging a four-level communication reduction strategy designed for a generalized tensor factorization, which has the flexibility of modeling different tensor distribution with multiple kinds of loss functions. Experiments on two real-world EHR datasets demonstrate that CiderTF achieves comparable convergence with the communication reduction up to 99.99%.

张量因式分解已被证明是一种高效的无监督学习方法，可用于健康数据分析，特别是计算表型分析。在计算表型分析中，包含患者医疗程序、用药、诊断、化验等病史的高维电子健康记录（EHR）被转换为有意义且可解释的医学概念。联邦张量因子化技术在中央服务器的协调下，将张量计算分配给多个工作人员，从而实现了跨多家医院的表型联合学习，同时保护了患者信息的隐私。然而，现有的联合张量因式分解算法在中央服务器的参与下存在单点故障问题，不仅容易受到外部攻击，而且在上行带宽受限的情况下限制了与服务器共享信息的客户端数量。本文提出了一种通信效率高的去中心化广义张量因式分解（CiderTF），它利用为广义张量因式分解设计的四级通信缩减策略降低了上行链路通信成本，而广义张量因式分解可以灵活地对具有多种损失函数的不同张量分布进行建模。在两个真实世界的电子病历数据集上进行的实验证明，CiderTF 实现了相当的收敛性，通信降低率高达 99.99%。

{"title":"Communication Efficient Tensor Factorization for Decentralized Healthcare Networks.","authors":"Jing Ma, Qiuchen Zhang, Jian Lou, Li Xiong, Sivasubramanium Bhavani, Joyce C Ho","doi":"10.1109/icdm51629.2021.00147","DOIUrl":"10.1109/icdm51629.2021.00147","url":null,"abstract":"Tensor factorization has been proved as an efficient unsupervised learning approach for health data analysis, especially for computational phenotyping, where the high-dimensional Electronic Health Records (EHRs) with patients history of medical procedures, medications, diagnosis, lab tests, etc., are converted to meaningful and interpretable medical concepts. Federated tensor factorization distributes the tensor computation to multiple workers under the coordination of a central server, which enables jointly learning the phenotypes across multiple hospitals while preserving the privacy of the patient information. However, existing federated tensor factorization algorithms encounter the single-point-failure issue with the involvement of the central server, which is not only easily exposed to external attacks, but also limits the number of clients sharing information with the server under restricted uplink bandwidth. In this paper, we propose CiderTF, a communication-efficient decentralized generalized tensor factorization, which reduces the uplink communication cost by leveraging a four-level communication reduction strategy designed for a generalized tensor factorization, which has the flexibility of modeling different tensor distribution with multiple kinds of loss functions. Experiments on two real-world EHR datasets demonstrate that CiderTF achieves comparable convergence with the communication reduction up to 99.99%.","PeriodicalId":74565,"journal":{"name":"Proceedings. IEEE International Conference on Data Mining","volume":"2021 ","pages":"1216-1221"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9652777/pdf/nihms-1846243.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9738773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SCEHR: Supervised Contrastive Learning for Clinical Risk Prediction using Electronic Health Records. SCEHR:使用电子健康记录进行临床风险预测的监督对比学习。

Proceedings. IEEE International Conference on Data Mining

Pub Date : 2021-12-01 DOI: 10.1109/icdm51629.2021.00097

Chengxi Zang, Fei Wang

Contrastive learning has demonstrated promising performance in image and text domains either in a self-supervised or a supervised manner. In this work, we extend the supervised contrastive learning framework to clinical risk prediction problems based on longitudinal electronic health records (EHR). We propose a general supervised contrastive loss $ℒ_{C o n t r a s t i v e C r o s s E n t r o p y} + λ ℒ_{S u p e r v i s e d C o n t r a s t i v e R e g u l a r i z e r}$ for learning both binary classification (e.g. in-hospital mortality prediction) and multi-label classification (e.g. phenotyping) in a unified framework. Our supervised contrastive loss practices the key idea of contrastive learning, namely, pulling similar samples closer and pushing dissimilar ones apart from each other, simultaneously by its two components: $ℒ_{C o n t r a s t i v e C r o s s E n t r o p y}$ tries to contrast samples with learned anchors which represent positive and negative clusters, and $ℒ_{S u p e r v i s e d C o n t r a s t i v e R e g u l a r i z e r}$ tries to contrast samples with each other according to their supervised labels. We propose two versions of the above supervised contrastive loss and our experiments on real-world EHR data demonstrate that our proposed loss functions show benefits in improving the performance of strong baselines and even state-of-the-art models on benchmarking tasks for clinical risk predictions. Our loss functions work well with extremely imbalanced data which are common for clinical risk prediction problems. Our loss functions can be easily used to replace (binary or multi-label

对比学习在图像和文本领域都有良好的表现，无论是自监督学习还是监督学习。在这项工作中，我们将监督对比学习框架扩展到基于纵向电子健康记录(EHR)的临床风险预测问题。我们建议采用一种综合监督对比损失ℒC o n t r s t i v e C r o s s e n t r o p y +λℒs p e r u v i s e d C o n t r s t i v e r e g u l r i z e r学习两个二进制分类(如住院死亡率预测)和多标记分类(例如表现型)在一个统一的框架中。我们的监督对比损失算法实践了对比学习的关键思想，即通过它的两个组成部分，同时把相似的样本拉得更近，把不相似的样本推得更远:ℒC o n t r s t i v e C r o s s e n t r o p y试图对比样本学习锚,它代表的积极的和消极的集群,我ℒs p e r u v s e d C o n t r s t i v e r e g u l r i z e r试图相互对比样本根据他们的监管标签。我们提出了上述监督对比损失的两个版本，我们在现实世界的电子病历数据上的实验表明，我们提出的损失函数在提高强基线甚至最先进的模型在临床风险预测基准任务上的性能方面具有优势。我们的损失函数可以很好地处理临床风险预测问题中常见的极度不平衡的数据。我们的损失函数可以很容易地取代现有临床预测模型中采用的(二值或多标签)交叉熵损失。Pytorch代码在https://github.com/calvin-zcx/SCEHR上发布。

{"title":"SCEHR: Supervised Contrastive Learning for Clinical Risk Prediction using Electronic Health Records.","authors":"Chengxi Zang, Fei Wang","doi":"10.1109/icdm51629.2021.00097","DOIUrl":"https://doi.org/10.1109/icdm51629.2021.00097","url":null,"abstract":"Contrastive learning has demonstrated promising performance in image and text domains either in a self-supervised or a supervised manner. In this work, we extend the supervised contrastive learning framework to clinical risk prediction problems based on longitudinal electronic health records (EHR). We propose a general supervised contrastive loss <math> <mrow><msub><mi>ℒ</mi> <mrow><mi>C</mi> <mi>o</mi> <mi>n</mi> <mi>t</mi> <mi>r</mi> <mi>a</mi> <mi>s</mi> <mi>t</mi> <mi>i</mi> <mi>v</mi> <mi>e</mi> <mspace></mspace> <mi>C</mi> <mi>r</mi> <mi>o</mi> <mi>s</mi> <mi>s</mi> <mspace></mspace> <mi>E</mi> <mi>n</mi> <mi>t</mi> <mi>r</mi> <mi>o</mi> <mi>p</mi> <mi>y</mi></mrow> </msub> <mo>+</mo> <mi>λ</mi> <msub><mi>ℒ</mi> <mrow><mi>S</mi> <mi>u</mi> <mi>p</mi> <mi>e</mi> <mi>r</mi> <mi>v</mi> <mi>i</mi> <mi>s</mi> <mi>e</mi> <mi>d</mi> <mspace></mspace> <mi>C</mi> <mi>o</mi> <mi>n</mi> <mi>t</mi> <mi>r</mi> <mi>a</mi> <mi>s</mi> <mi>t</mi> <mi>i</mi> <mi>v</mi> <mi>e</mi> <mspace></mspace> <mi>R</mi> <mi>e</mi> <mi>g</mi> <mi>u</mi> <mi>l</mi> <mi>a</mi> <mi>r</mi> <mi>i</mi> <mi>z</mi> <mi>e</mi> <mi>r</mi></mrow> </msub> </mrow> </math> for learning both binary classification (e.g. in-hospital mortality prediction) and multi-label classification (e.g. phenotyping) in a unified framework. Our supervised contrastive loss practices the key idea of contrastive learning, namely, pulling similar samples closer and pushing dissimilar ones apart from each other, simultaneously by its two components: <math> <mrow><msub><mi>ℒ</mi> <mrow><mi>C</mi> <mi>o</mi> <mi>n</mi> <mi>t</mi> <mi>r</mi> <mi>a</mi> <mi>s</mi> <mi>t</mi> <mi>i</mi> <mi>v</mi> <mi>e</mi> <mspace></mspace> <mi>C</mi> <mi>r</mi> <mi>o</mi> <mi>s</mi> <mi>s</mi> <mspace></mspace> <mi>E</mi> <mi>n</mi> <mi>t</mi> <mi>r</mi> <mi>o</mi> <mi>p</mi> <mi>y</mi></mrow> </msub> </mrow> </math> tries to contrast samples with learned anchors which represent positive and negative clusters, and <math> <mrow><msub><mi>ℒ</mi> <mrow><mi>S</mi> <mi>u</mi> <mi>p</mi> <mi>e</mi> <mi>r</mi> <mi>v</mi> <mi>i</mi> <mi>s</mi> <mi>e</mi> <mi>d</mi> <mspace></mspace> <mi>C</mi> <mi>o</mi> <mi>n</mi> <mi>t</mi> <mi>r</mi> <mi>a</mi> <mi>s</mi> <mi>t</mi> <mi>i</mi> <mi>v</mi> <mi>e</mi> <mspace></mspace> <mi>R</mi> <mi>e</mi> <mi>g</mi> <mi>u</mi> <mi>l</mi> <mi>a</mi> <mi>r</mi> <mi>i</mi> <mi>z</mi> <mi>e</mi> <mi>r</mi></mrow> </msub> </mrow> </math> tries to contrast samples with each other according to their supervised labels. We propose two versions of the above supervised contrastive loss and our experiments on real-world EHR data demonstrate that our proposed loss functions show benefits in improving the performance of strong baselines and even state-of-the-art models on benchmarking tasks for clinical risk predictions. Our loss functions work well with extremely imbalanced data which are common for clinical risk prediction problems. Our loss functions can be easily used to replace (binary or multi-label","PeriodicalId":74565,"journal":{"name":"Proceedings. IEEE International Conference on Data Mining","volume":" ","pages":"857-866"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9692209/pdf/nihms-1847610.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40486255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Behind Distribution Shift: Mining Driving Forces of Changes and Causal Arrows. 分布转移背后:变化的驱动力和因果箭头的挖掘。

Proceedings. IEEE International Conference on Data Mining

Pub Date : 2017-11-01 Epub Date: 2017-12-18 DOI: 10.1109/ICDM.2017.114

Biwei Huang, Kun Zhang, Jiji Zhang, Ruben Sanchez-Romero, Clark Glymour, Bernhard Schölkopf

We address two important issues in causal discovery from nonstationary or heterogeneous data, where parameters associated with a causal structure may change over time or across data sets. First, we investigate how to efficiently estimate the "driving force" of the nonstationarity of a causal mechanism. That is, given a causal mechanism that varies over time or across data sets and whose qualitative structure is known, we aim to extract from data a low-dimensional and interpretable representation of the main components of the changes. For this purpose we develop a novel kernel embedding of nonstationary conditional distributions that does not rely on sliding windows. Second, the embedding also leads to a measure of dependence between the changes of causal modules that can be used to determine the directions of many causal arrows. We demonstrate the power of our methods with experiments on both synthetic and real data.

我们解决了从非平稳或异构数据中发现因果关系的两个重要问题，其中与因果结构相关的参数可能随时间或跨数据集而变化。首先，我们研究了如何有效地估计因果机制的非平稳性的“驱动力”。也就是说，给定随时间或数据集而变化的因果机制，并且其定性结构是已知的，我们的目标是从数据中提取变化的主要组成部分的低维和可解释的表示。为此，我们开发了一种新的不依赖于滑动窗口的非平稳条件分布的核嵌入。其次，嵌入还导致因果模块变化之间的依赖度量，可用于确定许多因果箭头的方向。我们用合成数据和真实数据的实验证明了我们的方法的力量。

引用次数: 27

Binary Classifier Calibration using an Ensemble of Near Isotonic Regression Models. 基于近等压回归模型集合的二值分类器标定。

Proceedings. IEEE International Conference on Data Mining

Pub Date : 2016-12-01 DOI: 10.1109/ICDM.2016.0047

Mahdi Pakdaman Naeini, Gregory F Cooper

Learning accurate probabilistic models from data is crucial in many practical tasks in data mining. In this paper we present a new non-parametric calibration method called ensemble of near isotonic regression (ENIR). The method can be considered as an extension of BBQ [20], a recently proposed calibration method, as well as the commonly used calibration method based on isotonic regression (IsoRegC) [27]. ENIR is designed to address the key limitation of IsoRegC which is the monotonicity assumption of the predictions. Similar to BBQ, the method post-processes the output of a binary classifier to obtain calibrated probabilities. Thus it can be used with many existing classification models to generate accurate probabilistic predictions. We demonstrate the performance of ENIR on synthetic and real datasets for commonly applied binary classification models. Experimental results show that the method outperforms several common binary classifier calibration methods. In particular on the real data, ENIR commonly performs statistically significantly better than the other methods, and never worse. It is able to improve the calibration power of classifiers, while retaining their discrimination power. The method is also computationally tractable for large scale datasets, as it is O(N log N) time, where N is the number of samples.

从数据中学习准确的概率模型在数据挖掘的许多实际任务中是至关重要的。本文提出了一种新的非参数定标方法——近等压回归系综(ENIR)。该方法可以看作是最近提出的一种校准方法BBQ[20]的扩展，以及常用的基于等压回归的校准方法(IsoRegC)[27]。ENIR旨在解决IsoRegC的关键限制，即预测的单调性假设。与BBQ类似，该方法对二值分类器的输出进行后处理以获得校准概率。因此，它可以与许多现有的分类模型一起使用，以产生准确的概率预测。对于常用的二分类模型，我们展示了ENIR在合成数据集和真实数据集上的性能。实验结果表明，该方法优于几种常用的二值分类器标定方法。特别是在实际数据上，ENIR通常比其他方法在统计上表现得更好，而且不会更差。在保持分类器识别能力的同时，提高了分类器的校准能力。对于大规模数据集，该方法在计算上也易于处理，因为它是O(N log N)时间，其中N是样本数量。

{"title":"Binary Classifier Calibration using an Ensemble of Near Isotonic Regression Models.","authors":"Mahdi Pakdaman Naeini, Gregory F Cooper","doi":"10.1109/ICDM.2016.0047","DOIUrl":"https://doi.org/10.1109/ICDM.2016.0047","url":null,"abstract":"Learning accurate probabilistic models from data is crucial in many practical tasks in data mining. In this paper we present a new non-parametric calibration method called ensemble of near isotonic regression (ENIR). The method can be considered as an extension of BBQ [20], a recently proposed calibration method, as well as the commonly used calibration method based on isotonic regression (IsoRegC) [27]. ENIR is designed to address the key limitation of IsoRegC which is the monotonicity assumption of the predictions. Similar to BBQ, the method post-processes the output of a binary classifier to obtain calibrated probabilities. Thus it can be used with many existing classification models to generate accurate probabilistic predictions. We demonstrate the performance of ENIR on synthetic and real datasets for commonly applied binary classification models. Experimental results show that the method outperforms several common binary classifier calibration methods. In particular on the real data, ENIR commonly performs statistically significantly better than the other methods, and never worse. It is able to improve the calibration power of classifiers, while retaining their discrimination power. The method is also computationally tractable for large scale datasets, as it is O(N log N) time, where N is the number of samples.","PeriodicalId":74565,"journal":{"name":"Proceedings. IEEE International Conference on Data Mining","volume":"2016 ","pages":"360-369"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICDM.2016.0047","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9889794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 38

Self-Grouping Multi-Network Clustering. 自分组多网络集群。

Proceedings. IEEE International Conference on Data Mining

Pub Date : 2016-12-01 Epub Date: 2017-02-02 DOI: 10.1109/ICDM.2016.0146

Jingchao Ni, Wei Cheng, Wei Fan, Xiang Zhang

Joint clustering of multiple networks has been shown to be more accurate than performing clustering on individual networks separately. Many multi-view and multi-domain network clustering methods have been developed for joint multi-network clustering. These methods typically assume there is a common clustering structure shared by all networks, and different networks can provide complementary information on this underlying clustering structure. However, this assumption is too strict to hold in many emerging real-life applications, where multiple networks have diverse data distributions. More popularly, the networks in consideration belong to different underlying groups. Only networks in the same underlying group share similar clustering structures. Better clustering performance can be achieved by considering such groups differently. As a result, an ideal method should be able to automatically detect network groups so that networks in the same group share a common clustering structure. To address this problem, we propose a novel method, ComClus, to simultaneously group and cluster multiple networks. ComClus treats node clusters as features of networks and uses them to differentiate different network groups. Network grouping and clustering are coupled and mutually enhanced during the learning process. Extensive experimental evaluation on a variety of synthetic and real datasets demonstrates the effectiveness of our method.

多个网络的联合聚类已被证明比单独对单个网络进行聚类更准确。为了实现多网络联合聚类，人们开发了多种多视图、多域网络聚类方法。这些方法通常假设存在所有网络共享的公共聚类结构，并且不同的网络可以在这个底层聚类结构上提供补充信息。然而，这种假设过于严格，无法适用于许多新兴的现实应用程序，因为在这些应用程序中，多个网络具有不同的数据分布。更普遍的是，所考虑的网络属于不同的底层群体。只有同一底层组中的网络共享相似的聚类结构。通过以不同的方式考虑这些组，可以获得更好的集群性能。因此，理想的方法应该能够自动检测网络组，以便同一组中的网络共享一个共同的聚类结构。为了解决这个问题，我们提出了一种新的方法ComClus，同时对多个网络进行分组和聚类。comcluus将节点集群视为网络的特征，并利用它们来区分不同的网络组。网络分组和网络聚类在学习过程中是相互耦合、相互促进的。在各种合成和真实数据集上进行了广泛的实验评估，证明了我们的方法的有效性。

{"title":"Self-Grouping Multi-Network Clustering.","authors":"Jingchao Ni, Wei Cheng, Wei Fan, Xiang Zhang","doi":"10.1109/ICDM.2016.0146","DOIUrl":"10.1109/ICDM.2016.0146","url":null,"abstract":"Joint clustering of multiple networks has been shown to be more accurate than performing clustering on individual networks separately. Many multi-view and multi-domain network clustering methods have been developed for joint multi-network clustering. These methods typically assume there is a common clustering structure shared by all networks, and different networks can provide complementary information on this underlying clustering structure. However, this assumption is too strict to hold in many emerging real-life applications, where multiple networks have diverse data distributions. More popularly, the networks in consideration belong to different underlying groups. Only networks in the same underlying group share similar clustering structures. Better clustering performance can be achieved by considering such groups differently. As a result, an ideal method should be able to automatically detect network groups so that networks in the same group share a common clustering structure. To address this problem, we propose a novel method, ComClus, to simultaneously group and cluster multiple networks. ComClus treats node clusters as features of networks and uses them to differentiate different network groups. Network grouping and clustering are coupled and mutually enhanced during the learning process. Extensive experimental evaluation on a variety of synthetic and real datasets demonstrates the effectiveness of our method.","PeriodicalId":74565,"journal":{"name":"Proceedings. IEEE International Conference on Data Mining","volume":" ","pages":"1119-1124"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICDM.2016.0146","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35161055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

New Probabilistic Multi-Graph Decomposition Model to Identify Consistent Human Brain Network Modules. 一种新的概率多图分解模型识别一致性人脑网络模块。

Proceedings. IEEE International Conference on Data Mining

Pub Date : 2016-12-01 Epub Date: 2017-02-02 DOI: 10.1109/ICDM.2016.0041

Dijun Luo, Zhouyuan Huo, Yang Wang, Andrew J Saykin, Li Shen, Heng Huang

Many recent scientific efforts have been devoted to constructing the human connectome using Diffusion Tensor Imaging (DTI) data for understanding large-scale brain networks that underlie higher-level cognition in human. However, suitable network analysis computational tools are still lacking in human brain connectivity research. To address this problem, we propose a novel probabilistic multi-graph decomposition model to identify consistent network modules from the brain connectivity networks of the studied subjects. At first, we propose a new probabilistic graph decomposition model to address the high computational complexity issue in existing stochastic block models. After that, we further extend our new probabilistic graph decomposition model for multiple networks/graphs to identify the shared modules cross multiple brain networks by simultaneously incorporating multiple networks and predicting the hidden block state variables. We also derive an efficient optimization algorithm to solve the proposed objective and estimate the model parameters. We validate our method by analyzing both the weighted fiber connectivity networks constructed from DTI images and the standard human face image clustering benchmark data sets. The promising empirical results demonstrate the superior performance of our proposed method.

近年来，许多科学研究都致力于利用弥散张量成像(Diffusion Tensor Imaging, DTI)数据构建人类连接组，以了解人类高级认知基础上的大规模大脑网络。然而，在人脑连通性的研究中，仍然缺乏合适的网络分析计算工具。为了解决这一问题，我们提出了一种新的概率多图分解模型，从被研究对象的大脑连接网络中识别出一致的网络模块。首先，针对现有随机块模型计算复杂度高的问题，提出了一种新的概率图分解模型。之后，我们进一步扩展了新的多网络/图的概率图分解模型，通过同时合并多个网络和预测隐藏块状态变量来识别跨多个大脑网络的共享模块。我们还推导了一种有效的优化算法来求解所提出的目标和估计模型参数。通过分析由DTI图像构建的加权光纤连接网络和标准人脸图像聚类基准数据集，验证了我们的方法。实证结果表明，本文提出的方法具有良好的性能。

{"title":"New Probabilistic Multi-Graph Decomposition Model to Identify Consistent Human Brain Network Modules.","authors":"Dijun Luo, Zhouyuan Huo, Yang Wang, Andrew J Saykin, Li Shen, Heng Huang","doi":"10.1109/ICDM.2016.0041","DOIUrl":"https://doi.org/10.1109/ICDM.2016.0041","url":null,"abstract":"Many recent scientific efforts have been devoted to constructing the human connectome using Diffusion Tensor Imaging (DTI) data for understanding large-scale brain networks that underlie higher-level cognition in human. However, suitable network analysis computational tools are still lacking in human brain connectivity research. To address this problem, we propose a novel probabilistic multi-graph decomposition model to identify consistent network modules from the brain connectivity networks of the studied subjects. At first, we propose a new probabilistic graph decomposition model to address the high computational complexity issue in existing stochastic block models. After that, we further extend our new probabilistic graph decomposition model for multiple networks/graphs to identify the shared modules cross multiple brain networks by simultaneously incorporating multiple networks and predicting the hidden block state variables. We also derive an efficient optimization algorithm to solve the proposed objective and estimate the model parameters. We validate our method by analyzing both the weighted fiber connectivity networks constructed from DTI images and the standard human face image clustering benchmark data sets. The promising empirical results demonstrate the superior performance of our proposed method.","PeriodicalId":74565,"journal":{"name":"Proceedings. IEEE International Conference on Data Mining","volume":" ","pages":"301-310"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/ICDM.2016.0041","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"36044857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1