首页 > 最新文献

Proceedings of machine learning research最新文献

英文 中文
Multiple Imputation with Neural Network Gaussian Process for High-dimensional Incomplete Data 高维不完全数据的神经网络高斯过程多重脉冲
Pub Date : 2022-11-23 DOI: 10.48550/arXiv.2211.13297
Zongyu Dai, Zhiqi Bu, Q. Long
Missing data are ubiquitous in real world applications and, if not adequately handled, may lead to the loss of information and biased findings in downstream analysis. Particularly, high-dimensional incomplete data with a moderate sample size, such as analysis of multi-omics data, present daunting challenges. Imputation is arguably the most popular method for handling missing data, though existing imputation methods have a number of limitations. Single imputation methods such as matrix completion methods do not adequately account for imputation uncertainty and hence would yield improper statistical inference. In contrast, multiple imputation (MI) methods allow for proper inference but existing methods do not perform well in high-dimensional settings. Our work aims to address these significant methodological gaps, leveraging recent advances in neural network Gaussian process (NNGP) from a Bayesian viewpoint. We propose two NNGP-based MI methods, namely MI-NNGP, that can apply multiple imputations for missing values from a joint (posterior predictive) distribution. The MI-NNGP methods are shown to significantly outperform existing state-of-the-art methods on synthetic and real datasets, in terms of imputation error, statistical inference, robustness to missing rates, and computation costs, under three missing data mechanisms, MCAR, MAR, and MNAR. Code is available in the GitHub repository https://github.com/bestadcarry/MI-NNGP.
在现实世界的应用程序中,丢失的数据无处不在,如果处理不当,可能会导致信息丢失,并在下游分析中导致有偏差的结果。特别是,中等样本量的高维不完整数据,如多组学数据的分析,提出了艰巨的挑战。尽管现有的插入方法有许多局限性,但插入可以说是处理缺失数据的最流行的方法。单一的输入方法,如矩阵补全方法不能充分考虑输入的不确定性,因此会产生不适当的统计推断。相比之下,多重插值(MI)方法允许适当的推理,但现有方法在高维设置中表现不佳。我们的工作旨在解决这些重要的方法差距,从贝叶斯的角度利用神经网络高斯过程(NNGP)的最新进展。我们提出了两种基于nngp的MI方法,即MI- nngp,该方法可以对联合(后验预测)分布中的缺失值进行多次插值。在三种缺失数据机制(MCAR、MAR和MNAR)下,MI-NNGP方法在输入误差、统计推断、对缺失率的鲁棒性和计算成本方面明显优于现有的最先进的合成和真实数据集方法。代码可在GitHub存储库https://github.com/bestadcarry/MI-NNGP中获得。
{"title":"Multiple Imputation with Neural Network Gaussian Process for High-dimensional Incomplete Data","authors":"Zongyu Dai, Zhiqi Bu, Q. Long","doi":"10.48550/arXiv.2211.13297","DOIUrl":"https://doi.org/10.48550/arXiv.2211.13297","url":null,"abstract":"Missing data are ubiquitous in real world applications and, if not adequately handled, may lead to the loss of information and biased findings in downstream analysis. Particularly, high-dimensional incomplete data with a moderate sample size, such as analysis of multi-omics data, present daunting challenges. Imputation is arguably the most popular method for handling missing data, though existing imputation methods have a number of limitations. Single imputation methods such as matrix completion methods do not adequately account for imputation uncertainty and hence would yield improper statistical inference. In contrast, multiple imputation (MI) methods allow for proper inference but existing methods do not perform well in high-dimensional settings. Our work aims to address these significant methodological gaps, leveraging recent advances in neural network Gaussian process (NNGP) from a Bayesian viewpoint. We propose two NNGP-based MI methods, namely MI-NNGP, that can apply multiple imputations for missing values from a joint (posterior predictive) distribution. The MI-NNGP methods are shown to significantly outperform existing state-of-the-art methods on synthetic and real datasets, in terms of imputation error, statistical inference, robustness to missing rates, and computation costs, under three missing data mechanisms, MCAR, MAR, and MNAR. Code is available in the GitHub repository https://github.com/bestadcarry/MI-NNGP.","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"189 1","pages":"265-279"},"PeriodicalIF":0.0,"publicationDate":"2022-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43147782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Predicting Attrition Patterns from Pediatric Weight Management Programs. 预测儿科体重管理计划的减员模式
Hamed Fayyaz, Thao-Ly T Phan, H Timothy Bunnell, Rahmatollah Beheshti

Obesity is a major public health concern. Multidisciplinary pediatric weight management programs are considered standard treatment for children with obesity who are not able to be successfully managed in the primary care setting. Despite their great potential, high dropout rates (referred to as attrition) are a major hurdle in delivering successful interventions. Predicting attrition patterns can help providers reduce the alarmingly high rates of attrition (up to 80%) by engaging in earlier and more personalized interventions. Previous work has mainly focused on finding static predictors of attrition on smaller datasets and has achieved limited success in effective prediction. In this study, we have collected a five-year comprehensive dataset of 4,550 children from diverse backgrounds receiving treatment at four pediatric weight management programs in the US. We then developed a machine learning pipeline to predict (a) the likelihood of attrition, and (b) the change in body-mass index (BMI) percentile of children, at different time points after joining the weight management program. Our pipeline is greatly customized for this problem using advanced machine learning techniques to process longitudinal data, smaller-size data, and interrelated prediction tasks. The proposed method showed strong prediction performance as measured by AUROC scores (average AUROC of 0.77 for predicting attrition, and 0.78 for predicting weight outcomes).

肥胖症是一个重大的公共卫生问题。多学科儿科体重管理计划被认为是针对无法在初级保健环境中成功管理的肥胖症儿童的标准治疗方法。尽管其潜力巨大,但高辍学率(简称减员)是成功实施干预的主要障碍。预测流失模式可以帮助医疗服务提供者更早、更个性化地采取干预措施,从而降低惊人的高流失率(高达 80%)。以往的工作主要侧重于在较小的数据集上寻找流失的静态预测因素,在有效预测方面取得的成功有限。在这项研究中,我们收集了一个为期五年的综合数据集,其中包括在美国四个儿科体重管理项目中接受治疗的 4550 名不同背景的儿童。然后,我们开发了一个机器学习管道,用于预测儿童在加入体重管理项目后不同时间点的(a)减员可能性和(b)体重指数(BMI)百分位数的变化。针对这一问题,我们采用先进的机器学习技术,对纵向数据、较小规模的数据以及相互关联的预测任务进行处理,并对管道进行了大幅定制。从 AUROC 分数来看,所提出的方法显示出很强的预测性能(预测减员的平均 AUROC 为 0.77,预测体重结果的平均 AUROC 为 0.78)。
{"title":"Predicting Attrition Patterns from Pediatric Weight Management Programs.","authors":"Hamed Fayyaz, Thao-Ly T Phan, H Timothy Bunnell, Rahmatollah Beheshti","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Obesity is a major public health concern. Multidisciplinary pediatric weight management programs are considered standard treatment for children with obesity who are not able to be successfully managed in the primary care setting. Despite their great potential, high dropout rates (referred to as attrition) are a major hurdle in delivering successful interventions. Predicting attrition patterns can help providers reduce the alarmingly high rates of attrition (up to 80%) by engaging in earlier and more personalized interventions. Previous work has mainly focused on finding static predictors of attrition on smaller datasets and has achieved limited success in effective prediction. In this study, we have collected a five-year comprehensive dataset of 4,550 children from diverse backgrounds receiving treatment at four pediatric weight management programs in the US. We then developed a machine learning pipeline to predict (a) the likelihood of attrition, and (b) the change in body-mass index (BMI) percentile of children, at different time points after joining the weight management program. Our pipeline is greatly customized for this problem using advanced machine learning techniques to process longitudinal data, smaller-size data, and interrelated prediction tasks. The proposed method showed strong prediction performance as measured by AUROC scores (average AUROC of 0.77 for predicting attrition, and 0.78 for predicting weight outcomes).</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"193 ","pages":"326-342"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9854275/pdf/nihms-1865420.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10604379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enzyme Activity Prediction of Sequence Variants on Novel Substrates using Improved Substrate Encodings and Convolutional Pooling. 基于改进底物编码和卷积池的新底物序列变异酶活性预测。
Zhiqing Xu, Jinghao Wu, Yun S Song, Radhakrishnan Mahadevan

Protein engineering is currently being revolutionized by deep learning applications, especially through natural language processing (NLP) techniques. It has been shown that state-of-the-art self-supervised language models trained on entire protein databases capture hidden contextual and structural information in amino acid sequences and are capable of improving sequence-to-function predictions. Yet, recent studies have reported that current compound-protein modeling approaches perform poorly on learning interactions between enzymes and substrates of interest within one protein family. We attribute this to low-grade substrate encoding methods and over-compressed sequence representations received by downstream predictive models. In this study, we propose a new substrate-encoding based on Extended Connectivity Fingerprints (ECFPs) and a convolutional-pooling of the sequence embeddings. Through testing on an activity profiling dataset of haloalkanoate dehalogenase superfamily that measures activities of 218 phosphatases against 168 substrates, we show substantial improvements in predictive performances of compound-protein interaction modeling. In addition, we also test the workflow on three other datasets from the halogenase, kinase and aminotransferase families and show that our pipeline achieves good performance on these datasets as well. We further demonstrate the utility of this downstream model architecture by showing that it achieves good performance with six different protein embeddings, including ESM-1b (Rives et al., 2021), TAPE (Rao et al., 2019), ProtBert, ProtAlbert, ProtT5, and ProtXLNet (Elnaggar et al., 2021). This study provides a new workflow for activity prediction on novel substrates that can be used to engineer new enzymes for sustainability applications.

蛋白质工程目前正在通过深度学习应用,特别是通过自然语言处理(NLP)技术发生革命性的变化。研究表明,在整个蛋白质数据库上训练的最先进的自监督语言模型可以捕获氨基酸序列中隐藏的上下文和结构信息,并能够改进序列到功能的预测。然而,最近的研究报道,目前的化合物蛋白质建模方法在学习一个蛋白质家族中酶和底物之间的相互作用方面表现不佳。我们将此归因于低级底物编码方法和下游预测模型接收的过度压缩序列表示。在这项研究中,我们提出了一种新的基于扩展连接指纹(ECFPs)和序列嵌入的卷积池的基板编码。通过测试卤代烷酸脱卤酶超家族的活性分析数据集(测量218种磷酸酶对168种底物的活性),我们发现化合物-蛋白质相互作用模型的预测性能有了实质性的改进。此外,我们还在来自卤化酶,激酶和转氨酶家族的其他三个数据集上测试了工作流,并表明我们的管道在这些数据集上也取得了良好的性能。我们进一步证明了这种下游模型架构的有效性,表明它在六种不同的蛋白质嵌入中实现了良好的性能,包括ESM-1b (Rives等人,2021)、TAPE (Rao等人,2019)、ProtBert、ProtAlbert、ProtT5和ProtXLNet (Elnaggar等人,2021)。该研究为新型底物的活性预测提供了新的工作流程,可用于设计可持续性应用的新酶。
{"title":"Enzyme Activity Prediction of Sequence Variants on Novel Substrates using Improved Substrate Encodings and Convolutional Pooling.","authors":"Zhiqing Xu,&nbsp;Jinghao Wu,&nbsp;Yun S Song,&nbsp;Radhakrishnan Mahadevan","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Protein engineering is currently being revolutionized by deep learning applications, especially through natural language processing (NLP) techniques. It has been shown that state-of-the-art self-supervised language models trained on entire protein databases capture hidden contextual and structural information in amino acid sequences and are capable of improving sequence-to-function predictions. Yet, recent studies have reported that current compound-protein modeling approaches perform poorly on learning interactions between enzymes and substrates of interest within one protein family. We attribute this to low-grade substrate encoding methods and over-compressed sequence representations received by downstream predictive models. In this study, we propose a new substrate-encoding based on Extended Connectivity Fingerprints (ECFPs) and a convolutional-pooling of the sequence embeddings. Through testing on an activity profiling dataset of haloalkanoate dehalogenase superfamily that measures activities of 218 phosphatases against 168 substrates, we show substantial improvements in predictive performances of compound-protein interaction modeling. In addition, we also test the workflow on three other datasets from the halogenase, kinase and aminotransferase families and show that our pipeline achieves good performance on these datasets as well. We further demonstrate the utility of this downstream model architecture by showing that it achieves good performance with six different protein embeddings, including ESM-1b (Rives et al., 2021), TAPE (Rao et al., 2019), ProtBert, ProtAlbert, ProtT5, and ProtXLNet (Elnaggar et al., 2021). This study provides a new workflow for activity prediction on novel substrates that can be used to engineer new enzymes for sustainability applications.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"165 ","pages":"78-87"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9759087/pdf/nihms-1842132.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10762656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Selecting deep neural networks that yield consistent attribution-based interpretations for genomics. 为基因组学选择能产生一致归因解释的深度神经网络。
Antonio Majdandzic, Chandana Rajesh, Amber Tang, Shushan Toneyan, Ethan Labelson, Rohit Tripathy, Peter K Koo

Deep neural networks (DNNs) have advanced our ability to take DNA primary sequence as input and predict a myriad of molecular activities measured via high-throughput functional genomic assays. Post hoc attribution analysis has been employed to provide insights into the importance of features learned by DNNs, often revealing patterns such as sequence motifs. However, attribution maps typically harbor spurious importance scores to an extent that varies from model to model, even for DNNs whose predictions generalize well. Thus, the standard approach for model selection, which relies on performance of a held-out validation set, does not guarantee that a high-performing DNN will provide reliable explanations. Here we introduce two approaches that quantify the consistency of important features across a population of attribution maps; consistency reflects a qualitative property of human interpretable attribution maps. We employ the consistency metrics as part of a multivariate model selection framework to identify models that yield high generalization performance and interpretable attribution analysis. We demonstrate the efficacy of this approach across various DNNs quantitatively with synthetic data and qualitatively with chromatin accessibility data.

深度神经网络(DNN)提高了我们将 DNA 原始序列作为输入并预测通过高通量功能基因组测定所测得的大量分子活动的能力。事后归因分析被用来深入了解 DNNs 所学特征的重要性,通常能揭示序列图案等模式。然而,归因图通常包含虚假的重要性得分,其程度因模型而异,即使是预测通用性良好的 DNN 也不例外。因此,标准的模型选择方法依赖于保留验证集的表现,并不能保证表现优异的 DNN 能够提供可靠的解释。在此,我们介绍两种量化归因图群体中重要特征一致性的方法;一致性反映了人类可解释归因图的定性属性。我们将一致性度量作为多元模型选择框架的一部分,以确定能产生高泛化性能和可解释归因分析的模型。我们通过合成数据和染色质可及性数据分别定量和定性地证明了这种方法在各种 DNN 中的有效性。
{"title":"Selecting deep neural networks that yield consistent attribution-based interpretations for genomics.","authors":"Antonio Majdandzic, Chandana Rajesh, Amber Tang, Shushan Toneyan, Ethan Labelson, Rohit Tripathy, Peter K Koo","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Deep neural networks (DNNs) have advanced our ability to take DNA primary sequence as input and predict a myriad of molecular activities measured via high-throughput functional genomic assays. Post hoc attribution analysis has been employed to provide insights into the importance of features learned by DNNs, often revealing patterns such as sequence motifs. However, attribution maps typically harbor spurious importance scores to an extent that varies from model to model, even for DNNs whose predictions generalize well. Thus, the standard approach for model selection, which relies on performance of a held-out validation set, does not guarantee that a high-performing DNN will provide reliable explanations. Here we introduce two approaches that quantify the consistency of important features across a population of attribution maps; consistency reflects a qualitative property of human interpretable attribution maps. We employ the consistency metrics as part of a multivariate model selection framework to identify models that yield high generalization performance and interpretable attribution analysis. We demonstrate the efficacy of this approach across various DNNs quantitatively with synthetic data and qualitatively with chromatin accessibility data.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"200 ","pages":"131-149"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10194041/pdf/nihms-1895253.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9544629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Path Towards Clinical Adaptation of Accelerated MRI. 加速MRI的临床适应之路。
Michael S Yao, Michael S Hansen

Accelerated MRI reconstructs images of clinical anatomies from sparsely sampled signal data to reduce patient scan times. While recent works have leveraged deep learning to accomplish this task, such approaches have often only been explored in simulated environments where there is no signal corruption or resource limitations. In this work, we explore augmentations to neural network MRI image reconstructors to enhance their clinical relevancy. Namely, we propose a ConvNet model for detecting sources of image artifacts that achieves a classifier F 2 score of 79.1%. We also demonstrate that training reconstructors on MR signal data with variable acceleration factors can improve their average performance during a clinical patient scan by up to 2%. We offer a loss function to overcome catastrophic forgetting when models learn to reconstruct MR images of multiple anatomies and orientations. Finally, we propose a method for using simulated phantom data to pre-train reconstructors in situations with limited clinically acquired datasets and compute capabilities. Our results provide a potential path forward for clinical adaptation of accelerated MRI.

加速MRI从稀疏采样信号数据重建临床解剖图像,以减少患者扫描时间。虽然最近的研究已经利用深度学习来完成这项任务,但这种方法通常只在没有信号损坏或资源限制的模拟环境中进行了探索。在这项工作中,我们探索增强神经网络MRI图像重建,以提高其临床相关性。也就是说,我们提出了一个用于检测图像伪影源的卷积神经网络模型,该模型的分类器f2得分为79.1%。我们还证明,在具有可变加速因子的MR信号数据上训练重建器可以在临床患者扫描期间将其平均性能提高2%。我们提供了一个损失函数来克服模型学习重建多个解剖和方向的MR图像时的灾难性遗忘。最后,我们提出了一种方法,在临床上获得的数据集和计算能力有限的情况下,使用模拟幻像数据对重建器进行预训练。我们的研究结果为加速MRI的临床应用提供了一条潜在的途径。
{"title":"A Path Towards Clinical Adaptation of Accelerated MRI.","authors":"Michael S Yao,&nbsp;Michael S Hansen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Accelerated MRI reconstructs images of clinical anatomies from sparsely sampled signal data to reduce patient scan times. While recent works have leveraged deep learning to accomplish this task, such approaches have often only been explored in simulated environments where there is no signal corruption or resource limitations. In this work, we explore augmentations to neural network MRI image reconstructors to enhance their clinical relevancy. Namely, we propose a ConvNet model for detecting sources of image artifacts that achieves a classifier <i>F</i> <sub><i>2</i></sub> score of 79.1%. We also demonstrate that training reconstructors on MR signal data with variable acceleration factors can improve their average performance during a clinical patient scan by up to 2%. We offer a loss function to overcome catastrophic forgetting when models learn to reconstruct MR images of multiple anatomies and orientations. Finally, we propose a method for using simulated phantom data to pre-train reconstructors in situations with limited clinically acquired datasets and compute capabilities. Our results provide a potential path forward for clinical adaptation of accelerated MRI.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"193 ","pages":"489-511"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10061571/pdf/nihms-1846161.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9336136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Counterfactual and Factual Reasoning over Hypergraphs for Interpretable Clinical Predictions on EHR. 在超图上进行反事实和事实推理,在电子病历上进行可解释的临床预测。
Ran Xu, Yue Yu, Chao Zhang, Mohammed K Ali, Joyce C Ho, Carl Yang

Electronic Health Record modeling is crucial for digital medicine. However, existing models ignore higher-order interactions among medical codes and their causal relations towards downstream clinical predictions. To address such limitations, we propose a novel framework CACHE, to provide effective and insightful clinical predictions based on hypergraph representation learning and counterfactual and factual reasoning techniques. Experiments on two real EHR datasets show the superior performance of CACHE. Case studies with a domain expert illustrate a preferred capability of CACHE in generating clinically meaningful interpretations towards the correct predictions.

电子健康记录建模对数字医疗至关重要。然而,现有模型忽略了医疗代码之间的高阶交互作用及其对下游临床预测的因果关系。为了解决这些局限性,我们提出了一个新颖的框架 CACHE,基于超图表示学习以及反事实和事实推理技术,提供有效且有洞察力的临床预测。在两个真实的电子病历数据集上进行的实验显示了 CACHE 的卓越性能。与领域专家进行的案例研究表明,CACHE 在生成对正确预测有临床意义的解释方面具有优越的能力。
{"title":"Counterfactual and Factual Reasoning over Hypergraphs for Interpretable Clinical Predictions on EHR.","authors":"Ran Xu, Yue Yu, Chao Zhang, Mohammed K Ali, Joyce C Ho, Carl Yang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Electronic Health Record modeling is crucial for digital medicine. However, existing models ignore higher-order interactions among medical codes and their causal relations towards downstream clinical predictions. To address such limitations, we propose a novel framework CACHE, to provide <i>effective</i> and <i>insightful</i> clinical predictions based on hypergraph representation learning and counterfactual and factual reasoning techniques. Experiments on two real EHR datasets show the superior performance of CACHE. Case studies with a domain expert illustrate a preferred capability of CACHE in generating clinically meaningful interpretations towards the correct predictions.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"193 ","pages":"259-278"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10227831/pdf/nihms-1901945.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9553346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated intracranial vessel labeling with learning boosted by vessel connectivity, radii and spatial context. 自动颅内血管标记与学习促进血管连接,半径和空间背景。
Jannik Sobisch, Žiga Bizjak, Aichi Chien, Žiga Špiclin

Cerebrovascular diseases are among the world's top causes of death and their screening and diagnosis rely on angiographic imaging. We focused on automated anatomical labeling of cerebral arteries that enables their cross-sectional quantification and inter-subject comparisons and thereby identification of geometric risk factors correlated to the cerebrovascular diseases. We used 152 cerebral TOF-MRA angiograms from three publicly available datasets and manually created reference labeling using Slicer3D. We extracted centerlines from nnU-net based segmentations using VesselVio and labeled them according to the reference labeling. Vessel centerline coordinates, in combination with additional vessel connectivity, radius and spatial context features were used for training seven distinct PointNet++ models. Model trained solely on the vessel centerline coordinates resulted in ACC of 0.93 and across-labels average TPR was 0.88. Including vessel radius significantly improved ACC to 0.95, and average TPR to 0.91. Finally, focusing spatial context to the Circle of Willis are resulted in best ACC of 0.96 and best average TPR of 0.93. Hence, using vessel radius and spatial context greatly improved vessel labeling, with the attained perfomance opening the avenue for clinical applications of intracranial vessel labeling.

脑血管疾病是世界上最主要的死亡原因之一,其筛查和诊断依赖于血管造影成像。我们专注于脑动脉的自动解剖标记,使其横断面量化和受试者间比较成为可能,从而确定与脑血管疾病相关的几何危险因素。我们使用了来自三个公开数据集的152张大脑TOF-MRA血管图,并使用Slicer3D手动创建了参考标签。我们使用VesselVio从基于nnU-net的分割中提取中心线,并根据参考标记对其进行标记。船舶中心线坐标,结合额外的船舶连通性、半径和空间环境特征,用于训练七种不同的PointNet++模型。仅在血管中心线坐标上训练的模型ACC为0.93,跨标签平均TPR为0.88。加入血管半径后,ACC显著提高至0.95,TPR显著提高至0.91。最后,将空间背景聚焦于威利斯圈的最佳ACC值为0.96,最佳平均TPR值为0.93。因此,利用血管半径和空间背景极大地改善了血管标记,所获得的性能为颅内血管标记的临床应用开辟了道路。
{"title":"Automated intracranial vessel labeling with learning boosted by vessel connectivity, radii and spatial context.","authors":"Jannik Sobisch,&nbsp;Žiga Bizjak,&nbsp;Aichi Chien,&nbsp;Žiga Špiclin","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Cerebrovascular diseases are among the world's top causes of death and their screening and diagnosis rely on angiographic imaging. We focused on automated anatomical labeling of cerebral arteries that enables their cross-sectional quantification and inter-subject comparisons and thereby identification of geometric risk factors correlated to the cerebrovascular diseases. We used 152 cerebral TOF-MRA angiograms from three publicly available datasets and manually created reference labeling using Slicer3D. We extracted centerlines from nnU-net based segmentations using VesselVio and labeled them according to the reference labeling. Vessel centerline coordinates, in combination with additional vessel connectivity, radius and spatial context features were used for training seven distinct PointNet++ models. Model trained solely on the vessel centerline coordinates resulted in ACC of 0.93 and across-labels average TPR was 0.88. Including vessel radius significantly improved ACC to 0.95, and average TPR to 0.91. Finally, focusing spatial context to the Circle of Willis are resulted in best ACC of 0.96 and best average TPR of 0.93. Hence, using vessel radius and spatial context greatly improved vessel labeling, with the attained perfomance opening the avenue for clinical applications of intracranial vessel labeling.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"194 ","pages":"34-44"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10112880/pdf/nihms-1889674.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9389427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Extensive Data Processing Pipeline for MIMIC-IV. 用于MIMIC-IV的扩展数据处理管道。
Mehak Gupta, Brennan Gallamoza, Nicolas Cutrona, Pranjal Dhakal, Raphael Poulain, Rahmatollah Beheshti

An increasing amount of research is being devoted to applying machine learning methods to electronic health record (EHR) data for various clinical purposes. This growing area of research has exposed the challenges of the accessibility of EHRs. MIMIC is a popular, public, and free EHR dataset in a raw format that has been used in numerous studies. The absence of standardized preprocessing steps can be, however, a significant barrier to the wider adoption of this rare resource. Additionally, this absence can reduce the reproducibility of the developed tools and limit the ability to compare the results among similar studies. In this work, we provide a greatly customizable pipeline to extract, clean, and preprocess the data available in the fourth version of the MIMIC dataset (MIMIC-IV). The pipeline also presents an end-to-end wizard-like package supporting predictive model creations and evaluations. The pipeline covers a range of clinical prediction tasks which can be broadly classified into four categories - readmission, length of stay, mortality, and phenotype prediction. The tool is publicly available at https://github.com/healthylaife/MIMIC-IV-Data-Pipeline.

越来越多的研究致力于将机器学习方法应用于各种临床目的的电子健康记录(EHR)数据。这一不断发展的研究领域暴露了电子病历可及性的挑战。MIMIC是一个流行的、公开的、免费的电子病历数据集,其原始格式已在许多研究中使用。然而,缺乏标准化的预处理步骤可能是广泛采用这种稀有资源的一个重大障碍。此外,这种缺失会降低开发工具的可重复性,并限制在类似研究中比较结果的能力。在这项工作中,我们提供了一个非常可定制的管道来提取、清理和预处理第四版MIMIC数据集(MIMIC- iv)中的数据。该管道还提供了一个端到端的类似向导的包,支持预测模型的创建和评估。该管道涵盖了一系列临床预测任务,可大致分为四类-再入院,住院时间,死亡率和表型预测。该工具可在https://github.com/healthylaife/MIMIC-IV-Data-Pipeline上公开获取。
{"title":"An Extensive Data Processing Pipeline for MIMIC-IV.","authors":"Mehak Gupta,&nbsp;Brennan Gallamoza,&nbsp;Nicolas Cutrona,&nbsp;Pranjal Dhakal,&nbsp;Raphael Poulain,&nbsp;Rahmatollah Beheshti","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>An increasing amount of research is being devoted to applying machine learning methods to electronic health record (EHR) data for various clinical purposes. This growing area of research has exposed the challenges of the accessibility of EHRs. MIMIC is a popular, public, and free EHR dataset in a raw format that has been used in numerous studies. The absence of standardized preprocessing steps can be, however, a significant barrier to the wider adoption of this rare resource. Additionally, this absence can reduce the reproducibility of the developed tools and limit the ability to compare the results among similar studies. In this work, we provide a greatly customizable pipeline to extract, clean, and preprocess the data available in the fourth version of the MIMIC dataset (MIMIC-IV). The pipeline also presents an end-to-end wizard-like package supporting predictive model creations and evaluations. The pipeline covers a range of clinical prediction tasks which can be broadly classified into four categories - readmission, length of stay, mortality, and phenotype prediction. The tool is publicly available at https://github.com/healthylaife/MIMIC-IV-Data-Pipeline.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"193 ","pages":"311-325"},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9854277/pdf/nihms-1865425.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10604378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrastive Representation Learning for Gaze Estimation 注视估计的对比表征学习
Pub Date : 2022-10-24 DOI: 10.48550/arXiv.2210.13404
Swati Jindal, R. Manduchi
Self-supervised learning (SSL) has become prevalent for learning representations in computer vision. Notably, SSL exploits contrastive learning to encourage visual representations to be invariant under various image transformations. The task of gaze estimation, on the other hand, demands not just invariance to various appearances but also equivariance to the geometric transformations. In this work, we propose a simple contrastive representation learning framework for gaze estimation, named Gaze Contrastive Learning (GazeCLR). GazeCLR exploits multi-view data to promote equivariance and relies on selected data augmentation techniques that do not alter gaze directions for invariance learning. Our experiments demonstrate the effectiveness of GazeCLR for several settings of the gaze estimation task. Particularly, our results show that GazeCLR improves the performance of cross-domain gaze estimation and yields as high as 17.2% relative improvement. Moreover, the GazeCLR framework is competitive with state-of-the-art representation learning methods for few-shot evaluation. The code and pre-trained models are available at https://github.com/jswati31/gazeclr.
自监督学习(Self-supervised learning, SSL)已经成为计算机视觉中学习表征的主流。值得注意的是,SSL利用对比学习来鼓励视觉表示在各种图像转换下保持不变。另一方面,注视估计的任务不仅要求对各种外观的不变性,而且要求对几何变换的等变性。在这项工作中,我们提出了一个简单的凝视估计对比表征学习框架,称为凝视对比学习(GazeCLR)。GazeCLR利用多视图数据来促进等方差,并依赖于不改变凝视方向的选择数据增强技术来进行不变性学习。我们的实验证明了GazeCLR在几种注视估计任务设置下的有效性。特别是,我们的研究结果表明,GazeCLR提高了跨域凝视估计的性能,相对提高了17.2%。此外,GazeCLR框架在少镜头评估方面与最先进的表示学习方法具有竞争力。代码和预训练模型可在https://github.com/jswati31/gazeclr上获得。
{"title":"Contrastive Representation Learning for Gaze Estimation","authors":"Swati Jindal, R. Manduchi","doi":"10.48550/arXiv.2210.13404","DOIUrl":"https://doi.org/10.48550/arXiv.2210.13404","url":null,"abstract":"Self-supervised learning (SSL) has become prevalent for learning representations in computer vision. Notably, SSL exploits contrastive learning to encourage visual representations to be invariant under various image transformations. The task of gaze estimation, on the other hand, demands not just invariance to various appearances but also equivariance to the geometric transformations. In this work, we propose a simple contrastive representation learning framework for gaze estimation, named Gaze Contrastive Learning (GazeCLR). GazeCLR exploits multi-view data to promote equivariance and relies on selected data augmentation techniques that do not alter gaze directions for invariance learning. Our experiments demonstrate the effectiveness of GazeCLR for several settings of the gaze estimation task. Particularly, our results show that GazeCLR improves the performance of cross-domain gaze estimation and yields as high as 17.2% relative improvement. Moreover, the GazeCLR framework is competitive with state-of-the-art representation learning methods for few-shot evaluation. The code and pre-trained models are available at https://github.com/jswati31/gazeclr.","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"210 1","pages":"37-49"},"PeriodicalIF":0.0,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42458334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Path Towards Clinical Adaptation of Accelerated MRI 加速MRI的临床适应之路
Pub Date : 2022-08-26 DOI: 10.48550/arXiv.2208.12835
Michael S. Yao, M. Hansen
Accelerated MRI reconstructs images of clinical anatomies from sparsely sampled signal data to reduce patient scan times. While recent works have leveraged deep learning to accomplish this task, such approaches have often only been explored in simulated environments where there is no signal corruption or resource limitations. In this work, we explore augmentations to neural network MRI image reconstructors to enhance their clinical relevancy. Namely, we propose a ConvNet model for detecting sources of image artifacts that achieves a classifier F 2 score of 79.1%. We also demonstrate that training reconstructors on MR signal data with variable acceleration factors can improve their average performance during a clinical patient scan by up to 2%. We offer a loss function to overcome catastrophic forgetting when models learn to reconstruct MR images of multiple anatomies and orientations. Finally, we propose a method for using simulated phantom data to pre-train reconstructors in situations with limited clinically acquired datasets and compute capabilities. Our results provide a potential path forward for clinical adaptation of accelerated MRI.
加速MRI从稀疏采样信号数据重建临床解剖图像,以减少患者扫描时间。虽然最近的研究已经利用深度学习来完成这项任务,但这种方法通常只在没有信号损坏或资源限制的模拟环境中进行了探索。在这项工作中,我们探索增强神经网络MRI图像重建,以提高其临床相关性。也就是说,我们提出了一个用于检测图像伪影源的卷积神经网络模型,该模型的分类器f2得分为79.1%。我们还证明,在具有可变加速因子的MR信号数据上训练重建器可以在临床患者扫描期间将其平均性能提高2%。我们提供了一个损失函数来克服模型学习重建多个解剖和方向的MR图像时的灾难性遗忘。最后,我们提出了一种方法,在临床上获得的数据集和计算能力有限的情况下,使用模拟幻像数据对重建器进行预训练。我们的研究结果为加速MRI的临床应用提供了一条潜在的途径。
{"title":"A Path Towards Clinical Adaptation of Accelerated MRI","authors":"Michael S. Yao, M. Hansen","doi":"10.48550/arXiv.2208.12835","DOIUrl":"https://doi.org/10.48550/arXiv.2208.12835","url":null,"abstract":"Accelerated MRI reconstructs images of clinical anatomies from sparsely sampled signal data to reduce patient scan times. While recent works have leveraged deep learning to accomplish this task, such approaches have often only been explored in simulated environments where there is no signal corruption or resource limitations. In this work, we explore augmentations to neural network MRI image reconstructors to enhance their clinical relevancy. Namely, we propose a ConvNet model for detecting sources of image artifacts that achieves a classifier F 2 score of 79.1%. We also demonstrate that training reconstructors on MR signal data with variable acceleration factors can improve their average performance during a clinical patient scan by up to 2%. We offer a loss function to overcome catastrophic forgetting when models learn to reconstruct MR images of multiple anatomies and orientations. Finally, we propose a method for using simulated phantom data to pre-train reconstructors in situations with limited clinically acquired datasets and compute capabilities. Our results provide a potential path forward for clinical adaptation of accelerated MRI.","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"193 1","pages":"489-511"},"PeriodicalIF":0.0,"publicationDate":"2022-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42674329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of machine learning research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1