Proceedings of machine learning research最新文献

英文中文

Causal isotonic calibration for heterogeneous treatment effects 异质治疗效果的因果等渗校准

Proceedings of machine learning research

Pub Date : 2023-02-27 DOI: 10.48550/arXiv.2302.14011

L. Laan, Ernesto Ulloa-P'erez, M. Carone, Alexander Luedtke

We propose causal isotonic calibration, a novel nonparametric method for calibrating predictors of heterogeneous treatment effects. In addition, we introduce a novel data-efficient variant of calibration that avoids the need for hold-out calibration sets, which we refer to as cross-calibration. Causal isotonic cross-calibration takes cross-fitted predictors and outputs a single calibrated predictor obtained using all available data. We establish under weak conditions that causal isotonic calibration and cross-calibration both achieve fast doubly-robust calibration rates so long as either the propensity score or outcome regression is estimated well in an appropriate sense. The proposed causal isotonic calibrator can be wrapped around any black-box learning algorithm to provide strong distribution-free calibration guarantees while preserving predictive performance.

我们提出了因果等渗校准，这是一种新的非参数方法，用于校准异质性治疗效果的预测因子。此外，我们引入了一种新的数据高效校准变体，它避免了对保持校准集的需要，我们称之为交叉校准。因果等渗交叉校准采用交叉拟合的预测因子，并输出使用所有可用数据获得的单个校准预测因子。我们在弱条件下确定，只要在适当的意义上很好地估计了倾向得分或结果回归，因果等渗校准和交叉校准都能实现快速的双稳健校准率。所提出的因果等渗校准器可以包裹在任何黑盒学习算法周围，以提供强大的无分布校准保证，同时保持预测性能。

引用次数: 0

Fair admission risk prediction with proportional multicalibration. 比例多重校准的公平入场风险预测。

Proceedings of machine learning research

Pub Date : 2023-01-01

William G La Cava, Elle Lett, Guangya Wan

Fair calibration is a widely desirable fairness criteria in risk prediction contexts. One way to measure and achieve fair calibration is with multicalibration. Multicalibration constrains calibration error among flexibly-defined subpopulations while maintaining overall calibration. However, multicalibrated models can exhibit a higher percent calibration error among groups with lower base rates than groups with higher base rates. As a result, it is possible for a decision-maker to learn to trust or distrust model predictions for specific groups. To alleviate this, we propose proportional multicalibration, a criteria that constrains the percent calibration error among groups and within prediction bins. We prove that satisfying proportional multicalibration bounds a model's multicalibration as well its differential calibration, a fairness criteria that directly measures how closely a model approximates sufficiency. Therefore, proportionally calibrated models limit the ability of decision makers to distinguish between model performance on different patient groups, which may make the models more trustworthy in practice. We provide an efficient algorithm for post-processing risk prediction models for proportional multicalibration and evaluate it empirically. We conduct simulation studies and investigate a real-world application of PMC-postprocessing to prediction of emergency department patient admissions. We observe that proportional multicalibration is a promising criteria for controlling simultaneous measures of calibration fairness of a model over intersectional groups with virtually no cost in terms of classification performance.

在风险预测中，公平校准是一个广泛需要的公平标准。一种测量和实现公平校准的方法是多重校准。在保持整体校准的同时，多重校准约束了灵活定义的子种群之间的校准误差。然而，多校准模型在基率较低的组中比在基率较高的组中显示出更高百分比的校准误差。因此，决策者有可能学会信任或不信任特定群体的模型预测。为了缓解这种情况，我们提出了比例多重校准，这是一种限制组间和预测箱内校准误差百分比的标准。我们证明了满足比例多重校准边界的模型的多重校准以及它的微分校准，这是一个公平的准则，直接衡量一个模型接近的程度。因此，比例校准模型限制了决策者区分模型在不同患者群体上的表现的能力，这可能使模型在实践中更值得信赖。提出了一种有效的比例多重校准后处理风险预测模型算法，并对其进行了实证评价。我们进行了模拟研究，并调查了pmc -后处理在预测急诊科患者入院方面的实际应用。我们观察到，比例多重校准是一个很有前途的标准，用于控制模型在交叉组上的校准公平性的同时测量，在分类性能方面几乎没有成本。

{"title":"Fair admission risk prediction with proportional multicalibration.","authors":"William G La Cava, Elle Lett, Guangya Wan","doi":"","DOIUrl":"","url":null,"abstract":"Fair calibration is a widely desirable fairness criteria in risk prediction contexts. One way to measure and achieve fair calibration is with multicalibration. Multicalibration constrains calibration error among flexibly-defined subpopulations while maintaining overall calibration. However, multicalibrated models can exhibit a higher percent calibration error among groups with lower base rates than groups with higher base rates. As a result, it is possible for a decision-maker to learn to trust or distrust model predictions for specific groups. To alleviate this, we propose proportional multicalibration, a criteria that constrains the percent calibration error among groups and within prediction bins. We prove that satisfying proportional multicalibration bounds a model's multicalibration as well its differential calibration, a fairness criteria that directly measures how closely a model approximates sufficiency. Therefore, proportionally calibrated models limit the ability of decision makers to distinguish between model performance on different patient groups, which may make the models more trustworthy in practice. We provide an efficient algorithm for post-processing risk prediction models for proportional multicalibration and evaluate it empirically. We conduct simulation studies and investigate a real-world application of PMC-postprocessing to prediction of emergency department patient admissions. We observe that proportional multicalibration is a promising criteria for controlling simultaneous measures of calibration fairness of a model over intersectional groups with virtually no cost in terms of classification performance.","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"209 ","pages":"350-378"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10417639/pdf/nihms-1917236.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10008223","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Directed Graphical Models and Causal Discovery for Zero-Inflated Data. 零膨胀数据的有向图模型和因果发现。

Proceedings of machine learning research

Pub Date : 2023-01-01

Shiqing Yu, Mathias Drton, Ali Shojaie

With advances in technology, gene expression measurements from single cells can be used to gain refined insights into regulatory relationships among genes. Directed graphical models are well-suited to explore such (cause-effect) relationships. However, statistical analyses of single cell data are complicated by the fact that the data often show zero-inflated expression patterns. To address this challenge, we propose directed graphical models that are based on Hurdle conditional distributions parametrized in terms of polynomials in parent variables and their 0/1 indicators of being zero or nonzero. While directed graphs for Gaussian models are only identifiable up to an equivalence class in general, we show that, under a natural and weak assumption, the exact directed acyclic graph of our zero-inflated models can be identified. We propose methods for graph recovery, apply our model to real single-cell gene expression data on T helper cells, and show simulated experiments that validate the identifiability and graph estimation methods in practice.

随着技术的进步，单细胞基因表达测量可用于深入了解基因之间的调控关系。定向图模型非常适合用来探索这种（因果）关系。然而，由于单细胞数据通常显示零膨胀表达模式，因此单细胞数据的统计分析非常复杂。为了应对这一挑战，我们提出了基于赫尔德条件分布的有向图模型，其参数为父变量的多项式及其为零或非零的 0/1 指标。虽然高斯模型的有向图一般只能识别到等价类，但我们证明，在一个自然的弱假设下，我们的零膨胀模型的精确有向无环图是可以识别的。我们提出了恢复图的方法，将我们的模型应用于 T 辅助细胞的真实单细胞基因表达数据，并展示了在实践中验证可识别性和图估计方法的模拟实验。

引用次数: 0

Skill, or Style? Classification of Fetal Sonography Eye-Tracking Data. 技能，还是风格？胎儿超声眼动追踪数据的分类。

Proceedings of machine learning research

Pub Date : 2023-01-01

Clare Teng, Lior Drukker, Aris T Papageorghiou, J Alison Noble

We present a method for classifying human skill at fetal ultrasound scanning from eye-tracking and pupillary data of sonographers. Human skill characterization for this clinical task typically creates groupings of clinician skills such as expert and beginner based on the number of years of professional experience; experts typically have more than 10 years and beginners between 0-5 years. In some cases, they also include trainees who are not yet fully-qualified professionals. Prior work has considered eye movements that necessitates separating eye-tracking data into eye movements, such as fixations and saccades. Our method does not use prior assumptions about the relationship between years of experience and does not require the separation of eye-tracking data. Our best performing skill classification model achieves an F1 score of 98% and 70% for expert and trainee classes respectively. We also show that years of experience as a direct measure of skill, is significantly correlated to the expertise of a sonographer.

我们介绍了一种根据超声技师的眼动追踪和瞳孔数据对人类胎儿超声扫描技能进行分类的方法。针对这项临床任务的人类技能特征描述通常会根据专业经验年限对临床医生的技能进行分组，如专家和初学者；专家通常拥有 10 年以上的专业经验，初学者则在 0-5 年之间。在某些情况下，他们还包括尚未完全获得专业资格的受训人员。之前的工作考虑了眼球运动，这就需要将眼球跟踪数据分离成眼球运动，如定点和眼球移动。我们的方法不使用关于工作年限之间关系的先验假设，也不需要分离眼动跟踪数据。我们性能最好的技能分类模型在专家和学员类别中的 F1 分数分别达到了 98% 和 70%。我们还表明，作为技能的直接衡量标准，工作经验年限与超声波技师的专业技能有显著相关性。

引用次数: 0

Contrastive Representation Learning for Gaze Estimation. 用于凝视估计的对比表征学习

Proceedings of machine learning research

Pub Date : 2023-01-01

Swati Jindal, Roberto Manduchi

Self-supervised learning (SSL) has become prevalent for learning representations in computer vision. Notably, SSL exploits contrastive learning to encourage visual representations to be invariant under various image transformations. The task of gaze estimation, on the other hand, demands not just invariance to various appearances but also equivariance to the geometric transformations. In this work, we propose a simple contrastive representation learning framework for gaze estimation, named Gaze Contrastive Learning (GazeCLR). GazeCLR exploits multi-view data to promote equivariance and relies on selected data augmentation techniques that do not alter gaze directions for invariance learning. Our experiments demonstrate the effectiveness of GazeCLR for several settings of the gaze estimation task. Particularly, our results show that GazeCLR improves the performance of cross-domain gaze estimation and yields as high as 17.2% relative improvement. Moreover, the GazeCLR framework is competitive with state-of-the-art representation learning methods for few-shot evaluation. The code and pre-trained models are available at https://github.com/jswati31/gazeclr.

自我监督学习（SSL）已成为计算机视觉表征学习的主流。值得注意的是，SSL 利用对比学习鼓励视觉表征在各种图像变换下保持不变。另一方面，凝视估计任务不仅要求对各种外观保持不变，还要求对几何变换保持等差数列。在这项工作中，我们为注视估计提出了一个简单的对比表示学习框架，命名为注视对比学习（Gaze Contrastive Learning，GazeCLR）。GazeCLR 利用多视角数据来促进等差性，并依靠不改变注视方向的选定数据增强技术来进行不变量学习。我们的实验证明了 GazeCLR 在几种凝视估计任务设置中的有效性。特别是，我们的实验结果表明，GazeCLR 提高了跨域注视估计的性能，相对提高率高达 17.2%。此外，GazeCLR 框架在少镜头评估方面与最先进的表示学习方法相比具有竞争力。代码和预训练模型可在 https://github.com/jswati31/gazeclr 上获取。

{"title":"Contrastive Representation Learning for Gaze Estimation.","authors":"Swati Jindal, Roberto Manduchi","doi":"","DOIUrl":"","url":null,"abstract":"Self-supervised learning (SSL) has become prevalent for learning representations in computer vision. Notably, SSL exploits contrastive learning to encourage visual representations to be invariant under various image transformations. The task of gaze estimation, on the other hand, demands not just invariance to various appearances but also equivariance to the geometric transformations. In this work, we propose a simple contrastive representation learning framework for gaze estimation, named Gaze Contrastive Learning (GazeCLR). GazeCLR exploits multi-view data to promote equivariance and relies on selected data augmentation techniques that do not alter gaze directions for invariance learning. Our experiments demonstrate the effectiveness of GazeCLR for several settings of the gaze estimation task. Particularly, our results show that GazeCLR improves the performance of cross-domain gaze estimation and yields as high as 17.2% relative improvement. Moreover, the GazeCLR framework is competitive with state-of-the-art representation learning methods for few-shot evaluation. The code and pre-trained models are available at https://github.com/jswati31/gazeclr.","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"210 ","pages":"37-49"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10270367/pdf/nihms-1862058.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9715239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Privacy-preserving patient clustering for personalized federated learning. 针对个性化联合学习的隐私保护患者聚类。

Proceedings of machine learning research

Pub Date : 2023-01-01

Ahmed Elhussein, Gamze Gürsoy

Federated Learning (FL) is a machine learning framework that enables multiple organizations to train a model without sharing their data with a central server. However, it experiences significant performance degradation if the data is non-identically independently distributed (non-IID). This is a problem in medical settings, where variations in the patient population contribute significantly to distribution differences across hospitals. Personalized FL addresses this issue by accounting for site-specific distribution differences. Clustered FL, a Personalized FL variant, was used to address this problem by clustering patients into groups across hospitals and training separate models on each group. However, privacy concerns remained as a challenge as the clustering process requires exchange of patient-level information. This was previously solved by forming clusters using aggregated data, which led to inaccurate groups and performance degradation. In this study, we propose Privacy-preserving Community-Based Federated machine Learning (PCBFL), a novel Clustered FL framework that can cluster patients using patient-level data while protecting privacy. PCBFL uses Secure Multiparty Computation, a cryptographic technique, to securely calculate patient-level similarity scores across hospitals. We then evaluate PCBFL by training a federated mortality prediction model using 20 sites from the eICU dataset. We compare the performance gain from PCBFL against traditional and existing Clustered FL frameworks. Our results show that PCBFL successfully forms clinically meaningful cohorts of low, medium, and high-risk patients. PCBFL outperforms traditional and existing Clustered FL frameworks with an average AUC improvement of 4.3% and AUPRC improvement of 7.8%.

联合学习（FL）是一种机器学习框架，它能让多个组织在不与中央服务器共享数据的情况下训练一个模型。但是，如果数据是非相同独立分布的（非 IID），它的性能就会明显下降。这在医疗环境中是个问题，因为病人群体的变化会极大地导致医院间的分布差异。个性化 FL 通过考虑特定地点的分布差异来解决这一问题。聚类 FL 是个性化 FL 的一种变体，通过将不同医院的患者聚类为不同组别，并对每个组别进行单独的模型训练，从而解决了这一问题。然而，由于聚类过程需要交换患者级别的信息，隐私问题仍然是一个挑战。以前解决这个问题的方法是使用聚合数据形成聚类，但这会导致分组不准确和性能下降。在本研究中，我们提出了保护隐私的基于社区的联合机器学习（PCBFL），这是一种新颖的聚类 FL 框架，可在保护隐私的同时使用患者级数据对患者进行聚类。PCBFL 使用加密技术 "安全多方计算"（Secure Multiparty Computation）来安全地计算医院间患者级别的相似性得分。然后，我们使用 eICU 数据集中的 20 个站点训练了一个联合死亡率预测模型，对 PCBFL 进行了评估。我们将 PCBFL 的性能增益与传统和现有的聚类 FL 框架进行了比较。我们的结果表明，PCBFL 成功地形成了具有临床意义的低、中、高风险患者队列。PCBFL 优于传统和现有的聚类 FL 框架，平均 AUC 提高了 4.3%，AUPRC 提高了 7.8%。

{"title":"Privacy-preserving patient clustering for personalized federated learning.","authors":"Ahmed Elhussein, Gamze Gürsoy","doi":"","DOIUrl":"","url":null,"abstract":"Federated Learning (FL) is a machine learning framework that enables multiple organizations to train a model without sharing their data with a central server. However, it experiences significant performance degradation if the data is non-identically independently distributed (non-IID). This is a problem in medical settings, where variations in the patient population contribute significantly to distribution differences across hospitals. Personalized FL addresses this issue by accounting for site-specific distribution differences. Clustered FL, a Personalized FL variant, was used to address this problem by clustering patients into groups across hospitals and training separate models on each group. However, privacy concerns remained as a challenge as the clustering process requires exchange of patient-level information. This was previously solved by forming clusters using aggregated data, which led to inaccurate groups and performance degradation. In this study, we propose Privacy-preserving Community-Based Federated machine Learning (PCBFL), a novel Clustered FL framework that can cluster patients using patient-level data while protecting privacy. PCBFL uses Secure Multiparty Computation, a cryptographic technique, to securely calculate patient-level similarity scores across hospitals. We then evaluate PCBFL by training a federated mortality prediction model using 20 sites from the eICU dataset. We compare the performance gain from PCBFL against traditional and existing Clustered FL frameworks. Our results show that PCBFL successfully forms clinically meaningful cohorts of low, medium, and high-risk patients. PCBFL outperforms traditional and existing Clustered FL frameworks with an average AUC improvement of 4.3% and AUPRC improvement of 7.8%.","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"219 ","pages":"150-166"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11376435/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142141965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SRDA: Mobile Sensing based Fluid Overload Detection for End Stage Kidney Disease Patients using Sensor Relation Dual Autoencoder. SRDA：基于移动传感的末期肾病患者体液超负荷检测（使用传感器关系双自动编码器）。

Proceedings of machine learning research

Pub Date : 2023-01-01

Mingyue Tang, Jiechao Gao, Guimin Dong, Carl Yang, Bradford Campbell, Brendan Bowman, Jamie Marie Zoellner, Emaad Abdel-Rahman, Mehdi Boukhechba

Chronic kidney disease (CKD) is a life-threatening and prevalent disease. CKD patients, especially endstage kidney disease (ESKD) patients on hemodialysis, suffer from kidney failures and are unable to remove excessive fluid, causing fluid overload and multiple morbidities including death. Current solutions for fluid overtake monitoring such as ultrasonography and biomarkers assessment are cumbersome, discontinuous, and can only be performed in the clinic. In this paper, we propose SRDA, a latent graph learning powered fluid overload detection system based on Sensor Relation Dual Autoencoder to detect excessive fluid consumption of EKSD patients based on passively collected bio-behavioral data from smartwatch sensors. Experiments using real-world mobile sensing data indicate that SRDA outperforms the state-of-the-art baselines in both F1 score and recall, and demonstrate the potential of ubiquitous sensing for ESKD fluid intake management.

慢性肾脏病（CKD）是一种威胁生命的常见疾病。慢性肾脏病患者，尤其是接受血液透析的终末期肾脏病（ESKD）患者，因肾功能衰竭而无法排出过多的液体，导致体液超负荷和包括死亡在内的多种病症。目前的体液超负荷监测解决方案，如超声波检查和生物标志物评估，都非常繁琐、不连续，而且只能在临床上进行。在本文中，我们提出了基于传感器关系双自动编码器的潜图学习驱动的体液过量检测系统 SRDA，该系统可根据智能手表传感器被动收集的生物行为数据检测 EKSD 患者的过量液体消耗。使用真实世界移动传感数据进行的实验表明，SRDA 在 F1 分数和召回率方面都优于最先进的基线系统，并证明了无处不在的传感在 ESKD 摄入液体管理方面的潜力。

引用次数: 0

Temporal Supervised Contrastive Learning for Modeling Patient Risk Progression. 为患者风险进展建模的时间监督对比学习。

Proceedings of machine learning research

Pub Date : 2023-01-01

Shahriar Noroozizadeh, Jeremy C Weiss, George H Chen

We consider the problem of predicting how the likelihood of an outcome of interest for a patient changes over time as we observe more of the patient's data. To solve this problem, we propose a supervised contrastive learning framework that learns an embedding representation for each time step of a patient time series. Our framework learns the embedding space to have the following properties: (1) nearby points in the embedding space have similar predicted class probabilities, (2) adjacent time steps of the same time series map to nearby points in the embedding space, and (3) time steps with very different raw feature vectors map to far apart regions of the embedding space. To achieve property (3), we employ a nearest neighbor pairing mechanism in the raw feature space. This mechanism also serves as an alternative to "data augmentation", a key ingredient of contrastive learning, which lacks a standard procedure that is adequately realistic for clinical tabular data, to our knowledge. We demonstrate that our approach outperforms state-of-the-art baselines in predicting mortality of septic patients (MIMIC-III dataset) and tracking progression of cognitive impairment (ADNI dataset). Our method also consistently recovers the correct synthetic dataset embedding structure across experiments, a feat not achieved by baselines. Our ablation experiments show the pivotal role of our nearest neighbor pairing.

我们考虑的问题是，当我们观察到更多病人的数据时，如何预测病人感兴趣的结果的可能性会随着时间的推移而发生变化。为了解决这个问题，我们提出了一个有监督的对比学习框架，该框架可以为患者时间序列的每个时间步学习一个嵌入表征。我们的框架学习的嵌入空间具有以下特性：(1) 嵌入空间中的邻近点具有相似的预测类别概率；(2) 同一时间序列的相邻时间步映射到嵌入空间中的邻近点；(3) 原始特征向量截然不同的时间步映射到嵌入空间中相距甚远的区域。为了实现特性（3），我们在原始特征空间中采用了近邻配对机制。这种机制也是 "数据扩增 "的替代方法，而 "数据扩增 "是对比学习的一个关键要素，据我们所知，临床表格数据缺乏足够现实的标准程序。我们证明，在预测败血症患者死亡率（MIMIC-III 数据集）和跟踪认知障碍进展（ADNI 数据集）方面，我们的方法优于最先进的基线方法。我们的方法还能在各种实验中持续恢复正确的合成数据集嵌入结构，这是基线方法无法实现的。我们的消融实验显示了近邻配对的关键作用。

{"title":"Temporal Supervised Contrastive Learning for Modeling Patient Risk Progression.","authors":"Shahriar Noroozizadeh, Jeremy C Weiss, George H Chen","doi":"","DOIUrl":"","url":null,"abstract":"We consider the problem of predicting how the likelihood of an outcome of interest for a patient changes over time as we observe more of the patient's data. To solve this problem, we propose a supervised contrastive learning framework that learns an embedding representation for each time step of a patient time series. Our framework learns the embedding space to have the following properties: (1) nearby points in the embedding space have similar predicted class probabilities, (2) adjacent time steps of the same time series map to nearby points in the embedding space, and (3) time steps with very different raw feature vectors map to far apart regions of the embedding space. To achieve property (3), we employ a nearest neighbor pairing mechanism in the raw feature space. This mechanism also serves as an alternative to \"data augmentation\", a key ingredient of contrastive learning, which lacks a standard procedure that is adequately realistic for clinical tabular data, to our knowledge. We demonstrate that our approach outperforms state-of-the-art baselines in predicting mortality of septic patients (MIMIC-III dataset) and tracking progression of cognitive impairment (ADNI dataset). Our method also consistently recovers the correct synthetic dataset embedding structure across experiments, a feat not achieved by baselines. Our ablation experiments show the pivotal role of our nearest neighbor pairing.","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"225 ","pages":"403-427"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10976929/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140320101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multiple Imputation with Neural Network Gaussian Process for High-dimensional Incomplete Data. 高维不完整数据的神经网络高斯过程多重插值。

Proceedings of machine learning research

Pub Date : 2022-12-01

Zongyu Dai, Zhiqi Bu, Qi Long

Missing data are ubiquitous in real world applications and, if not adequately handled, may lead to the loss of information and biased findings in downstream analysis. Particularly, high-dimensional incomplete data with a moderate sample size, such as analysis of multi-omics data, present daunting challenges. Imputation is arguably the most popular method for handling missing data, though existing imputation methods have a number of limitations. Single imputation methods such as matrix completion methods do not adequately account for imputation uncertainty and hence would yield improper statistical inference. In contrast, multiple imputation (MI) methods allow for proper inference but existing methods do not perform well in high-dimensional settings. Our work aims to address these significant methodological gaps, leveraging recent advances in neural network Gaussian process (NNGP) from a Bayesian viewpoint. We propose two NNGP-based MI methods, namely MI-NNGP, that can apply multiple imputations for missing values from a joint (posterior predictive) distribution. The MI-NNGP methods are shown to significantly outperform existing state-of-the-art methods on synthetic and real datasets, in terms of imputation error, statistical inference, robustness to missing rates, and computation costs, under three missing data mechanisms, MCAR, MAR, and MNAR. Code is available in the GitHub repository https://github.com/bestadcarry/MI-NNGP.

在现实世界的应用程序中，丢失的数据无处不在，如果处理不当，可能会导致信息丢失，并在下游分析中导致有偏差的结果。特别是，中等样本量的高维不完整数据，如多组学数据的分析，提出了艰巨的挑战。尽管现有的插入方法有许多局限性，但插入可以说是处理缺失数据的最流行的方法。单一的输入方法，如矩阵补全方法不能充分考虑输入的不确定性，因此会产生不适当的统计推断。相比之下，多重插值(MI)方法允许适当的推理，但现有方法在高维设置中表现不佳。我们的工作旨在解决这些重要的方法差距，从贝叶斯的角度利用神经网络高斯过程(NNGP)的最新进展。我们提出了两种基于nngp的MI方法，即MI- nngp，该方法可以对联合(后验预测)分布中的缺失值进行多次插值。在三种缺失数据机制(MCAR、MAR和MNAR)下，MI-NNGP方法在输入误差、统计推断、对缺失率的鲁棒性和计算成本方面明显优于现有的最先进的合成和真实数据集方法。代码可在GitHub存储库https://github.com/bestadcarry/MI-NNGP中获得。

{"title":"Multiple Imputation with Neural Network Gaussian Process for High-dimensional Incomplete Data.","authors":"Zongyu Dai, Zhiqi Bu, Qi Long","doi":"","DOIUrl":"","url":null,"abstract":"Missing data are ubiquitous in real world applications and, if not adequately handled, may lead to the loss of information and biased findings in downstream analysis. Particularly, high-dimensional incomplete data with a moderate sample size, such as analysis of multi-omics data, present daunting challenges. Imputation is arguably the most popular method for handling missing data, though existing imputation methods have a number of limitations. Single imputation methods such as matrix completion methods do not adequately account for imputation uncertainty and hence would yield improper statistical inference. In contrast, multiple imputation (MI) methods allow for proper inference but existing methods do not perform well in high-dimensional settings. Our work aims to address these significant methodological gaps, leveraging recent advances in neural network Gaussian process (NNGP) from a Bayesian viewpoint. We propose two NNGP-based MI methods, namely MI-NNGP, that can apply multiple imputations for missing values from a joint (posterior predictive) distribution. The MI-NNGP methods are shown to significantly outperform existing state-of-the-art methods on synthetic and real datasets, in terms of imputation error, statistical inference, robustness to missing rates, and computation costs, under three missing data mechanisms, MCAR, MAR, and MNAR. Code is available in the GitHub repository https://github.com/bestadcarry/MI-NNGP.","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"189 ","pages":"265-279"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10348708/pdf/nihms-1861886.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9821494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Meta-analysis of individualized treatment rules via sign-coherency 基于符号一致性的个体化治疗规则元分析

Proceedings of machine learning research

Pub Date : 2022-11-28 DOI: 10.48550/arXiv.2211.15476

Jay Jojo Cheng, J. Huling, Guanhua Chen

Medical treatments tailored to a patient's baseline characteristics hold the potential of improving patient outcomes while reducing negative side effects. Learning individualized treatment rules (ITRs) often requires aggregation of multiple datasets(sites); however, current ITR methodology does not take between-site heterogeneity into account, which can hurt model generalizability when deploying back to each site. To address this problem, we develop a method for individual-level meta-analysis of ITRs, which jointly learns site-specific ITRs while borrowing information about feature sign-coherency via a scientifically-motivated directionality principle. We also develop an adaptive procedure for model tuning, using information criteria tailored to the ITR learning problem. We study the proposed methods through numerical experiments to understand their performance under different levels of between-site heterogeneity and apply the methodology to estimate ITRs in a large multi-center database of electronic health records. This work extends several popular methodologies for estimating ITRs (A-learning, weighted learning) to the multiple-sites setting.

根据患者的基线特征量身定制的医学治疗有可能改善患者的预后，同时减少负面副作用。学习个性化治疗规则(itr)通常需要汇总多个数据集(站点);然而，当前的ITR方法没有考虑到站点之间的异质性，这可能会在部署到每个站点时损害模型的泛化性。为了解决这一问题，我们开发了一种个体层面的itr元分析方法，该方法通过科学动机的方向性原则，在借鉴特征符号一致性信息的同时，共同学习特定地点的itr。我们还开发了一个自适应的模型调优过程，使用针对ITR学习问题量身定制的信息标准。我们通过数值实验研究了所提出的方法，以了解它们在不同站点间异质性水平下的性能，并将该方法应用于大型多中心电子健康记录数据库中的itr估计。这项工作将几种流行的估计itr的方法(a -学习，加权学习)扩展到多站点设置。

{"title":"Meta-analysis of individualized treatment rules via sign-coherency","authors":"Jay Jojo Cheng, J. Huling, Guanhua Chen","doi":"10.48550/arXiv.2211.15476","DOIUrl":"https://doi.org/10.48550/arXiv.2211.15476","url":null,"abstract":"Medical treatments tailored to a patient's baseline characteristics hold the potential of improving patient outcomes while reducing negative side effects. Learning individualized treatment rules (ITRs) often requires aggregation of multiple datasets(sites); however, current ITR methodology does not take between-site heterogeneity into account, which can hurt model generalizability when deploying back to each site. To address this problem, we develop a method for individual-level meta-analysis of ITRs, which jointly learns site-specific ITRs while borrowing information about feature sign-coherency via a scientifically-motivated directionality principle. We also develop an adaptive procedure for model tuning, using information criteria tailored to the ITR learning problem. We study the proposed methods through numerical experiments to understand their performance under different levels of between-site heterogeneity and apply the methodology to estimate ITRs in a large multi-center database of electronic health records. This work extends several popular methodologies for estimating ITRs (A-learning, weighted learning) to the multiple-sites setting.","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"193 1","pages":"171-198"},"PeriodicalIF":0.0,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47047493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of machine learning research

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀