首页 > 最新文献

Proceedings of machine learning research最新文献

英文 中文
A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data. 基于小先导数据的大数据集分类器准确率预测的概率方法。
Ethan Harvey, Wansu Chen, David M Kent, Michael C Hughes

Practitioners building classifiers often start with a smaller pilot dataset and plan to grow to larger data in the near future. Such projects need a toolkit for extrapolating how much classifier accuracy may improve from a 2x, 10x, or 50x increase in data size. While existing work has focused on finding a single "best-fit" curve using various functional forms like power laws, we argue that modeling and assessing the uncertainty of predictions is critical yet has seen less attention. In this paper, we propose a Gaussian process model to obtain probabilistic extrapolations of accuracy or similar performance metrics as dataset size increases. We evaluate our approach in terms of error, likelihood, and coverage across six datasets. Though we focus on medical tasks and image modalities, our open source approach generalizes to any kind of classifier.

构建分类器的从业者通常从一个较小的试验数据集开始,并计划在不久的将来发展到更大的数据集。这样的项目需要一个工具包来推断数据大小增加2倍、10倍或50倍会提高多少分类器的准确性。虽然现有的工作主要集中在寻找一个单一的“最佳拟合”曲线,使用各种函数形式,如幂律,我们认为,建模和评估预测的不确定性是至关重要的,但很少有人关注。在本文中,我们提出了一个高斯过程模型,以获得随着数据集大小增加的准确性或类似性能指标的概率外推。我们根据六个数据集的误差、可能性和覆盖率来评估我们的方法。虽然我们专注于医疗任务和图像模式,但我们的开源方法可以推广到任何类型的分类器。
{"title":"A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data.","authors":"Ethan Harvey, Wansu Chen, David M Kent, Michael C Hughes","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Practitioners building classifiers often start with a smaller pilot dataset and plan to grow to larger data in the near future. Such projects need a toolkit for extrapolating how much classifier accuracy may improve from a 2x, 10x, or 50x increase in data size. While existing work has focused on finding a single \"best-fit\" curve using various functional forms like power laws, we argue that modeling and assessing the <i>uncertainty</i> of predictions is critical yet has seen less attention. In this paper, we propose a Gaussian process model to obtain probabilistic extrapolations of accuracy or similar performance metrics as dataset size increases. We evaluate our approach in terms of error, likelihood, and coverage across six datasets. Though we focus on medical tasks and image modalities, our open source approach generalizes to any kind of classifier.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"225 ","pages":"129-144"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11826957/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143434464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MULTIPAR: Supervised Irregular Tensor Factorization with Multi-task Learning for Computational Phenotyping. 基于多任务学习的不规则张量分解。
Yifei Ren, Jian Lou, Li Xiong, Joyce C Ho, Xiaoqian Jiang, Sivasubramanium Venkatraman Bhavani

Tensor factorization has received increasing interest due to its intrinsic ability to capture latent factors in multi-dimensional data with many applications including Electronic Health Records (EHR) mining. PARAFAC2 and its variants have been proposed to address irregular tensors where one of the tensor modes is not aligned, e.g., different patients in EHRs may have different length of records. PARAFAC2 has been successfully applied to EHRs for extracting meaningful medical concepts (phenotypes). Despite recent advancements, current models' predictability and interpretability are not satisfactory, which limits its utility for downstream analysis. In this paper, we propose MULTIPAR: a supervised irregular tensor factorization with multi-task learning for computational phenotyping. MULTIPAR is flexible to incorporate both static (e.g. in-hospital mortality prediction) and continuous or dynamic (e.g. the need for ventilation) tasks. By supervising the tensor factorization with downstream prediction tasks and leveraging information from multiple related predictive tasks, MULTIPAR can yield not only more meaningful phenotypes but also better predictive performance for downstream tasks. We conduct extensive experiments on two real-world temporal EHR datasets to demonstrate that MULTIPAR is scalable and achieves better tensor fit with more meaningful subgroups and stronger predictive performance compared to existing state-of-the-art methods. The implementation of MULTIPAR is available.

张量分解由于其固有的捕获多维数据中潜在因素的能力而受到越来越多的关注,包括电子健康记录(EHR)挖掘在内的许多应用。PARAFAC2及其变体已被提出用于解决其中一个张量模式未对齐的不规则张量,例如,电子病历中的不同患者可能具有不同长度的记录。PARAFAC2已成功应用于电子病历,用于提取有意义的医学概念(表型)。尽管最近取得了进展,但当前模型的可预测性和可解释性并不令人满意,这限制了其在下游分析中的效用。在本文中,我们提出了MULTIPAR:一种具有多任务学习的有监督不规则张量分解算法。MULTIPAR可以灵活地纳入静态(如住院死亡率预测)和连续或动态(如需要通风)任务。通过监督下游预测任务的张量分解,并利用来自多个相关预测任务的信息,MULTIPAR不仅可以产生更有意义的表型,还可以为下游任务提供更好的预测性能。我们在两个真实世界的时间EHR数据集上进行了广泛的实验,以证明MULTIPAR是可扩展的,与现有的最先进的方法相比,它具有更好的张量拟合和更有意义的子组,并且具有更强的预测性能。MULTIPAR的实现是可用的。
{"title":"MULTIPAR: Supervised Irregular Tensor Factorization with Multi-task Learning for Computational Phenotyping.","authors":"Yifei Ren, Jian Lou, Li Xiong, Joyce C Ho, Xiaoqian Jiang, Sivasubramanium Venkatraman Bhavani","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Tensor factorization has received increasing interest due to its intrinsic ability to capture latent factors in multi-dimensional data with many applications including Electronic Health Records (EHR) mining. PARAFAC2 and its variants have been proposed to address irregular tensors where one of the tensor modes is not aligned, e.g., different patients in EHRs may have different length of records. PARAFAC2 has been successfully applied to EHRs for extracting meaningful medical concepts (phenotypes). Despite recent advancements, current models' predictability and interpretability are not satisfactory, which limits its utility for downstream analysis. In this paper, we propose MULTIPAR: a supervised irregular tensor factorization with multi-task learning for computational phenotyping. MULTIPAR is flexible to incorporate both static (e.g. in-hospital mortality prediction) and continuous or dynamic (e.g. the need for ventilation) tasks. By supervising the tensor factorization with downstream prediction tasks and leveraging information from multiple related predictive tasks, MULTIPAR can yield not only more meaningful phenotypes but also better predictive performance for downstream tasks. We conduct extensive experiments on two real-world temporal EHR datasets to demonstrate that MULTIPAR is scalable and achieves better tensor fit with more meaningful subgroups and stronger predictive performance compared to existing state-of-the-art methods. The implementation of MULTIPAR is available.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"225 ","pages":"498-511"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11611252/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142775499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GANcMRI: Cardiac magnetic resonance video generation and physiologic guidance using latent space prompting. GANcMRI:利用潜在空间提示生成心脏磁共振视频和生理指导。
Milos Vukadinovic, Alan C Kwan, Debiao Li, David Ouyang

Generative artificial intelligence can be applied to medical imaging on tasks such as privacy-preserving image generation and superresolution and denoising of existing images. Few prior approaches have used cardiac magnetic resonance imaging (cMRI) as a modality given the complexity of videos (the addition of the temporal dimension) as well as the limited scale of publicly available datasets. We introduce GANcMRI, a generative adversarial network that can synthesize cMRI videos with physiological guidance based on latent space prompting. GANcMRI uses a StyleGAN framework to learn the latent space from individual video frames and leverages the timedependent trajectory between end-systolic and end-diastolic frames in the latent space to predict progression and generate motion over time. We proposed various methods for modeling latent time-dependent trajectories and found that our Frame-to-frame approach generates the best motion and video quality. GANcMRI generated high-quality cMRI image frames that are indistinguishable by cardiologists, however, artifacts in video generation allow cardiologists to still recognize the difference between real and generated videos. The generated cMRI videos can be prompted to apply physiologybased adjustments which produces clinically relevant phenotypes recognizable by cardiologists. GANcMRI has many potential applications such as data augmentation, education, anomaly detection, and preoperative planning.

生成式人工智能可应用于医学成像任务,如保护隐私的图像生成以及现有图像的超分辨率和去噪。鉴于视频的复杂性(增加了时间维度)以及公开可用数据集的规模有限,此前很少有方法将心脏磁共振成像(cMRI)作为一种模式。我们介绍的 GANcMRI 是一种生成式对抗网络,它可以根据潜在空间提示合成具有生理指导的 cMRI 视频。GANcMRI 使用 StyleGAN 框架从单个视频帧中学习潜空间,并利用潜空间中收缩末期和舒张末期帧之间与时间相关的轨迹来预测进展并随时间产生运动。我们提出了多种方法来模拟潜在的随时间变化的轨迹,结果发现我们的 "帧到帧 "方法生成的运动和视频质量最好。GANcMRI 生成的高质量 cMRI 图像帧是心脏病专家无法分辨的,但是,视频生成过程中的伪影仍能让心脏病专家识别出真实视频和生成视频之间的差异。生成的 cMRI 视频可提示应用基于生理学的调整,从而产生心脏病专家可识别的临床相关表型。GANcMRI 有许多潜在应用,如数据增强、教育、异常检测和术前规划。
{"title":"GANcMRI: Cardiac magnetic resonance video generation and physiologic guidance using latent space prompting.","authors":"Milos Vukadinovic, Alan C Kwan, Debiao Li, David Ouyang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Generative artificial intelligence can be applied to medical imaging on tasks such as privacy-preserving image generation and superresolution and denoising of existing images. Few prior approaches have used cardiac magnetic resonance imaging (cMRI) as a modality given the complexity of videos (the addition of the temporal dimension) as well as the limited scale of publicly available datasets. We introduce GANcMRI, a generative adversarial network that can synthesize cMRI videos with physiological guidance based on latent space prompting. GANcMRI uses a StyleGAN framework to learn the latent space from individual video frames and leverages the timedependent trajectory between end-systolic and end-diastolic frames in the latent space to predict progression and generate motion over time. We proposed various methods for modeling latent time-dependent trajectories and found that our Frame-to-frame approach generates the best motion and video quality. GANcMRI generated high-quality cMRI image frames that are indistinguishable by cardiologists, however, artifacts in video generation allow cardiologists to still recognize the difference between real and generated videos. The generated cMRI videos can be prompted to apply physiologybased adjustments which produces clinically relevant phenotypes recognizable by cardiologists. GANcMRI has many potential applications such as data augmentation, education, anomaly detection, and preoperative planning.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"225 ","pages":"594-606"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10783442/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139426220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diffusion Models To Predict 3D Late Mechanical Activation From Sparse 2D Cardiac MRIs. 从稀疏的二维心脏磁共振成像预测三维晚期机械激活的扩散模型
Nivetha Jayakumar, Jiarui Xing, Tonmoy Hossain, Fred Epstein, Kenneth Bilchick, Miaomiao Zhang

Identifying regions of late mechanical activation (LMA) of the left ventricular (LV) myocardium is critical in determining the optimal pacing site for cardiac resynchronization therapy in patients with heart failure. Several deep learning-based approaches have been developed to predict 3D LMA maps of LV myocardium from a stack of sparse 2D cardiac magnetic resonance imaging (MRIs). However, these models often loosely consider the geometric shape structure of the myocardium. This makes the reconstructed activation maps suboptimal; hence leading to a reduced accuracy of predicting the late activating regions of hearts. In this paper, we propose to use shape-constrained diffusion models to better reconstruct a 3D LMA map, given a limited number of 2D cardiac MRI slices. In contrast to previous methods that primarily rely on spatial correlations of image intensities for 3D reconstruction, our model leverages object shape as priors learned from the training data to guide the reconstruction process. To achieve this, we develop a joint learning network that simultaneously learns a mean shape under deformation models. Each reconstructed image is then considered as a deformed variant of the mean shape. To validate the performance of our model, we train and test the proposed framework on a publicly available mesh dataset of 3D myocardium and compare it with state-of-the-art deep learning-based reconstruction models. Experimental results show that our model achieves superior performance in reconstructing the 3D LMA maps as compared to the state-of-the-art models.

识别左心室(LV)心肌的晚期机械激活(LMA)区域对于确定心力衰竭患者心脏再同步疗法的最佳起搏部位至关重要。目前已开发出几种基于深度学习的方法,可从一叠稀疏的二维心脏磁共振成像(MRI)中预测左心室心肌的三维 LMA 图。然而,这些模型通常没有考虑心肌的几何形状结构。这使得重建的激活图不够理想,从而降低了预测心脏晚期激活区域的准确性。在本文中,我们建议在二维心脏磁共振成像切片数量有限的情况下,使用形状约束扩散模型来更好地重建三维 LMA 图。与以往主要依靠图像强度的空间相关性进行三维重建的方法不同,我们的模型利用从训练数据中学到的物体形状作为先验来指导重建过程。为此,我们开发了一个联合学习网络,同时学习变形模型下的平均形状。然后,每个重建图像都被视为平均形状的变形变体。为了验证我们模型的性能,我们在一个公开的三维心肌网状数据集上对所提出的框架进行了训练和测试,并将其与最先进的基于深度学习的重建模型进行了比较。实验结果表明,与最先进的模型相比,我们的模型在重建三维 LMA 图方面表现出色。
{"title":"Diffusion Models To Predict 3D Late Mechanical Activation From Sparse 2D Cardiac MRIs.","authors":"Nivetha Jayakumar, Jiarui Xing, Tonmoy Hossain, Fred Epstein, Kenneth Bilchick, Miaomiao Zhang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Identifying regions of late mechanical activation (LMA) of the left ventricular (LV) myocardium is critical in determining the optimal pacing site for cardiac resynchronization therapy in patients with heart failure. Several deep learning-based approaches have been developed to predict 3D LMA maps of LV myocardium from a stack of sparse 2D cardiac magnetic resonance imaging (MRIs). However, these models often loosely consider the geometric shape structure of the myocardium. This makes the reconstructed activation maps suboptimal; hence leading to a reduced accuracy of predicting the late activating regions of hearts. In this paper, we propose to use shape-constrained diffusion models to better reconstruct a 3D LMA map, given a limited number of 2D cardiac MRI slices. In contrast to previous methods that primarily rely on spatial correlations of image intensities for 3D reconstruction, our model leverages object shape as priors learned from the training data to guide the reconstruction process. To achieve this, we develop a joint learning network that simultaneously learns a mean shape under deformation models. Each reconstructed image is then considered as a deformed variant of the mean shape. To validate the performance of our model, we train and test the proposed framework on a publicly available mesh dataset of 3D myocardium and compare it with state-of-the-art deep learning-based reconstruction models. Experimental results show that our model achieves superior performance in reconstructing the 3D LMA maps as compared to the state-of-the-art models.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"225 ","pages":"190-200"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10958778/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140208466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Meta-Evaluation of Faithfulness Metrics for Long-Form Hospital-Course Summarization. 长篇医院病历摘要忠实度指标的元评价。
Griffin Adams, Jason Zucker, Noémie Elhadad

Long-form clinical summarization of hospital admissions has real-world significance because of its potential to help both clinicians and patients. The factual consistency of summaries-their faithfulness-is critical to their safe usage in clinical settings. To better understand the limitations of state-of-the-art natural language processing (NLP) systems, as well as the suitability of existing evaluation metrics, we benchmark faithfulness metrics against fine-grained human annotations for model-generated summaries of a patient's Brief Hospital Course. We create a corpus of patient hospital admissions and summaries for a cohort of HIV patients, each with complex medical histories. Annotators are presented with summaries and source notes, and asked to categorize manually highlighted summary elements (clinical entities like conditions and medications as well as actions like "following up") into one of three categories: "Incorrect," "Missing," and "Not in Notes." We meta-evaluate a broad set of faithfulness metrics-proposed for the general NLP domain-by measuring the correlation of metric scores to clinician ratings. Across metrics, we explore the importance of domain adaptation (e.g. the impact of in-domain pre-training and metric fine-tuning), the use of source-summary alignments, and the effects of distilling a single metric from an ensemble. We find that off-the-shelf metrics with no exposure to clinical text correlate well to clinician ratings yet overly rely on copy-and-pasted text. As a practical guide, we observe that most metrics correlate best to clinicians when provided with one summary sentence at a time and a minimal set of supporting sentences from the notes before discharge.

对住院病例进行长篇临床总结具有现实意义,因为它可以帮助临床医生和患者。摘要的事实一致性--即其忠实性--对其在临床环境中的安全使用至关重要。为了更好地了解最先进的自然语言处理(NLP)系统的局限性以及现有评估指标的适用性,我们针对模型生成的患者住院病程摘要的细粒度人工注释,制定了忠实度指标基准。我们创建了一个患者入院病历和摘要语料库,其中包含了一批艾滋病患者,每个人都有复杂的病史。我们向注释者展示了摘要和源注释,并要求他们将手动突出显示的摘要元素(如病情和药物等临床实体以及 "随访 "等操作)归入三个类别之一:"不正确"、"缺失 "和 "不在注释中"。通过衡量指标得分与临床医生评分的相关性,我们对为一般 NLP 领域提出的一系列广泛的忠实度指标进行了元评估。在各种度量标准中,我们探讨了领域适应性的重要性(例如,领域内预训练和度量标准微调的影响)、源摘要排列的使用以及从组合中提炼单一度量标准的效果。我们发现,没有接触过临床文本的现成度量标准与临床医生的评分有很好的相关性,但却过度依赖复制粘贴的文本。作为实用指南,我们观察到,如果每次只向临床医生提供一个摘要句子和出院前病历中最基本的辅助句子集,大多数指标与临床医生的相关性最佳。
{"title":"A Meta-Evaluation of Faithfulness Metrics for Long-Form Hospital-Course Summarization.","authors":"Griffin Adams, Jason Zucker, Noémie Elhadad","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Long-form clinical summarization of hospital admissions has real-world significance because of its potential to help both clinicians and patients. The factual consistency of summaries-their faithfulness-is critical to their safe usage in clinical settings. To better understand the limitations of state-of-the-art natural language processing (NLP) systems, as well as the suitability of existing evaluation metrics, we benchmark faithfulness metrics against fine-grained human annotations for model-generated summaries of a patient's Brief Hospital Course. We create a corpus of patient hospital admissions and summaries for a cohort of HIV patients, each with complex medical histories. Annotators are presented with summaries and source notes, and asked to categorize manually highlighted summary elements (clinical entities like conditions and medications as well as actions like \"following up\") into one of three categories: \"Incorrect,\" \"Missing,\" and \"Not in Notes.\" We meta-evaluate a broad set of faithfulness metrics-proposed for the general NLP domain-by measuring the correlation of metric scores to clinician ratings. Across metrics, we explore the importance of domain adaptation (e.g. the impact of in-domain pre-training and metric fine-tuning), the use of source-summary alignments, and the effects of distilling a single metric from an ensemble. We find that off-the-shelf metrics with no exposure to clinical text correlate well to clinician ratings yet overly rely on copy-and-pasted text. As a practical guide, we observe that most metrics correlate best to clinicians when provided with one summary sentence at a time and a minimal set of supporting sentences from the notes before discharge.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"219 ","pages":"2-30"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11441639/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142334002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting Heart Disease from Multi-View Ultrasound Images via Supervised Attention Multiple Instance Learning. 通过监督注意力多实例学习从多视角超声波图像中检测心脏病
Zhe Huang, Benjamin S Wessler, Michael C Hughes

Aortic stenosis (AS) is a degenerative valve condition that causes substantial morbidity and mortality. This condition is under-diagnosed and under-treated. In clinical practice, AS is diagnosed with expert review of transthoracic echocardiography, which produces dozens of ultrasound images of the heart. Only some of these views show the aortic valve. To automate screening for AS, deep networks must learn to mimic a human expert's ability to identify views of the aortic valve then aggregate across these relevant images to produce a study-level diagnosis. We find previous approaches to AS detection yield insufficient accuracy due to relying on inflexible averages across images. We further find that off-the-shelf attention-based multiple instance learning (MIL) performs poorly. We contribute a new end-to-end MIL approach with two key methodological innovations. First, a supervised attention technique guides the learned attention mechanism to favor relevant views. Second, a novel self-supervised pretraining strategy applies contrastive learning on the representation of the whole study instead of individual images as commonly done in prior literature. Experiments on an open-access dataset and a temporally-external heldout set show that our approach yields higher accuracy while reducing model size.

主动脉瓣狭窄(AS)是一种瓣膜退行性病变,会导致严重的发病率和死亡率。这种疾病诊断不足,治疗不足。在临床实践中,主动脉瓣狭窄是通过专家对经胸超声心动图的检查来诊断的。这些图像中只有部分能显示主动脉瓣。要实现强直性脊柱炎的自动筛查,深度网络必须学会模仿人类专家识别主动脉瓣视图的能力,然后汇总这些相关图像,得出研究级别的诊断结果。我们发现,以往的强直性脊柱炎检测方法由于依赖于图像间不灵活的平均值,因此准确性不足。我们还发现,现成的基于注意力的多实例学习(MIL)效果不佳。我们提出了一种新的端到端 MIL 方法,并在方法上进行了两项关键创新。首先,监督注意力技术会引导学习到的注意力机制偏向相关视图。其次,一种新颖的自监督预训练策略将对比学习应用于整个研究的表征上,而不是之前文献中常见的单个图像。在开放访问数据集和时间外部保留集上进行的实验表明,我们的方法在降低模型大小的同时,还能获得更高的准确性。
{"title":"Detecting Heart Disease from Multi-View Ultrasound Images via Supervised Attention Multiple Instance Learning.","authors":"Zhe Huang, Benjamin S Wessler, Michael C Hughes","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Aortic stenosis (AS) is a degenerative valve condition that causes substantial morbidity and mortality. This condition is under-diagnosed and under-treated. In clinical practice, AS is diagnosed with expert review of transthoracic echocardiography, which produces dozens of ultrasound images of the heart. Only some of these views show the aortic valve. To automate screening for AS, deep networks must learn to mimic a human expert's ability to identify views of the aortic valve then aggregate across these relevant images to produce a study-level diagnosis. We find previous approaches to AS detection yield insufficient accuracy due to relying on inflexible averages across images. We further find that off-the-shelf attention-based multiple instance learning (MIL) performs poorly. We contribute a new end-to-end MIL approach with two key methodological innovations. First, a supervised attention technique guides the learned attention mechanism to favor relevant views. Second, a novel self-supervised pretraining strategy applies contrastive learning on the representation of the whole study instead of individual images as commonly done in prior literature. Experiments on an open-access dataset and a temporally-external heldout set show that our approach yields higher accuracy while reducing model size.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"219 ","pages":"285-307"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10923076/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140095343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fairness-Aware Class Imbalanced Learning on Multiple Subgroups. 多个子群上的公平感知类不平衡学习
Davoud Ataee Tarzanagh, Bojian Hou, Boning Tong, Qi Long, Li Shen

We present a novel Bayesian-based optimization framework that addresses the challenge of generalization in overparameterized models when dealing with imbalanced subgroups and limited samples per subgroup. Our proposed tri-level optimization framework utilizes local predictors, which are trained on a small amount of data, as well as a fair and class-balanced predictor at the middle and lower levels. To effectively overcome saddle points for minority classes, our lower-level formulation incorporates sharpness-aware minimization. Meanwhile, at the upper level, the framework dynamically adjusts the loss function based on validation loss, ensuring a close alignment between the global predictor and local predictors. Theoretical analysis demonstrates the framework's ability to enhance classification and fairness generalization, potentially resulting in improvements in the generalization bound. Empirical results validate the superior performance of our tri-level framework compared to existing state-of-the-art approaches. The source code can be found at https://github.com/PennShenLab/FACIMS.

我们提出了一种新颖的基于贝叶斯的优化框架,以解决在处理不平衡子群和每个子群样本有限的情况下,过参数化模型的泛化难题。我们提出的三层优化框架利用了在少量数据基础上训练的局部预测器,以及中层和低层的公平和类平衡预测器。为了有效克服少数群体的鞍点问题,我们的低层次方案采用了锐度感知最小化。同时,在上层,该框架根据验证损失动态调整损失函数,确保全局预测器和局部预测器之间的紧密配合。理论分析表明,该框架能够增强分类和公平泛化能力,从而有可能改善泛化边界。实证结果验证了与现有的最先进方法相比,我们的三层框架具有更优越的性能。源代码见 https://github.com/PennShenLab/FACIMS。
{"title":"Fairness-Aware Class Imbalanced Learning on Multiple Subgroups.","authors":"Davoud Ataee Tarzanagh, Bojian Hou, Boning Tong, Qi Long, Li Shen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>We present a novel Bayesian-based optimization framework that addresses the challenge of generalization in overparameterized models when dealing with imbalanced subgroups and limited samples per subgroup. Our proposed tri-level optimization framework utilizes <i>local</i> predictors, which are trained on a small amount of data, as well as a fair and class-balanced predictor at the middle and lower levels. To effectively overcome saddle points for minority classes, our lower-level formulation incorporates sharpness-aware minimization. Meanwhile, at the upper level, the framework dynamically adjusts the loss function based on validation loss, ensuring a close alignment between the <i>global</i> predictor and local predictors. Theoretical analysis demonstrates the framework's ability to enhance classification and fairness generalization, potentially resulting in improvements in the generalization bound. Empirical results validate the superior performance of our tri-level framework compared to existing state-of-the-art approaches. The source code can be found at https://github.com/PennShenLab/FACIMS.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"216 ","pages":"2123-2133"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11003754/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140857599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-modal Differentiable Unsupervised Feature Selection. 多模态可微无监督特征选择。
Junchen Yang, Ofir Lindenbaum, Yuval Kluger, Ariel Jaffe

Multi-modal high throughput biological data presents a great scientific opportunity and a significant computational challenge. In multi-modal measurements, every sample is observed simultaneously by two or more sets of sensors. In such settings, many observed variables in both modalities are often nuisance and do not carry information about the phenomenon of interest. Here, we propose a multi-modal unsupervised feature selection framework: identifying informative variables based on coupled high-dimensional measurements. Our method is designed to identify features associated with two types of latent low-dimensional structures: (i) shared structures that govern the observations in both modalities, and (ii) differential structures that appear in only one modality. To that end, we propose two Laplacian-based scoring operators. We incorporate the scores with differentiable gates that mask nuisance features and enhance the accuracy of the structure captured by the graph Laplacian. The performance of the new scheme is illustrated using synthetic and real datasets, including an extended biological application to single-cell multi-omics.

多模态高通量生物数据提供了巨大的科学机遇和重大的计算挑战。在多模态测量中,每个样品由两组或多组传感器同时观测。在这种情况下,两种模式下的许多观察变量通常都是令人讨厌的,并且不携带有关感兴趣现象的信息。在此,我们提出了一个多模态无监督特征选择框架:基于耦合高维测量识别信息变量。我们的方法旨在识别与两种潜在低维结构相关的特征:(i)在两种模态下控制观测的共享结构,以及(ii)仅在一种模态中出现的差异结构。为此,我们提出了两个基于拉普拉斯的评分算子。我们将分数与可微门结合起来,这些可微门掩盖了讨厌的特征,并提高了图拉普拉斯函数捕获的结构的准确性。使用合成和真实数据集说明了新方案的性能,包括扩展到单细胞多组学的生物学应用。
{"title":"Multi-modal Differentiable Unsupervised Feature Selection.","authors":"Junchen Yang, Ofir Lindenbaum, Yuval Kluger, Ariel Jaffe","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Multi-modal high throughput biological data presents a great scientific opportunity and a significant computational challenge. In multi-modal measurements, every sample is observed simultaneously by two or more sets of sensors. In such settings, many observed variables in both modalities are often nuisance and do not carry information about the phenomenon of interest. Here, we propose a multi-modal unsupervised feature selection framework: identifying informative variables based on coupled high-dimensional measurements. Our method is designed to identify features associated with two types of latent low-dimensional structures: (i) shared structures that govern the observations in both modalities, and (ii) differential structures that appear in only one modality. To that end, we propose two Laplacian-based scoring operators. We incorporate the scores with differentiable gates that mask nuisance features and enhance the accuracy of the structure captured by the graph Laplacian. The performance of the new scheme is illustrated using synthetic and real datasets, including an extended biological application to single-cell multi-omics.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"216 ","pages":"2400-2410"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12859758/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146108622","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Maximum Likelihood Estimation of Flexible Survival Densities with Importance Sampling. 利用重要性采样对灵活生存密度进行最大似然估计
Mert Ketenci, Shreyas Bhave, Noémie Elhadad, Adler Perotte

Survival analysis is a widely-used technique for analyzing time-to-event data in the presence of censoring. In recent years, numerous survival analysis methods have emerged which scale to large datasets and relax traditional assumptions such as proportional hazards. These models, while being performant, are very sensitive to model hyperparameters including: (1) number of bins and bin size for discrete models and (2) number of cluster assignments for mixture-based models. Each of these choices requires extensive tuning by practitioners to achieve optimal performance. In addition, we demonstrate in empirical studies that: (1) optimal bin size may drastically differ based on the metric of interest (e.g., concordance vs brier score), and (2) mixture models may suffer from mode collapse and numerical instability. We propose a survival analysis approach which eliminates the need to tune hyperparameters such as mixture assignments and bin sizes, reducing the burden on practitioners. We show that the proposed approach matches or outperforms baselines on several real-world datasets.

生存分析是一种广泛使用的技术,用于分析存在剔除的时间到事件数据。近年来,出现了许多生存分析方法,这些方法可扩展到大型数据集,并放宽了比例危险等传统假设。这些模型虽然性能优越,但对模型超参数非常敏感,包括:(1) 离散模型的箱数和箱大小;(2) 基于混合模型的聚类分配数。这些选择中的每一个都需要实践者进行大量的调整才能达到最佳性能。此外,我们还通过实证研究证明了以下几点:(1) 最佳分仓大小可能会因相关指标(如一致性与布赖尔得分)的不同而大相径庭,(2) 混合物模型可能会出现模式崩溃和数值不稳定性。我们提出的生存分析方法无需调整混合分配和分仓大小等超参数,从而减轻了从业人员的负担。我们的研究表明,在几个真实世界数据集上,我们提出的方法与基线相匹配,甚至优于基线。
{"title":"Maximum Likelihood Estimation of Flexible Survival Densities with Importance Sampling.","authors":"Mert Ketenci, Shreyas Bhave, Noémie Elhadad, Adler Perotte","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Survival analysis is a widely-used technique for analyzing time-to-event data in the presence of censoring. In recent years, numerous survival analysis methods have emerged which scale to large datasets and relax traditional assumptions such as proportional hazards. These models, while being performant, are very sensitive to model hyperparameters including: (1) number of bins and bin size for discrete models and (2) number of cluster assignments for mixture-based models. Each of these choices requires extensive tuning by practitioners to achieve optimal performance. In addition, we demonstrate in empirical studies that: (1) optimal bin size may drastically differ based on the metric of interest (e.g., concordance vs brier score), and (2) mixture models may suffer from mode collapse and numerical instability. We propose a survival analysis approach which eliminates the need to tune hyperparameters such as mixture assignments and bin sizes, reducing the burden on practitioners. We show that the proposed approach matches or outperforms baselines on several real-world datasets.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"219 ","pages":"360-380"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11441640/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142334003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bringing At-home Pediatric Sleep Apnea Testing Closer to Reality: A Multi-modal Transformer Approach. 让居家小儿睡眠呼吸暂停检测更接近现实:多模式变压器方法。
Hamed Fayyaz, Abigail Strang, Rahmatollah Beheshti

Sleep apnea in children is a major health problem affecting one to five percent of children (in the US). If not treated in a timely manner, it can also lead to other physical and mental health issues. Pediatric sleep apnea has different clinical causes and characteristics than adults. Despite a large group of studies dedicated to studying adult apnea, pediatric sleep apnea has been studied in a much less limited fashion. Relatedly, at-home sleep apnea testing tools and algorithmic methods for automatic detection of sleep apnea are widely present for adults, but not children. In this study, we target this gap by presenting a machine learning-based model for detecting apnea events from commonly collected sleep signals. We show that our method outperforms state-of-the-art methods across two public datasets, as determined by the F1-score and AUROC measures. Additionally, we show that using two of the signals that are easier to collect at home (ECG and SpO2) can also achieve very competitive results, potentially addressing the concerns about collecting various sleep signals from children outside the clinic. Therefore, our study can greatly inform ongoing progress toward increasing the accessibility of pediatric sleep apnea testing and improving the timeliness of the treatment interventions.

儿童睡眠呼吸暂停是一个重大的健康问题,影响着 1% 到 5% 的儿童(在美国)。如果不及时治疗,还可能导致其他身心健康问题。小儿睡眠呼吸暂停的临床原因和特点与成人不同。尽管有大量研究致力于研究成人呼吸暂停,但对小儿睡眠呼吸暂停的研究却少得多。与此相关的是,用于自动检测睡眠呼吸暂停的家用睡眠呼吸暂停测试工具和算法方法广泛应用于成人,但儿童却没有。在本研究中,我们针对这一空白,提出了一种基于机器学习的模型,用于从通常收集的睡眠信号中检测呼吸暂停事件。根据 F1 分数和 AUROC 指标,我们的方法在两个公共数据集上的表现优于最先进的方法。此外,我们还表明,使用两种在家中更容易收集的信号(ECG 和 SpO2)也能获得非常有竞争力的结果,从而有可能解决在诊所外收集儿童各种睡眠信号的问题。因此,我们的研究可以为不断提高小儿睡眠呼吸暂停检测的可及性和改善治疗干预的及时性提供重要信息。
{"title":"Bringing At-home Pediatric Sleep Apnea Testing Closer to Reality: A Multi-modal Transformer Approach.","authors":"Hamed Fayyaz, Abigail Strang, Rahmatollah Beheshti","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Sleep apnea in children is a major health problem affecting one to five percent of children (in the US). If not treated in a timely manner, it can also lead to other physical and mental health issues. Pediatric sleep apnea has different clinical causes and characteristics than adults. Despite a large group of studies dedicated to studying adult apnea, pediatric sleep apnea has been studied in a much less limited fashion. Relatedly, at-home sleep apnea testing tools and algorithmic methods for automatic detection of sleep apnea are widely present for adults, but not children. In this study, we target this gap by presenting a machine learning-based model for detecting apnea events from commonly collected sleep signals. We show that our method outperforms state-of-the-art methods across two public datasets, as determined by the F1-score and AUROC measures. Additionally, we show that using two of the signals that are easier to collect at home (ECG and SpO<sub>2</sub>) can also achieve very competitive results, potentially addressing the concerns about collecting various sleep signals from children outside the clinic. Therefore, our study can greatly inform ongoing progress toward increasing the accessibility of pediatric sleep apnea testing and improving the timeliness of the treatment interventions.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"219 ","pages":"167-185"},"PeriodicalIF":0.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10854997/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139725300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of machine learning research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1