首页 > 最新文献

Proceedings of machine learning research最新文献

英文 中文
Contextual Bandits with Budgeted Information Reveal. 有预算信息揭示的情境大盗。
Kyra Gan, Esmaeil Keyvanshokooh, Xueqing Liu, Susan Murphy

Contextual bandit algorithms are commonly used in digital health to recommend personalized treatments. However, to ensure the effectiveness of the treatments, patients are often requested to take actions that have no immediate benefit to them, which we refer to as pro-treatment actions. In practice, clinicians have a limited budget to encourage patients to take these actions and collect additional information. We introduce a novel optimization and learning algorithm to address this problem. This algorithm effectively combines the strengths of two algorithmic approaches in a seamless manner, including 1) an online primal-dual algorithm for deciding the optimal timing to reach out to patients, and 2) a contextual bandit learning algorithm to deliver personalized treatment to the patient. We prove that this algorithm admits a sub-linear regret bound. We illustrate the usefulness of this algorithm on both synthetic and real-world data.

数字医疗领域通常使用情境强盗算法来推荐个性化治疗方案。然而,为了确保治疗的有效性,患者往往会被要求采取对他们没有直接益处的行动,我们称之为支持治疗行动。在实践中,临床医生的预算有限,无法鼓励患者采取这些行动并收集更多信息。我们引入了一种新颖的优化和学习算法来解决这一问题。该算法有效地将两种算法方法的优势完美地结合在一起,包括:1)在线原始二元算法,用于决定接触患者的最佳时机;2)情境强盗学习算法,用于向患者提供个性化治疗。我们证明了这种算法具有亚线性遗憾约束。我们在合成数据和真实世界数据上说明了该算法的实用性。
{"title":"Contextual Bandits with Budgeted Information Reveal.","authors":"Kyra Gan, Esmaeil Keyvanshokooh, Xueqing Liu, Susan Murphy","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Contextual bandit algorithms are commonly used in digital health to recommend personalized treatments. However, to ensure the effectiveness of the treatments, patients are often requested to take actions that have no immediate benefit to them, which we refer to as <i>pro-treatment</i> actions. In practice, clinicians have a limited budget to encourage patients to take these actions and collect additional information. We introduce a novel optimization and learning algorithm to address this problem. This algorithm effectively combines the strengths of two algorithmic approaches in a seamless manner, including 1) an online primal-dual algorithm for deciding the optimal timing to reach out to patients, and 2) a contextual bandit learning algorithm to deliver personalized treatment to the patient. We prove that this algorithm admits a sub-linear regret bound. We illustrate the usefulness of this algorithm on both synthetic and real-world data.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"238 ","pages":"3970-3978"},"PeriodicalIF":0.0,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11503011/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142514329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
BOBA: Byzantine-Robust Federated Learning with Label Skewness. 带有标签偏度的拜占庭鲁棒联邦学习。
Wenxuan Bao, Jun Wu, Jingrui He

In federated learning, most existing robust aggregation rules (AGRs) combat Byzantine attacks in the IID setting, where client data is assumed to be independent and identically distributed. In this paper, we address label skewness, a more realistic and challenging non-IID setting, where each client only has access to a few classes of data. In this setting, state-of-the-art AGRs suffer from selection bias, leading to significant performance drop for particular classes; they are also more vulnerable to Byzantine attacks due to the increased variation among gradients of honest clients. To address these limitations, we propose an efficient two-stage method named BOBA. Theoretically, we prove the convergence of BOBA with an error of the optimal order. Our empirical evaluations demonstrate BOBA's superior unbiasedness and robustness across diverse models and datasets when compared to various baselines. Our code is available at https://github.com/baowenxuan/BOBA.

在联邦学习中,大多数现有的鲁棒聚合规则(agr)在IID设置中对抗拜占庭攻击,其中假定客户端数据是独立且相同分布的。在本文中,我们解决了标签偏度,这是一种更现实和更具挑战性的非iid设置,其中每个客户端只能访问几类数据。在这种情况下,最先进的agr受到选择偏差的影响,导致特定类别的性能显著下降;它们也更容易受到拜占庭攻击,因为诚实客户端的梯度变化越来越大。为了解决这些限制,我们提出了一种高效的两阶段方法——BOBA。从理论上证明了BOBA具有最优阶误差的收敛性。我们的实证评估表明,与各种基线相比,BOBA在不同模型和数据集上具有优越的无偏性和稳健性。我们的代码可在https://github.com/baowenxuan/BOBA上获得。
{"title":"BOBA: Byzantine-Robust Federated Learning with Label Skewness.","authors":"Wenxuan Bao, Jun Wu, Jingrui He","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>In federated learning, most existing robust aggregation rules (AGRs) combat Byzantine attacks in the IID setting, where client data is assumed to be independent and identically distributed. In this paper, we address label skewness, a more realistic and challenging non-IID setting, where each client only has access to a few classes of data. In this setting, state-of-the-art AGRs suffer from selection bias, leading to significant performance drop for particular classes; they are also more vulnerable to Byzantine attacks due to the increased variation among gradients of honest clients. To address these limitations, we propose an efficient two-stage method named <i>BOBA</i>. Theoretically, we prove the convergence of BOBA with an error of the optimal order. Our empirical evaluations demonstrate BOBA's superior unbiasedness and robustness across diverse models and datasets when compared to various baselines. Our code is available at https://github.com/baowenxuan/BOBA.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"1 ","pages":"892-900"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12403066/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144994647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
E(3) × SO(3)-Equivariant Networks for Spherical Deconvolution in Diffusion MRI. 用于扩散核磁共振成像中球形解卷积的 E(3) × SO(3) - Equivariant 网络
Axel Elaldi, Guido Gerig, Neel Dey

We present Roto-Translation Equivariant Spherical Deconvolution (RT-ESD), an E(3)×SO(3) equivariant framework for sparse deconvolution of volumes where each voxel contains a spherical signal. Such 6D data naturally arises in diffusion MRI (dMRI), a medical imaging modality widely used to measure microstructure and structural connectivity. As each dMRI voxel is typically a mixture of various overlapping structures, there is a need for blind deconvolution to recover crossing anatomical structures such as white matter tracts. Existing dMRI work takes either an iterative or deep learning approach to sparse spherical deconvolution, yet it typically does not account for relationships between neighboring measurements. This work constructs equivariant deep learning layers which respect to symmetries of spatial rotations, reflections, and translations, alongside the symmetries of voxelwise spherical rotations. As a result, RT-ESD improves on previous work across several tasks including fiber recovery on the DiSCo dataset, deconvolution-derived partial volume estimation on real-world in vivo human brain dMRI, and improved downstream reconstruction of fiber tractograms on the Tractometer dataset. Our implementation is available at https://github.com/AxelElaldi/e3so3_conv.

我们提出了旋转-平移等变球形解卷积(RT-ESD),这是一种 E(3)×SO(3) 等变框架,用于对每个体素都包含球形信号的体积进行稀疏解卷积。这种 6D 数据自然出现在弥散核磁共振成像(dMRI)中,这是一种广泛用于测量微观结构和结构连接性的医学成像模式。由于每个 dMRI 象素通常是各种重叠结构的混合物,因此需要进行盲解卷以恢复交叉解剖结构,如白质束。现有的 dMRI 研究采用迭代或深度学习方法进行稀疏球形去卷积,但通常不会考虑相邻测量值之间的关系。这项工作构建了等变深度学习层,在尊重空间旋转、反射和平移对称性的同时,也尊重体素球面旋转的对称性。因此,RT-ESD 在多个任务上都比以前的工作有所改进,包括 DiSCo 数据集上的纤维恢复、真实世界活体人脑 dMRI 上的去卷积衍生部分体积估计,以及 Tractometer 数据集上纤维束图的改进下游重建。我们的实施方案可在 https://github.com/AxelElaldi/e3so3_conv 上查阅。
{"title":"E(3) × SO(3)-Equivariant Networks for Spherical Deconvolution in Diffusion MRI.","authors":"Axel Elaldi, Guido Gerig, Neel Dey","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>We present Roto-Translation Equivariant Spherical Deconvolution (RT-ESD), an <math><mi>E</mi><mo>(</mo><mn>3</mn><mo>)</mo><mo>×</mo><mi>S</mi><mi>O</mi><mo>(</mo><mn>3</mn><mo>)</mo></math> equivariant framework for sparse deconvolution of volumes where each voxel contains a spherical signal. Such 6D data naturally arises in diffusion MRI (dMRI), a medical imaging modality widely used to measure microstructure and structural connectivity. As each dMRI voxel is typically a mixture of various overlapping structures, there is a need for blind deconvolution to recover crossing anatomical structures such as white matter tracts. Existing dMRI work takes either an iterative or deep learning approach to sparse spherical deconvolution, yet it typically does not account for relationships between neighboring measurements. This work constructs equivariant deep learning layers which respect to symmetries of spatial rotations, reflections, and translations, alongside the symmetries of voxelwise spherical rotations. As a result, RT-ESD improves on previous work across several tasks including fiber recovery on the DiSCo dataset, deconvolution-derived partial volume estimation on real-world <i>in vivo</i> human brain dMRI, and improved downstream reconstruction of fiber tractograms on the Tractometer dataset. Our implementation is available at https://github.com/AxelElaldi/e3so3_conv.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"227 ","pages":"301-319"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10901527/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139991995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization. 缩小差距:鲁棒性和标准通用性中的拉德马赫复杂性。
Jiancong Xiao, Ruoyu Sun, Qi Long, Weijie J Su

Training Deep Neural Networks (DNNs) with adversarial examples often results in poor generalization to test-time adversarial data. This paper investigates this issue, known as adversarially robust generalization, through the lens of Rademacher complexity. Building upon the studies by Khim and Loh (2018); Yin et al. (2019), numerous works have been dedicated to this problem, yet achieving a satisfactory bound remains an elusive goal. Existing works on DNNs either apply to a surrogate loss instead of the robust loss or yield bounds that are notably looser compared to their standard counterparts. In the latter case, the bounds have a higher dependency on the width m of the DNNs or the dimension d of the data, with an extra factor of at least 𝒪 ( m ) or 𝒪 ( d ) . This paper presents upper bounds for adversarial Rademacher complexity of DNNs that match the best-known upper bounds in standard settings, as established in the work of Bartlett et al. (2017), with the dependency on width and dimension being 𝒪 ( ln ( d m ) ) . The central challenge addressed is calculating the covering number of adversarial function classes. We aim to construct a new cover that possesses two properties: 1) compatibility with adversarial examples, and 2) precision comparable to covers used in standard settings. To this end, we introduce a new variant of covering number called the uniform covering number, specifically designed and proven to reconcile these two properties. Consequently, our method effectively bridges the gap between Rademacher complexity in robust and standard generalization.

用对抗性示例训练深度神经网络(DNN)往往会导致对测试时对抗性数据的泛化效果不佳。本文通过拉德马赫复杂性的视角研究了这一问题,即所谓的对抗性鲁棒泛化(adversarially robust generalization)。在 Khim 和 Loh(2018 年)、Yin 等人(2019 年)的研究基础上,已有大量作品致力于解决这一问题,但要达到令人满意的界限仍是一个难以实现的目标。关于 DNN 的现有研究要么适用于替代损失而非稳健损失,要么产生的边界明显比标准边界宽松。在后一种情况下,边界对 DNNs 的宽度 m 或数据维度 d 有更高的依赖性,至少有 𝒪 ( m ) 或 𝒪 ( d ) 的额外系数。本文提出了 DNN 的对抗性拉德马赫复杂度上界,与 Bartlett 等人(2017)的研究中建立的标准设置中最著名的上界相匹配,对宽度和维度的依赖性为 𝒪 ( ln ( d m ) ) 。我们面临的核心挑战是计算对抗函数类的覆盖数。我们的目标是构建一个具有以下两个特性的新覆盖:1) 与对抗示例兼容,以及 2) 精度可与标准设置中使用的覆盖相媲美。为此,我们引入了一种新的覆盖数变体,称为统一覆盖数,它是为协调这两个特性而专门设计并经过验证的。因此,我们的方法有效地弥合了鲁棒性和标准泛函的拉德马赫复杂性之间的差距。
{"title":"Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization.","authors":"Jiancong Xiao, Ruoyu Sun, Qi Long, Weijie J Su","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Training Deep Neural Networks (DNNs) with adversarial examples often results in poor generalization to test-time adversarial data. This paper investigates this issue, known as adversarially robust generalization, through the lens of Rademacher complexity. Building upon the studies by Khim and Loh (2018); Yin et al. (2019), numerous works have been dedicated to this problem, yet achieving a satisfactory bound remains an elusive goal. Existing works on DNNs either apply to a surrogate loss instead of the robust loss or yield bounds that are notably looser compared to their standard counterparts. In the latter case, the bounds have a higher dependency on the width <math><mi>m</mi></math> of the DNNs or the dimension <math><mi>d</mi></math> of the data, with an extra factor of at least <math><mi>𝒪</mi> <mo>(</mo> <msqrt><mi>m</mi></msqrt> <mo>)</mo></math> or <math><mi>𝒪</mi> <mo>(</mo> <msqrt><mi>d</mi></msqrt> <mo>)</mo></math> . This paper presents upper bounds for adversarial Rademacher complexity of DNNs that match the best-known upper bounds in standard settings, as established in the work of Bartlett et al. (2017), with the dependency on width and dimension being <math><mi>𝒪</mi> <mo>(</mo> <mtext>ln</mtext> <mspace></mspace> <mo>(</mo> <mi>d</mi> <mi>m</mi> <mo>)</mo> <mo>)</mo></math> . The central challenge addressed is calculating the covering number of adversarial function classes. We aim to construct a new cover that possesses two properties: 1) compatibility with adversarial examples, and 2) precision comparable to covers used in standard settings. To this end, we introduce a new variant of covering number called the <i>uniform covering number</i>, specifically designed and proven to reconcile these two properties. Consequently, our method effectively bridges the gap between Rademacher complexity in robust and standard generalization.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"247 ","pages":"5074-5075"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11350389/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142115745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alleviating tiling effect by random walk sliding window in high-resolution histological whole slide image synthesis. 在高分辨率组织学整张切片图像合成中利用随机漫步滑动窗口缓解平铺效应
Shunxing Bao, Ho Hin Lee, Qi Yang, Lucas W Remedios, Ruining Deng, Can Cui, Leon Y Cai, Kaiwen Xu, Xin Yu, Sophie Chiron, Yike Li, Nathan Heath Patterson, Yaohong Wang, Jia Li, Qi Liu, Ken S Lau, Joseph T Roland, Lori A Coburn, Keith T Wilson, Bennett A Landman, Yuankai Huo

Multiplex immunofluorescence (MxIF) is an advanced molecular imaging technique that can simultaneously provide biologists with multiple (i.e., more than 20) molecular markers on a single histological tissue section. Unfortunately, due to imaging restrictions, the more routinely used hematoxylin and eosin (H&E) stain is typically unavailable with MxIF on the same tissue section. As biological H&E staining is not feasible, previous efforts have been made to obtain H&E whole slide image (WSI) from MxIF via deep learning empowered virtual staining. However, the tiling effect is a long-lasting problem in high-resolution WSI-wise synthesis. The MxIF to H&E synthesis is no exception. Limited by computational resources, the cross-stain image synthesis is typically performed at the patch-level. Thus, discontinuous intensities might be visually identified along with the patch boundaries assembling all individual patches back to a WSI. In this work, we propose a deep learning based unpaired high-resolution image synthesis method to obtain virtual H&E WSIs from MxIF WSIs (each with 27 markers/stains) with reduced tiling effects. Briefly, we first extend the CycleGAN framework by adding simultaneous nuclei and mucin segmentation supervision as spatial constraints. Then, we introduce a random walk sliding window shifting strategy during the optimized inference stage, to alleviate the tiling effects. The validation results show that our spatially constrained synthesis method achieves a 56% performance gain for the downstream cell segmentation task. The proposed inference method reduces the tiling effects by using 50% fewer computation resources without compromising performance. The proposed random sliding window inference method is a plug-and-play module, which can be generalized for other high-resolution WSI image synthesis applications. The source code with our proposed model are available at https://github.com/MASILab/RandomWalkSlidingWindow.git.

多重免疫荧光(MxIF)是一种先进的分子成像技术,可在单个组织切片上同时为生物学家提供多种(即 20 多种)分子标记。遗憾的是,由于成像限制,在同一组织切片上通常无法使用常规使用的苏木精和伊红(H&E)染色。由于生物 H&E 染色不可行,以前曾有人通过深度学习虚拟染色,从 MxIF 获取 H&E 全切片图像(WSI)。然而,平铺效应是高分辨率 WSI 合成中的一个长期问题。从 MxIF 到 H&E 的合成也不例外。受限于计算资源,交叉染色图像合成通常在斑块级进行。因此,在将所有单个斑块组装回 WSI 的过程中,可能会在视觉上识别出不连续的强度和斑块边界。在这项工作中,我们提出了一种基于深度学习的无配对高分辨率图像合成方法,以从 MxIF WSI(每个 WSI 有 27 个标记/污点)中获得虚拟 H&E WSI,并减少平铺效应。简而言之,我们首先扩展了 CycleGAN 框架,添加了同步的细胞核和粘蛋白分割监督作为空间约束。然后,我们在优化推理阶段引入了随机漫步滑动窗口移动策略,以减轻堆叠效应。验证结果表明,我们的空间约束合成方法在下游细胞分割任务中实现了 56% 的性能提升。所提出的推理方法在不影响性能的前提下减少了 50% 的计算资源,从而降低了平铺效应。所提出的随机滑动窗口推理方法是一个即插即用的模块,可以推广到其他高分辨率 WSI 图像合成应用中。我们提出的模型的源代码可在 https://github.com/MASILab/RandomWalkSlidingWindow.git 上获取。
{"title":"Alleviating tiling effect by random walk sliding window in high-resolution histological whole slide image synthesis.","authors":"Shunxing Bao, Ho Hin Lee, Qi Yang, Lucas W Remedios, Ruining Deng, Can Cui, Leon Y Cai, Kaiwen Xu, Xin Yu, Sophie Chiron, Yike Li, Nathan Heath Patterson, Yaohong Wang, Jia Li, Qi Liu, Ken S Lau, Joseph T Roland, Lori A Coburn, Keith T Wilson, Bennett A Landman, Yuankai Huo","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Multiplex immunofluorescence (MxIF) is an advanced molecular imaging technique that can simultaneously provide biologists with multiple (i.e., more than 20) molecular markers on a single histological tissue section. Unfortunately, due to imaging restrictions, the more routinely used hematoxylin and eosin (H&E) stain is typically unavailable with MxIF on the same tissue section. As biological H&E staining is not feasible, previous efforts have been made to obtain H&E whole slide image (WSI) from MxIF via deep learning empowered virtual staining. However, the tiling effect is a long-lasting problem in high-resolution WSI-wise synthesis. The MxIF to H&E synthesis is no exception. Limited by computational resources, the cross-stain image synthesis is typically performed at the patch-level. Thus, discontinuous intensities might be visually identified along with the patch boundaries assembling all individual patches back to a WSI. In this work, we propose a deep learning based unpaired high-resolution image synthesis method to obtain virtual H&E WSIs from MxIF WSIs (each with 27 markers/stains) with reduced tiling effects. Briefly, we first extend the CycleGAN framework by adding simultaneous nuclei and mucin segmentation supervision as spatial constraints. Then, we introduce a random walk sliding window shifting strategy during the optimized inference stage, to alleviate the tiling effects. The validation results show that our spatially constrained synthesis method achieves a 56% performance gain for the downstream cell segmentation task. The proposed inference method reduces the tiling effects by using 50% fewer computation resources without compromising performance. The proposed random sliding window inference method is a plug-and-play module, which can be generalized for other high-resolution WSI image synthesis applications. The source code with our proposed model are available at https://github.com/MASILab/RandomWalkSlidingWindow.git.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"227 ","pages":"1406-1422"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11238901/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141592304","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Consistent Deep Rigid MRI Motion Correction. 数据一致的深度刚性磁共振成像运动校正。
Nalini M Singh, Neel Dey, Malte Hoffmann, Bruce Fischl, Elfar Adalsteinsson, Robert Frost, Adrian V Dalca, Polina Golland

Motion artifacts are a pervasive problem in MRI, leading to misdiagnosis or mischaracterization in population-level imaging studies. Current retrospective rigid intra-slice motion correction techniques jointly optimize estimates of the image and the motion parameters. In this paper, we use a deep network to reduce the joint image-motion parameter search to a search over rigid motion parameters alone. Our network produces a reconstruction as a function of two inputs: corrupted k-space data and motion parameters. We train the network using simulated, motion-corrupted k-space data generated with known motion parameters. At test-time, we estimate unknown motion parameters by minimizing a data consistency loss between the motion parameters, the network-based image reconstruction given those parameters, and the acquired measurements. Intra-slice motion correction experiments on simulated and realistic 2D fast spin echo brain MRI achieve high reconstruction fidelity while providing the benefits of explicit data consistency optimization. Our code is publicly available at https://www.github.com/nalinimsingh/neuroMoCo.

运动伪影是核磁共振成像中普遍存在的问题,会导致群体成像研究中的误诊或错误定性。目前的回顾性刚性片内运动校正技术需要联合优化图像和运动参数的估计值。在本文中,我们使用深度网络将图像-运动参数联合搜索简化为仅对刚性运动参数进行搜索。我们的网络根据两个输入的函数生成重建结果:损坏的 k 空间数据和运动参数。我们使用已知运动参数生成的模拟运动损坏 k 空间数据来训练网络。测试时,我们通过最小化运动参数、给定这些参数的基于网络的图像重建和获取的测量值之间的数据一致性损失来估计未知运动参数。在模拟和现实的二维快速自旋回波脑磁共振成像上进行的片内运动校正实验实现了高重建保真度,同时提供了显式数据一致性优化的优势。我们的代码可在 https://www.github.com/nalinimsingh/neuroMoCo 公开获取。
{"title":"Data Consistent Deep Rigid MRI Motion Correction.","authors":"Nalini M Singh, Neel Dey, Malte Hoffmann, Bruce Fischl, Elfar Adalsteinsson, Robert Frost, Adrian V Dalca, Polina Golland","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Motion artifacts are a pervasive problem in MRI, leading to misdiagnosis or mischaracterization in population-level imaging studies. Current retrospective rigid intra-slice motion correction techniques jointly optimize estimates of the image and the motion parameters. In this paper, we use a deep network to reduce the joint image-motion parameter search to a search over rigid motion parameters alone. Our network produces a reconstruction as a function of two inputs: corrupted k-space data and motion parameters. We train the network using simulated, motion-corrupted k-space data generated with known motion parameters. At test-time, we estimate unknown motion parameters by minimizing a data consistency loss between the motion parameters, the network-based image reconstruction given those parameters, and the acquired measurements. Intra-slice motion correction experiments on simulated and realistic 2D fast spin echo brain MRI achieve high reconstruction fidelity while providing the benefits of explicit data consistency optimization. Our code is publicly available at https://www.github.com/nalinimsingh/neuroMoCo.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"227 ","pages":"368-381"},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11482239/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142482672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Inferring Dynamic Regulatory Interaction Graphs from Time Series Data with Perturbations. 从有扰动的时间序列数据推断动态调节相互作用图。
Dhananjay Bhaskar, Daniel Sumner Magruder, Matheo Morales, Edward De Brouwer, Aarthi Venkat, Frederik Wenkel, James Noonan, Guy Wolf, Natalia Ivanova, Smita Krishnaswamy

Complex systems are characterized by intricate interactions between entities that evolve dynamically over time. Accurate inference of these dynamic relationships is crucial for understanding and predicting system behavior. In this paper, we propose Regulatory Temporal Interaction Network Inference (RiTINI) for inferring time-varying interaction graphs in complex systems using a novel combination of space-and-time graph attentions and graph neural ordinary differential equations (ODEs). RiTINI leverages time-lapse signals on a graph prior, as well as perturbations of signals at various nodes in order to effectively capture the dynamics of the underlying system. This approach is distinct from traditional causal inference networks, which are limited to inferring acyclic and static graphs. In contrast, RiTINI can infer cyclic, directed, and time-varying graphs, providing a more comprehensive and accurate representation of complex systems. The graph attention mechanism in RiTINI allows the model to adaptively focus on the most relevant interactions in time and space, while the graph neural ODEs enable continuous-time modeling of the system's dynamics. We evaluate RiTINI's performance on simulations of dynamical systems, neuronal networks, and gene regulatory networks, demonstrating its state-of-the-art capability in inferring interaction graphs compared to previous methods.

复杂系统的特点是实体之间复杂的相互作用,这些相互作用随时间动态演变。这些动态关系的准确推断对于理解和预测系统行为至关重要。本文提出了一种将时空图关注与图神经常微分方程(ODEs)相结合的新方法,用于复杂系统时变交互图的推理。为了有效地捕捉底层系统的动态,RiTINI利用了先验图上的延时信号,以及各个节点上的信号扰动。这种方法与传统的因果推理网络不同,传统的因果推理网络仅限于推断非循环和静态图。相比之下,RiTINI可以推断循环,有向和时变图,为复杂系统提供更全面和准确的表示。RiTINI中的图注意机制允许模型自适应地关注时间和空间中最相关的交互,而图神经ode则允许对系统动态进行连续时间建模。我们评估了RiTINI在模拟动力系统、神经网络和基因调控网络上的性能,与以前的方法相比,展示了其在推断相互作用图方面的最先进能力。
{"title":"Inferring Dynamic Regulatory Interaction Graphs from Time Series Data with Perturbations.","authors":"Dhananjay Bhaskar, Daniel Sumner Magruder, Matheo Morales, Edward De Brouwer, Aarthi Venkat, Frederik Wenkel, James Noonan, Guy Wolf, Natalia Ivanova, Smita Krishnaswamy","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Complex systems are characterized by intricate interactions between entities that evolve dynamically over time. Accurate inference of these dynamic relationships is crucial for understanding and predicting system behavior. In this paper, we propose Regulatory Temporal Interaction Network Inference (RiTINI) for inferring time-varying interaction graphs in complex systems using a novel combination of space-and-time graph attentions and graph neural ordinary differential equations (ODEs). RiTINI leverages time-lapse signals on a graph prior, as well as perturbations of signals at various nodes in order to effectively capture the dynamics of the underlying system. This approach is distinct from traditional causal inference networks, which are limited to inferring acyclic and static graphs. In contrast, RiTINI can infer cyclic, directed, and time-varying graphs, providing a more comprehensive and accurate representation of complex systems. The graph attention mechanism in RiTINI allows the model to adaptively focus on the most relevant interactions in time and space, while the graph neural ODEs enable continuous-time modeling of the system's dynamics. We evaluate RiTINI's performance on simulations of dynamical systems, neuronal networks, and gene regulatory networks, demonstrating its state-of-the-art capability in inferring interaction graphs compared to previous methods.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"231 ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12269789/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144661234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data. 基于小先导数据的大数据集分类器准确率预测的概率方法。
Ethan Harvey, Wansu Chen, David M Kent, Michael C Hughes

Practitioners building classifiers often start with a smaller pilot dataset and plan to grow to larger data in the near future. Such projects need a toolkit for extrapolating how much classifier accuracy may improve from a 2x, 10x, or 50x increase in data size. While existing work has focused on finding a single "best-fit" curve using various functional forms like power laws, we argue that modeling and assessing the uncertainty of predictions is critical yet has seen less attention. In this paper, we propose a Gaussian process model to obtain probabilistic extrapolations of accuracy or similar performance metrics as dataset size increases. We evaluate our approach in terms of error, likelihood, and coverage across six datasets. Though we focus on medical tasks and image modalities, our open source approach generalizes to any kind of classifier.

构建分类器的从业者通常从一个较小的试验数据集开始,并计划在不久的将来发展到更大的数据集。这样的项目需要一个工具包来推断数据大小增加2倍、10倍或50倍会提高多少分类器的准确性。虽然现有的工作主要集中在寻找一个单一的“最佳拟合”曲线,使用各种函数形式,如幂律,我们认为,建模和评估预测的不确定性是至关重要的,但很少有人关注。在本文中,我们提出了一个高斯过程模型,以获得随着数据集大小增加的准确性或类似性能指标的概率外推。我们根据六个数据集的误差、可能性和覆盖率来评估我们的方法。虽然我们专注于医疗任务和图像模式,但我们的开源方法可以推广到任何类型的分类器。
{"title":"A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data.","authors":"Ethan Harvey, Wansu Chen, David M Kent, Michael C Hughes","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Practitioners building classifiers often start with a smaller pilot dataset and plan to grow to larger data in the near future. Such projects need a toolkit for extrapolating how much classifier accuracy may improve from a 2x, 10x, or 50x increase in data size. While existing work has focused on finding a single \"best-fit\" curve using various functional forms like power laws, we argue that modeling and assessing the <i>uncertainty</i> of predictions is critical yet has seen less attention. In this paper, we propose a Gaussian process model to obtain probabilistic extrapolations of accuracy or similar performance metrics as dataset size increases. We evaluate our approach in terms of error, likelihood, and coverage across six datasets. Though we focus on medical tasks and image modalities, our open source approach generalizes to any kind of classifier.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"225 ","pages":"129-144"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11826957/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143434464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MULTIPAR: Supervised Irregular Tensor Factorization with Multi-task Learning for Computational Phenotyping. 基于多任务学习的不规则张量分解。
Yifei Ren, Jian Lou, Li Xiong, Joyce C Ho, Xiaoqian Jiang, Sivasubramanium Venkatraman Bhavani

Tensor factorization has received increasing interest due to its intrinsic ability to capture latent factors in multi-dimensional data with many applications including Electronic Health Records (EHR) mining. PARAFAC2 and its variants have been proposed to address irregular tensors where one of the tensor modes is not aligned, e.g., different patients in EHRs may have different length of records. PARAFAC2 has been successfully applied to EHRs for extracting meaningful medical concepts (phenotypes). Despite recent advancements, current models' predictability and interpretability are not satisfactory, which limits its utility for downstream analysis. In this paper, we propose MULTIPAR: a supervised irregular tensor factorization with multi-task learning for computational phenotyping. MULTIPAR is flexible to incorporate both static (e.g. in-hospital mortality prediction) and continuous or dynamic (e.g. the need for ventilation) tasks. By supervising the tensor factorization with downstream prediction tasks and leveraging information from multiple related predictive tasks, MULTIPAR can yield not only more meaningful phenotypes but also better predictive performance for downstream tasks. We conduct extensive experiments on two real-world temporal EHR datasets to demonstrate that MULTIPAR is scalable and achieves better tensor fit with more meaningful subgroups and stronger predictive performance compared to existing state-of-the-art methods. The implementation of MULTIPAR is available.

张量分解由于其固有的捕获多维数据中潜在因素的能力而受到越来越多的关注,包括电子健康记录(EHR)挖掘在内的许多应用。PARAFAC2及其变体已被提出用于解决其中一个张量模式未对齐的不规则张量,例如,电子病历中的不同患者可能具有不同长度的记录。PARAFAC2已成功应用于电子病历,用于提取有意义的医学概念(表型)。尽管最近取得了进展,但当前模型的可预测性和可解释性并不令人满意,这限制了其在下游分析中的效用。在本文中,我们提出了MULTIPAR:一种具有多任务学习的有监督不规则张量分解算法。MULTIPAR可以灵活地纳入静态(如住院死亡率预测)和连续或动态(如需要通风)任务。通过监督下游预测任务的张量分解,并利用来自多个相关预测任务的信息,MULTIPAR不仅可以产生更有意义的表型,还可以为下游任务提供更好的预测性能。我们在两个真实世界的时间EHR数据集上进行了广泛的实验,以证明MULTIPAR是可扩展的,与现有的最先进的方法相比,它具有更好的张量拟合和更有意义的子组,并且具有更强的预测性能。MULTIPAR的实现是可用的。
{"title":"MULTIPAR: Supervised Irregular Tensor Factorization with Multi-task Learning for Computational Phenotyping.","authors":"Yifei Ren, Jian Lou, Li Xiong, Joyce C Ho, Xiaoqian Jiang, Sivasubramanium Venkatraman Bhavani","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Tensor factorization has received increasing interest due to its intrinsic ability to capture latent factors in multi-dimensional data with many applications including Electronic Health Records (EHR) mining. PARAFAC2 and its variants have been proposed to address irregular tensors where one of the tensor modes is not aligned, e.g., different patients in EHRs may have different length of records. PARAFAC2 has been successfully applied to EHRs for extracting meaningful medical concepts (phenotypes). Despite recent advancements, current models' predictability and interpretability are not satisfactory, which limits its utility for downstream analysis. In this paper, we propose MULTIPAR: a supervised irregular tensor factorization with multi-task learning for computational phenotyping. MULTIPAR is flexible to incorporate both static (e.g. in-hospital mortality prediction) and continuous or dynamic (e.g. the need for ventilation) tasks. By supervising the tensor factorization with downstream prediction tasks and leveraging information from multiple related predictive tasks, MULTIPAR can yield not only more meaningful phenotypes but also better predictive performance for downstream tasks. We conduct extensive experiments on two real-world temporal EHR datasets to demonstrate that MULTIPAR is scalable and achieves better tensor fit with more meaningful subgroups and stronger predictive performance compared to existing state-of-the-art methods. The implementation of MULTIPAR is available.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"225 ","pages":"498-511"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11611252/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142775499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GANcMRI: Cardiac magnetic resonance video generation and physiologic guidance using latent space prompting. GANcMRI:利用潜在空间提示生成心脏磁共振视频和生理指导。
Milos Vukadinovic, Alan C Kwan, Debiao Li, David Ouyang

Generative artificial intelligence can be applied to medical imaging on tasks such as privacy-preserving image generation and superresolution and denoising of existing images. Few prior approaches have used cardiac magnetic resonance imaging (cMRI) as a modality given the complexity of videos (the addition of the temporal dimension) as well as the limited scale of publicly available datasets. We introduce GANcMRI, a generative adversarial network that can synthesize cMRI videos with physiological guidance based on latent space prompting. GANcMRI uses a StyleGAN framework to learn the latent space from individual video frames and leverages the timedependent trajectory between end-systolic and end-diastolic frames in the latent space to predict progression and generate motion over time. We proposed various methods for modeling latent time-dependent trajectories and found that our Frame-to-frame approach generates the best motion and video quality. GANcMRI generated high-quality cMRI image frames that are indistinguishable by cardiologists, however, artifacts in video generation allow cardiologists to still recognize the difference between real and generated videos. The generated cMRI videos can be prompted to apply physiologybased adjustments which produces clinically relevant phenotypes recognizable by cardiologists. GANcMRI has many potential applications such as data augmentation, education, anomaly detection, and preoperative planning.

生成式人工智能可应用于医学成像任务,如保护隐私的图像生成以及现有图像的超分辨率和去噪。鉴于视频的复杂性(增加了时间维度)以及公开可用数据集的规模有限,此前很少有方法将心脏磁共振成像(cMRI)作为一种模式。我们介绍的 GANcMRI 是一种生成式对抗网络,它可以根据潜在空间提示合成具有生理指导的 cMRI 视频。GANcMRI 使用 StyleGAN 框架从单个视频帧中学习潜空间,并利用潜空间中收缩末期和舒张末期帧之间与时间相关的轨迹来预测进展并随时间产生运动。我们提出了多种方法来模拟潜在的随时间变化的轨迹,结果发现我们的 "帧到帧 "方法生成的运动和视频质量最好。GANcMRI 生成的高质量 cMRI 图像帧是心脏病专家无法分辨的,但是,视频生成过程中的伪影仍能让心脏病专家识别出真实视频和生成视频之间的差异。生成的 cMRI 视频可提示应用基于生理学的调整,从而产生心脏病专家可识别的临床相关表型。GANcMRI 有许多潜在应用,如数据增强、教育、异常检测和术前规划。
{"title":"GANcMRI: Cardiac magnetic resonance video generation and physiologic guidance using latent space prompting.","authors":"Milos Vukadinovic, Alan C Kwan, Debiao Li, David Ouyang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Generative artificial intelligence can be applied to medical imaging on tasks such as privacy-preserving image generation and superresolution and denoising of existing images. Few prior approaches have used cardiac magnetic resonance imaging (cMRI) as a modality given the complexity of videos (the addition of the temporal dimension) as well as the limited scale of publicly available datasets. We introduce GANcMRI, a generative adversarial network that can synthesize cMRI videos with physiological guidance based on latent space prompting. GANcMRI uses a StyleGAN framework to learn the latent space from individual video frames and leverages the timedependent trajectory between end-systolic and end-diastolic frames in the latent space to predict progression and generate motion over time. We proposed various methods for modeling latent time-dependent trajectories and found that our Frame-to-frame approach generates the best motion and video quality. GANcMRI generated high-quality cMRI image frames that are indistinguishable by cardiologists, however, artifacts in video generation allow cardiologists to still recognize the difference between real and generated videos. The generated cMRI videos can be prompted to apply physiologybased adjustments which produces clinically relevant phenotypes recognizable by cardiologists. GANcMRI has many potential applications such as data augmentation, education, anomaly detection, and preoperative planning.</p>","PeriodicalId":74504,"journal":{"name":"Proceedings of machine learning research","volume":"225 ","pages":"594-606"},"PeriodicalIF":0.0,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10783442/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139426220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of machine learning research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1