Disentangled latent energy-based style translation: An image-level structural MRI harmonization framework

IF 6.3 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Neural Networks Pub Date : 2025-04-01 Epub Date: 2024-12-16 DOI:10.1016/j.neunet.2024.107039
Mengqi Wu , Lintao Zhang , Pew-Thian Yap , Hongtu Zhu , Mingxia Liu
{"title":"Disentangled latent energy-based style translation: An image-level structural MRI harmonization framework","authors":"Mengqi Wu ,&nbsp;Lintao Zhang ,&nbsp;Pew-Thian Yap ,&nbsp;Hongtu Zhu ,&nbsp;Mingxia Liu","doi":"10.1016/j.neunet.2024.107039","DOIUrl":null,"url":null,"abstract":"<div><div>Brain magnetic resonance imaging (MRI) has been extensively employed across clinical and research fields, but often exhibits sensitivity to <em>site effects</em> arising from non-biological variations such as differences in field strength and scanner vendors. Numerous retrospective MRI harmonization techniques have demonstrated encouraging outcomes in reducing the site effects at image level. However, existing methods generally suffer from high computational requirements and limited generalizability, restricting their applicability to unseen MRIs. In this paper, we design a novel disentangled latent energy-based style translation (DLEST) framework for unpaired image-level MRI harmonization, consisting of (a) <em>site-invariant image generation</em> (SIG), (b) <em>site-specific style translation</em> (SST), and (c) <em>site-specific MRI synthesis</em> (SMS). Specifically, the SIG employs a latent autoencoder to encode MRIs into a low-dimensional latent space and reconstruct MRIs from latent codes. The SST utilizes an energy-based model to comprehend global latent distribution of a target domain and translate source latent codes towards the target domain, while SMS enables MRI synthesis with a target-specific style. By disentangling image generation and style translation in latent space, the DLEST can achieve efficient style translation. Our model was trained on T1-weighted MRIs from a public dataset (with 3,984 subjects across 58 acquisition sites/settings) and validated on an independent dataset (with 9 traveling subjects scanned in 11 sites/settings) in four tasks: histogram and feature visualization, site classification, brain tissue segmentation, and site-specific structural MRI synthesis. Qualitative and quantitative results demonstrate the superiority of our method over several state-of-the-arts.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"184 ","pages":"Article 107039"},"PeriodicalIF":6.3000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608024009687","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/16 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Brain magnetic resonance imaging (MRI) has been extensively employed across clinical and research fields, but often exhibits sensitivity to site effects arising from non-biological variations such as differences in field strength and scanner vendors. Numerous retrospective MRI harmonization techniques have demonstrated encouraging outcomes in reducing the site effects at image level. However, existing methods generally suffer from high computational requirements and limited generalizability, restricting their applicability to unseen MRIs. In this paper, we design a novel disentangled latent energy-based style translation (DLEST) framework for unpaired image-level MRI harmonization, consisting of (a) site-invariant image generation (SIG), (b) site-specific style translation (SST), and (c) site-specific MRI synthesis (SMS). Specifically, the SIG employs a latent autoencoder to encode MRIs into a low-dimensional latent space and reconstruct MRIs from latent codes. The SST utilizes an energy-based model to comprehend global latent distribution of a target domain and translate source latent codes towards the target domain, while SMS enables MRI synthesis with a target-specific style. By disentangling image generation and style translation in latent space, the DLEST can achieve efficient style translation. Our model was trained on T1-weighted MRIs from a public dataset (with 3,984 subjects across 58 acquisition sites/settings) and validated on an independent dataset (with 9 traveling subjects scanned in 11 sites/settings) in four tasks: histogram and feature visualization, site classification, brain tissue segmentation, and site-specific structural MRI synthesis. Qualitative and quantitative results demonstrate the superiority of our method over several state-of-the-arts.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于解纠缠潜能的风格翻译:一个图像级结构MRI协调框架。
脑磁共振成像(MRI)已广泛应用于临床和研究领域,但往往对非生物变异(如场强和扫描仪供应商的差异)引起的部位效应敏感。许多回顾性MRI协调技术在图像水平上显示了令人鼓舞的减少部位影响的结果。然而,现有的方法通常存在计算要求高、通用性有限的问题,限制了它们对未见过的核磁共振成像的适用性。在本文中,我们设计了一种新的基于解纠缠潜能的风格翻译(DLEST)框架,用于非配对图像级MRI协调,包括(a)位点不变图像生成(SIG), (b)位点特定风格翻译(SST)和(c)位点特定MRI合成(SMS)。具体地说,SIG采用潜伏自编码器将核磁共振成像编码到一个低维的潜伏空间中,并从潜伏编码中重构核磁共振成像。SST利用基于能量的模型来理解目标域的全局潜在分布,并将源潜在代码转换为目标域,而SMS则以特定目标的方式进行MRI合成。通过对隐空间的图像生成和风格转换进行解纠缠,实现了高效的风格转换。我们的模型在来自公共数据集(包括58个采集地点/设置的3984名受试者)的t1加权MRI上进行了训练,并在独立数据集(包括11个地点/设置的9名旅行受试者)上进行了四项任务的验证:直方图和特征可视化、部位分类、脑组织分割和部位特异性结构MRI合成。定性和定量结果表明,我们的方法优于几种最先进的技术。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Neural Networks
Neural Networks 工程技术-计算机:人工智能
CiteScore
13.90
自引率
7.70%
发文量
425
审稿时长
67 days
期刊介绍: Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.
期刊最新文献
Minimizing command timing variability is a key factor in skilled actions Inferring gene regulatory networks via adversarially regularized directed graph autoencoder A continual learning framework with long-term and multiple short-term memory networks A Multi-Agent Continual reinforcement learning framework with multi-Timescale replay and dynamic task classification A survey of recent advances in adversarial attack and defense on vision-language models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1