S.S. Lövdal, R. van Veen, G. Carli, R. J. Renken, T. Shiner, N. Bregman, R. Orad, D. Arnaldi, B. Orso, S. Morbelli, P. Mattioli, K. L. Leenders, R. Dierckx, S. K. Meles, M. Biehl
{"title":"IRMA: Machine learning-based harmonization of $$^{18}$$ F-FDG PET brain scans in multi-center studies","authors":"S.S. Lövdal, R. van Veen, G. Carli, R. J. Renken, T. Shiner, N. Bregman, R. Orad, D. Arnaldi, B. Orso, S. Morbelli, P. Mattioli, K. L. Leenders, R. Dierckx, S. K. Meles, M. Biehl","doi":"10.1007/s00259-025-07114-4","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Purpose</h3><p>Center-specific effects in PET brain scans arise due to differences in technical and procedural aspects. This restricts the merging of data between centers and introduces source-specific bias.</p><h3 data-test=\"abstract-sub-heading\">Methods</h3><p>We demonstrate the use of the recently proposed machine learning method Iterated Relevance Matrix Analysis (IRMA) for harmonization of center-specific effects in brain <span>\\(^{18}\\)</span>F-Fluorodeoxyglucose (<span>\\(^{18}\\)</span>F-FDG) PET scans. The center difference is learned by applying IRMA on PCA-based feature vectors of healthy controls (HC), resulting in a subspace <span>\\(\\varvec{V}\\)</span>, representing information not comparable between centers, and the remaining subspace <span>\\(\\varvec{U}\\)</span>, where no center differences are present. In this proof-of-concept study, we demonstrate the properties of the method using data from four centers. After center-harmonization, a Generalized Matrix Learning Vector Quantization (GMLVQ) model was trained to discriminate between Parkinson’s disease, Alzheimer’s disease and Dementia with Lewy Bodies.</p><h3 data-test=\"abstract-sub-heading\">Results</h3><p>At the initial IRMA iteration, the system was able to determine the center origin of the four HC cohorts almost perfectly. The method required six iterations, corresponding to a six-dimensional subspace <span>\\(\\varvec{V}\\)</span>, to determine the entire center difference. An uncorrected disease classification model was highly biased to center-specific effects, creating a falsely inflated performance when applying internal (cross-) validation. The cross-validation performance of the center-harmonized model remained high, while it generalized significantly better to unseen test cohorts. Furthermore, the framework is highly transparent, providing analytic reconstructions of the correction and visualizations of the data in voxel space.</p><h3 data-test=\"abstract-sub-heading\">Conclusion</h3><p>IRMA can be used to learn and disregard center-specific information in features extracted from brain <span>\\(^{18}\\)</span>F-FDG PET scans, while retaining disease-specific information.</p>","PeriodicalId":11909,"journal":{"name":"European Journal of Nuclear Medicine and Molecular Imaging","volume":"25 1","pages":""},"PeriodicalIF":8.6000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Nuclear Medicine and Molecular Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00259-025-07114-4","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
Center-specific effects in PET brain scans arise due to differences in technical and procedural aspects. This restricts the merging of data between centers and introduces source-specific bias.
Methods
We demonstrate the use of the recently proposed machine learning method Iterated Relevance Matrix Analysis (IRMA) for harmonization of center-specific effects in brain \(^{18}\)F-Fluorodeoxyglucose (\(^{18}\)F-FDG) PET scans. The center difference is learned by applying IRMA on PCA-based feature vectors of healthy controls (HC), resulting in a subspace \(\varvec{V}\), representing information not comparable between centers, and the remaining subspace \(\varvec{U}\), where no center differences are present. In this proof-of-concept study, we demonstrate the properties of the method using data from four centers. After center-harmonization, a Generalized Matrix Learning Vector Quantization (GMLVQ) model was trained to discriminate between Parkinson’s disease, Alzheimer’s disease and Dementia with Lewy Bodies.
Results
At the initial IRMA iteration, the system was able to determine the center origin of the four HC cohorts almost perfectly. The method required six iterations, corresponding to a six-dimensional subspace \(\varvec{V}\), to determine the entire center difference. An uncorrected disease classification model was highly biased to center-specific effects, creating a falsely inflated performance when applying internal (cross-) validation. The cross-validation performance of the center-harmonized model remained high, while it generalized significantly better to unseen test cohorts. Furthermore, the framework is highly transparent, providing analytic reconstructions of the correction and visualizations of the data in voxel space.
Conclusion
IRMA can be used to learn and disregard center-specific information in features extracted from brain \(^{18}\)F-FDG PET scans, while retaining disease-specific information.
期刊介绍:
The European Journal of Nuclear Medicine and Molecular Imaging serves as a platform for the exchange of clinical and scientific information within nuclear medicine and related professions. It welcomes international submissions from professionals involved in the functional, metabolic, and molecular investigation of diseases. The journal's coverage spans physics, dosimetry, radiation biology, radiochemistry, and pharmacy, providing high-quality peer review by experts in the field. Known for highly cited and downloaded articles, it ensures global visibility for research work and is part of the EJNMMI journal family.