Qianqian Wang , Junhao Zhang , Long Li , Lishan Qiao , Pew-Thian Yap , Mingxia Liu
{"title":"Graph augmentation guided federated knowledge distillation for multisite functional MRI analysis","authors":"Qianqian Wang , Junhao Zhang , Long Li , Lishan Qiao , Pew-Thian Yap , Mingxia Liu","doi":"10.1016/j.patcog.2025.111559","DOIUrl":null,"url":null,"abstract":"<div><div>Resting-state functional MRI (rs-fMRI) is a non-invasive tool increasingly used to detect abnormalities in brain connectivity for disorder analysis. Many learning models have been explored for fMRI analysis but usually require extensive data for reliable training. Multisite studies increase sample sizes by pooling data from multiple sites, but often face data security and privacy challenges. Federated learning (FL) facilitates collaborative model training without pooling fMRI data from different sites/clients. However, many FL methods share model parameters between clients, posing significant security risks during communication and greatly increasing communication costs. Besides, fMRI data for local model training is usually limited at each site, which may hinder local model training. To this end, we propose a graph augmentation guided federated distillation (GAFD) framework for multisite fMRI analysis and brain disorder identification. At each client, we augment each input functional connectivity network/graph derived from fMRI by perturbing node features and edges, followed by a feature encoder for graph representation learning. A contrastive loss is used to maximize the agreement of learned representations from the same subject, further enhancing discriminative power of fMRI representations. On the server side, the server receives model outputs (<em>i.e.</em>, logit scores) corresponding to augmented graphs from each client and merges them. The merged logit score is then sent back to each client for knowledge distillation. This can promote knowledge sharing among clients, reduce the risk of privacy leakage, and greatly decrease communication costs. Experimental results on two multisite fMRI datasets indicate that our approach outperforms several state-of-the-arts.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"164 ","pages":"Article 111559"},"PeriodicalIF":7.5000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325002195","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Resting-state functional MRI (rs-fMRI) is a non-invasive tool increasingly used to detect abnormalities in brain connectivity for disorder analysis. Many learning models have been explored for fMRI analysis but usually require extensive data for reliable training. Multisite studies increase sample sizes by pooling data from multiple sites, but often face data security and privacy challenges. Federated learning (FL) facilitates collaborative model training without pooling fMRI data from different sites/clients. However, many FL methods share model parameters between clients, posing significant security risks during communication and greatly increasing communication costs. Besides, fMRI data for local model training is usually limited at each site, which may hinder local model training. To this end, we propose a graph augmentation guided federated distillation (GAFD) framework for multisite fMRI analysis and brain disorder identification. At each client, we augment each input functional connectivity network/graph derived from fMRI by perturbing node features and edges, followed by a feature encoder for graph representation learning. A contrastive loss is used to maximize the agreement of learned representations from the same subject, further enhancing discriminative power of fMRI representations. On the server side, the server receives model outputs (i.e., logit scores) corresponding to augmented graphs from each client and merges them. The merged logit score is then sent back to each client for knowledge distillation. This can promote knowledge sharing among clients, reduce the risk of privacy leakage, and greatly decrease communication costs. Experimental results on two multisite fMRI datasets indicate that our approach outperforms several state-of-the-arts.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.