Memory-Guided Transformer with group attention for knee MRI diagnosis

IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pattern Recognition Pub Date : 2025-02-05 DOI:10.1016/j.patcog.2025.111417
Rui Huang , Zonghai Huang , Hantang Zhou , Qiang Zhai , Fengjun Mu , Huayi Zhan , Hong Cheng , Xiao Yang
{"title":"Memory-Guided Transformer with group attention for knee MRI diagnosis","authors":"Rui Huang ,&nbsp;Zonghai Huang ,&nbsp;Hantang Zhou ,&nbsp;Qiang Zhai ,&nbsp;Fengjun Mu ,&nbsp;Huayi Zhan ,&nbsp;Hong Cheng ,&nbsp;Xiao Yang","doi":"10.1016/j.patcog.2025.111417","DOIUrl":null,"url":null,"abstract":"<div><div>Magnetic resonance imaging (MRI) plays an important role in the diagnosis of knee injuries, due to its detailed information, which greatly enhances physicians’ diagnostic accuracy. However, the complex image information also makes it difficult for physicians to interpret MRI. There is an urgent need for a computer-assisted method to help physicians extract key information from MRIs and make diagnostic decisions. Knee MRI includes features across three different levels: anatomical plane-level, dataset-level, and case-level. In this paper, we approach the intelligent diagnosis of knee injuries as an interpretable MRI classification task, using a three-stage Memory-Guided Transformer (MGT) for implementation. The first stage focuses on extracting anatomical plane-level and dataset-level features through group attention and cross-attention, which are then stored in the memory matrix. In the second stage, the trained memory matrix guides the extraction of case-level features from different anatomical planes for each case. Finally, the probability of knee injury is determined using linear regression. The MGT was trained with the publicly available MRNet dataset. Compared with the original optimal model PERMIT, it shows a 5.7% improvement in the Youden index. A high level of consistency was observed between the physician-labeled diagnostic regions and the regions identified by group attention. Visualization of the trained memory revealed specific patterns, with column 62 corresponding to healthy subjects and column 81 to patients. These results demonstrate that MGT can effectively assist physicians in diagnosing knee injuries while offering excellent interpretability.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"162 ","pages":"Article 111417"},"PeriodicalIF":7.6000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325000779","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Magnetic resonance imaging (MRI) plays an important role in the diagnosis of knee injuries, due to its detailed information, which greatly enhances physicians’ diagnostic accuracy. However, the complex image information also makes it difficult for physicians to interpret MRI. There is an urgent need for a computer-assisted method to help physicians extract key information from MRIs and make diagnostic decisions. Knee MRI includes features across three different levels: anatomical plane-level, dataset-level, and case-level. In this paper, we approach the intelligent diagnosis of knee injuries as an interpretable MRI classification task, using a three-stage Memory-Guided Transformer (MGT) for implementation. The first stage focuses on extracting anatomical plane-level and dataset-level features through group attention and cross-attention, which are then stored in the memory matrix. In the second stage, the trained memory matrix guides the extraction of case-level features from different anatomical planes for each case. Finally, the probability of knee injury is determined using linear regression. The MGT was trained with the publicly available MRNet dataset. Compared with the original optimal model PERMIT, it shows a 5.7% improvement in the Youden index. A high level of consistency was observed between the physician-labeled diagnostic regions and the regions identified by group attention. Visualization of the trained memory revealed specific patterns, with column 62 corresponding to healthy subjects and column 81 to patients. These results demonstrate that MGT can effectively assist physicians in diagnosing knee injuries while offering excellent interpretability.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
记忆引导变压器与群体关注在膝关节MRI诊断中的应用
磁共振成像(MRI)在膝关节损伤的诊断中发挥着重要的作用,因为它能提供详细的信息,大大提高了医生的诊断准确性。然而,复杂的图像信息也给医生解释MRI带来了困难。迫切需要一种计算机辅助的方法来帮助医生从核磁共振成像中提取关键信息并做出诊断决定。膝关节MRI包括三个不同水平的特征:解剖平面水平、数据集水平和病例水平。在本文中,我们将膝关节损伤的智能诊断作为一项可解释的MRI分类任务,使用三级记忆引导变压器(MGT)来实现。第一阶段主要通过群体注意和交叉注意提取解剖平面级和数据集级特征,并将其存储在记忆矩阵中。在第二阶段,训练好的记忆矩阵指导从每个病例的不同解剖平面提取病例级特征。最后,用线性回归法确定膝关节损伤的概率。MGT使用公开可用的MRNet数据集进行训练。与原最优模型PERMIT相比,优登指数提高了5.7%。在医生标记的诊断区域和群体注意识别的区域之间观察到高度的一致性。训练记忆的可视化显示了特定的模式,第62列对应健康受试者,第81列对应患者。这些结果表明,MGT可以有效地帮助医生诊断膝关节损伤,同时提供良好的可解释性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Pattern Recognition
Pattern Recognition 工程技术-工程:电子与电气
CiteScore
14.40
自引率
16.20%
发文量
683
审稿时长
5.6 months
期刊介绍: The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.
期刊最新文献
Editorial Board Contrastive calibration on consensus and complementary multi-view representations Adversarial supervised contrastive feature learning for cross-modal retrieval A visual-textual mutual guidance fusion network for remote sensing visual question answering Generalizable face forgery detection via mining single-step reconstruction difference
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1