Pub Date : 2025-11-01Epub Date: 2025-06-03DOI: 10.1117/1.JMI.12.6.061404
Ali Mammadov, Loïc Le Folgoc, Julien Adam, Anne Buronfosse, Gilles Hayem, Guillaume Hocquet, Pietro Gori
Purpose: Multiple instance learning (MIL) has emerged as the best solution for whole slide image (WSI) classification. It consists of dividing each slide into patches, which are treated as a bag of instances labeled with a global label. MIL includes two main approaches: instance-based and embedding-based. In the former, each patch is classified independently, and then, the patch scores are aggregated to predict the bag label. In the latter, bag classification is performed after aggregating patch embeddings. Even if instance-based methods are naturally more interpretable, embedding-based MILs have usually been preferred in the past due to their robustness to poor feature extractors. Recently, the quality of feature embeddings has drastically increased using self-supervised learning (SSL). Nevertheless, many authors continue to endorse the superiority of embedding-based MIL.
Approach: We conduct 710 experiments across 4 datasets, comparing 10 MIL strategies, 6 self-supervised methods with 4 backbones, 4 foundation models, and various pathology-adapted techniques. Furthermore, we introduce 4 instance-based MIL methods, never used before in the pathology domain.
Results: We show that with a good SSL feature extractor, simple instance-based MILs, with very few parameters, obtain similar or better performance than complex, state-of-the-art (SOTA) embedding-based MIL methods, setting new SOTA results on the BRACS and Camelyon16 datasets.
Conclusion: As simple instance-based MIL methods are naturally more interpretable and explainable to clinicians, our results suggest that more effort should be put into well-adapted SSL methods for WSI rather than into complex embedding-based MIL methods.
{"title":"Self-supervision enhances instance-based multiple instance learning methods in digital pathology: a benchmark study.","authors":"Ali Mammadov, Loïc Le Folgoc, Julien Adam, Anne Buronfosse, Gilles Hayem, Guillaume Hocquet, Pietro Gori","doi":"10.1117/1.JMI.12.6.061404","DOIUrl":"10.1117/1.JMI.12.6.061404","url":null,"abstract":"<p><strong>Purpose: </strong>Multiple instance learning (MIL) has emerged as the best solution for whole slide image (WSI) classification. It consists of dividing each slide into patches, which are treated as a bag of instances labeled with a global label. MIL includes two main approaches: instance-based and embedding-based. In the former, each patch is classified independently, and then, the patch scores are aggregated to predict the bag label. In the latter, bag classification is performed after aggregating patch embeddings. Even if instance-based methods are naturally more interpretable, embedding-based MILs have usually been preferred in the past due to their robustness to poor feature extractors. Recently, the quality of feature embeddings has drastically increased using self-supervised learning (SSL). Nevertheless, many authors continue to endorse the superiority of embedding-based MIL.</p><p><strong>Approach: </strong>We conduct 710 experiments across 4 datasets, comparing 10 MIL strategies, 6 self-supervised methods with 4 backbones, 4 foundation models, and various pathology-adapted techniques. Furthermore, we introduce 4 instance-based MIL methods, never used before in the pathology domain.</p><p><strong>Results: </strong>We show that with a good SSL feature extractor, simple instance-based MILs, with very few parameters, obtain similar or better performance than complex, state-of-the-art (SOTA) embedding-based MIL methods, setting new SOTA results on the BRACS and Camelyon16 datasets.</p><p><strong>Conclusion: </strong>As simple instance-based MIL methods are naturally more interpretable and explainable to clinicians, our results suggest that more effort should be put into well-adapted SSL methods for WSI rather than into complex embedding-based MIL methods.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 6","pages":"061404"},"PeriodicalIF":1.9,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12134610/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144235588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-01Epub Date: 2025-11-24DOI: 10.1117/1.JMI.12.6.064003
Mingzhe Hu, Shaoyan Pan, Chih-Wei Chang, Richard L J Qiu, Junbo Peng, Tonghe Wang, Justin Roper, Hui Mao, David Yu, Xiaofeng Yang
Purpose: We propose a deep learning framework, the cycle-guided denoising diffusion probability model (CG-DDPM), for cross-modality magnetic resonance imaging (MRI) synthesis. The CG-DDPM aims to generate high-quality MRIs of a target modality from an existing modality, addressing the challenge of missing MRI sequences in clinical practice.
Approach: The CG-DDPM employs two interconnected conditional diffusion probabilistic models, with a cycle-guided reverse latent noise regularization to enhance synthesis consistency and anatomical fidelity. The framework was evaluated using the BraTS2020 dataset, which includes three-dimensional brain MRIs with -weighted, -weighted, and FLAIR modalities. The synthetic images were quantitatively assessed using metrics such as multi-scale structural similarity measure (MSSIM), peak signal-to-noise ratio (PSNR), and mean absolute error (MAE). The CG-DDPM was benchmarked against state-of-the-art methods, including IDDPM, IDDIM, and MRI-cGAN.
Results: The CG-DDPM demonstrated superior performance across all cross-modality synthesis tasks (T1 → T2, T2 → T1, T1 → FLAIR, and FLAIR → T1). It consistently achieved the highest MSSIM values (ranging from 0.966 to 0.971), the lowest MAE (0.011 to 0.013), and competitive PSNR values (27.7 to 28.8 dB). Across all tasks, CG-DDPM outperformed IDDPM, IDDIM, and MRI-cGAN in most metrics and exhibited significantly lower uncertainty and inconsistency in MC-based sampling. Statistical analyses confirmed the robustness of CG-DDPM, with in key comparisons.
Conclusions: The proposed CG-DDPM provides a robust and efficient solution for cross-modality MRI synthesis, offering improved accuracy, stability, and clinical applicability compared with existing methods. This approach has the potential to streamline MRI-based workflows, enhance diagnostic imaging, and support precision treatment planning in medical physics and radiation oncology.
{"title":"Cross-modality 3D MRI synthesis via cycle-guided denoising diffusion probability model.","authors":"Mingzhe Hu, Shaoyan Pan, Chih-Wei Chang, Richard L J Qiu, Junbo Peng, Tonghe Wang, Justin Roper, Hui Mao, David Yu, Xiaofeng Yang","doi":"10.1117/1.JMI.12.6.064003","DOIUrl":"10.1117/1.JMI.12.6.064003","url":null,"abstract":"<p><strong>Purpose: </strong>We propose a deep learning framework, the cycle-guided denoising diffusion probability model (CG-DDPM), for cross-modality magnetic resonance imaging (MRI) synthesis. The CG-DDPM aims to generate high-quality MRIs of a target modality from an existing modality, addressing the challenge of missing MRI sequences in clinical practice.</p><p><strong>Approach: </strong>The CG-DDPM employs two interconnected conditional diffusion probabilistic models, with a cycle-guided reverse latent noise regularization to enhance synthesis consistency and anatomical fidelity. The framework was evaluated using the BraTS2020 dataset, which includes three-dimensional brain MRIs with <math><mrow><mi>T</mi> <mn>1</mn></mrow> </math> -weighted, <math><mrow><mi>T</mi> <mn>2</mn></mrow> </math> -weighted, and FLAIR modalities. The synthetic images were quantitatively assessed using metrics such as multi-scale structural similarity measure (MSSIM), peak signal-to-noise ratio (PSNR), and mean absolute error (MAE). The CG-DDPM was benchmarked against state-of-the-art methods, including IDDPM, IDDIM, and MRI-cGAN.</p><p><strong>Results: </strong>The CG-DDPM demonstrated superior performance across all cross-modality synthesis tasks (T1 → T2, T2 → T1, T1 → FLAIR, and FLAIR → T1). It consistently achieved the highest MSSIM values (ranging from 0.966 to 0.971), the lowest MAE (0.011 to 0.013), and competitive PSNR values (27.7 to 28.8 dB). Across all tasks, CG-DDPM outperformed IDDPM, IDDIM, and MRI-cGAN in most metrics and exhibited significantly lower uncertainty and inconsistency in MC-based sampling. Statistical analyses confirmed the robustness of CG-DDPM, with <math><mrow><mi>p</mi> <mrow><mtext>-</mtext></mrow> <mrow><mtext>values</mtext></mrow> <mo><</mo> <mn>0.05</mn></mrow> </math> in key comparisons.</p><p><strong>Conclusions: </strong>The proposed CG-DDPM provides a robust and efficient solution for cross-modality MRI synthesis, offering improved accuracy, stability, and clinical applicability compared with existing methods. This approach has the potential to streamline MRI-based workflows, enhance diagnostic imaging, and support precision treatment planning in medical physics and radiation oncology.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 6","pages":"064003"},"PeriodicalIF":1.7,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12643384/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145606948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-01Epub Date: 2025-12-18DOI: 10.1117/1.JMI.12.6.064006
Lucas W Remedios, Chloe Cho, Trent M Schwartz, Dingjie Su, Gaurav Rudravaram, Chenyu Gao, Aravind R Krishnan, Adam M Saunders, Michael E Kim, Shunxing Bao, Alvin C Powers, Bennett A Landman, John Virostko
<p><strong>Purpose: </strong>Although elevated body mass index (BMI) is a well-known risk factor for type 2 diabetes, the disease's presence in some lean adults and absence in others with obesity suggests that more detailed measurements of body composition may uncover abdominal phenotypes of type 2 diabetes. With artificial intelligence (AI) and computed tomography (CT), we can now leverage robust image segmentation to extract detailed measurements of size, shape, and tissue composition from abdominal organs, abdominal muscle, and abdominal fat depots in 3D clinical imaging at scale. This creates an opportunity to empirically define body composition signatures linked to type 2 diabetes risk and protection using large-scale clinical data.</p><p><strong>Approach: </strong>We studied imaging records of 1728 de-identified patients from Vanderbilt University Medical Center with BMI collected from the electronic health record. To uncover BMI-specific diabetic abdominal patterns from clinical CT, we applied our design four times: once on the full cohort ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>1728</mn></mrow> </math> ) and once on lean ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>497</mn></mrow> </math> ), overweight ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>611</mn></mrow> </math> ), and obese ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>620</mn></mrow> </math> ) subgroups separately. Briefly, our experimental design transforms abdominal scans into collections of explainable measurements, identifies which measurements most strongly predict type 2 diabetes and how they contribute to risk or protection, groups scans by shared model decision patterns, and links those decision patterns back to interpretable abdominal phenotypes in the original explainable measurement space of the abdomen using the following steps. (1) To capture abdominal composition: we represented each scan as a collection of 88 automatically extracted measurements of the size, shape, and fat content of abdominal structures using TotalSegmentator. (2) To learn key predictors: we trained a 10-fold cross-validated random forest classifier with SHapley Additive exPlanations (SHAP) analysis to rank features and estimate their risk-versus-protective effects for type 2 diabetes. (3) To validate individual effects: for the 20 highest-ranked features, we ran univariate logistic regressions to quantify their independent associations with type 2 diabetes. (4) To identify decision-making patterns: we embedded the top-20 SHAP profiles with uniform manifold approximation and projection and applied silhouette-guided K-means to cluster the random forest's decision space. (5) To link decisions to abdominal phenotypes: we fit one-versus-rest classifiers on the original anatomical measurements from each decision cluster and applied a second SHAP analysis to explore whether the random forest's logic had identified abdominal phenotypes.</p><p><strong>Results: </strong>Across the full, lean, overweight, and obese cohort
{"title":"Data-driven abdominal phenotypes of type 2 diabetes in lean, overweight, and obese cohorts from computed tomography.","authors":"Lucas W Remedios, Chloe Cho, Trent M Schwartz, Dingjie Su, Gaurav Rudravaram, Chenyu Gao, Aravind R Krishnan, Adam M Saunders, Michael E Kim, Shunxing Bao, Alvin C Powers, Bennett A Landman, John Virostko","doi":"10.1117/1.JMI.12.6.064006","DOIUrl":"10.1117/1.JMI.12.6.064006","url":null,"abstract":"<p><strong>Purpose: </strong>Although elevated body mass index (BMI) is a well-known risk factor for type 2 diabetes, the disease's presence in some lean adults and absence in others with obesity suggests that more detailed measurements of body composition may uncover abdominal phenotypes of type 2 diabetes. With artificial intelligence (AI) and computed tomography (CT), we can now leverage robust image segmentation to extract detailed measurements of size, shape, and tissue composition from abdominal organs, abdominal muscle, and abdominal fat depots in 3D clinical imaging at scale. This creates an opportunity to empirically define body composition signatures linked to type 2 diabetes risk and protection using large-scale clinical data.</p><p><strong>Approach: </strong>We studied imaging records of 1728 de-identified patients from Vanderbilt University Medical Center with BMI collected from the electronic health record. To uncover BMI-specific diabetic abdominal patterns from clinical CT, we applied our design four times: once on the full cohort ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>1728</mn></mrow> </math> ) and once on lean ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>497</mn></mrow> </math> ), overweight ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>611</mn></mrow> </math> ), and obese ( <math><mrow><mi>n</mi> <mo>=</mo> <mn>620</mn></mrow> </math> ) subgroups separately. Briefly, our experimental design transforms abdominal scans into collections of explainable measurements, identifies which measurements most strongly predict type 2 diabetes and how they contribute to risk or protection, groups scans by shared model decision patterns, and links those decision patterns back to interpretable abdominal phenotypes in the original explainable measurement space of the abdomen using the following steps. (1) To capture abdominal composition: we represented each scan as a collection of 88 automatically extracted measurements of the size, shape, and fat content of abdominal structures using TotalSegmentator. (2) To learn key predictors: we trained a 10-fold cross-validated random forest classifier with SHapley Additive exPlanations (SHAP) analysis to rank features and estimate their risk-versus-protective effects for type 2 diabetes. (3) To validate individual effects: for the 20 highest-ranked features, we ran univariate logistic regressions to quantify their independent associations with type 2 diabetes. (4) To identify decision-making patterns: we embedded the top-20 SHAP profiles with uniform manifold approximation and projection and applied silhouette-guided K-means to cluster the random forest's decision space. (5) To link decisions to abdominal phenotypes: we fit one-versus-rest classifiers on the original anatomical measurements from each decision cluster and applied a second SHAP analysis to explore whether the random forest's logic had identified abdominal phenotypes.</p><p><strong>Results: </strong>Across the full, lean, overweight, and obese cohort","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 6","pages":"064006"},"PeriodicalIF":1.7,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12712129/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145806053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-01Epub Date: 2025-11-08DOI: 10.1117/1.JMI.12.6.064501
Garazi Casillas Martinez, Anthony Winder, Emma A M Stanley, Raissa Souza, Matthias Wilms, Myka Estes, Sarah J MacEachern, Nils D Forkert
Purpose: Autism is one of the most common neurodevelopmental conditions, and it is characterized by restricted, repetitive behaviors and social difficulties that affect daily functioning. It is challenging to provide an early and accurate diagnosis due to the wide diversity of symptoms and the developmental changes that occur during childhood. We evaluate the feasibility of an explainable deep learning (DL) model using structural MRI (sMRI) to identify meaningful brain biomarkers relevant to autism in children and thus support its diagnosis.
Approach: A total of 452 -weighted sMRI scans from children aged 9 to 11 years were obtained from the Autism Brain Imaging Data Exchange database. A DL model was trained to differentiate between autistic and typically developing children. Model explainability was assessed using saliency maps to identify key brain regions contributing to classification. Model performance was evaluated across 20 folds and compared with traditional machine learning models trained with regional volumetric features extracted from the sMRI scans.
Results: The model achieved a mean area under the receiver operating curve of 71.2%. The saliency maps highlighted brain regions that are known neuroanatomical and functional biomarkers associated with autism, such as the cuneus, pericalcarine, ventricles, lingual, vermal lobules, caudate, and thalamus.
Conclusions: We show the potential of interpretable DL models trained on sMRI data to aid in autism diagnosis within a narrowly defined pediatric age group. Our findings contribute to the field of explainable artificial intelligence methods in neurodevelopmental research and may help in clinical decision-making for autism and other neurodevelopmental conditions.
{"title":"Interpretable convolutional neural network for autism diagnosis support in children using structural magnetic resonance imaging datasets.","authors":"Garazi Casillas Martinez, Anthony Winder, Emma A M Stanley, Raissa Souza, Matthias Wilms, Myka Estes, Sarah J MacEachern, Nils D Forkert","doi":"10.1117/1.JMI.12.6.064501","DOIUrl":"10.1117/1.JMI.12.6.064501","url":null,"abstract":"<p><strong>Purpose: </strong>Autism is one of the most common neurodevelopmental conditions, and it is characterized by restricted, repetitive behaviors and social difficulties that affect daily functioning. It is challenging to provide an early and accurate diagnosis due to the wide diversity of symptoms and the developmental changes that occur during childhood. We evaluate the feasibility of an explainable deep learning (DL) model using structural MRI (sMRI) to identify meaningful brain biomarkers relevant to autism in children and thus support its diagnosis.</p><p><strong>Approach: </strong>A total of 452 <math><mrow><mi>T</mi> <mn>1</mn></mrow> </math> -weighted sMRI scans from children aged 9 to 11 years were obtained from the Autism Brain Imaging Data Exchange database. A DL model was trained to differentiate between autistic and typically developing children. Model explainability was assessed using saliency maps to identify key brain regions contributing to classification. Model performance was evaluated across 20 folds and compared with traditional machine learning models trained with regional volumetric features extracted from the sMRI scans.</p><p><strong>Results: </strong>The model achieved a mean area under the receiver operating curve of 71.2%. The saliency maps highlighted brain regions that are known neuroanatomical and functional biomarkers associated with autism, such as the cuneus, pericalcarine, ventricles, lingual, vermal lobules, caudate, and thalamus.</p><p><strong>Conclusions: </strong>We show the potential of interpretable DL models trained on sMRI data to aid in autism diagnosis within a narrowly defined pediatric age group. Our findings contribute to the field of explainable artificial intelligence methods in neurodevelopmental research and may help in clinical decision-making for autism and other neurodevelopmental conditions.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 6","pages":"064501"},"PeriodicalIF":1.7,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12596041/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145483379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-01Epub Date: 2025-11-13DOI: 10.1117/1.JMI.12.6.067001
Mary N Henderson, David B Jordan, Zong-Ming Li
Purpose: The purpose of this study was to assess the variations in shear-wave speed (SWS) in individual thenar muscles under varied pinch forces in healthy adults. It was hypothesized that (1) SWS would vary among the individual thenar muscles, and (2) there would be an increase in SWS with increased pinch force.
Approach: Thirteen healthy participants' dominant hands were imaged using an ultrasound probe aligned longitudinally along the muscle fibers of the abductor pollicis brevis (APB), opponens pollicis (OPP), and flexor pollicis brevis (FPB). The SWS of each muscle was derived. Each participant completed trials consisting of randomly ordered pinch forces at 0, 10, and 20 N, 10% of maximum pinch force (MPF), and 20% MPF.
Results: The SWSs varied significantly among individual thenar muscles ( ) under absolute ( ) and relative forces ( ). There was a significant increase in SWS as the force increased from 0 to 20 N in the APB ( ) and OPP ( ), and not in the FPB ( ). There was a significant increase in SWS as the force increased from 0 to 20% MPF in the APB ( ), and not in the OPP ( ) or the FPB ( ).
Conclusions: The SWS of the APB and OPP increased as force increased and was different among the thenar muscles. This suggests SWS evaluations may be an appropriate method for evaluating muscles under tension, or different voluntary force conditions, specifically for the APB and OPP muscles.
{"title":"Shear-wave elastography of healthy individual thenar muscles.","authors":"Mary N Henderson, David B Jordan, Zong-Ming Li","doi":"10.1117/1.JMI.12.6.067001","DOIUrl":"https://doi.org/10.1117/1.JMI.12.6.067001","url":null,"abstract":"<p><strong>Purpose: </strong>The purpose of this study was to assess the variations in shear-wave speed (SWS) in individual thenar muscles under varied pinch forces in healthy adults. It was hypothesized that (1) SWS would vary among the individual thenar muscles, and (2) there would be an increase in SWS with increased pinch force.</p><p><strong>Approach: </strong>Thirteen healthy participants' dominant hands were imaged using an ultrasound probe aligned longitudinally along the muscle fibers of the abductor pollicis brevis (APB), opponens pollicis (OPP), and flexor pollicis brevis (FPB). The SWS of each muscle was derived. Each participant completed trials consisting of randomly ordered pinch forces at 0, 10, and 20 N, 10% of maximum pinch force (MPF), and 20% MPF.</p><p><strong>Results: </strong>The SWSs varied significantly among individual thenar muscles ( <math><mrow><mi>p</mi> <mo><</mo> <mn>0.01</mn></mrow> </math> ) under absolute ( <math><mrow><mi>p</mi> <mo><</mo> <mn>0.01</mn></mrow> </math> ) and relative forces ( <math><mrow><mi>p</mi> <mo><</mo> <mn>0.05</mn></mrow> </math> ). There was a significant increase in SWS as the force increased from 0 to 20 N in the APB ( <math><mrow><mi>p</mi> <mo><</mo> <mn>0.001</mn></mrow> </math> ) and OPP ( <math><mrow><mi>p</mi> <mo><</mo> <mn>0.001</mn></mrow> </math> ), and not in the FPB ( <math><mrow><mi>p</mi> <mo>=</mo> <mn>0.873</mn></mrow> </math> ). There was a significant increase in SWS as the force increased from 0 to 20% MPF in the APB ( <math><mrow><mi>p</mi> <mo>=</mo> <mn>0.005</mn></mrow> </math> ), and not in the OPP ( <math><mrow><mi>p</mi> <mo>=</mo> <mn>0.586</mn></mrow> </math> ) or the FPB ( <math><mrow><mi>p</mi> <mo>=</mo> <mn>0.984</mn></mrow> </math> ).</p><p><strong>Conclusions: </strong>The SWS of the APB and OPP increased as force increased and was different among the thenar muscles. This suggests SWS evaluations may be an appropriate method for evaluating muscles under tension, or different voluntary force conditions, specifically for the APB and OPP muscles.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 6","pages":"067001"},"PeriodicalIF":1.7,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12614004/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145542916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-01Epub Date: 2025-12-05DOI: 10.1117/1.JMI.12.6.061411
Bulut Aygunes, Ramazan Gokberk Cinbis, Selim Aksoy
Purpose: Weakly supervised learning (WSL) is widely used for histopathological image analysis by modeling images as sets of fixed-size patches and utilizing image-level diagnoses as weak labels. However, in multiclass classification scenarios, patches corresponding to a wide spectrum of diagnostic categories can co-exist in a single image, complicating the learning process. We aim to address label uncertainty in such multiclass settings.
Approach: We propose a two-branch architecture and a complementary training strategy to improve patch-based WSL. One branch estimates patch-level class likelihoods, whereas the other predicts per-class patch relevance weights. These outputs are combined into image-level class predictions via a relevance-weighted sum of per-patch class likelihoods. To further improve performance, we introduce a multilabel augmentation strategy that forms new training samples by combining patch sets and labels from pairs of images, resulting in multilabel samples that enrich the training set by increasing the chance of having more patches that are relevant to the augmented label sets.
Results: We evaluate our method on two challenging multiclass breast histopathology datasets for region of interest classification. The proposed architecture and training strategy outperform conventional weakly supervised methods, demonstrating improved classification accuracy and robustness, particularly in underrepresented classes.
Conclusions: The proposed architecture effectively models the complex relationship between image-level labels and patch-level content in multiclass histopathological image analysis. Combined with the image-level multilabel augmentation strategy, it improves learning under label uncertainty. These contributions hold potential for more accurate and scalable diagnostic support systems in digital pathology.
{"title":"Patch relevance estimation and multilabel augmentation for weakly supervised histopathology image classification.","authors":"Bulut Aygunes, Ramazan Gokberk Cinbis, Selim Aksoy","doi":"10.1117/1.JMI.12.6.061411","DOIUrl":"https://doi.org/10.1117/1.JMI.12.6.061411","url":null,"abstract":"<p><strong>Purpose: </strong>Weakly supervised learning (WSL) is widely used for histopathological image analysis by modeling images as sets of fixed-size patches and utilizing image-level diagnoses as weak labels. However, in multiclass classification scenarios, patches corresponding to a wide spectrum of diagnostic categories can co-exist in a single image, complicating the learning process. We aim to address label uncertainty in such multiclass settings.</p><p><strong>Approach: </strong>We propose a two-branch architecture and a complementary training strategy to improve patch-based WSL. One branch estimates patch-level class likelihoods, whereas the other predicts per-class patch relevance weights. These outputs are combined into image-level class predictions via a relevance-weighted sum of per-patch class likelihoods. To further improve performance, we introduce a multilabel augmentation strategy that forms new training samples by combining patch sets and labels from pairs of images, resulting in multilabel samples that enrich the training set by increasing the chance of having more patches that are relevant to the augmented label sets.</p><p><strong>Results: </strong>We evaluate our method on two challenging multiclass breast histopathology datasets for region of interest classification. The proposed architecture and training strategy outperform conventional weakly supervised methods, demonstrating improved classification accuracy and robustness, particularly in underrepresented classes.</p><p><strong>Conclusions: </strong>The proposed architecture effectively models the complex relationship between image-level labels and patch-level content in multiclass histopathological image analysis. Combined with the image-level multilabel augmentation strategy, it improves learning under label uncertainty. These contributions hold potential for more accurate and scalable diagnostic support systems in digital pathology.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 6","pages":"061411"},"PeriodicalIF":1.7,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12680080/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145702351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-01Epub Date: 2025-12-15DOI: 10.1117/1.JMI.12.6.063501
Thomas Larsen, Hsin Wu Tseng, Jing-Tzyh Alan Chiang, Srinivasan Vedantham
Purpose: We aim to investigate the performance of dedicated breast computed tomography (CT) for the detection of soft-tissue lesions and compare it to the detection of microcalcification clusters using cascaded systems analysis, with the intent of identifying which lesion type should be used for system optimization.
Approach: Signal and noise were propagated through the imaging chain using a cascaded systems model to obtain the modulation transfer function and noise power spectrum. Two imaging tasks were considered: a soft-tissue mass lesion modeled as a disk of 4 mm diameter and a cluster of microcalcifications modeled as calcium carbonate spheres of diameter. Detectability indices using three numerical observer models were obtained for various scintillator thicknesses and acquisition conditions at a fixed 4.5 mGy mean glandular dose.
Results: Detectability index trends are reversed between soft-tissue lesion and microcalcification cluster for the range of X-ray tube voltages and filtrations studied, indicating a potential need for compromise. However, for each of the 150 combinations studied (6 kV settings Cu filter thicknesses CsI:Tl scintillator thicknesses) and for each of the three numerical observer models, the detectability index for soft-tissue lesions always exceeded the microcalcification cluster.
Conclusion: When the lesion type is unknown, such as during breast cancer screening, it is more appropriate to optimize the system parameters for the task of detecting a microcalcification cluster, as the detectability index for the soft-tissue lesion exceeded that for the microcalcification cluster for all conditions investigated.
{"title":"Soft-tissue lesion and microcalcification detectability in cone-beam breast CT: cascaded system analysis.","authors":"Thomas Larsen, Hsin Wu Tseng, Jing-Tzyh Alan Chiang, Srinivasan Vedantham","doi":"10.1117/1.JMI.12.6.063501","DOIUrl":"10.1117/1.JMI.12.6.063501","url":null,"abstract":"<p><strong>Purpose: </strong>We aim to investigate the performance of dedicated breast computed tomography (CT) for the detection of soft-tissue lesions and compare it to the detection of microcalcification clusters using cascaded systems analysis, with the intent of identifying which lesion type should be used for system optimization.</p><p><strong>Approach: </strong>Signal and noise were propagated through the imaging chain using a cascaded systems model to obtain the modulation transfer function and noise power spectrum. Two imaging tasks were considered: a soft-tissue mass lesion modeled as a disk of 4 mm diameter and a cluster of microcalcifications modeled as calcium carbonate spheres of <math><mrow><mn>220</mn> <mtext> </mtext> <mi>μ</mi> <mi>m</mi></mrow> </math> diameter. Detectability indices using three numerical observer models were obtained for various scintillator thicknesses and acquisition conditions at a fixed 4.5 mGy mean glandular dose.</p><p><strong>Results: </strong>Detectability index trends are reversed between soft-tissue lesion and microcalcification cluster for the range of X-ray tube voltages and filtrations studied, indicating a potential need for compromise. However, for each of the 150 combinations studied (6 kV settings <math><mrow><mo>×</mo> <mtext> </mtext> <mn>5</mn></mrow> </math> Cu filter thicknesses <math><mrow><mo>×</mo> <mtext> </mtext> <mn>5</mn></mrow> </math> CsI:Tl scintillator thicknesses) and for each of the three numerical observer models, the detectability index for soft-tissue lesions always exceeded the microcalcification cluster.</p><p><strong>Conclusion: </strong>When the lesion type is unknown, such as during breast cancer screening, it is more appropriate to optimize the system parameters for the task of detecting a microcalcification cluster, as the detectability index for the soft-tissue lesion exceeded that for the microcalcification cluster for all conditions investigated.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 6","pages":"063501"},"PeriodicalIF":1.7,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12704369/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145769526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-01Epub Date: 2025-12-27DOI: 10.1117/1.JMI.12.6.063502
Kristina Tri Wigati, Hilde Bosmans, Joke Binst, Kim Lemmens, Annelies Jacobs, Djarwani S Soejoko, Nicholas Marshall
Purpose: X-ray detector noise decomposition and normalized noise power spectrum (NNPS) are two metrics proposed in the European Guidelines for the quality control (QC) of digital mammography (DM) systems. We aim to examine the reproducibility of these metrics in longitudinal testing and the relevance of the limiting values in the Guidelines and produce device-specific performance data.
Approach: Semiannual QC data for 55 DM systems (16 models, 6 manufacturers) were retrieved from our medical physics archive, giving a total of 455 QC tests covering a period of 5 years (10 tests). Average values of the detector response function fit coefficients, the electronic, quantum and structure noise coefficients, and the NNPS data were analyzed across DM models and longitudinally for a given device. The fraction of quantum noise at the clinical detector air kerma ( ) level was determined, along with the longitudinal change in NNPS.
Results: Coefficient of variation for the NNPS at was 0.04, averaged over all systems. Quantum noise evaluated at was the largest noise fraction for all devices studied, ranging from 61.2% to 98.2%. Electronic noise was generally lower for the latest X-ray detectors. Consequently, the fraction of quantum noise at has improved from 6.2% to 28.0% and corresponds to a broader quantum noise-limited range.
Conclusion: The evaluated noise metrics were reproducible, identified changes in X-ray detector performance, and have a useful role to play in QC testing. The average values have application as reference performance data.
目的:x射线探测器噪声分解和归一化噪声功率谱(NNPS)是欧洲数字乳房x线照相术(DM)系统质量控制(QC)指南中提出的两个指标。我们的目标是检查这些指标在纵向测试中的可重复性以及指南中限制值的相关性,并生成特定于设备的性能数据。方法:从我们的医学物理档案中检索55个DM系统(16种型号,6家制造商)的半年QC数据,共提供455次QC测试,涵盖5年(10次测试)。对给定器件的探测器响应函数拟合系数、电子、量子和结构噪声系数的平均值以及NNPS数据进行了跨DM模型和纵向分析。测定临床检测器空气孔径(DAK 50 mm)水平上的量子噪声分数,以及NNPS的纵向变化。结果:2.0 mm - 1时NNPS的变异系数为0.04,为所有系统的平均值。在DAK 50 mm处评估的量子噪声是所研究的所有器件中最大的噪声部分,范围从61.2%到98.2%。最新的x射线探测器的电子噪声一般较低。因此,量子噪声在DAK 50 mm处的比例从6.2%提高到28.0%,对应于更宽的量子噪声限制范围。结论:评价的噪声指标具有可重复性,可识别x射线探测器性能的变化,在QC检测中具有重要作用。其平均值可作为参考性能数据。
{"title":"In-depth look at the use of pixel variance and the noise power spectrum in digital mammography quality control.","authors":"Kristina Tri Wigati, Hilde Bosmans, Joke Binst, Kim Lemmens, Annelies Jacobs, Djarwani S Soejoko, Nicholas Marshall","doi":"10.1117/1.JMI.12.6.063502","DOIUrl":"https://doi.org/10.1117/1.JMI.12.6.063502","url":null,"abstract":"<p><strong>Purpose: </strong>X-ray detector noise decomposition and normalized noise power spectrum (NNPS) are two metrics proposed in the European Guidelines for the quality control (QC) of digital mammography (DM) systems. We aim to examine the reproducibility of these metrics in longitudinal testing and the relevance of the limiting values in the Guidelines and produce device-specific performance data.</p><p><strong>Approach: </strong>Semiannual QC data for 55 DM systems (16 models, 6 manufacturers) were retrieved from our medical physics archive, giving a total of 455 QC tests covering a period of 5 years (10 tests). Average values of the detector response function fit coefficients, the electronic, quantum and structure noise coefficients, and the NNPS data were analyzed across DM models and longitudinally for a given device. The fraction of quantum noise at the clinical detector air kerma ( <math> <mrow> <msub><mrow><mi>DAK</mi></mrow> <mrow><mn>50</mn> <mtext> </mtext> <mi>mm</mi></mrow> </msub> </mrow> </math> ) level was determined, along with the longitudinal change in NNPS.</p><p><strong>Results: </strong>Coefficient of variation for the NNPS at <math><mrow><mn>2.0</mn> <mtext> </mtext> <msup><mrow><mi>mm</mi></mrow> <mrow><mo>-</mo> <mn>1</mn></mrow> </msup> </mrow> </math> was 0.04, averaged over all systems. Quantum noise evaluated at <math> <mrow> <msub><mrow><mi>DAK</mi></mrow> <mrow><mn>50</mn> <mtext> </mtext> <mi>mm</mi></mrow> </msub> </mrow> </math> was the largest noise fraction for all devices studied, ranging from 61.2% to 98.2%. Electronic noise was generally lower for the latest X-ray detectors. Consequently, the fraction of quantum noise at <math> <mrow> <msub><mrow><mi>DAK</mi></mrow> <mrow><mn>50</mn> <mtext> </mtext> <mi>mm</mi></mrow> </msub> </mrow> </math> has improved from 6.2% to 28.0% and corresponds to a broader quantum noise-limited range.</p><p><strong>Conclusion: </strong>The evaluated noise metrics were reproducible, identified changes in X-ray detector performance, and have a useful role to play in QC testing. The average values have application as reference performance data.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 6","pages":"063502"},"PeriodicalIF":1.7,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12743580/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145851136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-01Epub Date: 2025-10-22DOI: 10.1117/1.JMI.12.6.061409
Anthony Bilic, Guangyu Sun, Ming Li, Md Sanzid Bin Hossain, Yu Tian, Wei Zhang, Laura Brattain, Dexter Hadley, Chen Chen
Purpose: Whole slide image (WSI) classification relies on multiple instance learning (MIL) with spatial patch features, but current methods struggle to capture global dependencies due to the immense size of WSIs and the local nature of patch embeddings. This limitation hinders the modeling of coarse structures essential for robust diagnostic prediction.
Approach: We propose Fourier Transform Multiple Instance Learning (FFT-MIL), a framework that augments MIL with a frequency-domain branch to provide compact global context. Low-frequency crops are extracted from WSIs via the Fast Fourier Transform and processed through a modular FFT-Block composed of convolutional layers and Min-Max normalization to mitigate the high variance of frequency data. The learned global frequency feature is fused with spatial patch features through lightweight integration strategies, enabling compatibility with diverse MIL architectures.
Results: FFT-MIL was evaluated across six state-of-the-art MIL methods on three public datasets (BRACS, LUAD, and IMP). Integration of the FFT-Block improved macro scores by an average of 3.51% and area under the curve by 1.51%, demonstrating consistent gains across architectures and datasets.
Conclusions: FFT-MIL establishes frequency-domain learning as an effective and efficient mechanism for capturing global dependencies in WSI classification, complementing spatial features and advancing the scalability and accuracy of MIL-based computational pathology. The source code is publicly available at https://github.com/irulenot/FFT-MIL.
{"title":"Fourier Transform Multiple Instance Learning for whole slide image classification.","authors":"Anthony Bilic, Guangyu Sun, Ming Li, Md Sanzid Bin Hossain, Yu Tian, Wei Zhang, Laura Brattain, Dexter Hadley, Chen Chen","doi":"10.1117/1.JMI.12.6.061409","DOIUrl":"https://doi.org/10.1117/1.JMI.12.6.061409","url":null,"abstract":"<p><strong>Purpose: </strong>Whole slide image (WSI) classification relies on multiple instance learning (MIL) with spatial patch features, but current methods struggle to capture global dependencies due to the immense size of WSIs and the local nature of patch embeddings. This limitation hinders the modeling of coarse structures essential for robust diagnostic prediction.</p><p><strong>Approach: </strong>We propose Fourier Transform Multiple Instance Learning (FFT-MIL), a framework that augments MIL with a frequency-domain branch to provide compact global context. Low-frequency crops are extracted from WSIs via the Fast Fourier Transform and processed through a modular FFT-Block composed of convolutional layers and Min-Max normalization to mitigate the high variance of frequency data. The learned global frequency feature is fused with spatial patch features through lightweight integration strategies, enabling compatibility with diverse MIL architectures.</p><p><strong>Results: </strong>FFT-MIL was evaluated across six state-of-the-art MIL methods on three public datasets (BRACS, LUAD, and IMP). Integration of the FFT-Block improved macro <math><mrow><mi>F</mi> <mn>1</mn></mrow> </math> scores by an average of 3.51% and area under the curve by 1.51%, demonstrating consistent gains across architectures and datasets.</p><p><strong>Conclusions: </strong>FFT-MIL establishes frequency-domain learning as an effective and efficient mechanism for capturing global dependencies in WSI classification, complementing spatial features and advancing the scalability and accuracy of MIL-based computational pathology. The source code is publicly available at https://github.com/irulenot/FFT-MIL.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 6","pages":"061409"},"PeriodicalIF":1.7,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12543031/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145356570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-01Epub Date: 2025-03-20DOI: 10.1117/1.JMI.12.S2.S22004
Kai Yang, Craig K Abbey, Bruno Barufaldi, Xinhua Li, Theodore A Marschall, Bob Liu
Purpose: We aim to analyze higher-order textural components of digital breast tomosynthesis (DBT) images to quantify differences in the appearance of breast parenchyma produced by different vendors.
Approach: We included consecutive women who had normal screening DBT exams in January 2018 from a GE system and in adjacent years from Hologic systems. Laplacian fractional entropy (LFE), as a measure of non-Gaussian statistical properties of breast tissue texture, was calculated from for-presentation Craniocaudal (CC) view DBT slices and synthetic mammograms (SMs) through frequency-based filtering with Gabor filters, which were considered mathematical models for human visual response to image textures. The LFE values were compared within and across subjects and vendors along with secondary parameters (laterality, year-to-year, modality, and breast density) via two-way analysis of variance (ANOVA) tests using frequency as one of the two independent variables, and a -value was considered statistically significant.
Results: A total of 8529 CC view DBT slices and SM images from 73 screening exams in 25 women were analyzed. Significant differences in LFE were observed for different frequencies ( ) and across vendors (GE versus Hologic DBT: , GE versus Hologic SM: ).
Conclusion: Significant differences in perception of breast parenchyma textures among two DBT vendors were demonstrated via higher-order non-Gaussian statistical properties. This finding extends previously observed differences in anatomical noise power spectra in DBT images and provides quantitative evidence to support caution in across-vendor comparative reading and will be beneficial to facilitate future development of vendor-neutral artificial intelligence algorithms for breast cancer screening.
目的:我们旨在分析数字乳腺断层合成(DBT)图像的高阶纹理成分,以量化不同供应商生产的乳腺实质外观的差异。方法:我们纳入了2018年1月GE系统和Hologic系统连续进行正常筛查DBT检查的女性。Laplacian分数熵(LFE)作为乳腺组织纹理非高斯统计特性的度量,通过基于频率的Gabor滤波器滤波,从呈现颅侧(CC)视图的DBT切片和合成乳房x光片(SMs)中计算得到,这被认为是人类视觉响应图像纹理的数学模型。使用频率作为两个自变量之一,通过双向方差分析(ANOVA)检验,比较受试者和供应商内部和之间的LFE值以及次要参数(横向性、年度、模态和乳腺密度),P值0.05被认为具有统计学意义。结果:共分析了25例女性73次筛查检查的8529张CC视图DBT切片和SM图像。不同频率和不同供应商(GE与Hologic DBT: P 0.001, GE与Hologic SM: P 0.001)的LFE存在显著差异。结论:高阶非高斯统计特性证明了两种DBT供应商对乳腺实质纹理的感知存在显著差异。这一发现扩展了先前观察到的DBT图像中解剖噪声功率谱的差异,并提供了定量证据,支持跨供应商比较阅读的谨慎性,并将有助于促进供应商中立的乳腺癌筛查人工智能算法的未来发展。
{"title":"Frequency-based texture analysis of non-Gaussian properties of digital breast tomosynthesis images and comparison across two vendors.","authors":"Kai Yang, Craig K Abbey, Bruno Barufaldi, Xinhua Li, Theodore A Marschall, Bob Liu","doi":"10.1117/1.JMI.12.S2.S22004","DOIUrl":"10.1117/1.JMI.12.S2.S22004","url":null,"abstract":"<p><strong>Purpose: </strong>We aim to analyze higher-order textural components of digital breast tomosynthesis (DBT) images to quantify differences in the appearance of breast parenchyma produced by different vendors.</p><p><strong>Approach: </strong>We included consecutive women who had normal screening DBT exams in January 2018 from a GE system and in adjacent years from Hologic systems. Laplacian fractional entropy (LFE), as a measure of non-Gaussian statistical properties of breast tissue texture, was calculated from for-presentation Craniocaudal (CC) view DBT slices and synthetic mammograms (SMs) through frequency-based filtering with Gabor filters, which were considered mathematical models for human visual response to image textures. The LFE values were compared within and across subjects and vendors along with secondary parameters (laterality, year-to-year, modality, and breast density) via two-way analysis of variance (ANOVA) tests using frequency as one of the two independent variables, and a <math><mrow><mi>P</mi></mrow> </math> -value <math><mrow><mo><</mo> <mn>0.05</mn></mrow> </math> was considered statistically significant.</p><p><strong>Results: </strong>A total of 8529 CC view DBT slices and SM images from 73 screening exams in 25 women were analyzed. Significant differences in LFE were observed for different frequencies ( <math><mrow><mi>P</mi> <mo><</mo> <mn>0.001</mn></mrow> </math> ) and across vendors (GE versus Hologic DBT: <math><mrow><mi>P</mi> <mo><</mo> <mn>0.001</mn></mrow> </math> , GE versus Hologic SM: <math><mrow><mi>P</mi> <mo><</mo> <mn>0.001</mn></mrow> </math> ).</p><p><strong>Conclusion: </strong>Significant differences in perception of breast parenchyma textures among two DBT vendors were demonstrated via higher-order non-Gaussian statistical properties. This finding extends previously observed differences in anatomical noise power spectra in DBT images and provides quantitative evidence to support caution in across-vendor comparative reading and will be beneficial to facilitate future development of vendor-neutral artificial intelligence algorithms for breast cancer screening.</p>","PeriodicalId":47707,"journal":{"name":"Journal of Medical Imaging","volume":"12 Suppl 2","pages":"S22004"},"PeriodicalIF":1.9,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11925074/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143694062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}