Journal of X-Ray Science and Technology最新文献_第4页

MAFA-Uformer: Multi-attention and dual-branch feature aggregation U-shaped transformer for sparse-view CT reconstruction.

IF 1.7 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology

Pub Date : 2025-01-01 Epub Date: 2025-01-08 DOI: 10.1177/08953996241300016

Xuan Zhang, Chenyun Fang, Zhiwei Qiao

Background: Although computed tomography (CT) is widely employed in disease detection, X-ray radiation may pose a risk to the health of patients. Reducing the projection views is a common method, however, the reconstructed images often suffer from streak artifacts.

Purpose: In previous related works, it can be found that the convolutional neural network (CNN) is proficient in extracting local features, while the Transformer is adept at capturing global information. To suppress streak artifacts for sparse-view CT, this study aims to develop a method that combines the advantages of CNN and Transformer.

Methods: In this paper, we propose a Multi-Attention and Dual-Branch Feature Aggregation U-shaped Transformer network (MAFA-Uformer), which consists of two branches: CNN and Transformer. Firstly, with a coordinate attention mechanism, the Transformer branch can capture the overall structure and orientation information to provide a global context understanding of the image under reconstruction. Secondly, the CNN branch focuses on extracting crucial local features of images through channel spatial attention, thus enhancing detail recognition capabilities. Finally, through a feature fusion module, the global information from the Transformer and the local features from the CNN are integrated effectively.

Results: Experimental results demonstrate that our method achieves outstanding performance in terms of peak signal-to-noise ratio (PSNR), structural similarity (SSIM), and root mean square error (RMSE). Compared with Restormer, our model achieves significant improvements: PSNR increases by 0.76 dB, SSIM improves by 0.44%, and RMSE decreases by 8.55%.

Conclusion: Our method not only effectively suppresses artifacts but also better preserves details and features, thereby providing robust support for accurate diagnosis of CT images.

{"title":"MAFA-Uformer: Multi-attention and dual-branch feature aggregation U-shaped transformer for sparse-view CT reconstruction.","authors":"Xuan Zhang, Chenyun Fang, Zhiwei Qiao","doi":"10.1177/08953996241300016","DOIUrl":"10.1177/08953996241300016","url":null,"abstract":"Background: Although computed tomography (CT) is widely employed in disease detection, X-ray radiation may pose a risk to the health of patients. Reducing the projection views is a common method, however, the reconstructed images often suffer from streak artifacts.Purpose: In previous related works, it can be found that the convolutional neural network (CNN) is proficient in extracting local features, while the Transformer is adept at capturing global information. To suppress streak artifacts for sparse-view CT, this study aims to develop a method that combines the advantages of CNN and Transformer.Methods: In this paper, we propose a Multi-Attention and Dual-Branch Feature Aggregation U-shaped Transformer network (MAFA-Uformer), which consists of two branches: CNN and Transformer. Firstly, with a coordinate attention mechanism, the Transformer branch can capture the overall structure and orientation information to provide a global context understanding of the image under reconstruction. Secondly, the CNN branch focuses on extracting crucial local features of images through channel spatial attention, thus enhancing detail recognition capabilities. Finally, through a feature fusion module, the global information from the Transformer and the local features from the CNN are integrated effectively.Results: Experimental results demonstrate that our method achieves outstanding performance in terms of peak signal-to-noise ratio (PSNR), structural similarity (SSIM), and root mean square error (RMSE). Compared with Restormer, our model achieves significant improvements: PSNR increases by 0.76 dB, SSIM improves by 0.44%, and RMSE decreases by 8.55%.Conclusion: Our method not only effectively suppresses artifacts but also better preserves details and features, thereby providing robust support for accurate diagnosis of CT images.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"157-166"},"PeriodicalIF":1.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143460445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Mask R-CNN assisted diagnosis of spinal tuberculosis. 脊柱结核的假面 R-CNN 辅助诊断。

IF 1.7 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology

Pub Date : 2025-01-01 Epub Date: 2024-12-24 DOI: 10.1177/08953996241290326

Wenjun Li, Yanfan Li, Huan Peng, Wenjun Liang

The prevalence of spinal tuberculosis (ST) is particularly high in underdeveloped regions with inadequate medical conditions. This not only leads to misdiagnosis and delays in treatment progress but also contributes to the continued transmission of tuberculosis bacteria, posing a risk to other individuals. Currently, CT imaging is extensively utilized in computer-aided diagnosis (CAD). The main features of ST on CT images include bone destruction, osteosclerosis, sequestration formation, and intervertebral disc damage. However, manual diagnosis by doctors may result in subjective judgments and misdiagnosis. Therefore, an accurate and objective method is needed for diagnosing of spinal tuberculosis. In this paper, we put forward an assistive diagnostic approach for spinal tuberculosis that is based on deep learning. The approach uses the Mask R-CNN model. Moreover, we modify the original model network by incorporating the ResPath and cbam* to improve the performance metrics, namely $m A P_{small}$ and F1-score. Meanwhile, other deep learning models such as Faster-RCNN and SSD were also compared. Experimental results demonstrate that the enhanced model can effectively identify spinal tuberculosis lesions, with an $m A P_{small}$ of 0.9175, surpassing the original model's 0.8340, and an F1-score of 0.9335, outperforming the original model's 0.8657.

{"title":"Mask R-CNN assisted diagnosis of spinal tuberculosis.","authors":"Wenjun Li, Yanfan Li, Huan Peng, Wenjun Liang","doi":"10.1177/08953996241290326","DOIUrl":"10.1177/08953996241290326","url":null,"abstract":"The prevalence of spinal tuberculosis (ST) is particularly high in underdeveloped regions with inadequate medical conditions. This not only leads to misdiagnosis and delays in treatment progress but also contributes to the continued transmission of tuberculosis bacteria, posing a risk to other individuals. Currently, CT imaging is extensively utilized in computer-aided diagnosis (CAD). The main features of ST on CT images include bone destruction, osteosclerosis, sequestration formation, and intervertebral disc damage. However, manual diagnosis by doctors may result in subjective judgments and misdiagnosis. Therefore, an accurate and objective method is needed for diagnosing of spinal tuberculosis. In this paper, we put forward an assistive diagnostic approach for spinal tuberculosis that is based on deep learning. The approach uses the Mask R-CNN model. Moreover, we modify the original model network by incorporating the ResPath and cbam* to improve the performance metrics, namely <math><mi>m</mi><mi>A</mi><msub><mi>P</mi><mrow><mrow><mi>small</mi></mrow></mrow></msub></math> and F1-score. Meanwhile, other deep learning models such as Faster-RCNN and SSD were also compared. Experimental results demonstrate that the enhanced model can effectively identify spinal tuberculosis lesions, with an <math><mi>m</mi><mi>A</mi><msub><mi>P</mi><mrow><mrow><mi>small</mi></mrow></mrow></msub></math> of 0.9175, surpassing the original model's 0.8340, and an F1-score of 0.9335, outperforming the original model's 0.8657.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"120-133"},"PeriodicalIF":1.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143460308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimization of dynamic multi-leaf collimator based on multi-objective particle swarm optimization algorithm.

IF 1.7 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology

Pub Date : 2025-01-01 Epub Date: 2024-12-26 DOI: 10.1177/08953996241304986

Jun Lv, Liuli Chen, Zhiqiang Zhu, Pengcheng Long, Liqin Hu

Background: The dynamic multi-leaf collimator (DMLC) plays a crucial role in shaping X-rays, significantly enhancing the precision, efficiency, and quality of tumor radiotherapy.

Objective: To improve the shaping effect of X-rays by optimizing the end structure of the DMLC leaf, which significantly impacts the collimator's performance.

Methods: This study introduces the innovative application of the multi-objective particle swarm optimization (MOPSO) algorithm to optimize DMLC parameters, including leaf end radius, source-to-leaf distance, leaf height, and tangent angle between the leaf end and the central axis. The main optimization objectives are to minimize the width and variance of the penumbra, defined as the distance between the 80% and 20% dose of X-rays on the isocenter plane, which directly impacts treatment accuracy.

Results: Structural optimization across various scenarios showed significant improvements in the size and uniformity of the penumbra, ensuring a more precise radiation dose. Based on the optimized structure, a three-dimensional model of the MLC was designed and an experimental prototype was fabricated for performance testing. The results indicate that the optimized MLC exhibits a smaller penumbra.

Conclusion: The proposed optimization method significantly enhances the precision of radiotherapy while minimizing radiation exposure to healthy tissue, representing a notable advancement in radiotherapy technology.

{"title":"Optimization of dynamic multi-leaf collimator based on multi-objective particle swarm optimization algorithm.","authors":"Jun Lv, Liuli Chen, Zhiqiang Zhu, Pengcheng Long, Liqin Hu","doi":"10.1177/08953996241304986","DOIUrl":"10.1177/08953996241304986","url":null,"abstract":"Background: The dynamic multi-leaf collimator (DMLC) plays a crucial role in shaping X-rays, significantly enhancing the precision, efficiency, and quality of tumor radiotherapy.Objective: To improve the shaping effect of X-rays by optimizing the end structure of the DMLC leaf, which significantly impacts the collimator's performance.Methods: This study introduces the innovative application of the multi-objective particle swarm optimization (MOPSO) algorithm to optimize DMLC parameters, including leaf end radius, source-to-leaf distance, leaf height, and tangent angle between the leaf end and the central axis. The main optimization objectives are to minimize the width and variance of the penumbra, defined as the distance between the 80% and 20% dose of X-rays on the isocenter plane, which directly impacts treatment accuracy.Results: Structural optimization across various scenarios showed significant improvements in the size and uniformity of the penumbra, ensuring a more precise radiation dose. Based on the optimized structure, a three-dimensional model of the MLC was designed and an experimental prototype was fabricated for performance testing. The results indicate that the optimized MLC exhibits a smaller penumbra.Conclusion: The proposed optimization method significantly enhances the precision of radiotherapy while minimizing radiation exposure to healthy tissue, representing a notable advancement in radiotherapy technology.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"145-156"},"PeriodicalIF":1.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143460312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Radiomics and deep learning features of pericoronary adipose tissue on non-contrast computerized tomography for predicting non-calcified plaques.

IF 1.7 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology

Pub Date : 2025-01-01 Epub Date: 2024-12-18 DOI: 10.1177/08953996241292476

Junli Yu, Yan Ding, Li Wang, Shunxin Hu, Ning Dong, Jiangnan Sheng, Yingna Ren, Ziyue Wang

Background: Inflammation of coronary arterial plaque is considered a key factor in the development of coronary heart disease. Early the plaque detection and timely treatment of the atherosclerosis could effectively reduce the risk of cardiovascular events. However, there is no study combining radiomics with deep learning techniques to predict non-calcified plaques (NCP) in coronary artery at present.

Objective: To investigate the value of combination of radiomics and deep learning features based on non-contrast computerized tomography (CT) scans of pericoronary adipose tissue (PCAT), integrating with clinical risk factors of patients, in identifying coronary inflammation and predicting the presence of NCP.

Methods: The clinical and imaging data of 353 patients were analyzed. The region of interest (ROI) of PCAT was manually outlined on non-contrast CT scan images, like coronary CT calcium score sequential images, then the radiomics and deep learning features in ROIs were extracted respectively. In training set (Center 1), after performing feature selection, radiomics and deep learning feature models were established, meanwhile, clinical models were built. Finally, combined models were developed out via integrating clinical, radiomics, and deep learning features. The predictive performance of the four feature model groups (clinical, radiomics, deep learning, and three combination) was assessed by seven different machine learning models through generation of receiver operating characteristic curves (ROC) and the calculation of area under the curve (AUC), sensitivity, specificity, and accuracy. Furthermore, the predictive performance of each model was validated in an external validation set (Center 2).

Results: For the single model comparation, eXtreme Gradient Boosting (XGBoost) showed the best performance among the clinical model group in the validation set. And Random Forest (RF) exhibited the best indicative performance not only among the radiomics feature group but also in the deep learning feature model group. What's more, among the combined model group, RF still displayed the best predictive performance, with the value of AUC, sensitivity, specificity, and accuracy in the validation set are 0.963, 0.857, 0.929, and 0.905.

Conclusion: The RF model in the combined model group based on non-contrast CT scan PCAT can predict the presence of NCP more accurately and has the potential for preliminary screening of the NCP.

{"title":"Radiomics and deep learning features of pericoronary adipose tissue on non-contrast computerized tomography for predicting non-calcified plaques.","authors":"Junli Yu, Yan Ding, Li Wang, Shunxin Hu, Ning Dong, Jiangnan Sheng, Yingna Ren, Ziyue Wang","doi":"10.1177/08953996241292476","DOIUrl":"10.1177/08953996241292476","url":null,"abstract":"Background: Inflammation of coronary arterial plaque is considered a key factor in the development of coronary heart disease. Early the plaque detection and timely treatment of the atherosclerosis could effectively reduce the risk of cardiovascular events. However, there is no study combining radiomics with deep learning techniques to predict non-calcified plaques (NCP) in coronary artery at present.Objective: To investigate the value of combination of radiomics and deep learning features based on non-contrast computerized tomography (CT) scans of pericoronary adipose tissue (PCAT), integrating with clinical risk factors of patients, in identifying coronary inflammation and predicting the presence of NCP.Methods: The clinical and imaging data of 353 patients were analyzed. The region of interest (ROI) of PCAT was manually outlined on non-contrast CT scan images, like coronary CT calcium score sequential images, then the radiomics and deep learning features in ROIs were extracted respectively. In training set (Center 1), after performing feature selection, radiomics and deep learning feature models were established, meanwhile, clinical models were built. Finally, combined models were developed out via integrating clinical, radiomics, and deep learning features. The predictive performance of the four feature model groups (clinical, radiomics, deep learning, and three combination) was assessed by seven different machine learning models through generation of receiver operating characteristic curves (ROC) and the calculation of area under the curve (AUC), sensitivity, specificity, and accuracy. Furthermore, the predictive performance of each model was validated in an external validation set (Center 2).Results: For the single model comparation, eXtreme Gradient Boosting (XGBoost) showed the best performance among the clinical model group in the validation set. And Random Forest (RF) exhibited the best indicative performance not only among the radiomics feature group but also in the deep learning feature model group. What's more, among the combined model group, RF still displayed the best predictive performance, with the value of AUC, sensitivity, specificity, and accuracy in the validation set are 0.963, 0.857, 0.929, and 0.905.Conclusion: The RF model in the combined model group based on non-contrast CT scan PCAT can predict the presence of NCP more accurately and has the potential for preliminary screening of the NCP.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"96-108"},"PeriodicalIF":1.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143460277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A class of Landweber-type iterative methods based on the Radon transform for incomplete view tomography.

IF 1.7 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology

Pub Date : 2025-01-01 Epub Date: 2025-01-10 DOI: 10.1177/08953996241301697

Duo Liu, Gangrong Qu

Background: We study the reconstruction problem for incomplete view tomography, including sparse view tomography and limited angle tomography, by the Landweber iteration and its accelerated version. Traditional implementations of these Landweber-type iterative methods necessitate multiple large-scale matrix-vector multiplications, which in turn require substantial time and storage resources.

Objective: This paper aims to develop and test a novel and efficient discretization approach for a class of Landweber-type methods that minimizes storage requirements by incorporating the specific structure of the incomplete view Radon transform.

Methods: We prove that the normal operator of incomplete view Radon transform in these methods is a compact convolution operator, and derive the explicit representation of its convolution kernel. Discretized by the pixel basis, these Landweber-type iterative methods can be implemented quickly and accurately by introducing a discretized convolution operation between two small-scale matrices with minimal storage requirements.

Results: For the simulated complete and limited angle data, the reconstruction results using various Landweber-type methods with our proposed discretization scheme achieve a 1-5dB improvement in PSNR and require one-third of computation time compared to the traditional approach. For the simulated sparse view data, our discretization scheme yields a valid image with the highest PSNR.

Conclusions: The Landweber-type iterative methods, when combined with our proposed discretization approach based on the Radon transform, are effective for addressing the incomplete view tomography problem.

{"title":"A class of Landweber-type iterative methods based on the Radon transform for incomplete view tomography.","authors":"Duo Liu, Gangrong Qu","doi":"10.1177/08953996241301697","DOIUrl":"10.1177/08953996241301697","url":null,"abstract":"Background: We study the reconstruction problem for incomplete view tomography, including sparse view tomography and limited angle tomography, by the Landweber iteration and its accelerated version. Traditional implementations of these Landweber-type iterative methods necessitate multiple large-scale matrix-vector multiplications, which in turn require substantial time and storage resources.Objective: This paper aims to develop and test a novel and efficient discretization approach for a class of Landweber-type methods that minimizes storage requirements by incorporating the specific structure of the incomplete view Radon transform.Methods: We prove that the normal operator of incomplete view Radon transform in these methods is a compact convolution operator, and derive the explicit representation of its convolution kernel. Discretized by the pixel basis, these Landweber-type iterative methods can be implemented quickly and accurately by introducing a discretized convolution operation between two small-scale matrices with minimal storage requirements.Results: For the simulated complete and limited angle data, the reconstruction results using various Landweber-type methods with our proposed discretization scheme achieve a 1-5dB improvement in PSNR and require one-third of computation time compared to the traditional approach. For the simulated sparse view data, our discretization scheme yields a valid image with the highest PSNR.Conclusions: The Landweber-type iterative methods, when combined with our proposed discretization approach based on the Radon transform, are effective for addressing the incomplete view tomography problem.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"187-203"},"PeriodicalIF":1.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143460283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A multimodal similarity-aware and knowledge-driven pre-training approach for reliable pneumoconiosis diagnosis.

IF 1.7 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology

Pub Date : 2025-01-01 Epub Date: 2025-01-13 DOI: 10.1177/08953996241296400

Xueting Ren, Guohua Ji, Surong Chu, Shinichi Yoshida, Juanjuan Zhao, Baoping Jia, Yan Qiang

Background: Pneumoconiosis staging is challenging due to the low clarity of X-ray images and the small, diffuse nature of the lesions. Additionally, the scarcity of annotated data makes it difficult to develop accurate staging models. Although clinical text reports provide valuable contextual information, existing works primarily focus on designing multimodal image-text contrastive learning tasks, neglecting the high similarity of pneumoconiosis imaging representations. This results in inadequate extraction of fine-grained multimodal information and underutilization of domain knowledge, limiting their application in medical tasks.

Objective: The study aims to address the limitations of current multimodal methods by proposing a new approach that improves the precision of pneumoconiosis diagnosis and staging through enhanced fine-grained learning and better utilization of domain knowledge.

Methods: The proposed Multimodal Similarity-aware and Knowledge-driven Pre-Training (MSK-PT) approach involves two stages. In the first stage, we deeply analyze the similar features of pneumoconiosis images and use a similarity-aware modality alignment strategy to explore the fine-grained representations and associated disturbances of pneumoconiosis lesions between images and texts, guiding the model to match more appropriate feature representations. In the second stage, we utilize data-associated features and pre-stored domain knowledge features as priors and constraints to guide the downstream model in the visual domain without annotations. To address potential erroneous labels generated by model predictions, we further introduce an uncertainty threshold strategy to mitigate the negative impact of imperfect prediction labels and enhance model interpretability.

Results: We collected and created the pneumoconiosis chest X-ray (PneumoCXR) dataset to evaluate our proposed MSK-PT method. The experimental results show that our method achieved a classification accuracy of 81.73%, outperforming the state-of-the-art algorithms by 2.53%.

Conclusions: MSK-PT showed diagnostic performance that matches or exceeds the average radiologist's level, even with limited labeled data, highlighting the method's effectiveness and robustness.

{"title":"A multimodal similarity-aware and knowledge-driven pre-training approach for reliable pneumoconiosis diagnosis.","authors":"Xueting Ren, Guohua Ji, Surong Chu, Shinichi Yoshida, Juanjuan Zhao, Baoping Jia, Yan Qiang","doi":"10.1177/08953996241296400","DOIUrl":"10.1177/08953996241296400","url":null,"abstract":"Background: Pneumoconiosis staging is challenging due to the low clarity of X-ray images and the small, diffuse nature of the lesions. Additionally, the scarcity of annotated data makes it difficult to develop accurate staging models. Although clinical text reports provide valuable contextual information, existing works primarily focus on designing multimodal image-text contrastive learning tasks, neglecting the high similarity of pneumoconiosis imaging representations. This results in inadequate extraction of fine-grained multimodal information and underutilization of domain knowledge, limiting their application in medical tasks.Objective: The study aims to address the limitations of current multimodal methods by proposing a new approach that improves the precision of pneumoconiosis diagnosis and staging through enhanced fine-grained learning and better utilization of domain knowledge.Methods: The proposed Multimodal Similarity-aware and Knowledge-driven Pre-Training (MSK-PT) approach involves two stages. In the first stage, we deeply analyze the similar features of pneumoconiosis images and use a similarity-aware modality alignment strategy to explore the fine-grained representations and associated disturbances of pneumoconiosis lesions between images and texts, guiding the model to match more appropriate feature representations. In the second stage, we utilize data-associated features and pre-stored domain knowledge features as priors and constraints to guide the downstream model in the visual domain without annotations. To address potential erroneous labels generated by model predictions, we further introduce an uncertainty threshold strategy to mitigate the negative impact of imperfect prediction labels and enhance model interpretability.Results: We collected and created the pneumoconiosis chest X-ray (PneumoCXR) dataset to evaluate our proposed MSK-PT method. The experimental results show that our method achieved a classification accuracy of 81.73%, outperforming the state-of-the-art algorithms by 2.53%.Conclusions: MSK-PT showed diagnostic performance that matches or exceeds the average radiologist's level, even with limited labeled data, highlighting the method's effectiveness and robustness.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"229-248"},"PeriodicalIF":1.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143460306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhanced swin transformer based tuberculosis classification with segmentation using chest X-ray.

IF 1.7 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology

Pub Date : 2025-01-01 Epub Date: 2025-01-08 DOI: 10.1177/08953996241300018

P Visu, V Sathiya, P Ajitha, R Surendran

Background:: Tuberculosis disease is the disease that causes significant morbidity and mortality worldwide. Thus, early detection of the disease is crucial for proper treatment and controlling the spread of Tuberculosis disease. Chest X-ray imaging is one of the most widely used diagnostic tools for detecting the Tuberculosis, which is time-consuming, and prone to errors. Nowadays, deep learning model provides the automated classification of medical images with promising outcome.

Objective:: Thus, this research introduced a deep learning based segmentation and classification model. Initially, the Adaptive Gaussian Filtering based pre-processing and data augmentation is performed to remove artefacts and biased outcome. Then, Attention UNet (A_UNet) based segmentation is proposed for segmenting the required region of Chest X-ray.

Methods:: Using the segmented outcome, Enhanced Swin Transformer (EnSTrans) model based Tuberculosis classification model is designed with Residual Pyramid Network based Multi-layer perceptron (MLP) layer for enhancing the classification accuracy.

Results:: Enhanced Lotus Effect Optimization (EnLeO) Algorithm is employed for the loss function optimization of the EnSTrans model.

Conclusions:: The proposed methods acquired the Accuracy, Recall, Precision, F-score, and Specificity of 99.0576%, 98.9459%, 99.145%, 98.96%, and 99.152% respectively.

{"title":"Enhanced swin transformer based tuberculosis classification with segmentation using chest X-ray.","authors":"P Visu, V Sathiya, P Ajitha, R Surendran","doi":"10.1177/08953996241300018","DOIUrl":"10.1177/08953996241300018","url":null,"abstract":"Background:: Tuberculosis disease is the disease that causes significant morbidity and mortality worldwide. Thus, early detection of the disease is crucial for proper treatment and controlling the spread of Tuberculosis disease. Chest X-ray imaging is one of the most widely used diagnostic tools for detecting the Tuberculosis, which is time-consuming, and prone to errors. Nowadays, deep learning model provides the automated classification of medical images with promising outcome.Objective:: Thus, this research introduced a deep learning based segmentation and classification model. Initially, the Adaptive Gaussian Filtering based pre-processing and data augmentation is performed to remove artefacts and biased outcome. Then, Attention UNet (A_UNet) based segmentation is proposed for segmenting the required region of Chest X-ray.Methods:: Using the segmented outcome, Enhanced Swin Transformer (EnSTrans) model based Tuberculosis classification model is designed with Residual Pyramid Network based Multi-layer perceptron (MLP) layer for enhancing the classification accuracy.Results:: Enhanced Lotus Effect Optimization (EnLeO) Algorithm is employed for the loss function optimization of the EnSTrans model.Conclusions:: The proposed methods acquired the Accuracy, Recall, Precision, F-score, and Specificity of 99.0576%, 98.9459%, 99.145%, 98.96%, and 99.152% respectively.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"167-186"},"PeriodicalIF":1.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143460441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Selecting projection views based on error equidistribution for computed tomography.

IF 1.7 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology

Pub Date : 2025-01-01 Epub Date: 2025-01-15 DOI: 10.1177/08953996241289267

Yinghui Zhang, Xing Zhao, Ke Chen, Hongwei Li

Background: Nonuniform sampling is a useful technique to optimize the acquisition of projections with a limited budget. Existing methods for selecting important projection views have limitations, such as relying on blueprint images or excessive computing resources.

Methods: We aim to develop a simple nonuniform sampling method for selecting informative projection views suitable for practical CT applications. The proposed algorithm is inspired by two key observations: projection errors contain angle-specific information, and adding views around error peaks effectively reduces errors and improves reconstruction. Given a budget and an initial view set, the proposed method involves: estimating projection errors based on current set of projection views, adding more projection views based on error equidistribution to smooth out errors, and final image reconstruction based on the new set of projection views. This process can be recursive, and the initial view can be obtained uniformly or from a prior for greater efficiency.

Results: Comparison with popular view selection algorithms using simulated and real data demonstrates consistently superior performance in identifying optimal views and generating high-quality reconstructions. Notably, the new algorithm performs well in both PSNR and SSIM metrics while being computationally efficient, enhancing its practicality for CT optimization.

Conclusions: A projection view selection algorithm based on error equidistribution is proposed, offering superior reconstruction quality and efficiency over existing methods. It is ready for real CT applications to optimize dose utilization.

{"title":"Selecting projection views based on error equidistribution for computed tomography.","authors":"Yinghui Zhang, Xing Zhao, Ke Chen, Hongwei Li","doi":"10.1177/08953996241289267","DOIUrl":"10.1177/08953996241289267","url":null,"abstract":"Background: Nonuniform sampling is a useful technique to optimize the acquisition of projections with a limited budget. Existing methods for selecting important projection views have limitations, such as relying on blueprint images or excessive computing resources.Methods: We aim to develop a simple nonuniform sampling method for selecting informative projection views suitable for practical CT applications. The proposed algorithm is inspired by two key observations: projection errors contain angle-specific information, and adding views around error peaks effectively reduces errors and improves reconstruction. Given a budget and an initial view set, the proposed method involves: estimating projection errors based on current set of projection views, adding more projection views based on error equidistribution to smooth out errors, and final image reconstruction based on the new set of projection views. This process can be recursive, and the initial view can be obtained uniformly or from a prior for greater efficiency.Results: Comparison with popular view selection algorithms using simulated and real data demonstrates consistently superior performance in identifying optimal views and generating high-quality reconstructions. Notably, the new algorithm performs well in both PSNR and SSIM metrics while being computationally efficient, enhancing its practicality for CT optimization.Conclusions: A projection view selection algorithm based on error equidistribution is proposed, offering superior reconstruction quality and efficiency over existing methods. It is ready for real CT applications to optimize dose utilization.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"249-269"},"PeriodicalIF":1.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143459999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MRI classification and discrimination of spinal schwannoma and meningioma based on deep learning. 基于深度学习的脊髓分裂瘤和脑膜瘤的磁共振成像分类和鉴别。

IF 1.7 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology

Pub Date : 2025-01-01 Epub Date: 2024-11-27 DOI: 10.1177/08953996241289745

Yidan Liu, Zhenhua Zhou, Yuanjun Wang

Backgroud: Schwannoma (SCH) and meningiomas (MEN) are the two most common primary spinal cord tumors. Differentiating between them preoperatively remains a clinical challenge due to the substantial overlap in their clinical presentation and imaging characteristics.

Objective: The objective of this study is to facilitate early diagnosis of patients and reduce clinician stress by constructing a deep learning-based classification model for automatic diagnosis of schwannoma and meningiomas using magnetic resonance images (MRI).

Methods: We retrospectively collected MRI images of 74 patients with pathologically confirmed schwannoma and meningiomas from 2015 to 2020 at a local hosipital, and constructed a CNN model based on the PyTorch's deep learning framework for the discrimination between the two. First, a modified feature fusion CNN model (ResNet34-SKConv) was trained by introducing a selective convolutional kernel module into the original CNN model. The introduction of the selective convolutional kernel module enhances the network's focus on tumor features and effectively improves the network's performance. Finally, the trained model was used to process all the MRI image slices to achieve the classification of SCH and MEN patients by the voting prediction method.

Results: Using the 5-fold cross-validation method, this new ResNet34-SKConv model achieves a classification accuracy of 92.32%, a specificity of 95.87%, and a F1-score of 93.54, respectively.

Conclusion: This study demonstrated that a classification model using a deep learning network can be effective in achieving differential diagnosis of SCH and MEN. Thus, the new method has great potential for developing new computer-aided diagnosis and applications with future clinical practice.

{"title":"MRI classification and discrimination of spinal schwannoma and meningioma based on deep learning.","authors":"Yidan Liu, Zhenhua Zhou, Yuanjun Wang","doi":"10.1177/08953996241289745","DOIUrl":"10.1177/08953996241289745","url":null,"abstract":"Backgroud: Schwannoma (SCH) and meningiomas (MEN) are the two most common primary spinal cord tumors. Differentiating between them preoperatively remains a clinical challenge due to the substantial overlap in their clinical presentation and imaging characteristics.Objective: The objective of this study is to facilitate early diagnosis of patients and reduce clinician stress by constructing a deep learning-based classification model for automatic diagnosis of schwannoma and meningiomas using magnetic resonance images (MRI).Methods: We retrospectively collected MRI images of 74 patients with pathologically confirmed schwannoma and meningiomas from 2015 to 2020 at a local hosipital, and constructed a CNN model based on the PyTorch's deep learning framework for the discrimination between the two. First, a modified feature fusion CNN model (ResNet34-SKConv) was trained by introducing a selective convolutional kernel module into the original CNN model. The introduction of the selective convolutional kernel module enhances the network's focus on tumor features and effectively improves the network's performance. Finally, the trained model was used to process all the MRI image slices to achieve the classification of SCH and MEN patients by the voting prediction method.Results: Using the 5-fold cross-validation method, this new ResNet34-SKConv model achieves a classification accuracy of 92.32%, a specificity of 95.87%, and a F1-score of 93.54, respectively.Conclusion: This study demonstrated that a classification model using a deep learning network can be effective in achieving differential diagnosis of SCH and MEN. Thus, the new method has great potential for developing new computer-aided diagnosis and applications with future clinical practice.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"26-36"},"PeriodicalIF":1.7,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143460311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A comprehensive guide to content-based image retrieval algorithms with visualsift ensembling. 基于内容的图像检索算法与视觉漂移集合综合指南。

IF 3 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology

Pub Date : 2024-09-11 DOI: 10.3233/xst-240189

C Ramesh Babu Durai,R Sathesh Raaj,Sindhu Chandra Sekharan,V S Nishok

BACKGROUNDContent-based image retrieval (CBIR) systems are vital for managing the large volumes of data produced by medical imaging technologies. They enable efficient retrieval of relevant medical images from extensive databases, supporting clinical diagnosis, treatment planning, and medical research.OBJECTIVEThis study aims to enhance CBIR systems' effectiveness in medical image analysis by introducing the VisualSift Ensembling Integration with Attention Mechanisms (VEIAM). VEIAM seeks to improve diagnostic accuracy and retrieval efficiency by integrating robust feature extraction with dynamic attention mechanisms.METHODSVEIAM combines Scale-Invariant Feature Transform (SIFT) with selective attention mechanisms to emphasize crucial regions within medical images dynamically. Implemented in Python, the model integrates seamlessly into existing medical image analysis workflows, providing a robust and accessible tool for clinicians and researchers.RESULTSThe proposed VEIAM model demonstrated an impressive accuracy of 97.34% in classifying and retrieving medical images. This performance indicates VEIAM's capability to discern subtle patterns and textures critical for accurate diagnostics.CONCLUSIONSBy merging SIFT-based feature extraction with attention processes, VEIAM offers a discriminatively powerful approach to medical image analysis. Its high accuracy and efficiency in retrieving relevant medical images make it a promising tool for enhancing diagnostic processes and supporting medical research in CBIR systems.

背景基于内容的图像检索（CBIR）系统对管理医疗成像技术产生的大量数据至关重要。本研究旨在通过引入 VisualSift Ensembling Integration with Attention Mechanisms (VEIAM)，提高 CBIR 系统在医学图像分析中的有效性。方法VEIAM将规模不变特征变换（SIFT）与选择性注意机制相结合，动态强调医学图像中的关键区域。该模型采用 Python 语言实现，可无缝集成到现有的医学图像分析工作流程中，为临床医生和研究人员提供了一个强大且易于使用的工具。结果提出的 VEIAM 模型在医学图像分类和检索方面的准确率高达 97.34%，令人印象深刻。结论通过将基于 SIFT 的特征提取与注意过程相结合，VEIAM 为医学图像分析提供了一种具有强大判别能力的方法。VEIAM 在检索相关医学图像方面的高准确性和高效率使其成为一种很有前途的工具，可用于增强诊断过程和支持 CBIR 系统中的医学研究。

{"title":"A comprehensive guide to content-based image retrieval algorithms with visualsift ensembling.","authors":"C Ramesh Babu Durai,R Sathesh Raaj,Sindhu Chandra Sekharan,V S Nishok","doi":"10.3233/xst-240189","DOIUrl":"https://doi.org/10.3233/xst-240189","url":null,"abstract":"BACKGROUNDContent-based image retrieval (CBIR) systems are vital for managing the large volumes of data produced by medical imaging technologies. They enable efficient retrieval of relevant medical images from extensive databases, supporting clinical diagnosis, treatment planning, and medical research.OBJECTIVEThis study aims to enhance CBIR systems' effectiveness in medical image analysis by introducing the VisualSift Ensembling Integration with Attention Mechanisms (VEIAM). VEIAM seeks to improve diagnostic accuracy and retrieval efficiency by integrating robust feature extraction with dynamic attention mechanisms.METHODSVEIAM combines Scale-Invariant Feature Transform (SIFT) with selective attention mechanisms to emphasize crucial regions within medical images dynamically. Implemented in Python, the model integrates seamlessly into existing medical image analysis workflows, providing a robust and accessible tool for clinicians and researchers.RESULTSThe proposed VEIAM model demonstrated an impressive accuracy of 97.34% in classifying and retrieving medical images. This performance indicates VEIAM's capability to discern subtle patterns and textures critical for accurate diagnostics.CONCLUSIONSBy merging SIFT-based feature extraction with attention processes, VEIAM offers a discriminatively powerful approach to medical image analysis. Its high accuracy and efficiency in retrieving relevant medical images make it a promising tool for enhancing diagnostic processes and supporting medical research in CBIR systems.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":"79 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142258308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0