Pub Date : 2025-02-01Epub Date: 2024-10-11DOI: 10.1007/s11517-024-03213-w
Seungkeon Lee, Young Do Song, Eui Chul Lee
Stress is linked to health problems, increasing the need for immediate monitoring. Traditional methods like electrocardiograms or contact photoplethysmography require device attachment, causing discomfort, and ultra-short-term stress measurement research remains inadequate. This paper proposes a method for ultra-short-term stress monitoring using remote photoplethysmography (rPPG). Previous predictions of ultra-short-term stress have typically used pulse rate variability (PRV) features derived from time-segmented heart rate data. However, PRV varies at the same stress levels depending on heart rates, necessitating a new method to account for these differences. This study addressed this by segmenting rPPG data based on normal-to-normal intervals (NNIs), converted from peak-to-peak intervals, to predict ultra-short-term stress indices. We used NNI counts corresponding to average durations of 10, 20, and 30 s (13, 26, and 39 NNIs) to extract PRV features, predicting the Baevsky stress index through regressors. The Extra Trees Regressor achieved R2 scores of 0.6699 for 13 NNIs, 0.8751 for 26 NNIs, and 0.9358 for 39 NNIs, surpassing the time-segmented approach, which yielded 0.4162, 0.6528, and 0.7943 for 10, 20, and 30-s intervals, respectively. These findings demonstrate that using NNI counts for ultra-short-term stress prediction improves accuracy by accounting for individual bio-signal variations.
压力与健康问题息息相关,因此更需要即时监测。传统方法,如心电图或接触式光电血压计,需要安装设备,会造成不适,而且超短期压力测量研究仍然不足。本文提出了一种利用远程光心动图(rPPG)进行超短期应激监测的方法。以往对超短期压力的预测通常使用从时间片段心率数据中得出的脉率变异性(PRV)特征。然而,在相同的压力水平下,PRV 会因心率的不同而变化,因此需要一种新的方法来解释这些差异。为了解决这个问题,本研究根据正常到正常间期(NNI)对 rPPG 数据进行分段,并从峰值到峰值间期进行转换,以预测超短期压力指数。我们使用与 10、20 和 30 秒(13、26 和 39 个 NNI)平均持续时间相对应的 NNI 计数提取 PRV 特征,并通过回归因子预测 Baevsky 压力指数。额外树回归器对 13 个 NNI 的 R2 得分为 0.6699,对 26 个 NNI 的 R2 得分为 0.8751,对 39 个 NNI 的 R2 得分为 0.9358,超过了时间分段方法,后者对 10、20 和 30 秒间隔的 R2 得分分别为 0.4162、0.6528 和 0.7943。这些研究结果表明,使用 NNI 计数进行超短期应激预测可通过考虑个体生物信号的变化来提高准确性。
{"title":"Ultra-short-term stress measurement using RGB camera-based remote photoplethysmography with reduced effects of Individual differences in heart rate.","authors":"Seungkeon Lee, Young Do Song, Eui Chul Lee","doi":"10.1007/s11517-024-03213-w","DOIUrl":"10.1007/s11517-024-03213-w","url":null,"abstract":"<p><p>Stress is linked to health problems, increasing the need for immediate monitoring. Traditional methods like electrocardiograms or contact photoplethysmography require device attachment, causing discomfort, and ultra-short-term stress measurement research remains inadequate. This paper proposes a method for ultra-short-term stress monitoring using remote photoplethysmography (rPPG). Previous predictions of ultra-short-term stress have typically used pulse rate variability (PRV) features derived from time-segmented heart rate data. However, PRV varies at the same stress levels depending on heart rates, necessitating a new method to account for these differences. This study addressed this by segmenting rPPG data based on normal-to-normal intervals (NNIs), converted from peak-to-peak intervals, to predict ultra-short-term stress indices. We used NNI counts corresponding to average durations of 10, 20, and 30 s (13, 26, and 39 NNIs) to extract PRV features, predicting the Baevsky stress index through regressors. The Extra Trees Regressor achieved R<sup>2</sup> scores of 0.6699 for 13 NNIs, 0.8751 for 26 NNIs, and 0.9358 for 39 NNIs, surpassing the time-segmented approach, which yielded 0.4162, 0.6528, and 0.7943 for 10, 20, and 30-s intervals, respectively. These findings demonstrate that using NNI counts for ultra-short-term stress prediction improves accuracy by accounting for individual bio-signal variations.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"497-510"},"PeriodicalIF":2.6,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142401794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-01Epub Date: 2024-10-14DOI: 10.1007/s11517-024-03207-8
Lizhi Pan, Zhongyi Ding, Haifeng Zhao, Ruinan Mu, Jianmin Li
Human-machine interface (HMI) has been extensively developed and applied in rehabilitation. However, the performance of amputees on continuous movement decoding was significantly decreased compared with that of able-bodied individuals. To explore the impact of the absence of joint movements on the performance of HMI in rehabilitation, a generic musculoskeletal model (MM) was employed in this study to evaluate and compare the performance of subjects completing a series of on-line tasks with the wrist and metacarpophalangeal (MCP) joints unconstrained and constrained. The performance of the generic MM has been demonstrated in previous studies. The electromyography (EMG) signals of four muscles were employed as inputs of the generic MM to realize the continuous movement decoding of wrist and MCP joints. Ten able-bodied subjects were recruited to perform the on-line tasks. The completion time, the number of overshoots, and the path efficiency of the tasks were taken as the indexes to quantify the subjects' performance. The muscle activation associated with the movement was analyzed. Across all tasks and subjects, the average values of the three indexes with the joints unconstrained were 7.7 s, 0.59, and 0.38, respectively, while those with the joints constrained were 17.86 s, 1.47, and 0.22, respectively. The results demonstrated that the subjects performed better with the wrist and MCP joints unconstrained than with those joints constrained in the on-line tasks, suggesting that the absence of joint movements can be a reason of the decreased performance of continuous movement decoding with HMIs. Meanwhile, it is revealed that the different performance on motion behaviors is caused by the absence of joint movements.
人机界面(HMI)已在康复领域得到广泛开发和应用。然而,与健全人相比,截肢者在连续运动解码方面的表现明显下降。为了探索关节运动缺失对康复人机界面性能的影响,本研究采用了通用肌肉骨骼模型(MM)来评估和比较受试者在腕关节和掌指关节(MCP)无约束和受约束的情况下完成一系列在线任务的性能。通用 MM 的性能已在之前的研究中得到证实。通用 MM 采用四块肌肉的肌电图(EMG)信号作为输入,以实现腕关节和掌指关节的连续运动解码。研究人员招募了 10 名健全的受试者完成在线任务。任务的完成时间、过冲次数和路径效率作为量化受试者表现的指标。对与动作相关的肌肉激活进行了分析。在所有任务和受试者中,关节未受约束时,这三项指标的平均值分别为 7.7 秒、0.59 和 0.38,而关节受约束时,这三项指标的平均值分别为 17.86 秒、1.47 和 0.22。结果表明,在联机任务中,受试者在腕关节和 MCP 关节未受约束的情况下比关节受约束的情况下表现更好,这表明关节运动的缺失可能是人机界面连续运动解码性能下降的一个原因。同时,研究还揭示了运动行为上的不同表现是由关节运动的缺失造成的。
{"title":"Comparing on-line continuous movement decoding with joints unconstrained and constrained based on a generic musculoskeletal model.","authors":"Lizhi Pan, Zhongyi Ding, Haifeng Zhao, Ruinan Mu, Jianmin Li","doi":"10.1007/s11517-024-03207-8","DOIUrl":"10.1007/s11517-024-03207-8","url":null,"abstract":"<p><p>Human-machine interface (HMI) has been extensively developed and applied in rehabilitation. However, the performance of amputees on continuous movement decoding was significantly decreased compared with that of able-bodied individuals. To explore the impact of the absence of joint movements on the performance of HMI in rehabilitation, a generic musculoskeletal model (MM) was employed in this study to evaluate and compare the performance of subjects completing a series of on-line tasks with the wrist and metacarpophalangeal (MCP) joints unconstrained and constrained. The performance of the generic MM has been demonstrated in previous studies. The electromyography (EMG) signals of four muscles were employed as inputs of the generic MM to realize the continuous movement decoding of wrist and MCP joints. Ten able-bodied subjects were recruited to perform the on-line tasks. The completion time, the number of overshoots, and the path efficiency of the tasks were taken as the indexes to quantify the subjects' performance. The muscle activation associated with the movement was analyzed. Across all tasks and subjects, the average values of the three indexes with the joints unconstrained were 7.7 s, 0.59, and 0.38, respectively, while those with the joints constrained were 17.86 s, 1.47, and 0.22, respectively. The results demonstrated that the subjects performed better with the wrist and MCP joints unconstrained than with those joints constrained in the on-line tasks, suggesting that the absence of joint movements can be a reason of the decreased performance of continuous movement decoding with HMIs. Meanwhile, it is revealed that the different performance on motion behaviors is caused by the absence of joint movements.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"525-533"},"PeriodicalIF":2.6,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142479175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-01Epub Date: 2024-10-10DOI: 10.1007/s11517-024-03210-z
Khushi Yadav, Yasha Hasija
This study explores the bidirectional relation of esophageal squamous cell carcinoma (ESCC) and oral squamous cell carcinoma (OSCC), examining shared risk factors and underlying molecular mechanisms. By employing random forest (RF) classifier, enhanced with interpretable machine learning (IML) through SHapley Additive exPlanations (SHAP), we analyzed gene expression from two GEO datasets (GSE30784 and GSE44021). The GSE30784 dataset comprises 167 OSCC samples and 45 control group, whereas the GSE44021 dataset encompasses 113 ESCC samples and 113 control group. Our analysis led to identification of 20 key genes, such as XBP1, VGLL1, and RAD1, which are significantly associated with development of ESCC and OSCC. Further investigations were conducted using tools like NetworkAnalyst 3.0, Single Cell Portal, and miRNET 2.0, which highlighted complex interactions between these genes and specific miRNA targets including hsa-mir-124-3p and hsa-mir-1-3p. Our model achieved high precision in identifying genes linked to crucial processes like programmed cell death and cancer pathways, suggesting new avenues for diagnosis and treatment. This study confirms the bidirectional relationship between OSCC and ESCC, laying groundwork for targeted therapeutic approaches. This study helps to identify shared biological pathways and genetic factors of these conditions for designing personalized medicine strategies and to improve disease management.
{"title":"Integrated analysis of gene expressions and targeted mirnas for explaining crosstalk between oral and esophageal squamous cell carcinomas through an interpretable machine learning approach.","authors":"Khushi Yadav, Yasha Hasija","doi":"10.1007/s11517-024-03210-z","DOIUrl":"10.1007/s11517-024-03210-z","url":null,"abstract":"<p><p>This study explores the bidirectional relation of esophageal squamous cell carcinoma (ESCC) and oral squamous cell carcinoma (OSCC), examining shared risk factors and underlying molecular mechanisms. By employing random forest (RF) classifier, enhanced with interpretable machine learning (IML) through SHapley Additive exPlanations (SHAP), we analyzed gene expression from two GEO datasets (GSE30784 and GSE44021). The GSE30784 dataset comprises 167 OSCC samples and 45 control group, whereas the GSE44021 dataset encompasses 113 ESCC samples and 113 control group. Our analysis led to identification of 20 key genes, such as XBP1, VGLL1, and RAD1, which are significantly associated with development of ESCC and OSCC. Further investigations were conducted using tools like NetworkAnalyst 3.0, Single Cell Portal, and miRNET 2.0, which highlighted complex interactions between these genes and specific miRNA targets including hsa-mir-124-3p and hsa-mir-1-3p. Our model achieved high precision in identifying genes linked to crucial processes like programmed cell death and cancer pathways, suggesting new avenues for diagnosis and treatment. This study confirms the bidirectional relationship between OSCC and ESCC, laying groundwork for targeted therapeutic approaches. This study helps to identify shared biological pathways and genetic factors of these conditions for designing personalized medicine strategies and to improve disease management.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"483-495"},"PeriodicalIF":2.6,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142394778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-01Epub Date: 2024-10-10DOI: 10.1007/s11517-024-03215-8
Roberta Scuoppo, Salvatore Castelbuono, Stefano Cannata, Giovanni Gentile, Valentina Agnese, Diego Bellavia, Caterina Gandolfo, Salvatore Pasta
Purpose: In silico trials using computational modeling and simulations can complement clinical trials to improve the time-to-market of complex cardiovascular devices in humans. This study aims to investigate the significance of synthetic data in developing in silico trials for assessing the safety and efficacy of cardiovascular devices, focusing on bioprostheses designed for transcatheter aortic valve implantation (TAVI).
Methods: A statistical shape model (SSM) was employed to extract uncorrelated shape features from TAVI patients, enabling the augmentation of the original patient population into a clinically validated synthetic cohort. Machine learning techniques were utilized not only for risk stratification and classification but also for predicting the physiological variability within the original patient population.
Results: By randomly varying the statistical shape modes within a range of ± 2σ, a hundred virtual patients were generated, forming the synthetic cohort. Validation against the original patient population was conducted using morphological measurements. Support vector machine regression, based on selected shape modes (principal component scores), effectively predicted the peak pressure gradient across the stenosis (R-squared of 0.551 and RMSE of 11.67 mmHg). Multilayer perceptron neural network accurately predicted the optimal device size for implantation with high sensitivity and specificity (AUC = 0.98).
Conclusion: The study highlights the potential of integrating computational predictions, advanced machine learning techniques, and synthetic data generation to improve predictive accuracy and assess TAVI-related outcomes through in silico trials.
{"title":"Generation of a virtual cohort of TAVI patients for in silico trials: a statistical shape and machine learning analysis.","authors":"Roberta Scuoppo, Salvatore Castelbuono, Stefano Cannata, Giovanni Gentile, Valentina Agnese, Diego Bellavia, Caterina Gandolfo, Salvatore Pasta","doi":"10.1007/s11517-024-03215-8","DOIUrl":"10.1007/s11517-024-03215-8","url":null,"abstract":"<p><strong>Purpose: </strong>In silico trials using computational modeling and simulations can complement clinical trials to improve the time-to-market of complex cardiovascular devices in humans. This study aims to investigate the significance of synthetic data in developing in silico trials for assessing the safety and efficacy of cardiovascular devices, focusing on bioprostheses designed for transcatheter aortic valve implantation (TAVI).</p><p><strong>Methods: </strong>A statistical shape model (SSM) was employed to extract uncorrelated shape features from TAVI patients, enabling the augmentation of the original patient population into a clinically validated synthetic cohort. Machine learning techniques were utilized not only for risk stratification and classification but also for predicting the physiological variability within the original patient population.</p><p><strong>Results: </strong>By randomly varying the statistical shape modes within a range of ± 2σ, a hundred virtual patients were generated, forming the synthetic cohort. Validation against the original patient population was conducted using morphological measurements. Support vector machine regression, based on selected shape modes (principal component scores), effectively predicted the peak pressure gradient across the stenosis (R-squared of 0.551 and RMSE of 11.67 mmHg). Multilayer perceptron neural network accurately predicted the optimal device size for implantation with high sensitivity and specificity (AUC = 0.98).</p><p><strong>Conclusion: </strong>The study highlights the potential of integrating computational predictions, advanced machine learning techniques, and synthetic data generation to improve predictive accuracy and assess TAVI-related outcomes through in silico trials.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"467-482"},"PeriodicalIF":2.6,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11750893/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142401783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-01Epub Date: 2024-10-17DOI: 10.1007/s11517-024-03202-z
Angel Rio-Alvarez, Pablo García Marcos, Paula Puerta González, Esther Serrano-Pertierra, Antonello Novelli, M Teresa Fernández-Sánchez, Víctor M González
The counting and characterization of neurons in primary cultures have long been areas of significant scientific interest due to their multifaceted applications, ranging from neuronal viability assessment to the study of neuronal development. Traditional methods, often relying on fluorescence or colorimetric staining and manual segmentation, are time consuming, labor intensive, and prone to error, raising the need for the development of automated and reliable methods. This paper delves into the evaluation of three pivotal deep learning techniques: semantic segmentation, which allows for pixel-level classification and is solely suited for characterization; object detection, which focuses on counting and locating neurons; and instance segmentation, which amalgamates the features of the other two but employing more intricate structures. The goal of this research is to discern what technique or combination of those techniques yields the optimal results for automatic counting and characterization of neurons in images of neuronal cultures. Following rigorous experimentation, we conclude that instance segmentation stands out, providing superior outcomes for both challenges.
{"title":"Evaluating deep learning techniques for optimal neurons counting and characterization in complex neuronal cultures.","authors":"Angel Rio-Alvarez, Pablo García Marcos, Paula Puerta González, Esther Serrano-Pertierra, Antonello Novelli, M Teresa Fernández-Sánchez, Víctor M González","doi":"10.1007/s11517-024-03202-z","DOIUrl":"10.1007/s11517-024-03202-z","url":null,"abstract":"<p><p>The counting and characterization of neurons in primary cultures have long been areas of significant scientific interest due to their multifaceted applications, ranging from neuronal viability assessment to the study of neuronal development. Traditional methods, often relying on fluorescence or colorimetric staining and manual segmentation, are time consuming, labor intensive, and prone to error, raising the need for the development of automated and reliable methods. This paper delves into the evaluation of three pivotal deep learning techniques: semantic segmentation, which allows for pixel-level classification and is solely suited for characterization; object detection, which focuses on counting and locating neurons; and instance segmentation, which amalgamates the features of the other two but employing more intricate structures. The goal of this research is to discern what technique or combination of those techniques yields the optimal results for automatic counting and characterization of neurons in images of neuronal cultures. Following rigorous experimentation, we conclude that instance segmentation stands out, providing superior outcomes for both challenges.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"545-560"},"PeriodicalIF":2.6,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11750910/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142479180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The aim of this study was to investigate and compare the biomechanical properties of the conventional and novel hip prosthetic socket by using the finite element and gait analysis. According to the CT scan model of the subject's residual limb, the bones, soft tissues, and the socket model were reconstructed in three dimensions by using inverse modeling. The distribution of normal and shear stresses at the residual limb-socket interface under the standing condition was investigated using the finite element method and verified by designing a pressure acquisition module system. The gait experiment compared and analyzed the conventional and novel sockets. The results show that the simulation results are consistent with the experimental data. The novel socket exhibited superior stress performance and gait outcomes compared to the conventional design. Our findings provide a research basis for evaluating the comfort of the hip prosthetic socket, optimizing and designing the structure of the socket of the hip.
{"title":"Comparative biomechanical analysis of a conventional/novel hip prosthetic socket.","authors":"Yu Qian, Yunzhang Cheng, Shiyao Chen, Mingwei Zhang, Yingyu Fang, Tianyi Zhang","doi":"10.1007/s11517-024-03206-9","DOIUrl":"10.1007/s11517-024-03206-9","url":null,"abstract":"<p><p>The aim of this study was to investigate and compare the biomechanical properties of the conventional and novel hip prosthetic socket by using the finite element and gait analysis. According to the CT scan model of the subject's residual limb, the bones, soft tissues, and the socket model were reconstructed in three dimensions by using inverse modeling. The distribution of normal and shear stresses at the residual limb-socket interface under the standing condition was investigated using the finite element method and verified by designing a pressure acquisition module system. The gait experiment compared and analyzed the conventional and novel sockets. The results show that the simulation results are consistent with the experimental data. The novel socket exhibited superior stress performance and gait outcomes compared to the conventional design. Our findings provide a research basis for evaluating the comfort of the hip prosthetic socket, optimizing and designing the structure of the socket of the hip.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"417-428"},"PeriodicalIF":2.6,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142367226","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-01Epub Date: 2024-10-03DOI: 10.1007/s11517-024-03197-7
Liangquan Yan, Jumin Zhao, Danyang Shi, Dengao Li, Yi Liu
Heart failure represents the ultimate stage in the progression of diverse cardiac ailments. Throughout the management of heart failure, physicians require observation of medical imagery to formulate therapeutic regimens for patients. Automated report generation technology serves as a tool aiding physicians in patient management. However, previous studies failed to generate targeted reports for specific diseases. To produce high-quality medical reports with greater relevance across diverse conditions, we introduce an automatic report generation model HF-CMN, tailored to heart failure. Firstly, the generated report includes comprehensive information pertaining to heart failure gleaned from chest radiographs. Additionally, we construct a storage query matrix grouping based on a multi-label type, enhancing the accuracy of our model in aligning images with text. Experimental results demonstrate that our method can generate reports strongly correlated with heart failure and outperforms most other advanced methods on benchmark datasets MIMIC-CXR and IU X-Ray. Further analysis confirms that our method achieves superior alignment between images and texts, resulting in higher-quality reports.
心力衰竭是各种心脏疾病发展的终极阶段。在心力衰竭的整个治疗过程中,医生需要观察医疗图像,为患者制定治疗方案。自动报告生成技术是帮助医生管理病人的一种工具。然而,以往的研究未能针对特定疾病生成有针对性的报告。为了在各种疾病中生成更有针对性的高质量医疗报告,我们引入了一种针对心力衰竭的自动报告生成模型 HF-CMN。首先,生成的报告包括从胸片中收集到的有关心力衰竭的全面信息。此外,我们还构建了基于多标签类型的存储查询矩阵分组,从而提高了模型在图像与文本对齐方面的准确性。实验结果表明,我们的方法可以生成与心衰密切相关的报告,在基准数据集 MIMIC-CXR 和 IU X-Ray 上的表现优于其他大多数先进方法。进一步的分析证实,我们的方法实现了图像与文本之间的卓越对齐,从而生成了更高质量的报告。
{"title":"HF-CMN: a medical report generation model for heart failure.","authors":"Liangquan Yan, Jumin Zhao, Danyang Shi, Dengao Li, Yi Liu","doi":"10.1007/s11517-024-03197-7","DOIUrl":"10.1007/s11517-024-03197-7","url":null,"abstract":"<p><p>Heart failure represents the ultimate stage in the progression of diverse cardiac ailments. Throughout the management of heart failure, physicians require observation of medical imagery to formulate therapeutic regimens for patients. Automated report generation technology serves as a tool aiding physicians in patient management. However, previous studies failed to generate targeted reports for specific diseases. To produce high-quality medical reports with greater relevance across diverse conditions, we introduce an automatic report generation model HF-CMN, tailored to heart failure. Firstly, the generated report includes comprehensive information pertaining to heart failure gleaned from chest radiographs. Additionally, we construct a storage query matrix grouping based on a multi-label type, enhancing the accuracy of our model in aligning images with text. Experimental results demonstrate that our method can generate reports strongly correlated with heart failure and outperforms most other advanced methods on benchmark datasets MIMIC-CXR and IU X-Ray. Further analysis confirms that our method achieves superior alignment between images and texts, resulting in higher-quality reports.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"399-415"},"PeriodicalIF":2.6,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142367227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In response to the challenge of low accuracy in retinal vessel segmentation attributed to the minute nature of the vessels, this paper proposes a retinal vessel segmentation model based on an improved U-Net, which combines multi-scale feature extraction and fusion techniques. An improved dilated residual module was first used to replace the original convolutional layer of U-Net, and this module, coupled with a dual attention mechanism and diverse expansion rates, facilitates the extraction of multi-scale vascular features. Moreover, an adaptive feature fusion module was added at the skip connections of the model to improve vessel connectivity. To further optimize network training, a hybrid loss function is employed to mitigate the class imbalance between vessels and the background. Experimental results on the DRIVE dataset and CHASE_DB1 dataset show that the proposed model has an accuracy of 96.27% and 96.96%, sensitivity of 81.32% and 82.59%, and AUC of 98.34% and 98.70%, respectively, demonstrating superior segmentation performance.
{"title":"A multi-scale feature extraction and fusion-based model for retinal vessel segmentation in fundus images.","authors":"Jinzhi Zhou, Guangcen Ma, Haoyang He, Saifeng Li, Guopeng Zhang","doi":"10.1007/s11517-024-03223-8","DOIUrl":"10.1007/s11517-024-03223-8","url":null,"abstract":"<p><p>In response to the challenge of low accuracy in retinal vessel segmentation attributed to the minute nature of the vessels, this paper proposes a retinal vessel segmentation model based on an improved U-Net, which combines multi-scale feature extraction and fusion techniques. An improved dilated residual module was first used to replace the original convolutional layer of U-Net, and this module, coupled with a dual attention mechanism and diverse expansion rates, facilitates the extraction of multi-scale vascular features. Moreover, an adaptive feature fusion module was added at the skip connections of the model to improve vessel connectivity. To further optimize network training, a hybrid loss function is employed to mitigate the class imbalance between vessels and the background. Experimental results on the DRIVE dataset and CHASE_DB1 dataset show that the proposed model has an accuracy of 96.27% and 96.96%, sensitivity of 81.32% and 82.59%, and AUC of 98.34% and 98.70%, respectively, demonstrating superior segmentation performance.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"595-608"},"PeriodicalIF":2.6,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142479173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Using echocardiography to assess the left ventricular function is one of the most crucial cardiac examinations in clinical diagnosis, and LV segmentation plays a particularly vital role in medical image processing as many important clinical diagnostic parameters are derived from the segmentation results, such as ejection function. However, echocardiography typically has a lower resolution and contains a significant amount of noise and motion artifacts, making it a challenge to accurate segmentation, especially in the region of the cardiac chamber boundary, which significantly restricts the accurate calculation of subsequent clinical parameters. In this paper, our goal is to achieve accurate LV segmentation through a simplified approach by introducing a branch sub-network into the decoder of the traditional U-Net. Specifically, we employed the LV contour features to supervise the branch decoding process and used a cross attention module to facilitate the interaction relationship between the branch and the original decoding process, thereby improving the segmentation performance in the region LV boundaries. In the experiments, the proposed branch U-Net (BU-Net) demonstrated superior performance on CAMUS and EchoNet-dynamic public echocardiography segmentation datasets in comparison to state-of-the-art segmentation models, without the need for complex residual connections or transformer-based architectures. Our codes are publicly available at Anonymous Github https://anonymous.4open.science/r/Anoymous_two-BFF2/ .
{"title":"Contour-constrained branch U-Net for accurate left ventricular segmentation in echocardiography.","authors":"Mingjun Qu, Jinzhu Yang, Honghe Li, Yiqiu Qi, Qi Yu","doi":"10.1007/s11517-024-03201-0","DOIUrl":"10.1007/s11517-024-03201-0","url":null,"abstract":"<p><p>Using echocardiography to assess the left ventricular function is one of the most crucial cardiac examinations in clinical diagnosis, and LV segmentation plays a particularly vital role in medical image processing as many important clinical diagnostic parameters are derived from the segmentation results, such as ejection function. However, echocardiography typically has a lower resolution and contains a significant amount of noise and motion artifacts, making it a challenge to accurate segmentation, especially in the region of the cardiac chamber boundary, which significantly restricts the accurate calculation of subsequent clinical parameters. In this paper, our goal is to achieve accurate LV segmentation through a simplified approach by introducing a branch sub-network into the decoder of the traditional U-Net. Specifically, we employed the LV contour features to supervise the branch decoding process and used a cross attention module to facilitate the interaction relationship between the branch and the original decoding process, thereby improving the segmentation performance in the region LV boundaries. In the experiments, the proposed branch U-Net (BU-Net) demonstrated superior performance on CAMUS and EchoNet-dynamic public echocardiography segmentation datasets in comparison to state-of-the-art segmentation models, without the need for complex residual connections or transformer-based architectures. Our codes are publicly available at Anonymous Github https://anonymous.4open.science/r/Anoymous_two-BFF2/ .</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"561-573"},"PeriodicalIF":2.6,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142479176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-01Epub Date: 2024-09-26DOI: 10.1007/s11517-024-03205-w
Yunchai Li, Run Fang, Nangang Zhang, Chengsheng Liao, Xiaochang Chen, Xiaoyu Wang, Yunfei Luo, Leheng Li, Min Mao, Yunlong Zhang
With the rapid advancement of modern medical technology, microscopy imaging systems have become one of the key technologies in medical image analysis. However, manual use of microscopes presents issues such as operator dependency, inefficiency, and time consumption. To enhance the efficiency and accuracy of medical image capture and reduce the burden of subsequent quantitative analysis, this paper proposes an improved microscope salient object detection algorithm based on U2-Net, incorporating deep learning technology. The improved algorithm first enhances the network's key information extraction capability by incorporating the Convolutional Block Attention Module (CBAM) into U2-Net. It then optimizes network complexity by constructing a Simple Pyramid Pooling Module (SPPM) and uses Ghost convolution to achieve model lightweighting. Additionally, data augmentation is applied to the slides to improve the algorithm's robustness and generalization. The experimental results show that the size of the improved algorithm model is 72.5 MB, which represents a 56.85% reduction compared to the original U2-Net model size of 168.0 MB. Additionally, the model's prediction accuracy has increased from 92.24 to 97.13%, providing an efficient means for subsequent image processing and analysis tasks in microscopy imaging systems.
{"title":"An improved algorithm for salient object detection of microscope based on U<sup>2</sup>-Net.","authors":"Yunchai Li, Run Fang, Nangang Zhang, Chengsheng Liao, Xiaochang Chen, Xiaoyu Wang, Yunfei Luo, Leheng Li, Min Mao, Yunlong Zhang","doi":"10.1007/s11517-024-03205-w","DOIUrl":"10.1007/s11517-024-03205-w","url":null,"abstract":"<p><p>With the rapid advancement of modern medical technology, microscopy imaging systems have become one of the key technologies in medical image analysis. However, manual use of microscopes presents issues such as operator dependency, inefficiency, and time consumption. To enhance the efficiency and accuracy of medical image capture and reduce the burden of subsequent quantitative analysis, this paper proposes an improved microscope salient object detection algorithm based on U<sup>2</sup>-Net, incorporating deep learning technology. The improved algorithm first enhances the network's key information extraction capability by incorporating the Convolutional Block Attention Module (CBAM) into U<sup>2</sup>-Net. It then optimizes network complexity by constructing a Simple Pyramid Pooling Module (SPPM) and uses Ghost convolution to achieve model lightweighting. Additionally, data augmentation is applied to the slides to improve the algorithm's robustness and generalization. The experimental results show that the size of the improved algorithm model is 72.5 MB, which represents a 56.85% reduction compared to the original U<sup>2</sup>-Net model size of 168.0 MB. Additionally, the model's prediction accuracy has increased from 92.24 to 97.13%, providing an efficient means for subsequent image processing and analysis tasks in microscopy imaging systems.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":"383-397"},"PeriodicalIF":2.6,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142331240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}