首页 > 最新文献

IEEE Journal of Translational Engineering in Health and Medicine-Jtehm最新文献

英文 中文
Commute Booster: A Mobile Application for First/Last Mile and Middle Mile Navigation Support for People With Blindness and Low Vision 通勤助推器:一款为盲人和弱视人士提供第一/最后一英里和中间一英里导航支持的移动应用程序
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2023-07-07 DOI: 10.1109/JTEHM.2023.3293450
Junchi Feng;Mahya Beheshti;Mira Philipson;Yuvraj Ramsaywack;Maurizio Porfiri;John-Ross Rizzo
Objective: People with blindness and low vision face substantial challenges when navigating both indoor and outdoor environments. While various solutions are available to facilitate travel to and from public transit hubs, there is a notable absence of solutions for navigating within transit hubs, often referred to as the “middle mile”. Although research pilots have explored the middle mile journey, no solutions exist at scale, leaving a critical gap for commuters with disabilities. In this paper, we proposed a novel mobile application, Commute Booster, that offers full trip planning and real-time guidance inside the station. Methods and procedures: Our system consists of two key components: the general transit feed specification (GTFS) and optical character recognition (OCR). The GTFS dataset generates a comprehensive list of wayfinding signage within subway stations that users will encounter during their intended journey. The OCR functionality enables users to identify relevant navigation signs in their immediate surroundings. By seamlessly integrating these two components, Commute Booster provides real-time feedback to users regarding the presence or absence of relevant navigation signs within the field of view of their phone camera during their journey. Results: As part of our technical validation process, we conducted tests at three subway stations in New York City. The sign detection achieved an impressive overall accuracy rate of 0.97. Additionally, the system exhibited a maximum detection range of 11 meters and supported an oblique angle of approximately 110 degrees for field of view detection. Conclusion: The Commute Booster mobile application relies on computer vision technology and does not require additional sensors or infrastructure. It holds tremendous promise in assisting individuals with blindness and low vision during their daily commutes. Clinical and Translational Impact Statement: Commute Booster translates the combination of OCR and GTFS into an assistive tool, which holds great promise for assisting people with blindness and low vision in their daily commute.
目的:失明和低视力人群在室内和室外环境中都面临着巨大的挑战。虽然有各种各样的解决方案可以方便人们往返于公共交通枢纽,但明显缺乏在交通枢纽内导航的解决方案,这些解决方案通常被称为“中间一英里”。尽管试点研究已经探索了中间一英里的旅程,但没有大规模的解决方案,这给残疾通勤者留下了一个严重的缺口。在本文中,我们提出了一个新颖的移动应用程序,通勤助推器,提供完整的行程规划和实时指导站内。方法和步骤:我们的系统由两个关键部分组成:通用过境馈电规范(GTFS)和光学字符识别(OCR)。GTFS数据集生成了用户在预定旅程中会遇到的地铁站内的寻路标志的综合列表。OCR功能使用户能够识别周围环境中的相关导航标志。通过无缝集成这两个组件,通勤助推器可以向用户提供实时反馈,告知他们在旅途中手机摄像头视野内是否存在相关导航标志。结果:作为技术验证过程的一部分,我们在纽约市的三个地铁站进行了测试。符号检测的总体准确率达到了令人印象深刻的0.97。此外,该系统的最大探测范围为11米,并支持约110度的斜角进行视场探测。结论:通勤助推器移动应用程序依赖于计算机视觉技术,不需要额外的传感器或基础设施。它在帮助失明和视力低下的人日常通勤方面有着巨大的希望。临床和转化影响声明:通勤助推器将OCR和GTFS的结合转化为一种辅助工具,它为帮助失明和弱视人士在日常通勤中提供了很大的希望。
{"title":"Commute Booster: A Mobile Application for First/Last Mile and Middle Mile Navigation Support for People With Blindness and Low Vision","authors":"Junchi Feng;Mahya Beheshti;Mira Philipson;Yuvraj Ramsaywack;Maurizio Porfiri;John-Ross Rizzo","doi":"10.1109/JTEHM.2023.3293450","DOIUrl":"10.1109/JTEHM.2023.3293450","url":null,"abstract":"Objective: People with blindness and low vision face substantial challenges when navigating both indoor and outdoor environments. While various solutions are available to facilitate travel to and from public transit hubs, there is a notable absence of solutions for navigating within transit hubs, often referred to as the “middle mile”. Although research pilots have explored the middle mile journey, no solutions exist at scale, leaving a critical gap for commuters with disabilities. In this paper, we proposed a novel mobile application, Commute Booster, that offers full trip planning and real-time guidance inside the station. Methods and procedures: Our system consists of two key components: the general transit feed specification (GTFS) and optical character recognition (OCR). The GTFS dataset generates a comprehensive list of wayfinding signage within subway stations that users will encounter during their intended journey. The OCR functionality enables users to identify relevant navigation signs in their immediate surroundings. By seamlessly integrating these two components, Commute Booster provides real-time feedback to users regarding the presence or absence of relevant navigation signs within the field of view of their phone camera during their journey. Results: As part of our technical validation process, we conducted tests at three subway stations in New York City. The sign detection achieved an impressive overall accuracy rate of 0.97. Additionally, the system exhibited a maximum detection range of 11 meters and supported an oblique angle of approximately 110 degrees for field of view detection. Conclusion: The Commute Booster mobile application relies on computer vision technology and does not require additional sensors or infrastructure. It holds tremendous promise in assisting individuals with blindness and low vision during their daily commutes. Clinical and Translational Impact Statement: Commute Booster translates the combination of OCR and GTFS into an assistive tool, which holds great promise for assisting people with blindness and low vision in their daily commute.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"11 ","pages":"523-535"},"PeriodicalIF":3.4,"publicationDate":"2023-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10175612","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"62231443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design and Evaluation of a Balanced Compliant Laparoscopic Grasper 平衡顺应性腹腔镜抓取器的设计与评价。
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2023-07-03 DOI: 10.1109/JTEHM.2023.3291925
Jan-Willem Klok;Roelf Postema;Asþor T. Steinþorsson;Jenny Dankelman;Tim Horeman
In laparoscopic surgery, quality of haptic feedback is reduced compared to conventional surgery, leading to unintentional tissue damage during grasping. From the perspective of haptics, poor mechanical design of laparoscopic instrument joints induces friction and a nonlinear actuation-tip force relation. In this study, a novel laparoscopic grasper using compliant joints and a magnetic balancer is presented, and the reduction in hysteresis and friction is evaluated. The hysteresis loop of the novel compliant grasper and two conventional laparoscopic graspers (high quality leading commercial brand and low quality unbranded grasper) were measured. In order to assess quality of haptic feedback, the lowest grasper tip load perceivable by instrument users was measured with the novel and the conventional laparoscopic graspers. The hysteresis loop measurement yielded a mechanical efficiency of 43% for the novel grasper, compared to- 25% and 23% for the Aesculap and the unbranded grasper, respectively. The forces perceivable by the user through the novel grasper were significantly lower (mean 1.37N, SD 0.44N) than those of conventional graspers (mean 2.15N, SD 0.71N and mean 2.65N, SD 1.20N, respectively). The balanced compliant grasper technology has the ability to improve the quality of haptic feedback compared to conventional laparoscopic graspers. Research is needed to relate these results to soft and delicate tissue grasping in a clinical setting, for which this instrument is intended.
在腹腔镜手术中,与传统手术相比,触觉反馈的质量降低,导致抓握过程中无意的组织损伤。从触觉的角度来看,腹腔镜器械关节的机械设计不佳会导致摩擦和非线性的致动-尖端-力关系。在这项研究中,提出了一种新型的腹腔镜抓取器,该抓取器使用柔性关节和磁性平衡器,并评估了磁滞和摩擦的减少。测量了新型顺应性抓取器和两种传统腹腔镜抓取器(高质量领先商业品牌和低质量无品牌抓取器)的磁滞回线。为了评估触觉反馈的质量,使用新型和传统腹腔镜抓握器测量仪器用户可感知的最低抓握器尖端负载。磁滞回线测量结果表明,新型抓持器的机械效率为43%,而Aesculap和无品牌抓持器分别为-25%和23%。使用者通过新型抓握器可感知的力(平均1.37N,SD 0.44N)显著低于传统抓握器的力(分别为平均2.15N,SD 0.71N和平均2.65N,SD 1.20N)。与传统腹腔镜抓握器相比,平衡柔顺抓握器技术能够提高触觉反馈的质量。需要进行研究,将这些结果与临床环境中柔软细腻的组织抓取联系起来,这就是该仪器的用途。
{"title":"Design and Evaluation of a Balanced Compliant Laparoscopic Grasper","authors":"Jan-Willem Klok;Roelf Postema;Asþor T. Steinþorsson;Jenny Dankelman;Tim Horeman","doi":"10.1109/JTEHM.2023.3291925","DOIUrl":"10.1109/JTEHM.2023.3291925","url":null,"abstract":"In laparoscopic surgery, quality of haptic feedback is reduced compared to conventional surgery, leading to unintentional tissue damage during grasping. From the perspective of haptics, poor mechanical design of laparoscopic instrument joints induces friction and a nonlinear actuation-tip force relation. In this study, a novel laparoscopic grasper using compliant joints and a magnetic balancer is presented, and the reduction in hysteresis and friction is evaluated. The hysteresis loop of the novel compliant grasper and two conventional laparoscopic graspers (high quality leading commercial brand and low quality unbranded grasper) were measured. In order to assess quality of haptic feedback, the lowest grasper tip load perceivable by instrument users was measured with the novel and the conventional laparoscopic graspers. The hysteresis loop measurement yielded a mechanical efficiency of 43% for the novel grasper, compared to- 25% and 23% for the Aesculap and the unbranded grasper, respectively. The forces perceivable by the user through the novel grasper were significantly lower (mean 1.37N, SD 0.44N) than those of conventional graspers (mean 2.15N, SD 0.71N and mean 2.65N, SD 1.20N, respectively). The balanced compliant grasper technology has the ability to improve the quality of haptic feedback compared to conventional laparoscopic graspers. Research is needed to relate these results to soft and delicate tissue grasping in a clinical setting, for which this instrument is intended.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"11 ","pages":"451-459"},"PeriodicalIF":3.4,"publicationDate":"2023-07-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10172011","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41220007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simultaneous Control of Venous Reservoir Level and Arterial Flow Rate in Cardiopulmonary Bypass With a Centrifugal Pump 离心泵在体外循环中同时控制静脉储备水平和动脉流速。
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2023-06-30 DOI: 10.1109/JTEHM.2023.3290951
Hidenobu Takahashi;Takuya Kinoshita;Zu Soh;Shigeyuki Okahara;Satoshi Miyamoto;Shinji Ninomiya;Toshio Tsuji
Cardiopulmonary bypass (CPB) is an indispensable technique in cardiac surgery, providing the ability to temporarily replace cardiopulmonary function and create a bloodless surgical field. Traditionally, the operation of CPB systems has depended on the expertise and experience of skilled perfusionists. In particular, simultaneously controlling the arterial and venous occluders is difficult because the blood flow rate and reservoir level both change, and failure may put the patient’s life at risk. This study proposes an automatic control system with a two-degree-of-freedom model matching controller nested in an I-PD feedback controller to simultaneously regulate the blood flow rate and reservoir level. CPB operations were performed using glycerin and bovine blood as perfusate to simulate flow-up and flow-down phases. The results confirmed that the arterial blood flow rate followed the manually adjusted target venous blood flow rate, with an error of less than 5.32%, and the reservoir level was maintained, with an error of less than 3.44% from the target reservoir level. Then, we assessed the robustness of the control system against disturbances caused by venting/suction of blood. The resulting flow rate error was 5.95%, and the reservoir level error 2.02%. The accuracy of the proposed system is clinically satisfactory and within the allowable error range of 10% or less, meeting the standards set for perfusionists. Moreover, because of the system’s simple configuration, consisting of a camera and notebook PC, the system can easily be integrated with general CPB equipment. This practical design enables seamless adoption in clinical settings. With these advancements, the proposed system represents a significant step towards the automation of CPB.
体外循环(CPB)是心脏外科手术中不可或缺的技术,它能够暂时取代心肺功能,创造一个不流血的手术环境。传统上,CPB系统的操作依赖于熟练灌注师的专业知识和经验。特别地,同时控制动脉和静脉封堵器是困难的,因为血液流速和储液器水平都会改变,并且失败可能会使患者的生命处于危险之中。本研究提出了一种自动控制系统,该系统将两自由度模型匹配控制器嵌套在I-PD反馈控制器中,以同时调节血液流速和储液器液位。CPB手术使用甘油和牛血作为灌注液来模拟上行和下行阶段。结果证实,动脉血流量遵循手动调整的目标静脉血流量,误差小于5.32%,并保持了水库水位,与目标水库水位的误差小于3.44%。然后,我们评估了控制系统对血液排出/吸入引起的干扰的鲁棒性。由此产生的流速误差为5.95%,水库水位误差为2.02%。所提出的系统的准确性在临床上是令人满意的,并且在10%或更小的允许误差范围内,满足灌注师的标准。此外,由于该系统配置简单,由摄像头和笔记本电脑组成,因此可以很容易地与通用CPB设备集成。这种实用的设计能够在临床环境中无缝采用。随着这些进步,所提出的系统代表着CPB自动化的重要一步。
{"title":"Simultaneous Control of Venous Reservoir Level and Arterial Flow Rate in Cardiopulmonary Bypass With a Centrifugal Pump","authors":"Hidenobu Takahashi;Takuya Kinoshita;Zu Soh;Shigeyuki Okahara;Satoshi Miyamoto;Shinji Ninomiya;Toshio Tsuji","doi":"10.1109/JTEHM.2023.3290951","DOIUrl":"10.1109/JTEHM.2023.3290951","url":null,"abstract":"Cardiopulmonary bypass (CPB) is an indispensable technique in cardiac surgery, providing the ability to temporarily replace cardiopulmonary function and create a bloodless surgical field. Traditionally, the operation of CPB systems has depended on the expertise and experience of skilled perfusionists. In particular, simultaneously controlling the arterial and venous occluders is difficult because the blood flow rate and reservoir level both change, and failure may put the patient’s life at risk. This study proposes an automatic control system with a two-degree-of-freedom model matching controller nested in an I-PD feedback controller to simultaneously regulate the blood flow rate and reservoir level. CPB operations were performed using glycerin and bovine blood as perfusate to simulate flow-up and flow-down phases. The results confirmed that the arterial blood flow rate followed the manually adjusted target venous blood flow rate, with an error of less than 5.32%, and the reservoir level was maintained, with an error of less than 3.44% from the target reservoir level. Then, we assessed the robustness of the control system against disturbances caused by venting/suction of blood. The resulting flow rate error was 5.95%, and the reservoir level error 2.02%. The accuracy of the proposed system is clinically satisfactory and within the allowable error range of 10% or less, meeting the standards set for perfusionists. Moreover, because of the system’s simple configuration, consisting of a camera and notebook PC, the system can easily be integrated with general CPB equipment. This practical design enables seamless adoption in clinical settings. With these advancements, the proposed system represents a significant step towards the automation of CPB.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"11 ","pages":"435-440"},"PeriodicalIF":3.4,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10168928","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9945552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Epileptic Seizure Prediction Method Based on CBAM-3D CNN-LSTM Model 一种基于CBAM-3D CNN-LSTM模型的癫痫发作预测方法。
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2023-06-27 DOI: 10.1109/JTEHM.2023.3290036
Xiang Lu;Anhao Wen;Lei Sun;Hao Wang;Yinjing Guo;Yande Ren
Epilepsy as a common disease of the nervous system, with high incidence, sudden and recurrent characteristics. Therefore, timely prediction of seizures and intervention treatment can significantly reduce the accidental injury of patients and protect the life and health of patients. Epilepsy seizures is the result of temporal and spatial evolution, Existing deep learning methods often ignore its spatial features, in order to make better use of the temporal and spatial characteristics of epileptic EEG signals. We propose a CBAM-3D CNN-LSTM model to predict epilepsy seizures. First, we apply short-time Fourier transform(STFT) to preprocess EEG signals. Secondly, the 3D CNN model was used to extract the features of preictal stage and interictal stage from the preprocessed signals. Thirdly, Bi-LSTM is connected to 3D CNN for classification. Finally CBAM is introduced into the model. Different attention is given to the data channel and space to extract key information, so that the model can accurately extract interictal and pre-ictal features. Our proposed approach achieved an accuracy of 97.95%, a sensitivity of 98.40%, and a false alarm rate of 0.017 h−1 on 11 patients from the public CHB-MIT scalp EEG dataset. Clinical and Translational Impact Statement—Timely prediction of epileptic seizures and intervention treatment can significantly reduce the accidental injury of patients and protect the life and health of patients.
癫痫作为一种常见的神经系统疾病,具有发病率高、突发性和复发性的特点。因此,及时预测癫痫发作并进行干预治疗,可以显著减少患者的意外伤害,保护患者的生命健康。癫痫发作是时间和空间进化的结果,现有的深度学习方法为了更好地利用癫痫脑电信号的时间和空间特征,往往忽略了其空间特征。我们提出了一个CBAM-3DCNN-LSTM模型来预测癫痫发作。首先,我们应用短时傅立叶变换(STFT)对脑电信号进行预处理。其次,利用三维CNN模型从预处理后的信号中提取发作前和发作间期的特征。第三,将Bi-LSTM连接到3D CNN进行分类。最后将CBAM引入到模型中。对提取关键信息的数据通道和空间给予了不同的关注,使模型能够准确地提取发作间期和发作前的特征。我们提出的方法在来自公共CHB-MIT头皮EEG数据集的11名患者中实现了97.95%的准确率、98.40%的灵敏度和0.017h-1的误报率。临床和转化影响声明及时预测癫痫发作并进行干预治疗,可以显著减少患者的意外伤害,保护患者的生命健康。
{"title":"An Epileptic Seizure Prediction Method Based on CBAM-3D CNN-LSTM Model","authors":"Xiang Lu;Anhao Wen;Lei Sun;Hao Wang;Yinjing Guo;Yande Ren","doi":"10.1109/JTEHM.2023.3290036","DOIUrl":"10.1109/JTEHM.2023.3290036","url":null,"abstract":"Epilepsy as a common disease of the nervous system, with high incidence, sudden and recurrent characteristics. Therefore, timely prediction of seizures and intervention treatment can significantly reduce the accidental injury of patients and protect the life and health of patients. Epilepsy seizures is the result of temporal and spatial evolution, Existing deep learning methods often ignore its spatial features, in order to make better use of the temporal and spatial characteristics of epileptic EEG signals. We propose a CBAM-3D CNN-LSTM model to predict epilepsy seizures. First, we apply short-time Fourier transform(STFT) to preprocess EEG signals. Secondly, the 3D CNN model was used to extract the features of preictal stage and interictal stage from the preprocessed signals. Thirdly, Bi-LSTM is connected to 3D CNN for classification. Finally CBAM is introduced into the model. Different attention is given to the data channel and space to extract key information, so that the model can accurately extract interictal and pre-ictal features. Our proposed approach achieved an accuracy of 97.95%, a sensitivity of 98.40%, and a false alarm rate of 0.017 h\u0000<sup>−1</sup>\u0000 on 11 patients from the public CHB-MIT scalp EEG dataset. \u0000<italic><b>Clinical and Translational Impact Statement</b></i>\u0000—Timely prediction of epileptic seizures and intervention treatment can significantly reduce the accidental injury of patients and protect the life and health of patients.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"11 ","pages":"417-423"},"PeriodicalIF":3.4,"publicationDate":"2023-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10164022","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9814105","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
TransU²-Net: An Effective Medical Image Segmentation Framework Based on Transformer and U²-Net TransU-Net:一种基于Transformer和U-Net的有效医学图像分割框架。
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2023-06-27 DOI: 10.1109/JTEHM.2023.3289990
Xiang Li;Xianjin Fang;Gaoming Yang;Shuzhi Su;Li Zhu;Zekuan Yu
Background: In the past few years, U-Net based U-shaped architecture and skip-connections have made incredible progress in the field of medical image segmentation. U2-Net achieves good performance in computer vision. However, in the medical image segmentation task, U2-Net with over nesting is easy to overfit. Purpose: A 2D network structure TransU2-Net combining transformer and a lighter weight U2-Net is proposed for automatic segmentation of brain tumor magnetic resonance image (MRI). Methods: The light-weight U2-Net architecture not only obtains multi-scale information but also reduces redundant feature extraction. Meanwhile, the transformer block embedded in the stacked convolutional layer obtains more global information; the transformer with skip-connection enhances spatial domain information representation. A new multi-scale feature map fusion strategy as a postprocessing method was proposed for better fusing high and low-dimensional spatial information. Results: Our proposed model TransU2-Net achieves better segmentation results, on the BraTS2021 dataset, our method achieves an average dice coefficient of 88.17%; Evaluation on the publicly available MSD dataset, we perform tumor evaluation, we achieve a dice coefficient of 74.69%; in addition to comparing the TransU2-Net results are compared with previously proposed 2D segmentation methods. Conclusions: We propose an automatic medical image segmentation method combining transformers and U2-Net, which has good performance and is of clinical importance. The experimental results show that the proposed method outperforms other 2D medical image segmentation methods. Clinical Translation Statement: We use the BarTS2021 dataset and the MSD dataset which are publicly available databases. All experiments in this paper are in accordance with medical ethics.
背景:在过去的几年里,基于U-Net的U型架构和跳跃连接在医学图像分割领域取得了令人难以置信的进展。U2 Net在计算机视觉方面取得了良好的性能。然而,在医学图像分割任务中,具有过嵌套的U2-Net容易过拟合。目的:提出了一种将变压器和较轻重量的U2 Net相结合的二维网络结构TransU2 Net,用于脑肿瘤磁共振图像(MRI)的自动分割。方法:轻量级的U2 Net结构不仅可以获得多尺度信息,而且可以减少冗余特征提取。同时,嵌入堆叠卷积层中的变换器块获得了更多的全局信息;具有跳跃连接的变换器增强了空间域信息的表示。为了更好地融合高维和低维空间信息,提出了一种新的多尺度特征图融合策略作为后处理方法。结果:我们提出的模型TransU2-Net获得了更好的分割结果,在BraTS2021数据集上,我们的方法获得了88.17%的平均骰子系数;在公开的MSD数据集上进行评估,我们进行肿瘤评估,我们获得了74.69%的骰子系数;除了比较TransU2-Net的结果之外,还将其与先前提出的2D分割方法进行比较。结论:我们提出了一种将transformers和U2 Net相结合的医学图像自动分割方法,该方法具有良好的性能,具有重要的临床意义。实验结果表明,该方法优于其他二维医学图像分割方法。临床翻译声明:我们使用BarTS2021数据集和MSD数据集,它们是公开可用的数据库。本文中的所有实验都符合医学伦理。
{"title":"TransU²-Net: An Effective Medical Image Segmentation Framework Based on Transformer and U²-Net","authors":"Xiang Li;Xianjin Fang;Gaoming Yang;Shuzhi Su;Li Zhu;Zekuan Yu","doi":"10.1109/JTEHM.2023.3289990","DOIUrl":"10.1109/JTEHM.2023.3289990","url":null,"abstract":"Background: In the past few years, U-Net based U-shaped architecture and skip-connections have made incredible progress in the field of medical image segmentation. U\u0000<sup>2</sup>\u0000-Net achieves good performance in computer vision. However, in the medical image segmentation task, U\u0000<sup>2</sup>\u0000-Net with over nesting is easy to overfit. Purpose: A 2D network structure TransU\u0000<sup>2</sup>\u0000-Net combining transformer and a lighter weight U\u0000<sup>2</sup>\u0000-Net is proposed for automatic segmentation of brain tumor magnetic resonance image (MRI). Methods: The light-weight U\u0000<sup>2</sup>\u0000-Net architecture not only obtains multi-scale information but also reduces redundant feature extraction. Meanwhile, the transformer block embedded in the stacked convolutional layer obtains more global information; the transformer with skip-connection enhances spatial domain information representation. A new multi-scale feature map fusion strategy as a postprocessing method was proposed for better fusing high and low-dimensional spatial information. Results: Our proposed model TransU\u0000<sup>2</sup>\u0000-Net achieves better segmentation results, on the BraTS2021 dataset, our method achieves an average dice coefficient of 88.17%; Evaluation on the publicly available MSD dataset, we perform tumor evaluation, we achieve a dice coefficient of 74.69%; in addition to comparing the TransU\u0000<sup>2</sup>\u0000-Net results are compared with previously proposed 2D segmentation methods. Conclusions: We propose an automatic medical image segmentation method combining transformers and U\u0000<sup>2</sup>\u0000-Net, which has good performance and is of clinical importance. The experimental results show that the proposed method outperforms other 2D medical image segmentation methods. \u0000<bold>Clinical Translation Statement</b>\u0000: We use the BarTS2021 dataset and the MSD dataset which are publicly available databases. All experiments in this paper are in accordance with medical ethics.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"11 ","pages":"441-450"},"PeriodicalIF":3.4,"publicationDate":"2023-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/75/25/jtehm-fang-3289990.PMC10561737.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41220010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple Field-of-View Based Attention Driven Network for Weakly Supervised Common Bile Duct Stone Detection 基于多视野的注意力驱动网络用于弱监督胆总管结石检测。
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2023-06-15 DOI: 10.1109/JTEHM.2023.3286423
Ya-Han Chang;Meng-Ying Lin;Ming-Tsung Hsieh;Ming-Ching Ou;Chun-Rong Huang;Bor-Shyang Sheu
Objective: Common bile duct (CBD) stones caused diseases are life-threatening. Because CBD stones locate in the distal part of the CBD and have relatively small sizes, detecting CBD stones from CT scans is a challenging issue in the medical domain. Methods and procedures: We propose a deep learning based weakly-supervised method called multiple field-of-view based attention driven network (MFADNet) to detect CBD stones from CT scans based on image-level labels. Three dominant modules including a multiple field-of-view encoder, an attention driven decoder and a classification network are collaborated in the network. The encoder learns the feature of multi-scale contextual information while the decoder with the classification network is applied to locate the CBD stones based on spatial-channel attentions. To drive the learning of the whole network in a weakly-supervised and end-to-end trainable manner, four losses including the foreground loss, background loss, consistency loss and classification loss are proposed. Results: Compared with state-of-the-art weakly-supervised methods in the experiments, the proposed method can accurately classify and locate CBD stones based on the quantitative and qualitative results. Conclusion: We propose a novel multiple field-of-view based attention driven network for a new medical application of CBD stone detection from CT scans while only image-levels are required to reduce the burdens of labeling and help physicians automatically diagnose CBD stones. The source code is available at https://github.com/nchucvml/MFADNet after acceptance. Clinical impact: Our deep learning method can help physicians localize relatively small CBD stones for effectively diagnosing CBD stone caused diseases.
目的:胆总管结石引起的疾病危及生命。由于CBD结石位于CBD的远端,体积相对较小,因此从CT扫描中检测CBD结石在医学领域是一个具有挑战性的问题。方法和程序:我们提出了一种基于深度学习的弱监督方法,称为基于多视场的注意力驱动网络(MFADNet),用于基于图像级别标签从CT扫描中检测CBD结石。网络中协作了三个主要模块,包括多视场编码器、注意力驱动解码器和分类网络。编码器学习多尺度上下文信息的特征,而具有分类网络的解码器则基于空间通道注意力来定位CBD石头。为了以弱监督和端到端可训练的方式驱动整个网络的学习,提出了四种损失,包括前景损失、背景损失、一致性损失和分类损失。结果:与实验中最先进的弱监督方法相比,该方法能够根据定量和定性结果准确地对CBD结石进行分类和定位。结论:我们提出了一种新的基于多视场的注意力驱动网络,用于CT扫描中CBD结石检测的新医学应用,同时只需要图像级别即可减轻标记负担,并帮助医生自动诊断CBD结石。源代码位于https://github.com/nchucvml/MFADNet验收后。临床影响:我们的深度学习方法可以帮助医生定位相对较小的CBD结石,从而有效诊断CBD结石引起的疾病。
{"title":"Multiple Field-of-View Based Attention Driven Network for Weakly Supervised Common Bile Duct Stone Detection","authors":"Ya-Han Chang;Meng-Ying Lin;Ming-Tsung Hsieh;Ming-Ching Ou;Chun-Rong Huang;Bor-Shyang Sheu","doi":"10.1109/JTEHM.2023.3286423","DOIUrl":"10.1109/JTEHM.2023.3286423","url":null,"abstract":"Objective: Common bile duct (CBD) stones caused diseases are life-threatening. Because CBD stones locate in the distal part of the CBD and have relatively small sizes, detecting CBD stones from CT scans is a challenging issue in the medical domain. Methods and procedures: We propose a deep learning based weakly-supervised method called multiple field-of-view based attention driven network (MFADNet) to detect CBD stones from CT scans based on image-level labels. Three dominant modules including a multiple field-of-view encoder, an attention driven decoder and a classification network are collaborated in the network. The encoder learns the feature of multi-scale contextual information while the decoder with the classification network is applied to locate the CBD stones based on spatial-channel attentions. To drive the learning of the whole network in a weakly-supervised and end-to-end trainable manner, four losses including the foreground loss, background loss, consistency loss and classification loss are proposed. Results: Compared with state-of-the-art weakly-supervised methods in the experiments, the proposed method can accurately classify and locate CBD stones based on the quantitative and qualitative results. Conclusion: We propose a novel multiple field-of-view based attention driven network for a new medical application of CBD stone detection from CT scans while only image-levels are required to reduce the burdens of labeling and help physicians automatically diagnose CBD stones. The source code is available at \u0000<uri>https://github.com/nchucvml/MFADNet</uri>\u0000 after acceptance. Clinical impact: Our deep learning method can help physicians localize relatively small CBD stones for effectively diagnosing CBD stone caused diseases.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"11 ","pages":"394-404"},"PeriodicalIF":3.4,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10153581","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9854605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Graph Convolutional Network Based on Univariate Neurodegeneration Biomarker for Alzheimer’s Disease Diagnosis 基于单变量神经变性生物标志物的图卷积网络在阿尔茨海默病诊断中的应用。
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2023-06-13 DOI: 10.1109/JTEHM.2023.3285723
Zongshuai Qu;Tao Yao;Xinghui Liu;Gang Wang
Objective: Alzheimer’s disease (AD) is a progressive and irreversible neurodegenerative disease that is not easily detectable in the early stage. This study proposed an efficient method of applying a graph convolutional network (GCN) on the early prediction of AD. Methods: We proposed a univariate neurodegeneration biomarker (UNB) based GCN semi-supervised classification framework. We generated UNB by comparing the similarity of individual morphological atrophy pattern and the atrophy pattern of $text{A}beta +$ AD group according to the brain morphological abnormalities induced by AD. For the GCN semi-supervised classification model, we took the UNBs of individuals as the features of nodes and constructed the weight of edges according to the similarity of phenotypic information between individuals, which explored the essential features of individuals through spectral graph convolution. The attention module was constructed and embedded into the GCN framework, which may refine the input morphological features to highlight the main impact of AD on the cerebral cortex and weaken the instability caused by individual diversities, thereby identifying the significant ROIs affected by AD and improving the classification accuracy. Results: We tested the UNB-GCN framework on the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database. The estimated minimum sample sizes were 156, 349 and 423 for the longitudinal $text{A}beta +$ AD, $text{A}beta +$ mild cognitive impairment (MCI) and $text{A}beta +$ cognitively unimpaired (CU) groups, respectively. And the proposed UNB-GCN framework combined with the attention module can effectively improve the classification performance with 93.90% classification accuracy for AD vs. CU and 82.05% for AD vs. MCI on the validation set. Conclusion: The proposed UNB measures were superior to the conventional volume measures in describing the AD-induced cerebral cortex morphological changes. And the UNB-GCN framework combined with attention module may effectively improve the classification performance between MCI subjects and AD patients. Clinical and Translational Impact Statement: This study aims to predict the early AD patients, so as to help clinicians develop effective interventions to delay the deterioration of AD symptoms.
目的:阿尔茨海默病(AD)是一种进行性、不可逆的神经退行性疾病,早期不易发现。本研究提出了一种将图卷积网络(GCN)应用于AD早期预测的有效方法。方法:我们提出了一个基于单变量神经退行性变生物标志物(UNB)的GCN半监督分类框架。根据AD诱导的脑形态异常,我们通过比较个体形态萎缩模式与[公式:见正文]AD组萎缩模式的相似性来生成UNB。对于GCN半监督分类模型,我们将个体的UNBs作为节点的特征,并根据个体之间表型信息的相似性构建边缘的权重,通过谱图卷积来探索个体的本质特征。将注意力模块构建并嵌入GCN框架中,可以细化输入的形态学特征,突出AD对大脑皮层的主要影响,削弱个体多样性引起的不稳定性,从而识别出受AD影响的显著ROI,提高分类精度。结果:我们在阿尔茨海默病神经成像倡议(ADNI)数据库上测试了UNB-GCN框架。纵向[公式:见正文]AD、[公式:看正文]轻度认知障碍(MCI)和[公式:未见正文]认知未受损(CU)组的估计最小样本量分别为156、349和423。所提出的UNB-GCN框架与注意力模块相结合,可以有效地提高分类性能,在验证集上,AD与CU的分类准确率为93.90%,AD与MCI的分类准确度为82.05%。结论:在描述AD诱导的大脑皮层形态学变化方面,所提出的UNB测量优于传统的体积测量。UNB-GCN框架与注意力模块相结合可以有效提高MCI受试者与AD患者之间的分类性能。临床和转化影响声明:本研究旨在预测早期AD患者,以帮助临床医生制定有效的干预措施,延缓AD症状的恶化。
{"title":"A Graph Convolutional Network Based on Univariate Neurodegeneration Biomarker for Alzheimer’s Disease Diagnosis","authors":"Zongshuai Qu;Tao Yao;Xinghui Liu;Gang Wang","doi":"10.1109/JTEHM.2023.3285723","DOIUrl":"10.1109/JTEHM.2023.3285723","url":null,"abstract":"Objective: Alzheimer’s disease (AD) is a progressive and irreversible neurodegenerative disease that is not easily detectable in the early stage. This study proposed an efficient method of applying a graph convolutional network (GCN) on the early prediction of AD. Methods: We proposed a univariate neurodegeneration biomarker (UNB) based GCN semi-supervised classification framework. We generated UNB by comparing the similarity of individual morphological atrophy pattern and the atrophy pattern of \u0000<inline-formula> <tex-math>$text{A}beta +$ </tex-math></inline-formula>\u0000 AD group according to the brain morphological abnormalities induced by AD. For the GCN semi-supervised classification model, we took the UNBs of individuals as the features of nodes and constructed the weight of edges according to the similarity of phenotypic information between individuals, which explored the essential features of individuals through spectral graph convolution. The attention module was constructed and embedded into the GCN framework, which may refine the input morphological features to highlight the main impact of AD on the cerebral cortex and weaken the instability caused by individual diversities, thereby identifying the significant ROIs affected by AD and improving the classification accuracy. Results: We tested the UNB-GCN framework on the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database. The estimated minimum sample sizes were 156, 349 and 423 for the longitudinal \u0000<inline-formula> <tex-math>$text{A}beta +$ </tex-math></inline-formula>\u0000 AD, \u0000<inline-formula> <tex-math>$text{A}beta +$ </tex-math></inline-formula>\u0000 mild cognitive impairment (MCI) and \u0000<inline-formula> <tex-math>$text{A}beta +$ </tex-math></inline-formula>\u0000 cognitively unimpaired (CU) groups, respectively. And the proposed UNB-GCN framework combined with the attention module can effectively improve the classification performance with 93.90% classification accuracy for AD vs. CU and 82.05% for AD vs. MCI on the validation set. Conclusion: The proposed UNB measures were superior to the conventional volume measures in describing the AD-induced cerebral cortex morphological changes. And the UNB-GCN framework combined with attention module may effectively improve the classification performance between MCI subjects and AD patients. Clinical and Translational Impact Statement: This study aims to predict the early AD patients, so as to help clinicians develop effective interventions to delay the deterioration of AD symptoms.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"11 ","pages":"405-416"},"PeriodicalIF":3.4,"publicationDate":"2023-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10149537","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9892878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Sparse Independence Component Analysis for Competitive Endogenous RNA Co-Module Identification in Liver Hepatocellular Carcinoma 肝细胞癌竞争性内源性RNA共模块鉴定的稀疏独立成分分析。
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2023-06-07 DOI: 10.1109/JTEHM.2023.3283519
Yuhu Shi;Lili Zhou;Weiming Zeng;Boyang Wei;Jin Deng
Objective: Long non-coding RNAs (lncRNAs) have been shown to be associated with the pathogenesis of different kinds of diseases and play important roles in various biological processes. Although numerous lncRNAs have been found, the functions of most lncRNAs and physiological/pathological significance are still in its infancy. Meanwhile, their expression patterns and regulation mechanisms are also far from being fully understood. Methods: In order to reveal functional lncRNAs and identify the key lncRNAs, we develop a new sparse independence component analysis (ICA) method to identify lncRNA-mRNA-miRNA expression co-modules based on the competitive endogenous RNA (ceRNA) theory using the sample-matched lncRNA, mRNA and miRNA expression profiles. The expression data of the three RNA combined together is approximated sparsely to obtain the corresponding sparsity coefficient, and then it is decomposed by using ICA constraint optimization to obtain the common basis and modules. Subsequently, affine propagation clustering is used to perform cluster analysis on the common basis under multiple running conditions to obtain the co-modules for the selection of different RNA elements. Results: We applied sparse ICA to Liver Hepatocellular Carcinoma (LIHC) dataset and the experiment results demonstrate that the proposed sparse ICA method can effectively discover biologically functional expression common modules. Conclusion: It may provide insights into the function of lncRNAs and molecular mechanism of LIHC. Clinical and Translational Impact Statement–The results on LIHC dataset demonstrate that the proposed sparse ICA method can effectively discover biologically functional expression common modules, which may provide insights into the function of IncRNAs and molecular mechanism of LIHC.
目的:长非编码RNA(lncRNA)已被证明与不同类型疾病的发病机制有关,并在各种生物学过程中发挥重要作用。尽管已经发现了许多lncRNA,但大多数lncRNA的功能和生理/病理意义仍处于初级阶段。同时,它们的表达模式和调控机制也远未被完全理解。方法:为了揭示功能性lncRNA并鉴定关键lncRNA,我们基于竞争内源性RNA(ceRNA)理论,利用样本匹配的lncRNA、mRNA和miRNA表达谱,开发了一种新的稀疏独立成分分析(ICA)方法来鉴定lncRNA-mRNA-miRNA表达共模块。将三种RNA组合在一起的表达数据稀疏地近似,以获得相应的稀疏系数,然后使用ICA约束优化对其进行分解,以获得共同的基础和模块。随后,使用仿射传播聚类在多个运行条件下共同进行聚类分析,以获得用于选择不同RNA元素的协同模块。结果:我们将稀疏ICA应用于肝细胞癌(LIHC)数据集,实验结果表明,所提出的稀疏ICA方法可以有效地发现生物功能表达的公共模块。结论:这可能为深入了解lncRNA的功能和LIHC的分子机制提供依据。临床和翻译影响声明在LIHC数据集上的结果表明,所提出的稀疏ICA方法可以有效地发现生物功能表达的共同模块,这可能为深入了解IncRNA的功能和LIHC的分子机制提供信息。
{"title":"Sparse Independence Component Analysis for Competitive Endogenous RNA Co-Module Identification in Liver Hepatocellular Carcinoma","authors":"Yuhu Shi;Lili Zhou;Weiming Zeng;Boyang Wei;Jin Deng","doi":"10.1109/JTEHM.2023.3283519","DOIUrl":"10.1109/JTEHM.2023.3283519","url":null,"abstract":"Objective: Long non-coding RNAs (lncRNAs) have been shown to be associated with the pathogenesis of different kinds of diseases and play important roles in various biological processes. Although numerous lncRNAs have been found, the functions of most lncRNAs and physiological/pathological significance are still in its infancy. Meanwhile, their expression patterns and regulation mechanisms are also far from being fully understood. Methods: In order to reveal functional lncRNAs and identify the key lncRNAs, we develop a new sparse independence component analysis (ICA) method to identify lncRNA-mRNA-miRNA expression co-modules based on the competitive endogenous RNA (ceRNA) theory using the sample-matched lncRNA, mRNA and miRNA expression profiles. The expression data of the three RNA combined together is approximated sparsely to obtain the corresponding sparsity coefficient, and then it is decomposed by using ICA constraint optimization to obtain the common basis and modules. Subsequently, affine propagation clustering is used to perform cluster analysis on the common basis under multiple running conditions to obtain the co-modules for the selection of different RNA elements. Results: We applied sparse ICA to Liver Hepatocellular Carcinoma (LIHC) dataset and the experiment results demonstrate that the proposed sparse ICA method can effectively discover biologically functional expression common modules. Conclusion: It may provide insights into the function of lncRNAs and molecular mechanism of LIHC. Clinical and Translational Impact Statement–The results on LIHC dataset demonstrate that the proposed sparse ICA method can effectively discover biologically functional expression common modules, which may provide insights into the function of IncRNAs and molecular mechanism of LIHC.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"11 ","pages":"384-393"},"PeriodicalIF":3.4,"publicationDate":"2023-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10145103","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9854603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parameter-Free Matrix Decomposition for Specular Reflections Removal in Endoscopic Images 用于去除内窥镜图像镜面反射的无参数矩阵分解。
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2023-06-06 DOI: 10.1109/JTEHM.2023.3283444
Jithin Joseph;Sudhish N. George;Kiran Raja
Objective: Endoscopy is a medical diagnostic procedure used to see inside the human body with the help of a camera-attached system called the endoscope. Endoscopic images and videos suffer from specular reflections (or highlight) and can have an adverse impact on the diagnostic quality of images. These scattered white regions severely affect the visual appearance of images for both endoscopists and the computer-aided diagnosis of diseases. Methods & Results: We introduce a new parameter-free matrix decomposition technique to remove the specular reflections. The proposed method decomposes the original image into a highlight-free pseudo-low-rank component and a highlight component. Along with the highlight removal, the approach also removes the boundary artifacts present around the highlight regions, unlike the previous works based on family of Robust Principal Component Analysis (RPCA). The approach is evaluated on three publicly available endoscopy datasets: Kvasir Polyp, Kvasir Normal-Pylorus and Kvasir Capsule datasets. Our evaluation is benchmarked against 4 different state-of-the-art approaches using three different well-used metrics such as Structural Similarity Index Measure (SSIM), Percentage of highlights remaining and Coefficient of Variation (CoV). Conclusions: The results show significant improvements over the compared methods on all three metrics. The approach is further validated for statistical significance where it emerges better than other state-of-the-art approaches.Clinical and Translational Impact Statement—The mathematical concepts of low rank and rank decomposition in matrix algebra are translated to remove specularities in the endoscopic images The result shows the impact of the proposed method in removing specular reflections from endoscopic images indicating improved diagnosis efficiency for both endoscopists and computer-aided diagnosis systems
目的:内窥镜是一种医学诊断程序,用于在称为内窥镜的摄像头连接系统的帮助下观察人体内部。内窥镜图像和视频会受到镜面反射(或高光)的影响,并可能对图像的诊断质量产生不利影响。这些分散的白色区域严重影响内窥镜医生和疾病计算机辅助诊断的图像视觉外观。方法与结果:我们引入了一种新的无参数矩阵分解技术来去除镜面反射。该方法将原始图像分解为无高光伪低阶分量和高光分量。除了去除高光之外,该方法还去除了高光区域周围的边界伪影,这与以前基于稳健主成分分析(RPCA)家族的工作不同。该方法在三个公开的内窥镜检查数据集上进行了评估:Kvasir Polyp、Kvasir Normal Pylorus和Kvasir Capsule数据集。我们的评估以4种不同的最先进方法为基准,使用三种不同的常用指标,如结构相似性指数测量(SSIM)、剩余亮点百分比和变异系数(CoV)。结论:结果表明,在所有三个指标上,与比较方法相比,都有显著改进。该方法在统计显著性方面得到了进一步验证,其表现优于其他最先进的方法。临床和转化影响声明矩阵代数中的低秩和秩分解的数学概念被转化为去除内窥镜图像中的镜面反射。结果显示了所提出的方法在去除内窥镜中的镜面反射方面的影响,这表明内窥镜医生和计算机辅助诊断系统的诊断效率都有所提高。
{"title":"Parameter-Free Matrix Decomposition for Specular Reflections Removal in Endoscopic Images","authors":"Jithin Joseph;Sudhish N. George;Kiran Raja","doi":"10.1109/JTEHM.2023.3283444","DOIUrl":"10.1109/JTEHM.2023.3283444","url":null,"abstract":"<italic>Objective:</i>\u0000 Endoscopy is a medical diagnostic procedure used to see inside the human body with the help of a camera-attached system called the endoscope. Endoscopic images and videos suffer from specular reflections (or highlight) and can have an adverse impact on the diagnostic quality of images. These scattered white regions severely affect the visual appearance of images for both endoscopists and the computer-aided diagnosis of diseases. Methods & Results: We introduce a new parameter-free matrix decomposition technique to remove the specular reflections. The proposed method decomposes the original image into a highlight-free pseudo-low-rank component and a highlight component. Along with the highlight removal, the approach also removes the boundary artifacts present around the highlight regions, unlike the previous works based on family of Robust Principal Component Analysis (RPCA). The approach is evaluated on three publicly available endoscopy datasets: Kvasir Polyp, Kvasir Normal-Pylorus and Kvasir Capsule datasets. Our evaluation is benchmarked against 4 different state-of-the-art approaches using three different well-used metrics such as Structural Similarity Index Measure (SSIM), Percentage of highlights remaining and Coefficient of Variation (CoV). Conclusions: The results show significant improvements over the compared methods on all three metrics. The approach is further validated for statistical significance where it emerges better than other state-of-the-art approaches.\u0000<italic>Clinical and Translational Impact Statement—</i>\u0000The mathematical concepts of low rank and rank decomposition in matrix algebra are translated to remove specularities in the endoscopic images The result shows the impact of the proposed method in removing specular reflections from endoscopic images indicating improved diagnosis efficiency for both endoscopists and computer-aided diagnosis systems","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"11 ","pages":"360-374"},"PeriodicalIF":3.4,"publicationDate":"2023-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10144758","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10191715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CNN-LSTM Model for Recognizing Video-Recorded Actions Performed in a Traditional Chinese Exercise 用于识别在传统汉语练习中执行的视频录制动作的CNN-LSTM模型。
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2023-06-02 DOI: 10.1109/JTEHM.2023.3282245
Jing Chen;Jiping Wang;Qun Yuan;Zhao Yang
Identifying human actions from video data is an important problem in the fields of intelligent rehabilitation assessment. Motion feature extraction and pattern recognition are the two key procedures to achieve such goals. Traditional action recognition models are usually based on the geometric features manually extracted from video frames, which are however difficult to adapt to complex scenarios and cannot achieve high-precision recognition and robustness. We investigate a motion recognition model and apply it to recognize the sequence of complicated actions of a traditional Chinese exercise (ie, Baduanjin). We first developed a combined convolutional neural network (CNN) and long short-term memory (LSTM) model for recognizing the sequence of actions captured in video frames, and applied it to recognize the actions of Baduanjin. Moreover, this method has been compared with the traditional action recognition model based on geometric motion features in which Openpose is used to identify the joint positions in the skeletons. Its performance of high recognition accuracy has been verified on the testing video dataset, containing the video clips from 18 different practicers. The CNN-LSTM recognition model achieved 96.43% accuracy on the testing set; while those manually extracted features in the traditional action recognition model were only able to achieve 66.07% classification accuracy on the testing video dataset. The abstract image features extracted by the CNN module are more effective on improving the classification accuracy of the LSTM model. The proposed CNN-LSTM based method can be a useful tool in recognizing the complicated actions.
从视频数据中识别人类行为是智能康复评估领域的一个重要问题。运动特征提取和模式识别是实现这一目标的两个关键步骤。传统的动作识别模型通常基于从视频帧中手动提取的几何特征,但难以适应复杂的场景,无法实现高精度的识别和鲁棒性。我们研究了一个运动识别模型,并将其应用于中国传统体操(即八段锦)的复杂动作序列识别。我们首先开发了一种用于识别视频帧中捕捉的动作序列的卷积神经网络(CNN)和长短期记忆(LSTM)组合模型,并将其应用于识别八段锦的动作。此外,该方法还与传统的基于几何运动特征的动作识别模型进行了比较,在该模型中,Openpose用于识别骨骼中的关节位置。它的高识别精度性能已经在测试视频数据集上得到了验证,该数据集包含来自18个不同练习者的视频片段。CNN-LSTM识别模型在测试集上的准确率达到96.43%;而传统动作识别模型中手动提取的特征在测试视频数据集上只能达到66.07%的分类准确率。CNN模块提取的抽象图像特征在提高LSTM模型的分类精度方面更为有效。所提出的基于CNN-LSTM的方法可以成为识别复杂动作的有用工具。
{"title":"CNN-LSTM Model for Recognizing Video-Recorded Actions Performed in a Traditional Chinese Exercise","authors":"Jing Chen;Jiping Wang;Qun Yuan;Zhao Yang","doi":"10.1109/JTEHM.2023.3282245","DOIUrl":"10.1109/JTEHM.2023.3282245","url":null,"abstract":"Identifying human actions from video data is an important problem in the fields of intelligent rehabilitation assessment. Motion feature extraction and pattern recognition are the two key procedures to achieve such goals. Traditional action recognition models are usually based on the geometric features manually extracted from video frames, which are however difficult to adapt to complex scenarios and cannot achieve high-precision recognition and robustness. We investigate a motion recognition model and apply it to recognize the sequence of complicated actions of a traditional Chinese exercise (ie, Baduanjin). We first developed a combined convolutional neural network (CNN) and long short-term memory (LSTM) model for recognizing the sequence of actions captured in video frames, and applied it to recognize the actions of Baduanjin. Moreover, this method has been compared with the traditional action recognition model based on geometric motion features in which Openpose is used to identify the joint positions in the skeletons. Its performance of high recognition accuracy has been verified on the testing video dataset, containing the video clips from 18 different practicers. The CNN-LSTM recognition model achieved 96.43% accuracy on the testing set; while those manually extracted features in the traditional action recognition model were only able to achieve 66.07% classification accuracy on the testing video dataset. The abstract image features extracted by the CNN module are more effective on improving the classification accuracy of the LSTM model. The proposed CNN-LSTM based method can be a useful tool in recognizing the complicated actions.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"11 ","pages":"351-359"},"PeriodicalIF":3.4,"publicationDate":"2023-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10143200","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10173160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
IEEE Journal of Translational Engineering in Health and Medicine-Jtehm
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1