首页 > 最新文献

IEEE Journal of Translational Engineering in Health and Medicine-Jtehm最新文献

英文 中文
An Actuated Variable-View Rigid Scope System to Assist Visualization in Diagnostic Procedures 辅助诊断程序可视化的可变视角刚性显微镜系统
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2024-03-31 DOI: 10.1109/JTEHM.2024.3407951
Sofia Basha;Mohammad Khorasani;Nihal Abdurahiman;Jhasketan Padhan;Victor Baez;Abdulla Al-Ansari;Panagiotis Tsiamyrtzis;Aaron T. Becker;Nikhil V. Navkar
Objective: Variable-view rigid scopes offer advantages compared to traditional angled laparoscopes for examining a diagnostic site. However, altering the scope’s view requires a high level of dexterity and understanding of spatial orientation. This requires an intuitive mechanism to allow an operator to easily understand the anatomical surroundings and smoothly adjust the scope’s focus during diagnosis. To address this challenge, the objective of this work is to develop a mechanized arm that assists in visualization using variable-view rigid scopes during diagnostic procedures.Methods: A system with a mechanized arm to maneuver a variable-view rigid scope (EndoCAMeleon - Karl Storz) was developed. A user study was conducted to assess the ability of the proposed mechanized arm for diagnosis in a preclinical navigation task and a simulated cystoscopy procedure.Results: The mechanized arm performed significantly better than direct maneuvering of the rigid scope. In the preclinical navigation task, it reduced the percentage of time the scope’s focus shifted outside a predefined track. Similarly, for simulated cystoscopy procedure, it reduced the duration and the perceived workload.Conclusion: The proposed mechanized arm enhances the operator’s ability to accurately maneuver a variable-view rigid scope and reduces the effort in performing diagnostic procedures.Clinical and Translational Impact Statement: The preclinical research introduces a mechanized arm to intuitively maneuver a variable-view rigid scope during diagnostic procedures, while minimizing the mental and physical workload to the operator.
目的:与传统的倾斜腹腔镜相比,可变视角刚性腹腔镜在检查诊断部位方面具有优势。然而,改变瞄准镜的视角需要高度的灵活性和对空间方位的理解。这就需要一种直观的机制,让操作员能够轻松了解周围的解剖环境,并在诊断过程中顺利调整瞄准镜的焦点。为了应对这一挑战,这项工作的目标是开发一种机械化手臂,在诊断过程中使用可变视角刚性显微镜辅助观察:方法:开发了一种带有机械臂的系统,用于操纵可变视角硬镜(EndoCAMeleon - Karl Storz)。结果:机械化手臂在临床前导航任务和模拟膀胱镜检查过程中的表现明显优于机械化手臂:结果:机械化手臂的表现明显优于直接操纵硬镜。在临床前导航任务中,它减少了瞄准镜焦点偏离预定轨道的时间百分比。同样,在模拟膀胱镜检查过程中,机械臂缩短了持续时间,减轻了感知工作量:结论:拟议的机械化手臂提高了操作员准确操纵可变视角刚性镜的能力,并减少了执行诊断程序的工作量:临床前研究引入了一种机械化手臂,可在诊断过程中直观地操纵可变视角硬镜,同时最大限度地减轻操作者的脑力和体力负担。
{"title":"An Actuated Variable-View Rigid Scope System to Assist Visualization in Diagnostic Procedures","authors":"Sofia Basha;Mohammad Khorasani;Nihal Abdurahiman;Jhasketan Padhan;Victor Baez;Abdulla Al-Ansari;Panagiotis Tsiamyrtzis;Aaron T. Becker;Nikhil V. Navkar","doi":"10.1109/JTEHM.2024.3407951","DOIUrl":"10.1109/JTEHM.2024.3407951","url":null,"abstract":"Objective: Variable-view rigid scopes offer advantages compared to traditional angled laparoscopes for examining a diagnostic site. However, altering the scope’s view requires a high level of dexterity and understanding of spatial orientation. This requires an intuitive mechanism to allow an operator to easily understand the anatomical surroundings and smoothly adjust the scope’s focus during diagnosis. To address this challenge, the objective of this work is to develop a mechanized arm that assists in visualization using variable-view rigid scopes during diagnostic procedures.Methods: A system with a mechanized arm to maneuver a variable-view rigid scope (EndoCAMeleon - Karl Storz) was developed. A user study was conducted to assess the ability of the proposed mechanized arm for diagnosis in a preclinical navigation task and a simulated cystoscopy procedure.Results: The mechanized arm performed significantly better than direct maneuvering of the rigid scope. In the preclinical navigation task, it reduced the percentage of time the scope’s focus shifted outside a predefined track. Similarly, for simulated cystoscopy procedure, it reduced the duration and the perceived workload.Conclusion: The proposed mechanized arm enhances the operator’s ability to accurately maneuver a variable-view rigid scope and reduces the effort in performing diagnostic procedures.Clinical and Translational Impact Statement: The preclinical research introduces a mechanized arm to intuitively maneuver a variable-view rigid scope during diagnostic procedures, while minimizing the mental and physical workload to the operator.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"12 ","pages":"499-507"},"PeriodicalIF":3.4,"publicationDate":"2024-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10543117","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141190485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Study on Intelligent Optical Bone Densitometry 智能光学骨密度测量法研究
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2024-03-21 DOI: 10.1109/JTEHM.2024.3368106
Takhellambam Gautam Meitei;Wei-Chun Chang;Pou-Leng Cheong;Yi-Min Wang;Chia-Wei Sun
Osteoporosis is a prevalent chronic disease worldwide, particularly affecting the aging population. The gold standard diagnostic tool for osteoporosis is Dual-energy X-ray Absorptiometry (DXA). However, the expensive cost of the DXA machine and the need for skilled professionals to operate it restrict its accessibility to the general public. This paper builds upon previous research and proposes a novel approach for rapidly screening bone density. The method involves utilizing near-infrared light to capture local body information within the human body. Deep learning techniques are employed to analyze the obtained data and extract meaningful insights related to bone density. Our initial prediction, utilizing multi-linear regression, demonstrated a strong correlation (r = 0.98, p-value = 0.003**) with the measured Bone Mineral Density (BMD) obtained from Dual-energy X-ray Absorptiometry (DXA). This indicates a highly significant relationship between the predicted values and the actual BMD measurements. A deep learning-based algorithm is applied to analyze the underlying information further to predict bone density at the wrist, hip, and spine. The prediction of bone densities in the hip and spine holds significant importance due to their status as gold-standard sites for assessing an individual’s bone density. Our prediction rate had an error margin below 10% for the wrist and below 20% for the hip and spine bone density.
骨质疏松症是一种全球流行的慢性疾病,对老龄人口的影响尤为严重。骨质疏松症的金标准诊断工具是双能 X 射线吸收测定法(DXA)。然而,DXA 仪器价格昂贵,而且需要熟练的专业人员操作,这限制了普通大众对它的使用。本文在以往研究的基础上,提出了一种快速筛查骨密度的新方法。该方法利用近红外线捕捉人体局部信息。利用深度学习技术来分析获得的数据,并提取与骨密度相关的有意义的见解。我们利用多线性回归进行的初步预测显示,该预测与通过双能 X 射线吸收仪(DXA)测量的骨密度(BMD)之间存在很强的相关性(r = 0.98,p 值 = 0.003**)。这表明预测值与实际 BMD 测量值之间存在着非常显著的关系。应用基于深度学习的算法进一步分析基础信息,以预测手腕、髋部和脊柱的骨密度。由于髋部和脊柱是评估个人骨密度的黄金标准部位,因此预测这两个部位的骨密度具有重要意义。我们对腕部骨密度的预测误差率低于 10%,对髋部和脊柱骨密度的预测误差率低于 20%。
{"title":"A Study on Intelligent Optical Bone Densitometry","authors":"Takhellambam Gautam Meitei;Wei-Chun Chang;Pou-Leng Cheong;Yi-Min Wang;Chia-Wei Sun","doi":"10.1109/JTEHM.2024.3368106","DOIUrl":"10.1109/JTEHM.2024.3368106","url":null,"abstract":"Osteoporosis is a prevalent chronic disease worldwide, particularly affecting the aging population. The gold standard diagnostic tool for osteoporosis is Dual-energy X-ray Absorptiometry (DXA). However, the expensive cost of the DXA machine and the need for skilled professionals to operate it restrict its accessibility to the general public. This paper builds upon previous research and proposes a novel approach for rapidly screening bone density. The method involves utilizing near-infrared light to capture local body information within the human body. Deep learning techniques are employed to analyze the obtained data and extract meaningful insights related to bone density. Our initial prediction, utilizing multi-linear regression, demonstrated a strong correlation (r = 0.98, p-value = 0.003**) with the measured Bone Mineral Density (BMD) obtained from Dual-energy X-ray Absorptiometry (DXA). This indicates a highly significant relationship between the predicted values and the actual BMD measurements. A deep learning-based algorithm is applied to analyze the underlying information further to predict bone density at the wrist, hip, and spine. The prediction of bone densities in the hip and spine holds significant importance due to their status as gold-standard sites for assessing an individual’s bone density. Our prediction rate had an error margin below 10% for the wrist and below 20% for the hip and spine bone density.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"12 ","pages":"401-412"},"PeriodicalIF":3.4,"publicationDate":"2024-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10477504","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140202996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CHIVID: A Rapid Deployment of Community and Home Isolation During COVID-19 Pandemics CHIVID:COVID-19 大流行期间社区和家庭隔离的快速部署
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2024-03-13 DOI: 10.1109/JTEHM.2024.3377258
Parpada Piamjinda;Chiraphat Boonnag;Piyalitt Ittichaiwong;Seandee Rattanasonrerk;Kanyakorn Veerakanjana;Khanita Duangchaemkarn;Warissara Limpornchitwilai;Kamonwan Thanontip;Napasara Asawalertsak;Thitikorn Kaewlee;Theerawit Wilaiprasitporn
Background: CHIVID is a telemedicine solution developed under tight time constraints that assists Thai healthcare practitioners in monitoring non-severe COVID-19 patients in isolation programs during crises. It assesses patient health and notifies healthcare practitioners of high-risk scenarios through a chatbot. The system was designed to integrate with the famous Thai messaging app LINE, reducing development time and enhancing user-friendliness, and the system allowed patients to upload a pulse oximeter image automatically processed by the PACMAN function to extract oxygen saturation and heart rate values to reduce patient input errors. Methods: This article describes the proposed system and presents a mixed-methods study that evaluated the system’s performance by collecting survey responses from 70 healthcare practitioners and analyzing 14,817 patient records. Results: Approximately 71.4% of healthcare practitioners use the system more than twice daily, with the majority managing 1–10 patients, while 11.4% handle over 101 patients. The progress note is a function that healthcare practitioners most frequently use and are satisfied with. Regarding patient data, 58.9%(8,724/14,817) are male, and 49.7%(7,367/14,817) within the 18 to 34 age range. The average length of isolation was 7.6 days, and patients submitted progress notes twice daily on average. Notably, individuals aged 18 to 34 demonstrated the highest utilization rates for the PACMAN function. Furthermore, most patients, totaling over 95.52%(14,153/14,817), were discharged normally. Conclusion: The findings indicate that CHIVID could be one of the telemedicine solutions for hospitals with patient overflow and healthcare practitioners unfamiliar with telemedicine technology to improve patient care during a critical crisis. Clinical and Translational Impact Statement— CHIVID’s success arises from seamlessly integrating telemedicine into third-party application within a limited timeframe and effectively using clinical decision support systems to address challenges during the COVID-19 crisis.
背景:CHIVID 是在时间紧迫的情况下开发的远程医疗解决方案,可在危机期间协助泰国医疗从业人员监控隔离项目中的非重症 COVID-19 患者。它通过聊天机器人评估病人的健康状况并通知医疗从业人员高风险情况。该系统旨在与泰国著名的消息应用程序 LINE 集成,从而缩短开发时间并提高用户友好性。该系统允许患者上传脉搏血氧仪图像,并由 PACMAN 功能自动处理,以提取血氧饱和度和心率值,从而减少患者输入错误。方法:本文介绍了所提议的系统,并介绍了一项混合方法研究,该研究通过收集 70 名医疗从业人员的调查反馈和分析 14,817 份患者记录来评估该系统的性能。研究结果约 71.4% 的医疗从业人员每天使用该系统两次以上,其中大多数人管理 1-10 名患者,11.4% 的人管理 101 名以上患者。进度记录是医疗从业人员最常用且最满意的功能。在患者数据方面,58.9%(8,724/14,817)为男性,49.7%(7,367/14,817)在 18 至 34 岁之间。平均隔离时间为 7.6 天,患者平均每天提交两次进展记录。值得注意的是,18 至 34 岁人群对 PACMAN 功能的使用率最高。此外,大多数患者(总计超过 95.52%(14,153/14,817))都能正常出院。结论研究结果表明,CHIVID 可以作为远程医疗解决方案之一,帮助病人过多的医院和不熟悉远程医疗技术的医疗从业人员在危急关头改善病人护理。临床和转化影响声明--CHIVID 的成功源于在有限的时间内将远程医疗无缝集成到第三方应用中,并有效利用临床决策支持系统应对 COVID-19 危机期间的挑战。
{"title":"CHIVID: A Rapid Deployment of Community and Home Isolation During COVID-19 Pandemics","authors":"Parpada Piamjinda;Chiraphat Boonnag;Piyalitt Ittichaiwong;Seandee Rattanasonrerk;Kanyakorn Veerakanjana;Khanita Duangchaemkarn;Warissara Limpornchitwilai;Kamonwan Thanontip;Napasara Asawalertsak;Thitikorn Kaewlee;Theerawit Wilaiprasitporn","doi":"10.1109/JTEHM.2024.3377258","DOIUrl":"10.1109/JTEHM.2024.3377258","url":null,"abstract":"Background: CHIVID is a telemedicine solution developed under tight time constraints that assists Thai healthcare practitioners in monitoring non-severe COVID-19 patients in isolation programs during crises. It assesses patient health and notifies healthcare practitioners of high-risk scenarios through a chatbot. The system was designed to integrate with the famous Thai messaging app LINE, reducing development time and enhancing user-friendliness, and the system allowed patients to upload a pulse oximeter image automatically processed by the PACMAN function to extract oxygen saturation and heart rate values to reduce patient input errors. Methods: This article describes the proposed system and presents a mixed-methods study that evaluated the system’s performance by collecting survey responses from 70 healthcare practitioners and analyzing 14,817 patient records. Results: Approximately 71.4% of healthcare practitioners use the system more than twice daily, with the majority managing 1–10 patients, while 11.4% handle over 101 patients. The progress note is a function that healthcare practitioners most frequently use and are satisfied with. Regarding patient data, 58.9%(8,724/14,817) are male, and 49.7%(7,367/14,817) within the 18 to 34 age range. The average length of isolation was 7.6 days, and patients submitted progress notes twice daily on average. Notably, individuals aged 18 to 34 demonstrated the highest utilization rates for the PACMAN function. Furthermore, most patients, totaling over 95.52%(14,153/14,817), were discharged normally. Conclusion: The findings indicate that CHIVID could be one of the telemedicine solutions for hospitals with patient overflow and healthcare practitioners unfamiliar with telemedicine technology to improve patient care during a critical crisis. Clinical and Translational Impact Statement— CHIVID’s success arises from seamlessly integrating telemedicine into third-party application within a limited timeframe and effectively using clinical decision support systems to address challenges during the COVID-19 crisis.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"12 ","pages":"390-400"},"PeriodicalIF":3.4,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10472531","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140126520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Dysarthric Speech Segmentation With Emulated and Synthetic Augmentation 利用仿真和合成增强技术改进肢体障害语音分割
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2024-03-11 DOI: 10.1109/JTEHM.2024.3375323
Saeid Alavi Naeini;Leif Simmatis;Deniz Jafari;Yana Yunusova;Babak Taati
Acoustic features extracted from speech can help with the diagnosis of neurological diseases and monitoring of symptoms over time. Temporal segmentation of audio signals into individual words is an important pre-processing step needed prior to extracting acoustic features. Machine learning techniques could be used to automate speech segmentation via automatic speech recognition (ASR) and sequence to sequence alignment. While state-of-the-art ASR models achieve good performance on healthy speech, their performance significantly drops when evaluated on dysarthric speech. Fine-tuning ASR models on impaired speech can improve performance in dysarthric individuals, but it requires representative clinical data, which is difficult to collect and may raise privacy concerns. This study explores the feasibility of using two augmentation methods to increase ASR performance on dysarthric speech: 1) healthy individuals varying their speaking rate and loudness (as is often used in assessments of pathological speech); 2) synthetic speech with variations in speaking rate and accent (to ensure more diverse vocal representations and fairness). Experimental evaluations showed that fine-tuning a pre-trained ASR model with data from these two sources outperformed a model fine-tuned only on real clinical data and matched the performance of a model fine-tuned on the combination of real clinical data and synthetic speech. When evaluated on held-out acoustic data from 24 individuals with various neurological diseases, the best performing model achieved an average word error rate of 5.7% and a mean correct count accuracy of 94.4%. In segmenting the data into individual words, a mean intersection-over-union of 89.2% was obtained against manual parsing (ground truth). It can be concluded that emulated and synthetic augmentations can significantly reduce the need for real clinical data of dysarthric speech when fine-tuning ASR models and, in turn, for speech segmentation.
从语音中提取声学特征有助于诊断神经系统疾病和监测症状的变化。将音频信号按时间分割成单个单词是提取声学特征前所需的重要预处理步骤。机器学习技术可用于通过自动语音识别(ASR)和序列对序列配准自动进行语音分割。虽然最先进的 ASR 模型在健康语音上取得了良好的性能,但在评估听力障碍语音时,其性能却明显下降。在受损语音上对 ASR 模型进行微调可以提高发育障碍患者的性能,但这需要有代表性的临床数据,而这些数据很难收集,而且可能会引起隐私方面的担忧。本研究探讨了使用两种增强方法提高肢体运动障碍语音的 ASR 性能的可行性:1) 改变健康人的说话速度和响度(病理语音评估中常用的方法);2) 改变说话速度和口音的合成语音(以确保更多样化的声音表现和公平性)。实验评估结果表明,利用这两种来源的数据对预先训练好的 ASR 模型进行微调,其效果优于仅根据真实临床数据进行微调的模型,并且与根据真实临床数据和合成语音组合进行微调的模型效果相当。在对 24 名患有各种神经系统疾病的患者的语音数据进行评估时,表现最好的模型的平均单词错误率为 5.7%,平均正确计数准确率为 94.4%。在将数据分割成单个单词时,与人工解析(地面实况)相比,平均交叉-重合率达到 89.2%。可以得出这样的结论:在微调 ASR 模型时,仿真和合成增强可以大大减少对真实临床语音数据的需求,进而减少对语音分段的需求。
{"title":"Improving Dysarthric Speech Segmentation With Emulated and Synthetic Augmentation","authors":"Saeid Alavi Naeini;Leif Simmatis;Deniz Jafari;Yana Yunusova;Babak Taati","doi":"10.1109/JTEHM.2024.3375323","DOIUrl":"10.1109/JTEHM.2024.3375323","url":null,"abstract":"Acoustic features extracted from speech can help with the diagnosis of neurological diseases and monitoring of symptoms over time. Temporal segmentation of audio signals into individual words is an important pre-processing step needed prior to extracting acoustic features. Machine learning techniques could be used to automate speech segmentation via automatic speech recognition (ASR) and sequence to sequence alignment. While state-of-the-art ASR models achieve good performance on healthy speech, their performance significantly drops when evaluated on dysarthric speech. Fine-tuning ASR models on impaired speech can improve performance in dysarthric individuals, but it requires representative clinical data, which is difficult to collect and may raise privacy concerns. This study explores the feasibility of using two augmentation methods to increase ASR performance on dysarthric speech: 1) healthy individuals varying their speaking rate and loudness (as is often used in assessments of pathological speech); 2) synthetic speech with variations in speaking rate and accent (to ensure more diverse vocal representations and fairness). Experimental evaluations showed that fine-tuning a pre-trained ASR model with data from these two sources outperformed a model fine-tuned only on real clinical data and matched the performance of a model fine-tuned on the combination of real clinical data and synthetic speech. When evaluated on held-out acoustic data from 24 individuals with various neurological diseases, the best performing model achieved an average word error rate of 5.7% and a mean correct count accuracy of 94.4%. In segmenting the data into individual words, a mean intersection-over-union of 89.2% was obtained against manual parsing (ground truth). It can be concluded that emulated and synthetic augmentations can significantly reduce the need for real clinical data of dysarthric speech when fine-tuning ASR models and, in turn, for speech segmentation.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"12 ","pages":"382-389"},"PeriodicalIF":3.4,"publicationDate":"2024-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10464345","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140105378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multispectral Imaging-Based System for Detecting Tissue Oxygen Saturation With Wound Segmentation for Monitoring Wound Healing 基于多光谱成像的组织氧饱和度检测系统与用于监测伤口愈合的伤口分割技术
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2024-03-09 DOI: 10.1109/JTEHM.2024.3399232
Chih-Lung Lin;Meng-Hsuan Wu;Yuan-Hao Ho;Fang-Yi Lin;Yu-Hsien Lu;Yuan-Yu Hsueh;Chia-Chen Chen
Objective: Blood circulation is an important indicator of wound healing. In this study, a tissue oxygen saturation detecting (TOSD) system that is based on multispectral imaging (MSI) is proposed to quantify the degree of tissue oxygen saturation (StO2) in cutaneous tissues. Methods: A wound segmentation algorithm is used to segment automatically wound and skin areas, eliminating the need for manual labeling and applying adaptive tissue optics. Animal experiments were conducted on six mice in which they were observed seven times, once every two days. The TOSD system illuminated cutaneous tissues with two wavelengths of light - red ( $mathrm {lambda } = 660$ nm) and near-infrared ( $mathrm {lambda } = 880$ nm), and StO2 levels were calculated using images that were captured using a monochrome camera. The wound segmentation algorithm using ResNet34-based U-Net was integrated with computer vision techniques to improve its performance. Results: Animal experiments revealed that the wound segmentation algorithm achieved a Dice score of 93.49%. The StO2 levels that were determined using the TOSD system varied significantly among the phases of wound healing. Changes in StO2 levels were detected before laser speckle contrast imaging (LSCI) detected changes in blood flux. Moreover, statistical features that were extracted from the TOSD system and LSCI were utilized in principal component analysis (PCA) to visualize different wound healing phases. The average silhouette coefficients of the TOSD system with segmentation (ResNet34-based U-Net) and LSCI were 0.2890 and 0.0194, respectively. Conclusion: By detecting the StO2 levels of cutaneous tissues using the TOSD system with segmentation, the phases of wound healing were accurately distinguished. This method can support medical personnel in conducting precise wound assessments. Clinical and Translational Impact Statement—This study supports efforts in monitoring StO2 levels, wound segmentation, and wound healing phase classification to improve the efficiency and accuracy of preclinical research in the field.
目的:血液循环是伤口愈合的重要指标:血液循环是伤口愈合的重要指标。本研究提出了一种基于多光谱成像(MSI)的组织氧饱和度检测(TOSD)系统,用于量化皮肤组织的组织氧饱和度(StO2)。方法:采用伤口分割算法自动分割伤口和皮肤区域,无需人工标记,并应用自适应组织光学技术。对六只小鼠进行了动物实验,每两天观察一次,共观察七次。TOSD系统用两种波长的光--红光($mathrm {lambda } = 660$ nm)和近红外线($mathrm {lambda } = 880$ nm)照射皮肤组织,并使用单色相机捕捉的图像计算StO2水平。使用基于 ResNet34 的 U-Net 的伤口分割算法与计算机视觉技术相结合,以提高其性能。结果显示动物实验表明,伤口分割算法的 Dice 得分为 93.49%。使用 TOSD 系统测定的 StO2 水平在伤口愈合的不同阶段有显著差异。在激光斑点对比成像(LSCI)检测到血流变化之前,就能检测到 StO2 水平的变化。此外,从 TOSD 系统和 LSCI 提取的统计特征被用于主成分分析(PCA),以直观显示不同的伤口愈合阶段。带有分割功能的 TOSD 系统(基于 ResNet34 的 U-Net)和 LSCI 的平均轮廓系数分别为 0.2890 和 0.0194。结论通过使用带分割功能的 TOSD 系统检测皮肤组织的 StO2 水平,可以准确区分伤口愈合的各个阶段。这种方法可帮助医务人员进行精确的伤口评估。临床和转化影响声明--这项研究为监测 StO2 水平、伤口分割和伤口愈合阶段分类提供了支持,从而提高了该领域临床前研究的效率和准确性。
{"title":"Multispectral Imaging-Based System for Detecting Tissue Oxygen Saturation With Wound Segmentation for Monitoring Wound Healing","authors":"Chih-Lung Lin;Meng-Hsuan Wu;Yuan-Hao Ho;Fang-Yi Lin;Yu-Hsien Lu;Yuan-Yu Hsueh;Chia-Chen Chen","doi":"10.1109/JTEHM.2024.3399232","DOIUrl":"10.1109/JTEHM.2024.3399232","url":null,"abstract":"Objective: Blood circulation is an important indicator of wound healing. In this study, a tissue oxygen saturation detecting (TOSD) system that is based on multispectral imaging (MSI) is proposed to quantify the degree of tissue oxygen saturation (StO2) in cutaneous tissues. Methods: A wound segmentation algorithm is used to segment automatically wound and skin areas, eliminating the need for manual labeling and applying adaptive tissue optics. Animal experiments were conducted on six mice in which they were observed seven times, once every two days. The TOSD system illuminated cutaneous tissues with two wavelengths of light - red (\u0000<inline-formula> <tex-math>$mathrm {lambda } = 660$ </tex-math></inline-formula>\u0000 nm) and near-infrared (\u0000<inline-formula> <tex-math>$mathrm {lambda } = 880$ </tex-math></inline-formula>\u0000 nm), and StO2 levels were calculated using images that were captured using a monochrome camera. The wound segmentation algorithm using ResNet34-based U-Net was integrated with computer vision techniques to improve its performance. Results: Animal experiments revealed that the wound segmentation algorithm achieved a Dice score of 93.49%. The StO2 levels that were determined using the TOSD system varied significantly among the phases of wound healing. Changes in StO2 levels were detected before laser speckle contrast imaging (LSCI) detected changes in blood flux. Moreover, statistical features that were extracted from the TOSD system and LSCI were utilized in principal component analysis (PCA) to visualize different wound healing phases. The average silhouette coefficients of the TOSD system with segmentation (ResNet34-based U-Net) and LSCI were 0.2890 and 0.0194, respectively. Conclusion: By detecting the StO2 levels of cutaneous tissues using the TOSD system with segmentation, the phases of wound healing were accurately distinguished. This method can support medical personnel in conducting precise wound assessments. Clinical and Translational Impact Statement—This study supports efforts in monitoring StO2 levels, wound segmentation, and wound healing phase classification to improve the efficiency and accuracy of preclinical research in the field.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"12 ","pages":"468-479"},"PeriodicalIF":3.4,"publicationDate":"2024-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10528306","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925707","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weakly-Supervised Segmentation-Based Quantitative Characterization of Pulmonary Cavity Lesions in CT Scans 基于弱监督分割的 CT 扫描肺腔病变定量特征描述
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2024-03-09 DOI: 10.1109/JTEHM.2024.3399261
Wenyu Xing;Yanping Yang;Yannan Zhou;Tao Jiang;Yifang Li;Yuanlin Song;Dongni Hou;Dean TA
Objective: Pulmonary cavity lesion is one of the commonly seen lesions in lung caused by a variety of malignant and non-malignant diseases. Diagnosis of a cavity lesion is commonly based on accurate recognition of the typical morphological characteristics. A deep learning-based model to automatically detect, segment, and quantify the region of cavity lesion on CT scans has potential in clinical diagnosis, monitoring, and treatment efficacy assessment. Methods: A weakly-supervised deep learning-based method named CSA2-ResNet was proposed to quantitatively characterize cavity lesions in this paper. The lung parenchyma was firstly segmented using a pretrained 2D segmentation model, and then the output with or without cavity lesions was fed into the developed deep neural network containing hybrid attention modules. Next, the visualized lesion was generated from the activation region of the classification network using gradient-weighted class activation mapping, and image processing was applied for post-processing to obtain the expected segmentation results of cavity lesions. Finally, the automatic characteristic measurement of cavity lesions (e.g., area and thickness) was developed and verified. Results: the proposed weakly-supervised segmentation method achieved an accuracy, precision, specificity, recall, and F1-score of 98.48%, 96.80%, 97.20%, 100%, and 98.36%, respectively. There is a significant improvement (P < 0.05) compared to other methods. Quantitative characterization of morphology also obtained good analysis effects. Conclusions: The proposed easily-trained and high-performance deep learning model provides a fast and effective way for the diagnosis and dynamic monitoring of pulmonary cavity lesions in clinic. Clinical and Translational Impact Statement: This model used artificial intelligence to achieve the detection and quantitative analysis of pulmonary cavity lesions in CT scans. The morphological features revealed in experiments can be utilized as potential indicators for diagnosis and dynamic monitoring of patients with cavity lesions
目的:肺空洞病变是肺部常见病变之一,由多种恶性和非恶性疾病引起。空洞病变的诊断通常基于对典型形态特征的准确识别。基于深度学习的模型可自动检测、分割和量化 CT 扫描中的空洞病变区域,在临床诊断、监测和疗效评估方面具有潜力。研究方法本文提出了一种基于弱监督深度学习的方法,名为 CSA2-ResNet,用于定量描述空洞病变。首先使用预训练的二维分割模型对肺实质进行分割,然后将有无空洞病变的输出结果输入所开发的包含混合注意力模块的深度神经网络。接着,利用梯度加权类激活映射从分类网络的激活区域生成可视化病灶,并进行图像后处理,以获得预期的空洞病灶分割结果。最后,开发并验证了空洞病变的自动特征测量(如面积和厚度)。结果:所提出的弱监督分割方法的准确度、精确度、特异性、召回率和 F1 分数分别达到了 98.48%、96.80%、97.20%、100% 和 98.36%。与其他方法相比,有了明显的提高(P < 0.05)。形态的定量表征也获得了良好的分析效果。结论所提出的易于训练的高性能深度学习模型为临床诊断和动态监测肺空洞病变提供了一种快速有效的方法。临床与转化影响声明:该模型利用人工智能实现了对CT扫描中肺部空洞病变的检测和定量分析。实验中揭示的形态特征可作为诊断和动态监测肺空洞病变患者的潜在指标。
{"title":"Weakly-Supervised Segmentation-Based Quantitative Characterization of Pulmonary Cavity Lesions in CT Scans","authors":"Wenyu Xing;Yanping Yang;Yannan Zhou;Tao Jiang;Yifang Li;Yuanlin Song;Dongni Hou;Dean TA","doi":"10.1109/JTEHM.2024.3399261","DOIUrl":"10.1109/JTEHM.2024.3399261","url":null,"abstract":"Objective: Pulmonary cavity lesion is one of the commonly seen lesions in lung caused by a variety of malignant and non-malignant diseases. Diagnosis of a cavity lesion is commonly based on accurate recognition of the typical morphological characteristics. A deep learning-based model to automatically detect, segment, and quantify the region of cavity lesion on CT scans has potential in clinical diagnosis, monitoring, and treatment efficacy assessment. Methods: A weakly-supervised deep learning-based method named CSA2-ResNet was proposed to quantitatively characterize cavity lesions in this paper. The lung parenchyma was firstly segmented using a pretrained 2D segmentation model, and then the output with or without cavity lesions was fed into the developed deep neural network containing hybrid attention modules. Next, the visualized lesion was generated from the activation region of the classification network using gradient-weighted class activation mapping, and image processing was applied for post-processing to obtain the expected segmentation results of cavity lesions. Finally, the automatic characteristic measurement of cavity lesions (e.g., area and thickness) was developed and verified. Results: the proposed weakly-supervised segmentation method achieved an accuracy, precision, specificity, recall, and F1-score of 98.48%, 96.80%, 97.20%, 100%, and 98.36%, respectively. There is a significant improvement (P < 0.05) compared to other methods. Quantitative characterization of morphology also obtained good analysis effects. Conclusions: The proposed easily-trained and high-performance deep learning model provides a fast and effective way for the diagnosis and dynamic monitoring of pulmonary cavity lesions in clinic. Clinical and Translational Impact Statement: This model used artificial intelligence to achieve the detection and quantitative analysis of pulmonary cavity lesions in CT scans. The morphological features revealed in experiments can be utilized as potential indicators for diagnosis and dynamic monitoring of patients with cavity lesions","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"12 ","pages":"457-467"},"PeriodicalIF":3.4,"publicationDate":"2024-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10528288","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of Non-Sustained Supraventricular Tachycardia in Atrial Fibrillation Screening 检测心房颤动筛查中的非持续性室上性心动过速
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2024-03-07 DOI: 10.1109/JTEHM.2024.3397739
Hesam Halvaei;Tove Hygrell;Emma Svennberg;Valentina D.A. Corino;Leif Sörnmo;Martin Stridh
Objective: Non-sustained supraventricular tachycardia (nsSVT) is associated with a higher risk of developing atrial fibrillation (AF), and, therefore, detection of nsSVT can improve AF screening efficiency. However, the detection is challenged by the lower signal quality of ECGs recorded using handheld devices and the presence of ectopic beats which may mimic the rhythm characteristics of nsSVT.Methods: The present study introduces a new nsSVT detector for use in single-lead, 30-s ECGs, based on the assumption that beats in an nsSVT episode exhibits similar morphology, implying that episodes with beats of deviating morphology, either due to ectopic beats or noise/artifacts, are excluded. A support vector machine is used to classify successive 5-beat sequences in a sliding window with respect to similar morphology. Due to the lack of adequate training data, the classifier is trained using simulated ECGs with varying signal-to-noise ratio. In a subsequent step, a set of rhythm criteria is applied to similar beat sequences to ensure that episode duration and heart rate is acceptable.Results: The performance of the proposed detector is evaluated using the StrokeStop II database, resulting in sensitivity, specificity, and positive predictive value of 84.6%, 99.4%, and 18.5%, respectively. Conclusion: The results show that a significant reduction in expert review burden (factor of 6) can be achieved using the proposed detector.Clinical and Translational Impact: The reduction in the expert review burden shows that nsSVT detection in AF screening can be made considerably more efficiently.
目的:非持续性室上性心动过速(nsSVT非持续性室上性心动过速(nsSVT)与罹患心房颤动(AF)的高风险相关,因此,检测 nsSVT 可以提高 AF 筛查的效率。然而,使用手持设备记录的心电图信号质量较低,异位搏动的存在可能会模仿 nsSVT 的节律特征,这给检测带来了挑战:本研究介绍了一种用于单导联 30 秒心电图的新型 nsSVT 检测器,该检测器基于以下假设:nsSVT 事件中的搏动表现出相似的形态,这意味着由于异位搏动或噪声/伪影导致的搏动形态偏差的事件将被排除在外。支持向量机用于对滑动窗口中的连续 5 次搏动序列进行形态相似性分类。由于缺乏足够的训练数据,分类器使用信噪比不同的模拟心电图进行训练。在随后的步骤中,一组节律标准被应用于相似的节拍序列,以确保发作持续时间和心率是可接受的:结果:使用 StrokeStop II 数据库对所提出的检测器的性能进行了评估,结果显示灵敏度、特异性和阳性预测值分别为 84.6%、99.4% 和 18.5%。结论结果表明,使用所提出的检测器可以显著减轻专家的审核负担(系数为 6):临床和转化影响:专家审核负担的减轻表明,房颤筛查中的 nsSVT 检测可以大大提高效率。
{"title":"Detection of Non-Sustained Supraventricular Tachycardia in Atrial Fibrillation Screening","authors":"Hesam Halvaei;Tove Hygrell;Emma Svennberg;Valentina D.A. Corino;Leif Sörnmo;Martin Stridh","doi":"10.1109/JTEHM.2024.3397739","DOIUrl":"10.1109/JTEHM.2024.3397739","url":null,"abstract":"Objective: Non-sustained supraventricular tachycardia (nsSVT) is associated with a higher risk of developing atrial fibrillation (AF), and, therefore, detection of nsSVT can improve AF screening efficiency. However, the detection is challenged by the lower signal quality of ECGs recorded using handheld devices and the presence of ectopic beats which may mimic the rhythm characteristics of nsSVT.Methods: The present study introduces a new nsSVT detector for use in single-lead, 30-s ECGs, based on the assumption that beats in an nsSVT episode exhibits similar morphology, implying that episodes with beats of deviating morphology, either due to ectopic beats or noise/artifacts, are excluded. A support vector machine is used to classify successive 5-beat sequences in a sliding window with respect to similar morphology. Due to the lack of adequate training data, the classifier is trained using simulated ECGs with varying signal-to-noise ratio. In a subsequent step, a set of rhythm criteria is applied to similar beat sequences to ensure that episode duration and heart rate is acceptable.Results: The performance of the proposed detector is evaluated using the StrokeStop II database, resulting in sensitivity, specificity, and positive predictive value of 84.6%, 99.4%, and 18.5%, respectively. Conclusion: The results show that a significant reduction in expert review burden (factor of 6) can be achieved using the proposed detector.Clinical and Translational Impact: The reduction in the expert review burden shows that nsSVT detection in AF screening can be made considerably more efficiently.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"12 ","pages":"480-487"},"PeriodicalIF":3.4,"publicationDate":"2024-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10521724","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925721","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Dual-Camera Eye-Tracking Platform for Rapid Real-Time Diagnosis of Acute Delirium: A Pilot Study 用于快速实时诊断急性谵妄的双摄像头眼动仪平台:一项试点研究
IF 3.7 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2024-03-07 DOI: 10.1109/JTEHM.2024.3397737
Ahmed Al-Hindawi;Marcela Vizcaychipi;Yiannis Demiris
Objective: Delirium, an acute confusional state, affects 20-80% of patients in Intensive Care Units (ICUs), one in three medically hospitalized patients, and up to 50% of all patients who have had surgery. Its development is associated with short- and long-term morbidity, and increased risk of death. Yet, we lack any rapid, objective, and automated method to diagnose delirium. Here, we detail the prospective deployment of a novel dual-camera contextual eye-tracking platform. We then use the data from this platform to contemporaneously classify delirium.Results: We recruited 42 patients, resulting in 210 (114 with delirium, 96 without) recordings of hospitalized patients in ICU across two centers, as part of a prospective multi-center feasibility pilot study. All recordings made with our platform were usable for analysis. We divided the collected data into training and validation cohorts based on the data originating center. We trained two Temporal Convolutional Network (TCN) models that can classify delirium using a pre-existing manual scoring system (Confusion Assessment Method in ICU (CAM-ICU)) as the training target. The first model uses eye movements only which achieves an Area Under the Receiver Operator Curve (AUROC) of 0.67 and a mean Average Precision (mAP) of 0.68. The second model uses the point of regard, the part of the scene the patient is looking at, and increases the AUROC to 0.76 and the mAP to 0.81. These models are the first to classify delirium using continuous non-invasive eye-tracking but will require further clinical prospective validation prior to use as a decision-support tool.Clinical impact: Eye-tracking is a biological signal that can be used to identify delirium in patients in ICU. The platform, alongside the trained neural networks, can automatically, objectively, and continuously classify delirium aiding in the early detection of the deteriorating patient. Future work is aimed at prospective evaluation and clinical translation.
目的:谵妄是一种急性精神错乱状态,影响着重症监护病房(ICU)20%-80% 的病人,每三名住院病人中就有一名谵妄患者,而在所有接受过手术的病人中,谵妄患者的比例高达 50%。它的发生与短期和长期发病率有关,并增加了死亡风险。然而,我们缺乏快速、客观和自动化的谵妄诊断方法。在此,我们详细介绍了新型双摄像头情境眼动追踪平台的前瞻性部署。然后,我们利用该平台的数据对谵妄进行实时分类:作为前瞻性多中心可行性试点研究的一部分,我们招募了 42 名患者,在两个中心对重症监护室的住院患者进行了 210 次(114 次有谵妄,96 次无谵妄)记录。使用我们的平台进行的所有录音均可用于分析。我们根据数据来源中心将收集到的数据分为训练组和验证组。我们训练了两个时序卷积网络(TCN)模型,这两个模型可以使用已有的人工评分系统(ICU 混乱评估方法(CAM-ICU))作为训练目标,对谵妄进行分类。第一个模型仅使用眼球运动,其受体运算曲线下面积 (AUROC) 为 0.67,平均精度 (mAP) 为 0.68。第二个模型使用视点,即患者正在注视的场景部分,将 AUROC 提高到 0.76,将 mAP 提高到 0.81。这些模型是首个利用连续无创眼动追踪技术对谵妄进行分类的模型,但在用作决策支持工具之前还需要进一步的临床前瞻性验证:临床影响:眼球追踪是一种生物信号,可用于识别重症监护病房患者的谵妄。该平台和训练有素的神经网络可自动、客观、持续地对谵妄进行分类,有助于及早发现病情恶化的病人。未来的工作旨在进行前瞻性评估和临床转化。
{"title":"A Dual-Camera Eye-Tracking Platform for Rapid Real-Time Diagnosis of Acute Delirium: A Pilot Study","authors":"Ahmed Al-Hindawi;Marcela Vizcaychipi;Yiannis Demiris","doi":"10.1109/JTEHM.2024.3397737","DOIUrl":"10.1109/JTEHM.2024.3397737","url":null,"abstract":"Objective: Delirium, an acute confusional state, affects 20-80% of patients in Intensive Care Units (ICUs), one in three medically hospitalized patients, and up to 50% of all patients who have had surgery. Its development is associated with short- and long-term morbidity, and increased risk of death. Yet, we lack any rapid, objective, and automated method to diagnose delirium. Here, we detail the prospective deployment of a novel dual-camera contextual eye-tracking platform. We then use the data from this platform to contemporaneously classify delirium.Results: We recruited 42 patients, resulting in 210 (114 with delirium, 96 without) recordings of hospitalized patients in ICU across two centers, as part of a prospective multi-center feasibility pilot study. All recordings made with our platform were usable for analysis. We divided the collected data into training and validation cohorts based on the data originating center. We trained two Temporal Convolutional Network (TCN) models that can classify delirium using a pre-existing manual scoring system (Confusion Assessment Method in ICU (CAM-ICU)) as the training target. The first model uses eye movements only which achieves an Area Under the Receiver Operator Curve (AUROC) of 0.67 and a mean Average Precision (mAP) of 0.68. The second model uses the point of regard, the part of the scene the patient is looking at, and increases the AUROC to 0.76 and the mAP to 0.81. These models are the first to classify delirium using continuous non-invasive eye-tracking but will require further clinical prospective validation prior to use as a decision-support tool.Clinical impact: Eye-tracking is a biological signal that can be used to identify delirium in patients in ICU. The platform, alongside the trained neural networks, can automatically, objectively, and continuously classify delirium aiding in the early detection of the deteriorating patient. Future work is aimed at prospective evaluation and clinical translation.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"12 ","pages":"488-498"},"PeriodicalIF":3.7,"publicationDate":"2024-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10521720","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140925709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acoustic and Text Features Analysis for Adult ADHD Screening: A Data-Driven Approach Utilizing DIVA Interview 用于成人多动症筛查的声音和文本特征分析:利用 DIVA 访谈的数据驱动方法
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2024-02-26 DOI: 10.1109/JTEHM.2024.3369764
Shuanglin Li;Rajesh Nair;Syed Mohsen Naqvi
Attention Deficit Hyperactivity Disorder (ADHD) is a neurodevelopmental disorder commonly seen in childhood that leads to behavioural changes in social development and communication patterns, often continues into undiagnosed adulthood due to a global shortage of psychiatrists, resulting in delayed diagnoses with lasting consequences on individual’s well-being and the societal impact. Recently, machine learning methodologies have been incorporated into healthcare systems to facilitate the diagnosis and enhance the potential prediction of treatment outcomes for mental health conditions. In ADHD detection, the previous research focused on utilizing functional magnetic resonance imaging (fMRI) or Electroencephalography (EEG) signals, which require costly equipment and trained personnel for data collection. In recent years, speech and text modalities have garnered increasing attention due to their cost-effectiveness and non-wearable sensing in data collection. In this research, conducted in collaboration with the Cumbria, Northumberland, Tyne and Wear NHS Foundation Trust, we gathered audio data from both ADHD patients and normal controls based on the clinically popular Diagnostic Interview for ADHD in adults (DIVA). Subsequently, we transformed the speech data into text modalities through the utilization of the Google Cloud Speech API. We extracted both acoustic and text features from the data, encompassing traditional acoustic features (e.g., MFCC), specialized feature sets (e.g., eGeMAPS), as well as deep-learned linguistic and semantic features derived from pre-trained deep learning models. These features are employed in conjunction with a support vector machine for ADHD classification, yielding promising outcomes in the utilization of audio and text data for effective adult ADHD screening. Clinical impact: This research introduces a transformative approach in ADHD diagnosis, employing speech and text analysis to facilitate early and more accessible detection, particularly beneficial in areas with limited psychiatric resources. Clinical and Translational Impact Statement: The successful application of machine learning techniques in analyzing audio and text data for ADHD screening represents a significant advancement in mental health diagnostics, paving the way for its integration into clinical settings and potentially improving patient outcomes on a broader scale.
注意力缺陷多动障碍(ADHD)是一种常见于儿童期的神经发育障碍,会导致社交发展和沟通模式的行为改变,由于全球精神科医生短缺,这种障碍往往会持续到成年而得不到诊断,导致诊断延迟,对个人福祉和社会影响造成持久后果。最近,机器学习方法已被纳入医疗保健系统,以促进诊断并增强对精神健康状况治疗结果的潜在预测。在多动症检测方面,以往的研究侧重于利用功能磁共振成像(fMRI)或脑电图(EEG)信号,这需要昂贵的设备和训练有素的人员来收集数据。近年来,语音和文本模式因其成本效益高且在数据收集过程中不需要穿戴感应设备而受到越来越多的关注。在这项与坎布里亚、诺森伯兰、泰恩和威尔国家医疗服务系统基金会合作进行的研究中,我们根据临床上流行的成人多动症诊断访谈(DIVA),收集了多动症患者和正常对照组的音频数据。随后,我们利用谷歌云语音应用程序接口(Google Cloud Speech API)将语音数据转换为文本模式。我们从数据中提取了声学和文本特征,包括传统的声学特征(如 MFCC)、专业特征集(如 eGeMAPS),以及从预先训练的深度学习模型中提取的深度学习语言和语义特征。这些特征与支持向量机一起用于多动症分类,在利用音频和文本数据进行有效的成人多动症筛查方面取得了可喜的成果。临床影响:这项研究在多动症诊断中引入了一种变革性方法,利用语音和文本分析促进早期和更方便的检测,尤其有利于精神科资源有限的地区。临床和转化影响声明:在多动症筛查中成功应用机器学习技术分析音频和文本数据,是心理健康诊断领域的一大进步,为其融入临床环境铺平了道路,并有可能在更大范围内改善患者的治疗效果。
{"title":"Acoustic and Text Features Analysis for Adult ADHD Screening: A Data-Driven Approach Utilizing DIVA Interview","authors":"Shuanglin Li;Rajesh Nair;Syed Mohsen Naqvi","doi":"10.1109/JTEHM.2024.3369764","DOIUrl":"10.1109/JTEHM.2024.3369764","url":null,"abstract":"Attention Deficit Hyperactivity Disorder (ADHD) is a neurodevelopmental disorder commonly seen in childhood that leads to behavioural changes in social development and communication patterns, often continues into undiagnosed adulthood due to a global shortage of psychiatrists, resulting in delayed diagnoses with lasting consequences on individual’s well-being and the societal impact. Recently, machine learning methodologies have been incorporated into healthcare systems to facilitate the diagnosis and enhance the potential prediction of treatment outcomes for mental health conditions. In ADHD detection, the previous research focused on utilizing functional magnetic resonance imaging (fMRI) or Electroencephalography (EEG) signals, which require costly equipment and trained personnel for data collection. In recent years, speech and text modalities have garnered increasing attention due to their cost-effectiveness and non-wearable sensing in data collection. In this research, conducted in collaboration with the Cumbria, Northumberland, Tyne and Wear NHS Foundation Trust, we gathered audio data from both ADHD patients and normal controls based on the clinically popular Diagnostic Interview for ADHD in adults (DIVA). Subsequently, we transformed the speech data into text modalities through the utilization of the Google Cloud Speech API. We extracted both acoustic and text features from the data, encompassing traditional acoustic features (e.g., MFCC), specialized feature sets (e.g., eGeMAPS), as well as deep-learned linguistic and semantic features derived from pre-trained deep learning models. These features are employed in conjunction with a support vector machine for ADHD classification, yielding promising outcomes in the utilization of audio and text data for effective adult ADHD screening. Clinical impact: This research introduces a transformative approach in ADHD diagnosis, employing speech and text analysis to facilitate early and more accessible detection, particularly beneficial in areas with limited psychiatric resources. Clinical and Translational Impact Statement: The successful application of machine learning techniques in analyzing audio and text data for ADHD screening represents a significant advancement in mental health diagnostics, paving the way for its integration into clinical settings and potentially improving patient outcomes on a broader scale.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"12 ","pages":"359-370"},"PeriodicalIF":3.4,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10445184","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139980103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Wavelet-Based Approach for Motion Artifact Reduction in Ambulatory Seismocardiography 基于小波的动态心动图运动伪影消除方法
IF 3.4 3区 医学 Q2 ENGINEERING, BIOMEDICAL Pub Date : 2024-02-20 DOI: 10.1109/JTEHM.2024.3368291
James Skoric;Yannick D’Mello;David V. Plant
Wearable sensing has become a vital approach to cardiac health monitoring, and seismocardiography (SCG) is emerging as a promising technology in this field. However, the applicability of SCG is hindered by motion artifacts, including those encountered in practice of which the strongest source is walking. This holds back the translation of SCG to clinical settings. We therefore investigated techniques to enhance the quality of SCG signals in the presence of motion artifacts. To simulate ambulant recordings, we corrupted a clean SCG dataset with real-walking-vibrational noise. We decomposed the signal using several empirical-mode-decomposition methods and the maximum overlap discrete wavelet transform (MODWT). By combining MODWT, time-frequency masking, and nonnegative matrix factorization, we developed a novel algorithm which leveraged the vertical axis accelerometer to reduce walking vibrations in dorsoventral SCG. The accuracy and applicability of our method was verified using heart rate estimation. We used an interactive selection approach to improve estimation accuracy. The best decomposition method for reduction of motion artifact noise was the MODWT. Our algorithm improved heart rate estimation from 0.1 to 0.8 r-squared at −15 dB signal-to-noise ratio (SNR). Our method reduces motion artifacts in SCG signals up to a SNR of −19 dB without requiring any external assistance from electrocardiography (ECG). Such a standalone solution is directly applicable to the usage of SCG in daily life, as a content-rich replacement for other wearables in clinical settings, and other continuous monitoring scenarios. In applications with higher noise levels, ECG may be incorporated to further enhance SCG and extend its usable range. This work addresses the challenges posed by motion artifacts, enabling SCG to offer reliable cardiovascular insights in more difficult scenarios, and thereby facilitating wearable monitoring in daily life and the clinic.
可穿戴传感技术已成为心脏健康监测的重要方法,而地震心动图(SCG)正成为该领域一项前景广阔的技术。然而,运动伪影阻碍了地震心动图的应用,包括在实践中遇到的运动伪影,其中最主要的来源是行走。这阻碍了 SCG 在临床环境中的应用。因此,我们研究了在存在运动伪影的情况下提高 SCG 信号质量的技术。为了模拟伏卧记录,我们用真实行走振动噪声破坏了一个干净的 SCG 数据集。我们使用几种经验模式分解方法和最大重叠离散小波变换(MODWT)对信号进行分解。通过结合 MODWT、时频掩蔽和非负矩阵因式分解,我们开发出了一种新型算法,该算法利用垂直轴加速度计来减少 SCG 背腹部的行走振动。我们使用心率估算验证了该方法的准确性和适用性。我们采用交互式选择方法来提高估算的准确性。减少运动伪噪声的最佳分解方法是 MODWT。在信噪比(SNR)为-15 dB的情况下,我们的算法将心率估计值的r平方从0.1提高到0.8。我们的方法可将 SCG 信号中的运动伪影降低至信噪比为 -19 dB,而无需心电图(ECG)的任何外部辅助。这种独立的解决方案可直接应用于日常生活中的 SCG 使用,在临床环境中作为其他可穿戴设备的内容丰富的替代品,以及其他连续监测场景。在噪声水平较高的应用中,可结合心电图进一步增强 SCG 并扩大其可用范围。这项工作解决了运动伪影带来的挑战,使 SCG 能够在更困难的情况下提供可靠的心血管见解,从而促进日常生活和临床中的可穿戴监测。
{"title":"A Wavelet-Based Approach for Motion Artifact Reduction in Ambulatory Seismocardiography","authors":"James Skoric;Yannick D’Mello;David V. Plant","doi":"10.1109/JTEHM.2024.3368291","DOIUrl":"10.1109/JTEHM.2024.3368291","url":null,"abstract":"Wearable sensing has become a vital approach to cardiac health monitoring, and seismocardiography (SCG) is emerging as a promising technology in this field. However, the applicability of SCG is hindered by motion artifacts, including those encountered in practice of which the strongest source is walking. This holds back the translation of SCG to clinical settings. We therefore investigated techniques to enhance the quality of SCG signals in the presence of motion artifacts. To simulate ambulant recordings, we corrupted a clean SCG dataset with real-walking-vibrational noise. We decomposed the signal using several empirical-mode-decomposition methods and the maximum overlap discrete wavelet transform (MODWT). By combining MODWT, time-frequency masking, and nonnegative matrix factorization, we developed a novel algorithm which leveraged the vertical axis accelerometer to reduce walking vibrations in dorsoventral SCG. The accuracy and applicability of our method was verified using heart rate estimation. We used an interactive selection approach to improve estimation accuracy. The best decomposition method for reduction of motion artifact noise was the MODWT. Our algorithm improved heart rate estimation from 0.1 to 0.8 r-squared at −15 dB signal-to-noise ratio (SNR). Our method reduces motion artifacts in SCG signals up to a SNR of −19 dB without requiring any external assistance from electrocardiography (ECG). Such a standalone solution is directly applicable to the usage of SCG in daily life, as a content-rich replacement for other wearables in clinical settings, and other continuous monitoring scenarios. In applications with higher noise levels, ECG may be incorporated to further enhance SCG and extend its usable range. This work addresses the challenges posed by motion artifacts, enabling SCG to offer reliable cardiovascular insights in more difficult scenarios, and thereby facilitating wearable monitoring in daily life and the clinic.","PeriodicalId":54255,"journal":{"name":"IEEE Journal of Translational Engineering in Health and Medicine-Jtehm","volume":"12 ","pages":"348-358"},"PeriodicalIF":3.4,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10440609","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139953785","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Journal of Translational Engineering in Health and Medicine-Jtehm
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1