首页 > 最新文献

2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)最新文献

英文 中文
COVID-19 Detection Exploiting Self-Supervised Learning Representations of Respiratory Sounds 利用呼吸声音的自监督学习表征检测COVID-19
Pub Date : 2022-09-27 DOI: 10.1109/BHI56158.2022.9926967
Adria Mallol-Ragolta, Shuo Liu, B. Schuller
In this work, we focus on the automatic detection of COVID-19 patients from the analysis of cough, breath, and speech samples. Our goal is to investigate the suitability of Self-Supervised Learning (SSL) representations extracted using Wav2Vec 2.0 for the task at hand. For this, in addition to the SSL representations, the models trained exploit the Low-Level Descriptors (LLD) of the eGeMAPS feature set, and Mel-spectrogram coefficients. The extracted representations are analysed using Convolutional Neural Networks (CNN) reinforced with contextual attention. Our experiments are performed using the data released as part of the Second Diagnosing COVID-19 using Acoustics (DiCOVA) Challenge, and we use the Area Under the Curve (AUC) as the evaluation metric. When using the CNNs without contextual attention, the multi-type model exploiting the SSL Wav2Vec 2.0 representations from the cough, breath, and speech sounds scores the highest AUC, 80.37 %. When reinforcing the embedded representations learnt with contextual attention, the AUC obtained using this same model slightly decreases to 80.01 %. The best performance on the test set is obtained with a multi-type model fusing the embedded representations extracted from the LLDs of the cough, breath, and speech samples and reinforced using contextual attention, scoring an AUC of 81.27 %.
在这项工作中,我们专注于从咳嗽、呼吸和语音样本的分析中自动检测COVID-19患者。我们的目标是研究使用Wav2Vec 2.0提取的自监督学习(SSL)表示对手头任务的适用性。为此,除了SSL表示之外,训练的模型还利用了eGeMAPS特征集的低级描述符(LLD)和mel谱图系数。提取的表征使用卷积神经网络(CNN)与上下文注意增强进行分析。我们的实验是使用作为第二次使用声学诊断COVID-19 (DiCOVA)挑战的一部分发布的数据进行的,我们使用曲线下面积(AUC)作为评估指标。当使用没有上下文关注的cnn时,利用来自咳嗽、呼吸和语音的SSL Wav2Vec 2.0表示的多类型模型的AUC得分最高,为80.37%。当用上下文注意强化学习到的嵌入表征时,使用同一模型获得的AUC略有下降至80.01%。多类型模型融合了从咳嗽、呼吸和语音样本的lld中提取的嵌入表征,并使用上下文注意进行强化,在测试集中获得了最佳性能,AUC为81.27%。
{"title":"COVID-19 Detection Exploiting Self-Supervised Learning Representations of Respiratory Sounds","authors":"Adria Mallol-Ragolta, Shuo Liu, B. Schuller","doi":"10.1109/BHI56158.2022.9926967","DOIUrl":"https://doi.org/10.1109/BHI56158.2022.9926967","url":null,"abstract":"In this work, we focus on the automatic detection of COVID-19 patients from the analysis of cough, breath, and speech samples. Our goal is to investigate the suitability of Self-Supervised Learning (SSL) representations extracted using Wav2Vec 2.0 for the task at hand. For this, in addition to the SSL representations, the models trained exploit the Low-Level Descriptors (LLD) of the eGeMAPS feature set, and Mel-spectrogram coefficients. The extracted representations are analysed using Convolutional Neural Networks (CNN) reinforced with contextual attention. Our experiments are performed using the data released as part of the Second Diagnosing COVID-19 using Acoustics (DiCOVA) Challenge, and we use the Area Under the Curve (AUC) as the evaluation metric. When using the CNNs without contextual attention, the multi-type model exploiting the SSL Wav2Vec 2.0 representations from the cough, breath, and speech sounds scores the highest AUC, 80.37 %. When reinforcing the embedded representations learnt with contextual attention, the AUC obtained using this same model slightly decreases to 80.01 %. The best performance on the test set is obtained with a multi-type model fusing the embedded representations extracted from the LLDs of the cough, breath, and speech samples and reinforced using contextual attention, scoring an AUC of 81.27 %.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124399190","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Machine Learning-based Detection of In-Utero Fetal Presentation from Non-Invasive Fetal ECG 基于机器学习的无创胎儿心电图宫内胎儿表现检测
Pub Date : 2022-09-27 DOI: 10.1109/BHI56158.2022.9926804
Katharina M. Jaeger, Michael Nissen, R. Richer, Simone Rahm, Adriana Titzmann, P. Fasching, Bjoern M. Eskofier, Heike Leutheuser
Preterm births account for more than 10 % of all newborns. An adverse fetal presentation is a risk factor for intrapartum and neonatal mortality. To date, no technology enables a longitudinal, ubiquitous, and unobtrusive monitoring of fetal presentation. This study presents a first approach to fetal orientation detection based on non-invasive fetal electrocardiography (NI-fECG) using the non-invasive multi-modal foetal ECG-Doppler data set for antenatal cardiology research. The data set contains 60 recordings from 39 pregnant women (21–27 weeks), including NI-fECG and ultrasound position ground truth. We evaluated both handcrafted and generic features for five different classifiers (k-Nearest-Neighbor, Decision Tree Classifier, Support Vector Classification, AdaBoost Classifier, and Multilayer Perceptron) using cross-validation on subject splits on a cleaned subset. Best results for the distinction between vertex (head down) and breech (head up) were achieved using an AdaBoost classifier with a balanced accuracy of 86.5 ± 15.0 %. With this work, we take a first step towards longitudinal fetal presentation monitoring, which contributes to a better understanding of reduced fetal movements and extends the potential applications of NI-fECG in prenatal care. In future work, we will expand our classification system to detect more detailed fetal presentations using a newly created data set.
早产占所有新生儿的10%以上。不良胎儿呈现是产时和新生儿死亡的危险因素。到目前为止,还没有一种技术能够对胎儿的表现进行纵向的、无所不在的、不显眼的监测。本研究提出了一种基于无创胎儿心电图(NI-fECG)的胎儿取向检测方法,该方法使用无创多模态胎儿心电图多普勒数据集用于产前心脏病学研究。数据集包含39名孕妇(21-27周)的60条记录,包括NI-fECG和超声位置地面真实值。我们对五种不同分类器(k-Nearest-Neighbor, Decision Tree Classifier, Support Vector Classification, AdaBoost Classifier和Multilayer Perceptron)的手工特征和通用特征进行了评估,并对清理后的子集上的主题分割进行了交叉验证。使用AdaBoost分类器区分顶点(头部向下)和后臀(头部向上)的最佳结果,平衡精度为86.5±15.0%。通过这项工作,我们向纵向胎儿呈现监测迈出了第一步,这有助于更好地了解胎儿运动减少,并扩展NI-fECG在产前护理中的潜在应用。在未来的工作中,我们将扩展我们的分类系统,使用新创建的数据集来检测更详细的胎儿表现。
{"title":"Machine Learning-based Detection of In-Utero Fetal Presentation from Non-Invasive Fetal ECG","authors":"Katharina M. Jaeger, Michael Nissen, R. Richer, Simone Rahm, Adriana Titzmann, P. Fasching, Bjoern M. Eskofier, Heike Leutheuser","doi":"10.1109/BHI56158.2022.9926804","DOIUrl":"https://doi.org/10.1109/BHI56158.2022.9926804","url":null,"abstract":"Preterm births account for more than 10 % of all newborns. An adverse fetal presentation is a risk factor for intrapartum and neonatal mortality. To date, no technology enables a longitudinal, ubiquitous, and unobtrusive monitoring of fetal presentation. This study presents a first approach to fetal orientation detection based on non-invasive fetal electrocardiography (NI-fECG) using the non-invasive multi-modal foetal ECG-Doppler data set for antenatal cardiology research. The data set contains 60 recordings from 39 pregnant women (21–27 weeks), including NI-fECG and ultrasound position ground truth. We evaluated both handcrafted and generic features for five different classifiers (k-Nearest-Neighbor, Decision Tree Classifier, Support Vector Classification, AdaBoost Classifier, and Multilayer Perceptron) using cross-validation on subject splits on a cleaned subset. Best results for the distinction between vertex (head down) and breech (head up) were achieved using an AdaBoost classifier with a balanced accuracy of 86.5 ± 15.0 %. With this work, we take a first step towards longitudinal fetal presentation monitoring, which contributes to a better understanding of reduced fetal movements and extends the potential applications of NI-fECG in prenatal care. In future work, we will expand our classification system to detect more detailed fetal presentations using a newly created data set.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"85 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130321269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance vs. Privacy: Evaluating the Performance of Predicting Second Primary Cancer in Lung Cancer Survivors with Privacy-preserving Approaches 性能与隐私:评估使用隐私保护方法预测肺癌幸存者第二原发性癌症的性能
Pub Date : 2022-09-27 DOI: 10.1109/BHI56158.2022.9926935
Jui-Fu Hong, Y. Tseng
Deep learning has been widely used in the medical field to support medical decision making. Simultaneously, with the rise of data privacy protection, accessing clinical records across different institutions has become a possible challenge. Several approaches, such as federated and transfer learning, have been proposed to train models without accessing all the records from each institution, but the performance of these privacy-preserved models may not be as good as centralized approaches, which aggregate all records to build a centralized model. To explore the potential of privacy-preserving second primary cancer (SPC) prediction of lung cancer survivors using real-world data, we evaluated the performance of federated learning, transfer learning, learning with a single institution, and traditional centralized learning. We trained machine learning models using data from four hospitals and compared the model performances of learning from a single institution, centralized learning, federated learning, and transfer learning. The results show that federated learning outperformed other learning strategies in three of the four sites (AUROC from 0.733 to 0.777). However, only Site 6 showed that federated learning significantly outperformed all the other learning strategies (P < 0.05). In summary, federated learning can develop a unified model for the multiple institutions while maintaining data security.
深度学习已被广泛应用于医疗领域,以支持医疗决策。同时,随着数据隐私保护的兴起,访问不同机构的临床记录已成为一个可能的挑战。已经提出了几种方法,如联邦学习和迁移学习,在不访问每个机构的所有记录的情况下训练模型,但是这些保护隐私的模型的性能可能不如集中方法好,集中方法将所有记录聚集在一起以构建集中模型。为了探索使用真实世界数据对肺癌幸存者进行隐私保护的第二原发癌(SPC)预测的潜力,我们评估了联邦学习、迁移学习、单一机构学习和传统集中式学习的性能。我们使用来自四家医院的数据训练机器学习模型,并比较了从单一机构学习、集中学习、联合学习和迁移学习的模型性能。结果表明,联邦学习在四个站点中的三个站点上优于其他学习策略(AUROC从0.733到0.777)。然而,只有Site 6显示联邦学习显著优于其他所有学习策略(P < 0.05)。总之,联邦学习可以在维护数据安全的同时为多个机构开发统一的模型。
{"title":"Performance vs. Privacy: Evaluating the Performance of Predicting Second Primary Cancer in Lung Cancer Survivors with Privacy-preserving Approaches","authors":"Jui-Fu Hong, Y. Tseng","doi":"10.1109/BHI56158.2022.9926935","DOIUrl":"https://doi.org/10.1109/BHI56158.2022.9926935","url":null,"abstract":"Deep learning has been widely used in the medical field to support medical decision making. Simultaneously, with the rise of data privacy protection, accessing clinical records across different institutions has become a possible challenge. Several approaches, such as federated and transfer learning, have been proposed to train models without accessing all the records from each institution, but the performance of these privacy-preserved models may not be as good as centralized approaches, which aggregate all records to build a centralized model. To explore the potential of privacy-preserving second primary cancer (SPC) prediction of lung cancer survivors using real-world data, we evaluated the performance of federated learning, transfer learning, learning with a single institution, and traditional centralized learning. We trained machine learning models using data from four hospitals and compared the model performances of learning from a single institution, centralized learning, federated learning, and transfer learning. The results show that federated learning outperformed other learning strategies in three of the four sites (AUROC from 0.733 to 0.777). However, only Site 6 showed that federated learning significantly outperformed all the other learning strategies (P < 0.05). In summary, federated learning can develop a unified model for the multiple institutions while maintaining data security.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"146 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131706749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
One-side Virtual Histological Staining Model for Complex Human Samples 复杂人体标本的单侧虚拟组织学染色模型
Pub Date : 2022-09-27 DOI: 10.1109/BHI56158.2022.9926959
Lulin Shi, Ivy H. M. Wong, Claudia T. K. Lo, T. T. Wong
Virtual histological staining technique with a label-free auto-fluorescence image as an input is a challenging scientific pursuit to visualize complicated biological structures with distinct features. Recently, most of the related methods follow the two-side image translation architecture to get rid of paired data restriction, which is necessary for unprocessed and thick tissue virtual histological staining style transformation. However, the associated cycle consistency loss will inevitably lead to huge calculation consumption and cannot address the problem of incorrect translation among intracellular and extracellular components, which we termed as incorrect staining. In this paper, we propose a novel and computational-efficient one-side image translation framework to transfer unstained auto-fluorescence images into virtual hematoxylin- and eosin-stained counterparts for both thin and thick human samples. To address the incorrect nuclear staining issue, we design a region-classification loss to incorporate partial supervision information. Experimental data on both thin and thick human lung samples are used to demonstrate that our method is computationally efficient while achieving a comparable transformation performance over the two-side framework.
以无标记的自动荧光图像作为输入的虚拟组织学染色技术是一项具有挑战性的科学追求,以可视化具有不同特征的复杂生物结构。目前,相关方法大多采用双面图像平移架构,以摆脱成对数据的限制,这是未处理和厚组织虚拟组织学染色风格转换所必需的。然而,相关的周期一致性损失将不可避免地导致巨大的计算消耗,并且无法解决细胞内和细胞外成分之间不正确翻译的问题,我们称之为不正确染色。在本文中,我们提出了一种新的计算效率高的单面图像转换框架,将未染色的自动荧光图像转换为薄和厚的人体样本的虚拟苏木精和伊红染色的对应物。为了解决不正确的核染色问题,我们设计了一个区域分类损失来合并部分监督信息。在薄的和厚的人肺样本上的实验数据被用来证明我们的方法是计算效率高的,同时在两侧框架上实现了相当的转换性能。
{"title":"One-side Virtual Histological Staining Model for Complex Human Samples","authors":"Lulin Shi, Ivy H. M. Wong, Claudia T. K. Lo, T. T. Wong","doi":"10.1109/BHI56158.2022.9926959","DOIUrl":"https://doi.org/10.1109/BHI56158.2022.9926959","url":null,"abstract":"Virtual histological staining technique with a label-free auto-fluorescence image as an input is a challenging scientific pursuit to visualize complicated biological structures with distinct features. Recently, most of the related methods follow the two-side image translation architecture to get rid of paired data restriction, which is necessary for unprocessed and thick tissue virtual histological staining style transformation. However, the associated cycle consistency loss will inevitably lead to huge calculation consumption and cannot address the problem of incorrect translation among intracellular and extracellular components, which we termed as incorrect staining. In this paper, we propose a novel and computational-efficient one-side image translation framework to transfer unstained auto-fluorescence images into virtual hematoxylin- and eosin-stained counterparts for both thin and thick human samples. To address the incorrect nuclear staining issue, we design a region-classification loss to incorporate partial supervision information. Experimental data on both thin and thick human lung samples are used to demonstrate that our method is computationally efficient while achieving a comparable transformation performance over the two-side framework.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126494948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Interpretability with Relevance Aggregation in Neural Networks for Absenteeism Prediction 基于关联聚合的神经网络可解释性缺勤预测
Pub Date : 2022-09-27 DOI: 10.1109/BHI56158.2022.9926870
Julio Marcos Gomes Junior, Fabricio M. Lopes
The lack of attendance of employees is called absenteeism and occurs for various reasons, such as vigorous physical activity, advanced age and high psychological demands of the work. The absenteeism affects the direct and indirect costs of the companies, and may reach 15% of the payroll. Therefore, it is fundamental to know its main causes and contribute to control and mitigation strategies. Neural networks have been successfully applied in the classification of several problems, but they are black boxes, because they do not explain which aspects are considered in their decisions. This aspect is very important in health applications, in which it is necessary to explain and clearly interpret the results. In this context, this work presents an approach to classify absenteeism through neural networks and Layer-wise relevance propagation (LRP) aggregation in order to identify the most relevant features and to assign relevance scores individually per class and among all classes. The proposed approach was assessed by considering a dataset widely used as a benchmark and compared to the existing literature methods. The proposed approach presented the highest assertiveness rates among the compared methods, reaching an average accuracy of 0.83, identifying the most relevant features for the classification of absenteeism through a relevance score. Therefore, the results allow the interpretability of the causes of each class of absenteeism, which contribute to the management of human resources, occupational medicine and the development of strategies for its mitigation.
员工缺勤被称为旷工,旷工的原因多种多样,如体力活动剧烈、年龄较大、工作心理要求高等。旷工影响公司的直接和间接成本,可能达到工资的15%。因此,了解其主要原因并为控制和缓解战略作出贡献至关重要。神经网络已经成功地应用于几个问题的分类,但它们是黑盒子,因为它们不能解释在决策中考虑哪些方面。这方面在健康应用中非常重要,在这方面有必要解释和清楚地解释结果。在此背景下,本研究提出了一种通过神经网络和分层相关传播(LRP)聚合对旷工进行分类的方法,以识别最相关的特征,并为每个班级和所有班级单独分配相关分数。通过将广泛使用的数据集作为基准,并与现有文献方法进行比较,对所提出的方法进行了评估。所提出的方法在比较的方法中呈现出最高的自信率,达到0.83的平均准确率,通过相关性评分识别出与缺勤分类最相关的特征。因此,调查结果可以解释每一类缺勤的原因,这有助于人力资源管理、职业医学和制定减轻缺勤的战略。
{"title":"Interpretability with Relevance Aggregation in Neural Networks for Absenteeism Prediction","authors":"Julio Marcos Gomes Junior, Fabricio M. Lopes","doi":"10.1109/BHI56158.2022.9926870","DOIUrl":"https://doi.org/10.1109/BHI56158.2022.9926870","url":null,"abstract":"The lack of attendance of employees is called absenteeism and occurs for various reasons, such as vigorous physical activity, advanced age and high psychological demands of the work. The absenteeism affects the direct and indirect costs of the companies, and may reach 15% of the payroll. Therefore, it is fundamental to know its main causes and contribute to control and mitigation strategies. Neural networks have been successfully applied in the classification of several problems, but they are black boxes, because they do not explain which aspects are considered in their decisions. This aspect is very important in health applications, in which it is necessary to explain and clearly interpret the results. In this context, this work presents an approach to classify absenteeism through neural networks and Layer-wise relevance propagation (LRP) aggregation in order to identify the most relevant features and to assign relevance scores individually per class and among all classes. The proposed approach was assessed by considering a dataset widely used as a benchmark and compared to the existing literature methods. The proposed approach presented the highest assertiveness rates among the compared methods, reaching an average accuracy of 0.83, identifying the most relevant features for the classification of absenteeism through a relevance score. Therefore, the results allow the interpretability of the causes of each class of absenteeism, which contribute to the management of human resources, occupational medicine and the development of strategies for its mitigation.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134278325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of Video Capsule Endoscopy Images Using Visual Transformers 视频胶囊内窥镜图像的视觉变换分类
Pub Date : 2022-09-27 DOI: 10.1109/BHI56158.2022.9926791
Daniel Lopes Soares Lima, A. Pessoa, A. C. D. Paiva, António Cunha, Geraldo Braz Júnior, J. Almeida
Cancers related to the gastrointestinal tract have a high incidence rate in the population, with a high mortality rate. Videos obtained through endoscopic capsules are essential for evaluating anomalies that can progress to cancer. However, due to their duration, which can reach 10 hours, they demand great attention from the medical specialist in their analysis. Machine learning techniques have been successfully applied in developing computer-aided diagnostic systems since the 1990s, where Convolutional Neural Networks (CNNs) have become very successful for pattern recognition in images. CNNs use convolutions to extract features from the analyzed data, operating in a fixed-size window and thus having problems capturing pixel-level relationships considering the spatial and temporal domains. Otherwise, transformers use attention mechanisms, where data is structured in a vector space that can aggregate information from adjacent data to determine meaning in a given context. This work proposes a computational method for analyzing images extracted from videos obtained by endoscopic capsules, using a transformer-based model that helps diagnose of gastrointestinal tract abnormalities. Preliminary results are promising. The classification task of 11 classes evaluated on the publicly available Kvasir-Capsule dataset yielded an average value of 99.70% of accuracy, 99.64% of precision, 99.86% of sensitivity, and 99.54% of f1-score.
胃肠道相关癌症在人群中发病率高,死亡率高。通过内窥镜胶囊获得的视频对于评估可能发展为癌症的异常是必不可少的。然而,由于其持续时间可达10个小时,因此需要医学专家在分析时给予高度关注。自20世纪90年代以来,机器学习技术已经成功地应用于开发计算机辅助诊断系统,其中卷积神经网络(cnn)在图像模式识别方面非常成功。cnn使用卷积从分析数据中提取特征,在固定大小的窗口中操作,因此考虑到空间和时间域,在捕获像素级关系方面存在问题。否则,转换器使用注意机制,其中数据在向量空间中结构化,可以聚合来自相邻数据的信息以确定给定上下文中的含义。这项工作提出了一种计算方法,用于分析从内窥镜胶囊获得的视频中提取的图像,使用基于变压器的模型来帮助诊断胃肠道异常。初步结果令人鼓舞。在公开可用的Kvasir-Capsule数据集上评估的11个类别的分类任务的平均准确度为99.70%,精密度为99.64%,灵敏度为99.86%,f1-score为99.54%。
{"title":"Classification of Video Capsule Endoscopy Images Using Visual Transformers","authors":"Daniel Lopes Soares Lima, A. Pessoa, A. C. D. Paiva, António Cunha, Geraldo Braz Júnior, J. Almeida","doi":"10.1109/BHI56158.2022.9926791","DOIUrl":"https://doi.org/10.1109/BHI56158.2022.9926791","url":null,"abstract":"Cancers related to the gastrointestinal tract have a high incidence rate in the population, with a high mortality rate. Videos obtained through endoscopic capsules are essential for evaluating anomalies that can progress to cancer. However, due to their duration, which can reach 10 hours, they demand great attention from the medical specialist in their analysis. Machine learning techniques have been successfully applied in developing computer-aided diagnostic systems since the 1990s, where Convolutional Neural Networks (CNNs) have become very successful for pattern recognition in images. CNNs use convolutions to extract features from the analyzed data, operating in a fixed-size window and thus having problems capturing pixel-level relationships considering the spatial and temporal domains. Otherwise, transformers use attention mechanisms, where data is structured in a vector space that can aggregate information from adjacent data to determine meaning in a given context. This work proposes a computational method for analyzing images extracted from videos obtained by endoscopic capsules, using a transformer-based model that helps diagnose of gastrointestinal tract abnormalities. Preliminary results are promising. The classification task of 11 classes evaluated on the publicly available Kvasir-Capsule dataset yielded an average value of 99.70% of accuracy, 99.64% of precision, 99.86% of sensitivity, and 99.54% of f1-score.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128764777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Continuous Human Activity Recognition and Step-Time Variability Analysis with FMCW Radar 基于FMCW雷达的连续人体活动识别与步长变异性分析
Pub Date : 2022-09-27 DOI: 10.1109/BHI56158.2022.9926892
S. Gurbuz, Mohammad Mahbubur Rahman, Emre Kurtoğlu, D. Martelli
Human activity recognition (HAR) and gait analysis are important functions that support aging-in-place and remote health monitoring. Although there have been many works investigating HAR with radar based on single-activity snapshots in time, few works address recognition in continuous streams of radio frequency (RF) data, where in daily life many different activities are conducted. This work proposes a novel variable window averaging method to segment RF data streams containing a mixture of large-scale gross motor activities as well as fine-grain hand gestures, a physics-aware generative adversarial network (PhGAN) to recognize daily activities, and a new technique to estimate step-time variability from RF data. Our results show that extraction of motion detected intervals and GAN-synthesized samples during training boosts HAR accuracy, while the estimation variance of time-step variability from radar compares well with that obtained from a Vicon motion capture system.
人体活动识别(HAR)和步态分析是支持原地衰老和远程健康监测的重要功能。虽然有很多研究工作是利用基于单活动快照的雷达来研究HAR,但很少有研究工作是在连续的射频(RF)数据流中进行识别的,而在日常生活中,射频数据流中进行了许多不同的活动。这项工作提出了一种新的可变窗口平均方法来分割包含大规模大动作活动和细粒度手势的RF数据流,一种物理感知生成对抗网络(PhGAN)来识别日常活动,以及一种新的技术来估计RF数据的步长变异性。我们的研究结果表明,在训练过程中提取运动检测间隔和gan合成样本提高了HAR精度,而雷达的时间步变率估计方差与Vicon运动捕捉系统的估计方差相当。
{"title":"Continuous Human Activity Recognition and Step-Time Variability Analysis with FMCW Radar","authors":"S. Gurbuz, Mohammad Mahbubur Rahman, Emre Kurtoğlu, D. Martelli","doi":"10.1109/BHI56158.2022.9926892","DOIUrl":"https://doi.org/10.1109/BHI56158.2022.9926892","url":null,"abstract":"Human activity recognition (HAR) and gait analysis are important functions that support aging-in-place and remote health monitoring. Although there have been many works investigating HAR with radar based on single-activity snapshots in time, few works address recognition in continuous streams of radio frequency (RF) data, where in daily life many different activities are conducted. This work proposes a novel variable window averaging method to segment RF data streams containing a mixture of large-scale gross motor activities as well as fine-grain hand gestures, a physics-aware generative adversarial network (PhGAN) to recognize daily activities, and a new technique to estimate step-time variability from RF data. Our results show that extraction of motion detected intervals and GAN-synthesized samples during training boosts HAR accuracy, while the estimation variance of time-step variability from radar compares well with that obtained from a Vicon motion capture system.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134123724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Modeling of Plaque Progression in the Carotid Artery Using Coupled Agent Based with Finite Element Method 基于有限元法的耦合剂对颈动脉斑块进展的建模
Pub Date : 2022-09-27 DOI: 10.1109/BHI56158.2022.9926817
N. Filipovic, Smiljana Tomasevic, Andjela Blagojević, Branko Arsić, Miloš Anić, T. Djukić
In study, we presented a new computational model for atheromatic plaque growth progression in the carotid artery using specialized mathematical models and computational simulations which will enable the accurate prediction of the cardiovascular disease evolution. The simulated model with coupled Agent Based Method (ABM) and Finite Element Method (FEM) has been presented. The ABM was coupled with an initial WSS profile, which triggers a pathologic vascular remodeling by perturbing the baseline cellular activity and favoring lipid infiltration and accumulation within the arterial wall. The ABM model takes shear stress and LDL initial distribution from the lumen and starts iterative calculation inside the wall for lipid infiltration and accumulation using a random number generator for each time step. After ABM iterations, both wall lipid distribution and wall geometry are changed. This directly influences the wall artery geometry which is also modeled with finite element, with ABM elements inside these large finite elements. Then, fluid-structure solver is running and lumen domain is calculated again. The change of the shape of the cross-sections of the arterial wall is shown in three specific moments in time (baseline, after 3 months and after 6 months). One main pros of this new approach are the use of realistic 3D reconstructed artery providing in this way a more realistic, patient-specific simulation of plaque progression.
在研究中,我们利用专门的数学模型和计算模拟,提出了一种新的颈动脉粥样硬化斑块生长进展的计算模型,可以准确预测心血管疾病的演变。提出了基于Agent的方法和有限元法的耦合仿真模型。ABM与最初的WSS相结合,通过扰乱基线细胞活性和促进动脉壁内脂质浸润和积聚,引发病理性血管重塑。ABM模型从管腔获取剪切应力和LDL的初始分布,使用随机数生成器对每个时间步进行壁内脂质浸润和积累的迭代计算。经过ABM迭代后,壁脂分布和壁的几何形状都发生了变化。这直接影响了壁动脉的几何形状,这也是用有限元建模的,在这些大的有限元中有ABM元素。然后运行流固求解器,重新计算流腔域。动脉壁横截面形状的变化表现在三个特定时刻(基线、3个月后和6个月后)。这种新方法的一个主要优点是使用真实的3D重建动脉,以这种方式提供更真实的、针对患者的斑块进展模拟。
{"title":"Modeling of Plaque Progression in the Carotid Artery Using Coupled Agent Based with Finite Element Method","authors":"N. Filipovic, Smiljana Tomasevic, Andjela Blagojević, Branko Arsić, Miloš Anić, T. Djukić","doi":"10.1109/BHI56158.2022.9926817","DOIUrl":"https://doi.org/10.1109/BHI56158.2022.9926817","url":null,"abstract":"In study, we presented a new computational model for atheromatic plaque growth progression in the carotid artery using specialized mathematical models and computational simulations which will enable the accurate prediction of the cardiovascular disease evolution. The simulated model with coupled Agent Based Method (ABM) and Finite Element Method (FEM) has been presented. The ABM was coupled with an initial WSS profile, which triggers a pathologic vascular remodeling by perturbing the baseline cellular activity and favoring lipid infiltration and accumulation within the arterial wall. The ABM model takes shear stress and LDL initial distribution from the lumen and starts iterative calculation inside the wall for lipid infiltration and accumulation using a random number generator for each time step. After ABM iterations, both wall lipid distribution and wall geometry are changed. This directly influences the wall artery geometry which is also modeled with finite element, with ABM elements inside these large finite elements. Then, fluid-structure solver is running and lumen domain is calculated again. The change of the shape of the cross-sections of the arterial wall is shown in three specific moments in time (baseline, after 3 months and after 6 months). One main pros of this new approach are the use of realistic 3D reconstructed artery providing in this way a more realistic, patient-specific simulation of plaque progression.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126520933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Localisation Study of Deep Learning Models for Chest X-ray Image Classification 胸部x线图像分类深度学习模型的局部化研究
Pub Date : 2022-09-27 DOI: 10.1109/BHI56158.2022.9926904
James Gascoigne-Burns, Stamos Katsigiannis
Deep learning models have demonstrated superhuman performance in a multitude of image classification tasks, including the classification of chest X-ray images. Despite this, medical professionals are reluctant to embrace these models in clinical settings due to a lack of interpretability, citing being able to visualise the image areas contributing most to a model's predictions as one of the best ways to establish trust. To aid the discussion of their suitability for real-world use, in this work, we attempt to address this issue by conducting a localisation study of two state-of-the-art deep learning models for chest X-ray image classification, ResNet-38-large-meta and CheXNet, on a set of 984 radiologist annotated X-ray images from the publicly available ChestX-ray14 dataset. We do this by applying and comparing several state-of-the-art visualisation methods, combined with a novel dynamic thresholding approach for generating bounding boxes, which we show to outperform the static thresholding method used by similar localisation studies in the literature. Results also seem to indicate that localisation quality is more sensitive to the choice of thresholding scheme than the visualisation method used, and that a high discriminative ability as measured by classification performance is not necessarily sufficient for models to produce useful and accurate localisations.
深度学习模型在许多图像分类任务中表现出了超人的性能,包括胸部x射线图像的分类。尽管如此,由于缺乏可解释性,医学专业人员不愿意在临床环境中采用这些模型,理由是能够可视化对模型预测贡献最大的图像区域,这是建立信任的最佳方式之一。为了帮助讨论它们在现实世界中的适用性,在这项工作中,我们试图通过对两种最先进的胸部x射线图像分类深度学习模型resnet -38-大元模型和CheXNet进行本地化研究来解决这个问题,这些模型来自公开可用的ChestX-ray14数据集,包含984张放射科医生注释的x射线图像。我们通过应用和比较几种最先进的可视化方法,结合一种用于生成边界框的新型动态阈值方法来实现这一点,我们证明该方法优于文献中类似定位研究使用的静态阈值方法。结果似乎还表明,定位质量对阈值方案的选择比所使用的可视化方法更敏感,并且通过分类性能衡量的高判别能力并不一定足以使模型产生有用和准确的定位。
{"title":"A Localisation Study of Deep Learning Models for Chest X-ray Image Classification","authors":"James Gascoigne-Burns, Stamos Katsigiannis","doi":"10.1109/BHI56158.2022.9926904","DOIUrl":"https://doi.org/10.1109/BHI56158.2022.9926904","url":null,"abstract":"Deep learning models have demonstrated superhuman performance in a multitude of image classification tasks, including the classification of chest X-ray images. Despite this, medical professionals are reluctant to embrace these models in clinical settings due to a lack of interpretability, citing being able to visualise the image areas contributing most to a model's predictions as one of the best ways to establish trust. To aid the discussion of their suitability for real-world use, in this work, we attempt to address this issue by conducting a localisation study of two state-of-the-art deep learning models for chest X-ray image classification, ResNet-38-large-meta and CheXNet, on a set of 984 radiologist annotated X-ray images from the publicly available ChestX-ray14 dataset. We do this by applying and comparing several state-of-the-art visualisation methods, combined with a novel dynamic thresholding approach for generating bounding boxes, which we show to outperform the static thresholding method used by similar localisation studies in the literature. Results also seem to indicate that localisation quality is more sensitive to the choice of thresholding scheme than the visualisation method used, and that a high discriminative ability as measured by classification performance is not necessarily sufficient for models to produce useful and accurate localisations.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132822163","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Class Activation Maps for the disentanglement and occlusion of identity attributes in medical imagery 分类激活图用于医学图像中身份属性的解纠缠和闭塞
Pub Date : 2022-09-27 DOI: 10.1109/BHI56158.2022.9926856
Laura Carolina Martínez Esmeral, A. Uhl
Deriving patients' identity from medical imagery threatens privacy, as these data are acquired to support diagnosis but not to reveal identity-related features. Still, for many medical imaging modalities, such identity breaches have been reported. To cope with this, some de-identification methods based on the generation of synthetic data have been explored in the literature. However, in this paper, we try to perform, instead, an occlusion of the personal identifiers directly on the data by means of Class Activation Maps, in such a way that diagnosis related features do not get altered.
从医学图像中获取患者的身份会威胁到隐私,因为这些数据是为了支持诊断而获得的,而不是为了揭示与身份相关的特征。尽管如此,对于许多医学成像模式,此类身份泄露已被报道。为了解决这个问题,文献中已经探索了一些基于合成数据生成的去识别方法。然而,在本文中,我们试图通过类激活图直接在数据上执行个人标识符的遮挡,这样与诊断相关的特征就不会被改变。
{"title":"Class Activation Maps for the disentanglement and occlusion of identity attributes in medical imagery","authors":"Laura Carolina Martínez Esmeral, A. Uhl","doi":"10.1109/BHI56158.2022.9926856","DOIUrl":"https://doi.org/10.1109/BHI56158.2022.9926856","url":null,"abstract":"Deriving patients' identity from medical imagery threatens privacy, as these data are acquired to support diagnosis but not to reveal identity-related features. Still, for many medical imaging modalities, such identity breaches have been reported. To cope with this, some de-identification methods based on the generation of synthetic data have been explored in the literature. However, in this paper, we try to perform, instead, an occlusion of the personal identifiers directly on the data by means of Class Activation Maps, in such a way that diagnosis related features do not get altered.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127712059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1