首页 > 最新文献

2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)最新文献

英文 中文
Four potential prognostic markers for breast cancer identified by hybrid gene and module expression analysis 通过杂交基因和模块表达分析确定乳腺癌的四个潜在预后标志物
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.10113358
Lin Xi, Xiangyang Yuan, Jing Liu, X. Tang
With the aim of screening the prognostic genes for breast cancer (BRCA) and exploring the possible mechanism and clinical value of these genes in the growth and regression stage of disease, we study the genes in the public gene expression omnibus (GEO) GSE22820 and the cancer genome atlas (TCGA). To achieve high-confidence gene candidates for BRCA, we present a hybrid gene and module analysis pipeline that strategically considers data mining on different datasets. Ultimately, four gene candidates, i.e., PLIN1, GPD1, LIPE, and CHRDL1, are targeted for BRCA. Afterwards, Kaplan-Meier survival analysis is performed on these genes for verification, revealing that the overall survival time of patients with low expression of these genes was shorter than that of patients with high expression (with P<0.05). Moreover, in order to study the role of these genes in the mechanisms and functionality related to cytoplasmic lipid metabolism, functional enrichment and pathway analysis are implemented. The results indicate that the expression of the four discovered genes plays an adverse role in BRCA development and could serve as effective biomarkers for predicting the formation and progression of BRCA.
为了筛选乳腺癌(BRCA)的预后基因,探讨这些基因在疾病生长和消退阶段的可能机制和临床价值,我们研究了公共基因表达图谱(GEO) GSE22820和癌症基因组图谱(TCGA)中的基因。为了获得高可信度的BRCA候选基因,我们提出了一种混合基因和模块分析管道,该管道战略性地考虑了对不同数据集的数据挖掘。最终,四个候选基因PLIN1、GPD1、LIPE和CHRDL1成为BRCA的靶点。随后对这些基因进行Kaplan-Meier生存分析验证,发现这些基因低表达患者的总生存时间短于高表达患者(P<0.05)。此外,为了研究这些基因在细胞质脂代谢相关机制和功能中的作用,我们进行了功能富集和途径分析。结果表明,这四个基因的表达在BRCA的发生发展中起着不利的作用,可以作为预测BRCA形成和进展的有效生物标志物。
{"title":"Four potential prognostic markers for breast cancer identified by hybrid gene and module expression analysis","authors":"Lin Xi, Xiangyang Yuan, Jing Liu, X. Tang","doi":"10.1109/BIBM55620.2022.10113358","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.10113358","url":null,"abstract":"With the aim of screening the prognostic genes for breast cancer (BRCA) and exploring the possible mechanism and clinical value of these genes in the growth and regression stage of disease, we study the genes in the public gene expression omnibus (GEO) GSE22820 and the cancer genome atlas (TCGA). To achieve high-confidence gene candidates for BRCA, we present a hybrid gene and module analysis pipeline that strategically considers data mining on different datasets. Ultimately, four gene candidates, i.e., PLIN1, GPD1, LIPE, and CHRDL1, are targeted for BRCA. Afterwards, Kaplan-Meier survival analysis is performed on these genes for verification, revealing that the overall survival time of patients with low expression of these genes was shorter than that of patients with high expression (with P<0.05). Moreover, in order to study the role of these genes in the mechanisms and functionality related to cytoplasmic lipid metabolism, functional enrichment and pathway analysis are implemented. The results indicate that the expression of the four discovered genes plays an adverse role in BRCA development and could serve as effective biomarkers for predicting the formation and progression of BRCA.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129872611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gene Ontology based protein functional annotation using pretrained embeddings 基于基因本体的预训练嵌入蛋白功能标注
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995108
Thi Thuy Duong Vu, Jaehee Jung
The Gene Ontology (GO) database contains approximately 40,000 classes of terms arranged in a hierarchical relationship. These terms mainly define protein functions and are used in bioinformatics to automatically predict protein functions using their sequences. Recently, several models have been studied, such as ProtBert and ProteinBERT, which predict protein functions by fine-tuning a pretrained model of the nucleotide sequence using a self-supervised deep method. We proposed two models to predict GO using protein features extracted by the ProtBert model to annotate proteins with their GO terms. Additionally, we customized the ProteinBERT model and fine-tuned it to predict GO terms. The experiment showed that protein embeddings created using pretrained transformer models can be used as a source of data for tasks involving sequence prediction, with a focus on protein functions. The suggested models allow flexible sequence lengths and provide improved performance compared to other comparison methods.
基因本体(GO)数据库包含大约40,000类按层次关系排列的术语。这些术语主要定义蛋白质的功能,并在生物信息学中使用它们的序列来自动预测蛋白质的功能。最近,人们研究了ProtBert和ProteinBERT等模型,它们通过使用自监督深度方法对核苷酸序列的预训练模型进行微调来预测蛋白质功能。我们提出了两种预测氧化石墨烯的模型,使用ProtBert模型提取的蛋白质特征来用它们的氧化石墨烯术语注释蛋白质。此外,我们定制了ProteinBERT模型,并对其进行了微调,以预测GO术语。实验表明,使用预训练的变压器模型创建的蛋白质嵌入可以用作涉及序列预测的任务的数据来源,重点是蛋白质功能。与其他比较方法相比,建议的模型允许灵活的序列长度,并提供更好的性能。
{"title":"Gene Ontology based protein functional annotation using pretrained embeddings","authors":"Thi Thuy Duong Vu, Jaehee Jung","doi":"10.1109/BIBM55620.2022.9995108","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995108","url":null,"abstract":"The Gene Ontology (GO) database contains approximately 40,000 classes of terms arranged in a hierarchical relationship. These terms mainly define protein functions and are used in bioinformatics to automatically predict protein functions using their sequences. Recently, several models have been studied, such as ProtBert and ProteinBERT, which predict protein functions by fine-tuning a pretrained model of the nucleotide sequence using a self-supervised deep method. We proposed two models to predict GO using protein features extracted by the ProtBert model to annotate proteins with their GO terms. Additionally, we customized the ProteinBERT model and fine-tuned it to predict GO terms. The experiment showed that protein embeddings created using pretrained transformer models can be used as a source of data for tasks involving sequence prediction, with a focus on protein functions. The suggested models allow flexible sequence lengths and provide improved performance compared to other comparison methods.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128311954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Regulation Networks of Chinese Medicines Against Rheumatoid Arthritis with Syndrome of Deficiency of Liver and Kidney 类风湿性关节炎肝肾虚证的中药调控网络研究
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995565
Guang Zheng, Ye Lv, Junmei Zhao, Hongtao Guo
In China, the integration of traditional Chinese medicine (TCM) and western medicine against rheumatoid arthritis (RA) delivers both low-level inflammation indexes and less side effects. Thus, exploring the mechanism of TCM against RA might help to explore the pathology of RA. As the diagnosis of TCM is based on syndrome differentiation and deficiency of liver and kidney is the leading one, then, explore associated Chinese medicines against RA may shed light on the therapeutic regulation network. In this study, network pharmacology analysis was carried out based on bioactive compounds, targeted proteins and protein interactions towards RA. As a result, the regulation network against RA delivered by the top five Chinese medicines was constructed. Further bioinformatics analysis on participant genes not only elucidate the relationships to RA and immune response, but also the reduced side effects e.g. osteoporosis. Validation of the therapeutic effect on RA patients was done via check indexes on C-reactive protein and erythrocyte sedimentation rate. Potential effects of the delivered regulation network were demonstrated with heatmap on the microarray data of RA synovium tissue.
在中国,中西医结合治疗类风湿性关节炎(RA)既能降低炎症指数,又能减少副作用。因此,探讨中药抗RA的机制可能有助于探究RA的病理机制。中医诊断以辨证论治为主,以肝肾虚证为主,探索相关中药治疗类风湿性关节炎可能有助于揭示其治疗调控网络。本研究基于RA的生物活性化合物、靶向蛋白和蛋白相互作用进行网络药理学分析。从而构建了五大中药对RA的调控网络。对参与基因的进一步生物信息学分析不仅阐明了与RA和免疫反应的关系,而且还减少了骨质疏松等副作用。通过检查c反应蛋白和红细胞沉降率等指标来验证治疗RA的疗效。通过RA滑膜组织微阵列数据的热图显示了传递的调节网络的潜在影响。
{"title":"The Regulation Networks of Chinese Medicines Against Rheumatoid Arthritis with Syndrome of Deficiency of Liver and Kidney","authors":"Guang Zheng, Ye Lv, Junmei Zhao, Hongtao Guo","doi":"10.1109/BIBM55620.2022.9995565","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995565","url":null,"abstract":"In China, the integration of traditional Chinese medicine (TCM) and western medicine against rheumatoid arthritis (RA) delivers both low-level inflammation indexes and less side effects. Thus, exploring the mechanism of TCM against RA might help to explore the pathology of RA. As the diagnosis of TCM is based on syndrome differentiation and deficiency of liver and kidney is the leading one, then, explore associated Chinese medicines against RA may shed light on the therapeutic regulation network. In this study, network pharmacology analysis was carried out based on bioactive compounds, targeted proteins and protein interactions towards RA. As a result, the regulation network against RA delivered by the top five Chinese medicines was constructed. Further bioinformatics analysis on participant genes not only elucidate the relationships to RA and immune response, but also the reduced side effects e.g. osteoporosis. Validation of the therapeutic effect on RA patients was done via check indexes on C-reactive protein and erythrocyte sedimentation rate. Potential effects of the delivered regulation network were demonstrated with heatmap on the microarray data of RA synovium tissue.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128389814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AIAT: Adaptive Iteration Adversarial Training for Robust Pulmonary Nodule Detection 稳健肺结节检测的自适应迭代对抗训练
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995043
Guoxing Yang, Xiaohong Liu, Jianyu Shi, Xianchao Zhang, Guangyu Wang
Lung cancer is one of the leading causes of death worldwide. Early diagnosis through cancer screening can significantly improve lung cancer patients’ survival. Recently, deep learning based diagnostic systems for nodule detection have shown great potential in assisting radiologists to screen cancer more efficiently. However, studies have found that deep learning models lack robustness against imperceptible crafted adversarial attacks and few studied improving the robustness of pulmonary nodule detection. Therefore, making pulmonary nodule detection models robust remains challenges. Moreover, traditional adversarial training methods either hurt the natural generalization or need expensive computational cost. To address these challenges, here we propose a novel adversarial training method called, Adaptive Iteration Adversarial Training (AIAT). AIAT generates adversarial samples by adding adversarial noise with an adaptive iteration strategy, so that it can stably and fast train models with improving robustness. Extensive experiments on the LUNA 16 dataset show that AIAT improves robustness for pulmonary nodule detection without compromising the natural generalization, and largely reduces training time.
肺癌是世界范围内导致死亡的主要原因之一。通过癌症筛查进行早期诊断可以显著提高肺癌患者的生存率。最近,基于深度学习的结节检测诊断系统在帮助放射科医生更有效地筛查癌症方面显示出巨大的潜力。然而,研究发现深度学习模型对难以察觉的精心制作的对抗性攻击缺乏鲁棒性,并且很少有研究提高肺结节检测的鲁棒性。因此,使肺结节检测模型具有鲁棒性仍然是一个挑战。此外,传统的对抗性训练方法要么损害自然泛化,要么需要昂贵的计算成本。为了解决这些挑战,我们提出了一种新的对抗训练方法,称为自适应迭代对抗训练(AIAT)。AIAT采用自适应迭代策略,通过加入对抗噪声来生成对抗样本,从而在提高鲁棒性的同时稳定快速地训练模型。在LUNA 16数据集上的大量实验表明,AIAT在不影响自然泛化的情况下提高了肺结节检测的鲁棒性,并大大减少了训练时间。
{"title":"AIAT: Adaptive Iteration Adversarial Training for Robust Pulmonary Nodule Detection","authors":"Guoxing Yang, Xiaohong Liu, Jianyu Shi, Xianchao Zhang, Guangyu Wang","doi":"10.1109/BIBM55620.2022.9995043","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995043","url":null,"abstract":"Lung cancer is one of the leading causes of death worldwide. Early diagnosis through cancer screening can significantly improve lung cancer patients’ survival. Recently, deep learning based diagnostic systems for nodule detection have shown great potential in assisting radiologists to screen cancer more efficiently. However, studies have found that deep learning models lack robustness against imperceptible crafted adversarial attacks and few studied improving the robustness of pulmonary nodule detection. Therefore, making pulmonary nodule detection models robust remains challenges. Moreover, traditional adversarial training methods either hurt the natural generalization or need expensive computational cost. To address these challenges, here we propose a novel adversarial training method called, Adaptive Iteration Adversarial Training (AIAT). AIAT generates adversarial samples by adding adversarial noise with an adaptive iteration strategy, so that it can stably and fast train models with improving robustness. Extensive experiments on the LUNA 16 dataset show that AIAT improves robustness for pulmonary nodule detection without compromising the natural generalization, and largely reduces training time.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128230364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Context-Guided Attention Method for Integrating Features of Histopathological Patches 一种整合组织病理斑块特征的上下文引导注意方法
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995300
Yuqi Chen, Juan Liu, Peng Jiang, Jing Feng, Dehua Cao, Baochuan Pang
Lots of researchers have studied for classifying histopathological whole slide images (WSIs). Since a WSI is too large to be processed directly, researchers usually cut it into many small-sized patches and then integrate the discriminative features extracted from the patches to obtain a slide-level feature of the WSI. The integration strategy generating the slide-level features is crucial for the WSI classification model. Lots of attention-based methods have been proposed for such purpose. However, most attention-based methods do not take the patches relationship into consideration, which affects the classification performance of the models. In this work, we propose a novel Context-Guided attention (CGattention) method to integrate the patch-level features, which constructs a context vector to simulate the global context information of the whole WSI and implicitly characterizes the relationship between patches in the WSI. When evaluated on two publicly available datasets, the CGattention based model obtained the better performance than other attention-based models.
许多研究者对组织病理学全切片图像(wsi)的分类进行了研究。由于WSI太大而无法直接处理,研究人员通常将其切割成许多小块,然后将从这些小块中提取的判别特征进行整合,从而获得WSI的滑动级特征。生成滑动级特征的集成策略对WSI分类模型至关重要。为此,人们提出了许多基于注意力的方法。然而,大多数基于注意力的方法没有考虑补丁关系,从而影响了模型的分类性能。在这项工作中,我们提出了一种新的上下文引导注意力(CGattention)方法来整合补丁级特征,该方法构建了一个上下文向量来模拟整个WSI的全局上下文信息,并隐式地表征了WSI中补丁之间的关系。当在两个公开的数据集上进行评估时,基于CGattention的模型比其他基于attention的模型获得了更好的性能。
{"title":"A Context-Guided Attention Method for Integrating Features of Histopathological Patches","authors":"Yuqi Chen, Juan Liu, Peng Jiang, Jing Feng, Dehua Cao, Baochuan Pang","doi":"10.1109/BIBM55620.2022.9995300","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995300","url":null,"abstract":"Lots of researchers have studied for classifying histopathological whole slide images (WSIs). Since a WSI is too large to be processed directly, researchers usually cut it into many small-sized patches and then integrate the discriminative features extracted from the patches to obtain a slide-level feature of the WSI. The integration strategy generating the slide-level features is crucial for the WSI classification model. Lots of attention-based methods have been proposed for such purpose. However, most attention-based methods do not take the patches relationship into consideration, which affects the classification performance of the models. In this work, we propose a novel Context-Guided attention (CGattention) method to integrate the patch-level features, which constructs a context vector to simulate the global context information of the whole WSI and implicitly characterizes the relationship between patches in the WSI. When evaluated on two publicly available datasets, the CGattention based model obtained the better performance than other attention-based models.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129943795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
iCircDA-ENR: identification of circRNA-disease associations based on ensemble network representation iCircDA-ENR:基于集合网络表示的circrna -疾病关联识别
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995330
Hang Wei, Xiayue Fan, Shuai Wu
Circular RNAs (circRNAs) are severing as important regulators for various physiological and pathological life activities. Identifying associations between circRNAs and diseases can help uncover the disease mechanism, and promote the diagnosis and treatment of human diseases. To provide assisting guidance and optimize biological experiments, some computational methods have been proposed to predict circRNA-disease associations. However, most predictors focus on identifying missing associations for known circRNA and diseases. It is still challenging to effectively detect potential circRNA-disease association pattern because of their limited generation ability and insufficient pair representation. In this regard, we propose a novel computational method named iCircDA-ENR for identifying circRNA-disease associations based on ensemble network representation. Different from other predictors, iCircDA-ENR is a ranking method. Multiple biological information and meta-paths are introduced to construct heterogeneous relation network, and then different network representation algorithms are incorporated into ranking framework to capture informative network features. The learned ranking predictor prioritizes the candidate diseases for query circRNAs according to their relevance degree. Experimental results illustrate that iCircDA-ENR achieves better performance and wider applicability, benefited from its sufficient representation and effective learning.
环状rna (circRNAs)是多种生理和病理生命活动的重要调控因子。识别circrna与疾病之间的关联有助于揭示疾病机制,促进人类疾病的诊断和治疗。为了提供辅助指导和优化生物学实验,已经提出了一些计算方法来预测circrna与疾病的关联。然而,大多数预测因子侧重于确定已知circRNA与疾病之间缺失的关联。由于circrna的生成能力有限,且对表征不充分,因此有效检测潜在的circrna -疾病关联模式仍然具有挑战性。在这方面,我们提出了一种新的计算方法,名为iCircDA-ENR,用于识别基于集成网络表示的circrna -疾病关联。与其他预测方法不同,iCircDA-ENR是一种排序方法。引入多种生物信息和元路径构建异构关系网络,并在排序框架中引入不同的网络表示算法来捕获信息网络特征。学习的排序预测器根据候选疾病的相关度对查询circrna进行优先排序。实验结果表明,iCircDA-ENR具有充分的表征和有效的学习能力,具有更好的性能和更广泛的适用性。
{"title":"iCircDA-ENR: identification of circRNA-disease associations based on ensemble network representation","authors":"Hang Wei, Xiayue Fan, Shuai Wu","doi":"10.1109/BIBM55620.2022.9995330","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995330","url":null,"abstract":"Circular RNAs (circRNAs) are severing as important regulators for various physiological and pathological life activities. Identifying associations between circRNAs and diseases can help uncover the disease mechanism, and promote the diagnosis and treatment of human diseases. To provide assisting guidance and optimize biological experiments, some computational methods have been proposed to predict circRNA-disease associations. However, most predictors focus on identifying missing associations for known circRNA and diseases. It is still challenging to effectively detect potential circRNA-disease association pattern because of their limited generation ability and insufficient pair representation. In this regard, we propose a novel computational method named iCircDA-ENR for identifying circRNA-disease associations based on ensemble network representation. Different from other predictors, iCircDA-ENR is a ranking method. Multiple biological information and meta-paths are introduced to construct heterogeneous relation network, and then different network representation algorithms are incorporated into ranking framework to capture informative network features. The learned ranking predictor prioritizes the candidate diseases for query circRNAs according to their relevance degree. Experimental results illustrate that iCircDA-ENR achieves better performance and wider applicability, benefited from its sufficient representation and effective learning.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130127720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multidimensional Feature Extraction Method Based on MSTBN and EEMD-WPT for Emotion Recognition from EEG Signals 基于MSTBN和EEMD-WPT的脑电信号情感识别多维特征提取方法
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995251
Shilin Zhang, Qingcheng Zhang
Emotion recognition is an important component of human-computer interaction (HCI) systems. However, current emotion recognition methods have some drawbacks such as inconsistency in brain network size, lack of effective mining of features in different dimensions. In this paper, we propose a multidimensional feature extraction method based on MSTBN and EEMD-WPT for emotion recognition. Firstly, the wavelet packet transform (WPT) is utilized to decompose the pre-processed electroencephalography (EEG) signals into four frequency bands ($theta,alpha,beta$, and $gamma$), and phase locking value (PLV) is used to construct multi-band connectivity matrix. Secondly, to remove redundant information, the minimum spanning tree based brain network (MSTBN) is established and MSTBN features are extracted including global features and local features. Thirdly, ensemble empirical mode decomposition (EEMD) and WPT (EEMD-WPT) are applied to EEG signals for a more refined decomposition of modes and bands. Then, the modified multi-scale sample entropy (MMSE) and fractal dimension (FD) are extracted to capture the neural activity processes in the brain. Finally, the MSTBN features are fused with the nonlinear features MMSE and FD, which are input into random forest (RF) to identify emotions. Experimental results on DEAP dataset indicate that the accuracy is 87.24% and 89.84% for valance and arousal. Experimental analysis reveals that MSTBN of negative emotions is more divergent and emotional information is transmitted more rapidly in the brain. Women are more susceptible to emotional perception than men. The proposed multidimensional feature extraction method has potential to be applied to HCI systems.
情感识别是人机交互(HCI)系统的重要组成部分。然而,目前的情绪识别方法存在脑网络大小不一致、缺乏对不同维度特征的有效挖掘等缺点。本文提出了一种基于MSTBN和EEMD-WPT的情感识别多维特征提取方法。首先,利用小波包变换(WPT)将预处理后的脑电图信号分解为4个频段($theta,alpha,beta$、$gamma$),并利用锁相值(PLV)构建多频段连接矩阵;其次,为了去除冗余信息,建立基于最小生成树的脑网络(MSTBN),提取MSTBN特征,包括全局特征和局部特征;第三,将集成经验模态分解(EEMD)和WPT (EEMD-WPT)技术应用于脑电信号中,得到更精细的模态和频带分解。然后,提取改进的多尺度样本熵(MMSE)和分形维数(FD)来捕捉大脑的神经活动过程;最后,将MSTBN特征与非线性特征MMSE和FD融合,输入到随机森林(RF)中进行情绪识别。在DEAP数据集上的实验结果表明,该方法的准确率为87.24% and 89.84% for valance and arousal. Experimental analysis reveals that MSTBN of negative emotions is more divergent and emotional information is transmitted more rapidly in the brain. Women are more susceptible to emotional perception than men. The proposed multidimensional feature extraction method has potential to be applied to HCI systems.
{"title":"A Multidimensional Feature Extraction Method Based on MSTBN and EEMD-WPT for Emotion Recognition from EEG Signals","authors":"Shilin Zhang, Qingcheng Zhang","doi":"10.1109/BIBM55620.2022.9995251","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995251","url":null,"abstract":"Emotion recognition is an important component of human-computer interaction (HCI) systems. However, current emotion recognition methods have some drawbacks such as inconsistency in brain network size, lack of effective mining of features in different dimensions. In this paper, we propose a multidimensional feature extraction method based on MSTBN and EEMD-WPT for emotion recognition. Firstly, the wavelet packet transform (WPT) is utilized to decompose the pre-processed electroencephalography (EEG) signals into four frequency bands ($theta,alpha,beta$, and $gamma$), and phase locking value (PLV) is used to construct multi-band connectivity matrix. Secondly, to remove redundant information, the minimum spanning tree based brain network (MSTBN) is established and MSTBN features are extracted including global features and local features. Thirdly, ensemble empirical mode decomposition (EEMD) and WPT (EEMD-WPT) are applied to EEG signals for a more refined decomposition of modes and bands. Then, the modified multi-scale sample entropy (MMSE) and fractal dimension (FD) are extracted to capture the neural activity processes in the brain. Finally, the MSTBN features are fused with the nonlinear features MMSE and FD, which are input into random forest (RF) to identify emotions. Experimental results on DEAP dataset indicate that the accuracy is 87.24% and 89.84% for valance and arousal. Experimental analysis reveals that MSTBN of negative emotions is more divergent and emotional information is transmitted more rapidly in the brain. Women are more susceptible to emotional perception than men. The proposed multidimensional feature extraction method has potential to be applied to HCI systems.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128951512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PROCONSUL: PRObabilistic exploration of CONnectivity Significance patterns for disease modULe discovery 疾病模块发现的连通性显著性模式的概率探索
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995586
R. D. Luca, Marco Carfora, Gonzalo Blanco, A. Mastropietro, M. Petti, P. Tieri
The possibility to computationally prioritize candidate disease genes capitalizing on existing information has led to a speedup in the discovery of new methods. Many gene discovery techniques exploit network data, like protein-protein interactions (PPIs), in order to extract knowledge from the network structure relying on several network metrics. We here present PROCONSUL, a method that builds on top of the concept of connectivity significance (CS) and exploits the idea of probabilistic exploration of the space of putative disease genes. We show that our methodology is able to outperform the state-of-the-art tool based on CS in several settings, and propose different, effective gene discovery strategies according to specific disease network properties.
利用现有信息计算候选疾病基因优先级的可能性导致了新方法发现的加速。许多基因发现技术利用网络数据,如蛋白质-蛋白质相互作用(PPIs),以便依靠几个网络指标从网络结构中提取知识。我们在此提出PROCONSUL,这是一种建立在连接重要性(CS)概念之上的方法,并利用了对假定疾病基因空间进行概率探索的思想。我们表明,我们的方法能够在几个设置中优于基于CS的最先进的工具,并根据特定的疾病网络属性提出不同的,有效的基因发现策略。
{"title":"PROCONSUL: PRObabilistic exploration of CONnectivity Significance patterns for disease modULe discovery","authors":"R. D. Luca, Marco Carfora, Gonzalo Blanco, A. Mastropietro, M. Petti, P. Tieri","doi":"10.1109/BIBM55620.2022.9995586","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995586","url":null,"abstract":"The possibility to computationally prioritize candidate disease genes capitalizing on existing information has led to a speedup in the discovery of new methods. Many gene discovery techniques exploit network data, like protein-protein interactions (PPIs), in order to extract knowledge from the network structure relying on several network metrics. We here present PROCONSUL, a method that builds on top of the concept of connectivity significance (CS) and exploits the idea of probabilistic exploration of the space of putative disease genes. We show that our methodology is able to outperform the state-of-the-art tool based on CS in several settings, and propose different, effective gene discovery strategies according to specific disease network properties.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129150079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Dataset for Falling Risk Assessment of the Elderly using Wearable Plantar Pressure 基于可穿戴足底压力的老年人跌倒风险评估数据集
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995052
Guohua Hu, Jianxiu Jin, Zhen Song, Shibin Wu, Lin Shu, Junan Xie, Jianlin Ou, Zhuoming Chen, Xiangmin Xu
Falling is characterized by high incidence and great harm among the elderly. Timely assessing falling risk in daily life is helpful for reducing the occurrence of severe health outcomes. Establishing dataset for falling risk assessment based on wearable devices in the elderly is important work. However, current existing datasets might not reflect the natural gait of the subject due to the discomfort in wearing. Relevant data processing methods based on these datasets have limited practicability and might not be applied to real scenes in daily life. To make daily falling risk assessment possible, we proposed a novel approach to set up a continuous and wearable plantar pressure dataset of 48 older adults along with falling risk labels. The dataset was collected by plantar pressure monitoring shoes which were suitable for daily living spaces. Moreover, the Conv-LSTM algorithm was applied on the dataset, and the average classification result was up to 95.57%, reflecting the effectiveness of this dataset. The dataset is helpful for the studies of falling risk assessment and health monitoring among the elderly.
老年人跌倒具有发病率高、危害大的特点。及时评估日常生活中的跌倒风险有助于减少严重健康后果的发生。建立基于可穿戴设备的老年人跌倒风险评估数据集是一项重要的工作。然而,目前现有的数据集可能无法反映受试者的自然步态,因为穿着不舒服。基于这些数据集的相关数据处理方法实用性有限,可能无法应用于日常生活中的真实场景。为了使每日跌倒风险评估成为可能,我们提出了一种新方法,建立了48名老年人的连续可穿戴足底压力数据集,并附有跌倒风险标签。数据集由适合日常生活空间的足底压力监测鞋收集。此外,在数据集上应用了convl - lstm算法,平均分类结果高达95.57%,反映了该数据集的有效性。该数据集有助于老年人跌倒风险评估和健康监测的研究。
{"title":"A Dataset for Falling Risk Assessment of the Elderly using Wearable Plantar Pressure","authors":"Guohua Hu, Jianxiu Jin, Zhen Song, Shibin Wu, Lin Shu, Junan Xie, Jianlin Ou, Zhuoming Chen, Xiangmin Xu","doi":"10.1109/BIBM55620.2022.9995052","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995052","url":null,"abstract":"Falling is characterized by high incidence and great harm among the elderly. Timely assessing falling risk in daily life is helpful for reducing the occurrence of severe health outcomes. Establishing dataset for falling risk assessment based on wearable devices in the elderly is important work. However, current existing datasets might not reflect the natural gait of the subject due to the discomfort in wearing. Relevant data processing methods based on these datasets have limited practicability and might not be applied to real scenes in daily life. To make daily falling risk assessment possible, we proposed a novel approach to set up a continuous and wearable plantar pressure dataset of 48 older adults along with falling risk labels. The dataset was collected by plantar pressure monitoring shoes which were suitable for daily living spaces. Moreover, the Conv-LSTM algorithm was applied on the dataset, and the average classification result was up to 95.57%, reflecting the effectiveness of this dataset. The dataset is helpful for the studies of falling risk assessment and health monitoring among the elderly.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130585245","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Health Informatics on Big COVID-19 Pandemic Data via N-Shot Learning 基于N-Shot学习的COVID-19大流行大数据卫生信息学
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995592
C. Leung, Evan W. R. Madill, N. D. Tran, Christine Y. Zhang
Nowadays, very large amounts of data are generating at a fast rate from a wide variety of rich data sources. Valuable information and knowledge embedded in these big data can be discovered by data science, data mining and machine learning techniques. Biomedical records are examples of the big data. With the technological advancements, more healthcare practice has gradually been supported by electronic processes and communication. This enables health informatics, in which computer science meets the healthcare sector to address healthcare and medical problems. As a concrete example, there have been more than 635 millions cumulative cases of coronavirus disease 2019 (COVID-19) worldwide over the past 3 years since COVID-19 has declared as a pandemic. Hence, effective strategies, solutions, tools and methods—such as artificial intelligence (AI) and/or big data approaches—to tackle the COVID-19 pandemic and possible future pandemics are in demand. In this paper, we present models to analyze big COVID-19 pandemic data and make predictions via N-shot learning. Specifically, our binary model predicts whether patients are COVID-19 or not. If so, the model predicts whether they require hospitalization or not, whereas our multi-class model predicts severity and thus the corresponding levels of hospitalization required by the patients. Our models uses N-shot learning with autoencoders. Evaluation results on real-life pandemic data demonstrate the practicality of our models towards effective allocation of resources (e.g., hospital facilities, staff). These showcase the benefits of AI and/or big data approaches in tackling the pandemic.
如今,大量的数据正在从各种各样的丰富数据源中快速生成。通过数据科学、数据挖掘和机器学习技术,可以发现嵌入在这些大数据中的有价值的信息和知识。生物医学记录就是大数据的例子。随着技术的进步,越来越多的医疗保健实践逐渐得到电子流程和通信的支持。这使得健康信息学成为可能,其中计算机科学与医疗保健部门相结合,以解决医疗保健和医疗问题。一个具体的例子是,自2019冠状病毒病(COVID-19)被宣布为大流行以来,过去3年里,全球累计确诊病例超过6.35亿例。因此,需要有效的战略、解决方案、工具和方法,如人工智能和/或大数据方法,来应对COVID-19大流行和未来可能出现的大流行。在本文中,我们提出了分析COVID-19大流行数据并通过N-shot学习进行预测的模型。具体来说,我们的二元模型可以预测患者是否感染COVID-19。如果是,模型预测他们是否需要住院,而我们的多类别模型预测严重程度,从而预测患者所需的相应住院水平。我们的模型使用n次自动编码器学习。对现实大流行数据的评估结果表明,我们的模型在有效分配资源(例如医院设施、工作人员)方面具有实用性。这些展示了人工智能和/或大数据方法在应对大流行方面的好处。
{"title":"Health Informatics on Big COVID-19 Pandemic Data via N-Shot Learning","authors":"C. Leung, Evan W. R. Madill, N. D. Tran, Christine Y. Zhang","doi":"10.1109/BIBM55620.2022.9995592","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995592","url":null,"abstract":"Nowadays, very large amounts of data are generating at a fast rate from a wide variety of rich data sources. Valuable information and knowledge embedded in these big data can be discovered by data science, data mining and machine learning techniques. Biomedical records are examples of the big data. With the technological advancements, more healthcare practice has gradually been supported by electronic processes and communication. This enables health informatics, in which computer science meets the healthcare sector to address healthcare and medical problems. As a concrete example, there have been more than 635 millions cumulative cases of coronavirus disease 2019 (COVID-19) worldwide over the past 3 years since COVID-19 has declared as a pandemic. Hence, effective strategies, solutions, tools and methods—such as artificial intelligence (AI) and/or big data approaches—to tackle the COVID-19 pandemic and possible future pandemics are in demand. In this paper, we present models to analyze big COVID-19 pandemic data and make predictions via N-shot learning. Specifically, our binary model predicts whether patients are COVID-19 or not. If so, the model predicts whether they require hospitalization or not, whereas our multi-class model predicts severity and thus the corresponding levels of hospitalization required by the patients. Our models uses N-shot learning with autoencoders. Evaluation results on real-life pandemic data demonstrate the practicality of our models towards effective allocation of resources (e.g., hospital facilities, staff). These showcase the benefits of AI and/or big data approaches in tackling the pandemic.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130620948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1