首页 > 最新文献

2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)最新文献

英文 中文
EpiMCBN: A Kind of Epistasis Mining Approach Using MCMC Sampling Optimizing Bayesian Network EpiMCBN:一种基于MCMC采样优化贝叶斯网络的上位挖掘方法
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995264
Xuan Yang, Keqin Li, Yang Zhang, Xi Yu, Junli Deng, Jianxiao Liu
Proposing a more effective and accurate epistatic loci detection method is of great significance in improving crop quality, disease treatment, etc. Due to the characteristics of high accuracy and processing non-linear relationship, Bayesian network (BN) has been widely used in constructing the network of SNPs and phenotypes and thus to mine epistasis. However, the shortcoming of BN is that the search space is too large and unable to process large-scale SNPs. In this work, we propose a kind of epistasis mining method using Markov Chain Monte Carlo (MCMC) sampling optimizing Bayesian network (EpiMCBN). Firstly, we use the space of node order composed of SNPs and phenotype to replace the space of network structure. Then MCMC algorithm is used to do sampling to generate multiple different initial orders in linear space or partial space. We use Markov state transition matrix to transfer the initial samples along the Markov chain, thus obtaining multiple order samples. Then we use the $alpha$-BICBN scoring function to score the Bayesian networks corresponding to these node orders. Through estimating the probability of edge occurrence in the Bayesian networks, we get an approximate Bayesian network of SNPs and phenotype, then obtain the epistatic loci affecting phenotype. Finally, we compare EpiMCBN with the current popular epistasis mining algorithms using both simulated and real age-related macular disease (AMD) datasets. Experiment results show that EpiMCBN has better epistasis detection accuracy, lower false positive rate, and higher F1-score compared to other methods. Availability and implementation: Source code and dataset are available at: http://122.205.95.139/EpiMCBN/.
提出一种更有效、准确的上位性位点检测方法对提高作物品质、病害防治等具有重要意义。贝叶斯网络(Bayesian network, BN)由于具有精度高、处理非线性关系的特点,被广泛应用于构建snp与表型网络,进而挖掘上位性。然而,BN的缺点是搜索空间太大,无法处理大规模的snp。本文提出了一种基于马尔可夫链蒙特卡罗(MCMC)采样优化贝叶斯网络(EpiMCBN)的上位性挖掘方法。首先,我们使用由snp和表型组成的节点顺序空间来代替网络结构空间。然后利用MCMC算法进行采样,在线性空间或部分空间中生成多个不同的初始阶数。我们利用马尔可夫状态转移矩阵沿马尔可夫链传递初始样本,从而获得多阶样本。然后使用$alpha$-BICBN评分函数对这些节点顺序对应的贝叶斯网络进行评分。通过估计贝叶斯网络中边缘出现的概率,得到snp与表型的近似贝叶斯网络,进而得到影响表型的上位基因座。最后,我们使用模拟和真实年龄相关性黄斑疾病(AMD)数据集将EpiMCBN与当前流行的上位性挖掘算法进行比较。实验结果表明,与其他方法相比,EpiMCBN具有更好的上位性检测准确率、更低的假阳性率和更高的f1评分。可用性和实现:源代码和数据集可在:http://122.205.95.139/EpiMCBN/。
{"title":"EpiMCBN: A Kind of Epistasis Mining Approach Using MCMC Sampling Optimizing Bayesian Network","authors":"Xuan Yang, Keqin Li, Yang Zhang, Xi Yu, Junli Deng, Jianxiao Liu","doi":"10.1109/BIBM55620.2022.9995264","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995264","url":null,"abstract":"Proposing a more effective and accurate epistatic loci detection method is of great significance in improving crop quality, disease treatment, etc. Due to the characteristics of high accuracy and processing non-linear relationship, Bayesian network (BN) has been widely used in constructing the network of SNPs and phenotypes and thus to mine epistasis. However, the shortcoming of BN is that the search space is too large and unable to process large-scale SNPs. In this work, we propose a kind of epistasis mining method using Markov Chain Monte Carlo (MCMC) sampling optimizing Bayesian network (EpiMCBN). Firstly, we use the space of node order composed of SNPs and phenotype to replace the space of network structure. Then MCMC algorithm is used to do sampling to generate multiple different initial orders in linear space or partial space. We use Markov state transition matrix to transfer the initial samples along the Markov chain, thus obtaining multiple order samples. Then we use the $alpha$-BICBN scoring function to score the Bayesian networks corresponding to these node orders. Through estimating the probability of edge occurrence in the Bayesian networks, we get an approximate Bayesian network of SNPs and phenotype, then obtain the epistatic loci affecting phenotype. Finally, we compare EpiMCBN with the current popular epistasis mining algorithms using both simulated and real age-related macular disease (AMD) datasets. Experiment results show that EpiMCBN has better epistasis detection accuracy, lower false positive rate, and higher F1-score compared to other methods. Availability and implementation: Source code and dataset are available at: http://122.205.95.139/EpiMCBN/.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130747733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial bee colony algorithm based on self-adjusting random grouping for high-order epistasis detection 基于自调整随机分组的高阶上位检测人工蜂群算法
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995075
J. Shang, Yijun Gu, Y. Sun, Feng Li, Jin-Xin Liu, Boxin Guan
In the genome-wide association studies (GWAS), epistasis detection is of great significance to study the pathogenesis of complex diseases. Epistasis refers to the effect of interactions between multiple single nucleotide polymorphisms (SNPs) on complex diseases. In this paper, an artificial bee colony algorithm based on self-adjusting random grouping (ABC-SRG) is proposed for high-order epistasis detection. ABC-SRG adopts a new self-adjusting random grouping strategy, which realizes the division of the original data according to the fitness value of each grouping. In addition, a variance-based adaptive iteration strategy is proposed, which implements the adaptive iteration through the variance of the fitness value of each iteration of the algorithm. To demonstrate the effectiveness of the algorithm, the experiments on simulated data and real data were conducted. In the simulation experiments, ABC-SRG was compared with the other five methods for second-order and third-order SNP interaction detection. Age-related macular degeneration (AMD) data were selected for the real data experiment, and most of the SNP interactions detected in the experiment have been confirmed to be related to the AMD disease. Therefore, ABC-SRG is an effective method to detect high-order epistasis.
在全基因组关联研究(GWAS)中,上位性检测对于研究复杂疾病的发病机制具有重要意义。上位性是指多个单核苷酸多态性(snp)相互作用对复杂疾病的影响。本文提出了一种基于自调整随机分组的人工蜂群算法(ABC-SRG),用于高阶上位性检测。ABC-SRG采用一种新的自调整随机分组策略,根据每个分组的适应度值对原始数据进行划分。此外,提出了一种基于方差的自适应迭代策略,通过算法每次迭代适应度值的方差来实现自适应迭代。为了验证该算法的有效性,分别在仿真数据和实际数据上进行了实验。在模拟实验中,将ABC-SRG与其他五种方法进行了二阶和三阶SNP相互作用检测的比较。选取年龄相关性黄斑变性(Age-related macular degeneration, AMD)数据进行真实数据实验,实验中检测到的SNP相互作用大部分已被证实与AMD疾病相关。因此,ABC-SRG是检测高阶上位性的有效方法。
{"title":"Artificial bee colony algorithm based on self-adjusting random grouping for high-order epistasis detection","authors":"J. Shang, Yijun Gu, Y. Sun, Feng Li, Jin-Xin Liu, Boxin Guan","doi":"10.1109/BIBM55620.2022.9995075","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995075","url":null,"abstract":"In the genome-wide association studies (GWAS), epistasis detection is of great significance to study the pathogenesis of complex diseases. Epistasis refers to the effect of interactions between multiple single nucleotide polymorphisms (SNPs) on complex diseases. In this paper, an artificial bee colony algorithm based on self-adjusting random grouping (ABC-SRG) is proposed for high-order epistasis detection. ABC-SRG adopts a new self-adjusting random grouping strategy, which realizes the division of the original data according to the fitness value of each grouping. In addition, a variance-based adaptive iteration strategy is proposed, which implements the adaptive iteration through the variance of the fitness value of each iteration of the algorithm. To demonstrate the effectiveness of the algorithm, the experiments on simulated data and real data were conducted. In the simulation experiments, ABC-SRG was compared with the other five methods for second-order and third-order SNP interaction detection. Age-related macular degeneration (AMD) data were selected for the real data experiment, and most of the SNP interactions detected in the experiment have been confirmed to be related to the AMD disease. Therefore, ABC-SRG is an effective method to detect high-order epistasis.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"207 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131954702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
TransMixer: A Hybrid Transformer and CNN Architecture for Polyp Segmentation TransMixer:用于多边形分割的混合变压器和CNN架构
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995247
Yanglin Huang, Donghui Tan, Yuan Zhang, Xuanya Li, Kai Hu
Learning how to fully extract global representations and local features is a key factor in improving the performance of polyp segmentation. In this paper, we explore the potential of combined techniques of Transformers and convolutional neural networks (CNNs) to address the challenges of polyp segmentation. Specifically, we present TransMixer, a hybrid interaction fusion architecture of the Transformer branch and the CNN branch, which is able to enhance the local details of global representations and the global context awareness of local features. To achieve this, we first bridge the semantic gap between the Transformer branch and the CNN branch through the Interaction Fusion Module (IFM), and then make full use of both respective properties to enhance polyp feature representations. After that, we further propose the Hierarchical Attention Module (HAM) to collect polyp semantic information from high-level features to gradually guide the recovery of polyp spatial information in low-level features. Quantitative and qualitative results show that the proposed model is more robust to various complex situations compared to existing methods, and achieves state-of-the-art performance in polyp segmentation.
学习如何充分提取全局表征和局部特征是提高息肉分割性能的关键因素。在本文中,我们探讨了变压器和卷积神经网络(cnn)结合技术的潜力,以解决息肉分割的挑战。具体来说,我们提出了TransMixer,这是Transformer分支和CNN分支的混合交互融合架构,它能够增强全局表示的局部细节和局部特征的全局上下文感知。为了实现这一点,我们首先通过交互融合模块(Interaction Fusion Module, IFM)弥合Transformer分支和CNN分支之间的语义差距,然后充分利用两者各自的属性来增强息肉特征表示。在此基础上,我们进一步提出了分层注意模块(Hierarchical Attention Module, HAM),从高阶特征中收集息肉的语义信息,逐步指导低阶特征中息肉空间信息的恢复。定量和定性结果表明,与现有方法相比,该模型对各种复杂情况具有更强的鲁棒性,在息肉分割中达到了最先进的性能。
{"title":"TransMixer: A Hybrid Transformer and CNN Architecture for Polyp Segmentation","authors":"Yanglin Huang, Donghui Tan, Yuan Zhang, Xuanya Li, Kai Hu","doi":"10.1109/BIBM55620.2022.9995247","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995247","url":null,"abstract":"Learning how to fully extract global representations and local features is a key factor in improving the performance of polyp segmentation. In this paper, we explore the potential of combined techniques of Transformers and convolutional neural networks (CNNs) to address the challenges of polyp segmentation. Specifically, we present TransMixer, a hybrid interaction fusion architecture of the Transformer branch and the CNN branch, which is able to enhance the local details of global representations and the global context awareness of local features. To achieve this, we first bridge the semantic gap between the Transformer branch and the CNN branch through the Interaction Fusion Module (IFM), and then make full use of both respective properties to enhance polyp feature representations. After that, we further propose the Hierarchical Attention Module (HAM) to collect polyp semantic information from high-level features to gradually guide the recovery of polyp spatial information in low-level features. Quantitative and qualitative results show that the proposed model is more robust to various complex situations compared to existing methods, and achieves state-of-the-art performance in polyp segmentation.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131972200","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Association rule analysis for fetal heart rate pattern of late FGR FGR晚期胎儿心率模式的关联规则分析
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995462
Liyan Zhong, Shiyao Huang, Xia Li, Guiqing Liu, Qinqun Chen, Xiaomu Luo, Yuexing Hao, Jiaming Hong, Hang Wei
Late fetal growth restriction (FGR) is a common complication of pregnancy characterized by chronic hypoxia. However, late FGR is in a dilemma of the high incidence but low detection rate. Depending on the non-invasiveness and convenient operation, the routine cardiotocography (CTG) allows continuous monitoring fetal heart rate (FHR) to assess fetal intrauterine stockpiling ability. In this paper, we aimed to explore the FHR pattern of late FGR in routine CTG. For analysis, the FHR features were acquired using routine CTG in a population of 160 healthy and 102 late FGR fetuses published in IEEE Dataport. First, we explored the relationships among FHR features and their importance on late FGR assessment by utilizing hypothesis testing, principal component analysis (PCA) and Spearman correlation analysis. Second, we presented a regression coefficient-based backward-stepwise-selection of association rules analysis (ARA) called backward-stepwise Max-R2 Apriori ARA, to find the optimum itemset that helps diagnose late FGRs from healthy fetuses. The hypothesis testing, PCA and Spearman correlation analysis found eight FHR features were highly relevant to the late FGR. Moreover, the backward-stepwise Max-R2 Apriori ARA validated the correlation and interpretation about FHR features of late FGR. In conclusion, the analysis results are consistent with clinical knowledge on late FGR and help screen late FGR in antepartum fetal monitoring.
晚期胎儿生长受限(FGR)是妊娠期以慢性缺氧为特征的常见并发症。然而,晚期FGR处于高发低检出率的困境。由于无创性和操作方便,常规心脏造影(CTG)允许连续监测胎儿心率(FHR)来评估胎儿宫内储存能力。本文旨在探讨常规CTG中晚期FGR的FHR模式。为了进行分析,在IEEE Dataport上发表的160名健康胎儿和102名晚期FGR胎儿中使用常规CTG获得FHR特征。首先,通过假设检验、主成分分析(PCA)和Spearman相关分析,探讨了FHR特征之间的关系及其对FGR后期评价的重要性。其次,我们提出了一种基于回归系数的反向逐步选择关联规则分析(ARA),称为反向逐步Max-R2 Apriori ARA,以找到有助于诊断健康胎儿晚期fgr的最佳项目集。假设检验、PCA和Spearman相关分析发现8个FHR特征与FGR晚期高度相关。此外,后向逐步Max-R2 Apriori ARA验证了FGR后期FHR特征的相关性和解释。综上所述,分析结果与临床对晚期FGR的认识一致,有助于产前胎儿监护中筛查晚期FGR。
{"title":"Association rule analysis for fetal heart rate pattern of late FGR","authors":"Liyan Zhong, Shiyao Huang, Xia Li, Guiqing Liu, Qinqun Chen, Xiaomu Luo, Yuexing Hao, Jiaming Hong, Hang Wei","doi":"10.1109/BIBM55620.2022.9995462","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995462","url":null,"abstract":"Late fetal growth restriction (FGR) is a common complication of pregnancy characterized by chronic hypoxia. However, late FGR is in a dilemma of the high incidence but low detection rate. Depending on the non-invasiveness and convenient operation, the routine cardiotocography (CTG) allows continuous monitoring fetal heart rate (FHR) to assess fetal intrauterine stockpiling ability. In this paper, we aimed to explore the FHR pattern of late FGR in routine CTG. For analysis, the FHR features were acquired using routine CTG in a population of 160 healthy and 102 late FGR fetuses published in IEEE Dataport. First, we explored the relationships among FHR features and their importance on late FGR assessment by utilizing hypothesis testing, principal component analysis (PCA) and Spearman correlation analysis. Second, we presented a regression coefficient-based backward-stepwise-selection of association rules analysis (ARA) called backward-stepwise Max-R2 Apriori ARA, to find the optimum itemset that helps diagnose late FGRs from healthy fetuses. The hypothesis testing, PCA and Spearman correlation analysis found eight FHR features were highly relevant to the late FGR. Moreover, the backward-stepwise Max-R2 Apriori ARA validated the correlation and interpretation about FHR features of late FGR. In conclusion, the analysis results are consistent with clinical knowledge on late FGR and help screen late FGR in antepartum fetal monitoring.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"49 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131009264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MOVE: Integrating Multi-source Information for Predicting DTI via Cross-view Contrastive Learning MOVE:通过交叉视角对比学习整合多源信息预测DTI
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995438
Yuening Qu, Chengxin He, Jin Yin, Zhenjiang Zhao, Jingyu Chen, Lei Duan
Drug-target interaction (DTI) prediction serves as the foundation of new drug findings and drug repositioning. For drugs/targets, the sequence data contains the biological structural information, while the heterogeneous network contains the biochemical functional information. These two types of information describe different aspects of drugs and targets. Due to the complexity of DTI machinery, it is necessary to learn the representation from multiple perspectives. We hereby try to design a way to leverage information from multi-source data to the maximum extent and find a strategy to fuse them. To address the above challenges, we propose a model, named MOVE (short for integrating multi-source information for predicting DTI via cross-view contrastive 1earning), for learning comprehensive representations of each drug and target from multi-source data. MOVE extracts information from the sequence view and the network view, then utilizes a fusion module with auxiliary contrastive learning to facilitate the fusion of representations. Experimental results on the benchmark dataset demonstrate that MOVE is effective in DTI prediction.
药物-靶标相互作用(DTI)预测是新药发现和药物重新定位的基础。对于药物/靶点,序列数据包含生物结构信息,异构网络包含生物化学功能信息。这两种类型的信息描述了药物和靶点的不同方面。由于DTI机制的复杂性,有必要从多个角度学习其表示。因此,我们试图设计一种最大限度地利用多源数据信息的方法,并找到一种融合策略。为了解决上述挑战,我们提出了一个名为MOVE(通过交叉视图对比学习集成多源信息预测DTI的缩写)的模型,用于从多源数据中学习每种药物和靶标的综合表示。MOVE从序列视图和网络视图中提取信息,然后利用带有辅助对比学习的融合模块来促进表征的融合。在基准数据集上的实验结果表明,MOVE在DTI预测中是有效的。
{"title":"MOVE: Integrating Multi-source Information for Predicting DTI via Cross-view Contrastive Learning","authors":"Yuening Qu, Chengxin He, Jin Yin, Zhenjiang Zhao, Jingyu Chen, Lei Duan","doi":"10.1109/BIBM55620.2022.9995438","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995438","url":null,"abstract":"Drug-target interaction (DTI) prediction serves as the foundation of new drug findings and drug repositioning. For drugs/targets, the sequence data contains the biological structural information, while the heterogeneous network contains the biochemical functional information. These two types of information describe different aspects of drugs and targets. Due to the complexity of DTI machinery, it is necessary to learn the representation from multiple perspectives. We hereby try to design a way to leverage information from multi-source data to the maximum extent and find a strategy to fuse them. To address the above challenges, we propose a model, named MOVE (short for integrating multi-source information for predicting DTI via cross-view contrastive 1earning), for learning comprehensive representations of each drug and target from multi-source data. MOVE extracts information from the sequence view and the network view, then utilizes a fusion module with auxiliary contrastive learning to facilitate the fusion of representations. Experimental results on the benchmark dataset demonstrate that MOVE is effective in DTI prediction.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127612824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal Learning for Predicting Mortality in Patients with Pulmonary Arterial Hypertension 多模式学习预测肺动脉高压患者死亡率
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995597
M. N. I. Suvon, P. C. Tripathi, S. Alabed, A. Swift, Haiping Lu
Pulmonary Arterial Hypertension (PAH) is a lifethreatening disorder. The prediction of mortality in PAH patients can play a crucial role in the clinical management of this disease. The prediction of mortality from one modality is a difficult task that may only provide limited performance. Therefore, we propose a multimodal learning approach in this work to predict one-year mortality in PAH patients. We have utilised three modalities, which include extracted numerical imaging features, echo report categorical features, and echo report text features from Electronic Health Records (EHRs) of patients. We have proposed a feature integration module to combine features from multiple modalities. The text features have been extracted from the echo reports using the Bidirectional Encoder Representations from Transformers (BERT). An attention mechanism and a weighted summation method are also adopted during the process of feature integration. We have performed different experiments to evaluate the performance of the proposed framework for mortality prediction. The experimental results indicate that we can achieve the best AUC score of 0.89 for predicting one-year mortality by combining all three modalities. The source code of this paper is available at https://github.com/Mdnaimulislam/MultimodalTab.
肺动脉高压(PAH)是一种危及生命的疾病。PAH患者的死亡率预测对该病的临床治疗具有重要意义。从一种模式预测死亡率是一项困难的任务,可能只能提供有限的性能。因此,我们在这项工作中提出了一种多模式学习方法来预测PAH患者的一年死亡率。我们使用了三种模式,包括从患者的电子健康记录(EHRs)中提取的数值成像特征、回声报告分类特征和回声报告文本特征。我们提出了一个特征集成模块来组合来自多个模态的特征。文本特征是从回波报告中提取的,使用了变形金刚的双向编码器表示(BERT)。在特征整合过程中,采用了注意机制和加权求和方法。我们进行了不同的实验来评估所提出的死亡率预测框架的性能。实验结果表明,综合三种方法预测1年死亡率的最佳AUC得分为0.89。本文的源代码可从https://github.com/Mdnaimulislam/MultimodalTab获得。
{"title":"Multimodal Learning for Predicting Mortality in Patients with Pulmonary Arterial Hypertension","authors":"M. N. I. Suvon, P. C. Tripathi, S. Alabed, A. Swift, Haiping Lu","doi":"10.1109/BIBM55620.2022.9995597","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995597","url":null,"abstract":"Pulmonary Arterial Hypertension (PAH) is a lifethreatening disorder. The prediction of mortality in PAH patients can play a crucial role in the clinical management of this disease. The prediction of mortality from one modality is a difficult task that may only provide limited performance. Therefore, we propose a multimodal learning approach in this work to predict one-year mortality in PAH patients. We have utilised three modalities, which include extracted numerical imaging features, echo report categorical features, and echo report text features from Electronic Health Records (EHRs) of patients. We have proposed a feature integration module to combine features from multiple modalities. The text features have been extracted from the echo reports using the Bidirectional Encoder Representations from Transformers (BERT). An attention mechanism and a weighted summation method are also adopted during the process of feature integration. We have performed different experiments to evaluate the performance of the proposed framework for mortality prediction. The experimental results indicate that we can achieve the best AUC score of 0.89 for predicting one-year mortality by combining all three modalities. The source code of this paper is available at https://github.com/Mdnaimulislam/MultimodalTab.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"538 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127647165","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Single-Cell Topological Simplicial Analysis Reveals Higher-Order Cellular Complexity 单细胞拓扑简单分析揭示高阶细胞复杂性
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995365
Baihan Lin
The absence of a conventional association between the cell-cell cohabitation and its emergent dynamics into cliques during development has hindered our understanding of how cell populations proliferate, differentiate, and compete, i.e. the cell ecology. With the recent advancement of the single-cell RNA-sequencing (RNA-seq), we can potentially describe such a link by constructing network graphs that characterize the similarity of the gene expression profiles of the cell-specific transcriptional programs, and analyzing these graphs systematically using the summary statistics informed by the algebraic topology. We propose the single-cell topological simplicial analysis (scTSA). Applying this approach to the single-cell gene expression profiles from local networks of cells in different developmental stages with different outcomes reveals a previously unseen topology of cellular ecology. These networks contain an abundance of cliques of single-cell profiles bound into cavities that guide the emergence of more complicated habitation forms. We visualize these ecological patterns with topological simplicial architectures of these networks, compared with the null models. Benchmarked on the single-cell RNA-seq data of zebrafish embryogenesis spanning 38,731 cells, 25 cell types and 12 time steps, our approach highlights the gastrulation as the most critical stage, consistent with consensus in developmental biology. As a nonlinear, model-independent, and unsupervised framework, our approach can also be applied to tracing multi-scale cell lineage, identifying critical stages, or creating pseudo-time series.11For an extended version of this work and a systematic evaluation of our approach, please refer to [1] for more details.
细胞-细胞共存及其在发育过程中形成集团的动态之间缺乏传统的联系,这阻碍了我们对细胞群体如何增殖、分化和竞争的理解,即细胞生态学。随着单细胞rna测序(RNA-seq)的最新进展,我们可以通过构建表征细胞特异性转录程序基因表达谱相似性的网络图,并利用代数拓扑的汇总统计系统地分析这些图,来潜在地描述这种联系。我们提出单细胞拓扑简单分析(scTSA)。将这种方法应用于不同发育阶段具有不同结果的细胞局部网络的单细胞基因表达谱,揭示了以前未见过的细胞生态学拓扑结构。这些网络包含了大量的单细胞群,这些单细胞群被捆绑在洞穴中,引导着更复杂的居住形式的出现。与零模型相比,我们用这些网络的拓扑简单架构来可视化这些生态模式。以斑马鱼胚胎发生的单细胞RNA-seq数据为基准,跨越38,731个细胞,25种细胞类型和12个时间步骤,我们的方法强调原肠胚形成是最关键的阶段,与发育生物学的共识一致。作为一个非线性、模型无关和无监督的框架,我们的方法也可以应用于追踪多尺度细胞谱系、识别关键阶段或创建伪时间序列。有关这项工作的扩展版本和对我们方法的系统评估,请参阅[1]了解更多细节。
{"title":"Single-Cell Topological Simplicial Analysis Reveals Higher-Order Cellular Complexity","authors":"Baihan Lin","doi":"10.1109/BIBM55620.2022.9995365","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995365","url":null,"abstract":"The absence of a conventional association between the cell-cell cohabitation and its emergent dynamics into cliques during development has hindered our understanding of how cell populations proliferate, differentiate, and compete, i.e. the cell ecology. With the recent advancement of the single-cell RNA-sequencing (RNA-seq), we can potentially describe such a link by constructing network graphs that characterize the similarity of the gene expression profiles of the cell-specific transcriptional programs, and analyzing these graphs systematically using the summary statistics informed by the algebraic topology. We propose the single-cell topological simplicial analysis (scTSA). Applying this approach to the single-cell gene expression profiles from local networks of cells in different developmental stages with different outcomes reveals a previously unseen topology of cellular ecology. These networks contain an abundance of cliques of single-cell profiles bound into cavities that guide the emergence of more complicated habitation forms. We visualize these ecological patterns with topological simplicial architectures of these networks, compared with the null models. Benchmarked on the single-cell RNA-seq data of zebrafish embryogenesis spanning 38,731 cells, 25 cell types and 12 time steps, our approach highlights the gastrulation as the most critical stage, consistent with consensus in developmental biology. As a nonlinear, model-independent, and unsupervised framework, our approach can also be applied to tracing multi-scale cell lineage, identifying critical stages, or creating pseudo-time series.11For an extended version of this work and a systematic evaluation of our approach, please refer to [1] for more details.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127989598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Predicting Lymph Node Metastasis and Distant Metastasis using Differential Correlations of miRNAs and Their Target RNAs in Cancer 利用肿瘤中mirna及其靶rna的差异相关性预测淋巴结转移和远处转移
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9995322
Seokwoo Lee, Myounghoon Cho, Wook Lee, B. Park, Kyungsook Han
As the most common cause of cancer death, metastasis is a complex process that involves the spread of cancer cells from the original site to other parts of the body. Diagnosis of metastasis is usually confirmed by clinical examinations and imaging, but such diagnosis is made after metastasis occurs. Early detection of metastasis plays an important role in treatment planning, which in turn has an impact on the survival of patients. So far a few methods have been developed to predict lymph node metastasis, but few methods are available for predicting distant metastasis. Motivated by a recently known gene regulation mechanism involving miRNAs, we developed a new method for predicting both lymph node metastasis and distant metastasis. We identified differential correlations of miRNAs and their target RNAs in cancer, and built prediction models using the differential correlations. Testing the method on several types of cancer showed that differential correlations of miRNAs and their target RNAs are much more powerful than expressions of known metastasis predictive genes in predicting distant metastasis as well as lymph node metastasis. Although preliminary, the method developed in this study will be useful in predicting metastasis and thereby in determining treatment options for cancer patients.
作为癌症死亡的最常见原因,转移是一个复杂的过程,涉及癌细胞从原发部位扩散到身体的其他部位。转移的诊断通常通过临床检查和影像学证实,但这种诊断是在转移发生后才做出的。转移的早期发现在治疗计划中起着重要作用,进而影响患者的生存。目前已有几种预测淋巴结转移的方法,但用于预测远处转移的方法很少。受最近已知的涉及miRNAs的基因调控机制的启发,我们开发了一种预测淋巴结转移和远处转移的新方法。我们确定了mirna及其靶rna在癌症中的差异相关性,并利用这种差异相关性建立了预测模型。对几种类型的癌症进行的测试表明,在预测远处转移和淋巴结转移方面,miRNAs及其靶rna的差异相关性比已知转移预测基因的表达更强大。虽然是初步的,但本研究开发的方法将有助于预测癌症转移,从而确定癌症患者的治疗方案。
{"title":"Predicting Lymph Node Metastasis and Distant Metastasis using Differential Correlations of miRNAs and Their Target RNAs in Cancer","authors":"Seokwoo Lee, Myounghoon Cho, Wook Lee, B. Park, Kyungsook Han","doi":"10.1109/BIBM55620.2022.9995322","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9995322","url":null,"abstract":"As the most common cause of cancer death, metastasis is a complex process that involves the spread of cancer cells from the original site to other parts of the body. Diagnosis of metastasis is usually confirmed by clinical examinations and imaging, but such diagnosis is made after metastasis occurs. Early detection of metastasis plays an important role in treatment planning, which in turn has an impact on the survival of patients. So far a few methods have been developed to predict lymph node metastasis, but few methods are available for predicting distant metastasis. Motivated by a recently known gene regulation mechanism involving miRNAs, we developed a new method for predicting both lymph node metastasis and distant metastasis. We identified differential correlations of miRNAs and their target RNAs in cancer, and built prediction models using the differential correlations. Testing the method on several types of cancer showed that differential correlations of miRNAs and their target RNAs are much more powerful than expressions of known metastasis predictive genes in predicting distant metastasis as well as lymph node metastasis. Although preliminary, the method developed in this study will be useful in predicting metastasis and thereby in determining treatment options for cancer patients.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131352261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A morphometrics approach for inclusion of localised characteristics from medical imaging studies into genome-wide association studies 将医学影像学研究中的局部特征纳入全基因组关联研究的形态计量学方法
Pub Date : 2022-12-06 DOI: 10.1109/BIBM55620.2022.9994977
Gabrielle Dagasso, M. Wilms, N. Forkert
Medical images, such as magnetic resonance or computed tomography, are increasingly being used to investigate the genetic architecture of neurological diseases like Alzheimer’s disease, or psychiatric disorders like attention-deficit hyperactivity disorder. The quantified global or regional brain imaging measures are commonly known as imaging-specific or -derived phenotypes (IDPs) when conducting genotype-phenotype association studies. Inclusion of whole medical images rather than derived tabular data as IDPs has been done by either a voxelwise approach or a global approach of whole medical images via principal component analysis. Limitations with multiple testing and inability to isolate high variation regions within the principal components arise with either of these approaches. This work proposes a principal component analysis-like localised approach of dimensionality reduction using diffeomorphic morphometry allowing for the selection of distances to model more regional effects. The main benefit of the proposed method is that it can can reduce the dimensionality of the problem considerably in comparison to the medical image’s variability it is describing while grouping spatial information potentially lost in dimensionality reduction techniques like principal component analyses. Moreover, the approach not only allows to include locality in the analysis but can also be used as a generative model to explore the morphometric changes across an axis of particular components of interest. To demonstrate the feasibility of this pipeline for inclusion in a multivariate genome-wide association study, it was applied to 1,359 subjects from the Adolescent Brain Cognitive Development Study for traits related to attention-deficit disorder. The results show that the proposed method can identify more specific morphometric features associated with genome regions.
磁共振或计算机断层扫描等医学图像正越来越多地用于研究阿尔茨海默病等神经系统疾病或注意力缺陷多动障碍等精神疾病的遗传结构。在进行基因型-表型关联研究时,量化的全球或区域脑成像测量通常被称为成像特异性或衍生表型(IDPs)。通过体素方法或通过主成分分析的整体医学图像方法,将整个医学图像而不是派生的表格数据纳入国内流离失所者。这些方法中的任何一种都存在多重测试的局限性和无法在主成分中隔离高变化区域的能力。这项工作提出了一种类似于主成分分析的局部降维方法,使用微分形态测量法,允许选择距离来模拟更多的区域效应。该方法的主要优点是,与医学图像的可变性相比,它可以大大降低问题的维数,同时分组在主成分分析等降维技术中可能丢失的空间信息。此外,该方法不仅允许在分析中包含局部性,而且还可以用作生成模型,以探索跨特定感兴趣组件轴的形态测量学变化。为了证明该方法在多变量全基因组关联研究中的可行性,研究人员将其应用于青少年大脑认知发展研究中的1359名受试者,以研究与注意力缺陷障碍相关的特征。结果表明,该方法可以识别与基因组区域相关的更具体的形态特征。
{"title":"A morphometrics approach for inclusion of localised characteristics from medical imaging studies into genome-wide association studies","authors":"Gabrielle Dagasso, M. Wilms, N. Forkert","doi":"10.1109/BIBM55620.2022.9994977","DOIUrl":"https://doi.org/10.1109/BIBM55620.2022.9994977","url":null,"abstract":"Medical images, such as magnetic resonance or computed tomography, are increasingly being used to investigate the genetic architecture of neurological diseases like Alzheimer’s disease, or psychiatric disorders like attention-deficit hyperactivity disorder. The quantified global or regional brain imaging measures are commonly known as imaging-specific or -derived phenotypes (IDPs) when conducting genotype-phenotype association studies. Inclusion of whole medical images rather than derived tabular data as IDPs has been done by either a voxelwise approach or a global approach of whole medical images via principal component analysis. Limitations with multiple testing and inability to isolate high variation regions within the principal components arise with either of these approaches. This work proposes a principal component analysis-like localised approach of dimensionality reduction using diffeomorphic morphometry allowing for the selection of distances to model more regional effects. The main benefit of the proposed method is that it can can reduce the dimensionality of the problem considerably in comparison to the medical image’s variability it is describing while grouping spatial information potentially lost in dimensionality reduction techniques like principal component analyses. Moreover, the approach not only allows to include locality in the analysis but can also be used as a generative model to explore the morphometric changes across an axis of particular components of interest. To demonstrate the feasibility of this pipeline for inclusion in a multivariate genome-wide association study, it was applied to 1,359 subjects from the Adolescent Brain Cognitive Development Study for traits related to attention-deficit disorder. The results show that the proposed method can identify more specific morphometric features associated with genome regions.","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128766219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Computational Solutions to Explore Genomic 3D Organization 探索基因组三维组织的计算解决方案
Pub Date : 2022-12-06 DOI: 10.1109/bibm55620.2022.9995500
{"title":"Computational Solutions to Explore Genomic 3D Organization","authors":"","doi":"10.1109/bibm55620.2022.9995500","DOIUrl":"https://doi.org/10.1109/bibm55620.2022.9995500","url":null,"abstract":"","PeriodicalId":210337,"journal":{"name":"2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125359277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1