首页 > 最新文献

2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)最新文献

英文 中文
DASE: Condition-specific differential alternative splicing variants estimation method without reference genome sequence, and its application to non-model organisms 无参考基因组序列的条件特异性差异选择性剪接变异体估计方法及其在非模式生物中的应用
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822540
Kouki Yonezawa, Tsukasa Mori, S. Shigeno, A. Ogura
Alternative splicing is a mechanism to produce gene expression diversity under the constraint of a limited number of genes, causing spatiotemporal gene expression in many tissues and developmental processes in organisms. This mechanism is well studied in model organisms but not in non-model organisms because the current standard method requires genomic sequences as well as fully annotated information of exons and introns, that are not accessible from non-model organisms. However, it is essential to uncover the landscape of alternative splicing of organisms to understand its evolutionary impacts and roles. We developed a method for condition-specific alternative splicing estimation based on de novo transcriptome assembly, and it would help to enlarge knowledge of alternative splicing functionalized in non-model organisms. The software is deposited to https://github.com/koukiyonezawa/DASE.
选择性剪接是在有限数量基因的约束下产生基因表达多样性的机制,在生物体的许多组织和发育过程中引起基因的时空表达。这一机制在模式生物中得到了很好的研究,但在非模式生物中却没有得到很好的研究,因为目前的标准方法需要基因组序列以及外显子和内含子的完整注释信息,而这些信息在非模式生物中是无法获得的。然而,揭示生物选择性剪接的景观对于理解其进化影响和作用是至关重要的。我们开发了一种基于从头转录组组装的条件特异性选择性剪接估计方法,这将有助于扩大对非模式生物中功能化的选择性剪接的了解。软件存放在https://github.com/koukiyonezawa/DASE。
{"title":"DASE: Condition-specific differential alternative splicing variants estimation method without reference genome sequence, and its application to non-model organisms","authors":"Kouki Yonezawa, Tsukasa Mori, S. Shigeno, A. Ogura","doi":"10.1109/BIBM.2016.7822540","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822540","url":null,"abstract":"Alternative splicing is a mechanism to produce gene expression diversity under the constraint of a limited number of genes, causing spatiotemporal gene expression in many tissues and developmental processes in organisms. This mechanism is well studied in model organisms but not in non-model organisms because the current standard method requires genomic sequences as well as fully annotated information of exons and introns, that are not accessible from non-model organisms. However, it is essential to uncover the landscape of alternative splicing of organisms to understand its evolutionary impacts and roles. We developed a method for condition-specific alternative splicing estimation based on de novo transcriptome assembly, and it would help to enlarge knowledge of alternative splicing functionalized in non-model organisms. The software is deposited to https://github.com/koukiyonezawa/DASE.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115044240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Some comparisons of gene expression classifiers 基因表达分类器的一些比较
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822783
Shinuk Kim, M. Kon, Hyowon Lee
Numerous computational studies related to cancer have been published, but increasing prediction accuracy of molecular datasets remains a challenge. Here we present a comparison of prediction based on a feature selection method combined with machine learning, for microRNA-Seq (miRNA-Seq) and mRNA-Seq data. We have tested three different approaches: support vector machine, decision tree and k nearest neighbors, under two different feature selection methods: fisher feature selection and infinite feature selection.
许多与癌症相关的计算研究已经发表,但提高分子数据集的预测准确性仍然是一个挑战。在这里,我们对microRNA-Seq (miRNA-Seq)和mRNA-Seq数据进行了基于特征选择方法和机器学习相结合的预测比较。我们在两种不同的特征选择方法:fisher特征选择和无限特征选择下测试了三种不同的方法:支持向量机、决策树和k近邻。
{"title":"Some comparisons of gene expression classifiers","authors":"Shinuk Kim, M. Kon, Hyowon Lee","doi":"10.1109/BIBM.2016.7822783","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822783","url":null,"abstract":"Numerous computational studies related to cancer have been published, but increasing prediction accuracy of molecular datasets remains a challenge. Here we present a comparison of prediction based on a feature selection method combined with machine learning, for microRNA-Seq (miRNA-Seq) and mRNA-Seq data. We have tested three different approaches: support vector machine, decision tree and k nearest neighbors, under two different feature selection methods: fisher feature selection and infinite feature selection.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124053063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Facial expression recognition based on LLENet 基于LLENet的面部表情识别
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822814
Dan Meng, Guitao Cao, Zhihai He, W. Cao
Facial expression recognition plays an important role in lie detection, and computer-aided diagnosis. Many deep learning facial expression feature extraction methods have a great improvement in recognition accuracy and robutness than traditional feature extraction methods. However, most of current deep learning methods need special parameter tuning and ad hoc fine-tuning tricks. This paper proposes a novel feature extraction model called Locally Linear Embedding Network (LLENet) for facial expression recognition. The proposed LLENet first reconstructs image sets for the cropped images. Unlike previous deep convolutional neural networks that initialized convolutional kernels randomly, we learn multi-stage kernels from reconstructed image sets directly in a supervised way. Also, we create an improved LLE to select kernels, from which we can obtain the most representative feature maps. Furthermore, to better measure the contribution of these kernels, a new distance based on kernel Euclidean is proposed. After the procedure of multi-scale feature analysis, feature representations are finally sent into a linear classifier. Experimental results on facial expression datasets (CK+) show that the proposed model can capture most representative features and thus improves previous results.
面部表情识别在测谎和计算机辅助诊断中起着重要的作用。许多深度学习面部表情特征提取方法在识别精度和鲁棒性上都比传统特征提取方法有了很大的提高。然而,目前大多数深度学习方法需要特殊的参数调整和特殊的微调技巧。提出了一种新的面部表情特征提取模型——局部线性嵌入网络(LLENet)。提出的LLENet首先为裁剪后的图像重建图像集。与以往随机初始化卷积核的深度卷积神经网络不同,我们直接以监督的方式从重构图像集中学习多阶段核。此外,我们还创建了一个改进的LLE来选择内核,从中我们可以获得最具代表性的特征映射。此外,为了更好地衡量这些核的贡献,提出了一种基于核欧几里得的距离。经过多尺度特征分析后,将特征表示送入线性分类器。在面部表情数据集(CK+)上的实验结果表明,该模型能够捕获大多数具有代表性的特征,从而改进了先前的结果。
{"title":"Facial expression recognition based on LLENet","authors":"Dan Meng, Guitao Cao, Zhihai He, W. Cao","doi":"10.1109/BIBM.2016.7822814","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822814","url":null,"abstract":"Facial expression recognition plays an important role in lie detection, and computer-aided diagnosis. Many deep learning facial expression feature extraction methods have a great improvement in recognition accuracy and robutness than traditional feature extraction methods. However, most of current deep learning methods need special parameter tuning and ad hoc fine-tuning tricks. This paper proposes a novel feature extraction model called Locally Linear Embedding Network (LLENet) for facial expression recognition. The proposed LLENet first reconstructs image sets for the cropped images. Unlike previous deep convolutional neural networks that initialized convolutional kernels randomly, we learn multi-stage kernels from reconstructed image sets directly in a supervised way. Also, we create an improved LLE to select kernels, from which we can obtain the most representative feature maps. Furthermore, to better measure the contribution of these kernels, a new distance based on kernel Euclidean is proposed. After the procedure of multi-scale feature analysis, feature representations are finally sent into a linear classifier. Experimental results on facial expression datasets (CK+) show that the proposed model can capture most representative features and thus improves previous results.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128906021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A method of removing Ocular Artifacts from EEG using Discrete Wavelet Transform and Kalman Filtering 基于离散小波变换和卡尔曼滤波的脑电信号眼部伪影去除方法
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822742
Yan Chen, Qinglin Zhao, Bin Hu, Jianpeng Li, Hua Jiang, Wenhua Lin, Yang Li, Shuangshuang Zhou, Hong Peng
Electroencephalogram (EEG) is a noninvasive method to record electrical activity of brain and it has been used extensively in research of brain function due to its high time resolution. However raw EEG is a mixture of signals, which contains noises such as Ocular Artifact (OA) that is irrelevant to the cognitive function of brain. To remove OAs from EEG, many methods have been proposed, such as Independent Components Analysis (ICA), Discrete Wavelet Transform (DWT), Adaptive Noise Cancellation (ANC) and Wavelet Packet Transform (WPT). In this paper, we present a novel hybrid de-noising method which uses Discrete Wavelet Transform (DWT) and Kalman Filtering to remove OAs in EEG. Firstly, we used this method on simulated data. The Mean Squared Error (MSE) of DWT-Kalman method was 0.0017, significantly lower compared to results using WPT-ICA and DWT-ANC, which were 0.0468 and 0.0052, respectively. Meanwhile, the Mean Absolute Error (MAE) using DWT-Kalman achieved an average of 0.0052, which also performed better than WPT-ICA and DWT-ANC, which were 0.0218 and 0.0115, respectively. Then we applied the proposed approach to the raw data collected by our prototype three-channel EEG collector and 64-channel Braincap from BRAIN PRODUCTS. On both data, our method achieved satisfying results. This method does not rely on any particular electrode or the number of electrodes in certain system, so it is recommended for ubiquitous applications.
脑电图(EEG)是一种记录脑电活动的无创方法,由于其高时间分辨率,在脑功能研究中得到了广泛的应用。然而,原始脑电图是一种混合信号,其中包含与大脑认知功能无关的噪声,如眼伪影(OA)。为了消除脑电信号中的噪声,人们提出了许多方法,如独立分量分析(ICA)、离散小波变换(DWT)、自适应噪声消除(ANC)和小波包变换(WPT)。本文提出了一种基于离散小波变换和卡尔曼滤波的混合去噪方法。首先,我们将该方法应用于模拟数据。DWT-Kalman方法的均方误差(MSE)为0.0017,显著低于WPT-ICA和DWT-ANC方法的结果(分别为0.0468和0.0052)。同时,使用DWT-Kalman的平均绝对误差(Mean Absolute Error, MAE)平均为0.0052,也优于WPT-ICA和DWT-ANC,分别为0.0218和0.0115。然后,我们将所提出的方法应用于我们的原型三通道EEG采集器和来自BRAIN PRODUCTS的64通道Braincap收集的原始数据。在这两个数据上,我们的方法都取得了令人满意的结果。该方法不依赖于任何特定的电极或特定系统中电极的数量,因此推荐用于普遍应用。
{"title":"A method of removing Ocular Artifacts from EEG using Discrete Wavelet Transform and Kalman Filtering","authors":"Yan Chen, Qinglin Zhao, Bin Hu, Jianpeng Li, Hua Jiang, Wenhua Lin, Yang Li, Shuangshuang Zhou, Hong Peng","doi":"10.1109/BIBM.2016.7822742","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822742","url":null,"abstract":"Electroencephalogram (EEG) is a noninvasive method to record electrical activity of brain and it has been used extensively in research of brain function due to its high time resolution. However raw EEG is a mixture of signals, which contains noises such as Ocular Artifact (OA) that is irrelevant to the cognitive function of brain. To remove OAs from EEG, many methods have been proposed, such as Independent Components Analysis (ICA), Discrete Wavelet Transform (DWT), Adaptive Noise Cancellation (ANC) and Wavelet Packet Transform (WPT). In this paper, we present a novel hybrid de-noising method which uses Discrete Wavelet Transform (DWT) and Kalman Filtering to remove OAs in EEG. Firstly, we used this method on simulated data. The Mean Squared Error (MSE) of DWT-Kalman method was 0.0017, significantly lower compared to results using WPT-ICA and DWT-ANC, which were 0.0468 and 0.0052, respectively. Meanwhile, the Mean Absolute Error (MAE) using DWT-Kalman achieved an average of 0.0052, which also performed better than WPT-ICA and DWT-ANC, which were 0.0218 and 0.0115, respectively. Then we applied the proposed approach to the raw data collected by our prototype three-channel EEG collector and 64-channel Braincap from BRAIN PRODUCTS. On both data, our method achieved satisfying results. This method does not rely on any particular electrode or the number of electrodes in certain system, so it is recommended for ubiquitous applications.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128961348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Integration of multiple heterogeneous omics data 多个异构组学数据的集成
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822582
Chuanchao Zhang, Juan Liu, Qianqian Shi, Xiangtian Yu, T. Zeng, Luonan Chen
Integration of different genomic profiles is challenging to understand complex diseases in a multi-view manner. Computational method is needed to preserve useful information of data types as well as correct bias. Thus, we proposed a novel framework pattern fusion analysis (PFA), to fuse the local sample patterns into a global pattern of patients with respect to the underlying data, by adaptively aligning the information in each type of biological data. In particular, PFA can adjust the distinct data types and achieve more robust sample pattern within different profiles. To validate the effectiveness of PFA, we tested PFA on various synthetic datasets and found that PFA is able to effectively capture the intrinsic clustering structure than the state-of-the-art integrative methods, such as moCluster, iClusterPlus and SNF. Moreover, in a case study on kidney cancer, PFA not only identified the multi-way feature modules among the prior-known disease associated genes, methylations and miRNAs, but also outperformed in cancer subtypes identification and could get effective clinical prognosis prediction. Totally, PFA not only provides new insights on the more holistic & systems-level sample pattern, but also supplies a new way for selecting more informative types of biological data.
整合不同的基因组图谱对以多视角理解复杂疾病具有挑战性。计算方法既要保留数据类型的有用信息,又要纠正偏差。因此,我们提出了一种新的框架模式融合分析(PFA),通过自适应地对齐每种生物数据中的信息,将局部样本模式融合到相对于基础数据的患者全局模式中。特别是,PFA可以调整不同的数据类型,并在不同的配置文件中实现更健壮的样本模式。为了验证PFA的有效性,我们在各种合成数据集上测试了PFA,发现PFA比最先进的综合方法(如moCluster, iClusterPlus和SNF)能够有效地捕获内在聚类结构。此外,在肾癌的案例研究中,PFA不仅识别了已知疾病相关基因、甲基化和mirna之间的多向特征模块,而且在癌症亚型识别方面也表现出色,能够得到有效的临床预后预测。总的来说,PFA不仅提供了更全面和系统级的样本模式的新见解,而且为选择更多信息类型的生物数据提供了一种新的方法。
{"title":"Integration of multiple heterogeneous omics data","authors":"Chuanchao Zhang, Juan Liu, Qianqian Shi, Xiangtian Yu, T. Zeng, Luonan Chen","doi":"10.1109/BIBM.2016.7822582","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822582","url":null,"abstract":"Integration of different genomic profiles is challenging to understand complex diseases in a multi-view manner. Computational method is needed to preserve useful information of data types as well as correct bias. Thus, we proposed a novel framework pattern fusion analysis (PFA), to fuse the local sample patterns into a global pattern of patients with respect to the underlying data, by adaptively aligning the information in each type of biological data. In particular, PFA can adjust the distinct data types and achieve more robust sample pattern within different profiles. To validate the effectiveness of PFA, we tested PFA on various synthetic datasets and found that PFA is able to effectively capture the intrinsic clustering structure than the state-of-the-art integrative methods, such as moCluster, iClusterPlus and SNF. Moreover, in a case study on kidney cancer, PFA not only identified the multi-way feature modules among the prior-known disease associated genes, methylations and miRNAs, but also outperformed in cancer subtypes identification and could get effective clinical prognosis prediction. Totally, PFA not only provides new insights on the more holistic & systems-level sample pattern, but also supplies a new way for selecting more informative types of biological data.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129923959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Differential Co-Expression Networks using RNA-seq and microarrays in Alzheimer's disease 使用RNA-seq和微阵列的阿尔茨海默病差异共表达网络
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822811
Hyojin Kang, Junehawk Lee, S. Yu
Differential Co-Expression Networks (DCENs) are graphical representations of genes showing differential co-expression pattern in response to experimental conditions or genetic changes. They have been successfully applied to identify condition-specific modules and provide a picture of the dynamic changes in gene regulatory networks. DCENs analysis investigates the differences among gene interconnections by calculating the expression correlation change of each gene pair between conditions. In this study, we collected many different datasets from NCBI GEO including 25 RNA-seq and 2,102 microarray samples derived from human brain and blood in Alzheimer's disease and performed differential co-expression analyses to identify functional modules responsible for the characterization of Alzheimer's disease. The DCENs were generated using Pearson correlation coefficient and meta-analysis was conducted using rank-based method. The preliminary results show that the structural characteristics of DCENs can provide new insights into the underlying gene regulatory dynamics in Alzheimer's disease. There is low size overlap between microarray- and RNA-seq-derived DCENs however, DCENs from RNA-seq would complement ones from microarray due to the higher coverage and dynamic range of RNA-seq.
差异共表达网络(dcen)是基因在实验条件或遗传变化下表现出差异共表达模式的图形表示。它们已被成功地应用于识别条件特异性模块,并提供了基因调控网络动态变化的图片。dcns分析通过计算各基因对在不同条件下的表达相关性变化来研究基因互连之间的差异。在这项研究中,我们从NCBI GEO收集了许多不同的数据集,包括来自阿尔茨海默病患者的人类大脑和血液的25个RNA-seq和2102个微阵列样本,并进行了差异共表达分析,以确定负责表征阿尔茨海默病的功能模块。采用Pearson相关系数生成dcn,采用基于秩的方法进行meta分析。初步结果表明,dcns的结构特征可以为阿尔茨海默病的潜在基因调控动力学提供新的见解。微阵列和RNA-seq衍生的dccn之间存在较小的大小重叠,但由于RNA-seq的覆盖率和动态范围更高,来自RNA-seq的dccn可以补充来自微阵列的dccn。
{"title":"Differential Co-Expression Networks using RNA-seq and microarrays in Alzheimer's disease","authors":"Hyojin Kang, Junehawk Lee, S. Yu","doi":"10.1109/BIBM.2016.7822811","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822811","url":null,"abstract":"Differential Co-Expression Networks (DCENs) are graphical representations of genes showing differential co-expression pattern in response to experimental conditions or genetic changes. They have been successfully applied to identify condition-specific modules and provide a picture of the dynamic changes in gene regulatory networks. DCENs analysis investigates the differences among gene interconnections by calculating the expression correlation change of each gene pair between conditions. In this study, we collected many different datasets from NCBI GEO including 25 RNA-seq and 2,102 microarray samples derived from human brain and blood in Alzheimer's disease and performed differential co-expression analyses to identify functional modules responsible for the characterization of Alzheimer's disease. The DCENs were generated using Pearson correlation coefficient and meta-analysis was conducted using rank-based method. The preliminary results show that the structural characteristics of DCENs can provide new insights into the underlying gene regulatory dynamics in Alzheimer's disease. There is low size overlap between microarray- and RNA-seq-derived DCENs however, DCENs from RNA-seq would complement ones from microarray due to the higher coverage and dynamic range of RNA-seq.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131161465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Factorial analysis of error correction performance using simulated next-generation sequencing data 利用模拟新一代测序数据进行误差校正性能的析因分析
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822685
Isaac Akogwu, Nan Wang, Chaoyang Zhang, Hwanseok Choi, H. Hong, P. Gong
Error correction is a critical initial step in next-generation sequencing (NGS) data analysis. Although more than 60 tools have been developed, there is no systematic evidence-based comparison with regard to their strength and weakness, especially in terms of correction accuracy. Here we report a full factorial simulation study to examine how NGS dataset characteristics (genome size, coverage depth and read length in particular) affect error correction performance (precision and F-score), as well as to compare performance sensitivity/resistance of six k-mer spectrum-based methods to variations in dataset characteristics. Multi-way ANOVA tests indicate that choice of correction method and dataset characteristics had significant effects on performance metrics. Overall, BFC, Bless, Bloocoo and Musket performed better than Lighter and Trowel on 27 synthetic datasets. For each chosen method, read length and coverage depth showed more pronounced impact on performance than genome size. This study shed insights to the performance behavior of error correction methods in response to the common variables one would encounter in real-world NGS datasets. It also warrants further studies of wet lab-generated experimental NGS data to validate findings obtained from this simulation study.
错误校正是新一代测序(NGS)数据分析的关键步骤。虽然已经开发了60多种工具,但对于它们的优缺点,特别是在校正准确性方面,还没有系统的基于证据的比较。在这里,我们报告了一项全因子模拟研究,以研究NGS数据集特征(基因组大小,覆盖深度和读取长度)如何影响纠错性能(精度和f分数),以及比较六种基于k-mer谱的方法对数据集特征变化的性能敏感性/阻力。多因素方差分析表明,校正方法和数据集特征的选择对性能指标有显著影响。总体而言,BFC、Bless、Bloocoo和Musket在27个合成数据集上的表现优于Lighter和Trowel。对于每种选择的方法,读取长度和覆盖深度比基因组大小对性能的影响更明显。这项研究揭示了错误校正方法在响应现实世界NGS数据集中可能遇到的常见变量时的性能行为。它还需要进一步研究湿实验室生成的实验NGS数据,以验证从模拟研究中获得的结果。
{"title":"Factorial analysis of error correction performance using simulated next-generation sequencing data","authors":"Isaac Akogwu, Nan Wang, Chaoyang Zhang, Hwanseok Choi, H. Hong, P. Gong","doi":"10.1109/BIBM.2016.7822685","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822685","url":null,"abstract":"Error correction is a critical initial step in next-generation sequencing (NGS) data analysis. Although more than 60 tools have been developed, there is no systematic evidence-based comparison with regard to their strength and weakness, especially in terms of correction accuracy. Here we report a full factorial simulation study to examine how NGS dataset characteristics (genome size, coverage depth and read length in particular) affect error correction performance (precision and F-score), as well as to compare performance sensitivity/resistance of six k-mer spectrum-based methods to variations in dataset characteristics. Multi-way ANOVA tests indicate that choice of correction method and dataset characteristics had significant effects on performance metrics. Overall, BFC, Bless, Bloocoo and Musket performed better than Lighter and Trowel on 27 synthetic datasets. For each chosen method, read length and coverage depth showed more pronounced impact on performance than genome size. This study shed insights to the performance behavior of error correction methods in response to the common variables one would encounter in real-world NGS datasets. It also warrants further studies of wet lab-generated experimental NGS data to validate findings obtained from this simulation study.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128689090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Is EEG causal to fNIRs? 脑电图与近红外光谱有因果关系吗?
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822648
Borzou Alipourfard, Jean X. Gao, Olajide Babawale, Hanli Liu
Causality analysis of simultaneous measurements of the brain's electrical activity and its hemodynamic activity provides the opportunity to study the neural underpinning of hemodynamic fluctuations. This multimodal analysis can also be used to extract valuable information regarding the location of the generators of various electrical events such as Alpha rhythms or epileptiform activity. To best of our knowledge, we are the first propose a method to assess causality from EEG to the hemodynamic activity measured using functional near-infrared spectroscopy (fNIRs). The main challenge in studying causality within this setting arises from the low sampling rate of the fNIRs and the mixed frequency nature of the data. Our method of analysis consists of two parts. Through a simple modification of Geweke's formulation of contamination, we first show that the low sampling frequency of the fNIRs does not cause contamination in estimating causality from EEG to fNIRs. We then apply a novel causality test to avoid the down-sampling of the EEG when measuring for causality. The method of analysis proposed here can be generalized to study causality in other biomedical signal analysis applications and mixed frequency settings.
同时测量脑电活动和血流动力学活动的因果关系分析为研究血流动力学波动的神经基础提供了机会。这种多模态分析也可用于提取有关各种电事件(如α节律或癫痫样活动)的发生器位置的有价值的信息。据我们所知,我们是第一个提出一种方法来评估从脑电图到使用功能近红外光谱(fNIRs)测量的血流动力学活性的因果关系。在这种情况下研究因果关系的主要挑战来自近红外光谱的低采样率和数据的混合频率性质。我们的分析方法由两部分组成。通过对Geweke的污染公式的简单修改,我们首先证明了低采样频率的近红外光谱在估计EEG到近红外光谱的因果关系时不会造成污染。然后,我们应用了一种新的因果关系检验,以避免在测量因果关系时脑电图的下采样。本文提出的分析方法可以推广到其他生物医学信号分析应用和混合频率设置中的因果关系研究。
{"title":"Is EEG causal to fNIRs?","authors":"Borzou Alipourfard, Jean X. Gao, Olajide Babawale, Hanli Liu","doi":"10.1109/BIBM.2016.7822648","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822648","url":null,"abstract":"Causality analysis of simultaneous measurements of the brain's electrical activity and its hemodynamic activity provides the opportunity to study the neural underpinning of hemodynamic fluctuations. This multimodal analysis can also be used to extract valuable information regarding the location of the generators of various electrical events such as Alpha rhythms or epileptiform activity. To best of our knowledge, we are the first propose a method to assess causality from EEG to the hemodynamic activity measured using functional near-infrared spectroscopy (fNIRs). The main challenge in studying causality within this setting arises from the low sampling rate of the fNIRs and the mixed frequency nature of the data. Our method of analysis consists of two parts. Through a simple modification of Geweke's formulation of contamination, we first show that the low sampling frequency of the fNIRs does not cause contamination in estimating causality from EEG to fNIRs. We then apply a novel causality test to avoid the down-sampling of the EEG when measuring for causality. The method of analysis proposed here can be generalized to study causality in other biomedical signal analysis applications and mixed frequency settings.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114493302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A modified rough-fuzzy clustering algorithm with spatial information for HEp-2 cell image segmentation 一种基于空间信息的改进的HEp-2细胞图像粗模糊聚类算法
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822549
Shaswati Roy, P. Maji
Indirect immunofluorescence (IIF) analysis is the most effective test for antinuclear autoantibodies (ANAs) analysis, in order to reveal the occurrence of some autoimmune diseases, such as connective tissue disorders. In the tests of antinuclear antibodies, the human epithelial type 2 (HEp-2) cells is mostly used as substrate. However, the recognition of the staining pattern of ANAs in the IIF image requires proper detection of the region of interest. In this regard, automatic segmentation of IIF images is an essential prerequisite as manual segmentation is labor intensive, time consuming, and subjective. Recently, rough-fuzzy clustering has been shown to provide significant results for image segmentation by handling different uncertainties present in the images. But, the existing robust rough-fuzzy clustering algorithm does not consider spatial distribution of the image. This is useful when the image is distorted by noise and other artifacts. In this regard, the paper proposes a segmentation algorithm by incorporating the spatial constraint with the advantages of robust rough-fuzzy clustering. In the current study, class label of a pixel is influenced by its neighboring pixels depending on their spatial distance. In this way, more number of neighboring pixels can be incorporated into the calculation of a pixel feature. The performance of the proposed method is evaluated on several HEp-2 cell images and compared with the existing algorithms by presenting both qualitative and quantitative results.
间接免疫荧光(IIF)分析是抗核自身抗体(ANAs)分析中最有效的方法,可用于揭示结缔组织疾病等自身免疫性疾病的发生。在抗核抗体试验中,人上皮细胞2型(HEp-2)细胞多被用作底物。然而,识别IIF图像中ANAs的染色模式需要对感兴趣的区域进行适当的检测。在这方面,IIF图像的自动分割是必不可少的先决条件,因为人工分割是劳动密集、耗时且主观的。近年来,粗糙模糊聚类通过处理图像中存在的不同不确定性,在图像分割方面取得了显著的效果。但是,现有的鲁棒粗糙模糊聚类算法没有考虑图像的空间分布。当图像被噪声和其他伪影扭曲时,这是有用的。为此,本文提出了一种结合空间约束和鲁棒粗模糊聚类优点的分割算法。在目前的研究中,像素的类标号受到其相邻像素的空间距离的影响。这样,可以将更多的相邻像素合并到像素特征的计算中。在多幅HEp-2细胞图像上评价了该方法的性能,并与现有算法进行了定性和定量对比。
{"title":"A modified rough-fuzzy clustering algorithm with spatial information for HEp-2 cell image segmentation","authors":"Shaswati Roy, P. Maji","doi":"10.1109/BIBM.2016.7822549","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822549","url":null,"abstract":"Indirect immunofluorescence (IIF) analysis is the most effective test for antinuclear autoantibodies (ANAs) analysis, in order to reveal the occurrence of some autoimmune diseases, such as connective tissue disorders. In the tests of antinuclear antibodies, the human epithelial type 2 (HEp-2) cells is mostly used as substrate. However, the recognition of the staining pattern of ANAs in the IIF image requires proper detection of the region of interest. In this regard, automatic segmentation of IIF images is an essential prerequisite as manual segmentation is labor intensive, time consuming, and subjective. Recently, rough-fuzzy clustering has been shown to provide significant results for image segmentation by handling different uncertainties present in the images. But, the existing robust rough-fuzzy clustering algorithm does not consider spatial distribution of the image. This is useful when the image is distorted by noise and other artifacts. In this regard, the paper proposes a segmentation algorithm by incorporating the spatial constraint with the advantages of robust rough-fuzzy clustering. In the current study, class label of a pixel is influenced by its neighboring pixels depending on their spatial distance. In this way, more number of neighboring pixels can be incorporated into the calculation of a pixel feature. The performance of the proposed method is evaluated on several HEp-2 cell images and compared with the existing algorithms by presenting both qualitative and quantitative results.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121116899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
2Path: A terpenoid metabolic network modeled as graph database 2 . path:一个以图数据库为模型的萜类代谢网络
Pub Date : 2016-12-01 DOI: 10.1109/BIBM.2016.7822709
Waldeyr M. C. Silva, Danilo Vilar, Daniel S. Souza, M. E. Walter, M. Brigido, M. Holanda
Terpenoids are involved in interactions as signaling for communication intra/inter species, signal molecules to attract pollinating insects, and defense against herbivores and microbes. Due to their chemical composition, many terpenoids possess vast pharmacological applicability in medicine and biotechnology, besides important roles in ecology, industry and commerce. The biosynthesis of terpenes has been widely studied over the years, and it is well known that they can be synthesized from two metabolic pathways: mevalonate pathway (MVA) and non-mevalonate pathway (MEP). On the other hand, genome-scale reconstruction of metabolic networks faces many challenges, including organizational data storage and data modeling, to properly represent the complexity of systems biology. Recent NoSQL database paradigms have introduced new concepts of scalable storage and data queries. Among them graph databases, which are versatile enough to cope with biological data. In this paper, we propose 2Path, a graph database model designed to represent terpenoid metabolic networks, with thousands of secondary metabolism reactions, such that it preserves important terpenoid biosynthesis characteristics.
萜类化合物参与相互作用,作为物种内/物种间交流的信号,吸引传粉昆虫的信号分子,以及防御食草动物和微生物。由于其化学组成,许多萜类化合物在医学和生物技术方面具有广泛的药理适用性,除了在生态、工业和商业方面具有重要作用。萜烯的生物合成已被广泛研究多年,众所周知,它们可以通过两种代谢途径合成:甲羟戊酸途径(MVA)和非甲羟戊酸途径(MEP)。另一方面,代谢网络的基因组尺度重建面临着许多挑战,包括组织数据存储和数据建模,以正确地表示系统生物学的复杂性。最近的NoSQL数据库范例引入了可伸缩存储和数据查询的新概念。其中包括图形数据库,它具有足够的通用性来处理生物数据。在本文中,我们提出了2Path,这是一个旨在表示萜类代谢网络的图形数据库模型,具有数千个次级代谢反应,因此它保留了重要的萜类生物合成特征。
{"title":"2Path: A terpenoid metabolic network modeled as graph database","authors":"Waldeyr M. C. Silva, Danilo Vilar, Daniel S. Souza, M. E. Walter, M. Brigido, M. Holanda","doi":"10.1109/BIBM.2016.7822709","DOIUrl":"https://doi.org/10.1109/BIBM.2016.7822709","url":null,"abstract":"Terpenoids are involved in interactions as signaling for communication intra/inter species, signal molecules to attract pollinating insects, and defense against herbivores and microbes. Due to their chemical composition, many terpenoids possess vast pharmacological applicability in medicine and biotechnology, besides important roles in ecology, industry and commerce. The biosynthesis of terpenes has been widely studied over the years, and it is well known that they can be synthesized from two metabolic pathways: mevalonate pathway (MVA) and non-mevalonate pathway (MEP). On the other hand, genome-scale reconstruction of metabolic networks faces many challenges, including organizational data storage and data modeling, to properly represent the complexity of systems biology. Recent NoSQL database paradigms have introduced new concepts of scalable storage and data queries. Among them graph databases, which are versatile enough to cope with biological data. In this paper, we propose 2Path, a graph database model designed to represent terpenoid metabolic networks, with thousands of secondary metabolism reactions, such that it preserves important terpenoid biosynthesis characteristics.","PeriodicalId":345384,"journal":{"name":"2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123752458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1