首页 > 最新文献

2012 IEEE 6th International Conference on Systems Biology (ISB)最新文献

英文 中文
A machine learning framework of functional biomarker discovery for different microbial communities based on metagenomic data 基于宏基因组数据的不同微生物群落功能性生物标志物发现的机器学习框架
Pub Date : 2012-09-27 DOI: 10.1109/ISB.2012.6314121
Wei Fang, Xingzhi Chang, Xiaoquan Su, Jian Xu, Deli Zhang, K. Ning
As more than 90% of microbial community could not be isolated and cultivated, the metagenomic methods have been commonly used to analyze the microbial community as a whole. With the fast acumination of metagenomic samples, it is now intriguing to find simple biomarkers, especially functional biomarkers, which could distinguish different metagenomic samples. Next-generation sequencing techniques have enabled the detection of very accurate gene-presence (abundance) values in metagenomic studies. And the presence/absence or different abundance values for a set of genes could be used as appropriate biomarker for identification of the corresponding microbial community's phenotype. However, it is not yet clear how to select such a set of genes (features), and how accurate would it be for such a set of selected genes on prediction of microbial community's phenotype. In this study, we have evaluated different machine learning methods, including feature selection methods and classification methods, for selection of biomarkers that could distinguish different samples. Then we proposed a machine learning framework, which could discover biomarkers for different microbial communities from the mining of metagenomic data. Given a set of features (genes) and their presence values in multiple samples, we first selected discriminative features as candidate by feature selection, and then selected the feature sets with low error rate and classification accuracies as biomarkers by classification method. We have selected whole genome sequencing data from simulation, public domain and in-house metagenomic data generation facilities. We tested the framework on prediction and evaluation of the biomarkers. Results have shown that the framework could select functional biomarkers with very high accuracy. Therefore, this framework would be a suitable tool to discover functional biomarkers to distinguish different microbial communities.
由于90%以上的微生物群落无法分离和培养,宏基因组学方法已被广泛用于对微生物群落进行整体分析。随着宏基因组样本的快速积累,寻找能够区分不同宏基因组样本的简单生物标志物,尤其是功能性生物标志物已成为人们关注的焦点。新一代测序技术已经能够在宏基因组研究中检测非常准确的基因存在(丰度)值。一组基因的存在/缺失或不同的丰度值可作为鉴定相应微生物群落表型的合适生物标志物。然而,如何选择这样一组基因(特征),以及这样一组选择的基因对微生物群落表型的预测准确度如何,目前还不清楚。在这项研究中,我们评估了不同的机器学习方法,包括特征选择方法和分类方法,用于选择可以区分不同样本的生物标志物。然后,我们提出了一个机器学习框架,该框架可以从元基因组数据的挖掘中发现不同微生物群落的生物标志物。给定一组特征(基因)及其在多个样本中的存在值,首先通过特征选择选择判别特征作为候选特征,然后通过分类方法选择错误率和分类准确率较低的特征集作为生物标志物。我们从模拟、公共领域和内部宏基因组数据生成设施中选择了全基因组测序数据。我们测试了生物标记物的预测和评估框架。结果表明,该框架能够以非常高的准确率选择功能性生物标志物。因此,该框架将是发现功能生物标志物以区分不同微生物群落的合适工具。
{"title":"A machine learning framework of functional biomarker discovery for different microbial communities based on metagenomic data","authors":"Wei Fang, Xingzhi Chang, Xiaoquan Su, Jian Xu, Deli Zhang, K. Ning","doi":"10.1109/ISB.2012.6314121","DOIUrl":"https://doi.org/10.1109/ISB.2012.6314121","url":null,"abstract":"As more than 90% of microbial community could not be isolated and cultivated, the metagenomic methods have been commonly used to analyze the microbial community as a whole. With the fast acumination of metagenomic samples, it is now intriguing to find simple biomarkers, especially functional biomarkers, which could distinguish different metagenomic samples. Next-generation sequencing techniques have enabled the detection of very accurate gene-presence (abundance) values in metagenomic studies. And the presence/absence or different abundance values for a set of genes could be used as appropriate biomarker for identification of the corresponding microbial community's phenotype. However, it is not yet clear how to select such a set of genes (features), and how accurate would it be for such a set of selected genes on prediction of microbial community's phenotype. In this study, we have evaluated different machine learning methods, including feature selection methods and classification methods, for selection of biomarkers that could distinguish different samples. Then we proposed a machine learning framework, which could discover biomarkers for different microbial communities from the mining of metagenomic data. Given a set of features (genes) and their presence values in multiple samples, we first selected discriminative features as candidate by feature selection, and then selected the feature sets with low error rate and classification accuracies as biomarkers by classification method. We have selected whole genome sequencing data from simulation, public domain and in-house metagenomic data generation facilities. We tested the framework on prediction and evaluation of the biomarkers. Results have shown that the framework could select functional biomarkers with very high accuracy. Therefore, this framework would be a suitable tool to discover functional biomarkers to distinguish different microbial communities.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125049168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Pigmented network structure detection using semi-smart adaptive filters 基于半智能自适应滤波器的色素网络结构检测
Pub Date : 2012-09-27 DOI: 10.1109/ISB.2012.6314155
L. Nowak, M. Ogorzałek, M. P. Pawlowski
This paper demonstrates a method for detecting pigment based dermatoscopic structure called pigment network. This structure is used in dermatoscopy as one of the criteria in clinical evaluation of pigmented skin lesions and can indicate if a lesion is of malignant nature. For detection process we have developed an adaptive filter, inspired by Swarm Intelligence (SI) optimization algorithms. The introduced filtering method is applied in a non-linear manner, to processed dermatoscopic image of a skin lesion. The non-linear approach derives from SI algorithms, and allows selective image filtering. In the beginning of filtration process, the filters (agents) are randomly applied to sections of the image, where each of them adapts its output based on the neighborhood surrounding it. Agents share its information with other agents that are located in immediate vicinity. This is a new approach to the problem of dermatoscopic structure detection, and it is highly flexible, as it can be applied to images without the need of previous pre-processing stage. This feature is highly desirable, mainly due to the fact that in most cases of computer aided diagnostic, input images need to be pre-processed (e.g.: brightness normalization, histogram equation, contrast enhancement, color normalization) and results of this can introduce unwanted artifacts, so this step need to be verified by human. Results of applying the introduced method can be used as one of the differential structures criteria for calculating the Total Dermatoscopy Score (TDS) of the ABCD rule.
本文介绍了一种基于皮肤镜结构的色素网络检测方法。这种结构在皮肤镜检查中被用作临床评估色素皮肤病变的标准之一,可以指示病变是否为恶性。对于检测过程,我们开发了一种自适应滤波器,灵感来自群体智能(SI)优化算法。所介绍的滤波方法以非线性方式应用于处理过的皮肤病变的皮肤镜图像。非线性方法源自SI算法,并允许选择性图像滤波。在过滤过程的开始,过滤器(代理)随机应用于图像的各个部分,其中每个过滤器(代理)根据其周围的邻域调整其输出。代理与邻近的其他代理共享其信息。这是一种解决皮肤镜结构检测问题的新方法,它具有高度的灵活性,可以在不需要之前的预处理阶段的情况下应用于图像。这一特征是非常可取的,主要是因为在大多数计算机辅助诊断的情况下,输入图像需要进行预处理(例如:亮度归一化、直方图方程、对比度增强、颜色归一化),其结果可能会引入不必要的伪影,因此这一步需要人工验证。应用该方法的结果可作为计算ABCD规则的总皮肤镜评分(TDS)的鉴别结构标准之一。
{"title":"Pigmented network structure detection using semi-smart adaptive filters","authors":"L. Nowak, M. Ogorzałek, M. P. Pawlowski","doi":"10.1109/ISB.2012.6314155","DOIUrl":"https://doi.org/10.1109/ISB.2012.6314155","url":null,"abstract":"This paper demonstrates a method for detecting pigment based dermatoscopic structure called pigment network. This structure is used in dermatoscopy as one of the criteria in clinical evaluation of pigmented skin lesions and can indicate if a lesion is of malignant nature. For detection process we have developed an adaptive filter, inspired by Swarm Intelligence (SI) optimization algorithms. The introduced filtering method is applied in a non-linear manner, to processed dermatoscopic image of a skin lesion. The non-linear approach derives from SI algorithms, and allows selective image filtering. In the beginning of filtration process, the filters (agents) are randomly applied to sections of the image, where each of them adapts its output based on the neighborhood surrounding it. Agents share its information with other agents that are located in immediate vicinity. This is a new approach to the problem of dermatoscopic structure detection, and it is highly flexible, as it can be applied to images without the need of previous pre-processing stage. This feature is highly desirable, mainly due to the fact that in most cases of computer aided diagnostic, input images need to be pre-processed (e.g.: brightness normalization, histogram equation, contrast enhancement, color normalization) and results of this can introduce unwanted artifacts, so this step need to be verified by human. Results of applying the introduced method can be used as one of the differential structures criteria for calculating the Total Dermatoscopy Score (TDS) of the ABCD rule.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125348400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
cGRNexp: a web platform for building combinatorial gene regulation networks based on user-uploaded gene expression datasets cGRNexp:基于用户上传的基因表达数据集构建组合基因调控网络的web平台
Pub Date : 2012-09-27 DOI: 10.1109/ISB.2012.6314122
Huayong Xu, Hui Yu, K. Tu, Qianqian Shi, Chaochun Wei, Yuan-yuan Li, Yixue Li
While we witness rapid progresses in development of methodologies/algorithms for constructing and analyzing the combinatorial regulation network which includes both TF regulators and miRNA regulators, we find a lack of tools or servers available for facilitating related works. A web service is especially needed that allows user to upload their own expression datasets and mine the combinatorial gene reglatory networks regarding the particular experimental context. Herein we report cGRNexp, a web platform for building combinatorial gene regulation networks based on user-uploaded gene expression datasets. In cGRNexp, we deposit three types of sequence-matching-based regulatory relationships and implement two functional modules for processing microRNA-perturbed gene expression datasets and parallel miRNA/mRNA expression datasets. With the microarrays and next-generation sequencing platforms being increasingly accessible, a large amount of miRNA or mRNA expression datasets will be attainable in the near future, and thus, our web platform cGRNexp will be very useful for helping people mine the conditional combinatorial regulatory networks from their own expression datasets. cGRNexp is accessible at http://www.scbit.org/cgrnexp/.
虽然我们见证了构建和分析组合调控网络(包括TF调控因子和miRNA调控因子)的方法/算法的快速发展,但我们发现缺乏可用的工具或服务器来促进相关工作。特别需要一个网络服务,允许用户上传他们自己的表达数据集,并根据特定的实验环境挖掘组合基因调控网络。在此,我们报告了cGRNexp,一个基于用户上传的基因表达数据集构建组合基因调控网络的web平台。在cGRNexp中,我们建立了三种基于序列匹配的调控关系,并实现了两个功能模块,用于处理microrna干扰的基因表达数据集和并行miRNA/mRNA表达数据集。随着微阵列和下一代测序平台的日益便利,在不久的将来将可以获得大量的miRNA或mRNA表达数据集,因此,我们的web平台cGRNexp将非常有助于帮助人们从自己的表达数据集中挖掘条件组合调节网络。cGRNexp可通过http://www.scbit.org/cgrnexp/访问。
{"title":"cGRNexp: a web platform for building combinatorial gene regulation networks based on user-uploaded gene expression datasets","authors":"Huayong Xu, Hui Yu, K. Tu, Qianqian Shi, Chaochun Wei, Yuan-yuan Li, Yixue Li","doi":"10.1109/ISB.2012.6314122","DOIUrl":"https://doi.org/10.1109/ISB.2012.6314122","url":null,"abstract":"While we witness rapid progresses in development of methodologies/algorithms for constructing and analyzing the combinatorial regulation network which includes both TF regulators and miRNA regulators, we find a lack of tools or servers available for facilitating related works. A web service is especially needed that allows user to upload their own expression datasets and mine the combinatorial gene reglatory networks regarding the particular experimental context. Herein we report cGRNexp, a web platform for building combinatorial gene regulation networks based on user-uploaded gene expression datasets. In cGRNexp, we deposit three types of sequence-matching-based regulatory relationships and implement two functional modules for processing microRNA-perturbed gene expression datasets and parallel miRNA/mRNA expression datasets. With the microarrays and next-generation sequencing platforms being increasingly accessible, a large amount of miRNA or mRNA expression datasets will be attainable in the near future, and thus, our web platform cGRNexp will be very useful for helping people mine the conditional combinatorial regulatory networks from their own expression datasets. cGRNexp is accessible at http://www.scbit.org/cgrnexp/.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126643291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A comparison of three weighted human gene functional association networks 三种加权人类基因功能关联网络的比较
Pub Date : 2012-09-27 DOI: 10.1109/ISB.2012.6314108
Jing Zhao, Chun-Lin Wang, Tinghong Yang, Bo Li, Xing Chen, Xiaona Shen, Ling Fang
Gene-gene association or protein-protein interaction databases have been important resource for the study of cellular functions and human diseases. A number of gene association databases have been available in the public domain. Each of these databases has its own unique virtues, but no single database could provide enough confidence and coverage. These years some meta-databases have been built by integrating various resources of gene functional associations and weighing the evidence of each association by some score systems. In this work, we compared three weighted genome-scale human gene association networks constructed from three such meta-databases, STRING, FunCoup and FLN, respectively. We found that the three networks share a large fraction of common genes but only quite limited overlapped interactions. However, most genes involved in important cellular processes and human diseases, as well as their pairwise interactions, is included in all of the three networks. This explains why all the three networks have been successfully applied in the study of cellular functions and diseases mechanisms. We believe that further integration of these meta-databases would provide higher confidence and coverage of gene associations in human proteome and facilitate the study of human gene association networks.
基因-基因关联或蛋白-蛋白相互作用数据库已成为研究细胞功能和人类疾病的重要资源。一些基因关联数据库已经在公共领域可用。这些数据库都有其独特的优点,但是没有一个数据库能够提供足够的信心和覆盖范围。近年来,通过整合基因功能关联的各种资源,并通过一些评分系统对每种关联的证据进行权衡,建立了一些元数据库。在这项工作中,我们比较了三个加权基因组尺度的人类基因关联网络,分别由三个这样的元数据库,STRING, FunCoup和FLN构建。我们发现这三个网络共享很大一部分共同基因,但只有相当有限的重叠相互作用。然而,大多数参与重要细胞过程和人类疾病的基因,以及它们的成对相互作用,都包括在这三个网络中。这就解释了为什么这三种网络都成功地应用于细胞功能和疾病机制的研究。我们相信这些元数据库的进一步整合将为人类蛋白质组基因关联提供更高的可信度和覆盖率,并促进人类基因关联网络的研究。
{"title":"A comparison of three weighted human gene functional association networks","authors":"Jing Zhao, Chun-Lin Wang, Tinghong Yang, Bo Li, Xing Chen, Xiaona Shen, Ling Fang","doi":"10.1109/ISB.2012.6314108","DOIUrl":"https://doi.org/10.1109/ISB.2012.6314108","url":null,"abstract":"Gene-gene association or protein-protein interaction databases have been important resource for the study of cellular functions and human diseases. A number of gene association databases have been available in the public domain. Each of these databases has its own unique virtues, but no single database could provide enough confidence and coverage. These years some meta-databases have been built by integrating various resources of gene functional associations and weighing the evidence of each association by some score systems. In this work, we compared three weighted genome-scale human gene association networks constructed from three such meta-databases, STRING, FunCoup and FLN, respectively. We found that the three networks share a large fraction of common genes but only quite limited overlapped interactions. However, most genes involved in important cellular processes and human diseases, as well as their pairwise interactions, is included in all of the three networks. This explains why all the three networks have been successfully applied in the study of cellular functions and diseases mechanisms. We believe that further integration of these meta-databases would provide higher confidence and coverage of gene associations in human proteome and facilitate the study of human gene association networks.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126346809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
System identification of the fermentation system of Thermoanaerobacter sp. X514 热厌氧菌sp. X514发酵体系的体系鉴定
Pub Date : 2012-09-27 DOI: 10.1109/ISB.2012.6314129
Jing Yang, X. Ling, L. Yao, Hua-Liang Wei, V. Kadirkamanathan
Bioethanol production by means of anaerobic thermophilic microorganisms with pentose or hexose as the substrate are of paramount importance in sustainable fuel innovation. Manipulation of microorganisms and the associated experiment conditions by means of various ad-hoc technology is obviously the most straightforward way with the aim of maximizing bioethanol yield. However, methodology by means of mathematical modeling and analysis is often neglected among these routines. In this paper, typical input-output models are applied in the metabolic system analysis of Thermoanaerobacter sp. X514 under sole glucose substrate, sole xylose substrate and mixed glucose and xylose substrates conditions. Orthogonal Least Squares (OLS) approach is used for model parameter estimation. Model selection is proposed in order to testify the generality of the suggested model. System identification results illustrate that various forms of Nonlinear AutoRegressive with eXogenous input models (NARX) are applicable in delineating the system where different substrates (glucose or xylose) are utilized during the experiments. The proposed model structure infers that the yields of various products in X514 are mainly driven by the history information of the substrate consumption change. Moreover, the interaction between the main fermentation products of X514 is indirectly connected through the proposed models.
以戊糖或己糖为底物的厌氧嗜热微生物生产生物乙醇在可持续燃料创新中具有至关重要的意义。通过各种特殊技术来操纵微生物和相关的实验条件显然是实现生物乙醇产量最大化的最直接的方法。然而,通过数学建模和分析的方法在这些例程中往往被忽视。本文采用典型的投入产出模型对热厌氧菌sp. X514在单一葡萄糖底物、单一木糖底物以及葡萄糖和木糖混合底物条件下的代谢系统进行了分析。采用正交最小二乘法对模型参数进行估计。提出了模型选择,以证明所建议模型的通用性。系统辨识结果表明,各种形式的非线性自回归外源输入模型(NARX)适用于描述实验中使用不同底物(葡萄糖或木糖)的系统。根据所提出的模型结构推断,X514中各种产品的良率主要由衬底消耗变化的历史信息驱动。此外,通过所建立的模型间接连接了X514主要发酵产物之间的相互作用。
{"title":"System identification of the fermentation system of Thermoanaerobacter sp. X514","authors":"Jing Yang, X. Ling, L. Yao, Hua-Liang Wei, V. Kadirkamanathan","doi":"10.1109/ISB.2012.6314129","DOIUrl":"https://doi.org/10.1109/ISB.2012.6314129","url":null,"abstract":"Bioethanol production by means of anaerobic thermophilic microorganisms with pentose or hexose as the substrate are of paramount importance in sustainable fuel innovation. Manipulation of microorganisms and the associated experiment conditions by means of various ad-hoc technology is obviously the most straightforward way with the aim of maximizing bioethanol yield. However, methodology by means of mathematical modeling and analysis is often neglected among these routines. In this paper, typical input-output models are applied in the metabolic system analysis of Thermoanaerobacter sp. X514 under sole glucose substrate, sole xylose substrate and mixed glucose and xylose substrates conditions. Orthogonal Least Squares (OLS) approach is used for model parameter estimation. Model selection is proposed in order to testify the generality of the suggested model. System identification results illustrate that various forms of Nonlinear AutoRegressive with eXogenous input models (NARX) are applicable in delineating the system where different substrates (glucose or xylose) are utilized during the experiments. The proposed model structure infers that the yields of various products in X514 are mainly driven by the history information of the substrate consumption change. Moreover, the interaction between the main fermentation products of X514 is indirectly connected through the proposed models.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131832656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel feature selection method based on CFS in cancer recognition 一种基于CFS的癌症识别特征选择新方法
Pub Date : 2012-09-27 DOI: 10.1109/ISB.2012.6314141
Xinguo Lu, Xianghua Peng, Ping Liu, Yong Deng, Bingtao Feng, Bo Liao
In recent years, the gene expression profiles are used for cancer recognition. But the researchers are disturbed by their large variables and small observes. In this paper, a novel feature selection method based on correlation-based feature selection(CFS) was proposed. Firstly, the measures of variable to variable and variable to observe were calculated respectively. Then we utilized heuristic search method to search the space of variable for selecting informative gene subset and the subset weight was computed using these measures. Through regression we obtained a subset of distinguished genes. Finally, the stratified sampling strategy was presented to obtain the most informative genes. And classification performance was tested to evaluate the proposed method. Ten-fold cross-validation experiment was performed in three datasets including leukemia, colon cancer and prostate tumor. The experimental results show that the proposed method can obtain the distinguished gene subset and different classifier can acquire better classification performance with this subset.
近年来,基因表达谱被用于癌症识别。但研究人员对他们的大变量和小观察结果感到不安。提出了一种基于相关特征选择(CFS)的特征选择方法。首先,分别计算变量对变量和变量对观察的测度。然后利用启发式搜索方法搜索变量空间,选择信息基因子集,并利用这些度量计算子集权值。通过回归,我们得到了一个区分基因的子集。最后,提出了分层采样策略,以获得信息量最大的基因。并对该方法进行了分类性能测试。在白血病、结肠癌和前列腺肿瘤三个数据集进行十倍交叉验证实验。实验结果表明,所提出的方法可以获得可区分的基因子集,不同的分类器使用该子集可以获得更好的分类性能。
{"title":"A novel feature selection method based on CFS in cancer recognition","authors":"Xinguo Lu, Xianghua Peng, Ping Liu, Yong Deng, Bingtao Feng, Bo Liao","doi":"10.1109/ISB.2012.6314141","DOIUrl":"https://doi.org/10.1109/ISB.2012.6314141","url":null,"abstract":"In recent years, the gene expression profiles are used for cancer recognition. But the researchers are disturbed by their large variables and small observes. In this paper, a novel feature selection method based on correlation-based feature selection(CFS) was proposed. Firstly, the measures of variable to variable and variable to observe were calculated respectively. Then we utilized heuristic search method to search the space of variable for selecting informative gene subset and the subset weight was computed using these measures. Through regression we obtained a subset of distinguished genes. Finally, the stratified sampling strategy was presented to obtain the most informative genes. And classification performance was tested to evaluate the proposed method. Ten-fold cross-validation experiment was performed in three datasets including leukemia, colon cancer and prostate tumor. The experimental results show that the proposed method can obtain the distinguished gene subset and different classifier can acquire better classification performance with this subset.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"147 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115934675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Using NMFAS to identify key biological pathways associated with human diseases 利用NMFAS识别与人类疾病相关的关键生物学途径
Pub Date : 2012-09-27 DOI: 10.1109/ISB.2012.6314116
Hao Guo, Yun-ping Zhu, Dong Li, F. He, Qi-jun Liu
Gene expression microarray enables us to measure the gene expression levels for thousands of genes at the same time. Here, we constructed the non-negative matrix factorization analysis strategy (NMFAS) to dig the underlying biological pathways related with various diseases by factorizing the pathway expression matrix, which was extracted from microarray matrix using pathway membership information, into the product of row and column vectors. We defined row vector as the pathway activity and column vector as the gene contribution weight. Via comparing the pathway activity of two different sample groups, we can identify significantly expressed pathways. We applied this strategy on two different cases: smoking and type 2 diabetes (DM2). We found 152 differentially expressed pathways by the comparison of pathway activity between smoker and never smoker, including pathways that have been validated in literature, such as “O-Glycans biosynthesis” and “Glutathione metabolism”. We also found important genes related to smoking phenotype, such as NQO, HSPA1A, ALDH3A1. As for DM2 analysis, our results suggested 9 pathways were significantly expressed, including typical pathways like “Oxidative phosphorylation” and “mTOR signaling pathway”, and found genes like CAPNS1, APP, COX7A1, COX7B, which might play important roles in the cellular regulations of DM2. In conclusion, Our strategy can be efficiently used to integrate gene expression profiles and biological pathway information to identify the key processes underlying human disease and can identify gene pathways missed by alternative approaches.
基因表达微阵列使我们能够同时测量数千个基因的基因表达水平。本研究构建非负矩阵因子化分析策略(NMFAS),通过利用途径隶属度信息从微阵列矩阵中提取途径表达矩阵,将其因子化为行向量和列向量的乘积,从而挖掘与各种疾病相关的潜在生物学途径。我们将行向量定义为途径活性,列向量定义为基因贡献权。通过比较两个不同样本组的通路活性,我们可以确定显著表达的通路。我们将这一策略应用于两种不同的情况:吸烟和2型糖尿病(DM2)。通过比较吸烟者和从不吸烟者之间的通路活性,我们发现了152个差异表达通路,包括已在文献中验证的通路,如“o -甘聚糖生物合成”和谷胱甘肽代谢”。我们还发现了与吸烟表型相关的重要基因,如NQO、HSPA1A、ALDH3A1。对于DM2的分析,我们的结果发现9条通路显著表达,包括“氧化磷酸化”和“mTOR信号通路”等典型通路,并发现CAPNS1、APP、COX7A1、COX7B等基因可能在DM2的细胞调控中发挥重要作用。总之,我们的策略可以有效地用于整合基因表达谱和生物学途径信息,以识别人类疾病的关键过程,并可以识别其他方法遗漏的基因途径。
{"title":"Using NMFAS to identify key biological pathways associated with human diseases","authors":"Hao Guo, Yun-ping Zhu, Dong Li, F. He, Qi-jun Liu","doi":"10.1109/ISB.2012.6314116","DOIUrl":"https://doi.org/10.1109/ISB.2012.6314116","url":null,"abstract":"Gene expression microarray enables us to measure the gene expression levels for thousands of genes at the same time. Here, we constructed the non-negative matrix factorization analysis strategy (NMFAS) to dig the underlying biological pathways related with various diseases by factorizing the pathway expression matrix, which was extracted from microarray matrix using pathway membership information, into the product of row and column vectors. We defined row vector as the pathway activity and column vector as the gene contribution weight. Via comparing the pathway activity of two different sample groups, we can identify significantly expressed pathways. We applied this strategy on two different cases: smoking and type 2 diabetes (DM2). We found 152 differentially expressed pathways by the comparison of pathway activity between smoker and never smoker, including pathways that have been validated in literature, such as “O-Glycans biosynthesis” and “Glutathione metabolism”. We also found important genes related to smoking phenotype, such as NQO, HSPA1A, ALDH3A1. As for DM2 analysis, our results suggested 9 pathways were significantly expressed, including typical pathways like “Oxidative phosphorylation” and “mTOR signaling pathway”, and found genes like CAPNS1, APP, COX7A1, COX7B, which might play important roles in the cellular regulations of DM2. In conclusion, Our strategy can be efficiently used to integrate gene expression profiles and biological pathway information to identify the key processes underlying human disease and can identify gene pathways missed by alternative approaches.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125855373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human encoded miRNAs that regulate the inflenenza virus genome 调节流感病毒基因组的人类编码mirna
Pub Date : 2012-09-27 DOI: 10.1109/ISB.2012.6314107
H. Zhang, Xin Li, Yuanning Liu, Zhi Li, Minggang Hu, Dong Xu
Motivation:MiRNAs can downregulate gene expression by mRNA cleavage or translational repression. Discovering human encoded miRNAs that regulate the influenza virus genome is important for molecular targets for drug development, and it also plays positive role in influenza control and prevention. Methods: We propose a new method based on scoring to discover human encoded miRNAs that regulate the influenza virus genome. The scoring based on the same complementary sites, the secondary structure of the complementary sites and the binding sites of all sequences respectively. Among them, taking the secondary structure as a vital factor is a new attempt. Results: Has-miR-489, has-miR-325, has-miR-876-3p and has-miR-2117 are targeted HA, PB2, MP and NS of influenza A, respectively.
动机:MiRNAs可以通过mRNA切割或翻译抑制下调基因表达。发现调节流感病毒基因组的人类编码mirna对药物开发的分子靶点具有重要意义,对流感防控也具有积极作用。方法:我们提出了一种基于评分的新方法来发现人类编码的调节流感病毒基因组的mirna。分别基于相同的互补位点、互补位点的二级结构和所有序列的结合位点进行评分。其中,将二级结构作为关键因素是一种新的尝试。结果:Has-miR-489、has-miR-325、has-miR-876-3p和has-miR-2117分别靶向甲型流感HA、PB2、MP和NS。
{"title":"Human encoded miRNAs that regulate the inflenenza virus genome","authors":"H. Zhang, Xin Li, Yuanning Liu, Zhi Li, Minggang Hu, Dong Xu","doi":"10.1109/ISB.2012.6314107","DOIUrl":"https://doi.org/10.1109/ISB.2012.6314107","url":null,"abstract":"Motivation:MiRNAs can downregulate gene expression by mRNA cleavage or translational repression. Discovering human encoded miRNAs that regulate the influenza virus genome is important for molecular targets for drug development, and it also plays positive role in influenza control and prevention. Methods: We propose a new method based on scoring to discover human encoded miRNAs that regulate the influenza virus genome. The scoring based on the same complementary sites, the secondary structure of the complementary sites and the binding sites of all sequences respectively. Among them, taking the secondary structure as a vital factor is a new attempt. Results: Has-miR-489, has-miR-325, has-miR-876-3p and has-miR-2117 are targeted HA, PB2, MP and NS of influenza A, respectively.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128501214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A stable simplification of a fas-signaling pathway model for apoptosis 凋亡fas信号通路模型的稳定简化
Pub Date : 2012-09-27 DOI: 10.1109/ISB.2012.6314124
Ya-Jing Huang, W. Yong
Apoptosis is important for maintaining normal embryonic development, tissue homeostasis and normal immune-system operation in multicellular organisms. Its malfunction may result in serious diseases such as cancer, autoimmunity, and neurodegeneration. In apoptosis, tens of species are present in many biochemical reactions with times scales of widely differing orders of magnitude. According to the law of mass action, apoptosis is usually described with a large and stiff system of ODEs (ordinary differential equations). The goal of this work is to derive a simple system of ODEs by using the classical PEA (partial equilibrium approximation) method. For this purpose, we firstly justify the mathematical correctness of the PEA in a quite general framework. On the basis of this result, we simplify the Fas-signaling pathway model proposed by Hua et al. (2005) by assuming the fastness of several reversible reactions. Numerical simulations and sensitivity analysis show that our simplification model is reliable.
在多细胞生物中,细胞凋亡对于维持正常的胚胎发育、组织稳态和正常的免疫系统运作至关重要。它的功能障碍可能导致严重的疾病,如癌症、自身免疫和神经变性。在细胞凋亡中,许多生物化学反应中存在数十种物质,其时间尺度差异很大。根据质量作用定律,细胞凋亡通常用一个大而僵硬的常微分方程系统来描述。本工作的目的是利用经典的部分平衡近似方法推导出一个简单的ode系统。为此,我们首先在一个相当普遍的框架中证明PEA的数学正确性。在此结果的基础上,我们通过假设几个可逆反应的牢固性,简化了Hua等(2005)提出的fas信号通路模型。数值模拟和灵敏度分析表明,简化模型是可靠的。
{"title":"A stable simplification of a fas-signaling pathway model for apoptosis","authors":"Ya-Jing Huang, W. Yong","doi":"10.1109/ISB.2012.6314124","DOIUrl":"https://doi.org/10.1109/ISB.2012.6314124","url":null,"abstract":"Apoptosis is important for maintaining normal embryonic development, tissue homeostasis and normal immune-system operation in multicellular organisms. Its malfunction may result in serious diseases such as cancer, autoimmunity, and neurodegeneration. In apoptosis, tens of species are present in many biochemical reactions with times scales of widely differing orders of magnitude. According to the law of mass action, apoptosis is usually described with a large and stiff system of ODEs (ordinary differential equations). The goal of this work is to derive a simple system of ODEs by using the classical PEA (partial equilibrium approximation) method. For this purpose, we firstly justify the mathematical correctness of the PEA in a quite general framework. On the basis of this result, we simplify the Fas-signaling pathway model proposed by Hua et al. (2005) by assuming the fastness of several reversible reactions. Numerical simulations and sensitivity analysis show that our simplification model is reliable.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"156 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123396677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Module of cellular networks in saccharomyces cerevisiae 酿酒酵母菌的细胞网络模块
Pub Date : 2012-09-27 DOI: 10.1109/ISB.2012.6314133
Yueying Yang, Di Liu, Jun Meng
The focus of the network of research is to determine their community or module, it helps the functional organization and evolution of the network. Modular can be seen as a function of a dynamic cell system executing complex functions in a living cell. How to identify the precious knowledge resources to build a more reliable module is still one of the most important and difficult problems in bioinformatics. We put forward a state space model combining the topological method to describe the time and space module in the cell cycle of the process. Not only our module function sets of genes related to identify a condition to activate or suppress in the cell cycle process in S.cerevisiae, but also have many different solutions, which have evolved into different molecular components will be the assembly at the right time in the cell cycle. The resulting module mapping analysis showed several assumptions connection biological process to a particular cell cycle conditions.
网络研究的重点是确定其社区或模块,这有助于网络的功能组织和演化。模块化可以看作是动态细胞系统在活细胞中执行复杂功能的功能。如何识别宝贵的知识资源,构建更可靠的模块仍然是生物信息学中最重要和最困难的问题之一。提出了一种结合拓扑方法的状态空间模型来描述过程中细胞周期的时间和空间模块。我们的模块功能集不仅与识别酵母细胞周期过程中激活或抑制条件相关的基因有关,而且还有许多不同的解决方案,这些解决方案已演变成不同的分子组分将在细胞周期的正确时间组装。由此产生的模块映射分析显示了几个将生物过程与特定细胞周期条件联系起来的假设。
{"title":"Module of cellular networks in saccharomyces cerevisiae","authors":"Yueying Yang, Di Liu, Jun Meng","doi":"10.1109/ISB.2012.6314133","DOIUrl":"https://doi.org/10.1109/ISB.2012.6314133","url":null,"abstract":"The focus of the network of research is to determine their community or module, it helps the functional organization and evolution of the network. Modular can be seen as a function of a dynamic cell system executing complex functions in a living cell. How to identify the precious knowledge resources to build a more reliable module is still one of the most important and difficult problems in bioinformatics. We put forward a state space model combining the topological method to describe the time and space module in the cell cycle of the process. Not only our module function sets of genes related to identify a condition to activate or suppress in the cell cycle process in S.cerevisiae, but also have many different solutions, which have evolved into different molecular components will be the assembly at the right time in the cell cycle. The resulting module mapping analysis showed several assumptions connection biological process to a particular cell cycle conditions.","PeriodicalId":224011,"journal":{"name":"2012 IEEE 6th International Conference on Systems Biology (ISB)","volume":"72 3-4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120891747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2012 IEEE 6th International Conference on Systems Biology (ISB)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1