信号处理技术在生物信息学、基因组学和蛋白质组学中的应用。

Erchin Serpedin, Javier Garcia-Frias, Yufei Huang, Ulisses Braga-Neto
{"title":"信号处理技术在生物信息学、基因组学和蛋白质组学中的应用。","authors":"Erchin Serpedin, Javier Garcia-Frias, Yufei Huang, Ulisses Braga-Neto","doi":"10.1155/2009/250306","DOIUrl":null,"url":null,"abstract":"The recent development of high-throughput molecular genetics technologies has brought a major impact to bioinformatics and systems biology. These technologies have made possible the measurement of the expression profiles of genes and proteins in a highly parallel and integrated fashion. The examination of the huge amounts of genomic and proteomic data holds the promise for understanding the complex interactions between genes and proteins, the functional processes of a cell, and the impact of various factors on a cell, and ultimately, for enabling the design of new technologies for intelligent management of diseases. This special issue focuses on modeling and processing of data arising in bioinformatics, genomics, and proteomics using signal processing methods. The importance of signal processing techniques is due to their important role in extracting, processing, and interpreting the information contained in genomic and proteomic data. It is our hope that signal processing methods will lead to new advances and insights in uncovering the structure, functioning and evolution of biological systems. The special issue consists of nine papers that span a wide range of problems and applications in bioinformatics, genomics, and proteomics such as design of compressive sensing microarrays, analysis of missing values in microarray data, and effect of imputation techniques on post genomic inference methods, RNA sequence alignment, detection of periodicity in genomic sequences and gene expression profiles, clustering and classification of gene and protein expression data, and intervention in probabilistic Boolean networks. Next, we will briefly introduce the papers reported in this special issue. W. Dai et al. analyze how to design a microarray that it is fit for compressive sensing and that captures also the biochemistry of probe-target DNA hybridization. Algorithms and design results are reported for determining probe sequences that satisfy the binding requirements and for evaluating the target concentrations. M. S. B. Sehgal et al. address the general problem of improving post genomic knowledge discovery procedures such as the selection of the most significant genes and inference of gene regulatory networks using missing microarray data imputation techniques. It is shown that instead of neglecting missing data, recycling microarray data via robust imputation techniques can yield substantial performance improvements in the subsequent post genomic discovery procedures. B.-J. Yoon developed a novel efficient and robust approach for fast and accurate structural alignment of RNAs, including pseudoknots. The proposed method turns out to accelerate the dynamic programming algorithm for family-specific models such as profile-csHMMs and CMs, and to be robust to small parameter changes that are present in the model used to predict the constraint. The paper by J. Epps explains in detail the origins of ambiguity in period estimation for symbolic sequences, and proposes a novel hybrid autocorrelation-IPDFT technique for periodicity characterization of sequences. W. Zhao et al. developed a novel algorithm for identification of genes involved in cyclic processes by combining gene expression analysis and prior knowledge. The proposed cyclic-genes detection algorithm is validated on data sets corresponding to Saccharomyces cerevisiae and Drosophila melanogaster, and shown to represent a valuable technique for unveiling pathways related to cyclic processes. T. J. Hestilow and Y. Huang propose a novel method for gene clustering using the shape information of gene expression profiles. The shape information which is represented in terms of normalized and time-scaled forward first-order differences is then exploited by a variational Bayes clustering approach and a non-Bayesian (Silhouette) cluster statistic, and shown to yield promising results in clustering time-series microarray data. The paper by W. Zhao et al. proposes a new clustering approach to combine the traditional clustering methods with power spectral analysis of time series gene expression measurements. Simulation results confirm that the proposed clustering approach provides superior performance relative to hierarchical, K-means, and self-organizing maps, and yields additional information about temporal regulated genetic processes, for example, cell-cycle. T. T. Vu and U. M. Braga-Neto address the important problem of assessing the effectiveness of bagging in the classification of small-sample genomic and proteomic data sets. Representative experimental results are presented and discussed. Finally, the paper by B. Faryabi et al. studies the effects on intervention performance in the context of probabilistic Boolean networks due to a reduction in the values of the model parameters.","PeriodicalId":72957,"journal":{"name":"EURASIP journal on bioinformatics & systems biology","volume":" ","pages":"250306"},"PeriodicalIF":0.0000,"publicationDate":"2009-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1155/2009/250306","citationCount":"1","resultStr":"{\"title\":\"Applications of signal processing techniques to bioinformatics, genomics, and proteomics.\",\"authors\":\"Erchin Serpedin, Javier Garcia-Frias, Yufei Huang, Ulisses Braga-Neto\",\"doi\":\"10.1155/2009/250306\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The recent development of high-throughput molecular genetics technologies has brought a major impact to bioinformatics and systems biology. These technologies have made possible the measurement of the expression profiles of genes and proteins in a highly parallel and integrated fashion. The examination of the huge amounts of genomic and proteomic data holds the promise for understanding the complex interactions between genes and proteins, the functional processes of a cell, and the impact of various factors on a cell, and ultimately, for enabling the design of new technologies for intelligent management of diseases. This special issue focuses on modeling and processing of data arising in bioinformatics, genomics, and proteomics using signal processing methods. The importance of signal processing techniques is due to their important role in extracting, processing, and interpreting the information contained in genomic and proteomic data. It is our hope that signal processing methods will lead to new advances and insights in uncovering the structure, functioning and evolution of biological systems. The special issue consists of nine papers that span a wide range of problems and applications in bioinformatics, genomics, and proteomics such as design of compressive sensing microarrays, analysis of missing values in microarray data, and effect of imputation techniques on post genomic inference methods, RNA sequence alignment, detection of periodicity in genomic sequences and gene expression profiles, clustering and classification of gene and protein expression data, and intervention in probabilistic Boolean networks. Next, we will briefly introduce the papers reported in this special issue. W. Dai et al. analyze how to design a microarray that it is fit for compressive sensing and that captures also the biochemistry of probe-target DNA hybridization. Algorithms and design results are reported for determining probe sequences that satisfy the binding requirements and for evaluating the target concentrations. M. S. B. Sehgal et al. address the general problem of improving post genomic knowledge discovery procedures such as the selection of the most significant genes and inference of gene regulatory networks using missing microarray data imputation techniques. It is shown that instead of neglecting missing data, recycling microarray data via robust imputation techniques can yield substantial performance improvements in the subsequent post genomic discovery procedures. B.-J. Yoon developed a novel efficient and robust approach for fast and accurate structural alignment of RNAs, including pseudoknots. The proposed method turns out to accelerate the dynamic programming algorithm for family-specific models such as profile-csHMMs and CMs, and to be robust to small parameter changes that are present in the model used to predict the constraint. The paper by J. Epps explains in detail the origins of ambiguity in period estimation for symbolic sequences, and proposes a novel hybrid autocorrelation-IPDFT technique for periodicity characterization of sequences. W. Zhao et al. developed a novel algorithm for identification of genes involved in cyclic processes by combining gene expression analysis and prior knowledge. The proposed cyclic-genes detection algorithm is validated on data sets corresponding to Saccharomyces cerevisiae and Drosophila melanogaster, and shown to represent a valuable technique for unveiling pathways related to cyclic processes. T. J. Hestilow and Y. Huang propose a novel method for gene clustering using the shape information of gene expression profiles. The shape information which is represented in terms of normalized and time-scaled forward first-order differences is then exploited by a variational Bayes clustering approach and a non-Bayesian (Silhouette) cluster statistic, and shown to yield promising results in clustering time-series microarray data. The paper by W. Zhao et al. proposes a new clustering approach to combine the traditional clustering methods with power spectral analysis of time series gene expression measurements. Simulation results confirm that the proposed clustering approach provides superior performance relative to hierarchical, K-means, and self-organizing maps, and yields additional information about temporal regulated genetic processes, for example, cell-cycle. T. T. Vu and U. M. Braga-Neto address the important problem of assessing the effectiveness of bagging in the classification of small-sample genomic and proteomic data sets. Representative experimental results are presented and discussed. Finally, the paper by B. Faryabi et al. studies the effects on intervention performance in the context of probabilistic Boolean networks due to a reduction in the values of the model parameters.\",\"PeriodicalId\":72957,\"journal\":{\"name\":\"EURASIP journal on bioinformatics & systems biology\",\"volume\":\" \",\"pages\":\"250306\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1155/2009/250306\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EURASIP journal on bioinformatics & systems biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1155/2009/250306\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2009/4/23 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EURASIP journal on bioinformatics & systems biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1155/2009/250306","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2009/4/23 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Applications of signal processing techniques to bioinformatics, genomics, and proteomics.
The recent development of high-throughput molecular genetics technologies has brought a major impact to bioinformatics and systems biology. These technologies have made possible the measurement of the expression profiles of genes and proteins in a highly parallel and integrated fashion. The examination of the huge amounts of genomic and proteomic data holds the promise for understanding the complex interactions between genes and proteins, the functional processes of a cell, and the impact of various factors on a cell, and ultimately, for enabling the design of new technologies for intelligent management of diseases. This special issue focuses on modeling and processing of data arising in bioinformatics, genomics, and proteomics using signal processing methods. The importance of signal processing techniques is due to their important role in extracting, processing, and interpreting the information contained in genomic and proteomic data. It is our hope that signal processing methods will lead to new advances and insights in uncovering the structure, functioning and evolution of biological systems. The special issue consists of nine papers that span a wide range of problems and applications in bioinformatics, genomics, and proteomics such as design of compressive sensing microarrays, analysis of missing values in microarray data, and effect of imputation techniques on post genomic inference methods, RNA sequence alignment, detection of periodicity in genomic sequences and gene expression profiles, clustering and classification of gene and protein expression data, and intervention in probabilistic Boolean networks. Next, we will briefly introduce the papers reported in this special issue. W. Dai et al. analyze how to design a microarray that it is fit for compressive sensing and that captures also the biochemistry of probe-target DNA hybridization. Algorithms and design results are reported for determining probe sequences that satisfy the binding requirements and for evaluating the target concentrations. M. S. B. Sehgal et al. address the general problem of improving post genomic knowledge discovery procedures such as the selection of the most significant genes and inference of gene regulatory networks using missing microarray data imputation techniques. It is shown that instead of neglecting missing data, recycling microarray data via robust imputation techniques can yield substantial performance improvements in the subsequent post genomic discovery procedures. B.-J. Yoon developed a novel efficient and robust approach for fast and accurate structural alignment of RNAs, including pseudoknots. The proposed method turns out to accelerate the dynamic programming algorithm for family-specific models such as profile-csHMMs and CMs, and to be robust to small parameter changes that are present in the model used to predict the constraint. The paper by J. Epps explains in detail the origins of ambiguity in period estimation for symbolic sequences, and proposes a novel hybrid autocorrelation-IPDFT technique for periodicity characterization of sequences. W. Zhao et al. developed a novel algorithm for identification of genes involved in cyclic processes by combining gene expression analysis and prior knowledge. The proposed cyclic-genes detection algorithm is validated on data sets corresponding to Saccharomyces cerevisiae and Drosophila melanogaster, and shown to represent a valuable technique for unveiling pathways related to cyclic processes. T. J. Hestilow and Y. Huang propose a novel method for gene clustering using the shape information of gene expression profiles. The shape information which is represented in terms of normalized and time-scaled forward first-order differences is then exploited by a variational Bayes clustering approach and a non-Bayesian (Silhouette) cluster statistic, and shown to yield promising results in clustering time-series microarray data. The paper by W. Zhao et al. proposes a new clustering approach to combine the traditional clustering methods with power spectral analysis of time series gene expression measurements. Simulation results confirm that the proposed clustering approach provides superior performance relative to hierarchical, K-means, and self-organizing maps, and yields additional information about temporal regulated genetic processes, for example, cell-cycle. T. T. Vu and U. M. Braga-Neto address the important problem of assessing the effectiveness of bagging in the classification of small-sample genomic and proteomic data sets. Representative experimental results are presented and discussed. Finally, the paper by B. Faryabi et al. studies the effects on intervention performance in the context of probabilistic Boolean networks due to a reduction in the values of the model parameters.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
From protein-protein interactions to protein co-expression networks: a new perspective to evaluate large-scale proteomic data. On biometric systems: electrocardiogram Gaussianity and data synthesis. BCC-NER: bidirectional, contextual clues named entity tagger for gene/protein mention recognition. Review of stochastic hybrid systems with applications in biological systems modeling and analysis. Bayesian inference for biomarker discovery in proteomics: an analytic solution.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1