首页 > 最新文献

International Journal of Bioinformatics Research and Applications最新文献

英文 中文
Mining nutrigenetics patterns related to obesity: use of parallel multifactor dimensionality reduction 挖掘与肥胖相关的营养遗传学模式:使用平行多因素降维
Q4 Health Professions Pub Date : 2015-05-01 DOI: 10.1504/IJBRA.2015.069194
K. Karayianni, K. Grimaldi, K. Nikita, I. Valavanis
This paper aims to enlighten the complex etiology beneath obesity by analysing data from a large nutrigenetics study, in which nutritional and genetic factors associated with obesity were recorded for around two thousand individuals. In our previous work, these data have been analysed using artificial neural network methods, which identified optimised subsets of factors to predict one's obesity status. These methods did not reveal though how the selected factors interact with each other in the obtained predictive models. For that reason, parallel Multifactor Dimensionality Reduction (pMDR) was used here to further analyse the pre-selected subsets of nutrigenetic factors. Within pMDR, predictive models using up to eight factors were constructed, further reducing the input dimensionality, while rules describing the interactive effects of the selected factors were derived. In this way, it was possible to identify specific genetic variations and their interactive effects with particular nutritional factors, which are now under further study.
本文旨在通过分析来自一项大型营养遗传学研究的数据来揭示肥胖背后的复杂病因,该研究记录了大约2000人与肥胖相关的营养和遗传因素。在我们之前的工作中,这些数据已经使用人工神经网络方法进行了分析,该方法确定了预测一个人肥胖状况的优化因素子集。虽然这些方法并没有揭示所选择的因素如何在获得的预测模型中相互作用。为此,本文采用并行多因子降维法(pMDR)进一步分析了预先选择的营养因子子集。在pMDR中,构建了使用多达8个因素的预测模型,进一步降低了输入维度,同时导出了描述所选因素相互作用的规则。通过这种方式,有可能确定特定的遗传变异及其与特定营养因素的相互作用,目前正在进一步研究中。
{"title":"Mining nutrigenetics patterns related to obesity: use of parallel multifactor dimensionality reduction","authors":"K. Karayianni, K. Grimaldi, K. Nikita, I. Valavanis","doi":"10.1504/IJBRA.2015.069194","DOIUrl":"https://doi.org/10.1504/IJBRA.2015.069194","url":null,"abstract":"This paper aims to enlighten the complex etiology beneath obesity by analysing data from a large nutrigenetics study, in which nutritional and genetic factors associated with obesity were recorded for around two thousand individuals. In our previous work, these data have been analysed using artificial neural network methods, which identified optimised subsets of factors to predict one's obesity status. These methods did not reveal though how the selected factors interact with each other in the obtained predictive models. For that reason, parallel Multifactor Dimensionality Reduction (pMDR) was used here to further analyse the pre-selected subsets of nutrigenetic factors. Within pMDR, predictive models using up to eight factors were constructed, further reducing the input dimensionality, while rules describing the interactive effects of the selected factors were derived. In this way, it was possible to identify specific genetic variations and their interactive effects with particular nutritional factors, which are now under further study.","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2015.069194","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66702249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysing extremely small sized ratio datasets 分析极小的比率数据集
Q4 Health Professions Pub Date : 2015-05-01 DOI: 10.1504/IJBRA.2015.069225
Piero Ricchiuto, J. Sng, W. Goh
The naïve use of expression ratios in high-throughput biological studies can greatly limit analytical outcome especially when sample size is small. In the worst-case scenario, with only one reference and one test state each (often due to the severe lack of study material); such limitations make it difficult to perform statistically meaningful analysis. Workarounds include the single sample Z-test or through network inference. Here, we describe a complementary plot-based approach for analysing such extremely small sized ratio (ESSR) data - a generalisation of the Bland-Altman plot, which we shall refer to as the Dodeca-Panels. Included in this paper is an R implementation of the Dodeca-Panels method.
naïve在高通量生物学研究中使用表达比会极大地限制分析结果,特别是在样本量小的情况下。在最坏的情况下,每个人只有一个参考和一个测试状态(通常是由于严重缺乏学习材料);这些限制使得很难进行统计上有意义的分析。变通方法包括单样本z检验或通过网络推理。在这里,我们描述了一种互补的基于图的方法来分析这种极小尺寸的比率(ESSR)数据——Bland-Altman图的概括,我们将其称为十二面板。本文中包含了Dodeca-Panels方法的R实现。
{"title":"Analysing extremely small sized ratio datasets","authors":"Piero Ricchiuto, J. Sng, W. Goh","doi":"10.1504/IJBRA.2015.069225","DOIUrl":"https://doi.org/10.1504/IJBRA.2015.069225","url":null,"abstract":"The naïve use of expression ratios in high-throughput biological studies can greatly limit analytical outcome especially when sample size is small. In the worst-case scenario, with only one reference and one test state each (often due to the severe lack of study material); such limitations make it difficult to perform statistically meaningful analysis. Workarounds include the single sample Z-test or through network inference. Here, we describe a complementary plot-based approach for analysing such extremely small sized ratio (ESSR) data - a generalisation of the Bland-Altman plot, which we shall refer to as the Dodeca-Panels. Included in this paper is an R implementation of the Dodeca-Panels method.","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2015.069225","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66702316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
BioInt: an integrative biological object-oriented application framework and interpreter BioInt:一个集成的生物面向对象的应用程序框架和解释器
Q4 Health Professions Pub Date : 2015-05-01 DOI: 10.1504/IJBRA.2015.069195
Sanket Desai, P. Burra
BioInt, a biological programming application framework and interpreter, is an attempt to equip the researchers with seamless integration, efficient extraction and effortless analysis of the data from various biological databases and algorithms. Based on the type of biological data, algorithms and related functionalities, a biology-specific framework was developed which has nine modules. The modules are a compilation of numerous reusable BioADTs. This software ecosystem containing more than 450 biological objects underneath the interpreter makes it flexible, integrative and comprehensive. Similar to Python, BioInt eliminates the compilation and linking steps cutting the time significantly. The researcher can write the scripts using available BioADTs (following C++ syntax) and execute them interactively or use as a command line application. It has features that enable automation, extension of the framework with new/external BioADTs/libraries and deployment of complex work flows.
BioInt是一个生物编程应用框架和解释器,旨在使研究人员能够无缝集成、高效提取和轻松分析来自各种生物数据库和算法的数据。根据生物数据的类型、算法和相关功能,开发了一个包含九个模块的生物专用框架。这些模块是许多可重复使用的bioadt的汇编。这个包含450多个生物对象的软件生态系统使其灵活、综合、全面。与Python类似,BioInt消除了编译和链接步骤,大大减少了时间。研究人员可以使用可用的BioADTs(遵循c++语法)编写脚本,并以交互方式执行它们或作为命令行应用程序使用。它具有支持自动化、使用新的/外部bioadt /库扩展框架以及部署复杂工作流的功能。
{"title":"BioInt: an integrative biological object-oriented application framework and interpreter","authors":"Sanket Desai, P. Burra","doi":"10.1504/IJBRA.2015.069195","DOIUrl":"https://doi.org/10.1504/IJBRA.2015.069195","url":null,"abstract":"BioInt, a biological programming application framework and interpreter, is an attempt to equip the researchers with seamless integration, efficient extraction and effortless analysis of the data from various biological databases and algorithms. Based on the type of biological data, algorithms and related functionalities, a biology-specific framework was developed which has nine modules. The modules are a compilation of numerous reusable BioADTs. This software ecosystem containing more than 450 biological objects underneath the interpreter makes it flexible, integrative and comprehensive. Similar to Python, BioInt eliminates the compilation and linking steps cutting the time significantly. The researcher can write the scripts using available BioADTs (following C++ syntax) and execute them interactively or use as a command line application. It has features that enable automation, extension of the framework with new/external BioADTs/libraries and deployment of complex work flows.","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2015.069195","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66702260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Predicting DNA mutations during cancer evolution 预测癌症进化过程中的DNA突变
Q4 Health Professions Pub Date : 2015-05-01 DOI: 10.1504/IJBRA.2015.069186
J. Martínez, Nelson Lopez-Jimenez, Tao Meng, S. S. Iyengar
Bio-systems are inherently complex information processing systems. Their physiological complexities limit the formulation and testing of a hypothesis for their behaviour. Our goal here was to test a computational framework utilising published data from a longitudinal study of patients with acute myeloid leukaemia (AML), whose DNA from both normal and malignant tissues were subjected to NGS analysis at various points in time. By processing the sequencing data before relapse time, we tested our framework by predicting the regions of the genome to be mutated at relapse time and, later, by comparing our results with the actual regions that showed mutations (discovered by genome sequencing at relapse time). After a detailed statistical analysis, the resulting correlation coefficient (degree of matching of proposed framework with real data) is 0.9816 ± 0.009 at 95% confidence interval. This high performance from our proposed framework opens new research opportunities for bioinformatics researchers and clinical doctors.
生物系统本质上是复杂的信息处理系统。它们生理上的复杂性限制了它们行为假说的形成和检验。我们的目标是利用急性髓性白血病(AML)患者纵向研究的已发表数据来测试计算框架,这些患者来自正常和恶性组织的DNA在不同时间点进行了NGS分析。通过在复发前处理测序数据,我们通过预测在复发时发生突变的基因组区域来测试我们的框架,然后通过将我们的结果与显示突变的实际区域(在复发时通过基因组测序发现)进行比较来测试我们的框架。经过详细的统计分析,得出的相关系数(所提出框架与实际数据的匹配程度)在95%置信区间为0.9816±0.009。我们提出的框架的这种高性能为生物信息学研究人员和临床医生开辟了新的研究机会。
{"title":"Predicting DNA mutations during cancer evolution","authors":"J. Martínez, Nelson Lopez-Jimenez, Tao Meng, S. S. Iyengar","doi":"10.1504/IJBRA.2015.069186","DOIUrl":"https://doi.org/10.1504/IJBRA.2015.069186","url":null,"abstract":"Bio-systems are inherently complex information processing systems. Their physiological complexities limit the formulation and testing of a hypothesis for their behaviour. Our goal here was to test a computational framework utilising published data from a longitudinal study of patients with acute myeloid leukaemia (AML), whose DNA from both normal and malignant tissues were subjected to NGS analysis at various points in time. By processing the sequencing data before relapse time, we tested our framework by predicting the regions of the genome to be mutated at relapse time and, later, by comparing our results with the actual regions that showed mutations (discovered by genome sequencing at relapse time). After a detailed statistical analysis, the resulting correlation coefficient (degree of matching of proposed framework with real data) is 0.9816 ± 0.009 at 95% confidence interval. This high performance from our proposed framework opens new research opportunities for bioinformatics researchers and clinical doctors.","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2015.069186","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66702184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Understanding the virulence of the entero-aggregative E. coli O104: H4 了解肠道聚集性大肠杆菌O104: H4的毒力
Q4 Health Professions Pub Date : 2015-05-01 DOI: 10.1504/IJBRA.2015.069185
B. Kesavan, K. Srividhya, S. Krishnaswamy, M. Raja, N. Vidya, A. Mohan
O104:H4 is a new strain of E. coli that has caused an outbreak in Germany. It was isolated from patients with bloody diarrhoea and Haemolytic Uremic Syndrome (HUS). BGI (www.genomics.cn) sequenced and assembled this new strain. It was reported to show resistance to a number of drugs that are toxic to other E. coli and causes serious complications during infections, which ultimately lead to death. Multi-drug resistance and high virulence of this strain is thought to be acquired from different sources, by horizontal gene transfer. A total of 38 prophage elements were detected from the new strain by using three computational tools viz., DRAD, Prophage Finder and PHAST. Analysis on these prophage elements shows a number of virulence proteins like Shiga toxin and multi-drug resistance protein encoding genes. The high virulence of the strain could be attributed by the prophage elements acquired from its micro environment.
O104:H4是一种新的大肠杆菌菌株,在德国引起了疫情。它是从血性腹泻和溶血性尿毒症综合征(HUS)患者中分离出来的。华大基因(www.genomics.cn)对这一新菌株进行了测序和组装。据报道,它对许多对其他大肠杆菌有毒的药物有抗药性,并在感染期间引起严重并发症,最终导致死亡。该菌株具有多药耐药性和高毒力,被认为是通过水平基因转移从不同来源获得的。利用DRAD、prophage Finder和PHAST三种计算工具,从新菌株中共检测到38个噬菌体元件。对这些原噬菌体元件的分析显示了许多毒力蛋白如志贺毒素和多重耐药蛋白编码基因。该菌株的高毒力可能与从其微环境中获得的前噬菌体元素有关。
{"title":"Understanding the virulence of the entero-aggregative E. coli O104: H4","authors":"B. Kesavan, K. Srividhya, S. Krishnaswamy, M. Raja, N. Vidya, A. Mohan","doi":"10.1504/IJBRA.2015.069185","DOIUrl":"https://doi.org/10.1504/IJBRA.2015.069185","url":null,"abstract":"O104:H4 is a new strain of E. coli that has caused an outbreak in Germany. It was isolated from patients with bloody diarrhoea and Haemolytic Uremic Syndrome (HUS). BGI (www.genomics.cn) sequenced and assembled this new strain. It was reported to show resistance to a number of drugs that are toxic to other E. coli and causes serious complications during infections, which ultimately lead to death. Multi-drug resistance and high virulence of this strain is thought to be acquired from different sources, by horizontal gene transfer. A total of 38 prophage elements were detected from the new strain by using three computational tools viz., DRAD, Prophage Finder and PHAST. Analysis on these prophage elements shows a number of virulence proteins like Shiga toxin and multi-drug resistance protein encoding genes. The high virulence of the strain could be attributed by the prophage elements acquired from its micro environment.","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2015.069185","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"66702131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
TDAC: Co-Expressed Gene Pattern Finding Using Attribute Clustering TDAC:利用属性聚类发现共表达基因模式
Q4 Health Professions Pub Date : 2015-02-01 DOI: 10.1007/978-81-322-1602-5_64
Tahleen A. Rahman, D. Bhattacharyya
{"title":"TDAC: Co-Expressed Gene Pattern Finding Using Attribute Clustering","authors":"Tahleen A. Rahman, D. Bhattacharyya","doi":"10.1007/978-81-322-1602-5_64","DOIUrl":"https://doi.org/10.1007/978-81-322-1602-5_64","url":null,"abstract":"","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75595974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HDVDB: a data warehouse for hepatitis delta virus. 丁型肝炎病毒数据仓库。
Q4 Health Professions Pub Date : 2015-01-01 DOI: 10.1504/IJBRA.2015.068091
Sarita Singh, Sunil Kumar Gupta, Anuradha Nischal, Kamlesh Kumar Pant, Prahlad Kishore Seth

Hepatitis Delta Virus (HDV) is an RNA virus and causes delta hepatitis in humans. Although a lot of data is available for HDV, but retrieval of information is a complicated task. Current web database 'HDVDB' provides a comprehensive web-resource for HDV. The database is basically concerned with basic information about HDV and disease caused by this virus, genome structure, pathogenesis, epidemiology, symptoms and prevention, etc. Database also supplies sequence data and bibliographic information about HDV. A tool 'siHDV Predict' to design the effective siRNA molecule to control the activity of HDV, is also integrated in database. It is a user friendly information system available at public domain and provides annotated information about HDV for research scholars, scientists, pharma industry people for further study.

丁型肝炎病毒(HDV)是一种RNA病毒,可引起人类丁型肝炎。虽然HDV有大量的可用数据,但信息的检索是一项复杂的任务。当前的网络数据库“HDVDB”为HDV提供了一个全面的网络资源。该数据库主要涉及HDV及其引起的疾病的基本信息、基因组结构、发病机制、流行病学、症状和预防等。数据库还提供HDV序列数据和书目信息。数据库中还集成了siHDV预测工具,用于设计有效的siRNA分子来控制HDV的活性。它是一个用户友好的信息系统,可在公共领域使用,并为研究学者,科学家,制药行业人士提供有关HDV的注释信息,以供进一步研究。
{"title":"HDVDB: a data warehouse for hepatitis delta virus.","authors":"Sarita Singh,&nbsp;Sunil Kumar Gupta,&nbsp;Anuradha Nischal,&nbsp;Kamlesh Kumar Pant,&nbsp;Prahlad Kishore Seth","doi":"10.1504/IJBRA.2015.068091","DOIUrl":"https://doi.org/10.1504/IJBRA.2015.068091","url":null,"abstract":"<p><p>Hepatitis Delta Virus (HDV) is an RNA virus and causes delta hepatitis in humans. Although a lot of data is available for HDV, but retrieval of information is a complicated task. Current web database 'HDVDB' provides a comprehensive web-resource for HDV. The database is basically concerned with basic information about HDV and disease caused by this virus, genome structure, pathogenesis, epidemiology, symptoms and prevention, etc. Database also supplies sequence data and bibliographic information about HDV. A tool 'siHDV Predict' to design the effective siRNA molecule to control the activity of HDV, is also integrated in database. It is a user friendly information system available at public domain and provides annotated information about HDV for research scholars, scientists, pharma industry people for further study. </p>","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2015.068091","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33018043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
TDAC: co-expressed gene pattern finding using attribute clustering. TDAC:利用属性聚类发现共表达基因模式。
Q4 Health Professions Pub Date : 2015-01-01 DOI: 10.1504/IJBRA.2015.067339
Tahleen A Rahman, Dhruba K Bhattacharyya

A number of clustering methods introduced for analysis of gene expression data for extracting potential relationships among the genes are studied and reported in this paper. An effective unsupervised method (TDAC) is proposed for simultaneous detection of outliers and biologically relevant co-expressed patterns. Effectiveness of TDAC is established in comparison to its other competing algorithms over six publicly available benchmark gene expression datasets in terms of both internal and external validity measures. Main attractions of TDAC are: (a) it does not require discretisation, (b) it is capable of identifying biologically relevant gene co-expressed patterns as well as outlier genes(s), (c) it is cost-effective in terms of time and space, (d) it does not require the number of clusters a priori, and (e) it is free from the restrictions of using any proximity measure.

本文研究和报道了一些用于基因表达数据分析的聚类方法,以提取基因之间的潜在关系。提出了一种有效的无监督方法(TDAC),用于同时检测异常值和生物学相关的共表达模式。在内部和外部有效性测量方面,TDAC的有效性是通过与六个公开可用的基准基因表达数据集的其他竞争算法进行比较而建立的。TDAC的主要优点是:(a)它不需要离散化,(b)它能够识别生物学相关的基因共表达模式以及异常基因,(c)在时间和空间方面具有成本效益,(d)它不需要先验集群的数量,(e)它不受使用任何邻近测量的限制。
{"title":"TDAC: co-expressed gene pattern finding using attribute clustering.","authors":"Tahleen A Rahman,&nbsp;Dhruba K Bhattacharyya","doi":"10.1504/IJBRA.2015.067339","DOIUrl":"https://doi.org/10.1504/IJBRA.2015.067339","url":null,"abstract":"<p><p>A number of clustering methods introduced for analysis of gene expression data for extracting potential relationships among the genes are studied and reported in this paper. An effective unsupervised method (TDAC) is proposed for simultaneous detection of outliers and biologically relevant co-expressed patterns. Effectiveness of TDAC is established in comparison to its other competing algorithms over six publicly available benchmark gene expression datasets in terms of both internal and external validity measures. Main attractions of TDAC are: (a) it does not require discretisation, (b) it is capable of identifying biologically relevant gene co-expressed patterns as well as outlier genes(s), (c) it is cost-effective in terms of time and space, (d) it does not require the number of clusters a priori, and (e) it is free from the restrictions of using any proximity measure. </p>","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2015.067339","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33043360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying protein complexes based on the integration of PPI network and gene expression data. 基于PPI网络和基因表达数据的蛋白质复合物鉴定。
Q4 Health Professions Pub Date : 2015-01-01 DOI: 10.1504/IJBRA.2015.067337
Weijie Chen, Min Li, Xuehong Wu, Jianxin Wang

Identification of protein complexes is crucial to understand principles of cellular organisation and predict protein functions. In this paper, a novel protein complex discovery algorithm IPCIPG is proposed based on the integration of Protein-Protein Interaction network (PPI network) and gene expression data. IPCIPG is a local search algorithm which has two versions: IPCIPG-n for identifying non-overlapping clusters and IPCIPG-o for detecting overlapping clusters. The experimental results on the yeast PPI network show that IPCIPG can identify protein complexes with specific biological meaning more effectively, precisely and comprehensively than six other algorithms: HUNTER, HC-PIN, CMC, SPICi, MOCDE and MCL.

鉴定蛋白质复合物对于理解细胞组织原理和预测蛋白质功能至关重要。本文提出了一种基于蛋白质-蛋白质相互作用网络(protein - protein Interaction network, PPI网络)和基因表达数据集成的蛋白质复合物发现算法IPCIPG。IPCIPG是一种局部搜索算法,有两个版本:IPCIPG-n(用于识别非重叠簇)和IPCIPG-o(用于检测重叠簇)。酵母PPI网络的实验结果表明,与HUNTER、HC-PIN、CMC、SPICi、mode和MCL等6种算法相比,IPCIPG能够更有效、更精确、更全面地识别具有特定生物学意义的蛋白质复合物。
{"title":"Identifying protein complexes based on the integration of PPI network and gene expression data.","authors":"Weijie Chen,&nbsp;Min Li,&nbsp;Xuehong Wu,&nbsp;Jianxin Wang","doi":"10.1504/IJBRA.2015.067337","DOIUrl":"https://doi.org/10.1504/IJBRA.2015.067337","url":null,"abstract":"<p><p>Identification of protein complexes is crucial to understand principles of cellular organisation and predict protein functions. In this paper, a novel protein complex discovery algorithm IPCIPG is proposed based on the integration of Protein-Protein Interaction network (PPI network) and gene expression data. IPCIPG is a local search algorithm which has two versions: IPCIPG-n for identifying non-overlapping clusters and IPCIPG-o for detecting overlapping clusters. The experimental results on the yeast PPI network show that IPCIPG can identify protein complexes with specific biological meaning more effectively, precisely and comprehensively than six other algorithms: HUNTER, HC-PIN, CMC, SPICi, MOCDE and MCL. </p>","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2015.067337","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33043359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Proteins involved in more domain types tend to be more essential. 参与更多结构域类型的蛋白质往往更重要。
Q4 Health Professions Pub Date : 2015-01-01 DOI: 10.1504/IJBRA.2015.068086
Lu Chen, Yingjiao Cheng, Min Li, Jianxin Wang

Investigation of essential proteins is significantly valuable for understanding of cellular life, drug design and other practical purposes. In most of current studies, essential proteins are generally mined in protein-protein interaction (PPI) networks with diverse topology features. In this study, we investigate what kind of proteins is inclined to be essential from a new perspective. The investigation implies that protein essentiality is correlated with protein domains, which are functional, structural and evolutionary units of proteins. Proteins with a larger Number of Domain Types (NDT) tend to be essential. The analyses on 22 species show that essential proteins identified by NDT are much more than those identified by ten random identifications. The consideration of the structural feature makes us less dependent on network data and thus enables us to investigate protein essentiality of more species with incomplete and/or inconsistent network data.

研究必需蛋白质对于理解细胞生命、药物设计和其他实际目的具有重要价值。在目前的大多数研究中,必需蛋白质通常是在具有不同拓扑特征的蛋白质-蛋白质相互作用(PPI)网络中挖掘的。在这项研究中,我们从一个新的角度研究了什么样的蛋白质是必不可少的。研究表明,蛋白质的重要性与蛋白质结构域有关,蛋白质结构域是蛋白质的功能、结构和进化单位。具有较多结构域类型(NDT)的蛋白质往往是必需的。对22个物种的分析表明,通过无损检测鉴定出的必需蛋白要比随机鉴定出的多得多。结构特征的考虑使我们减少了对网络数据的依赖,从而使我们能够在网络数据不完整和/或不一致的情况下研究更多物种的蛋白质必要性。
{"title":"Proteins involved in more domain types tend to be more essential.","authors":"Lu Chen,&nbsp;Yingjiao Cheng,&nbsp;Min Li,&nbsp;Jianxin Wang","doi":"10.1504/IJBRA.2015.068086","DOIUrl":"https://doi.org/10.1504/IJBRA.2015.068086","url":null,"abstract":"<p><p>Investigation of essential proteins is significantly valuable for understanding of cellular life, drug design and other practical purposes. In most of current studies, essential proteins are generally mined in protein-protein interaction (PPI) networks with diverse topology features. In this study, we investigate what kind of proteins is inclined to be essential from a new perspective. The investigation implies that protein essentiality is correlated with protein domains, which are functional, structural and evolutionary units of proteins. Proteins with a larger Number of Domain Types (NDT) tend to be essential. The analyses on 22 species show that essential proteins identified by NDT are much more than those identified by ten random identifications. The consideration of the structural feature makes us less dependent on network data and thus enables us to investigate protein essentiality of more species with incomplete and/or inconsistent network data. </p>","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2015.068086","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33143603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
International Journal of Bioinformatics Research and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1