首页 > 最新文献

2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)最新文献

英文 中文
A parametric Bayesian method to test the association of rare variants 一个参数贝叶斯方法来检验罕见变异的关联
Yufeng Shen, Y. Cheung, Shuang Wang, I. Pe’er
Testing statistical association of individual rare variants is underpowered due to low frequency. A common approach is to test the aggregated effects of individual variants in a locus such as genes. Current methods have distinct power profiles that are determined by underlying assumptions about the genetic model and effect size. Here we describe a parametric Bayesian approach to detect the association of rare variants. We express the assumptions about effect size by setting the prior distribution in the model, which can be adjusted based on the experimental design. This flexibility allows our method to achieve optimal power. The algorithmic contribution includes a dynamic program for efficient calculation of the association test statistic. We tested the method in simulated data, and demonstrated that it is better powered to detect rare variant association under various scenarios.
由于频率低,个别罕见变异的统计关联检测能力不足。一种常见的方法是测试基因等位点中个体变异的总体效应。目前的方法有不同的功率分布,这是由关于遗传模型和效应大小的基本假设决定的。在这里,我们描述了一种参数贝叶斯方法来检测罕见变异的关联。我们通过设置模型中的先验分布来表达对效应大小的假设,该假设可以根据实验设计进行调整。这种灵活性使我们的方法能够获得最佳功率。该算法的贡献包括一个动态程序,用于有效地计算关联检验统计量。我们在模拟数据中对该方法进行了测试,并证明该方法在各种情况下都能更好地检测罕见变异关联。
{"title":"A parametric Bayesian method to test the association of rare variants","authors":"Yufeng Shen, Y. Cheung, Shuang Wang, I. Pe’er","doi":"10.1109/BIBMW.2011.6112366","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112366","url":null,"abstract":"Testing statistical association of individual rare variants is underpowered due to low frequency. A common approach is to test the aggregated effects of individual variants in a locus such as genes. Current methods have distinct power profiles that are determined by underlying assumptions about the genetic model and effect size. Here we describe a parametric Bayesian approach to detect the association of rare variants. We express the assumptions about effect size by setting the prior distribution in the model, which can be adjusted based on the experimental design. This flexibility allows our method to achieve optimal power. The algorithmic contribution includes a dynamic program for efficient calculation of the association test statistic. We tested the method in simulated data, and demonstrated that it is better powered to detect rare variant association under various scenarios.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"81 1","pages":"137-143"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83402315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Liver targeting effect of vinegar-baked Radix Bupleuri on oxymatrine in mice 醋烤柴胡对小鼠氧化苦参碱的肝脏靶向作用
Ruizhi Zhao, You-jun Chen, Jian Cai
In traditional Chinese medicine co-administration drugs with vinegar-baked Radix Bupleuri (VBRB) is usually used to increase the therapeutic effect in liver disease. However, the scientific data for this effect are not available. In this paper, effect of VBRB on the distribution of oxymatrine was studied. Mice were divided into four groups by random, oxymatrine control and oxymatrine co-administered with three different doses of VBRB. Concentrations of oxymatrine and its metabolite matrine in different tissues were determined by HPLC-MS. Target effencicy was evalutaed by AUC, Cmax, and Relative targeting efficiency (RTE). The results showed that compared to the control group, VBRB significantly increased the distribution of both oxymatrine and matrine in liver and meanwhile decreased their distribution in other tissues, indicating a strong liver targeting enhancing effect. The results of this paper implied that co-administration with VBRB may be a simple and efficiencient method for liver targeting therapy.
中医常用醋炙柴胡配药来提高肝病的治疗效果。然而,这种影响的科学数据是不可用的。本文研究了VBRB对氧化苦参碱分布的影响。小鼠随机分为四组,氧化苦参碱对照和氧化苦参碱与三种不同剂量的VBRB共给药。采用高效液相色谱-质谱法测定氧化苦参碱及其代谢物苦参碱在不同组织中的浓度。用AUC、Cmax和相对靶向效率(RTE)评价靶效。结果显示,与对照组相比,VBRB显著增加了氧化苦参碱和苦参碱在肝脏中的分布,同时降低了它们在其他组织中的分布,显示出较强的肝脏靶向增强作用。本研究结果提示,与VBRB合用可能是一种简单有效的肝靶向治疗方法。
{"title":"Liver targeting effect of vinegar-baked Radix Bupleuri on oxymatrine in mice","authors":"Ruizhi Zhao, You-jun Chen, Jian Cai","doi":"10.1109/BIBMW.2011.6112462","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112462","url":null,"abstract":"In traditional Chinese medicine co-administration drugs with vinegar-baked Radix Bupleuri (VBRB) is usually used to increase the therapeutic effect in liver disease. However, the scientific data for this effect are not available. In this paper, effect of VBRB on the distribution of oxymatrine was studied. Mice were divided into four groups by random, oxymatrine control and oxymatrine co-administered with three different doses of VBRB. Concentrations of oxymatrine and its metabolite matrine in different tissues were determined by HPLC-MS. Target effencicy was evalutaed by AUC, Cmax, and Relative targeting efficiency (RTE). The results showed that compared to the control group, VBRB significantly increased the distribution of both oxymatrine and matrine in liver and meanwhile decreased their distribution in other tissues, indicating a strong liver targeting enhancing effect. The results of this paper implied that co-administration with VBRB may be a simple and efficiencient method for liver targeting therapy.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"31 1","pages":"740-745"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83456520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Employing Machine Learning Techniques for Data Enrichment: Increasing the Number of Samples for Effective Gene Expression Data Analysis 利用机器学习技术进行数据丰富:增加有效基因表达数据分析的样本数量
U. Erdogdu, Mehmet Tan, R. Alhajj, Faruk Polat, D. Demetrick, J. Rokne
For certain domains, e.g. bioinformatics, producing more real samples is costly, error prone and time consuming. Therefore, there is a need for an intelligent automated process capable of substituting the real samples by artificial samples that carry the same characteristics as the real samples and hence could be used for running comprehensive testing of new methodologies. Motivated by this need, we describe a novel approach that integrates Probabilistic Boolean Network and genetic algorithm based techniques into a framework that uses some existing real samples as input and successfully produces new samples as output. The new samples will inspire the characteristics of the existing samples without duplicating them. This leads to diversity in the samples and hence a more rich set of samples to be used in testing. The developed framework incorporates two models (perspectives) for sample generation. We illustrate its applicability for producing new gene expression data samples, a high demanding area that has not received attention. The two perspectives employed in the process are based on models that are not closely related, the independence eliminates the bias of having the produced approach covering only certain characteristics of the domain and leading to samples skewed towards one direction. The produced results are very promising in showing the effectiveness, usefulness and applicability of the proposed multi-model framework.
对于某些领域,例如生物信息学,生产更多的真实样本是昂贵的,容易出错和耗时的。因此,需要一种智能的自动化过程,能够用与真实样本具有相同特征的人工样本代替真实样本,从而可用于运行新方法的综合测试。在这种需求的激励下,我们描述了一种新的方法,该方法将基于概率布尔网络和遗传算法的技术集成到一个框架中,该框架使用一些现有的真实样本作为输入,并成功地产生新的样本作为输出。新样本将激发现有样本的特征,而不会复制它们。这导致了样本的多样性,因此在测试中使用了更丰富的样本集。开发的框架结合了两个模型(透视图)来生成样本。我们说明了它的适用性,以产生新的基因表达数据样本,一个高要求的领域,尚未得到重视。该过程中采用的两个视角是基于不密切相关的模型,独立性消除了产生的方法仅覆盖领域的某些特征并导致样本向一个方向倾斜的偏见。所得结果显示了所提出的多模型框架的有效性、实用性和适用性。
{"title":"Employing Machine Learning Techniques for Data Enrichment: Increasing the Number of Samples for Effective Gene Expression Data Analysis","authors":"U. Erdogdu, Mehmet Tan, R. Alhajj, Faruk Polat, D. Demetrick, J. Rokne","doi":"10.1109/BIBM.2011.105","DOIUrl":"https://doi.org/10.1109/BIBM.2011.105","url":null,"abstract":"For certain domains, e.g. bioinformatics, producing more real samples is costly, error prone and time consuming. Therefore, there is a need for an intelligent automated process capable of substituting the real samples by artificial samples that carry the same characteristics as the real samples and hence could be used for running comprehensive testing of new methodologies. Motivated by this need, we describe a novel approach that integrates Probabilistic Boolean Network and genetic algorithm based techniques into a framework that uses some existing real samples as input and successfully produces new samples as output. The new samples will inspire the characteristics of the existing samples without duplicating them. This leads to diversity in the samples and hence a more rich set of samples to be used in testing. The developed framework incorporates two models (perspectives) for sample generation. We illustrate its applicability for producing new gene expression data samples, a high demanding area that has not received attention. The two perspectives employed in the process are based on models that are not closely related, the independence eliminates the bias of having the produced approach covering only certain characteristics of the domain and leading to samples skewed towards one direction. The produced results are very promising in showing the effectiveness, usefulness and applicability of the proposed multi-model framework.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"9 1","pages":"238-242"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88615678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Mining fetal magnetocardiogram data for high-risk fetuses 挖掘胎儿心脏磁图数据的高危胎儿
D. Snider, Xiaowei Xu
The fetal magnetocardiogram (fMCG) contains a wealth of information regarding the health of a fetus. The purpose of this study is to classify fMCG data into the following two groups: high-risk and normal. In this presentation the authors first describe how the feature vector containing both time and frequency domain attributes is built from the time-series fMCG data. Second, the classification process using support vector machine (SVM) tools to identify the high-risk fetuses is described. Experimental results from 272 data sets taken from 118 fetuses demonstrate the SVM classifier's ability to distinguish between the high-risk and normal fetuses. Artificial neural networks and decision trees are used to validate the SVM results and receiver operating characteristic curve analysis and blind tests are employed to show the strength of the model. The model currently attains a sensitivity of 0.67 and a specificity of 0.65. While this study remains a work in progress, the authors are refining the process to improve the aforementioned results.
胎儿心脏磁图(fMCG)包含关于胎儿健康的丰富信息。本研究的目的是将快速消费品数据分为以下两组:高风险组和正常组。在本报告中,作者首先描述了如何从时间序列fMCG数据中构建包含时域和频域属性的特征向量。其次,描述了使用支持向量机(SVM)工具识别高危胎儿的分类过程。来自118个胎儿的272个数据集的实验结果证明了SVM分类器区分高危胎儿和正常胎儿的能力。采用人工神经网络和决策树对SVM结果进行验证,并采用接收者工作特征曲线分析和盲测试来验证模型的强度。该模型目前的灵敏度为0.67,特异性为0.65。虽然这项研究仍在进行中,但作者正在改进这一过程,以改善上述结果。
{"title":"Mining fetal magnetocardiogram data for high-risk fetuses","authors":"D. Snider, Xiaowei Xu","doi":"10.1109/BIBMW.2011.6112563","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112563","url":null,"abstract":"The fetal magnetocardiogram (fMCG) contains a wealth of information regarding the health of a fetus. The purpose of this study is to classify fMCG data into the following two groups: high-risk and normal. In this presentation the authors first describe how the feature vector containing both time and frequency domain attributes is built from the time-series fMCG data. Second, the classification process using support vector machine (SVM) tools to identify the high-risk fetuses is described. Experimental results from 272 data sets taken from 118 fetuses demonstrate the SVM classifier's ability to distinguish between the high-risk and normal fetuses. Artificial neural networks and decision trees are used to validate the SVM results and receiver operating characteristic curve analysis and blind tests are employed to show the strength of the model. The model currently attains a sensitivity of 0.67 and a specificity of 0.65. While this study remains a work in progress, the authors are refining the process to improve the aforementioned results.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"9 1","pages":"1066-1068"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84797373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fracture detection and quantitative measure of displacement in pelvic CT images 骨盆CT图像的骨折检测与位移定量测量
Jie Wu, Pavani Davuluri, Ashwin Belle, Charles Cockrell, Yang Tang, Kevin Ward, R. Hobson, K. Najarian
Traumatic pelvic injury is a severe and common injury in the United States. The automatic detection of fractures in pelvic CT images is a significant contribution for assisting physicians in making faster and more accurate patient diagnostic decisions and treatment planning. However, due to the low resolution and quality of the original images, the complexity of pelvic structures, and the difference in visual characteristics of fracture by their location, it is difficult to detect and accurately locate the pelvic fractures and determine the severity of the injury. In this paper, an automatic hierarchical algorithm for detecting pelvic bone fractures in CT scans is proposed. The algorithm utilizes symmetric comparison, adaptive windowing, boundary tracing, wavelet transform. Also, the quantitative measure of fracture severity in pelvic CT scans is defined. The results are promising, demonstrating that the proposed method is capable of automatically detecting both major and minor fractures accurately, shows potential for clinical application. Statistical results also indicate the superiority of the proposed method.
在美国,创伤性骨盆损伤是一种严重而常见的损伤。骨盆CT图像中骨折的自动检测对于帮助医生做出更快、更准确的患者诊断决策和治疗计划有着重要的贡献。然而,由于原始图像的分辨率和质量较低,骨盆结构的复杂性,以及骨折的视觉特征因其位置的不同而存在差异,因此很难检测和准确定位骨盆骨折并确定损伤的严重程度。本文提出了一种基于CT扫描的骨盆骨折自动分层检测算法。该算法利用对称比较、自适应加窗、边界跟踪和小波变换。此外,还定义了骨盆CT扫描中骨折严重程度的定量测量。结果表明,该方法能够准确地自动检测出大骨折和小骨折,具有临床应用潜力。统计结果也表明了该方法的优越性。
{"title":"Fracture detection and quantitative measure of displacement in pelvic CT images","authors":"Jie Wu, Pavani Davuluri, Ashwin Belle, Charles Cockrell, Yang Tang, Kevin Ward, R. Hobson, K. Najarian","doi":"10.1109/BIBMW.2011.6112437","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112437","url":null,"abstract":"Traumatic pelvic injury is a severe and common injury in the United States. The automatic detection of fractures in pelvic CT images is a significant contribution for assisting physicians in making faster and more accurate patient diagnostic decisions and treatment planning. However, due to the low resolution and quality of the original images, the complexity of pelvic structures, and the difference in visual characteristics of fracture by their location, it is difficult to detect and accurately locate the pelvic fractures and determine the severity of the injury. In this paper, an automatic hierarchical algorithm for detecting pelvic bone fractures in CT scans is proposed. The algorithm utilizes symmetric comparison, adaptive windowing, boundary tracing, wavelet transform. Also, the quantitative measure of fracture severity in pelvic CT scans is defined. The results are promising, demonstrating that the proposed method is capable of automatically detecting both major and minor fractures accurately, shows potential for clinical application. Statistical results also indicate the superiority of the proposed method.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"14 1","pages":"600-606"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86702180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Balance-bagging-PRFS algorithm for feature optimization on insomnia data intervened by traditional Chinese Medicine 中药干预失眠数据特征优化的Balance-bagging-PRFS算法
Xiao-bo Yang, Shixing Yan, Zheng-yang Zhou, Guozheng Li, Yan Li, Xin-feng Guo
Goal: Traditional Chinese Medicine (TCM) focuses on individual diagnosis. Besides the analysis methods on group level, clinical experimental data could also be researched with Information Technology to optimize the feature for individual healing effect; Method: we propose and apply a new method of feature optimization — Balance-Bagging-PRFS — to optimize the feature of insomnia intervened by TCM, aiming at solving problems typically in TCM data, such as mixing of discrete and continuous features and data imbalance; Result: from the view of all data, it is found that different levels of "ISI baseline score" and "Insomnia severity" have important influence on the curative effect. In treat group, different values of "environment" and "social field baseline" make remarkable difference on curative effect; while in control group, in which patients are treated with the placebo, "social field baseline", "survival quality baseline", and "classification of constitution" make sense; Conclusion: the method of Balance-Bagging-PRFS achieves good results in feature optimization for data from insomnia interfered by TCM, and it provides a basis for TCM individual diagnosis and for further optimization of symptom.
目标:中医注重个体诊断。除了群体层面的分析方法外,还可以利用信息技术对临床实验数据进行研究,优化个体愈合效果的特征;方法:针对中医数据中离散特征与连续特征混合、数据不平衡等问题,提出并应用一种新的特征优化方法- Balance-Bagging-PRFS对中医干预失眠特征进行优化;结果:从所有数据来看,发现不同水平的“ISI基线评分”和“失眠严重程度”对疗效有重要影响。治疗组不同“环境”值和“社会场基线”值对疗效有显著差异;对照组采用安慰剂治疗,“社会场基线”、“生存质量基线”、“体质分类”有意义;结论:Balance-Bagging-PRFS方法对中医干扰失眠数据的特征优化效果较好,可为中医个体化诊断及进一步优化症状提供依据。
{"title":"Balance-bagging-PRFS algorithm for feature optimization on insomnia data intervened by traditional Chinese Medicine","authors":"Xiao-bo Yang, Shixing Yan, Zheng-yang Zhou, Guozheng Li, Yan Li, Xin-feng Guo","doi":"10.1109/BIBMW.2011.6112485","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112485","url":null,"abstract":"Goal: Traditional Chinese Medicine (TCM) focuses on individual diagnosis. Besides the analysis methods on group level, clinical experimental data could also be researched with Information Technology to optimize the feature for individual healing effect; Method: we propose and apply a new method of feature optimization — Balance-Bagging-PRFS — to optimize the feature of insomnia intervened by TCM, aiming at solving problems typically in TCM data, such as mixing of discrete and continuous features and data imbalance; Result: from the view of all data, it is found that different levels of \"ISI baseline score\" and \"Insomnia severity\" have important influence on the curative effect. In treat group, different values of \"environment\" and \"social field baseline\" make remarkable difference on curative effect; while in control group, in which patients are treated with the placebo, \"social field baseline\", \"survival quality baseline\", and \"classification of constitution\" make sense; Conclusion: the method of Balance-Bagging-PRFS achieves good results in feature optimization for data from insomnia interfered by TCM, and it provides a basis for TCM individual diagnosis and for further optimization of symptom.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"28 1","pages":"854-857"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82017161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A New Measurement for Evaluating Clusters in Protein Interaction Networks 一种评价蛋白质相互作用网络簇的新方法
Min Li, Xuehong Wu, Jianxin Wang, Yi Pan
Clustering of protein-protein interaction networks is one of the most prevalent methods for identifying protein complexes and functional modules, which is crucial to understanding the principles of cellular organization and prediction of protein functions. In the past few years, many computational methods have been proposed. However, it is always a challenging task to evaluate how well the clusters are identified. Even for the most popular measurements, F-measure and Pvalue, bias exists for evaluating the identified clusters. In this paper, we propose a new measurement, named hF-measure, to evaluate clusters more finely and distinctly. First, we defined the hierarchical consistency and the hierarchical similarity. Then, we propose a new hierarchical measurement of hF-measure by taking into account the hierarchical organization of functional annotations and the functional similarities among proteins. The new measurement hF-measure can discriminate between different types of errors which cannot be distinguished by F-measure. The experimental results based on Gene Ontology (GO) and yeast functional modules show that hF-measure evaluates clusters more accurately when compared to F-measure.
蛋白质-蛋白质相互作用网络聚类是鉴定蛋白质复合物和功能模块的最常用方法之一,对于理解细胞组织原理和预测蛋白质功能至关重要。在过去的几年中,已经提出了许多计算方法。然而,评估如何很好地识别集群总是一项具有挑战性的任务。即使对于最流行的测量方法,f值和p值,在评估已识别的集群时也存在偏差。在本文中,我们提出了一种新的度量,称为高频度量,以更精细、更清晰地评价聚类。首先定义了层次一致性和层次相似性。然后,我们提出了一种考虑功能注释的层次组织和蛋白质之间的功能相似性的层次化度量方法。新的测量方法hF-measure可以区分F-measure不能区分的不同类型的误差。基于基因本体(GO)和酵母功能模块的实验结果表明,与F-measure相比,hF-measure对聚类的评估更准确。
{"title":"A New Measurement for Evaluating Clusters in Protein Interaction Networks","authors":"Min Li, Xuehong Wu, Jianxin Wang, Yi Pan","doi":"10.1109/BIBM.2011.47","DOIUrl":"https://doi.org/10.1109/BIBM.2011.47","url":null,"abstract":"Clustering of protein-protein interaction networks is one of the most prevalent methods for identifying protein complexes and functional modules, which is crucial to understanding the principles of cellular organization and prediction of protein functions. In the past few years, many computational methods have been proposed. However, it is always a challenging task to evaluate how well the clusters are identified. Even for the most popular measurements, F-measure and Pvalue, bias exists for evaluating the identified clusters. In this paper, we propose a new measurement, named hF-measure, to evaluate clusters more finely and distinctly. First, we defined the hierarchical consistency and the hierarchical similarity. Then, we propose a new hierarchical measurement of hF-measure by taking into account the hierarchical organization of functional annotations and the functional similarities among proteins. The new measurement hF-measure can discriminate between different types of errors which cannot be distinguished by F-measure. The experimental results based on Gene Ontology (GO) and yeast functional modules show that hF-measure evaluates clusters more accurately when compared to F-measure.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"8 1","pages":"63-68"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83621992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
iQuant: A fast yet accurate GUI tool for transcript quantification iQuant:一个快速而准确的GUI工具,用于转录量化
Tin Chi Nguyen, N. Deng, Guorong Xu, Z. Duan, D. Zhu
Transcript quantification using RNA-seq is central to contemporary and future transcriptomics research. The existing tools are useful but have much room for improvement. We present a new statistical model, a fast yet accurate transcript quantification algorithm. Our tool takes RNA-seq reads in fasta or fastq format as input and output transcript abundance through a few mouse clicks. Our method compares favorably with the existing GUI tools in terms of both time complexity and accuracy. Availability: Both simulation data used for method comparisons and the GUI tool are freely available at http://asammate.sourceforge.net/.
使用RNA-seq进行转录量化是当代和未来转录组学研究的核心。现有的工具很有用,但还有很大的改进空间。我们提出了一个新的统计模型,一个快速而准确的转录量化算法。我们的工具将fasta或fastq格式的RNA-seq读取作为输入和输出转录本丰度,只需点击几下鼠标。我们的方法在时间复杂度和准确性方面都优于现有的GUI工具。可用性:用于方法比较的模拟数据和GUI工具都可以在http://asammate.sourceforge.net/上免费获得。
{"title":"iQuant: A fast yet accurate GUI tool for transcript quantification","authors":"Tin Chi Nguyen, N. Deng, Guorong Xu, Z. Duan, D. Zhu","doi":"10.1109/BIBMW.2011.6112556","DOIUrl":"https://doi.org/10.1109/BIBMW.2011.6112556","url":null,"abstract":"Transcript quantification using RNA-seq is central to contemporary and future transcriptomics research. The existing tools are useful but have much room for improvement. We present a new statistical model, a fast yet accurate transcript quantification algorithm. Our tool takes RNA-seq reads in fasta or fastq format as input and output transcript abundance through a few mouse clicks. Our method compares favorably with the existing GUI tools in terms of both time complexity and accuracy. Availability: Both simulation data used for method comparisons and the GUI tool are freely available at http://asammate.sourceforge.net/.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"119 1","pages":"1048-1050"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91121161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Efficient and Fast Analysis for Detecting High Order Gene-by-Gene Interactions in a Genome-Wide Association Study 在全基因组关联研究中检测高阶基因间相互作用的高效快速分析
Sohee Oh, Jaehoon Lee, Min-Seok Kwon, Kyunga Kim, T. Park
Most common complex traits are affected by multiple genes and/or environmental factors. To understand genetic architecture of complex traits, the investigation of gene-gene and gene-environment interactions can be essential. However, conducting gene-gene interaction using genome-wide data requires exploring a huge search space and suffers from a computation burden due to high dimensionality of genetic data. To identify gene-gene interaction more efficiently, we propose a gene-based reduction method which first summarizes the gene effect by combining multiple single nucleotide polymorphism (SNP) and then performs the gene-gene interaction via the summarized gene effect. By reducing the search space from SNPs to gene, our gene-based method becomes efficient and fast for identifying gene-gene interaction in genome wide association studies. The gene-based reduction method is illustrated by hypertension data from a Korean population.
大多数常见的复杂性状受到多种基因和/或环境因素的影响。为了理解复杂性状的遗传结构,基因-基因和基因-环境相互作用的研究是必不可少的。然而,利用全基因组数据进行基因-基因相互作用需要探索巨大的搜索空间,并且由于遗传数据的高维,计算负担很大。为了更有效地识别基因-基因相互作用,我们提出了一种基于基因的还原方法,该方法首先通过组合多个单核苷酸多态性(SNP)来总结基因效应,然后通过总结的基因效应进行基因-基因相互作用。通过减少从SNPs到基因的搜索空间,我们的基于基因的方法在全基因组关联研究中能够高效、快速地识别基因-基因相互作用。以基因为基础的减少方法由韩国人群的高血压数据说明。
{"title":"Efficient and Fast Analysis for Detecting High Order Gene-by-Gene Interactions in a Genome-Wide Association Study","authors":"Sohee Oh, Jaehoon Lee, Min-Seok Kwon, Kyunga Kim, T. Park","doi":"10.1109/BIBM.2011.103","DOIUrl":"https://doi.org/10.1109/BIBM.2011.103","url":null,"abstract":"Most common complex traits are affected by multiple genes and/or environmental factors. To understand genetic architecture of complex traits, the investigation of gene-gene and gene-environment interactions can be essential. However, conducting gene-gene interaction using genome-wide data requires exploring a huge search space and suffers from a computation burden due to high dimensionality of genetic data. To identify gene-gene interaction more efficiently, we propose a gene-based reduction method which first summarizes the gene effect by combining multiple single nucleotide polymorphism (SNP) and then performs the gene-gene interaction via the summarized gene effect. By reducing the search space from SNPs to gene, our gene-based method becomes efficient and fast for identifying gene-gene interaction in genome wide association studies. The gene-based reduction method is illustrated by hypertension data from a Korean population.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"49 1","pages":"83-88"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76874598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Computational Pipeline for LC-MS/MS Based Metabolite Identification 基于LC-MS/MS的代谢物鉴定计算管道
Bin Zhou, J. Xiao, H. Ressom
Metabolite identification is the major bottle-neck in LC-MS based metabolomic investigations. The mass-based search approach often leaves a large fraction of metabolites with either no identification or multiple putative identifications. As manual verification of metabolites is laborious, computational approaches are needed to obtain more reliable putative identifications and prioritize them. In this paper, a computational pipeline is proposed to assist metabolite identification with improved coverage and prioritization capability. The pipeline is based on multiple pieces of publicly-available software and databases. The proposed pipeline is successfully applied in an LC-MS/MS-based metabolomic study, where mass, retention time, and MS/MS spectrum were used to improve the accuracy of metabolite identification and to prioritize putative identifications for subsequent metabolite verification.
代谢物鉴定是基于LC-MS的代谢组学研究的主要瓶颈。基于质量的搜索方法通常会留下很大一部分代谢物,要么没有鉴定,要么有多个假定的鉴定。由于代谢物的人工验证是费力的,需要计算方法来获得更可靠的推定鉴定并对它们进行优先排序。在本文中,提出了一个计算管道,以帮助代谢物鉴定提高覆盖率和优先级能力。该管道基于多个公开可用的软件和数据库。提议的管道已成功应用于基于LC-MS/MS的代谢组学研究,其中使用质量,保留时间和MS/MS谱来提高代谢物鉴定的准确性,并优先考虑后续代谢物验证的推定鉴定。
{"title":"A Computational Pipeline for LC-MS/MS Based Metabolite Identification","authors":"Bin Zhou, J. Xiao, H. Ressom","doi":"10.1109/BIBM.2011.89","DOIUrl":"https://doi.org/10.1109/BIBM.2011.89","url":null,"abstract":"Metabolite identification is the major bottle-neck in LC-MS based metabolomic investigations. The mass-based search approach often leaves a large fraction of metabolites with either no identification or multiple putative identifications. As manual verification of metabolites is laborious, computational approaches are needed to obtain more reliable putative identifications and prioritize them. In this paper, a computational pipeline is proposed to assist metabolite identification with improved coverage and prioritization capability. The pipeline is based on multiple pieces of publicly-available software and databases. The proposed pipeline is successfully applied in an LC-MS/MS-based metabolomic study, where mass, retention time, and MS/MS spectrum were used to improve the accuracy of metabolite identification and to prioritize putative identifications for subsequent metabolite verification.","PeriodicalId":6345,"journal":{"name":"2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)","volume":"70 1","pages":"247-251"},"PeriodicalIF":0.0,"publicationDate":"2011-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77009291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1