首页 > 最新文献

2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops最新文献

英文 中文
Integration of multiple annotators by aggregating experts and filtering novices 通过聚合专家和过滤新手来集成多个注释器
Pub Date : 2012-10-04 DOI: 10.1109/BIBM.2012.6392657
Ping Zhang, Z. Obradovic
Learning from noisy labels obtained from multiple annotators and without access to any true labels is an increasingly important problem in bioinformatics and biomedicine. In our method, this challenge is addressed by iteratively filtering low-quality annotators and estimating the consensus labels based only on the remaining experts that provide higher-quality annotations. Experiments on biomedical text classification and CASP9 protein disorder prediction tasks provide evidence that the proposed algorithm is more accurate than the majority voting and previously developed multi-annotator approaches. The benefit of using the new method is particularly large when low-quality annotators dominate. Moreover, the new algorithm also suggests the most relevant annotators for each instance, thus paving the way for understanding the behaviors of each annotator and building more reliable predictive models for bioinformatics applications.
在生物信息学和生物医学中,从来自多个注释者的噪声标签中学习,而无法获得任何真实标签是一个日益重要的问题。在我们的方法中,这一挑战是通过迭代过滤低质量注释器和仅基于提供高质量注释的剩余专家估计共识标签来解决的。生物医学文本分类和CASP9蛋白紊乱预测任务的实验证明,该算法比多数投票和先前开发的多注释器方法更准确。当低质量注释器占主导地位时,使用新方法的好处尤其大。此外,新算法还为每个实例提供了最相关的注释者,从而为理解每个注释者的行为和为生物信息学应用建立更可靠的预测模型铺平了道路。
{"title":"Integration of multiple annotators by aggregating experts and filtering novices","authors":"Ping Zhang, Z. Obradovic","doi":"10.1109/BIBM.2012.6392657","DOIUrl":"https://doi.org/10.1109/BIBM.2012.6392657","url":null,"abstract":"Learning from noisy labels obtained from multiple annotators and without access to any true labels is an increasingly important problem in bioinformatics and biomedicine. In our method, this challenge is addressed by iteratively filtering low-quality annotators and estimating the consensus labels based only on the remaining experts that provide higher-quality annotations. Experiments on biomedical text classification and CASP9 protein disorder prediction tasks provide evidence that the proposed algorithm is more accurate than the majority voting and previously developed multi-annotator approaches. The benefit of using the new method is particularly large when low-quality annotators dominate. Moreover, the new algorithm also suggests the most relevant annotators for each instance, thus paving the way for understanding the behaviors of each annotator and building more reliable predictive models for bioinformatics applications.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79187964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A comparison between hierarchical clustering and community detection method in the collection of gene targets for molecular identification of pathogenic fungi 层次聚类法与群落检测法在病原真菌分子鉴定基因靶点采集中的比较
Pub Date : 2012-10-04 DOI: 10.1109/BIBMW.2012.6470234
I. Thapa, S. Bhowmick, D. Bastola
Ribosomal RNA sequence is a popular primary molecular target in the diagnosis of many fungal and bacterial infections. More recently a number of other molecular targets like `cytochrome b', `rpoB', `actin' is available in public databases such as GenBank. These sequences could be better alternatives to the popular ribosomal RNA as molecular targets. However, existing computational approaches do not provide a convenient method to collect and make these sequences available for the development of new alternative sequence-based diagnostics that are critical for early detection of infectious agents like fungi. The long-term goal of this study is to develop a computational tool for the rapid identification of infectious agents in biological sample. In the present study, we focus on pre-processing of sequence data in public database and compare a number of clustering approaches to classify currently available DNA sequences into different target genes. We evaluate the correctness of these methods based on the target classification of seven different species of Zygomycetes. Use of a clustering comparison metric has shown that community detection and hierarchical clustering methods are on par with high accuracy.
核糖体RNA序列是诊断许多真菌和细菌感染的主要分子靶点。最近,诸如“细胞色素b”、“rpoB”、“肌动蛋白”等其他一些分子靶标也可以在GenBank等公共数据库中找到。这些序列可以更好地替代常用的核糖体RNA作为分子靶标。然而,现有的计算方法并没有提供一种方便的方法来收集这些序列,并使这些序列可用于开发新的基于序列的诊断方法,这些诊断方法对于真菌等感染性病原体的早期检测至关重要。本研究的长期目标是开发一种快速识别生物样本中感染因子的计算工具。在本研究中,我们重点研究了公共数据库中序列数据的预处理,并比较了几种聚类方法,将现有的DNA序列分类为不同的目标基因。我们以7种接合菌的目标分类为基础,对这些方法的正确性进行了评价。聚类比较度量的使用表明,社区检测和分层聚类方法具有相同的高准确性。
{"title":"A comparison between hierarchical clustering and community detection method in the collection of gene targets for molecular identification of pathogenic fungi","authors":"I. Thapa, S. Bhowmick, D. Bastola","doi":"10.1109/BIBMW.2012.6470234","DOIUrl":"https://doi.org/10.1109/BIBMW.2012.6470234","url":null,"abstract":"Ribosomal RNA sequence is a popular primary molecular target in the diagnosis of many fungal and bacterial infections. More recently a number of other molecular targets like `cytochrome b', `rpoB', `actin' is available in public databases such as GenBank. These sequences could be better alternatives to the popular ribosomal RNA as molecular targets. However, existing computational approaches do not provide a convenient method to collect and make these sequences available for the development of new alternative sequence-based diagnostics that are critical for early detection of infectious agents like fungi. The long-term goal of this study is to develop a computational tool for the rapid identification of infectious agents in biological sample. In the present study, we focus on pre-processing of sequence data in public database and compare a number of clustering approaches to classify currently available DNA sequences into different target genes. We evaluate the correctness of these methods based on the target classification of seven different species of Zygomycetes. Use of a clustering comparison metric has shown that community detection and hierarchical clustering methods are on par with high accuracy.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77703680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Monotonicity functional for a transient mathematical model of oxygen depletion during Photodynamic therapy 光动力治疗过程中氧耗瞬态数学模型的单调泛函
Pub Date : 2012-10-04 DOI: 10.1109/BIBMW.2012.6470263
I. Gkigkitzis
Accurate and computationally inexpensive energy density functional are highly desirable in the simulation of biochemical systems. A molecular "energy" integral functional for the reaction diffusion equation of the triplet oxygen [3O2] in the pseudo state equilibrium during treatment with Photodynamic therapy (PDT) is defined, and its monotonicity is analyzed. When the functional is evaluated on the solution of an existing mathematical model of a spheroid that represents the real physical system of a cell during PDT treatment, it gives a time dependent monotonically decreasing expression energy.
在生化系统的模拟中,精确且计算成本低廉的能量密度泛函是非常需要的。定义了三态氧[3O2]在光动力治疗(PDT)伪态平衡下的反应扩散方程的分子“能量”积分泛函,并分析了其单调性。在PDT处理过程中,当函数对代表细胞真实物理系统的球体的现有数学模型的解进行评估时,它给出了一个随时间单调递减的表达能量。
{"title":"Monotonicity functional for a transient mathematical model of oxygen depletion during Photodynamic therapy","authors":"I. Gkigkitzis","doi":"10.1109/BIBMW.2012.6470263","DOIUrl":"https://doi.org/10.1109/BIBMW.2012.6470263","url":null,"abstract":"Accurate and computationally inexpensive energy density functional are highly desirable in the simulation of biochemical systems. A molecular \"energy\" integral functional for the reaction diffusion equation of the triplet oxygen [3O2] in the pseudo state equilibrium during treatment with Photodynamic therapy (PDT) is defined, and its monotonicity is analyzed. When the functional is evaluated on the solution of an existing mathematical model of a spheroid that represents the real physical system of a cell during PDT treatment, it gives a time dependent monotonically decreasing expression energy.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76396222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using expression data to help pathway curation 利用表达数据帮助通路管理
Pub Date : 2012-10-04 DOI: 10.1109/BIBMW.2012.6470379
Keith S. Decker, Preeti Anday, Liang Sun, C. Schmidt
Pathway models for organisms beyond the most popular model organisms are often notoriously incomplete, even for commercially important species such as gallus gallus. This can make experimental expression data hard to interpret. The paper describes ESCAPE [Experimental System for Curation Assistance of Pathways via Espression data], under development to use available expression data, along with existing orthology mappings and curated machine-interpretable pathways, to assist in rapidly curating new species-specific pathways. Some of these techniques can also be extended to help in the analysis of the expression data in a curated pathway context as well.
除了最流行的模式生物之外,生物的途径模型往往是出了名的不完整,即使是对商业上重要的物种,如鸡,也是如此。这使得实验表达数据难以解释。这篇论文描述了正在开发中的ESCAPE(通过表达数据辅助路径管理实验系统),该系统将使用可用的表达数据,以及现有的形态学映射和机器可解释的路径管理,以帮助快速管理新的物种特异性路径。这些技术中的一些也可以扩展到帮助分析在一个精心策划的途径背景下的表达数据。
{"title":"Using expression data to help pathway curation","authors":"Keith S. Decker, Preeti Anday, Liang Sun, C. Schmidt","doi":"10.1109/BIBMW.2012.6470379","DOIUrl":"https://doi.org/10.1109/BIBMW.2012.6470379","url":null,"abstract":"Pathway models for organisms beyond the most popular model organisms are often notoriously incomplete, even for commercially important species such as gallus gallus. This can make experimental expression data hard to interpret. The paper describes ESCAPE [Experimental System for Curation Assistance of Pathways via Espression data], under development to use available expression data, along with existing orthology mappings and curated machine-interpretable pathways, to assist in rapidly curating new species-specific pathways. Some of these techniques can also be extended to help in the analysis of the expression data in a curated pathway context as well.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76418120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mechanism of needle-knife therapy in treating degenerative cervical spondylosis 针刀疗法治疗退行性颈椎病的机制
Pub Date : 2012-10-04 DOI: 10.1109/BIBMW.2012.6470330
Lizhi Chen, Feng Yuan, Xiao-Yan Zhao, Mei-ai Liang, W. Fu
The purpose of this work is to investigate the mechanism of Needle-Knife therapy in treating degenerative cervical spondylosis. With the change on the way of life and work, the incidence of degenerative cervical spondylosis is becoming higher and higher, and it has brought great suffering to people. The clinical efficacy of needle-knife therapy in treating the degenerative cervical spondylosis is satisfactory. Needle-knife therapy is an emerging treatmenti but it shows a significant effect in treating the degenerative cervical spondylosis. We will investigate the mechanism of Needle-Knife therapy in treating degenerative cervical spondylosis from spinal biomechanics and soft tissue injury.
本文旨在探讨针刀疗法治疗退行性颈椎病的作用机制。随着生活和工作方式的改变,退行性颈椎病的发病率越来越高,给人们带来了极大的痛苦。针刀疗法治疗退行性颈椎病的临床疗效满意。针刀疗法是一种新兴的治疗方法,但对退行性颈椎病的治疗效果显著。我们将探讨针刀疗法治疗退行性颈椎病由脊柱生物力学和软组织损伤引起的机制。
{"title":"Mechanism of needle-knife therapy in treating degenerative cervical spondylosis","authors":"Lizhi Chen, Feng Yuan, Xiao-Yan Zhao, Mei-ai Liang, W. Fu","doi":"10.1109/BIBMW.2012.6470330","DOIUrl":"https://doi.org/10.1109/BIBMW.2012.6470330","url":null,"abstract":"The purpose of this work is to investigate the mechanism of Needle-Knife therapy in treating degenerative cervical spondylosis. With the change on the way of life and work, the incidence of degenerative cervical spondylosis is becoming higher and higher, and it has brought great suffering to people. The clinical efficacy of needle-knife therapy in treating the degenerative cervical spondylosis is satisfactory. Needle-knife therapy is an emerging treatmenti but it shows a significant effect in treating the degenerative cervical spondylosis. We will investigate the mechanism of Needle-Knife therapy in treating degenerative cervical spondylosis from spinal biomechanics and soft tissue injury.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81268294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An empirical study on knee arthritis remedies 膝关节关节炎治疗的实证研究
Pub Date : 2012-10-04 DOI: 10.1109/BIBMW.2012.6470371
Shun Long, Guo-ming Chen, Weiheng Zhu, Wen-juan Shi
Knee arthritis is a common disease for elderly people who suffer inflammation of their knees. Various remedies have been proposed and put into practice for knee arthritis treatment and their efficacies vary. We present in this paper an empirical study on various long-established remedies for arthritis. The results given by various analysese we applied are not as clear-cut as expected. Some suggest that some common beliefs are incorrect, whilst others give controversy results hard to explain. These results can serve as a useful guide and reference for further study in related areas.
膝关节关节炎是膝关节发炎的老年人的常见病。针对膝关节炎的治疗,人们提出了各种各样的治疗方法并付诸实践,其疗效各不相同。在本文中,我们提出了对各种长期建立的治疗关节炎的实证研究。我们运用各种分析得出的结果并不像预期的那样明确。一些人认为一些普遍的信念是不正确的,而另一些人则给出了难以解释的争议结果。这些结果可为相关领域的进一步研究提供有益的指导和参考。
{"title":"An empirical study on knee arthritis remedies","authors":"Shun Long, Guo-ming Chen, Weiheng Zhu, Wen-juan Shi","doi":"10.1109/BIBMW.2012.6470371","DOIUrl":"https://doi.org/10.1109/BIBMW.2012.6470371","url":null,"abstract":"Knee arthritis is a common disease for elderly people who suffer inflammation of their knees. Various remedies have been proposed and put into practice for knee arthritis treatment and their efficacies vary. We present in this paper an empirical study on various long-established remedies for arthritis. The results given by various analysese we applied are not as clear-cut as expected. Some suggest that some common beliefs are incorrect, whilst others give controversy results hard to explain. These results can serve as a useful guide and reference for further study in related areas.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79575084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A link prediction based unsupervised rank aggregation algorithm for informative gene selection 基于链接预测的无监督秩聚集信息基因选择算法
Pub Date : 2012-10-04 DOI: 10.1109/BIBM.2012.6392697
Kang Li, Nan Du, A. Zhang
Informative Gene Selection is the process of identifying relevant genes that are significantly and differentially expressed in biological procedures. The microarray experiments conducted for this purpose usually implement only less than a hundred of samples to rank the relevance of over thousands of genes. Many irrelevant genes thus may gain statistical importance due to the randomness caused by the small sample problem, while relevant genes may lose focus in the same way. Overcoming such a problem goes beyond what a single microarray dataset can offer and stresses the use of multiple experiment results, which is defined as rank aggregation. In this paper, we propose a novel link prediction based rank aggregation algorithm for the purpose of informative gene selection. Each rank is transferred into a fully connected and weighted network, in which the nodes represent genes and the weights of links stand for priorities between connected nodes (genes). The integration of multiple gene ranks is then formulated as an optimization problem of link prediction on multiple networks, with criterion function favoring the maximization of weighted consensus among each network. We solve the problem through iterative estimation of weights and maximization of consensus among them. In the experimental evaluation, we demonstrate our method on the Prostate Cancer Dataset and compare it with other baseline methods. The results show that our link prediction based rank aggregation method remarkably outperforms all the compared methods, which proves the effectiveness of our framework in finding informative genes from multiple microarray experimental results.
信息性基因选择是识别在生物学过程中显著和差异表达的相关基因的过程。为此目的进行的微阵列实验通常只实施不到100个样本来对数千个基因的相关性进行排序。因此,由于小样本问题的随机性,许多不相关的基因可能会获得统计重要性,而相关的基因可能会以同样的方式失去重点。克服这样的问题超越了单个微阵列数据集所能提供的,并强调使用多个实验结果,这被定义为秩聚合。本文提出了一种新的基于链接预测的排序聚合算法,用于信息基因选择。每个等级被转移到一个完全连接和加权的网络中,其中节点代表基因,链路的权重代表连接节点(基因)之间的优先级。然后将多基因排序的整合表述为多网络上的链路预测优化问题,准则函数倾向于各网络间的加权一致性最大化。我们通过权重的迭代估计和它们之间一致性的最大化来解决问题。在实验评估中,我们在前列腺癌数据集上演示了我们的方法,并将其与其他基线方法进行了比较。结果表明,基于链接预测的排序聚合方法明显优于所有比较方法,证明了我们的框架在从多个微阵列实验结果中寻找信息基因方面的有效性。
{"title":"A link prediction based unsupervised rank aggregation algorithm for informative gene selection","authors":"Kang Li, Nan Du, A. Zhang","doi":"10.1109/BIBM.2012.6392697","DOIUrl":"https://doi.org/10.1109/BIBM.2012.6392697","url":null,"abstract":"Informative Gene Selection is the process of identifying relevant genes that are significantly and differentially expressed in biological procedures. The microarray experiments conducted for this purpose usually implement only less than a hundred of samples to rank the relevance of over thousands of genes. Many irrelevant genes thus may gain statistical importance due to the randomness caused by the small sample problem, while relevant genes may lose focus in the same way. Overcoming such a problem goes beyond what a single microarray dataset can offer and stresses the use of multiple experiment results, which is defined as rank aggregation. In this paper, we propose a novel link prediction based rank aggregation algorithm for the purpose of informative gene selection. Each rank is transferred into a fully connected and weighted network, in which the nodes represent genes and the weights of links stand for priorities between connected nodes (genes). The integration of multiple gene ranks is then formulated as an optimization problem of link prediction on multiple networks, with criterion function favoring the maximization of weighted consensus among each network. We solve the problem through iterative estimation of weights and maximization of consensus among them. In the experimental evaluation, we demonstrate our method on the Prostate Cancer Dataset and compare it with other baseline methods. The results show that our link prediction based rank aggregation method remarkably outperforms all the compared methods, which proves the effectiveness of our framework in finding informative genes from multiple microarray experimental results.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77323416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Libgapmis: An ultrafast library for short-read single-gap alignment Libgapmis:一个用于短读单间隙对齐的超快库
Pub Date : 2012-10-04 DOI: 10.1109/BIBMW.2012.6470221
Nikolaos S. Alachiotis, S. Berger, T. Flouri, S. Pissis, A. Stamatakis
A broad variety of short-read alignment programmes has been released recently to address the task of mapping tens of millions of short reads to a reference genome, placing emphasis on various aspects of the problem. Although all programmes allow for a small number of alignment mismatches, some of them either perform poorly when allowing gap insertions or they do not allow for gap insertions at all. The seed-and-extend strategy is applied in most of these programmes: after a fast alignment between a fragment of the reference sequence and a high-quality fragment of a short read-the seed-an important problem is to extend the alignment between a relatively short succeeding fragment of the reference sequence and the remaining low-quality fragment of the read allowing a number of mismatches and the insertion of gaps in the alignment. However, the length of the short reads in combination with the gap occurrence frequency observed in various applications suggest that the single-gap alignment of (parts of) those reads is desirable. In this article, we present libgapmis, an ultrafast library for pairwise short-read single-gap alignment including accelerated SSE-based and GPU-based versions. It implements an algorithm, which computes a modified version of the traditional dynamic programming matrix for sequence alignment to solve the above alignment problem. We show that the library functions of the CPU-based version are up to 20x faster compared to competing programmes, while the respective SSE-based and GPU-based versions are up to 6x and llx faster than our CPU-based implementation, respectively. The functions made available via our library can be seamlessly integrated into any short-read alignment pipeline.
最近发布了各种各样的短读序列比对程序,以解决将数千万个短读序列映射到参考基因组的任务,并强调了该问题的各个方面。尽管所有程序都允许少量的对齐不匹配,但其中一些程序在允许间隙插入时表现不佳,或者根本不允许间隙插入。在大多数程序中都应用了种子-扩展策略:在参考序列的片段与短序列的高质量片段(种子)之间快速比对之后,一个重要的问题是延长参考序列的相对较短的后续片段与剩余的低质量片段之间的比对,从而导致许多不匹配和在比对中插入间隙。然而,在各种应用中观察到的短读段长度与间隙发生频率的结合表明,这些读段的(部分)单间隙对齐是可取的。在本文中,我们介绍libgapmis,这是一个超快的库,用于两两短读单间隙对齐,包括基于sse和基于gpu的加速版本。实现了一种算法,该算法通过计算传统动态规划矩阵的改进版本来求解序列对齐问题。我们表明,与竞争程序相比,基于cpu的版本的库功能快了20倍,而基于sse和基于gpu的版本分别比基于cpu的实现快了6倍和16倍。通过我们的库提供的功能可以无缝地集成到任何短读对齐管道中。
{"title":"Libgapmis: An ultrafast library for short-read single-gap alignment","authors":"Nikolaos S. Alachiotis, S. Berger, T. Flouri, S. Pissis, A. Stamatakis","doi":"10.1109/BIBMW.2012.6470221","DOIUrl":"https://doi.org/10.1109/BIBMW.2012.6470221","url":null,"abstract":"A broad variety of short-read alignment programmes has been released recently to address the task of mapping tens of millions of short reads to a reference genome, placing emphasis on various aspects of the problem. Although all programmes allow for a small number of alignment mismatches, some of them either perform poorly when allowing gap insertions or they do not allow for gap insertions at all. The seed-and-extend strategy is applied in most of these programmes: after a fast alignment between a fragment of the reference sequence and a high-quality fragment of a short read-the seed-an important problem is to extend the alignment between a relatively short succeeding fragment of the reference sequence and the remaining low-quality fragment of the read allowing a number of mismatches and the insertion of gaps in the alignment. However, the length of the short reads in combination with the gap occurrence frequency observed in various applications suggest that the single-gap alignment of (parts of) those reads is desirable. In this article, we present libgapmis, an ultrafast library for pairwise short-read single-gap alignment including accelerated SSE-based and GPU-based versions. It implements an algorithm, which computes a modified version of the traditional dynamic programming matrix for sequence alignment to solve the above alignment problem. We show that the library functions of the CPU-based version are up to 20x faster compared to competing programmes, while the respective SSE-based and GPU-based versions are up to 6x and llx faster than our CPU-based implementation, respectively. The functions made available via our library can be seamlessly integrated into any short-read alignment pipeline.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91288929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
(2) Protein structure determination on demand (2)按需测定蛋白质结构
Pub Date : 2012-10-04 DOI: 10.1109/BIBM.2012.6392662
Ming Li
Protein structure prediction by computers at best may serve as a screening method, and the current high-throughput protein structure determination methods are costly and will never exhaust all proteins. A complementary approach is "protein structure determination on demand", say in a week. We will discuss two approaches that would realize this goal: automatic protein structure determination using NMR data and mass spectrometry data.
计算机预测蛋白质结构充其量只能作为一种筛选方法,而目前的高通量蛋白质结构测定方法成本高,且无法穷尽所有蛋白质。一种补充方法是“按需测定蛋白质结构”,比如在一周内。我们将讨论实现这一目标的两种方法:使用核磁共振数据和质谱数据自动测定蛋白质结构。
{"title":"(2) Protein structure determination on demand","authors":"Ming Li","doi":"10.1109/BIBM.2012.6392662","DOIUrl":"https://doi.org/10.1109/BIBM.2012.6392662","url":null,"abstract":"Protein structure prediction by computers at best may serve as a screening method, and the current high-throughput protein structure determination methods are costly and will never exhaust all proteins. A complementary approach is \"protein structure determination on demand\", say in a week. We will discuss two approaches that would realize this goal: automatic protein structure determination using NMR data and mass spectrometry data.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90608230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Extracting volumetric information from standard two-dimensional radiological annotations within the clinical workflow 从临床工作流程中的标准二维放射注释中提取体积信息
Pub Date : 2012-10-04 DOI: 10.1109/BIBMW.2012.6470226
Sharmili Roy, M. S. Brown, G. Shih
In a typical radiological reporting workflow, radiologists make image-based annotations to denote regions of clinical significance or to perform quantitative measurements. Interestingly, virtually all annotation software allow only 2D geometric primitives such as line segments and ellipses; 3D volume annotation is not supported. As a result, when dealing with anatomic entities that have volumetric properties (e.g. tumors, organs), a radiologist must summarize volumetric quantities in a written text-report or use a third party software outside the standard workflow to perform volumetric segmentation. In this paper, we describe an automated method to extract volumes from radiological annotations. Specifically, we describe a clustering method that parses the annotations of unconnected line segments to determine the locations of volumes. We show how this extracted information can be used to bootstrap and accelerate subsequent 3D segmentation while avoiding the need to perform redundant markup or segmentation seeding outside the standard radiological workflow. This 3D data can be utilized to enhance important clinical applications such as radiological reporting, exam summarization and visualization.
在典型的放射报告工作流程中,放射科医生根据图像进行注释,以表示具有临床意义的区域或进行定量测量。有趣的是,几乎所有的注释软件只允许二维几何原语,如线段和椭圆;不支持3D体标注。因此,当处理具有体积属性的解剖实体(例如肿瘤、器官)时,放射科医生必须在书面文本报告中总结体积数量,或者使用标准工作流程之外的第三方软件来执行体积分割。在本文中,我们描述了一种自动从放射性注释中提取体积的方法。具体来说,我们描述了一种聚类方法,该方法通过分析未连接线段的注释来确定卷的位置。我们展示了如何使用这些提取的信息来引导和加速随后的3D分割,同时避免了在标准放射工作流程之外执行冗余标记或分割播种的需要。这些3D数据可用于增强重要的临床应用,如放射报告、检查总结和可视化。
{"title":"Extracting volumetric information from standard two-dimensional radiological annotations within the clinical workflow","authors":"Sharmili Roy, M. S. Brown, G. Shih","doi":"10.1109/BIBMW.2012.6470226","DOIUrl":"https://doi.org/10.1109/BIBMW.2012.6470226","url":null,"abstract":"In a typical radiological reporting workflow, radiologists make image-based annotations to denote regions of clinical significance or to perform quantitative measurements. Interestingly, virtually all annotation software allow only 2D geometric primitives such as line segments and ellipses; 3D volume annotation is not supported. As a result, when dealing with anatomic entities that have volumetric properties (e.g. tumors, organs), a radiologist must summarize volumetric quantities in a written text-report or use a third party software outside the standard workflow to perform volumetric segmentation. In this paper, we describe an automated method to extract volumes from radiological annotations. Specifically, we describe a clustering method that parses the annotations of unconnected line segments to determine the locations of volumes. We show how this extracted information can be used to bootstrap and accelerate subsequent 3D segmentation while avoiding the need to perform redundant markup or segmentation seeding outside the standard radiological workflow. This 3D data can be utilized to enhance important clinical applications such as radiological reporting, exam summarization and visualization.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85985853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1