首页 > 最新文献

Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.最新文献

英文 中文
Performance comparison of algorithms for finding transcription factor binding sites 寻找转录因子结合位点的算法性能比较
Pub Date : 2003-03-10 DOI: 10.1109/BIBE.2003.1188949
S. Sinha, M. Tompa
We compare the accuracy of three motif-finding algorithms for the discovery of novel transcription factor binding sites among co-regulated genes. One of the algorithms (YMF) uses a motif model tailored for binding sites and an enumerative search of the motif space, while the other two (MEME and AlignACE) use a more general motif model and local search techniques. The comparison is done on synthetic data with planted motifs, as well as on real data sets of co-regulated genes from the yeast S. cerevisiae. More often than not, the enumerative algorithm is found to be more accurate than the other two on the yeast data sets, though there is a noticeable exclusivity in the accuracy of the different algorithms. The experiments on synthetic data reveal, not surprisingly, that each algorithm outperforms the others when motifs are planted according to its motif model.
我们比较了三种基序查找算法在共调节基因中发现新的转录因子结合位点的准确性。其中一种算法(YMF)使用针对结合位点定制的基序模型和基序空间的枚举搜索,而另外两种算法(MEME和AlignACE)使用更通用的基序模型和局部搜索技术。比较是在与种植基序的合成数据上进行的,以及在酵母酿酒酵母共调节基因的真实数据集上进行的。通常情况下,发现枚举算法比酵母数据集上的其他两种算法更准确,尽管不同算法的准确性存在明显的排他性。在合成数据上的实验表明,毫不奇怪,当根据其基序模型植入基序时,每种算法都优于其他算法。
{"title":"Performance comparison of algorithms for finding transcription factor binding sites","authors":"S. Sinha, M. Tompa","doi":"10.1109/BIBE.2003.1188949","DOIUrl":"https://doi.org/10.1109/BIBE.2003.1188949","url":null,"abstract":"We compare the accuracy of three motif-finding algorithms for the discovery of novel transcription factor binding sites among co-regulated genes. One of the algorithms (YMF) uses a motif model tailored for binding sites and an enumerative search of the motif space, while the other two (MEME and AlignACE) use a more general motif model and local search techniques. The comparison is done on synthetic data with planted motifs, as well as on real data sets of co-regulated genes from the yeast S. cerevisiae. More often than not, the enumerative algorithm is found to be more accurate than the other two on the yeast data sets, though there is a noticeable exclusivity in the accuracy of the different algorithms. The experiments on synthetic data reveal, not surprisingly, that each algorithm outperforms the others when motifs are planted according to its motif model.","PeriodicalId":178814,"journal":{"name":"Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116620381","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
Determining the asymmetries of skin lesions with fuzzy borders 边界模糊皮肤病变的不对称性判定
Pub Date : 2003-03-10 DOI: 10.1109/BIBE.2003.1188955
V. Ng, Tim K. Lee, Benny Y. M. Fung
Malignant melanoma is a popular cancer among youth; it is desirable to have a fast and convenience way to determine this disease in its early stage. One of the clinical features in diagnosis is related to the shape of lesions. In previous studies, circularity is commonly used as the asymmetric measurement of skin lesions. However, this measurement depends very much on the accuracy of the segmentation result. In this paper, we present an artificial neural network model to improve the measurements of the asymmetries of lesions that may have fuzzy borders. The main idea is enhancing the symmetric distant (eSD) with a number of variations. Results from experiments, which use the digitized images front the Lesion Clinic in Vancouver, Canada have shown the good discriminating power of the neural network model.
恶性黑色素瘤是年轻人中常见的癌症;希望能有一种快速方便的方法在早期诊断此病。诊断的临床特征之一与病变的形状有关。在以往的研究中,圆形通常被用作皮肤病变的不对称测量。然而,这种测量在很大程度上取决于分割结果的准确性。在本文中,我们提出了一种人工神经网络模型,以改进可能具有模糊边界的病变不对称性的测量。其主要思想是通过一些变化来增强对称距离(eSD)。利用加拿大温哥华病变诊所的数字化图像进行的实验结果表明,该神经网络模型具有良好的识别能力。
{"title":"Determining the asymmetries of skin lesions with fuzzy borders","authors":"V. Ng, Tim K. Lee, Benny Y. M. Fung","doi":"10.1109/BIBE.2003.1188955","DOIUrl":"https://doi.org/10.1109/BIBE.2003.1188955","url":null,"abstract":"Malignant melanoma is a popular cancer among youth; it is desirable to have a fast and convenience way to determine this disease in its early stage. One of the clinical features in diagnosis is related to the shape of lesions. In previous studies, circularity is commonly used as the asymmetric measurement of skin lesions. However, this measurement depends very much on the accuracy of the segmentation result. In this paper, we present an artificial neural network model to improve the measurements of the asymmetries of lesions that may have fuzzy borders. The main idea is enhancing the symmetric distant (eSD) with a number of variations. Results from experiments, which use the digitized images front the Lesion Clinic in Vancouver, Canada have shown the good discriminating power of the neural network model.","PeriodicalId":178814,"journal":{"name":"Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122999445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Load balancing: a case study of a pharmaceutical drug candidate database 负载平衡:一个药物候选数据库的案例研究
Pub Date : 2003-03-10 DOI: 10.1109/BIBE.2003.1188973
Zina Ben-Miled, S. Li, Jesse Martin, Chavali Balagopalakrishna, O. Bukhres, Robert J. Oppelt
In the past decade chemical and biological laboratory experiments have generated an explosive amount of data. As a result, a set applications that manipulate these dynamic, heterogeneous and massive amounts of data have emerged. An example of such applications in the pharmaceutical industry is the computational process involved in the early drug discovery of lead drug candidates for a given target disease. The discovery of lead drug candidates requires both consecutive and random data access to the pharmaceutical drug candidate database. This paper focuses on performance enhancement techniques for the pharmaceutical drug candidate database application. In particular, this paper compares static load balancing and dynamic load balancing in the context of the drug candidate database application. This database application is based on multi-queries. Some of these queries are multi-join queries.
在过去的十年里,化学和生物实验室实验产生了爆炸性的数据量。因此,出现了一组处理这些动态、异构和海量数据的应用程序。这种应用在制药工业中的一个例子是针对给定目标疾病的先导候选药物的早期药物发现所涉及的计算过程。先导候选药物的发现需要对候选药物数据库进行连续和随机的数据访问。本文主要研究候选药物数据库应用的性能增强技术。本文特别对候选药物数据库应用中的静态负载平衡和动态负载平衡进行了比较。这个数据库应用程序基于多查询。其中一些查询是多连接查询。
{"title":"Load balancing: a case study of a pharmaceutical drug candidate database","authors":"Zina Ben-Miled, S. Li, Jesse Martin, Chavali Balagopalakrishna, O. Bukhres, Robert J. Oppelt","doi":"10.1109/BIBE.2003.1188973","DOIUrl":"https://doi.org/10.1109/BIBE.2003.1188973","url":null,"abstract":"In the past decade chemical and biological laboratory experiments have generated an explosive amount of data. As a result, a set applications that manipulate these dynamic, heterogeneous and massive amounts of data have emerged. An example of such applications in the pharmaceutical industry is the computational process involved in the early drug discovery of lead drug candidates for a given target disease. The discovery of lead drug candidates requires both consecutive and random data access to the pharmaceutical drug candidate database. This paper focuses on performance enhancement techniques for the pharmaceutical drug candidate database application. In particular, this paper compares static load balancing and dynamic load balancing in the context of the drug candidate database application. This database application is based on multi-queries. Some of these queries are multi-join queries.","PeriodicalId":178814,"journal":{"name":"Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130194041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predictive model for yeast protein functions using modular neural approach 基于模块化神经网络的酵母蛋白功能预测模型
Pub Date : 2003-03-10 DOI: 10.1109/BIBE.2003.1188984
Doosung Hwang, F. Fotouhi, R. Finley, W. Grosky
In this paper we use a modular neural network to predict the molecular functions of yeast proteins. To solve this class problem, our proposed approach decomposes the original problem into a set of solvable 2-class subproblems using class information. Each 2-class problem has a set of positive and negative data. The yeast data is not equally distributed in function classes and hinders the learning of each neural network. We adopt a sampling strategy that generates a set of new class data to the subordinate class in order to balance the positive and negative data set. In data preparation, the biological concept of "guilt-by-interaction" is used for covering possible interaction partners among proteins of known functions. The proposed framework has been tested as a predictive model of yeast protein functions where the data source is stored in a relational database. In the experiments, the proposed system shows an average accuracy of 91.0% in the test set.
本文采用模块化神经网络对酵母蛋白的分子功能进行预测。为了解决这类问题,我们提出的方法利用类信息将原始问题分解为一组可解的2类子问题。每个2类问题都有一组正数据和负数据。酵母数据在函数类中分布不均,阻碍了每个神经网络的学习。为了平衡正负数据集,我们采用了一种采样策略,即向从属类生成一组新的类数据。在数据准备中,“相互作用的负罪感”的生物学概念用于覆盖已知功能的蛋白质之间可能的相互作用伙伴。所提出的框架已被测试为酵母蛋白功能的预测模型,其中数据源存储在关系数据库中。在实验中,该系统在测试集中的平均准确率为91.0%。
{"title":"Predictive model for yeast protein functions using modular neural approach","authors":"Doosung Hwang, F. Fotouhi, R. Finley, W. Grosky","doi":"10.1109/BIBE.2003.1188984","DOIUrl":"https://doi.org/10.1109/BIBE.2003.1188984","url":null,"abstract":"In this paper we use a modular neural network to predict the molecular functions of yeast proteins. To solve this class problem, our proposed approach decomposes the original problem into a set of solvable 2-class subproblems using class information. Each 2-class problem has a set of positive and negative data. The yeast data is not equally distributed in function classes and hinders the learning of each neural network. We adopt a sampling strategy that generates a set of new class data to the subordinate class in order to balance the positive and negative data set. In data preparation, the biological concept of \"guilt-by-interaction\" is used for covering possible interaction partners among proteins of known functions. The proposed framework has been tested as a predictive model of yeast protein functions where the data source is stored in a relational database. In the experiments, the proposed system shows an average accuracy of 91.0% in the test set.","PeriodicalId":178814,"journal":{"name":"Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126360958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A fast algorithm for RNA secondary structure prediction including pseudoknots 包含假结的RNA二级结构快速预测算法
Pub Date : 2003-03-10 DOI: 10.1109/BIBE.2003.1188924
F. Tahi, S. Engelen, M. Régnier
Many important RNA molecules contain pseudoknots, which are generally excluded by the definition of the secondary structure, mainly for computational reasons. Still, most existing algorithms for secondary structure prediction are not satisfactory in results and complexities, even when pseudoknots are not allowed. We present an algorithm, called P-DCFold, for the prediction of RNA secondary structures including all kinds of pseudoknots. It is based on the comparative approach. The helices are searched recursively, from more "likely" to less "likely", using the "Divide and Conquer" approach. This approach, which allows to limit the amount of searching, is possible when only non-interleaved helices are searched for. The pseudoknots are therefore searched in several steps, each helix of the pseudoknot being selected in a different step. P-DCFold has been applied to tmRNA and RnaseP sequences. In less than two seconds, their respective secondary structures, including their pseudoknots, have been recovered very efficiently.
许多重要的RNA分子含有假结,二级结构的定义通常将其排除在外,主要是由于计算原因。然而,大多数现有的二级结构预测算法在结果和复杂性上都不令人满意,即使在不允许假节的情况下也是如此。我们提出了一种称为P-DCFold的算法,用于预测包括各种假结在内的RNA二级结构。它是基于比较的方法。使用“分而治之”的方法,从“可能性”越高到“可能性”越低,递归地搜索螺旋。这种允许限制搜索量的方法在只搜索非交错的螺旋时是可行的。因此,伪结在几个步骤中搜索,每个假结的螺旋在不同的步骤中被选择。P-DCFold已应用于tmRNA和RnaseP序列。在不到两秒钟的时间里,它们各自的二级结构,包括它们的假结,都被非常有效地恢复了。
{"title":"A fast algorithm for RNA secondary structure prediction including pseudoknots","authors":"F. Tahi, S. Engelen, M. Régnier","doi":"10.1109/BIBE.2003.1188924","DOIUrl":"https://doi.org/10.1109/BIBE.2003.1188924","url":null,"abstract":"Many important RNA molecules contain pseudoknots, which are generally excluded by the definition of the secondary structure, mainly for computational reasons. Still, most existing algorithms for secondary structure prediction are not satisfactory in results and complexities, even when pseudoknots are not allowed. We present an algorithm, called P-DCFold, for the prediction of RNA secondary structures including all kinds of pseudoknots. It is based on the comparative approach. The helices are searched recursively, from more \"likely\" to less \"likely\", using the \"Divide and Conquer\" approach. This approach, which allows to limit the amount of searching, is possible when only non-interleaved helices are searched for. The pseudoknots are therefore searched in several steps, each helix of the pseudoknot being selected in a different step. P-DCFold has been applied to tmRNA and RnaseP sequences. In less than two seconds, their respective secondary structures, including their pseudoknots, have been recovered very efficiently.","PeriodicalId":178814,"journal":{"name":"Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127128536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Requirements of phylogenetic databases 系统发育数据库的要求
Pub Date : 2003-03-10 DOI: 10.1109/BIBE.2003.1188940
Luay K. Nakhleh, Daniel P. Miranker, François Barbançon, W. H. Piel, M. Donoghue
We examine the organizational impact on phylogenetic databases of the increasing sophistication in the need and use of phylogenetic data. A primary issue is the use of the unnormalized representation of phylogenies in Newick format as a primitive data type in existing phylogenetic databases. In particular we identify and enumerate a list of potential applications of such databases and queries (use-cases) that biologists may wish to see integrated into a phylogenetic database management system. We show there are many queries that would best be supported by a normalized data model where phylogenies are stored as lists of edges. Since many of the queries require transitive traversals of the phylogenies we demonstrate, constructively, that complex phylogenetic queries can be conveniently constructed as Datalog programs. We address concerns with respect to the cost and performance of the normalized representation by developing and empirically evaluating a feasibility prototype.
我们研究了系统发育数据日益复杂的需求和使用对系统发育数据库的组织影响。一个主要问题是在现有的系统发生数据库中使用Newick格式的系统发生的非规范化表示作为原始数据类型。特别地,我们确定并列举了这些数据库和查询(用例)的潜在应用程序列表,生物学家可能希望将其集成到系统发育数据库管理系统中。我们展示了许多查询最好由规范化数据模型支持,其中系统发生被存储为边列表。由于许多查询需要系统发生的传递遍历,我们建设性地证明,复杂的系统发生查询可以方便地构造为Datalog程序。我们通过开发和经验评估可行性原型来解决规范化表示的成本和性能方面的问题。
{"title":"Requirements of phylogenetic databases","authors":"Luay K. Nakhleh, Daniel P. Miranker, François Barbançon, W. H. Piel, M. Donoghue","doi":"10.1109/BIBE.2003.1188940","DOIUrl":"https://doi.org/10.1109/BIBE.2003.1188940","url":null,"abstract":"We examine the organizational impact on phylogenetic databases of the increasing sophistication in the need and use of phylogenetic data. A primary issue is the use of the unnormalized representation of phylogenies in Newick format as a primitive data type in existing phylogenetic databases. In particular we identify and enumerate a list of potential applications of such databases and queries (use-cases) that biologists may wish to see integrated into a phylogenetic database management system. We show there are many queries that would best be supported by a normalized data model where phylogenies are stored as lists of edges. Since many of the queries require transitive traversals of the phylogenies we demonstrate, constructively, that complex phylogenetic queries can be conveniently constructed as Datalog programs. We address concerns with respect to the cost and performance of the normalized representation by developing and empirically evaluating a feasibility prototype.","PeriodicalId":178814,"journal":{"name":"Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132951778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
Prostate gland and extra-capsular tissue 3D reconstruction and measurement 前列腺及囊外组织三维重建与测量
Pub Date : 2003-03-10 DOI: 10.1109/BIBE.2003.1188958
F. McKenzie, Rania Hussein, Jennifer Seevinck, P. Schellhammer, J. Diaz
Currently there are little objective parameters that can quantify the success of one form of prostate surgical removal over another. Accordingly, at Old Dominion University (ODU) we have been developing a process resulting in the use of software algorithms to assess the coverage and depth of extra-capsular soft tissue removed with the prostate by the various surgical approaches. Parameters such as the percent of capsule that is bare of soft tissue and where present the depth and extent of coverage have been assessed. First, visualization methods and tools are developed for images of prostate slices that are provided to ODU by the Pathology Department at Eastern Virginia Medical School (EVMS). The visualization tools interpolate and present 3D models of the prostates. Measurement algorithms are then applied to determine statistics about extra-capsular tissue coverage. This paper addresses the modeling, visualization, and analysis of prostate gland tissue to aid in quantifying prostate surgery success. Particular attention is directed towards the accuracy of these measurements and is addressed in the analysis discussions.
目前,很少有客观的参数可以量化一种形式的前列腺手术切除比另一种成功。因此,在老道明大学(ODU),我们一直在开发一种方法,使用软件算法来评估通过各种手术方法与前列腺一起切除的囊外软组织的覆盖范围和深度。已经评估了诸如裸露软组织的胶囊的百分比以及存在的覆盖深度和范围等参数。首先,开发了由东弗吉尼亚医学院(EVMS)病理学部门提供给ODU的前列腺切片图像的可视化方法和工具。可视化工具插入并呈现前列腺的3D模型。然后应用测量算法来确定有关包膜外组织覆盖的统计数据。本文讨论了前列腺组织的建模、可视化和分析,以帮助量化前列腺手术的成功。特别注意的是这些测量的准确性,并在分析讨论中讨论。
{"title":"Prostate gland and extra-capsular tissue 3D reconstruction and measurement","authors":"F. McKenzie, Rania Hussein, Jennifer Seevinck, P. Schellhammer, J. Diaz","doi":"10.1109/BIBE.2003.1188958","DOIUrl":"https://doi.org/10.1109/BIBE.2003.1188958","url":null,"abstract":"Currently there are little objective parameters that can quantify the success of one form of prostate surgical removal over another. Accordingly, at Old Dominion University (ODU) we have been developing a process resulting in the use of software algorithms to assess the coverage and depth of extra-capsular soft tissue removed with the prostate by the various surgical approaches. Parameters such as the percent of capsule that is bare of soft tissue and where present the depth and extent of coverage have been assessed. First, visualization methods and tools are developed for images of prostate slices that are provided to ODU by the Pathology Department at Eastern Virginia Medical School (EVMS). The visualization tools interpolate and present 3D models of the prostates. Measurement algorithms are then applied to determine statistics about extra-capsular tissue coverage. This paper addresses the modeling, visualization, and analysis of prostate gland tissue to aid in quantifying prostate surgery success. Particular attention is directed towards the accuracy of these measurements and is addressed in the analysis discussions.","PeriodicalId":178814,"journal":{"name":"Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131437790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
MotifMiner: a general toolkit for efficiently identifying common substructures in molecules MotifMiner:用于有效识别分子中常见子结构的通用工具包
Pub Date : 2003-03-10 DOI: 10.1109/BIBE.2003.1188971
M. Coatney, S. Parthasarathy
Scientific research often involves examining structural relationships in molecules since scientists strongly believe in the causal relationship between structure and function. Traditionally, researchers have identified these patterns, or motifs, manually using biochemical expertise. However, with the massive influx of new biochemical data and the ability to gather data for very large molecules, there is great need for techniques that automatically and efficiently identify commonly occurring structural patterns in molecules. Previous automated substructure discovery approaches have each introduced variations of similar underlying techniques and have embedded domain knowledge. While doing so improves performance for the particular domain, this complicates extensibility to other domains. Also, they do not address scalability or noise, which is critical for certain structural domains like macromolecules. In this paper, we present MotifMiner, a general toolkit for automatically identifying common motifs in most any scientific molecular dataset. We describe both our application framework and services for identifying motifs, as well as demonstrate the flexibility of our system by analyzing several disparate domains, including protein, drug, and MD simulation datasets.
科学研究经常涉及检查分子的结构关系,因为科学家们坚信结构和功能之间的因果关系。传统上,研究人员已经确定了这些模式,或基序,手动使用生化专业知识。然而,随着新的生化数据的大量涌入和收集非常大分子数据的能力,非常需要自动有效地识别分子中常见的结构模式的技术。以前的自动化子结构发现方法都引入了类似底层技术的变体,并嵌入了领域知识。虽然这样做可以提高特定领域的性能,但这会使到其他领域的可扩展性变得复杂。此外,它们没有解决可扩展性或噪声问题,而这对于大分子等特定结构域至关重要。在本文中,我们提出了MotifMiner,一个用于自动识别大多数科学分子数据集中的公共基序的通用工具包。我们描述了用于识别基序的应用程序框架和服务,并通过分析几个不同的领域(包括蛋白质、药物和MD模拟数据集)展示了我们系统的灵活性。
{"title":"MotifMiner: a general toolkit for efficiently identifying common substructures in molecules","authors":"M. Coatney, S. Parthasarathy","doi":"10.1109/BIBE.2003.1188971","DOIUrl":"https://doi.org/10.1109/BIBE.2003.1188971","url":null,"abstract":"Scientific research often involves examining structural relationships in molecules since scientists strongly believe in the causal relationship between structure and function. Traditionally, researchers have identified these patterns, or motifs, manually using biochemical expertise. However, with the massive influx of new biochemical data and the ability to gather data for very large molecules, there is great need for techniques that automatically and efficiently identify commonly occurring structural patterns in molecules. Previous automated substructure discovery approaches have each introduced variations of similar underlying techniques and have embedded domain knowledge. While doing so improves performance for the particular domain, this complicates extensibility to other domains. Also, they do not address scalability or noise, which is critical for certain structural domains like macromolecules. In this paper, we present MotifMiner, a general toolkit for automatically identifying common motifs in most any scientific molecular dataset. We describe both our application framework and services for identifying motifs, as well as demonstrate the flexibility of our system by analyzing several disparate domains, including protein, drug, and MD simulation datasets.","PeriodicalId":178814,"journal":{"name":"Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116020919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A new approach to detect similar proteins from 2D gel electrophoresis images 一种从二维凝胶电泳图像中检测相似蛋白的新方法
Pub Date : 2003-03-10 DOI: 10.1109/BIBE.2003.1188945
Nawaz Khan, S. Rahman
Many algorithms are available for quantitative and qualitative analysis of protein spot in gel electrophoresis images and majority of the these algorithms use geometric and image processing techniques to match protein spots. These algorithms do not take into consideration the electrophoretic mobility of the proteins and they only match similar protein spots rather than matching similar proteins. The approach presented in this paper uses a novel technique based on electrophoretic mobility to match protein spots between source and target images. The algorithm identifies the protein spot in the target image which lies on the same line of path as it is in the source image. A shape matching algorithm using Generalized Hough Transform and Canny Edge Detection method is used to determine the shape variance. The method describes here gives an accuracy of 90% or more to identify the same or similar proteins from the target image. Finally, a dedicated target based database has been created to store a set of finite values of an element spot for correlating the 3D protein structure.
凝胶电泳图像中蛋白质斑点的定量和定性分析算法有很多,其中大多数算法使用几何和图像处理技术来匹配蛋白质斑点。这些算法没有考虑蛋白质的电泳迁移率,它们只匹配相似的蛋白质点,而不是匹配相似的蛋白质。本文提出了一种基于电泳迁移率的新技术来匹配源图像和目标图像之间的蛋白质点。该算法识别目标图像中与源图像在同一条路径上的蛋白质点。采用广义霍夫变换和Canny边缘检测相结合的形状匹配算法来确定形状方差。本文描述的方法从目标图像中识别相同或相似的蛋白质的准确率为90%或更高。最后,建立了一个专用的基于目标的数据库来存储一组元素点的有限值,用于关联三维蛋白质结构。
{"title":"A new approach to detect similar proteins from 2D gel electrophoresis images","authors":"Nawaz Khan, S. Rahman","doi":"10.1109/BIBE.2003.1188945","DOIUrl":"https://doi.org/10.1109/BIBE.2003.1188945","url":null,"abstract":"Many algorithms are available for quantitative and qualitative analysis of protein spot in gel electrophoresis images and majority of the these algorithms use geometric and image processing techniques to match protein spots. These algorithms do not take into consideration the electrophoretic mobility of the proteins and they only match similar protein spots rather than matching similar proteins. The approach presented in this paper uses a novel technique based on electrophoretic mobility to match protein spots between source and target images. The algorithm identifies the protein spot in the target image which lies on the same line of path as it is in the source image. A shape matching algorithm using Generalized Hough Transform and Canny Edge Detection method is used to determine the shape variance. The method describes here gives an accuracy of 90% or more to identify the same or similar proteins from the target image. Finally, a dedicated target based database has been created to store a set of finite values of an element spot for correlating the 3D protein structure.","PeriodicalId":178814,"journal":{"name":"Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126623149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Respiratory gating for MRI and MRS in rodents 啮齿类动物呼吸门控的MRI和MRS研究
Pub Date : 2003-03-10 DOI: 10.1109/BIBE.2003.1188938
J. Garbow, J. Dugas, M. Conradi
The synchronization of MRI data acquisition to the respiratory cycle is crucial for collecting high-resolution images of small animals. Data collected during at-rest periods (breath-hold or between breaths) do not suffer from the blurring effects of respiratory motion seen in unsynchronized images. Here, we describe a simple, inexpensive, robust respiratory gating unit to achieve this synchronization. The unit can operate in either free breathing or driven breathing (ventilator) modes, with respiration detected optically (breathing motion) or via a pressure transducer. We demonstrate the effectiveness of this unit with in-vivo mouse images of lung and liver.
MRI数据采集与呼吸周期同步是采集小动物高分辨率图像的关键。在休息期间(屏气或两次呼吸之间)收集的数据不会受到在不同步图像中看到的呼吸运动的模糊影响。在这里,我们描述了一个简单,廉价,坚固的呼吸门控单元来实现这种同步。该装置可以在自由呼吸或驱动呼吸(呼吸机)模式下工作,呼吸检测光学(呼吸运动)或通过压力传感器。我们用活体小鼠肺和肝脏图像证明了该单元的有效性。
{"title":"Respiratory gating for MRI and MRS in rodents","authors":"J. Garbow, J. Dugas, M. Conradi","doi":"10.1109/BIBE.2003.1188938","DOIUrl":"https://doi.org/10.1109/BIBE.2003.1188938","url":null,"abstract":"The synchronization of MRI data acquisition to the respiratory cycle is crucial for collecting high-resolution images of small animals. Data collected during at-rest periods (breath-hold or between breaths) do not suffer from the blurring effects of respiratory motion seen in unsynchronized images. Here, we describe a simple, inexpensive, robust respiratory gating unit to achieve this synchronization. The unit can operate in either free breathing or driven breathing (ventilator) modes, with respiration detected optically (breathing motion) or via a pressure transducer. We demonstrate the effectiveness of this unit with in-vivo mouse images of lung and liver.","PeriodicalId":178814,"journal":{"name":"Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114499102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Third IEEE Symposium on Bioinformatics and Bioengineering, 2003. Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1