Simsearcher: a local similarity search engine for biological sequence databases

Fifth International Symposium on Multimedia Software Engineering, 2003. Proceedings. Pub Date : 1900-01-01 DOI:10.1109/MMSE.2003.1254456

Tian-Haw Tsai, Suh-Yin Lee

引用次数: 3

Abstract

An efficient local similarity search engine is developed by exploiting some techniques of data mining. All frequent patterns in the database are retrieved and recorded in a one-time preprocessing process. Then a query sequence is checked to see whether any pattern from the preprocessing stage is matched to the query. Two regions coming from the query and a database sequence that both match a pattern form a possible seed for local similarity. Finally, we extend and score each such seed region pair to see whether there really exists local similarity with a score high enough for reporting. For computational efficiency, a novel clustering approach is proposed and integrated into the proposed system, which is based on the local similarity search engine - the DELPHI system proposed by IBM. Extensive experiments are demonstrated to show the performance of our system.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Simsearcher:生物序列数据库的局部相似性搜索引擎

利用数据挖掘技术，开发了一种高效的局部相似度搜索引擎。在一次预处理过程中检索和记录数据库中的所有频繁模式。然后检查查询序列，以查看预处理阶段中的任何模式是否与查询匹配。来自查询和数据库序列的两个区域都匹配一个模式，形成了局部相似性的可能种子。最后，我们对每个这样的种子区域对进行扩展和评分，看看是否真的存在足够高的局部相似度来进行报告。为了提高计算效率，提出了一种新的聚类方法，并将其集成到该系统中，该方法基于IBM的局部相似度搜索引擎- DELPHI系统。大量的实验证明了系统的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Fifth International Symposium on Multimedia Software Engineering, 2003. Proceedings.

自引率

0.00%

发文量