Applications of Web mining - from Web search engine to P2P filtering $

H. Kawano
{"title":"Applications of Web mining - from Web search engine to P2P filtering $","authors":"H. Kawano","doi":"10.1109/ICKS.2004.1313420","DOIUrl":null,"url":null,"abstract":"We have developed Japanese Web search engine \"Mondou (RCAAU)\", which was based on the emerging technologies of data mining. Our search engine provides associative keywords which are tightly related to focusing Web pages. We also implemented the visual interface based on the technology of information visualization. In order to improve the performance of various search strategies by using characteristics of Web systems, we try to implement the advanced Web information systems with data mining and information technologies. Firstly, we introduce various Web mining algorithm, which efficiently reduces the computing cost of Web search. We pay attention to a part of useful pages effectively and improve the performance of Web search by using our proposed algorithms. Secondly, for preserving huge volume of born-digital information in the Internet, we are focusing on technologies of Web archiving system like WARP. In order to handle monotonously increasing digital information, we have to resolve many difficult problems of long life data preservation by improving Web searching techniques. Our experiences of our Mondou Web search engine and cooperative distributed Web robots are very useful and effective. Finally, the technologies of P2P (Peer-to-Peer) distributed search systems are becoming important rapidly. For example, it is very hard to discover appropriate information resources by simple queries of Gnutella, Freenet and so on. Therefore, in order to realize the topic-driven search, we propose more intelligent search systems, which are based on the technologies of data mining.","PeriodicalId":185973,"journal":{"name":"International Conference on Informatics Research for Development of Knowledge Society Infrastructure, 2004. ICKS 2004.","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Informatics Research for Development of Knowledge Society Infrastructure, 2004. ICKS 2004.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICKS.2004.1313420","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

We have developed Japanese Web search engine "Mondou (RCAAU)", which was based on the emerging technologies of data mining. Our search engine provides associative keywords which are tightly related to focusing Web pages. We also implemented the visual interface based on the technology of information visualization. In order to improve the performance of various search strategies by using characteristics of Web systems, we try to implement the advanced Web information systems with data mining and information technologies. Firstly, we introduce various Web mining algorithm, which efficiently reduces the computing cost of Web search. We pay attention to a part of useful pages effectively and improve the performance of Web search by using our proposed algorithms. Secondly, for preserving huge volume of born-digital information in the Internet, we are focusing on technologies of Web archiving system like WARP. In order to handle monotonously increasing digital information, we have to resolve many difficult problems of long life data preservation by improving Web searching techniques. Our experiences of our Mondou Web search engine and cooperative distributed Web robots are very useful and effective. Finally, the technologies of P2P (Peer-to-Peer) distributed search systems are becoming important rapidly. For example, it is very hard to discover appropriate information resources by simple queries of Gnutella, Freenet and so on. Therefore, in order to realize the topic-driven search, we propose more intelligent search systems, which are based on the technologies of data mining.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Web挖掘的应用——从Web搜索引擎到P2P过滤
我们开发了基于新兴数据挖掘技术的日语网络搜索引擎“Mondou (RCAAU)”。我们的搜索引擎提供与聚焦网页紧密相关的关联关键字。我们还实现了基于信息可视化技术的可视化界面。为了利用Web系统的特点来提高各种搜索策略的性能,我们尝试利用数据挖掘和信息技术来实现先进的Web信息系统。首先介绍了各种Web挖掘算法,有效地降低了Web搜索的计算成本。我们有效地关注了一部分有用的页面,并通过使用我们提出的算法提高了Web搜索的性能。其次,为了在互联网上保存海量的原生数字信息,我们重点研究了WARP等Web归档系统技术。为了处理单调增长的数字信息,我们必须通过改进Web搜索技术来解决长期数据保存的许多难题。我们的Mondou网络搜索引擎和协作式分布式网络机器人的经验是非常有用和有效的。最后,P2P (Peer-to-Peer)分布式搜索系统技术正迅速变得重要起来。例如,通过简单的Gnutella、Freenet等查询很难发现合适的信息资源。因此,为了实现主题驱动搜索,我们提出了基于数据挖掘技术的更智能的搜索系统。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
XML access control and e-commerce technologies for advanced information distribution Applications of Web mining - from Web search engine to P2P filtering $ The dynamic interaction between external information and the internal model TV2Web: generating and browsing Web with multiple LOD from video streams and their metadata Smoothing methods for mathematical programs with equilibrium constraints
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1