A distributed semantic similar search for high-dimensional resources in low-dimensional content addressable network

Qingyuan Hu, Chunhong Zhang, Yang Ji
{"title":"A distributed semantic similar search for high-dimensional resources in low-dimensional content addressable network","authors":"Qingyuan Hu, Chunhong Zhang, Yang Ji","doi":"10.1109/PIMRC.2013.6666764","DOIUrl":null,"url":null,"abstract":"A mechanism for distributed semantic similar resource search is proposed in P2P network. The mechanism is based on the content addressable network (CAN). CAN, one of P2P networks, has the natural ability to support the semantic similar search with the semantic vector space model (SVSM) of resources. However, there exists a mismatching problem between the low-dimension CAN network and the high-dimension resources, which needs a dimensionality reduction algorithm. For the semantic similar search in distributed environment of CAN, the applied dimensionality reduction algorithm needs to meet two specific requirements: maintenance for semantic similarity of SVSM of resources, and distributed computing with large and dynamic data, which is not well researched. A distributed algorithm called D-PCA is proposed based on the statistical characteristic of resources in each node. It extracts the principal components of original high-dimensional SVSM to reduce the dimension in a distributed way. D-PCA is taken as a novel hash function to project high-dimensional SVSM into low-dimensional space of distributed hash table in CAN. A semantic indexing and searching process based on semantic DHT in CAN are simulated to show the applicability of D-PCA and the effectiveness of semantic similar search.","PeriodicalId":210993,"journal":{"name":"2013 IEEE 24th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 24th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PIMRC.2013.6666764","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A mechanism for distributed semantic similar resource search is proposed in P2P network. The mechanism is based on the content addressable network (CAN). CAN, one of P2P networks, has the natural ability to support the semantic similar search with the semantic vector space model (SVSM) of resources. However, there exists a mismatching problem between the low-dimension CAN network and the high-dimension resources, which needs a dimensionality reduction algorithm. For the semantic similar search in distributed environment of CAN, the applied dimensionality reduction algorithm needs to meet two specific requirements: maintenance for semantic similarity of SVSM of resources, and distributed computing with large and dynamic data, which is not well researched. A distributed algorithm called D-PCA is proposed based on the statistical characteristic of resources in each node. It extracts the principal components of original high-dimensional SVSM to reduce the dimension in a distributed way. D-PCA is taken as a novel hash function to project high-dimensional SVSM into low-dimensional space of distributed hash table in CAN. A semantic indexing and searching process based on semantic DHT in CAN are simulated to show the applicability of D-PCA and the effectiveness of semantic similar search.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
低维内容可寻址网络中高维资源的分布式语义相似搜索
提出了一种基于P2P网络的分布式语义相似资源搜索机制。该机制基于内容可寻址网络(CAN)。作为P2P网络的一种,CAN具有利用资源的语义向量空间模型(SVSM)支持语义相似搜索的天然能力。然而,低维CAN网络与高维资源之间存在不匹配问题,需要一种降维算法。对于CAN分布式环境下的语义相似搜索,所应用的降维算法需要满足资源支持向量机语义相似度的维护和海量动态数据的分布式计算两方面的具体要求,而这方面的研究还不是很充分。基于各节点资源的统计特征,提出了一种分布式的D-PCA算法。提取原始高维支持向量机的主成分,以分布式方式降维。将D-PCA作为一种新颖的哈希函数,将高维svm投影到CAN的分布式哈希表的低维空间中。仿真了CAN中基于语义DHT的语义索引和搜索过程,验证了D-PCA的适用性和语义相似搜索的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Experimental validation of fog models for FSO under laboratory controlled conditions EWMA-triggered waterfilling for reduced-complexity resource management in ad-hoc connections Sleep scheduling in IEEE 802.16j relay networks A comparison of implicit and explicit channel feedback methods for MU-MIMO WLAN systems Optimization of collaborating secondary users in a cooperative sensing under noise uncertainty
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1