首页 > 最新文献

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology最新文献

英文 中文
Use of E-mail Social Networks for Enterprise Benefit 利用电子邮件社交网络为企业谋利
M. Laclavik, Stefan Dlugolinsky, M. Kvassay, L. Hluchý
The article discusses the potential methods and benefits of the analysis of social networks hidden in the enterprise and personal email archives. A proof-of concept prototype was developed. Social network extraction and the spreading activation algorithm are discussed and evaluated.
本文讨论了对企业和个人电子邮件档案中隐藏的社交网络进行分析的潜在方法和好处。开发了一个概念验证原型。对社会网络提取和传播激活算法进行了讨论和评价。
{"title":"Use of E-mail Social Networks for Enterprise Benefit","authors":"M. Laclavik, Stefan Dlugolinsky, M. Kvassay, L. Hluchý","doi":"10.1109/WI-IAT.2010.126","DOIUrl":"https://doi.org/10.1109/WI-IAT.2010.126","url":null,"abstract":"The article discusses the potential methods and benefits of the analysis of social networks hidden in the enterprise and personal email archives. A proof-of concept prototype was developed. Social network extraction and the spreading activation algorithm are discussed and evaluated.","PeriodicalId":340211,"journal":{"name":"2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131578026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Identifying Cohesive Subgroups and Their Correspondences in Multiple Related Networks 在多个相关网络中识别内聚子群及其对应关系
Prakash Mandayam Comar, Pang-Ning Tan, Anil K. Jain
Identifying cohesive subgroups in networks, also known as clustering is an active area of research in link mining with many practical applications. However, most of the early work in this area has focused on partitioning a single network or a bipartite graph into clusters/communities. This paper presents a framework that simultaneously clusters nodes from multiple related networks and learns the correspondences between subgroups in different networks. The framework also allows the incorporation of prior information about potential relationships between the subgroups. We have performed extensive experiments on both synthetic and real-life data sets to evaluate the effectiveness of our framework. Our results show superior performance of simultaneous clustering over independent clustering of individual networks.
识别网络中的内聚子群,也称为聚类,是链路挖掘中一个活跃的研究领域,具有许多实际应用。然而,该领域的大多数早期工作都集中在将单个网络或二部图划分为簇/社区。本文提出了一种同时从多个相关网络中聚类节点并学习不同网络中子组之间对应关系的框架。该框架还允许合并关于子组之间潜在关系的先验信息。我们对合成数据集和真实数据集进行了广泛的实验,以评估我们框架的有效性。我们的结果表明,同时聚类的性能优于单个网络的独立聚类。
{"title":"Identifying Cohesive Subgroups and Their Correspondences in Multiple Related Networks","authors":"Prakash Mandayam Comar, Pang-Ning Tan, Anil K. Jain","doi":"10.1109/WI-IAT.2010.226","DOIUrl":"https://doi.org/10.1109/WI-IAT.2010.226","url":null,"abstract":"Identifying cohesive subgroups in networks, also known as clustering is an active area of research in link mining with many practical applications. However, most of the early work in this area has focused on partitioning a single network or a bipartite graph into clusters/communities. This paper presents a framework that simultaneously clusters nodes from multiple related networks and learns the correspondences between subgroups in different networks. The framework also allows the incorporation of prior information about potential relationships between the subgroups. We have performed extensive experiments on both synthetic and real-life data sets to evaluate the effectiveness of our framework. Our results show superior performance of simultaneous clustering over independent clustering of individual networks.","PeriodicalId":340211,"journal":{"name":"2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131888771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Impact of One-Timer/N-Timer Object Classification on the Performance of Web Cache Replacement Algorithms 单定时器/ n定时器对象分类对Web缓存替换算法性能的影响
Saloua Messaoud Abid, H. Youssef
Web cache replacement Algorithms proposed in the literature try to maximize the Hit Ratio (HR), the Byte Hit Ratio (BHR), and the Delay Saving Ratio (DSR). However, even with an infinite Web cache storage capacity, values of these metrics could not exceed 70% most of the time. This is due to the fact that, given a workload, the first reference to an object is always a miss. Moreover, a statistical analysis of the workload shows that as much as 76% of objects are One-Timers (OT), i.e. they are referenced only once. Caching OT objects usually degrade the performance of all Web cache replacement algorithms. In fact, it may cause the eviction of N-Timer (NT) objects and hence increases the number of misses. In this paper, we present a technique to classify whether a cached object is an OT or not. We show through simulation that classification may significantly enhance the performance of replacement algorithms with respect to the HR, the BHR and the DSR.
文献中提出的Web缓存替换算法试图最大化命中率(HR)、字节命中率(BHR)和延迟节省率(DSR)。但是,即使使用无限的Web缓存存储容量,这些指标的值在大多数情况下也不能超过70%。这是因为,给定一个工作负载,对对象的第一次引用总是会错过。此外,对工作负载的统计分析表明,多达76%的对象是一次性(One-Timers, OT)的,即它们只被引用一次。缓存OT对象通常会降低所有Web缓存替换算法的性能。实际上,它可能会导致N-Timer (NT)对象的清除,从而增加失败的次数。本文提出了一种对缓存对象是否为OT进行分类的技术。我们通过仿真表明,分类可以显著提高替换算法在HR、BHR和DSR方面的性能。
{"title":"Impact of One-Timer/N-Timer Object Classification on the Performance of Web Cache Replacement Algorithms","authors":"Saloua Messaoud Abid, H. Youssef","doi":"10.1109/WI-IAT.2010.124","DOIUrl":"https://doi.org/10.1109/WI-IAT.2010.124","url":null,"abstract":"Web cache replacement Algorithms proposed in the literature try to maximize the Hit Ratio (HR), the Byte Hit Ratio (BHR), and the Delay Saving Ratio (DSR). However, even with an infinite Web cache storage capacity, values of these metrics could not exceed 70% most of the time. This is due to the fact that, given a workload, the first reference to an object is always a miss. Moreover, a statistical analysis of the workload shows that as much as 76% of objects are One-Timers (OT), i.e. they are referenced only once. Caching OT objects usually degrade the performance of all Web cache replacement algorithms. In fact, it may cause the eviction of N-Timer (NT) objects and hence increases the number of misses. In this paper, we present a technique to classify whether a cached object is an OT or not. We show through simulation that classification may significantly enhance the performance of replacement algorithms with respect to the HR, the BHR and the DSR.","PeriodicalId":340211,"journal":{"name":"2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133401551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Reasoning with Imprecise Context Using Improved Dempster-Shafer Theory 基于改进Dempster-Shafer理论的不精确上下文推理
C. H. Lyu, Minseuk Choi, Z. Li, H. Youn
In pervasive computing environment the contexts are usually imprecise and incomplete due to unreliable connectivity, user mobility, and resource constraints. In this paper we present an approach based on the Dempster-Shafer Theory (DST) for the reasoning with imprecise context. To solve the two fundamental issues of the DST, computation intensiveness and the Zadeh paradox, we filer out excrescent subsets based on their energy to reduce the number of subsets, and employ the concept of evidence loss and approval degree of evidence in the combining process.
在普适计算环境中,由于不可靠的连接、用户移动性和资源限制,上下文通常是不精确和不完整的。本文提出了一种基于Dempster-Shafer理论(DST)的不精确上下文推理方法。为了解决DST的两个基本问题——计算强度和Zadeh悖论,我们根据子集的能量对多余子集进行过滤以减少子集的数量,并在合并过程中引入证据损失和证据认可程度的概念。
{"title":"Reasoning with Imprecise Context Using Improved Dempster-Shafer Theory","authors":"C. H. Lyu, Minseuk Choi, Z. Li, H. Youn","doi":"10.1109/WI-IAT.2010.190","DOIUrl":"https://doi.org/10.1109/WI-IAT.2010.190","url":null,"abstract":"In pervasive computing environment the contexts are usually imprecise and incomplete due to unreliable connectivity, user mobility, and resource constraints. In this paper we present an approach based on the Dempster-Shafer Theory (DST) for the reasoning with imprecise context. To solve the two fundamental issues of the DST, computation intensiveness and the Zadeh paradox, we filer out excrescent subsets based on their energy to reduce the number of subsets, and employ the concept of evidence loss and approval degree of evidence in the combining process.","PeriodicalId":340211,"journal":{"name":"2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115196278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Making the Most of a Web Search Session 充分利用网络搜索会话
Benno Stein, Matthias Hagen
We tackle problems related to Web query formulation: given the set of keywords from a search session, 1) we find a maximum promising Web query, and, 2) we construct a family of promising Web queries covering all keywords. A query is promising if it fulfills user-defined constraints on the number of returned hits. We assume a real-world setting where the user is not given direct access to a search engine's index, i.e., querying is possible only through an interface. The goal to be optimized is the overall number of submitted Web queries. For both problems we develop search strategies based on co-occurrence probabilities. The achieved performance gain is substantial: compared to the uninformed baselines without co-occurrence probabilities the expected savings are up to 50% in the number of submitted queries, index accesses, and runtime.
我们处理与Web查询公式相关的问题:给定来自搜索会话的关键字集,1)我们找到一个最大的有希望的Web查询,2)我们构建一个涵盖所有关键字的有希望的Web查询族。如果查询满足用户定义的对返回的命中数的约束,那么它是有前途的。我们假设用户没有直接访问搜索引擎的索引,也就是说,只能通过接口进行查询。要优化的目标是提交的Web查询的总数。对于这两个问题,我们都基于共现概率开发了搜索策略。所实现的性能增益是巨大的:与没有共现概率的不知情基线相比,在提交查询、索引访问和运行时的数量上预期节省高达50%。
{"title":"Making the Most of a Web Search Session","authors":"Benno Stein, Matthias Hagen","doi":"10.1109/WI-IAT.2010.234","DOIUrl":"https://doi.org/10.1109/WI-IAT.2010.234","url":null,"abstract":"We tackle problems related to Web query formulation: given the set of keywords from a search session, 1) we find a maximum promising Web query, and, 2) we construct a family of promising Web queries covering all keywords. A query is promising if it fulfills user-defined constraints on the number of returned hits. We assume a real-world setting where the user is not given direct access to a search engine's index, i.e., querying is possible only through an interface. The goal to be optimized is the overall number of submitted Web queries. For both problems we develop search strategies based on co-occurrence probabilities. The achieved performance gain is substantial: compared to the uninformed baselines without co-occurrence probabilities the expected savings are up to 50% in the number of submitted queries, index accesses, and runtime.","PeriodicalId":340211,"journal":{"name":"2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116883483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Epsilon-Subjective Equivalence of Models for Interactive Dynamic Influence Diagrams 交互动态影响图模型的epsilon -主观等价性
Prashant Doshi, Muthukumaran Chandrasekaran, Yi-feng Zeng
Interactive dynamic influence diagrams (I-DID) are graphical models for sequential decision making in uncertain settings shared by other agents. Algorithms for solving I-DIDs face the challenge of an exponentially growing space of candidate models ascribed to other agents, over time. Pruning behaviorally equivalent models is one way toward minimizing the model set. We seek to further reduce the complexity by additionally pruning models that are approximately subjectively equivalent. Toward this, we define subjective equivalence in terms of the distribution over the subject agent's future action-observation paths, and introduce the notion of epsilon-subjective equivalence. We present a new approximation technique that reduces the candidate model space by removing models that are epsilon-subjectively equivalent with representative ones.
交互式动态影响图(I-DID)是在不确定环境下由其他代理共享的顺序决策的图形模型。随着时间的推移,解决i - did的算法面临着归因于其他代理的候选模型空间呈指数增长的挑战。修剪行为等效模型是最小化模型集的一种方法。我们试图通过额外修剪主观上近似等效的模型来进一步降低复杂性。为此,我们根据主体智能体未来行动-观察路径的分布来定义主观等价,并引入了epsilon-subjective equivalence的概念。我们提出了一种新的近似技术,通过去除与代表性模型在主观上等效的模型来减少候选模型空间。
{"title":"Epsilon-Subjective Equivalence of Models for Interactive Dynamic Influence Diagrams","authors":"Prashant Doshi, Muthukumaran Chandrasekaran, Yi-feng Zeng","doi":"10.1109/WI-IAT.2010.74","DOIUrl":"https://doi.org/10.1109/WI-IAT.2010.74","url":null,"abstract":"Interactive dynamic influence diagrams (I-DID) are graphical models for sequential decision making in uncertain settings shared by other agents. Algorithms for solving I-DIDs face the challenge of an exponentially growing space of candidate models ascribed to other agents, over time. Pruning behaviorally equivalent models is one way toward minimizing the model set. We seek to further reduce the complexity by additionally pruning models that are approximately subjectively equivalent. Toward this, we define subjective equivalence in terms of the distribution over the subject agent's future action-observation paths, and introduce the notion of epsilon-subjective equivalence. We present a new approximation technique that reduces the candidate model space by removing models that are epsilon-subjectively equivalent with representative ones.","PeriodicalId":340211,"journal":{"name":"2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116167967","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A Multi-heuristic Cooperative Ant Colony System for Optimizing Elimination Ordering of Bayesian Networks 基于多启发式协同蚁群系统的贝叶斯网络消去排序优化
Xuchu Dong, D. Ouyang, Yuxin Ye, Haihong Yu, Yonggang Zhang
To solve the problem of searching for an optimal elimination ordering of Bayesian networks, a novel effective heuristic, MinSum Weight, and an ACS approach incorporated with multi-heuristic mechanism are proposed. The ACS approach named MHC-ACS utilizes a set of heuristics to direct the ants moving in the search space. The cooperation of multiple heuristics helps ants explore more regions. Moreover, the most appropriate heuristic will be identified and be reinforced with the evolution of the whole system. Experiments demonstrate that MHC-ACS has a better performance than other swarm intelligence methods.
为了解决贝叶斯网络的最优消去排序问题,提出了一种新的有效启发式方法——最小和权值法,以及一种结合多启发式机制的ACS方法。名为MHC-ACS的ACS方法利用一组启发式方法来指导蚂蚁在搜索空间中的移动。多种启发式的合作有助于蚂蚁探索更多的区域。此外,最合适的启发式将被确定并随着整个系统的发展而得到加强。实验表明,MHC-ACS算法比其他群体智能算法具有更好的性能。
{"title":"A Multi-heuristic Cooperative Ant Colony System for Optimizing Elimination Ordering of Bayesian Networks","authors":"Xuchu Dong, D. Ouyang, Yuxin Ye, Haihong Yu, Yonggang Zhang","doi":"10.1109/WI-IAT.2010.33","DOIUrl":"https://doi.org/10.1109/WI-IAT.2010.33","url":null,"abstract":"To solve the problem of searching for an optimal elimination ordering of Bayesian networks, a novel effective heuristic, MinSum Weight, and an ACS approach incorporated with multi-heuristic mechanism are proposed. The ACS approach named MHC-ACS utilizes a set of heuristics to direct the ants moving in the search space. The cooperation of multiple heuristics helps ants explore more regions. Moreover, the most appropriate heuristic will be identified and be reinforced with the evolution of the whole system. Experiments demonstrate that MHC-ACS has a better performance than other swarm intelligence methods.","PeriodicalId":340211,"journal":{"name":"2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123503024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Getting What You Pay For: Is Exploration in Distributed Hill Climbing Really Worth it? 物有所值:分布式爬山的探索真的值得吗?
Melanie Smith, R. Mailler
The Distributed Stochastic Algorithm (DSA), Distributed Breakout Algorithm (DBA), and variations such as Distributed Simulated Annealing (DSAN), MGM-1, and DisPeL, are distributed hill-climbing techniques for solving large Distributed Constraint Optimization Problems (DCOPs) such as distributed scheduling, resource allocation, and distributed route planning. Like their centralized counterparts, these algorithms employ escape techniques to avoid getting trapped in local minima during the search process. For example, the best known version of DSA, DSA-B, makes hill-climbing and lateral escape moves, moves that do not impact the solution quality, with a single probability $p$. DSAN uses a similar scheme, but also occasionally makes a move that leads to a worse solution in an effort to find a better overall solution. Although these escape moves tend to lead to a better solutions in the end, the cost of employing the various strategies is often not well understood. In this work, we investigate the costs and benefits of the various escape strategies by empirically evaluating each of these protocols in distributed graph coloring and sensor tracking domains. Through our testing, we discovered that by reducing or eliminating escape moves, the cost of using these algorithms decreases dramatically without significantly affecting solution quality.
分布式随机算法(DSA)、分布式分组算法(DBA)以及诸如分布式模拟退火(DSAN)、MGM-1和驱散等变体,都是用于解决大型分布式约束优化问题(dcop)的分布式爬坡技术,例如分布式调度、资源分配和分布式路由规划。与集中式算法一样,这些算法采用逃逸技术来避免在搜索过程中陷入局部最小值。例如,最著名的DSA版本,DSA- b,以单一概率$p$进行爬坡和横向逃脱移动,这些移动不会影响解决方案的质量。DSAN使用类似的方案,但在寻找更好的整体解决方案的过程中,偶尔也会做出导致更糟糕的解决方案的举动。尽管这些逃避行动最终会带来更好的解决方案,但采用各种策略的成本往往不被很好地理解。在这项工作中,我们通过经验评估分布式图着色和传感器跟踪领域中的每种协议来研究各种逃逸策略的成本和收益。通过我们的测试,我们发现通过减少或消除逃避动作,使用这些算法的成本显着降低,而不会显著影响解决方案的质量。
{"title":"Getting What You Pay For: Is Exploration in Distributed Hill Climbing Really Worth it?","authors":"Melanie Smith, R. Mailler","doi":"10.1109/WI-IAT.2010.31","DOIUrl":"https://doi.org/10.1109/WI-IAT.2010.31","url":null,"abstract":"The Distributed Stochastic Algorithm (DSA), Distributed Breakout Algorithm (DBA), and variations such as Distributed Simulated Annealing (DSAN), MGM-1, and DisPeL, are distributed hill-climbing techniques for solving large Distributed Constraint Optimization Problems (DCOPs) such as distributed scheduling, resource allocation, and distributed route planning. Like their centralized counterparts, these algorithms employ escape techniques to avoid getting trapped in local minima during the search process. For example, the best known version of DSA, DSA-B, makes hill-climbing and lateral escape moves, moves that do not impact the solution quality, with a single probability $p$. DSAN uses a similar scheme, but also occasionally makes a move that leads to a worse solution in an effort to find a better overall solution. Although these escape moves tend to lead to a better solutions in the end, the cost of employing the various strategies is often not well understood. In this work, we investigate the costs and benefits of the various escape strategies by empirically evaluating each of these protocols in distributed graph coloring and sensor tracking domains. Through our testing, we discovered that by reducing or eliminating escape moves, the cost of using these algorithms decreases dramatically without significantly affecting solution quality.","PeriodicalId":340211,"journal":{"name":"2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116811548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Muzk Mesh: Interlinking Semantic Music Data Muzk Mesh:互连语义音乐数据
M. Singhi, Ying Ding, Yuyin Sun
The vision of the Semantic Web is to lift current Web into semantic repositories where heterogeneous data can be queried and different services can be mashed up. The Web becomes a platform for integrating data and services. The paper discusses the MuzkMesh music portal which mashups existing semantic music data from the Linked Open Data (LOD) bubbles and other common APIs. It aims to demo the power of semantic integration and useful use scenarios on music retrieval and entertainment.
语义Web的愿景是将当前Web提升为语义存储库,在其中可以查询异构数据,并将不同的服务组合在一起。Web成为了一个集成数据和服务的平台。本文讨论了MuzkMesh音乐门户,该门户融合了来自链接开放数据(LOD)气泡和其他常见api的现有语义音乐数据。它旨在展示语义集成的力量以及在音乐检索和娱乐方面的有用使用场景。
{"title":"Muzk Mesh: Interlinking Semantic Music Data","authors":"M. Singhi, Ying Ding, Yuyin Sun","doi":"10.1109/WI-IAT.2010.162","DOIUrl":"https://doi.org/10.1109/WI-IAT.2010.162","url":null,"abstract":"The vision of the Semantic Web is to lift current Web into semantic repositories where heterogeneous data can be queried and different services can be mashed up. The Web becomes a platform for integrating data and services. The paper discusses the MuzkMesh music portal which mashups existing semantic music data from the Linked Open Data (LOD) bubbles and other common APIs. It aims to demo the power of semantic integration and useful use scenarios on music retrieval and entertainment.","PeriodicalId":340211,"journal":{"name":"2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117225841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Predicting Web Search Hit Counts 预测网络搜索命中数
Tian Tian, J. Geller, Soon Ae Chun
Keyword-based search engines often return an unexpected number of results. Zero hits are naturally undesirable, while too many hits are likely to be overwhelming and of low precision. We present an approach for predicting the number of hits for a given set of query terms. Using word frequencies derived from a large corpus, we construct random samples of combinations of these words as search terms. Then we derive a correlation function between the computed probabilities of search terms and the observed hit counts for them. This regression function is used to predict the hit counts for a user’s new searches, with the intention of avoiding information overload. We report the results of experiments with Google, Yahoo! and Bing to validate our methodology. We further investigate the monotonicity of search results for negative search terms by those three search engines.
基于关键字的搜索引擎通常会返回一些意想不到的结果。零命中自然是不受欢迎的,而太多命中可能是压倒性的和低精度的。我们提出了一种预测给定查询词集命中次数的方法。使用来自大型语料库的词频,我们构建这些词组合的随机样本作为搜索词。然后,我们推导出搜索项的计算概率与观察到的命中次数之间的关联函数。这个回归函数用于预测用户新搜索的命中次数,目的是避免信息过载。我们报道了与谷歌、雅虎!和必应来验证我们的方法我们进一步研究了这三个搜索引擎对负面搜索项的搜索结果的单调性。
{"title":"Predicting Web Search Hit Counts","authors":"Tian Tian, J. Geller, Soon Ae Chun","doi":"10.1109/WI-IAT.2010.227","DOIUrl":"https://doi.org/10.1109/WI-IAT.2010.227","url":null,"abstract":"Keyword-based search engines often return an unexpected number of results. Zero hits are naturally undesirable, while too many hits are likely to be overwhelming and of low precision. We present an approach for predicting the number of hits for a given set of query terms. Using word frequencies derived from a large corpus, we construct random samples of combinations of these words as search terms. Then we derive a correlation function between the computed probabilities of search terms and the observed hit counts for them. This regression function is used to predict the hit counts for a user’s new searches, with the intention of avoiding information overload. We report the results of experiments with Google, Yahoo! and Bing to validate our methodology. We further investigate the monotonicity of search results for negative search terms by those three search engines.","PeriodicalId":340211,"journal":{"name":"2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124738724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1