首页 > 最新文献

2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology最新文献

英文 中文
Mining Fuzzy Domain Ontology Based on Concept Vector from Wikipedia Category Network 基于维基百科分类网络概念向量的模糊领域本体挖掘
Cheng-Yu Lu, Shou-Wei Ho, Jen-Ming Chung, Fu-Yuan Hsu, Hahn-Ming Lee, Jan-Ming Ho
Ontology is essential in the formalization of domain knowledge for effective human-computer interactions (i.e., expert-finding). Many researchers have proposed approaches to measure the similarity between concepts by accessing fuzzy domain ontology. However, engineering of the construction of domain ontologies turns out to be labor intensive and tedious. In this paper, we propose an approach to mine domain concepts from Wikipedia Category Network, and to generate the fuzzy relation based on a concept vector extraction method to measure the relatedness between a single term and a concept. Our methodology can conceptualize domain knowledge by mining Wikipedia Category Network. An empirical experiment is conducted to evaluate the robustness by using TREC dataset. Experiment results show the constructed fuzzy domain ontology derived by proposed approach can discover robust fuzzy domain ontology with satisfactory accuracy in information retrieval tasks.
本体对于有效的人机交互(即专家查找)领域知识的形式化是必不可少的。许多研究者提出了通过访问模糊领域本体来度量概念间相似性的方法。然而,领域本体的构建工程是一项劳动密集型和繁琐的工作。本文提出了一种从维基百科分类网络中挖掘领域概念的方法,并基于概念向量提取方法生成模糊关系,以度量单个术语与概念之间的相关性。我们的方法可以通过挖掘维基百科分类网络来概念化领域知识。利用TREC数据集对模型的鲁棒性进行了实证检验。实验结果表明,该方法构建的模糊领域本体能够在信息检索任务中以满意的准确率发现鲁棒的模糊领域本体。
{"title":"Mining Fuzzy Domain Ontology Based on Concept Vector from Wikipedia Category Network","authors":"Cheng-Yu Lu, Shou-Wei Ho, Jen-Ming Chung, Fu-Yuan Hsu, Hahn-Ming Lee, Jan-Ming Ho","doi":"10.1109/WI-IAT.2011.140","DOIUrl":"https://doi.org/10.1109/WI-IAT.2011.140","url":null,"abstract":"Ontology is essential in the formalization of domain knowledge for effective human-computer interactions (i.e., expert-finding). Many researchers have proposed approaches to measure the similarity between concepts by accessing fuzzy domain ontology. However, engineering of the construction of domain ontologies turns out to be labor intensive and tedious. In this paper, we propose an approach to mine domain concepts from Wikipedia Category Network, and to generate the fuzzy relation based on a concept vector extraction method to measure the relatedness between a single term and a concept. Our methodology can conceptualize domain knowledge by mining Wikipedia Category Network. An empirical experiment is conducted to evaluate the robustness by using TREC dataset. Experiment results show the constructed fuzzy domain ontology derived by proposed approach can discover robust fuzzy domain ontology with satisfactory accuracy in information retrieval tasks.","PeriodicalId":128421,"journal":{"name":"2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128430015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Understanding Crowds' Migration on the Web 了解人群在网络上的迁移
Yong Wang, Komal Pal, A. Kuzmanovic
Consider a network where nodes are websites and the weight of a link that connects two nodes corresponds to the average number of users that visits both of the two websites over longer timescales. Such user-driven Web network is not only invaluable for understanding how crowds' interests collectively spread on the Web, but also useful for applications such as advertising or search. In this paper, we manage to construct such a network by 'putting together' pieces of information publicly available from the popular analytics websites. Our contributions are threefold. First, we design a crawler and a normalization methodology that enable us to construct a user-driven Web network based on limited publicly-available information, and validate the high accuracy of our approach. Second, we evaluate the unique properties of our network, and demonstrate that it exhibits small-world, seed-free, and scale-free phenomena. Finally, we build an application, website selector, on top of the user-driven network. The core concept utilized in the website selector is that by exploiting the knowledge that a number of websites share a number of common users, an advertiser might prefer displaying his ads only on a subset of these websites to optimize the budget allocation, and in turn increase the visibility of his ads on other websites. Our websites elector system is tailored for ad commissioners and it could be easily embedded in their ad selection algorithms.
考虑一个网络,其中节点是网站,连接两个节点的链接的权重对应于在较长时间尺度上访问这两个网站的平均用户数量。这种用户驱动的网络不仅对于理解人群的兴趣如何在网络上集体传播是无价的,而且对于广告或搜索等应用程序也很有用。在本文中,我们通过将流行的分析网站上公开的信息片段“放在一起”来构建这样一个网络。我们的贡献是三重的。首先,我们设计了一个爬虫和一种规范化方法,使我们能够基于有限的公开信息构建用户驱动的Web网络,并验证了我们方法的高准确性。其次,我们评估了我们的网络的独特性质,并证明了它表现出小世界、无种子和无标度现象。最后,我们在用户驱动网络的基础上构建了一个应用程序——网站选择器。网站选择器中使用的核心概念是,通过利用许多网站共享许多共同用户的知识,广告商可能更喜欢只在这些网站的子集上显示他的广告,以优化预算分配,并反过来增加他的广告在其他网站上的可见性。我们的网站选举人系统是量身定制的广告专员,它可以很容易地嵌入到他们的广告选择算法。
{"title":"Understanding Crowds' Migration on the Web","authors":"Yong Wang, Komal Pal, A. Kuzmanovic","doi":"10.1109/WI-IAT.2011.40","DOIUrl":"https://doi.org/10.1109/WI-IAT.2011.40","url":null,"abstract":"Consider a network where nodes are websites and the weight of a link that connects two nodes corresponds to the average number of users that visits both of the two websites over longer timescales. Such user-driven Web network is not only invaluable for understanding how crowds' interests collectively spread on the Web, but also useful for applications such as advertising or search. In this paper, we manage to construct such a network by 'putting together' pieces of information publicly available from the popular analytics websites. Our contributions are threefold. First, we design a crawler and a normalization methodology that enable us to construct a user-driven Web network based on limited publicly-available information, and validate the high accuracy of our approach. Second, we evaluate the unique properties of our network, and demonstrate that it exhibits small-world, seed-free, and scale-free phenomena. Finally, we build an application, website selector, on top of the user-driven network. The core concept utilized in the website selector is that by exploiting the knowledge that a number of websites share a number of common users, an advertiser might prefer displaying his ads only on a subset of these websites to optimize the budget allocation, and in turn increase the visibility of his ads on other websites. Our websites elector system is tailored for ad commissioners and it could be easily embedded in their ad selection algorithms.","PeriodicalId":128421,"journal":{"name":"2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130166664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Evaluating the Trade-Offs between Diversity and Precision for Web Image Search Using Concept-Based Query Expansion 使用基于概念的查询扩展评估网络图像搜索多样性和精度之间的权衡
Enamul Hoque, O. Hoeber, Minglun Gong
Even though Web image search queries are often ambiguous, traditional search engines retrieve and present results solely based on relevance ranking, where only the most common and popular interpretations of the query are considered. Rather than assuming that all users are interested in the most common meaning of the query, a more sensible approach may be to produce a diversified set of images that cover the various aspects of the query, under the expectation that at least one of these interpretations will match the searcher's needs. However, such a promotion of diversity in the search results has the side-effect of decreasing the precision of the most common sense. In this paper, we evaluate this trade-off in the context of a method for explicitly diversifying image search results via concept-based query expansion using Wikipedia. Experiments with controlling the degree of diversification illustrate this balance between diversity and precision for both ambiguous and specific queries. Our ultimate goal of this research is to propose an automatic method for tuning the diversification parameter based on degree of ambiguity of the original query.
尽管Web图像搜索查询通常是模糊的,但传统的搜索引擎仅根据相关性排序检索和显示结果,其中只考虑查询的最常见和最流行的解释。与其假设所有用户都对查询的最常见含义感兴趣,更明智的方法可能是生成涵盖查询各个方面的多样化图像集,并期望这些解释中至少有一种符合搜索者的需求。然而,这种促进搜索结果多样性的副作用是降低了最常见常识的准确性。在本文中,我们在使用维基百科通过基于概念的查询扩展显式多样化图像搜索结果的方法的背景下评估这种权衡。控制多样化程度的实验说明了歧义查询和特定查询在多样性和精度之间的平衡。本研究的最终目标是提出一种基于原始查询的模糊程度自动调整多样化参数的方法。
{"title":"Evaluating the Trade-Offs between Diversity and Precision for Web Image Search Using Concept-Based Query Expansion","authors":"Enamul Hoque, O. Hoeber, Minglun Gong","doi":"10.1109/WI-IAT.2011.11","DOIUrl":"https://doi.org/10.1109/WI-IAT.2011.11","url":null,"abstract":"Even though Web image search queries are often ambiguous, traditional search engines retrieve and present results solely based on relevance ranking, where only the most common and popular interpretations of the query are considered. Rather than assuming that all users are interested in the most common meaning of the query, a more sensible approach may be to produce a diversified set of images that cover the various aspects of the query, under the expectation that at least one of these interpretations will match the searcher's needs. However, such a promotion of diversity in the search results has the side-effect of decreasing the precision of the most common sense. In this paper, we evaluate this trade-off in the context of a method for explicitly diversifying image search results via concept-based query expansion using Wikipedia. Experiments with controlling the degree of diversification illustrate this balance between diversity and precision for both ambiguous and specific queries. Our ultimate goal of this research is to propose an automatic method for tuning the diversification parameter based on degree of ambiguity of the original query.","PeriodicalId":128421,"journal":{"name":"2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130780953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Approaches to Relating and Integrating Semantic Data from Heterogeneous Sources 异构源语义数据的关联与集成方法
J. Keeney, Aidan Boran, Ivan Bedini, C. Matheus, P. Patel-Schneider
Integrating and relating heterogeneous data using inference is one of the cornerstones of semantic technologies and there are a variety of ways in which this may be achieved. Cross source relationships can be automatically translated or inferred using the axioms of RDFS/OWL, via user generated rules, or as the result of SPARQL query result transformations. For a given problem it is not always obvious which approach (or combination of approaches) will be the most effective and few guidelines exist for making this choice. This paper discusses these three approaches and demonstrates them using an "acquaintance" relationship drawn from data residing in common RDF information sources such as FOAF and DBLP data stores. The implementation of each approach is described along with practical considerations for their use. Quantitative and qualitative evaluation results of each approach are presented and the paper concludes with initial suggestions for guiding principles to help in selecting an appropriate approach for integrating heterogeneous semantic data sources.
使用推理集成和关联异构数据是语义技术的基石之一,有多种方法可以实现这一目标。可以使用RDFS/OWL的公理、通过用户生成的规则或作为SPARQL查询结果转换的结果自动翻译或推断跨源关系。对于给定的问题,并不总是很明显哪种方法(或方法的组合)将是最有效的,并且很少有指导方针可以做出这种选择。本文讨论了这三种方法,并使用从驻留在公共RDF信息源(如FOAF和DBLP数据存储)中的数据绘制的“熟人”关系来演示它们。描述了每种方法的实现及其使用的实际考虑。给出了每种方法的定量和定性评估结果,并对指导原则提出了初步建议,以帮助选择合适的方法来集成异构语义数据源。
{"title":"Approaches to Relating and Integrating Semantic Data from Heterogeneous Sources","authors":"J. Keeney, Aidan Boran, Ivan Bedini, C. Matheus, P. Patel-Schneider","doi":"10.1109/WI-IAT.2011.129","DOIUrl":"https://doi.org/10.1109/WI-IAT.2011.129","url":null,"abstract":"Integrating and relating heterogeneous data using inference is one of the cornerstones of semantic technologies and there are a variety of ways in which this may be achieved. Cross source relationships can be automatically translated or inferred using the axioms of RDFS/OWL, via user generated rules, or as the result of SPARQL query result transformations. For a given problem it is not always obvious which approach (or combination of approaches) will be the most effective and few guidelines exist for making this choice. This paper discusses these three approaches and demonstrates them using an \"acquaintance\" relationship drawn from data residing in common RDF information sources such as FOAF and DBLP data stores. The implementation of each approach is described along with practical considerations for their use. Quantitative and qualitative evaluation results of each approach are presented and the paper concludes with initial suggestions for guiding principles to help in selecting an appropriate approach for integrating heterogeneous semantic data sources.","PeriodicalId":128421,"journal":{"name":"2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127901008","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Examining Individual and System Level Effects of AmI Traffic Environments 检查AmI交通环境的个人和系统级影响
C. Maag, Christian Mark, H. Krüger, M. Fullerton, F. Busch, A. Leonhardt
Drivers on the road are constantly interacting. The development of new sensor, display and communication systems has laid the basis for ambient intelligence (AmI) technology in road traffic. The effects of such devices on driver behavior and emotional response have to be analyzed, e.g. by using driving simulator studies. By implementing the results of these studies into traffic simulation and "scaling up", emerging effects on the system level can be investigated, e.g. on emotional climate, road safety, and traffic flow. This paper describes an environment for the analysis of AmI systems applied to merging points on highways. Results from an advanced driving simulator study are presented and show positive effects of such a system on anger in driving. The implications of such a device on the system level of road traffic can be investigated by using the instrument of traffic simulation. This is illustrated by evaluating the anger present in the traffic simulator interactions using the results from the driving simulator experiment.
路上的司机不断地相互作用。新型传感器、显示和通信系统的发展为道路交通环境智能(AmI)技术奠定了基础。必须分析这些设备对驾驶员行为和情绪反应的影响,例如通过驾驶模拟器研究。通过将这些研究结果应用于交通模拟和“放大”,可以研究系统层面上的新影响,例如情绪气候、道路安全和交通流量。本文描述了一种应用于高速公路归并点的AmI系统分析环境。本文介绍了一项先进的驾驶模拟器研究的结果,并显示了该系统对驾驶中的愤怒情绪的积极影响。这种装置对道路交通系统水平的影响可以通过使用交通模拟工具进行调查。这是通过使用驾驶模拟器实验的结果来评估交通模拟器交互中存在的愤怒来说明的。
{"title":"Examining Individual and System Level Effects of AmI Traffic Environments","authors":"C. Maag, Christian Mark, H. Krüger, M. Fullerton, F. Busch, A. Leonhardt","doi":"10.1109/WI-IAT.2011.214","DOIUrl":"https://doi.org/10.1109/WI-IAT.2011.214","url":null,"abstract":"Drivers on the road are constantly interacting. The development of new sensor, display and communication systems has laid the basis for ambient intelligence (AmI) technology in road traffic. The effects of such devices on driver behavior and emotional response have to be analyzed, e.g. by using driving simulator studies. By implementing the results of these studies into traffic simulation and \"scaling up\", emerging effects on the system level can be investigated, e.g. on emotional climate, road safety, and traffic flow. This paper describes an environment for the analysis of AmI systems applied to merging points on highways. Results from an advanced driving simulator study are presented and show positive effects of such a system on anger in driving. The implications of such a device on the system level of road traffic can be investigated by using the instrument of traffic simulation. This is illustrated by evaluating the anger present in the traffic simulator interactions using the results from the driving simulator experiment.","PeriodicalId":128421,"journal":{"name":"2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126521623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
SemTagP: Semantic Community Detection in Folksonomies SemTagP:大众分类法中的语义社区检测
Guillaume Erétéo, Fabien L. Gandon, M. Buffa
Building on top of our results on semantic social network analysis, we present a community detection algorithm, SemTagP, that takes benefits of the semantic data that were captured while structuring the RDF graphs of social networks. SemTagP not only offers to detect but also to label communities by exploiting (in addition to the structure of the social graph) the tags used by people during the social tagging process as well as the semantic relations inferred between tags. Doing so, we are able to refine the partitioning of the social graph with semantic processing and to label the activity of detected communities. We tested and evaluated this algorithm on the social network built from Ph.D. theses funded by ADEME, the French Environment and Energy Management Agency. We showed how this approach allows us to detect and label communities of interest and control the precision of the labels.
在语义社交网络分析结果的基础上,我们提出了一个社区检测算法SemTagP,它利用了在构建社交网络RDF图时捕获的语义数据。SemTagP不仅可以检测社区,还可以通过利用人们在社会标记过程中使用的标签以及标签之间推断的语义关系来标记社区(除了社交图的结构之外)。这样,我们就能够通过语义处理来细化社交图的划分,并标记检测到的社区的活动。我们在由法国环境和能源管理局(ADEME)资助的博士论文构建的社交网络上测试和评估了该算法。我们展示了这种方法如何允许我们检测和标记感兴趣的社区,并控制标记的精度。
{"title":"SemTagP: Semantic Community Detection in Folksonomies","authors":"Guillaume Erétéo, Fabien L. Gandon, M. Buffa","doi":"10.1109/WI-IAT.2011.98","DOIUrl":"https://doi.org/10.1109/WI-IAT.2011.98","url":null,"abstract":"Building on top of our results on semantic social network analysis, we present a community detection algorithm, SemTagP, that takes benefits of the semantic data that were captured while structuring the RDF graphs of social networks. SemTagP not only offers to detect but also to label communities by exploiting (in addition to the structure of the social graph) the tags used by people during the social tagging process as well as the semantic relations inferred between tags. Doing so, we are able to refine the partitioning of the social graph with semantic processing and to label the activity of detected communities. We tested and evaluated this algorithm on the social network built from Ph.D. theses funded by ADEME, the French Environment and Energy Management Agency. We showed how this approach allows us to detect and label communities of interest and control the precision of the labels.","PeriodicalId":128421,"journal":{"name":"2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116713490","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
Modelling of Complex Systems with AML as Realized in MIRO Project MIRO项目中实现的复杂系统AML建模
Sébastien Chipeaux, F. Bouquet, C. Lang, N. Marilleau
In this paper, we propose a modeling approach for a spatial complex system. The targeted system is the city with its mobility patterns. The goal of MIRO project is to study service accessibility in the city. In fact, we simulate the city with multi agent systems using them to represent each part of the system(individuals, buildings, streets,). The MIRO team is composed by scientists of several domains (computer sciences, geography or economy), so we want to construct a model of the city to share knowledges of each domain. The next step, we will use verification approach in order to validate the model and then the simulator because we want to generate simulator from model. Thus, we propose a method for modeling such complex system. This method is based on AML (Agent Modeling Language) that is a language well adapted for modeling multi-agent systems. We, then, present a spatial AML meta-model coupled with a method. The use case is the MIRO project.
本文提出了一种空间复杂系统的建模方法。目标系统是城市及其流动模式。MIRO项目的目标是研究城市的服务可及性。事实上,我们用多智能体系统来模拟城市,用它们来代表系统的每个部分(个人、建筑、街道等)。MIRO团队由多个领域(计算机科学、地理或经济)的科学家组成,因此我们希望构建一个城市模型来共享每个领域的知识。下一步,我们将使用验证方法来验证模型,然后是模拟器,因为我们想从模型生成模拟器。因此,我们提出了一种对这种复杂系统进行建模的方法。该方法基于AML (Agent Modeling Language),这是一种非常适合多智能体系统建模的语言。因此,我们提出了一个空间AML元模型和一个方法。用例是MIRO项目。
{"title":"Modelling of Complex Systems with AML as Realized in MIRO Project","authors":"Sébastien Chipeaux, F. Bouquet, C. Lang, N. Marilleau","doi":"10.1109/WI-IAT.2011.195","DOIUrl":"https://doi.org/10.1109/WI-IAT.2011.195","url":null,"abstract":"In this paper, we propose a modeling approach for a spatial complex system. The targeted system is the city with its mobility patterns. The goal of MIRO project is to study service accessibility in the city. In fact, we simulate the city with multi agent systems using them to represent each part of the system(individuals, buildings, streets,). The MIRO team is composed by scientists of several domains (computer sciences, geography or economy), so we want to construct a model of the city to share knowledges of each domain. The next step, we will use verification approach in order to validate the model and then the simulator because we want to generate simulator from model. Thus, we propose a method for modeling such complex system. This method is based on AML (Agent Modeling Language) that is a language well adapted for modeling multi-agent systems. We, then, present a spatial AML meta-model coupled with a method. The use case is the MIRO project.","PeriodicalId":128421,"journal":{"name":"2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123867278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Independent Component Analysis Based Seeding Method for K-Means Clustering 基于独立分量分析的k均值聚类播种方法
T. Onoda, Miho Sakai, S. Yamada
The k-means clustering method is a widely used clustering technique for the Web because of its simplicity and speed. However, the clustering result depends heavily on the chosen initial clustering centers, which are chosen uniformly at random from the data points. We propose a seeding method based on the independent component analysis for the k-means clustering method. We evaluate the performance of our proposed method and compare it with other seeding methods by using benchmark datasets. We applied our proposed method to a Web corpus, which is provided by ODP. The experiments show that the normalized mutual information of our proposed method is better than the normalized mutual information of k-means clustering method and k-means++ clustering method. Therefore, the proposed method is useful for Web corpus.
k-均值聚类方法是一种应用广泛的网络聚类技术,具有简单、快速等优点。然而,聚类结果在很大程度上取决于所选择的初始聚类中心,这些中心是从数据点中均匀随机选择的。针对k-means聚类方法,提出了一种基于独立分量分析的种子方法。我们通过使用基准数据集评估了我们提出的方法的性能,并将其与其他播种方法进行了比较。我们将所提出的方法应用于ODP提供的Web语料库。实验表明,本文方法的互信息归一化优于k-means聚类方法和k-means++聚类方法的互信息归一化。因此,所提出的方法对Web语料库是有用的。
{"title":"Independent Component Analysis Based Seeding Method for K-Means Clustering","authors":"T. Onoda, Miho Sakai, S. Yamada","doi":"10.1109/WI-IAT.2011.29","DOIUrl":"https://doi.org/10.1109/WI-IAT.2011.29","url":null,"abstract":"The k-means clustering method is a widely used clustering technique for the Web because of its simplicity and speed. However, the clustering result depends heavily on the chosen initial clustering centers, which are chosen uniformly at random from the data points. We propose a seeding method based on the independent component analysis for the k-means clustering method. We evaluate the performance of our proposed method and compare it with other seeding methods by using benchmark datasets. We applied our proposed method to a Web corpus, which is provided by ODP. The experiments show that the normalized mutual information of our proposed method is better than the normalized mutual information of k-means clustering method and k-means++ clustering method. Therefore, the proposed method is useful for Web corpus.","PeriodicalId":128421,"journal":{"name":"2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125114764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Evaluation of a Semi-automated Semantic Annotation Approach for Bootstrapping the Analysis of Large-Scale Web Service Networks 一种用于大规模Web服务网络自举分析的半自动语义标注方法的评价
Shahab Mokarizadeh, Peep Küngas, M. Matskin
In recent years many methods have been proposed, which require semantic annotations of Web services as an input. Such methods include discovery, match-making, composition and execution of Web services in dynamic settings, just to mention few. At the same time automated Web service annotation approaches have been proposed for supporting application of former methods in settings where it is not feasible to provide the annotations manually. However, lack of effective automated evaluation frameworks has seriously limited proper evaluation of the constructed annotations in practical settings where the overall annotation quality of millions of Web services needs to be evaluated. This paper describes an evaluation framework for measuring the quality of semantic annotations of large number of Web services descriptions provided in form of WSDL and XSD documents. The evaluation framework is based on analyzing network properties, namely scale-free and small-world properties, of Web service networks, which in turn have been constructed from semantic annotations of Web services. The evaluation approach is demonstrated through evaluation of a semi-automated annotation approach, which was applied to a set of publicly available WSDL documents describing altogether ca 200 000 Web service operations.
近年来提出了许多方法,这些方法都需要Web服务的语义注释作为输入。这些方法包括在动态设置中发现、匹配、组合和执行Web服务,仅举几例。同时,已经提出了自动Web服务注释方法,以支持在无法手动提供注释的环境中应用以前的方法。然而,缺乏有效的自动化评估框架严重限制了在实际设置中对构造的注释进行适当评估,而实际设置需要评估数百万个Web服务的整体注释质量。本文描述了一个评估框架,用于度量以WSDL和XSD文档形式提供的大量Web服务描述的语义注释的质量。该评估框架基于对Web服务网络的网络属性(即无标度和小世界属性)的分析,这些属性又由Web服务的语义注释构建而成。通过对半自动注释方法的评估来演示评估方法,该方法应用于一组公开可用的WSDL文档,这些文档总共描述了大约20万个Web服务操作。
{"title":"Evaluation of a Semi-automated Semantic Annotation Approach for Bootstrapping the Analysis of Large-Scale Web Service Networks","authors":"Shahab Mokarizadeh, Peep Küngas, M. Matskin","doi":"10.1109/WI-IAT.2011.237","DOIUrl":"https://doi.org/10.1109/WI-IAT.2011.237","url":null,"abstract":"In recent years many methods have been proposed, which require semantic annotations of Web services as an input. Such methods include discovery, match-making, composition and execution of Web services in dynamic settings, just to mention few. At the same time automated Web service annotation approaches have been proposed for supporting application of former methods in settings where it is not feasible to provide the annotations manually. However, lack of effective automated evaluation frameworks has seriously limited proper evaluation of the constructed annotations in practical settings where the overall annotation quality of millions of Web services needs to be evaluated. This paper describes an evaluation framework for measuring the quality of semantic annotations of large number of Web services descriptions provided in form of WSDL and XSD documents. The evaluation framework is based on analyzing network properties, namely scale-free and small-world properties, of Web service networks, which in turn have been constructed from semantic annotations of Web services. The evaluation approach is demonstrated through evaluation of a semi-automated annotation approach, which was applied to a set of publicly available WSDL documents describing altogether ca 200 000 Web service operations.","PeriodicalId":128421,"journal":{"name":"2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131276090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Visualization-Based Support of Hypothesis Verification for Research Survey with Co-authorship Networks 基于可视化的合作作者网络研究调查假设验证支持
Takeshi Kurosawa, Y. Takama
This paper examines the effectiveness of a visualization system for getting insight into future research activities from co-authorship networks. A co-authorship network is important information when doing a research survey. In particular, there are many requests on survey that relate with researchers' future activities, such as identification of growing researchers and supervisors. In previous paper we proposed a visualization system for co-authorship networks, which provides the function for identifying research areas and that for identifying temporal variation of both network structure and keyword distribution. This paper examines its effectiveness through field trials by test participants. The results are examined as the process of hypothesis verification, which shows that test participants could perform the task even though they had no background knowledge about InfoVis.
本文考察了一个可视化系统的有效性,该系统可以从合作作者网络中深入了解未来的研究活动。在进行研究调查时,合作作者网络是重要的信息。特别是,与研究人员未来活动有关的调查要求很多,例如确定成长中的研究人员和主管。在之前的文章中,我们提出了一个合作作者网络的可视化系统,该系统提供了识别研究领域、识别网络结构和关键词分布的时间变化的功能。本文通过测试参与者的现场试验来检验其有效性。结果作为假设验证的过程进行检查,这表明测试参与者即使没有关于InfoVis的背景知识也可以执行任务。
{"title":"Visualization-Based Support of Hypothesis Verification for Research Survey with Co-authorship Networks","authors":"Takeshi Kurosawa, Y. Takama","doi":"10.1109/WI-IAT.2011.121","DOIUrl":"https://doi.org/10.1109/WI-IAT.2011.121","url":null,"abstract":"This paper examines the effectiveness of a visualization system for getting insight into future research activities from co-authorship networks. A co-authorship network is important information when doing a research survey. In particular, there are many requests on survey that relate with researchers' future activities, such as identification of growing researchers and supervisors. In previous paper we proposed a visualization system for co-authorship networks, which provides the function for identifying research areas and that for identifying temporal variation of both network structure and keyword distribution. This paper examines its effectiveness through field trials by test participants. The results are examined as the process of hypothesis verification, which shows that test participants could perform the task even though they had no background knowledge about InfoVis.","PeriodicalId":128421,"journal":{"name":"2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133508975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1