Web Intelligence/IAT Workshops最新文献

英文中文

A Tolerance Rough Set Based Overlapping Clustering for the DBLP Data 基于容差粗糙集的DBLP数据重叠聚类

Web Intelligence/IAT Workshops

Pub Date : 2010-08-31 DOI: 10.1109/WI-IAT.2010.286

Gamila Obadi, Pavla Drázdilová, Lukas Hlavacek, J. Martinovič, V. Snás̃el

In the article there is presented comparison of overlapping clustering methods for data mining of DBLP datasets. For the analysis, the DBLP data sets were pre-processed, while each journal has been assigned attributes, defined by its topics. The data collection can be described as vague and uncertain; obtained clusters and applied queries do not necessarily have crisp boundaries. The authors presented clustering through a tolerance rough set method (TRSM) and fuzzy c-mean (FCM) algorithm for journal recommendation based on topic search. The comparison of both clustering methods was presented using different measures of similarity.

本文对DBLP数据集的重叠聚类方法进行了比较。为了进行分析，对DBLP数据集进行了预处理，同时为每个期刊分配了由其主题定义的属性。数据收集可以描述为模糊和不确定;获得的集群和应用的查询不一定有清晰的边界。作者提出了一种基于容忍粗糙集(TRSM)和模糊c均值(FCM)算法的聚类方法，用于基于主题搜索的期刊推荐。采用不同的相似性度量对两种聚类方法进行了比较。

引用次数: 8

Predicting Edges and Vertices in a Network 预测网络中的边和顶点

Web Intelligence/IAT Workshops

Pub Date : 2010-08-31 DOI: 10.1109/WI-IAT.2010.317

Walid K. Sharabati, E. Wegman, Yasmin H. Said

This paper addresses missing edges and vertices in a network. We discuss interchangeability and duality between vertices and edges in a graph. We use covariate information associated with vertices to estimate the probability of missing edges; likewise, we use covariate information associated with edges to estimate the probability of missing vertices. In order to predict missing vertices, we apply the line graph transformation, which converts edges to vertices and vertices to edges. The probability of an edge is obtained by taking the inner product of the vectors of covariates. Moreover, we have extended the methodology of predicting two edges (dyadic ties) to predict edge

本文讨论了网络中缺失的边和顶点。讨论了图中顶点和边的互换性和对偶性。我们使用与顶点相关的协变量信息来估计缺失边的概率;同样，我们使用与边相关的协变量信息来估计缺失顶点的概率。为了预测缺失的顶点，我们应用线形图变换，将边转换为顶点，将顶点转换为边。边的概率是通过取协变量向量的内积得到的。此外，我们还扩展了预测两条边的方法(并矢联系)来预测边

引用次数: 1

Tracing Strength of Relationships in Social Networks 追踪社会网络中关系的强度

Web Intelligence/IAT Workshops

Pub Date : 2010-08-31 DOI: 10.1109/WI-IAT.2010.241

Ivan Srba, M. Bieliková

Current web is known as a space with constantly growing interactivity among its users. It is changing from the data storage into a social interaction place where people not only search interesting information, but also communicate and collaborate. Obviously, social networks are the most used places for common interaction among people. We present a method for analysis of the strength of relationships together with their evolution. This method is based on the various user activities in social networks. We evaluate our approach within the Facebook social network.

当前的网络被认为是一个用户之间交互性不断增强的空间。它正在从数据存储转变为社交互动场所，人们不仅可以搜索有趣的信息，还可以进行交流和协作。显然，社交网络是人们之间最常用的互动场所。我们提出了一种分析关系强度及其演变的方法。该方法基于社交网络中的各种用户活动。我们在Facebook社交网络中评估我们的方法。

引用次数: 13

Careful Seeding Based on Independent Component Analysis for k-Means Clustering 基于独立分量分析的k均值聚类谨慎播种

Web Intelligence/IAT Workshops

Pub Date : 2010-08-31 DOI: 10.1109/WI-IAT.2010.102

T. Onoda, Miho Sakai, S. Yamada

The k-means method is a widely used clustering technique because of its simplicity and speed. However, the clustering result depends heavily on the chosen initial value. In this report, we propose a seeding method with independent component analysis for the k-means method. Using a benchmark dataset, we evaluate the performance of our proposed method and compare it with other seeding methods.

k-means方法是一种应用广泛的聚类方法，具有简单、快速等优点。然而，聚类结果在很大程度上取决于所选择的初始值。在本报告中，我们提出了一种具有独立分量分析的k-means方法的播种方法。使用基准数据集，我们评估了我们提出的方法的性能，并将其与其他播种方法进行了比较。

引用次数: 4

Content Propagation Analysis of E-mail Communications 电子邮件传播的内容传播分析

Web Intelligence/IAT Workshops

Pub Date : 2010-08-31 DOI: 10.1109/WI-IAT.2010.202

Naoki Yoshinaga, S. Itaya, Rie Tanaka, Taku Konishi, Shinichi Doi, Keiji Yamada, P. Davis

We analyze email communications within a large company to reveal how email activity patterns depend on content. We characterize email contents using keywords and examine statistics of email transmissions. As a result, we are able to identify differences in network structures and propagation behaviors depending on the type of keyword.

我们分析了一家大公司内部的电子邮件通信，以揭示电子邮件活动模式如何依赖于内容。我们使用关键字来描述电子邮件内容，并检查电子邮件传输的统计数据。因此，我们能够根据关键字的类型识别网络结构和传播行为的差异。

引用次数: 4

LiSTOMS: A Light-Weighted Self-Tuning Ontology Mapping System 一个轻量级的自调优本体映射系统

Web Intelligence/IAT Workshops

Pub Date : 2010-08-31 DOI: 10.1109/WI-IAT.2010.173

Zhen Zhen, Junyi Shen, Jinwei Zhao, J. Qian

We argue that it is more practical to address the ontology mapping self-tuning problem in a whole system context instead of in a single matcher context. In this paper we introduce RMOMS, a Reference Model for Ontology Mapping Systems, consisting of six parts, the Preprocessor, the Dispatcher, the Matcher(s), the Aggregator, the Pruner, and the User Interface, with which to disassemble the self-tuning problem into more feasible units. We propose Maximum Weight Bipartite Graph Matching method for self-tuning matchers and Stable Match method for self-tuning aggregator, and test them in LiSTOMS, a light-weighted prototype sample of RMOMS. With comparison with some notable systems, LiSTOMS shows leading recall rate and competing precision rate.

我们认为在整个系统上下文中解决本体映射自调优问题比在单个匹配器上下文中解决本体映射自调优问题更实际。本文介绍了本体映射系统的参考模型rmom，该模型由预处理器、调度器、匹配器、聚合器、修剪器和用户界面六部分组成，利用rmom可以将自调优问题分解成更可行的单元。我们提出了自调优匹配器的最大权值二部图匹配方法和自调优聚合器的稳定匹配方法，并在rmom的轻量级原型样本LiSTOMS中进行了测试。通过与一些著名系统的比较，listams显示出领先的召回率和竞争的准确率。

引用次数: 1

Towards Privacy Preserving Information Retrieval through Semantic Microaggregation 基于语义微聚合的隐私保护信息检索

Web Intelligence/IAT Workshops

Pub Date : 2010-08-31 DOI: 10.1109/WI-IAT.2010.132

Daniel Abril, G. Navarro-Arribas, V. Torra

In this paper we introduce the problem of providing privacy preserving information for Web indexing, classification, and other information retrieval task. Web pages are represented by a frequency term vector that preserves k-anonymity for all the Web pages. This vector can then be used, for example, to build indexes of classifiers. Our proposal makes use of semantic micro aggregation.

本文介绍了为Web索引、分类和其他信息检索任务提供隐私保护信息的问题。Web页面由一个频率项向量表示，该向量为所有Web页面保留k-匿名性。然后可以使用这个向量，例如，构建分类器的索引。我们的方案利用了语义微聚合。

引用次数: 6

Detecting Task-Based Query Sessions Using Collaborative Knowledge 使用协作知识检测基于任务的查询会话

Web Intelligence/IAT Workshops

Pub Date : 2010-08-31 DOI: 10.1109/WI-IAT.2010.281

C. Lucchese, S. Orlando, R. Perego, F. Silvestri, Gabriele Tolomei

Our research challenge is to provide a mechanism for splitting into user task-based sessions a long-term log of queries submitted to a Web Search Engine (WSE). The hypothesis is that some query sessions entail the concept of user task. We present an approach that relies on a centroid-based and a density-based clustering algorithm, which consider queries inter-arrival times and use a novel distance function that takes care of query lexical content and exploits the collaborative knowledge collected by Wiktionary and Wikipedia.

我们的研究挑战是提供一种机制，将提交给Web搜索引擎(WSE)的查询的长期日志拆分为基于用户任务的会话。假设某些查询会话包含用户任务的概念。我们提出了一种依赖于基于质心和基于密度的聚类算法的方法，该算法考虑了查询的到达时间，并使用了一种新的距离函数，该函数照顾查询词汇内容，并利用了维基百科和维基百科收集的协作知识。

引用次数: 6

Search Engine Result Aggregation Using Analytical Hierarchy Process 基于层次分析法的搜索引擎结果聚合

Web Intelligence/IAT Workshops

Pub Date : 2010-08-01 DOI: 10.1109/WI-IAT.2010.256

A. De, Elizabeth D. Diaz, Vijay V. Raghavan

A metasearch engine queries search engines and collates information returned by them in one result set for the user. Metasearch can be external or internal. In external metasearch, result lists from external, independent search engines are merged. On the other hand, in an internal metasearch, result lists from using different search algorithms on the same corpus are aggregated. Thus result merging is a key function of metasearch. In this work, we propose a model for result merging that is based on the Analytic Hierarchy Process and compares documents and search engines in pair-wise comparison before merging. Our model has the capability to merge result lists based on ranks as well as scores, as returned by search engines. We use the LETOR 2 (LEarning TO Rank) dataset from Microsoft Research Asia for our experiments. When using document ranks, our model improves by 31.60% and 8.58% over the Borda-Fuse and Weighted Borda-Fuse models respectively. When using document scores the improvements are 42.92% and 18.03% respectively.

元搜索引擎查询搜索引擎，并将它们返回的信息整理成一个结果集供用户使用。元搜索可以是外部的也可以是内部的。在外部元搜索中，来自外部独立搜索引擎的结果列表被合并。另一方面，在内部元搜索中，在同一语料库上使用不同搜索算法的结果列表被聚合。因此，结果合并是元搜索的一个关键功能。在这项工作中，我们提出了一个基于层次分析法的结果合并模型，并在合并之前对文档和搜索引擎进行配对比较。我们的模型具有根据搜索引擎返回的排名和分数合并结果列表的能力。我们使用微软亚洲研究院的LETOR 2 (LEarning TO Rank)数据集进行实验。当使用文档排名时，我们的模型分别比Borda-Fuse和加权Borda-Fuse模型提高了31.60%和8.58%。使用文献评分时，提高率分别为42.92%和18.03%。

引用次数: 7

Chaotic Analysis on E-government System Development 电子政务系统发展的混沌分析

Web Intelligence/IAT Workshops

Pub Date : 1900-01-01 DOI: 10.1109/WI-IAT.2010.107

Yanzhang Wang, Guirong Xiao, Shengju Han

Based on the macro MART model of an organizaion’s informationalization development, and with further application of system dynamics principles, the system dynamics behavior description of MART model and the corresponding calculation method of Jacobian matrix and the characteristic value are offerred. The related balance points or singular points, along with such chaotic features as attractor, saddle point, repulsion point and dissipation, are also analyzed. It is further proved that e-government system development is highly complicated and unstable, the development of information technology does not depend on the e-government system development and the functional level of e-government’s management is the most critical element.

基于组织信息化发展的宏观MART模型，进一步应用系统动力学原理，给出了MART模型的系统动力学行为描述以及相应的雅可比矩阵和特征值的计算方法。分析了相关的平衡点或奇异点，以及吸引点、鞍点、斥力点和耗散等混沌特征。进一步证明了电子政务系统的发展具有高度的复杂性和不稳定性，信息技术的发展并不依赖于电子政务系统的发展，电子政务管理的功能水平是最关键的因素。

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Web Intelligence/IAT Workshops

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀