首页 > 最新文献

IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)最新文献

英文 中文
Social Networks and the Semantic Web 社交网络和语义网
Pub Date : 2007-09-18 DOI: 10.1109/WI.2004.128
P. Mika
A formal, web-based representation of social networks is both a necessity in terms of infrastructure as well as a prominent application for the Semantic Web. In this paper we present three advances in exploiting the opportunity of semantically-enriched network data: (1) an ontology for the representation of social networks and relationships (2) a hybrid system for online data acquisition that combines traditional web mining techniques with the collection of Semantic Web data (2) a case study highlighting some of the possible analysis of this data using methods from Social Network Analysis, the branch of sociology concerned with relational data.
正式的、基于Web的社交网络表示形式既是基础设施的必要条件,也是语义Web的重要应用程序。在本文中,我们提出了利用语义丰富的网络数据机会的三个进展:(1)表示社会网络和关系的本体;(2)将传统网络挖掘技术与语义网络数据收集相结合的在线数据采集混合系统;(2)一个案例研究,强调了使用社会网络分析方法对这些数据进行一些可能的分析,社会网络分析是社会学的一个分支,与关系数据有关。
{"title":"Social Networks and the Semantic Web","authors":"P. Mika","doi":"10.1109/WI.2004.128","DOIUrl":"https://doi.org/10.1109/WI.2004.128","url":null,"abstract":"A formal, web-based representation of social networks is both a necessity in terms of infrastructure as well as a prominent application for the Semantic Web. In this paper we present three advances in exploiting the opportunity of semantically-enriched network data: (1) an ontology for the representation of social networks and relationships (2) a hybrid system for online data acquisition that combines traditional web mining techniques with the collection of Semantic Web data (2) a case study highlighting some of the possible analysis of this data using methods from Social Network Analysis, the branch of sociology concerned with relational data.","PeriodicalId":229107,"journal":{"name":"IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128934296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 325
A Weighted Freshness Metric for Maintaining Search Engine Local Repository 一种用于维护搜索引擎本地资源库的加权新鲜度度量
Pub Date : 2004-09-20 DOI: 10.1109/WI.2004.17
Jianchao Han, N. Cercone, Xiaohua Hu
Current search engines maintain a local repository to improve the search efficiency. A crawler is used to periodically poll the remote web pages to update the contents of the local repository. Due to the resource limitations, some local pages may be stale. To maintain the high freshness of the repository, the crawler is expected to revisit remote web pages in optimized order and frequency. The intuitive metric of freshness of the local repository is defined as the fraction of up-to-date web pages in the repository, which is merely based on the repository content, and does not, unfortunately, reflect the perspective of the search engine users, e.g., how often is a web page queried? We propose a novel weighted metric of the repository freshness with the importance of web pages being the weights. This metric not only takes into account the local web pages themselves but also the perspectives of the search engine users. We study the repository synchronization policy under this new metric, compare this metric with others, analyze its features, and discuss how the web page importance is determined.
当前的搜索引擎维护本地存储库以提高搜索效率。爬虫用于定期轮询远程网页以更新本地存储库的内容。由于资源限制,某些本地页面可能已经过时。为了保持存储库的高新鲜度,期望爬虫以优化的顺序和频率重新访问远程web页面。本地存储库的新鲜度的直观度量被定义为存储库中最新网页的比例,它仅仅基于存储库内容,不幸的是,它并没有反映搜索引擎用户的观点,例如,一个网页被查询的频率有多高?我们提出了一种新的以网页重要性为权重的资源库新鲜度加权度量。这个指标不仅考虑了本地网页本身,还考虑了搜索引擎用户的观点。我们研究了该度量标准下的存储库同步策略,并与其他度量标准进行了比较,分析了其特点,讨论了如何确定网页的重要性。
{"title":"A Weighted Freshness Metric for Maintaining Search Engine Local Repository","authors":"Jianchao Han, N. Cercone, Xiaohua Hu","doi":"10.1109/WI.2004.17","DOIUrl":"https://doi.org/10.1109/WI.2004.17","url":null,"abstract":"Current search engines maintain a local repository to improve the search efficiency. A crawler is used to periodically poll the remote web pages to update the contents of the local repository. Due to the resource limitations, some local pages may be stale. To maintain the high freshness of the repository, the crawler is expected to revisit remote web pages in optimized order and frequency. The intuitive metric of freshness of the local repository is defined as the fraction of up-to-date web pages in the repository, which is merely based on the repository content, and does not, unfortunately, reflect the perspective of the search engine users, e.g., how often is a web page queried? We propose a novel weighted metric of the repository freshness with the importance of web pages being the weights. This metric not only takes into account the local web pages themselves but also the perspectives of the search engine users. We study the repository synchronization policy under this new metric, compare this metric with others, analyze its features, and discuss how the web page importance is determined.","PeriodicalId":229107,"journal":{"name":"IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115628559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Full-Coverage Web Prediction based on Web Usage Mining and Site Topology 基于Web使用挖掘和站点拓扑的全覆盖Web预测
Pub Date : 2004-09-20 DOI: 10.1109/WI.2004.71
Diamanto Oikonomopoulou, Maria Rigou, S. Sirmakessis, A. Tsakalidis
Understanding and modeling user online behavior, as well as predicting future requests remain an open challenge for researchers, analysts and marketers. In this paper, we propose an efficient prediction schema based on the extraction of sequential navigation patterns from server log files, combined with web site topology. Traversed paths are monitored, internally recorded and cleaned before being completed with cashed page views. After session and episode identification follows the construction of n-grams. Prediction is based upon a 5 + n-gram schema with all lower level n-grams participating, a procedure that resembles the construction of an All 5th-order Markov Model. The schema achieves full coverage while maintaining competitive prediction precision.
理解和模拟用户的在线行为,以及预测未来的需求仍然是研究人员、分析师和营销人员面临的一个公开挑战。本文提出了一种基于从服务器日志文件中提取顺序导航模式并结合网站拓扑结构的高效预测模式。在使用兑现的页面视图完成之前,将对遍历的路径进行监视、内部记录和清理。会话和情节之后的识别遵循n-gram的构建。预测基于5 + n-gram模式,所有较低级别的n-gram都参与其中,这一过程类似于全5阶马尔可夫模型的构建。该模式实现了全覆盖,同时保持了具有竞争力的预测精度。
{"title":"Full-Coverage Web Prediction based on Web Usage Mining and Site Topology","authors":"Diamanto Oikonomopoulou, Maria Rigou, S. Sirmakessis, A. Tsakalidis","doi":"10.1109/WI.2004.71","DOIUrl":"https://doi.org/10.1109/WI.2004.71","url":null,"abstract":"Understanding and modeling user online behavior, as well as predicting future requests remain an open challenge for researchers, analysts and marketers. In this paper, we propose an efficient prediction schema based on the extraction of sequential navigation patterns from server log files, combined with web site topology. Traversed paths are monitored, internally recorded and cleaned before being completed with cashed page views. After session and episode identification follows the construction of n-grams. Prediction is based upon a 5 + n-gram schema with all lower level n-grams participating, a procedure that resembles the construction of an All 5th-order Markov Model. The schema achieves full coverage while maintaining competitive prediction precision.","PeriodicalId":229107,"journal":{"name":"IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116922502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
The Abstract Syntax of RuleML - Towards a General Web Rule Language Framework RuleML的抽象语法——走向一个通用的Web规则语言框架
Pub Date : 2004-09-20 DOI: 10.1109/WI.2004.134
Gerd Wagner, G. Antoniou, Said Tabet, H. Boley
This paper discusses the approach taken by the Rule Markup Language (RuleML) Initiative towards a general Web rule language framework and relates it to the MDA and UML by the Object Management Group (OMG). It also presents the abstract syntax of RuleML 0.85 as a MOF/UML model and considers the possibility to integrate RuleML with OCL and Action Semantics.
本文讨论了规则标记语言(RuleML)倡议对通用Web规则语言框架所采取的方法,并将其与对象管理组(OMG)的MDA和UML联系起来。它还将RuleML 0.85的抽象语法表示为MOF/UML模型,并考虑了将RuleML与OCL和动作语义集成的可能性。
{"title":"The Abstract Syntax of RuleML - Towards a General Web Rule Language Framework","authors":"Gerd Wagner, G. Antoniou, Said Tabet, H. Boley","doi":"10.1109/WI.2004.134","DOIUrl":"https://doi.org/10.1109/WI.2004.134","url":null,"abstract":"This paper discusses the approach taken by the Rule Markup Language (RuleML) Initiative towards a general Web rule language framework and relates it to the MDA and UML by the Object Management Group (OMG). It also presents the abstract syntax of RuleML 0.85 as a MOF/UML model and considers the possibility to integrate RuleML with OCL and Action Semantics.","PeriodicalId":229107,"journal":{"name":"IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125953307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 53
Local vs Global Policies and Centralized vs Decentralized Control in Virtual Communities of Agents 代理虚拟社区中的局部与全局策略、集中与分散控制
Pub Date : 2004-09-20 DOI: 10.1109/WI.2004.89
G. Boella, Leendert van der Torre
We are interested in the design of policies for virtual communities of agents based on the grid infrastructure. In a virtual community agents can play both the role of resource consumers and the role of resource providers, and they remain in control of their resources. We argue that this requirement creates a distinction between two dimensions: global vs local and centralized and decentralized control by means of policies. The providers should be enabled to specify their local policies on their own resources, but their policies should be consistent with the global policies. At the same time, some aspects of the decentralized control should be delegated to specialized providers; this delegation requires a distinction between the authorization to access a resource and a permission to do so.
我们感兴趣的是基于网格基础设施的虚拟代理社区的策略设计。在虚拟社区中,代理既可以扮演资源消费者的角色,也可以扮演资源提供者的角色,并且它们仍然控制着自己的资源。我们认为,这一要求在两个维度之间产生了区别:通过政策手段进行的全球与本地以及集中和分散控制。提供者应该能够在他们自己的资源上指定他们的本地策略,但是他们的策略应该与全局策略一致。同时,分散控制的某些方面应委托给专门的提供者;此委托要求在访问资源的授权和访问资源的权限之间进行区分。
{"title":"Local vs Global Policies and Centralized vs Decentralized Control in Virtual Communities of Agents","authors":"G. Boella, Leendert van der Torre","doi":"10.1109/WI.2004.89","DOIUrl":"https://doi.org/10.1109/WI.2004.89","url":null,"abstract":"We are interested in the design of policies for virtual communities of agents based on the grid infrastructure. In a virtual community agents can play both the role of resource consumers and the role of resource providers, and they remain in control of their resources. We argue that this requirement creates a distinction between two dimensions: global vs local and centralized and decentralized control by means of policies. The providers should be enabled to specify their local policies on their own resources, but their policies should be consistent with the global policies. At the same time, some aspects of the decentralized control should be delegated to specialized providers; this delegation requires a distinction between the authorization to access a resource and a permission to do so.","PeriodicalId":229107,"journal":{"name":"IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)","volume":"187 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126028006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
MICE: Aggregating and Classifying Meta Search Results into Self-Customized Categories MICE:聚合和分类元搜索结果到自定义类别
Pub Date : 2004-09-20 DOI: 10.1109/WI.2004.96
Saravadee Sae Tan, Gan Keng Hoon, E. Tang, Cheong Sook Lin, Chan Siew Lin, Foo Wen Ying
Having broad coverage of search results returned by various search sources, combining and organizing these results in a meaningful way has become a common issue in the field of information retrieval. In this demo paper, we describe our meta search system, MICE, that is able to aggregate and classify search results based on user-customized categories. Categories help user to focus on search results, with respect to the categories concept customized by the user.
广泛覆盖各种搜索源返回的搜索结果,以有意义的方式对这些结果进行组合和组织已成为信息检索领域的普遍问题。在这篇演示论文中,我们描述了我们的元搜索系统MICE,它能够根据用户自定义的类别对搜索结果进行聚合和分类。类别帮助用户关注搜索结果,相对于用户自定义的类别概念。
{"title":"MICE: Aggregating and Classifying Meta Search Results into Self-Customized Categories","authors":"Saravadee Sae Tan, Gan Keng Hoon, E. Tang, Cheong Sook Lin, Chan Siew Lin, Foo Wen Ying","doi":"10.1109/WI.2004.96","DOIUrl":"https://doi.org/10.1109/WI.2004.96","url":null,"abstract":"Having broad coverage of search results returned by various search sources, combining and organizing these results in a meaningful way has become a common issue in the field of information retrieval. In this demo paper, we describe our meta search system, MICE, that is able to aggregate and classify search results based on user-customized categories. Categories help user to focus on search results, with respect to the categories concept customized by the user.","PeriodicalId":229107,"journal":{"name":"IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)","volume":"175 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123738968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Knowledge on the Web: Making Web Services Knowledge-Aware Web上的知识:使Web服务具有知识意识
Pub Date : 2004-09-20 DOI: 10.1109/WI.2004.87
A. Cuzzocrea
Knowledge personalization is currently the most investigated issue in the context of service-oriented systems on the Web. Knowledge representation and management are the critical issues for knowledge personalization, and actually are currently being widely investigated, mainly due to the explosion of data modeling technologies such as XML and XML Schema. Despite some progress, a widely approved standard for delivering knowledge is still missing. In this paper we propose a new approach for representing, managing, and delivering knowledge on the Web and the correspondent framework, called Distributed Knowledge Networks (DKN), that implements it. We also provide a reference architecture for DKN and some experimental results about knowledge personalization.
在面向服务的Web系统中,知识个性化是目前研究最多的问题。知识表示和管理是知识个性化的关键问题,目前正在得到广泛的研究,这主要是由于数据建模技术(如XML和XML Schema)的爆发。尽管取得了一些进展,但仍然缺少一个广泛认可的知识传递标准。在本文中,我们提出了一种在Web上表示、管理和传递知识的新方法,以及相应的框架,称为分布式知识网络(DKN),它实现了这种方法。本文还提供了一个知识个性化的参考体系结构和一些关于知识个性化的实验结果。
{"title":"Knowledge on the Web: Making Web Services Knowledge-Aware","authors":"A. Cuzzocrea","doi":"10.1109/WI.2004.87","DOIUrl":"https://doi.org/10.1109/WI.2004.87","url":null,"abstract":"Knowledge personalization is currently the most investigated issue in the context of service-oriented systems on the Web. Knowledge representation and management are the critical issues for knowledge personalization, and actually are currently being widely investigated, mainly due to the explosion of data modeling technologies such as XML and XML Schema. Despite some progress, a widely approved standard for delivering knowledge is still missing. In this paper we propose a new approach for representing, managing, and delivering knowledge on the Web and the correspondent framework, called Distributed Knowledge Networks (DKN), that implements it. We also provide a reference architecture for DKN and some experimental results about knowledge personalization.","PeriodicalId":229107,"journal":{"name":"IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125351880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Using WI Technologies to Develop Intelligent Portals - Research Activities at the WIC Japan Center - 利用WI技术开发智能门户——WIC日本中心的研究活动
Pub Date : 2004-09-20 DOI: 10.1109/WI.2004.156
N. Zhong
The objective of the WIC Japan Research Centre is to carry out basic research concerning certain aspects of Web Intelligence (WI). Our research activities focus on investigating WI technologies for developing various portals that enable intelligence for e-science, e-business, e-government, and e-learning, as well as deal with the scalability and complexity of real world, efficiently and effectively. We observe that developing intelligent portals is one of the most sophisticated applications, which needs to be supported by WI technologies. Research work that has been carried out can be categorized and described as follows.
WIC日本研究中心的目标是开展有关Web智能(WI)某些方面的基础研究。我们的研究活动集中于研究WI技术,用于开发各种门户,这些门户为电子科学、电子商务、电子政务和电子学习提供智能,并高效地处理现实世界的可伸缩性和复杂性。我们注意到,开发智能门户是最复杂的应用程序之一,它需要WI技术的支持。已经开展的研究工作可以分类和描述如下。
{"title":"Using WI Technologies to Develop Intelligent Portals - Research Activities at the WIC Japan Center -","authors":"N. Zhong","doi":"10.1109/WI.2004.156","DOIUrl":"https://doi.org/10.1109/WI.2004.156","url":null,"abstract":"The objective of the WIC Japan Research Centre is to carry out basic research concerning certain aspects of Web Intelligence (WI). Our research activities focus on investigating WI technologies for developing various portals that enable intelligence for e-science, e-business, e-government, and e-learning, as well as deal with the scalability and complexity of real world, efficiently and effectively. We observe that developing intelligent portals is one of the most sophisticated applications, which needs to be supported by WI technologies. Research work that has been carried out can be categorized and described as follows.","PeriodicalId":229107,"journal":{"name":"IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)","volume":"112 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122824097","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Extracting Precise Link Context Using NLP Parsing Technique 利用NLP解析技术提取精确的链接上下文
Pub Date : 2004-09-20 DOI: 10.1109/WI.2004.68
Qingyang Xu, Wanli Zuo
Link context has been exploited extensively ever since the advent of the World Wide Web, but the approach to extracting precise link context has not been fully explored and many state-of-the-art extraction methods are based on simplistic heuristics and require ad-hoc parameters. In this paper, we propose a novel two-step extraction model, which aims to systematically derive link context of quality as high as anchor text. In the macroscopic analysis step, a systematic web page structure analysis is performed to locate the content cohesive text region and potential relevant header or header like tags. In the microscopic extraction step, an English parser is used to extract the relevant sentence fragments in the text region and the nearest heading text is encompassed if the need arises. Preliminary experimental results proved our approach's effectiveness.
自从万维网出现以来,链接上下文就被广泛利用,但是提取精确链接上下文的方法还没有得到充分的探索,许多最先进的提取方法都是基于简单的启发式方法,需要特别的参数。在本文中,我们提出了一种新的两步提取模型,该模型旨在系统地获得质量与锚文本一样高的链接上下文。在宏观分析步骤中,进行系统的网页结构分析,定位内容内聚文本区域和潜在的相关标头或类标头标签。在微观提取步骤中,使用英语解析器提取文本区域中相关的句子片段,并在需要时包含最近的标题文本。初步实验结果证明了该方法的有效性。
{"title":"Extracting Precise Link Context Using NLP Parsing Technique","authors":"Qingyang Xu, Wanli Zuo","doi":"10.1109/WI.2004.68","DOIUrl":"https://doi.org/10.1109/WI.2004.68","url":null,"abstract":"Link context has been exploited extensively ever since the advent of the World Wide Web, but the approach to extracting precise link context has not been fully explored and many state-of-the-art extraction methods are based on simplistic heuristics and require ad-hoc parameters. In this paper, we propose a novel two-step extraction model, which aims to systematically derive link context of quality as high as anchor text. In the macroscopic analysis step, a systematic web page structure analysis is performed to locate the content cohesive text region and potential relevant header or header like tags. In the microscopic extraction step, an English parser is used to extract the relevant sentence fragments in the text region and the nearest heading text is encompassed if the need arises. Preliminary experimental results proved our approach's effectiveness.","PeriodicalId":229107,"journal":{"name":"IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129548754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
MumbleSearch Extraction of High Quality Web information for SME MumbleSearch为中小企业提取高质量的网络信息
Pub Date : 2004-09-20 DOI: 10.1109/WI.2004.102
N. Baldini, M. Gori, Marco Maggini
Although search engines are playing a crucial role for the retrieval of information from the Web, they cannot guarantee the quality required for most relevant business activities as well as for many top-level research projects. In this paper we present MumbleSearch, a Web Content Monitor which is especially conceived to extract and organize topic-based information with emphasis on quality requirements. We present the architecture of the software platform and its deployment for a real-world application, involving Italian Small and Medium Enterprises (SME).
尽管搜索引擎在从Web检索信息方面起着至关重要的作用,但它们不能保证大多数相关业务活动以及许多顶级研究项目所需的质量。在本文中,我们介绍了MumbleSearch,这是一个Web内容监视器,专门用于提取和组织基于主题的信息,强调质量要求。我们介绍了软件平台的架构及其在现实世界应用程序中的部署,涉及意大利中小型企业(SME)。
{"title":"MumbleSearch Extraction of High Quality Web information for SME","authors":"N. Baldini, M. Gori, Marco Maggini","doi":"10.1109/WI.2004.102","DOIUrl":"https://doi.org/10.1109/WI.2004.102","url":null,"abstract":"Although search engines are playing a crucial role for the retrieval of information from the Web, they cannot guarantee the quality required for most relevant business activities as well as for many top-level research projects. In this paper we present MumbleSearch, a Web Content Monitor which is especially conceived to extract and organize topic-based information with emphasis on quality requirements. We present the architecture of the software platform and its deployment for a real-world application, involving Italian Small and Medium Enterprises (SME).","PeriodicalId":229107,"journal":{"name":"IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2004-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129148676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1