首页 > 最新文献

2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology最新文献

英文 中文
Leveraging Web 2.0 Sources for Web Content Classification 利用Web 2.0源进行Web内容分类
Somnath Banerjee, Martin Scholz
This paper addresses practical aspects of Web page classification not captured by the classical text mining framework. Classifiers are supposed to perform well on a broad variety of pages. We argue that constructing training corpora is a bottleneck for building such classifiers, and that care has to be taken if the goal is to generalize to previously unseen kinds of pages on the Web. We study techniques for building training corpora automatically from publicly available Web resources, quantify the discrepancy between them, and demonstrate that encouraging agreement between classifiers given such diverse sources drastically outperforms methods that ignore the different natures of data sources on the Web.
本文讨论了经典文本挖掘框架没有捕捉到的Web页面分类的实际方面。分类器应该在各种各样的页面上表现良好。我们认为,构建训练语料库是构建此类分类器的瓶颈,如果目标是泛化到Web上以前未见过的页面类型,则必须小心。我们研究了从公开可用的Web资源自动构建训练语料库的技术,量化了它们之间的差异,并证明了在给定如此多样化的数据源的情况下,鼓励分类器之间的一致性大大优于忽略Web上数据源的不同性质的方法。
{"title":"Leveraging Web 2.0 Sources for Web Content Classification","authors":"Somnath Banerjee, Martin Scholz","doi":"10.1109/WIIAT.2008.291","DOIUrl":"https://doi.org/10.1109/WIIAT.2008.291","url":null,"abstract":"This paper addresses practical aspects of Web page classification not captured by the classical text mining framework. Classifiers are supposed to perform well on a broad variety of pages. We argue that constructing training corpora is a bottleneck for building such classifiers, and that care has to be taken if the goal is to generalize to previously unseen kinds of pages on the Web. We study techniques for building training corpora automatically from publicly available Web resources, quantify the discrepancy between them, and demonstrate that encouraging agreement between classifiers given such diverse sources drastically outperforms methods that ignore the different natures of data sources on the Web.","PeriodicalId":393772,"journal":{"name":"2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124768325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Web Communities Defined by Web Page Content 由网页内容定义的网络社区
M. Kudelka, V. Snás̃el, Z. Horak, A. Hassanien
In this paper we are looking for a relationship between the intent of Web pages, their architecture and the communities who take part in their usage and creation. For us, the Web page is entity carrying information about these communities. Our paper describes techniques, which can be used to extract mentioned information as well as tools usable in analysis of these information. Information about communities could be used in several ways thanks to our approach. Finally we present an experiment which proves the feasibility of our approach.
在本文中,我们正在寻找Web页面的意图、它们的体系结构和参与它们的使用和创建的社区之间的关系。对我们来说,网页是承载这些社区信息的实体。本文描述了提取上述信息的技术,以及分析这些信息的工具。由于我们的方法,有关社区的信息可以以多种方式使用。最后通过实验验证了该方法的可行性。
{"title":"Web Communities Defined by Web Page Content","authors":"M. Kudelka, V. Snás̃el, Z. Horak, A. Hassanien","doi":"10.1109/WIIAT.2008.93","DOIUrl":"https://doi.org/10.1109/WIIAT.2008.93","url":null,"abstract":"In this paper we are looking for a relationship between the intent of Web pages, their architecture and the communities who take part in their usage and creation. For us, the Web page is entity carrying information about these communities. Our paper describes techniques, which can be used to extract mentioned information as well as tools usable in analysis of these information. Information about communities could be used in several ways thanks to our approach. Finally we present an experiment which proves the feasibility of our approach.","PeriodicalId":393772,"journal":{"name":"2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129747702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Grasping Major Statements and Their Contradictions Toward Information Credibility Analysis of Web Contents 把握网络内容信息可信度分析的主要表述及其矛盾
Daisuke Kawahara, S. Kurohashi, Kentaro Inui
The World Wide Web contains wide variety of news reports, arguments, opinions, etc. that vary widely in quality. People judge the credibility of information on the Web for decision making in daily life. At present, while the quantity of information on the Web is explosively increasing, it is necessary to develop a system that supports such judgments. We have been developing an information credibility analysis system, WISDOM that considers the viewpoints of information contents, information senders, and information appearances. In this paper, as a viewpoint of information contents, we propose a method for providing a bird's eye view of major statements on a given topic and their contradictions. We evaluate the obtained statements in our experiments, and confirm the effectiveness of our approach. Furthermore, we discuss our future objectives.
万维网包含各种各样的新闻报道、争论、观点等,它们的质量参差不齐。人们在日常生活中通过判断网络信息的可信度来进行决策。目前,随着网络信息量的爆炸式增长,有必要开发一个支持这种判断的系统。我们一直在开发一个信息可信度分析系统,即WISDOM,它考虑了信息内容、信息发送者和信息外观的观点。在本文中,我们从信息内容的角度,提出了一种对给定主题的主要陈述及其矛盾提供鸟瞰图的方法。我们在实验中对得到的结论进行了评价,并证实了该方法的有效性。此外,我们讨论了我们未来的目标。
{"title":"Grasping Major Statements and Their Contradictions Toward Information Credibility Analysis of Web Contents","authors":"Daisuke Kawahara, S. Kurohashi, Kentaro Inui","doi":"10.1109/WIIAT.2008.289","DOIUrl":"https://doi.org/10.1109/WIIAT.2008.289","url":null,"abstract":"The World Wide Web contains wide variety of news reports, arguments, opinions, etc. that vary widely in quality. People judge the credibility of information on the Web for decision making in daily life. At present, while the quantity of information on the Web is explosively increasing, it is necessary to develop a system that supports such judgments. We have been developing an information credibility analysis system, WISDOM that considers the viewpoints of information contents, information senders, and information appearances. In this paper, as a viewpoint of information contents, we propose a method for providing a bird's eye view of major statements on a given topic and their contradictions. We evaluate the obtained statements in our experiments, and confirm the effectiveness of our approach. Furthermore, we discuss our future objectives.","PeriodicalId":393772,"journal":{"name":"2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128351502","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Discovering and Modelling Multiple Interests of Users in Collaborative Tagging Systems 协同标记系统中用户多重兴趣的发现与建模
C. Yeung, Nicholas Gibbins, N. Shadbolt
We analyse data obtained from several collaborative tagging systems and discover that user interests can be very diverse. Traditional methods for representing interests of users are usually not able to reflect such diversity. We propose a method to construct user profiles of multiple interests using data in a collaborative tagging system. Our evaluation suggests that the proposed method is able to generate user profiles which reflect the diversity of user interests and can be used to help provide more focused recommendation.
我们分析了从几个协作标签系统获得的数据,发现用户的兴趣可能非常多样化。代表用户利益的传统方法通常无法反映这种多样性。我们提出了一种利用协作标记系统中的数据来构建多兴趣用户档案的方法。我们的评估表明,所提出的方法能够生成反映用户兴趣多样性的用户档案,并可用于帮助提供更有针对性的推荐。
{"title":"Discovering and Modelling Multiple Interests of Users in Collaborative Tagging Systems","authors":"C. Yeung, Nicholas Gibbins, N. Shadbolt","doi":"10.1109/WIIAT.2008.267","DOIUrl":"https://doi.org/10.1109/WIIAT.2008.267","url":null,"abstract":"We analyse data obtained from several collaborative tagging systems and discover that user interests can be very diverse. Traditional methods for representing interests of users are usually not able to reflect such diversity. We propose a method to construct user profiles of multiple interests using data in a collaborative tagging system. Our evaluation suggests that the proposed method is able to generate user profiles which reflect the diversity of user interests and can be used to help provide more focused recommendation.","PeriodicalId":393772,"journal":{"name":"2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128458279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Service Interoperability between Agents and Semantic Web Services for Nomadic Environment 游牧环境下代理与语义Web服务之间的服务互操作性
Adeel Shajjar, N. Khalid, H. F. Ahmad, H. Suguri
In the current information age, peoplepsilas lives are driven by the availability and utilization of services to get information anytime, anywhere, in any format, and on any device. For a nomadic user, information becomes useful if certain scales of autonomy and intelligence are present in the information systems. In this context lightweight multi agent systems (L-MAS) become a preferable choice for design and development of intelligent autonomous mobile information systems. In this paper we aim to propose a multi agent based system architecture to bring OWL based semantic Web services to nomadic users. We propose a design for smart nomadic client using, L-MAS, which interacts with a multi agent based mediator system to get access to the OWL based semantic Web services.
在当前的信息时代,人们的生活被服务的可用性和利用所驱动,以便随时随地、以任何格式、在任何设备上获取信息。对于游牧用户来说,如果信息系统中存在一定程度的自主性和智能,那么信息就会变得有用。在此背景下,轻量级多智能体系统(L-MAS)成为自主智能移动信息系统设计与开发的理想选择。本文旨在提出一种基于多代理的系统架构,为游移用户提供基于OWL的语义Web服务。我们提出了一种使用L-MAS的智能游牧客户端设计,它与基于多代理的中介系统交互,以访问基于OWL的语义Web服务。
{"title":"Service Interoperability between Agents and Semantic Web Services for Nomadic Environment","authors":"Adeel Shajjar, N. Khalid, H. F. Ahmad, H. Suguri","doi":"10.1109/WIIAT.2008.327","DOIUrl":"https://doi.org/10.1109/WIIAT.2008.327","url":null,"abstract":"In the current information age, peoplepsilas lives are driven by the availability and utilization of services to get information anytime, anywhere, in any format, and on any device. For a nomadic user, information becomes useful if certain scales of autonomy and intelligence are present in the information systems. In this context lightweight multi agent systems (L-MAS) become a preferable choice for design and development of intelligent autonomous mobile information systems. In this paper we aim to propose a multi agent based system architecture to bring OWL based semantic Web services to nomadic users. We propose a design for smart nomadic client using, L-MAS, which interacts with a multi agent based mediator system to get access to the OWL based semantic Web services.","PeriodicalId":393772,"journal":{"name":"2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128466484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
The Role of Blackboard-Based Reasoning and Visual Analytics in RESIN's Predictive Analysis 基于黑板的推理和可视化分析在树脂预测分析中的作用
D. Liu, Jia Yue, Xiaoyu Wang, A. Raja, W. Ribarsky
Knowledge gathering and investigative tasks in open environments can be very complex because the problem-solving context is constantly evolving, and the data may be incomplete, unreliable and/or conflicting. This paper significantly extends our previous work on a mixed-initiative agent by making it capable of assisting humans in foraging task analysis using AI blackboard-based reasoning, visualizations and a mix-initiative user interface. The agent is equipped with the ability to adapt its processing to available resources, deadlines and its current problem-solving context.
在开放环境中,知识收集和调查任务可能非常复杂,因为解决问题的环境不断变化,数据可能不完整、不可靠和/或相互冲突。本文极大地扩展了我们之前在混合主动代理上的工作,使其能够使用基于AI黑板的推理、可视化和混合主动用户界面来协助人类进行任务分析。代理具有使其处理适应可用资源、最后期限和当前解决问题的上下文的能力。
{"title":"The Role of Blackboard-Based Reasoning and Visual Analytics in RESIN's Predictive Analysis","authors":"D. Liu, Jia Yue, Xiaoyu Wang, A. Raja, W. Ribarsky","doi":"10.1109/WIIAT.2008.307","DOIUrl":"https://doi.org/10.1109/WIIAT.2008.307","url":null,"abstract":"Knowledge gathering and investigative tasks in open environments can be very complex because the problem-solving context is constantly evolving, and the data may be incomplete, unreliable and/or conflicting. This paper significantly extends our previous work on a mixed-initiative agent by making it capable of assisting humans in foraging task analysis using AI blackboard-based reasoning, visualizations and a mix-initiative user interface. The agent is equipped with the ability to adapt its processing to available resources, deadlines and its current problem-solving context.","PeriodicalId":393772,"journal":{"name":"2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129803792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Relating Cognitive Process Models to Behavioural Models of Agents 认知过程模型与主体行为模型的关系
A. Sharpanskykh, Jan Treur
From an external perspective, cognitive agent behaviour can be described by specifying (temporal) correlations of a certain complexity between stimuli (input states) and (re)actions (output states) of the agent. From an internal perspective the agentpsilas dynamics can be characterized by direct (causal) temporal relations between internal cognitive states of the agent. Internal dynamics and externally observable behaviour of an agent have reciprocal relations with each other. This paper contributes an approach that allows automatic generation of a behavioural specification of an agent from a cognitive process model. Furthermore, by this automated transformation, internal cognitive state properties of an agent can be related by a representation relation to externally observable behavioural patterns.
从外部角度来看,认知代理行为可以通过指定代理的刺激(输入状态)和(再)动作(输出状态)之间一定复杂性的(时间)相关性来描述。从内部的角度来看,主体主体动力学可以通过主体内部认知状态之间的直接(因果)时间关系来表征。一个主体的内部动态和外部可观察的行为具有相互关系。本文提供了一种方法,允许从认知过程模型自动生成代理的行为规范。此外,通过这种自动化转换,代理的内部认知状态属性可以通过表征关系与外部可观察的行为模式相关联。
{"title":"Relating Cognitive Process Models to Behavioural Models of Agents","authors":"A. Sharpanskykh, Jan Treur","doi":"10.1109/WIIAT.2008.246","DOIUrl":"https://doi.org/10.1109/WIIAT.2008.246","url":null,"abstract":"From an external perspective, cognitive agent behaviour can be described by specifying (temporal) correlations of a certain complexity between stimuli (input states) and (re)actions (output states) of the agent. From an internal perspective the agentpsilas dynamics can be characterized by direct (causal) temporal relations between internal cognitive states of the agent. Internal dynamics and externally observable behaviour of an agent have reciprocal relations with each other. This paper contributes an approach that allows automatic generation of a behavioural specification of an agent from a cognitive process model. Furthermore, by this automated transformation, internal cognitive state properties of an agent can be related by a representation relation to externally observable behavioural patterns.","PeriodicalId":393772,"journal":{"name":"2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127153952","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
DFCM: Density Based Approach to Identify Outliers and to Get Efficient Clusters in Fuzzy Clustering 基于密度的模糊聚类中离群点识别和高效聚类方法
Prabhjot Kaur
The task of outlier identification is to find small groups of data objects that are exceptional when compared with rest large amount of data. The identification of outliers can lead to the discovery of truly unexpected knowledge in areas such as electronic commerce, credit card frauds, voting irregularity analysis, data cleansing, network intrusion, severe weather prediction & many more. This paper deals with the identification of outliers and to get efficient clusters in fuzzy clustering. In this paper a new density based definition of outlier and an algorithm dasiaDFCMpsila is proposed; which works in two phases. In first phase, it identifies outliers and separate them from original data-set and in the second phase, it creates clusters from noiseless data. DFCM modifies FCM fuzzy clustering technique to create clusters. But it can also be implemented with any other fuzzy clustering technique. Numerical examples and tests show that proposed algorithm gives better result when compared with FCM.
异常值识别的任务是找到与其他大量数据相比异常的小组数据对象。识别异常值可以在电子商务、信用卡欺诈、投票违规分析、数据清理、网络入侵、恶劣天气预测等领域发现真正意想不到的知识。本文研究了模糊聚类中异常值的识别和高效聚类的问题。本文提出了一种新的基于密度的离群点定义和算法dasiaDFCMpsila;它分两个阶段起作用。在第一阶段,它识别异常值并将其从原始数据集中分离出来,在第二阶段,它从无噪声数据中创建聚类。DFCM修改了FCM模糊聚类技术来创建聚类。但它也可以用任何其他模糊聚类技术实现。数值算例和测试结果表明,该算法与FCM相比具有更好的效果。
{"title":"DFCM: Density Based Approach to Identify Outliers and to Get Efficient Clusters in Fuzzy Clustering","authors":"Prabhjot Kaur","doi":"10.1109/WIIAT.2008.58","DOIUrl":"https://doi.org/10.1109/WIIAT.2008.58","url":null,"abstract":"The task of outlier identification is to find small groups of data objects that are exceptional when compared with rest large amount of data. The identification of outliers can lead to the discovery of truly unexpected knowledge in areas such as electronic commerce, credit card frauds, voting irregularity analysis, data cleansing, network intrusion, severe weather prediction & many more. This paper deals with the identification of outliers and to get efficient clusters in fuzzy clustering. In this paper a new density based definition of outlier and an algorithm dasiaDFCMpsila is proposed; which works in two phases. In first phase, it identifies outliers and separate them from original data-set and in the second phase, it creates clusters from noiseless data. DFCM modifies FCM fuzzy clustering technique to create clusters. But it can also be implemented with any other fuzzy clustering technique. Numerical examples and tests show that proposed algorithm gives better result when compared with FCM.","PeriodicalId":393772,"journal":{"name":"2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122354443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Solving Sum and Product Riddle via BDD-Based Model Checking 基于bdd模型检验的和积谜语求解
Xiangyu Luo, Kaile Su, A. Sattar, Yan Chen
We model the sum and product riddle in public announcement logic, which is interpreted on an epistemic Kripke model. The model is symbolically represented as a finite state program with n agents. A model checking method to the riddle is developed by using the BDD-based symbolic model checking algorithm for logic of knowledge we developed in [7]. The method is implemented by extending the model checker MCTK [7] and then the solution of the riddle is verified successfully.
本文对公告逻辑中的和与积谜题进行了建模,并用认知Kripke模型对其进行了解释。该模型被符号表示为具有n个代理的有限状态程序。利用我们在[7]中开发的基于bdd的知识逻辑符号模型检查算法,开发了一种谜语的模型检查方法。该方法通过扩展模型检查器MCTK[7]实现,并成功验证了谜语的解。
{"title":"Solving Sum and Product Riddle via BDD-Based Model Checking","authors":"Xiangyu Luo, Kaile Su, A. Sattar, Yan Chen","doi":"10.1109/WIIAT.2008.277","DOIUrl":"https://doi.org/10.1109/WIIAT.2008.277","url":null,"abstract":"We model the sum and product riddle in public announcement logic, which is interpreted on an epistemic Kripke model. The model is symbolically represented as a finite state program with n agents. A model checking method to the riddle is developed by using the BDD-based symbolic model checking algorithm for logic of knowledge we developed in [7]. The method is implemented by extending the model checker MCTK [7] and then the solution of the riddle is verified successfully.","PeriodicalId":393772,"journal":{"name":"2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132495058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Comparison of Performance for SVM Based Relevance Feedback Document Retrieval in Several Vector Space Models 基于SVM的相关反馈文档检索在几种向量空间模型中的性能比较
T. Onoda, H. Murata, S. Yamada
We investigate the following data mining problems from the document retrieval: From a large data set of documents, we need to find documents that relate to human interest as few iterations of human testing or checking as possible. In each iteration a comparatively small batch of documents is evaluated for relating to the human interest. We apply active learning techniques based on Support Vector Machine for evaluating successive batches, which is called relevance feedback. Our proposed approach has been very useful for document retrieval with relevance feedback experimentally. In this paper, we adopt several Vector Space Models into our proposed method, and then show the comparison results of the performance of our method in several Vector Space Models.
我们从文档检索中研究以下数据挖掘问题:从文档的大型数据集中,我们需要找到与人类兴趣相关的文档,尽可能少地进行人类测试或检查的迭代。在每次迭代中,相对较小的一批文档被评估为与人类兴趣相关。我们使用基于支持向量机的主动学习技术来评估连续批次,这被称为相关反馈。实验结果表明,本文提出的方法对具有相关反馈的文档检索非常有用。在本文中,我们将几种向量空间模型引入到我们提出的方法中,然后展示了我们的方法在几种向量空间模型中的性能比较结果。
{"title":"Comparison of Performance for SVM Based Relevance Feedback Document Retrieval in Several Vector Space Models","authors":"T. Onoda, H. Murata, S. Yamada","doi":"10.1109/WIIAT.2008.101","DOIUrl":"https://doi.org/10.1109/WIIAT.2008.101","url":null,"abstract":"We investigate the following data mining problems from the document retrieval: From a large data set of documents, we need to find documents that relate to human interest as few iterations of human testing or checking as possible. In each iteration a comparatively small batch of documents is evaluated for relating to the human interest. We apply active learning techniques based on Support Vector Machine for evaluating successive batches, which is called relevance feedback. Our proposed approach has been very useful for document retrieval with relevance feedback experimentally. In this paper, we adopt several Vector Space Models into our proposed method, and then show the comparison results of the performance of our method in several Vector Space Models.","PeriodicalId":393772,"journal":{"name":"2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2008-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130917471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1