首页 > 最新文献

2011 IEEE 23rd International Conference on Tools with Artificial Intelligence最新文献

英文 中文
Domain Adaptation with Good Edit Similarities: A Sparse Way to Deal with Scaling and Rotation Problems in Image Classification 具有良好编辑相似度的域自适应:一种处理图像分类中缩放和旋转问题的稀疏方法
Amaury Habrard, Jean-Philippe Peyrache, M. Sebban
In many real-life applications, the available source training information is either too small or not representative enough of the underlying target test problem. In the past few years, a new line of machine learning research has been developed to overcome such awkward situations, called Domain Adaptation (DA), giving rise to many adaptation algorithms and theoretical results in the form of generalization bounds. In this paper, a novel contribution is proposed in the form of a DA algorithm dealing with string-structured data, inspired from the DA support vector machine (SVM) technique introduced in [Bruzzone et al, PAMI 2010]. To ensure the convergence of SVM-based learning, the similarity functions involved in the process must be valid kernels, i.e. positive semi-definite (PSD) and symmetric. However, in the string-based context that we are considering in this paper, this condition is often not satisfied. Indeed, it has been proven that most string similarity functions based on the edit distance are not PSD. To overcome this drawback, we make use in this paper of the new theory of learning with good similarity functions introduced by Balcan et al., which (i) does not require the use of a valid kernel to learn well and (ii) allows us to induce sparser models. We take advantage of this theoretical framework to propose a new DA algorithm using good edit similarity functions. Using a suitable string-representation of handwritten digits, we show that are our new algorithm is very efficient to deal with the scaling and rotation problems usually encountered in image classification.
在许多实际应用程序中,可用的源训练信息要么太小,要么不足以代表潜在的目标测试问题。在过去的几年里,为了克服这种尴尬的情况,一种新的机器学习研究已经发展起来,称为领域适应(DA),产生了许多适应算法和以泛化边界形式出现的理论结果。本文以处理字符串结构数据的数据处理算法的形式提出了一种新的贡献,其灵感来自于[Bruzzone等人,PAMI 2010]中引入的数据处理支持向量机(SVM)技术。为了保证基于svm学习的收敛性,过程中涉及的相似函数必须是有效核函数,即正半定函数(PSD)和对称函数。然而,在我们在本文中考虑的基于字符串的上下文中,这个条件通常不满足。事实证明,大多数基于编辑距离的字符串相似度函数都不是PSD函数。为了克服这一缺点,我们在本文中使用了Balcan等人引入的具有良好相似函数的学习新理论,该理论(i)不需要使用有效的核来学习,(ii)允许我们诱导稀疏模型。我们利用这一理论框架提出了一种利用良好的编辑相似度函数的新的数据挖掘算法。使用合适的手写数字字符串表示,我们表明我们的新算法非常有效地处理图像分类中经常遇到的缩放和旋转问题。
{"title":"Domain Adaptation with Good Edit Similarities: A Sparse Way to Deal with Scaling and Rotation Problems in Image Classification","authors":"Amaury Habrard, Jean-Philippe Peyrache, M. Sebban","doi":"10.1109/ICTAI.2011.35","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.35","url":null,"abstract":"In many real-life applications, the available source training information is either too small or not representative enough of the underlying target test problem. In the past few years, a new line of machine learning research has been developed to overcome such awkward situations, called Domain Adaptation (DA), giving rise to many adaptation algorithms and theoretical results in the form of generalization bounds. In this paper, a novel contribution is proposed in the form of a DA algorithm dealing with string-structured data, inspired from the DA support vector machine (SVM) technique introduced in [Bruzzone et al, PAMI 2010]. To ensure the convergence of SVM-based learning, the similarity functions involved in the process must be valid kernels, i.e. positive semi-definite (PSD) and symmetric. However, in the string-based context that we are considering in this paper, this condition is often not satisfied. Indeed, it has been proven that most string similarity functions based on the edit distance are not PSD. To overcome this drawback, we make use in this paper of the new theory of learning with good similarity functions introduced by Balcan et al., which (i) does not require the use of a valid kernel to learn well and (ii) allows us to induce sparser models. We take advantage of this theoretical framework to propose a new DA algorithm using good edit similarity functions. Using a suitable string-representation of handwritten digits, we show that are our new algorithm is very efficient to deal with the scaling and rotation problems usually encountered in image classification.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129486663","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Using the H-Divergence to Prune Probabilistic Automata 利用h散度对概率自动机进行剪枝
Pub Date : 2011-11-07 DOI: 10.1109/ICTAI.2011.114
Marc Bernard, Baptiste Jeudy, Jean-Philippe Peyrache, M. Sebban, F. Thollard
A problem usually encountered in probabilistic automata learning is the difficulty to deal with large training samples and/or wide alphabets. This is partially due to the size of the resulting Probabilistic Prefix Tree (PPT) from which state merging-based learning algorithms are generally applied. In this paper, we propose a novel method to prune PPTs by making use of the H-divergence d_H, recently introduced in the field of domain adaptation. d_H is based on the classification error made by an hypothesis learned from unlabeled examples drawn according to two distributions to compare. Through a thorough comparison with state-of-the-art divergence measures, we provide experimental evidences that demonstrate the efficiency of our method based on this simple and intuitive criterion.
在概率自动机学习中经常遇到的一个问题是难以处理大型训练样本和/或广泛的字母。这部分是由于结果的概率前缀树(PPT)的大小,通常应用基于状态合并的学习算法。本文提出了一种利用域自适应领域新近引入的h -散度d_H对PPTs进行剪枝的新方法。d_H是基于从根据两个分布进行比较的未标记示例中学习到的假设所产生的分类误差。通过与最先进的散度测量方法的全面比较,我们提供了实验证据,证明了基于该简单直观准则的方法的有效性。
{"title":"Using the H-Divergence to Prune Probabilistic Automata","authors":"Marc Bernard, Baptiste Jeudy, Jean-Philippe Peyrache, M. Sebban, F. Thollard","doi":"10.1109/ICTAI.2011.114","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.114","url":null,"abstract":"A problem usually encountered in probabilistic automata learning is the difficulty to deal with large training samples and/or wide alphabets. This is partially due to the size of the resulting Probabilistic Prefix Tree (PPT) from which state merging-based learning algorithms are generally applied. In this paper, we propose a novel method to prune PPTs by making use of the H-divergence d_H, recently introduced in the field of domain adaptation. d_H is based on the classification error made by an hypothesis learned from unlabeled examples drawn according to two distributions to compare. Through a thorough comparison with state-of-the-art divergence measures, we provide experimental evidences that demonstrate the efficiency of our method based on this simple and intuitive criterion.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128208730","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Covert Communication Method Based on User-Generated Content Sites 一种基于用户生成内容站点的隐蔽通信方法
Pub Date : 2011-11-07 DOI: 10.1109/ICTAI.2011.179
Qingfeng Tan, Peipeng Liu, Jinqiao Shi, Xiao Wang, Li Guo
with the worldwide increasing of Internet censorship, censorship-resistance technology has attracted more and more attentions, some famous systems, such as Tor and JAP, have been deployed to provide public service for censorship-resistance. However, these systems all rely on dedicated infrastructure and entry points for service accessibility. The network infrastructure and entry points may become the target of censorship attack. In this paper, a UGC-based method is proposed (called user-generated content based covert communication, UGC3) for covert communication in a friends-to-friends (F2F) manner. It uses existing infrastructures (i.e., UGC sites ) to form a fully distributed overlay network. An efficient resource discovery algorithm is proposed to negotiate the rendezvous point. Analysis shows that this method is able to circumvent internet censorship with user repudiation and fault tolerance.
随着世界范围内网络审查的增加,抗审查技术越来越受到人们的关注,一些著名的系统,如Tor和JAP,已经被部署用于提供抗审查公共服务。然而,这些系统都依赖于专用的基础设施和服务可访问性的入口点。网络基础设施和入口可能成为审查攻击的目标。本文提出了一种基于ugc的方法(称为基于用户生成内容的隐蔽通信,UGC3),用于以朋友对朋友(F2F)方式进行隐蔽通信。它利用现有的基础设施(即UGC站点)形成一个完全分布式的覆盖网络。提出了一种高效的资源发现算法来协商集合点。分析表明,该方法具有用户抵赖和容错能力,能够有效规避网络审查。
{"title":"A Covert Communication Method Based on User-Generated Content Sites","authors":"Qingfeng Tan, Peipeng Liu, Jinqiao Shi, Xiao Wang, Li Guo","doi":"10.1109/ICTAI.2011.179","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.179","url":null,"abstract":"with the worldwide increasing of Internet censorship, censorship-resistance technology has attracted more and more attentions, some famous systems, such as Tor and JAP, have been deployed to provide public service for censorship-resistance. However, these systems all rely on dedicated infrastructure and entry points for service accessibility. The network infrastructure and entry points may become the target of censorship attack. In this paper, a UGC-based method is proposed (called user-generated content based covert communication, UGC3) for covert communication in a friends-to-friends (F2F) manner. It uses existing infrastructures (i.e., UGC sites ) to form a fully distributed overlay network. An efficient resource discovery algorithm is proposed to negotiate the rendezvous point. Analysis shows that this method is able to circumvent internet censorship with user repudiation and fault tolerance.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122502570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New Computational Aspects in Master-Slave Systems of Semantic Schemas 语义图式主从系统的新计算方法
Pub Date : 2011-11-07 DOI: 10.1109/ICTAI.2011.105
N. Tandareanu, Cristina Zamfir
In this paper we reconsider the computations accomplished in a semantic schema. We reconsider also the computations in a master-slave systems of semantic schemas introduced in [6] as a cooperating system of such structures. We show that a master-slave system is adequate to represent distributed knowledge. To relieve this fact we describe such a system named DiSys implemented in Java by client-server technology.
在本文中,我们重新考虑了在语义模式中完成的计算。我们还将[6]中引入的语义模式主从系统中的计算重新考虑为这种结构的协作系统。我们证明了主从系统足以表示分布式知识。为了减轻这一事实,我们描述了一个用Java实现的名为DiSys的系统,该系统采用客户机-服务器技术。
{"title":"New Computational Aspects in Master-Slave Systems of Semantic Schemas","authors":"N. Tandareanu, Cristina Zamfir","doi":"10.1109/ICTAI.2011.105","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.105","url":null,"abstract":"In this paper we reconsider the computations accomplished in a semantic schema. We reconsider also the computations in a master-slave systems of semantic schemas introduced in [6] as a cooperating system of such structures. We show that a master-slave system is adequate to represent distributed knowledge. To relieve this fact we describe such a system named DiSys implemented in Java by client-server technology.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"575 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133321847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Design and Development of a Social Intelligence Based Context-Aware Middleware Using BlackBoard 基于社会智能的上下文感知中间件的设计与开发
Pub Date : 2011-11-07 DOI: 10.1109/ICTAI.2011.151
Joohee Suh, Chong-woo Woo
The context-aware computing environment is changing due to recent development of the new computing devices and new concept of services. The systems are developing rapidly, but most of them focus on recognition of the collected information, not on the intelligent capability. In this paper, we defined a new context aware computing environment based on the concept of social intelligence, which implies an ability to share or utilize information by making a relationship, recognizes context by making an inference, and works in collaboration to offer services more efficiently. We have designed and developed a Social Intelligence based Context-Aware Middleware (SI-CAM), under the environment. The SI-CAM provides a service with following functions, multi context-awareness, context based task planning, and grouping intelligent entities for collaboration. The system is developed with blackboard based structure, and tested on virtual environment in the domain of ubiquitous restaurant. The experiment showed some significant results.
由于新的计算设备和新的服务概念的发展,上下文感知计算环境正在发生变化。目前系统发展迅速,但大多侧重于对采集信息的识别,而不是智能能力。在本文中,我们基于社会智能的概念定义了一个新的上下文感知计算环境,这意味着通过建立关系来共享或利用信息的能力,通过推理来识别上下文,并通过协作来更有效地提供服务。在该环境下,我们设计并开发了一个基于社会智能的上下文感知中间件(SI-CAM)。SI-CAM提供的服务具有以下功能:多上下文感知、基于上下文的任务规划和为协作对智能实体进行分组。该系统采用基于黑板的结构进行开发,并在无所不在的餐厅领域的虚拟环境中进行了测试。实验显示了一些重要的结果。
{"title":"Design and Development of a Social Intelligence Based Context-Aware Middleware Using BlackBoard","authors":"Joohee Suh, Chong-woo Woo","doi":"10.1109/ICTAI.2011.151","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.151","url":null,"abstract":"The context-aware computing environment is changing due to recent development of the new computing devices and new concept of services. The systems are developing rapidly, but most of them focus on recognition of the collected information, not on the intelligent capability. In this paper, we defined a new context aware computing environment based on the concept of social intelligence, which implies an ability to share or utilize information by making a relationship, recognizes context by making an inference, and works in collaboration to offer services more efficiently. We have designed and developed a Social Intelligence based Context-Aware Middleware (SI-CAM), under the environment. The SI-CAM provides a service with following functions, multi context-awareness, context based task planning, and grouping intelligent entities for collaboration. The system is developed with blackboard based structure, and tested on virtual environment in the domain of ubiquitous restaurant. The experiment showed some significant results.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128844407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Decomposition-Based Approach to OWL DL Ontology Diagnosis 基于分解的OWL DL本体诊断方法
Pub Date : 2011-11-07 DOI: 10.1109/ICTAI.2011.104
Jianfeng Du, G. Qi, Jeff Z. Pan, Yi-Dong Shen
Computing all diagnoses of an inconsistent ontology is important in ontology-based applications. However, the number of diagnoses can be very large. It is impractical to enumerate all diagnoses before identifying the target one to render the ontology consistent. Hence, we propose to represent all diagnoses by multiple sets of partial diagnoses, where the total number of partial diagnoses can be small and the target diagnosis can be directly retrieved from these partial diagnoses. We also propose methods for computing the new representation of all diagnoses in an OWL DL ontology. Experimental results show that computing the new representation of all diagnoses is much easier than directly computing all diagnoses.
在基于本体的应用中,计算不一致本体的所有诊断是非常重要的。然而,诊断的数量可能非常大。在确定目标诊断之前列举所有诊断以使本体保持一致是不切实际的。因此,我们建议用多组部分诊断来表示所有诊断,其中部分诊断的总数可以很小,并且可以直接从这些部分诊断中检索到目标诊断。我们还提出了计算OWL DL本体中所有诊断的新表示的方法。实验结果表明,计算所有诊断的新表示比直接计算所有诊断要容易得多。
{"title":"A Decomposition-Based Approach to OWL DL Ontology Diagnosis","authors":"Jianfeng Du, G. Qi, Jeff Z. Pan, Yi-Dong Shen","doi":"10.1109/ICTAI.2011.104","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.104","url":null,"abstract":"Computing all diagnoses of an inconsistent ontology is important in ontology-based applications. However, the number of diagnoses can be very large. It is impractical to enumerate all diagnoses before identifying the target one to render the ontology consistent. Hence, we propose to represent all diagnoses by multiple sets of partial diagnoses, where the total number of partial diagnoses can be small and the target diagnosis can be directly retrieved from these partial diagnoses. We also propose methods for computing the new representation of all diagnoses in an OWL DL ontology. Experimental results show that computing the new representation of all diagnoses is much easier than directly computing all diagnoses.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117148724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
How to Reason by HeaRT in a Semantic Knowledge-Based Wiki 如何在语义知识维基中进行心灵推理
W. T. Adrian, Szymon Bobek, G. J. Nalepa, K. Kaczor, Krzysztof Kluza
Semantic wikis constitute an increasingly popular class of systems for collaborative knowledge engineering. We developed Loki, a semantic wiki that uses a logic-based knowledge representation. It is compatible with semantic annotations mechanism as well as Semantic Web languages. We integrated the system with a rule engine called Heart that supports inference with production rules. Several modes for modularized rule bases, suitable for the distributed rule bases present in a wiki, are considered. Embedding the rule engine enables strong reasoning and allows to run production rules over semantic knowledge bases. In the paper, we demonstrate the system concepts and functionality using an illustrative example.
语义wiki构成了协作知识工程中日益流行的一类系统。我们开发了Loki,一个使用基于逻辑的知识表示的语义wiki。它兼容语义注释机制和语义Web语言。我们将该系统与一个名为Heart的规则引擎集成在一起,该引擎支持使用生产规则进行推理。考虑了适合wiki中分布式规则库的模块化规则库的几种模式。嵌入规则引擎支持强大的推理,并允许在语义知识库上运行生产规则。在本文中,我们用一个说明性的例子来说明系统的概念和功能。
{"title":"How to Reason by HeaRT in a Semantic Knowledge-Based Wiki","authors":"W. T. Adrian, Szymon Bobek, G. J. Nalepa, K. Kaczor, Krzysztof Kluza","doi":"10.1109/ICTAI.2011.71","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.71","url":null,"abstract":"Semantic wikis constitute an increasingly popular class of systems for collaborative knowledge engineering. We developed Loki, a semantic wiki that uses a logic-based knowledge representation. It is compatible with semantic annotations mechanism as well as Semantic Web languages. We integrated the system with a rule engine called Heart that supports inference with production rules. Several modes for modularized rule bases, suitable for the distributed rule bases present in a wiki, are considered. Embedding the rule engine enables strong reasoning and allows to run production rules over semantic knowledge bases. In the paper, we demonstrate the system concepts and functionality using an illustrative example.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"159 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115264064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
The Effect of the Characteristics of the Dataset on the Selection Stability 数据集特征对选择稳定性的影响
Pub Date : 2011-11-07 DOI: 10.1109/ICTAI.2011.167
Salem Alelyani, Huan Liu, Lei Wang
Feature selection is an effective technique to reduce the dimensionality of a data set and to select relevant features for the domain problem. Recently, stability of feature selection methods has gained increasing attention. In fact, it has become a crucial factor in determining the goodness of a feature selection algorithm besides the learning performance. In this work, we conduct an extensive experimental study using verity of data sets and different well-known feature selection algorithms in order to study the behavior of these algorithms in terms of the stability.
特征选择是一种有效的降低数据集的维数并为领域问题选择相关特征的技术。近年来,特征选择方法的稳定性越来越受到人们的关注。事实上,除了学习性能之外,它已经成为决定特征选择算法好坏的关键因素。在这项工作中,我们使用数据集的真实性和不同的知名特征选择算法进行了广泛的实验研究,以研究这些算法在稳定性方面的行为。
{"title":"The Effect of the Characteristics of the Dataset on the Selection Stability","authors":"Salem Alelyani, Huan Liu, Lei Wang","doi":"10.1109/ICTAI.2011.167","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.167","url":null,"abstract":"Feature selection is an effective technique to reduce the dimensionality of a data set and to select relevant features for the domain problem. Recently, stability of feature selection methods has gained increasing attention. In fact, it has become a crucial factor in determining the goodness of a feature selection algorithm besides the learning performance. In this work, we conduct an extensive experimental study using verity of data sets and different well-known feature selection algorithms in order to study the behavior of these algorithms in terms of the stability.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115281852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Ranking in Co-effecting Multi-object/Link Types Networks 协同效应多对象/链路类型网络的排序
Bo Zhou, Manna Wu, Xin Xia, Chao Wu
Research on link based object ranking attracts increasing attention these years, which also brings computer science research and business marketing brand-new concepts, opportunities as well as a great deal of challenges. With prosperity of web pages search engine and widely use of social networks, recent graph-theoretic ranking approaches have achieved remarkable successes although most of them are focus on homogeneous networks studying. Previous study on co-ranking methods tries to divide heterogeneous networks into multiple homogeneous sub-networks and ties between different sub-networks. This paper proposes an efficient topic biased ranking method for bringing order to co-effecting heterogeneous networks among authors, papers and accepted institutions (journals/conferences) within one single random surfer. This new method aims to update ranks for different types of objects (author, paper, journals/conferences) at each random walk.
近年来,基于链接的对象排序研究越来越受到人们的关注,这给计算机科学研究和商业营销带来了全新的概念和机遇,同时也带来了许多挑战。随着网页搜索引擎的蓬勃发展和社交网络的广泛应用,近年来的图论排序方法虽然大多集中在同质网络的研究上,但也取得了显著的成功。以往的协同排序方法试图将异构网络划分为多个同质子网络,并将不同子网络之间的联系进行划分。本文提出了一种有效的主题偏向排序方法,用于在单个随机冲浪者中对作者、论文和被接受的机构(期刊/会议)之间的协同效应异构网络进行排序。这种新方法旨在在每次随机漫步时更新不同类型对象(作者、论文、期刊/会议)的排名。
{"title":"Ranking in Co-effecting Multi-object/Link Types Networks","authors":"Bo Zhou, Manna Wu, Xin Xia, Chao Wu","doi":"10.1109/ICTAI.2011.84","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.84","url":null,"abstract":"Research on link based object ranking attracts increasing attention these years, which also brings computer science research and business marketing brand-new concepts, opportunities as well as a great deal of challenges. With prosperity of web pages search engine and widely use of social networks, recent graph-theoretic ranking approaches have achieved remarkable successes although most of them are focus on homogeneous networks studying. Previous study on co-ranking methods tries to divide heterogeneous networks into multiple homogeneous sub-networks and ties between different sub-networks. This paper proposes an efficient topic biased ranking method for bringing order to co-effecting heterogeneous networks among authors, papers and accepted institutions (journals/conferences) within one single random surfer. This new method aims to update ranks for different types of objects (author, paper, journals/conferences) at each random walk.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127341193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved Graph-Based Bilingual Corpus Selection with Sentence Pair Ranking for Statistical Machine Translation 基于统计机器翻译的改进图双语语料库选择与句子对排序
Wen-Han Chao, Zhoujun Li
In statistical machine translation, the number of sentence pairs in the bilingual corpus is very important to the quality of translation. However, when the quantity reaches some extent, enlarging corpus has less effect on the translation, whereas increasing greatly the time and space complexity to building translation systems, which hinders the development of statistical machine translation. In this paper, we propose several ranking approaches to measure the quantity of information of each sentence pair, and apply them into a graph-based bilingual corpus selection framework to form an improved corpus selection approach, which now considers the difference of the initial quantities of information between the sentence pairs. Our experiments in a Chinese-English translation task show that, selecting only 50% of the whole corpus via the graph-based selection approach as training set, we can obtain the near translation result with the one using the whole corpus, and we obtain better results than the baselines after using the IDF-related ranking approach.
在统计机器翻译中,双语语料库中句子对的数量对翻译质量至关重要。然而,当数量达到一定程度时,扩大语料库对翻译的影响不大,反而大大增加了构建翻译系统的时间和空间复杂性,阻碍了统计机器翻译的发展。本文提出了几种衡量句子对信息量的排序方法,并将其应用到基于图的双语语料库选择框架中,形成了一种考虑句子对初始信息量差异的改进语料库选择方法。我们在汉英翻译任务中的实验表明,通过基于图的选择方法只选择整个语料库的50%作为训练集,我们可以获得与使用整个语料库的翻译结果接近的翻译结果,并且使用idf相关排序方法获得比基线更好的结果。
{"title":"Improved Graph-Based Bilingual Corpus Selection with Sentence Pair Ranking for Statistical Machine Translation","authors":"Wen-Han Chao, Zhoujun Li","doi":"10.1109/ICTAI.2011.73","DOIUrl":"https://doi.org/10.1109/ICTAI.2011.73","url":null,"abstract":"In statistical machine translation, the number of sentence pairs in the bilingual corpus is very important to the quality of translation. However, when the quantity reaches some extent, enlarging corpus has less effect on the translation, whereas increasing greatly the time and space complexity to building translation systems, which hinders the development of statistical machine translation. In this paper, we propose several ranking approaches to measure the quantity of information of each sentence pair, and apply them into a graph-based bilingual corpus selection framework to form an improved corpus selection approach, which now considers the difference of the initial quantities of information between the sentence pairs. Our experiments in a Chinese-English translation task show that, selecting only 50% of the whole corpus via the graph-based selection approach as training set, we can obtain the near translation result with the one using the whole corpus, and we obtain better results than the baselines after using the IDF-related ranking approach.","PeriodicalId":332661,"journal":{"name":"2011 IEEE 23rd International Conference on Tools with Artificial Intelligence","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125992505","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
期刊
2011 IEEE 23rd International Conference on Tools with Artificial Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1