2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)最新文献

英文中文

Semi-Supervised Cross-Modal Hashing Based on Label Prediction and Distance Preserving 基于标签预测和距离保持的半监督跨模态哈希

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

Pub Date : 2019-11-01 DOI: 10.1109/ICTAI.2019.00053

Xu Zhang, Xin Tian, Bing Yang, Zuyu Zhang, Yan Li

Unlabeled data can be easily collected and help to exploit the correlations among different modalities. Existing works tried to explore label information contained in unlabeled data, however most of them suffer from difficulties in separating samples from different categories and have great interference. This paper proposes a novel method named semi-supervised cross-modal hashing based on label prediction and distance preserving(SS-LPDP). First, we use the deep neural networks to extract the feature of the labeled data among different modalities and get the feature distribution of each category. Second, the similarity of the data among different modalities is maximized based on the extracted feature and the label information. A common objective function is proposed with distance preserving constraint, which can effectively separate data into different categories and reduce interference in retrieval. An optimization algorithm is used to update the network parameters of feature learning in each modality, and the label information of unlabeled data are dynamically updated according to the changes of the feature distribution in each iteration. Experimental evaluation on Wiki, Pascal and NUS-WIDE datasets show that the proposed method outperforms recent methods when we set 25% samples without category labels.

未标记的数据可以很容易地收集，并有助于开发不同模式之间的相关性。现有的工作试图挖掘未标记数据中包含的标签信息，但大多数工作都存在从不同类别中分离样本的困难和很大的干扰。提出了一种基于标签预测和距离保持的半监督跨模态哈希算法(SS-LPDP)。首先，利用深度神经网络提取标记数据在不同模态间的特征，得到各类别的特征分布;其次，基于提取的特征和标签信息，最大化不同模态间数据的相似度;提出了一种带距离保持约束的公共目标函数，可以有效地将数据分类，减少检索过程中的干扰。采用优化算法更新各模态特征学习的网络参数，并根据每次迭代中特征分布的变化动态更新未标记数据的标签信息。在Wiki、Pascal和NUS-WIDE数据集上的实验评估表明，当我们设置25%的样本不带类别标签时，所提出的方法优于现有的方法。

{"title":"Semi-Supervised Cross-Modal Hashing Based on Label Prediction and Distance Preserving","authors":"Xu Zhang, Xin Tian, Bing Yang, Zuyu Zhang, Yan Li","doi":"10.1109/ICTAI.2019.00053","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00053","url":null,"abstract":"Unlabeled data can be easily collected and help to exploit the correlations among different modalities. Existing works tried to explore label information contained in unlabeled data, however most of them suffer from difficulties in separating samples from different categories and have great interference. This paper proposes a novel method named semi-supervised cross-modal hashing based on label prediction and distance preserving(SS-LPDP). First, we use the deep neural networks to extract the feature of the labeled data among different modalities and get the feature distribution of each category. Second, the similarity of the data among different modalities is maximized based on the extracted feature and the label information. A common objective function is proposed with distance preserving constraint, which can effectively separate data into different categories and reduce interference in retrieval. An optimization algorithm is used to update the network parameters of feature learning in each modality, and the label information of unlabeled data are dynamically updated according to the changes of the feature distribution in each iteration. Experimental evaluation on Wiki, Pascal and NUS-WIDE datasets show that the proposed method outperforms recent methods when we set 25% samples without category labels.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"276 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114945307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Improving Bandit-Based Recommendations with Spatial Context Reasoning: An Online Evaluation 利用空间语境推理改进基于强盗的推荐:一项在线评估

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

Pub Date : 2019-11-01 DOI: 10.1109/ICTAI.2019.00191

Nicolas Gutowski, O. Camp, F. Chhel, Tassadit Amghar, Patrick Albers

The pervasive deployment of low cost WiFi access points has accelerated the development of mobile computing to provide ubiquitous computing. Herein, we focus first on the discovery of urban districts, in several french cities, using the connection history of mobile users to a city-wide free public Wi-Fi network. The goal of our approach is to infer relevant spatial context features that can be exploitable by bandit-based recommendation systems and improve their performances. For the unsupervised context reasoning step, we use spectral clustering to deduce areas by grouping Wi-Fi access points according to their users' visitations. We have published an anonymized sample of our dataset and our results on the web. Then, we have integrated the deduced spatial context into a mobile cultural events visualization and recommendation app in order to evaluate the global performance online. Thus, over a year we have observed how such spatial context improves bandit-based recommendations in this app by comparing two use cases of the LinUCB algorithm: the first using the original context without the deduced geo-context, and the second using context enriched by our computed spatial context. Finally, our online evaluation shows that better results are obtained when combining spatial context reasoning with the bandit-based recommendation system, both in terms of accuracy and user participation.

低成本WiFi接入点的普遍部署加速了移动计算的发展，提供了无处不在的计算。在这里，我们首先关注几个法国城市的城区，使用移动用户到全市免费公共Wi-Fi网络的连接历史。我们的方法的目标是推断相关的空间背景特征，这些特征可以被基于强盗的推荐系统利用，并提高它们的性能。对于无监督上下文推理步骤，我们使用频谱聚类通过根据用户访问对Wi-Fi接入点进行分组来推断区域。我们已经在网上公布了我们的数据集和结果的匿名样本。然后，我们将推断的空间背景整合到一个移动文化事件可视化和推荐应用程序中，以便在线评估全球表现。因此，在过去的一年里，我们通过比较LinUCB算法的两个用例，观察了这样的空间背景如何改善这个应用程序中基于强盗的推荐:第一个使用原始背景，没有推断的地理背景，第二个使用我们计算的空间背景丰富的背景。最后，我们的在线评估表明，将空间上下文推理与基于强盗的推荐系统相结合，在准确率和用户参与度方面都获得了更好的结果。

{"title":"Improving Bandit-Based Recommendations with Spatial Context Reasoning: An Online Evaluation","authors":"Nicolas Gutowski, O. Camp, F. Chhel, Tassadit Amghar, Patrick Albers","doi":"10.1109/ICTAI.2019.00191","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00191","url":null,"abstract":"The pervasive deployment of low cost WiFi access points has accelerated the development of mobile computing to provide ubiquitous computing. Herein, we focus first on the discovery of urban districts, in several french cities, using the connection history of mobile users to a city-wide free public Wi-Fi network. The goal of our approach is to infer relevant spatial context features that can be exploitable by bandit-based recommendation systems and improve their performances. For the unsupervised context reasoning step, we use spectral clustering to deduce areas by grouping Wi-Fi access points according to their users' visitations. We have published an anonymized sample of our dataset and our results on the web. Then, we have integrated the deduced spatial context into a mobile cultural events visualization and recommendation app in order to evaluate the global performance online. Thus, over a year we have observed how such spatial context improves bandit-based recommendations in this app by comparing two use cases of the LinUCB algorithm: the first using the original context without the deduced geo-context, and the second using context enriched by our computed spatial context. Finally, our online evaluation shows that better results are obtained when combining spatial context reasoning with the bandit-based recommendation system, both in terms of accuracy and user participation.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114071989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Sparse High-Level Attention Networks for Person Re-Identification 稀疏高层注意网络用于人物再识别

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

Pub Date : 2019-11-01 DOI: 10.1109/ICTAI.2019.00213

Sheng Xie, Canlong Zhang, Zhixin Li, Zhiwen Wang

When extracting convolutional features from person images with low resolution, a large amount of available information will be lost due to the pooling, which will lead to the reduction of the accuracy of person classification models. This paper proposes a new classification model, which can effectively to reduce the loss of important information about the convolutional neural works. Firstly, the SE module in the Squeeze-and-Excitation Networks (SENet) is extracted and normalized to generate the Normalized Squeeze-and-Excitation (NSE) module. Then, 4 NSE modules are applied to the convolutional layers of ResNet. Finally, a Sparse Normalized Squeeze-and-Excitation Network (SNSENet) is constructed by adding 4 shortcut connections between the convolutional layers. The experimental results of Market-1501 show that the rank-1 of SNSE-ResNet-50 is 3.7% and 4.2% higher than that of SE-ResNet-50 and ResNet-50 respectively, it has done well in other person re-identification datasets.

在从低分辨率的人物图像中提取卷积特征时，由于池化会丢失大量的可用信息，从而导致人物分类模型的精度降低。本文提出了一种新的分类模型，该模型可以有效地减少卷积神经网络中重要信息的丢失。首先，对压缩激励网络(SENet)中的SE模块进行提取和归一化，生成归一化压缩激励(NSE)模块。然后，将4个NSE模块应用于ResNet的卷积层。最后，通过在卷积层之间增加4个快捷连接，构造了稀疏归一化压缩激励网络(SNSENet)。Market-1501的实验结果表明，SNSE-ResNet-50的rank-1分别比SE-ResNet-50和ResNet-50高3.7%和4.2%，在其他人员再识别数据集中表现良好。

引用次数: 4

Rethink Gaussian Denoising Prior for Real-World Image Denoising 重新考虑高斯去噪先验在真实世界图像去噪中的应用

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

Pub Date : 2019-11-01 DOI: 10.1109/ICTAI.2019.00244

Tianyang Wang, Jun Huan, Bo Li, Kaoning Hu

Real-world image denoising is a challenging but significant problem in computer vision. Unlike Gaussian denoising on which most existing methods focus, the real-world noise is nonadditive, and the distributions are difficult to model. This leads to unsatisfactory performance when applying a Gaussian denoising approach on real-world denoising problems. In this paper, we propose a simple framework for effective real-world image denoising. Specifically, we investigate the intrinsic properties of the Gaussian denoising prior and demonstrate this prior can aid real-world image denoising. To leverage this prior, we fine-tune it for only one epoch on a recently proposed real-world image denoising dataset, and the learned model can enhance both visual and quantitative results (peak-signal-noise-ratio) for real-world image denoising tasks. Extensive experiments demonstrate the effectiveness of our approach, and indicate that the Gaussian denoising prior can be also transferred to real-world image denoising by exploiting appropriate training schemes.

在计算机视觉中，真实图像去噪是一个具有挑战性但又很重要的问题。与大多数现有方法关注的高斯去噪不同，现实世界的噪声是非加性的，分布难以建模。当将高斯去噪方法应用于现实世界的去噪问题时，这会导致不满意的性能。在本文中，我们提出了一个简单的框架，有效的现实世界的图像去噪。具体来说，我们研究了高斯去噪先验的内在特性，并证明了这种先验可以帮助现实世界的图像去噪。为了利用这一先验，我们在最近提出的真实世界图像去噪数据集上仅对其进行了一个epoch的微调，并且学习的模型可以增强真实世界图像去噪任务的视觉和定量结果(峰值信噪比)。大量的实验证明了该方法的有效性，并表明利用适当的训练方案也可以将高斯去噪先验转移到现实世界的图像去噪中。

引用次数: 1

Integrating an Attention Mechanism and Deep Neural Network for Detection of DGA Domain Names 基于注意力机制和深度神经网络的DGA域名检测

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

Pub Date : 2019-11-01 DOI: 10.1109/ICTAI.2019.00121

Fangli Ren, Zhengwei Jiang, Jian Liu

Domain generation algorithms (DGA) are employed by malware to generate domain names as a common practice, with which to confirm rendezvous points to their command-and-control (C2) servers. The detection of DGA domain names is one of the important technologies for command and control communication detection. Considering the randomness of the DGA domain names, recent work in DGA detection employed machine learning methods based on features extracting and deep learning architectures to classify domain names. However, these methods perform poorly on wordlistbased DGA families, which generate domain names by randomly concatenating dictionary words. In this paper, we proposed the ATT-CNN-BiLSTM model to detect and classify DGA domain names. Firstly, the Convolutional Neural Network (CNN) and bidirectional Long Short-Term Memory (BiLSTM) neural network layer was used to extract the features of the domain sequences information; secondly, the attention layer was used to allocate the corresponding weight of the extracted domain deep information. Finally, the domain feature messages of different weights were put into the output layer to complete the tasks of detection and classification. The experiment results demonstrate the effectiveness of the proposed model both on regular DGA domain names and wordlist-based ones. To be precise, we got a F1 score of 98.92% for the detection and macro average F1 score of 81% for the classification task of DGA domain names.

域名生成算法(DGA)被恶意软件用来生成域名，作为一种常见的做法，用它来确认与他们的指挥和控制(C2)服务器的会合点。DGA域名检测是指挥控制通信检测的重要技术之一。考虑到DGA域名的随机性，最近的DGA检测工作采用基于特征提取和深度学习架构的机器学习方法对域名进行分类。然而，这些方法在基于wordlist的DGA族上表现不佳，这些DGA族通过随机连接字典中的单词来生成域名。本文提出了ATT-CNN-BiLSTM模型对DGA域名进行检测和分类。首先，利用卷积神经网络(CNN)和双向长短期记忆(BiLSTM)神经网络层提取域序列特征信息;其次，利用关注层对提取的领域深度信息进行权重分配;最后，将不同权重的域特征信息放入输出层，完成检测和分类任务。实验结果表明，该模型对常规DGA域名和基于词表的域名都是有效的。准确地说，我们对DGA域名的检测得到了98.92%的F1分数，对DGA域名的分类任务得到了81%的宏观平均F1分数。

{"title":"Integrating an Attention Mechanism and Deep Neural Network for Detection of DGA Domain Names","authors":"Fangli Ren, Zhengwei Jiang, Jian Liu","doi":"10.1109/ICTAI.2019.00121","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00121","url":null,"abstract":"Domain generation algorithms (DGA) are employed by malware to generate domain names as a common practice, with which to confirm rendezvous points to their command-and-control (C2) servers. The detection of DGA domain names is one of the important technologies for command and control communication detection. Considering the randomness of the DGA domain names, recent work in DGA detection employed machine learning methods based on features extracting and deep learning architectures to classify domain names. However, these methods perform poorly on wordlistbased DGA families, which generate domain names by randomly concatenating dictionary words. In this paper, we proposed the ATT-CNN-BiLSTM model to detect and classify DGA domain names. Firstly, the Convolutional Neural Network (CNN) and bidirectional Long Short-Term Memory (BiLSTM) neural network layer was used to extract the features of the domain sequences information; secondly, the attention layer was used to allocate the corresponding weight of the extracted domain deep information. Finally, the domain feature messages of different weights were put into the output layer to complete the tasks of detection and classification. The experiment results demonstrate the effectiveness of the proposed model both on regular DGA domain names and wordlist-based ones. To be precise, we got a F1 score of 98.92% for the detection and macro average F1 score of 81% for the classification task of DGA domain names.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129971873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

ICTAI 2019 Conference Committees ICTAI 2019会议委员会

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

Pub Date : 2019-11-01 DOI: 10.1109/ictai.2019.00007

引用次数: 0

Text-Enhanced Knowledge Representation Learning Based on Gated Convolutional Networks 基于门控卷积网络的文本增强知识表示学习

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

Pub Date : 2019-11-01 DOI: 10.1109/ICTAI.2019.00051

Chunfeng Liu, Yan Zhang, Mei Yu, Xuewei Li, Mankun Zhao, Tianyi Xu, Jian Yu, Ruiguo Yu

Knowledge representation learning (KRL), which transforms both the entities and relations into continuous low dimensional continuous vector space, has attracted considerable research. Most of existing knowledge graph (KG) completion models only considers the structural representation of triples, but do not consider the important text information about entity descriptions in the knowledge base. We propose a text-enhanced KG model based on gated convolution network (GConvTE), which can learn entity descriptions and symbol triples jointly by feature fusion. Specifically, each triple (head entity, relation, tail entity) is represented as a 3-column structural embedding matrix, a 3-column textual embedding matrix and a 3-column joint embedding matrix where each column vector represents a triple element. Textual embeddings are obtained by bidirectional gated recurrent unit with attention (A-BGRU) encoding entity descriptions and joint embeddings are obtained by the combination of textual embeddings and structural embeddings. Extending feature dimension in embedding layer, these three matrixs are concatenated into 3-channel feature block to be fed into convolution layer, where the gated unit is added to selectively output the joint features maps. These feature maps are concatenated and then multiplied with a weight vector via a dot product to return a score. The experimental results show that our model GConvTE achieves better link performance than previous state-of-art embedding models on two benchmark datasets.

知识表示学习(Knowledge representation learning, KRL)将实体和关系转化为连续的低维连续向量空间，引起了广泛的研究。现有的知识图补全模型大多只考虑三元组的结构表示，而没有考虑知识库中实体描述的重要文本信息。提出了一种基于门控卷积网络(GConvTE)的文本增强KG模型，通过特征融合实现实体描述和符号三元组的联合学习。具体来说，每个三元组(头部实体、关系实体、尾部实体)被表示为一个三列结构嵌入矩阵、一个三列文本嵌入矩阵和一个三列联合嵌入矩阵，其中每个列向量表示一个三元组元素。文本嵌入采用双向门控关注循环单元(A-BGRU)编码实现，文本嵌入与结构嵌入相结合实现联合嵌入。在嵌入层中扩展特征维度，将这三个矩阵串联成3通道特征块，送入卷积层，在卷积层中加入门控单元，选择性地输出联合特征映射。将这些特征映射连接起来，然后通过点积与权重向量相乘以返回分数。实验结果表明，我们的GConvTE模型在两个基准数据集上取得了比目前最先进的嵌入模型更好的链路性能。

{"title":"Text-Enhanced Knowledge Representation Learning Based on Gated Convolutional Networks","authors":"Chunfeng Liu, Yan Zhang, Mei Yu, Xuewei Li, Mankun Zhao, Tianyi Xu, Jian Yu, Ruiguo Yu","doi":"10.1109/ICTAI.2019.00051","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00051","url":null,"abstract":"Knowledge representation learning (KRL), which transforms both the entities and relations into continuous low dimensional continuous vector space, has attracted considerable research. Most of existing knowledge graph (KG) completion models only considers the structural representation of triples, but do not consider the important text information about entity descriptions in the knowledge base. We propose a text-enhanced KG model based on gated convolution network (GConvTE), which can learn entity descriptions and symbol triples jointly by feature fusion. Specifically, each triple (head entity, relation, tail entity) is represented as a 3-column structural embedding matrix, a 3-column textual embedding matrix and a 3-column joint embedding matrix where each column vector represents a triple element. Textual embeddings are obtained by bidirectional gated recurrent unit with attention (A-BGRU) encoding entity descriptions and joint embeddings are obtained by the combination of textual embeddings and structural embeddings. Extending feature dimension in embedding layer, these three matrixs are concatenated into 3-channel feature block to be fed into convolution layer, where the gated unit is added to selectively output the joint features maps. These feature maps are concatenated and then multiplied with a weight vector via a dot product to return a score. The experimental results show that our model GConvTE achieves better link performance than previous state-of-art embedding models on two benchmark datasets.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124620379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Using Classical Planning in Adversarial Problems 经典规划在对抗问题中的应用

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

Pub Date : 2019-11-01 DOI: 10.1109/ICTAI.2019.00185

Pavel Rytír, L. Chrpa, B. Bosanský

Many problems from classical planning are applied in the environment with other, possibly adversarial agents. However, plans found by classical planning algorithms lack the robustness against the actions of other agents - the quality of computed plans can be significantly worse compared to the model. To explicitly reason about other (adversarial) agents, the game-theoretic framework can be used. The scalability of game-theoretic algorithms, however, is limited and often insufficient for real-world problems. In this paper, we combine classical domain-independent planning algorithms and game-theoretic strategy-generation algorithm where plans form strategies in the game. Our contribution is threefold. First, we provide the methodology for using classical planning in this game-theoretic framework. Second, we analyze the trade-off between the quality of the planning algorithm and the robustness of final randomized plans and the computation time. Finally, we analyze different variants of integration of classical planning algorithms into the game-theoretic framework and show that at the cost a minor loss in the robustness of final plans, we can significantly reduce the computation time.

经典规划中的许多问题都适用于有其他可能对抗因子的环境。然而，经典规划算法发现的计划缺乏对其他代理行为的鲁棒性-计算计划的质量与模型相比可能明显更差。要明确地推断其他(对抗)代理，可以使用博弈论框架。然而，博弈论算法的可扩展性是有限的，通常不足以解决现实世界的问题。本文将经典的领域无关规划算法与博弈论策略生成算法相结合，其中计划在博弈中形成策略。我们的贡献是三重的。首先，我们提供了在这个博弈论框架中使用经典规划的方法。其次，我们分析了规划算法的质量与最终随机计划的鲁棒性和计算时间之间的权衡。最后，我们分析了将经典规划算法整合到博弈论框架中的不同变体，并表明以最终计划的鲁棒性损失为代价，我们可以显着减少计算时间。

{"title":"Using Classical Planning in Adversarial Problems","authors":"Pavel Rytír, L. Chrpa, B. Bosanský","doi":"10.1109/ICTAI.2019.00185","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00185","url":null,"abstract":"Many problems from classical planning are applied in the environment with other, possibly adversarial agents. However, plans found by classical planning algorithms lack the robustness against the actions of other agents - the quality of computed plans can be significantly worse compared to the model. To explicitly reason about other (adversarial) agents, the game-theoretic framework can be used. The scalability of game-theoretic algorithms, however, is limited and often insufficient for real-world problems. In this paper, we combine classical domain-independent planning algorithms and game-theoretic strategy-generation algorithm where plans form strategies in the game. Our contribution is threefold. First, we provide the methodology for using classical planning in this game-theoretic framework. Second, we analyze the trade-off between the quality of the planning algorithm and the robustness of final randomized plans and the computation time. Finally, we analyze different variants of integration of classical planning algorithms into the game-theoretic framework and show that at the cost a minor loss in the robustness of final plans, we can significantly reduce the computation time.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130880561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

GLSE: Global-Local Selective Encoding for Response Generation in Neural Conversation Model 神经会话模型中响应生成的全局-局部选择性编码

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

Pub Date : 2019-11-01 DOI: 10.1109/ICTAI.2019.00166

Hongli Wang, Jiangtao Ren

How to generate relevant and informative response is one of the core topics in response generation area. Following the task formulation of neural machine translation, previous works mainly consider response generation task as a mapping from a source sentence to a target sentence. However, the dialogue model tends to generate safe, commonplace responses (e.g., I don't know) regardless of the input, when learning to maximize the likelihood of response for the given message in an almost loss-less manner just like MT. Different from existing works, we propose a Global-Local Selective Encoding model (GLSE) to extend the seq2seq framework to generate more relevant and informative responses. Specifically, two types of selective gate network are introduced in this work: (i) A local selective word-sentence gate is added after encoding phase of Seq2Seq learning framework, which learns to tailor the original message information and generates a selected input representation. (ii) A global selective bidirectional-context gate is set to control the bidirectional information flow from a BiGRU based encoder to decoder. Empirical studies indicate the advantage of our model over several classical and strong baselines.

如何生成相关且信息丰富的响应是响应生成领域的核心问题之一。继神经机器翻译的任务表述之后，以往的工作主要将响应生成任务视为从源句到目标句的映射。然而，当学习像MT一样以几乎无损的方式最大化给定消息的响应可能性时，对话模型倾向于生成安全、常见的响应(例如，我不知道)，而不管输入是什么。与现有工作不同，我们提出了一个全局-局部选择编码模型(GLSE)来扩展seq2seq框架，以生成更相关和信息丰富的响应。具体来说，本文介绍了两种类型的选择性门网络:(i)在Seq2Seq学习框架的编码阶段后，增加一个局部选择性词-句门，该门学习对原始消息信息进行裁剪，并生成一个选择的输入表示。(ii)设置全局选择性双向上下文门来控制基于BiGRU的编码器到解码器的双向信息流。实证研究表明，我们的模型优于几个经典和强大的基线。

{"title":"GLSE: Global-Local Selective Encoding for Response Generation in Neural Conversation Model","authors":"Hongli Wang, Jiangtao Ren","doi":"10.1109/ICTAI.2019.00166","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00166","url":null,"abstract":"How to generate relevant and informative response is one of the core topics in response generation area. Following the task formulation of neural machine translation, previous works mainly consider response generation task as a mapping from a source sentence to a target sentence. However, the dialogue model tends to generate safe, commonplace responses (e.g., I don't know) regardless of the input, when learning to maximize the likelihood of response for the given message in an almost loss-less manner just like MT. Different from existing works, we propose a Global-Local Selective Encoding model (GLSE) to extend the seq2seq framework to generate more relevant and informative responses. Specifically, two types of selective gate network are introduced in this work: (i) A local selective word-sentence gate is added after encoding phase of Seq2Seq learning framework, which learns to tailor the original message information and generates a selected input representation. (ii) A global selective bidirectional-context gate is set to control the bidirectional information flow from a BiGRU based encoder to decoder. Empirical studies indicate the advantage of our model over several classical and strong baselines.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130164664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Dynamic Multi-population Artificial Bee Colony Algorithm 动态多种群人工蜂群算法

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

Pub Date : 2019-11-01 DOI: 10.1109/ICTAI.2019.00113

Xinyu Zhou, Yiwen Ling, M. Zhong, Mingwen Wang

As a relatively new paradigm of bio-inspired optimization techniques, artificial bee colony (ABC) algorithm has attracted much attention for its simplicity and effectiveness. However, the performance of ABC is not satisfactory when solving some complex optimization problems. To improve its performance, we propose a novel ABC variant by designing a dynamic multi-population scheme (DMPS). In DMPS, the population is divided into several subpopulations, and the size of subpopulation is adjusted dynamically by checking the quality of the global best solution. Moreover, we design two novel solution search equations to maximize the effectiveness of DMPS, in which the local best solution of each subpopulation and the global best solution of the whole population are utilized simultaneously. In the experiments, 32 widely used benchmark functions are used, and four well-established ABC variants are involved in the comparison. The comparative results show that our approach performs better on the majority of benchmark functions.

人工蜂群算法(artificial bee colony, ABC)作为一种较新的仿生优化技术，因其简单、有效而备受关注。然而，在解决一些复杂的优化问题时，ABC算法的性能并不令人满意。为了提高ABC算法的性能，我们通过设计动态多种群方案(DMPS)提出了一种新的ABC变体。在DMPS中，将种群划分为若干个子种群，并通过检查全局最优解的质量来动态调整子种群的大小。此外，为了使DMPS的有效性最大化，我们设计了两个新的解搜索方程，其中每个子种群的局部最优解和整个种群的全局最优解同时被利用。在实验中，使用了32个广泛使用的基准函数，并涉及4个完善的ABC变体进行比较。对比结果表明，我们的方法在大多数基准函数上表现更好。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀