首页 > 最新文献

2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)最新文献

英文 中文
Automatic Configuration of Bi-Objective Optimisation Algorithms: Impact of Correlation Between Objectives 双目标优化算法的自动配置:目标间相关性的影响
Aymeric Blot, H. Hoos, Marie-Éléonore Kessaci, Laetitia Vermeulen-Jourdan
Multi-objective optimisation algorithms expose various parameters that have to be tuned in order to be efficient. Moreover, in multi-objective optimisation, the correlation between objective functions is known to affect search space structure and algorithm performance. Considering the recent success of automatic algorithm configuration (AAC) techniques for the design of multi-objective optimisation algorithms, this raises two interesting questions: what is the impact of correlation between optimisation objectives on (1) the efficacy of different AAC approaches and (2) on the optimised algorithm designs obtained from these automated approaches? In this work, we study these questions for multi-objective local search algorithms (MOLS) for three well-known bi-objective permutation problems, using two single-objective AAC approaches and one multi-objective approach. Our empirical results clearly show that overall, multi-objective AAC is the most effective approach for the automatic configuration of the highly parametric MOLS framework, and that there is no systematic impact of the degree of correlation on the relative performance of the three AAC approaches. We also find that the best-performing configurations differ, depending on the correlation between objectives and the size of the problem instances to be solved, providing further evidence for the usefulness of automatic configuration of multi-objective optimisation algorithms.
为了提高效率,多目标优化算法暴露了必须调整的各种参数。此外,在多目标优化中,已知目标函数之间的相关性会影响搜索空间结构和算法性能。考虑到最近自动算法配置(AAC)技术在多目标优化算法设计方面的成功,这提出了两个有趣的问题:优化目标之间的相关性对(1)不同AAC方法的有效性和(2)从这些自动化方法中获得的优化算法设计的影响是什么?在这项工作中,我们研究了这些问题的多目标局部搜索算法(MOLS)的三个著名的双目标排列问题,使用两个单目标AAC方法和一个多目标方法。我们的实证结果清楚地表明,总体而言,多目标AAC是高度参数化MOLS框架自动配置的最有效方法,并且相关程度对三种AAC方法的相对性能没有系统影响。我们还发现,根据目标之间的相关性和待解决问题实例的大小,最佳性能配置有所不同,这为多目标优化算法自动配置的有效性提供了进一步的证据。
{"title":"Automatic Configuration of Bi-Objective Optimisation Algorithms: Impact of Correlation Between Objectives","authors":"Aymeric Blot, H. Hoos, Marie-Éléonore Kessaci, Laetitia Vermeulen-Jourdan","doi":"10.1109/ICTAI.2018.00093","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00093","url":null,"abstract":"Multi-objective optimisation algorithms expose various parameters that have to be tuned in order to be efficient. Moreover, in multi-objective optimisation, the correlation between objective functions is known to affect search space structure and algorithm performance. Considering the recent success of automatic algorithm configuration (AAC) techniques for the design of multi-objective optimisation algorithms, this raises two interesting questions: what is the impact of correlation between optimisation objectives on (1) the efficacy of different AAC approaches and (2) on the optimised algorithm designs obtained from these automated approaches? In this work, we study these questions for multi-objective local search algorithms (MOLS) for three well-known bi-objective permutation problems, using two single-objective AAC approaches and one multi-objective approach. Our empirical results clearly show that overall, multi-objective AAC is the most effective approach for the automatic configuration of the highly parametric MOLS framework, and that there is no systematic impact of the degree of correlation on the relative performance of the three AAC approaches. We also find that the best-performing configurations differ, depending on the correlation between objectives and the size of the problem instances to be solved, providing further evidence for the usefulness of automatic configuration of multi-objective optimisation algorithms.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116251134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Web Robot Detection: A Semantic Approach 网络机器人检测:一种语义方法
Athanasios Lagopoulos, Grigorios Tsoumakas, Georgios Papadopoulos
Web robots constitute nowadays more than half of the total web traffic. Malicious robots threaten the security, privacy and performance of the web, while non-malicious ones are involved in analytics skewing. The latter constitutes an important problem for large websites with unique content, as it can lead to false impressions about the popularity and impact of a piece of information. To deal with this problem, we present a novel web robot detection approach for content-rich websites, based on the assumption that human web users are interested in specific topics, while web robots crawl the web randomly. Our approach extends the typical representation of user sessions with a novel set of features that capture the semantics of the content of the requested resources. Empirical results on real-world data from the web portal of an academic publisher, show that the proposed semantic features lead to improved web robot detection accuracy.
如今,网络机器人构成了总网络流量的一半以上。恶意机器人威胁网络的安全、隐私和性能,而非恶意机器人则涉及分析偏差。对于拥有独特内容的大型网站来说,后者构成了一个重要问题,因为它可能导致对一条信息的受欢迎程度和影响的错误印象。为了解决这一问题,我们提出了一种针对内容丰富的网站的新型网络机器人检测方法,该方法基于人类网络用户对特定主题感兴趣的假设,而网络机器人则随机抓取网络。我们的方法扩展了用户会话的典型表示,使用一组新颖的特性来捕获所请求资源内容的语义。在某学术出版社门户网站的真实数据上的实证结果表明,所提出的语义特征提高了网络机器人的检测精度。
{"title":"Web Robot Detection: A Semantic Approach","authors":"Athanasios Lagopoulos, Grigorios Tsoumakas, Georgios Papadopoulos","doi":"10.1109/ICTAI.2018.00150","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00150","url":null,"abstract":"Web robots constitute nowadays more than half of the total web traffic. Malicious robots threaten the security, privacy and performance of the web, while non-malicious ones are involved in analytics skewing. The latter constitutes an important problem for large websites with unique content, as it can lead to false impressions about the popularity and impact of a piece of information. To deal with this problem, we present a novel web robot detection approach for content-rich websites, based on the assumption that human web users are interested in specific topics, while web robots crawl the web randomly. Our approach extends the typical representation of user sessions with a novel set of features that capture the semantics of the content of the requested resources. Empirical results on real-world data from the web portal of an academic publisher, show that the proposed semantic features lead to improved web robot detection accuracy.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123485793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Obstacle-Avoiding Euclidean Steiner Trees by n-Star Bundles 基于n星束的避障欧几里得斯坦纳树
V. Parque, T. Miyashita
Optimal topologies in networked systems is of relevant interest to integrate and coordinate multi-agency. Our interest in this paper is to compute the root location and the topology of minimal-length tree layouts given n nodes in a polygonal map, assuming an n-star network topology. Computational experiments involving 600 minimal tree planning scenarios show the feasibility and efficiency of the proposed approach.
网络系统的最优拓扑是多机构集成与协调的重要问题。在本文中,我们的兴趣是计算多边形映射中给定n个节点的最小长度树布局的根位置和拓扑结构,假设n星网络拓扑结构。涉及600个最小树规划场景的计算实验表明了该方法的可行性和有效性。
{"title":"Obstacle-Avoiding Euclidean Steiner Trees by n-Star Bundles","authors":"V. Parque, T. Miyashita","doi":"10.1109/ICTAI.2018.00057","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00057","url":null,"abstract":"Optimal topologies in networked systems is of relevant interest to integrate and coordinate multi-agency. Our interest in this paper is to compute the root location and the topology of minimal-length tree layouts given n nodes in a polygonal map, assuming an n-star network topology. Computational experiments involving 600 minimal tree planning scenarios show the feasibility and efficiency of the proposed approach.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"344 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124246239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Comparison of Traffic Forecasting Methods in Urban and Suburban Context 城市与郊区交通预测方法的比较
Julien Salotti, S. Fenet, Romain Billot, Nour-Eddin El Faouzi, C. Solnon
In the context of Connected and Smart Cities, the need to predict short term traffic conditions has led to the development of a large variety of forecasting algorithms. In spite of various research efforts, there is however still no clear view of the requirements involved in network-wide traffic forecasting. In this paper, the ability of several state-of-the-art methods to forecast the traffic flow at each road segment is studied. Some of the multivariate methods use the information of all sensors to predict traffic at a specific location, whereas some others rely on the selection of a suitable subset. In addition to classical methods, this paper studies the advantage of learning this subset by using a new variable selection algorithm based on time series graphical models and information theory. This method has already been successfully used in natural science applications with similar goals, but not in the traffic community. A contribution is to evaluate all these methods on two real-world datasets with different characteristics and to compare the forecasting ability of each method in both contexts. The first dataset describes the traffic flow in the city center of Lyon (France), which exhibits complex patterns due to the network structure and urban traffic dynamics. The second dataset describes inter-urban freeway traffic on the outskirts of the French city of Marseille. Experimental results validate the need for variable selection mechanisms and illustrate the complementarity of forecasting algorithms depending on the type of road and the forecasting horizon.
在互联城市和智慧城市的背景下,预测短期交通状况的需求导致了各种预测算法的发展。尽管进行了各种各样的研究工作,但是对于全网络流量预测所涉及的需求仍然没有明确的认识。本文研究了几种最先进的方法预测各路段交通流的能力。一些多变量方法使用所有传感器的信息来预测特定位置的交通,而另一些方法则依赖于选择合适的子集。在经典学习方法的基础上,本文采用一种新的基于时间序列图模型和信息论的变量选择算法,研究了学习该子集的优势。这种方法已经成功地应用于具有类似目标的自然科学应用中,但尚未应用于交通领域。一个贡献是在两个具有不同特征的真实数据集上评估所有这些方法,并比较每种方法在两种情况下的预测能力。第一个数据集描述了法国里昂市中心的交通流,由于网络结构和城市交通动态,该数据集呈现出复杂的模式。第二个数据集描述了法国城市马赛郊区的城际高速公路交通情况。实验结果验证了变量选择机制的必要性,并说明了根据道路类型和预测范围的预测算法的互补性。
{"title":"Comparison of Traffic Forecasting Methods in Urban and Suburban Context","authors":"Julien Salotti, S. Fenet, Romain Billot, Nour-Eddin El Faouzi, C. Solnon","doi":"10.1109/ICTAI.2018.00132","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00132","url":null,"abstract":"In the context of Connected and Smart Cities, the need to predict short term traffic conditions has led to the development of a large variety of forecasting algorithms. In spite of various research efforts, there is however still no clear view of the requirements involved in network-wide traffic forecasting. In this paper, the ability of several state-of-the-art methods to forecast the traffic flow at each road segment is studied. Some of the multivariate methods use the information of all sensors to predict traffic at a specific location, whereas some others rely on the selection of a suitable subset. In addition to classical methods, this paper studies the advantage of learning this subset by using a new variable selection algorithm based on time series graphical models and information theory. This method has already been successfully used in natural science applications with similar goals, but not in the traffic community. A contribution is to evaluate all these methods on two real-world datasets with different characteristics and to compare the forecasting ability of each method in both contexts. The first dataset describes the traffic flow in the city center of Lyon (France), which exhibits complex patterns due to the network structure and urban traffic dynamics. The second dataset describes inter-urban freeway traffic on the outskirts of the French city of Marseille. Experimental results validate the need for variable selection mechanisms and illustrate the complementarity of forecasting algorithms depending on the type of road and the forecasting horizon.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129393898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A New Method for Computing Stable Models in Logic Programming 一种计算逻辑规划中稳定模型的新方法
T. Khaled, B. Benhamou, P. Siegel
In this work, we introduce a new method for searching stable models of logical programs. This method is based on a relatively new semantics that has not been exploited yet. This semantics captures and extends that one of the stable models (Gelfond et al., 1988) and offers a new alternative to implement ASP solvers. The proposed method performs a DPLL enumerative process that is adapted to Answer Set Programming (ASP) framework according to the used semantics. This method has the advantage to use a Horn clause representation having the same size as the input logic program has constant spatial complexity. It avoids the workload induced by the loop management from which suffer most of the ASP solvers based on the Clark completion. Moreover, the enumeration is done on a restricted set of literals called the strong back-door (STB) of the considered logic program. This reduces the algorithm time complexity which is in theory a function of the size of the STB set. We also introduced new inference rules that the method uses to prune its search tree and hence reduces its size in practice. We implemented the proposed method and applied it to enumerate the stable models of some combinatorial problems. The method is compared to other known systems and the obtained results show that our approach is a good alternative for designing ASP solvers.
本文提出了一种寻找逻辑规划稳定模型的新方法。这种方法基于一种尚未被利用的相对较新的语义。这种语义捕获并扩展了一种稳定模型(Gelfond et al., 1988),并为实现ASP求解器提供了一种新的选择。该方法根据所使用的语义执行适合于答案集编程(ASP)框架的DPLL枚举过程。该方法的优点是使用与输入逻辑程序具有恒定空间复杂度的大小相同的Horn子句表示。它避免了大多数基于Clark完井的ASP求解器由于循环管理而带来的工作量。此外,枚举是在被考虑的逻辑程序的一组被称为强后门(STB)的受限字面值上完成的。这降低了算法的时间复杂度,这在理论上是STB集大小的函数。我们还引入了新的推理规则,该方法使用该规则来修剪其搜索树,从而在实践中减小其大小。我们实现了该方法,并将其应用于若干组合问题的稳定模型枚举。将该方法与其他已知系统进行了比较,结果表明该方法是设计ASP求解器的一个很好的选择。
{"title":"A New Method for Computing Stable Models in Logic Programming","authors":"T. Khaled, B. Benhamou, P. Siegel","doi":"10.1109/ICTAI.2018.00125","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00125","url":null,"abstract":"In this work, we introduce a new method for searching stable models of logical programs. This method is based on a relatively new semantics that has not been exploited yet. This semantics captures and extends that one of the stable models (Gelfond et al., 1988) and offers a new alternative to implement ASP solvers. The proposed method performs a DPLL enumerative process that is adapted to Answer Set Programming (ASP) framework according to the used semantics. This method has the advantage to use a Horn clause representation having the same size as the input logic program has constant spatial complexity. It avoids the workload induced by the loop management from which suffer most of the ASP solvers based on the Clark completion. Moreover, the enumeration is done on a restricted set of literals called the strong back-door (STB) of the considered logic program. This reduces the algorithm time complexity which is in theory a function of the size of the STB set. We also introduced new inference rules that the method uses to prune its search tree and hence reduces its size in practice. We implemented the proposed method and applied it to enumerate the stable models of some combinatorial problems. The method is compared to other known systems and the obtained results show that our approach is a good alternative for designing ASP solvers.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129610061","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
ALSTM: Adaptive LSTM for Durative Sequential Data ALSTM:持续序列数据的自适应LSTM
Dejiao Niu, Zheng Xia, Yawen Liu, Tao Cai, Tianquan Liu, Yongzhao Zhan
Long short-term memory (LSTM) network is an effective model architecture for deep learning approaches to sequence modeling tasks. However, the current LSTMs can't use the property of sequential data when dealing with the sequence components, which last for a certain period of time. This may make the model unable to benefit from the inherent characteristics of time series and result in poor performance as well as lower efficiency. In this paper, we present a novel adaptive LSTM for durative sequential data which exploits the temporal continuance of the input data in designing a new LSTM unit. By adding a new mask gate and maintaining span, the cell's memory update is not only determined by the input data but also affected by its duration. An adaptive memory update method is proposed according to the change of the sequence input at each time step. This breaks the limitation that the cells calculate the cell state and hidden output for each input always in a unified manner, making the model more suitable for processing the sequences with continuous data. The experimental results on various sequence training tasks show that under the same iteration epochs, the proposed method can achieve higher accuracy, but need relatively less training time compared with the standard LSTM architecture.
长短期记忆(LSTM)网络是深度学习方法用于序列建模任务的有效模型体系结构。但是,当前的lstm在处理序列组件时不能使用序列数据的属性,序列组件存在一定的时间。这可能使模型无法从时间序列的固有特征中获益,导致性能差,效率低。在本文中,我们提出了一种新的针对持续序列数据的自适应LSTM,在设计新的LSTM单元时利用了输入数据的时间连续性。通过添加一个新的掩码门并保持跨度,单元的内存更新不仅由输入数据决定,而且受其持续时间的影响。根据序列输入在每个时间步长的变化,提出了一种自适应记忆更新方法。这打破了单元对每个输入总是统一计算单元状态和隐藏输出的限制,使模型更适合处理连续数据序列。在各种序列训练任务上的实验结果表明,在相同迭代次数下,与标准LSTM结构相比,该方法可以达到更高的精度,但所需的训练时间相对较少。
{"title":"ALSTM: Adaptive LSTM for Durative Sequential Data","authors":"Dejiao Niu, Zheng Xia, Yawen Liu, Tao Cai, Tianquan Liu, Yongzhao Zhan","doi":"10.1109/ICTAI.2018.00032","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00032","url":null,"abstract":"Long short-term memory (LSTM) network is an effective model architecture for deep learning approaches to sequence modeling tasks. However, the current LSTMs can't use the property of sequential data when dealing with the sequence components, which last for a certain period of time. This may make the model unable to benefit from the inherent characteristics of time series and result in poor performance as well as lower efficiency. In this paper, we present a novel adaptive LSTM for durative sequential data which exploits the temporal continuance of the input data in designing a new LSTM unit. By adding a new mask gate and maintaining span, the cell's memory update is not only determined by the input data but also affected by its duration. An adaptive memory update method is proposed according to the change of the sequence input at each time step. This breaks the limitation that the cells calculate the cell state and hidden output for each input always in a unified manner, making the model more suitable for processing the sequences with continuous data. The experimental results on various sequence training tasks show that under the same iteration epochs, the proposed method can achieve higher accuracy, but need relatively less training time compared with the standard LSTM architecture.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124572103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
An Iterative Instance Selection Based Framework for Multiple-Instance Learning 基于迭代实例选择的多实例学习框架
Liming Yuan, Xianbin Wen, Lu Zhao, Haixia Xu
The instance selection based model is an effective multiple-instance learning (MIL) framework, which solves the MIL problems by embedding examples (bags of instances) into a new feature space formed by some concepts (represented by some selected instances). Most previous studies use single-point concepts for the instance selection, where every possible concept is represented by only a single instance. In this paper, we apply multiple-point concepts for choosing instances, in which each possible concept is jointly represented by a group of similar instances. Furthermore, we establish an iterative instance selection based MIL framework based on multiple-point concepts, which is guaranteed to automatically converge to the needed number of concepts for a given problem. The experimental results demonstrate that the proposed framework can better handle not only common MIL problems but also hybrid ones compared to state-of-the-art MIL algorithms.
基于实例选择的模型是一种有效的多实例学习(MIL)框架,它通过将实例(实例袋)嵌入到由一些概念(由一些选定的实例表示)形成的新特征空间中来解决MIL问题。以往的研究大多使用单点概念进行实例选择,其中每个可能的概念仅由单个实例表示。在本文中,我们采用多点概念选择实例,其中每个可能的概念由一组相似的实例联合表示。此外,我们建立了基于多点概念的迭代实例选择的MIL框架,保证了该框架能够自动收敛到给定问题所需的概念数量。实验结果表明,与现有的MIL算法相比,该框架不仅可以更好地处理常见的MIL问题,而且可以更好地处理混合MIL问题。
{"title":"An Iterative Instance Selection Based Framework for Multiple-Instance Learning","authors":"Liming Yuan, Xianbin Wen, Lu Zhao, Haixia Xu","doi":"10.1109/ICTAI.2018.00121","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00121","url":null,"abstract":"The instance selection based model is an effective multiple-instance learning (MIL) framework, which solves the MIL problems by embedding examples (bags of instances) into a new feature space formed by some concepts (represented by some selected instances). Most previous studies use single-point concepts for the instance selection, where every possible concept is represented by only a single instance. In this paper, we apply multiple-point concepts for choosing instances, in which each possible concept is jointly represented by a group of similar instances. Furthermore, we establish an iterative instance selection based MIL framework based on multiple-point concepts, which is guaranteed to automatically converge to the needed number of concepts for a given problem. The experimental results demonstrate that the proposed framework can better handle not only common MIL problems but also hybrid ones compared to state-of-the-art MIL algorithms.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128716918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Effective Products Categorization with Importance Scores and Morphological Analysis of the Titles 有效产品的重要度分类及标题的形态分析
Leonidas Akritidis, Athanasios Fevgas, Panayiotis Bozanis
During the past few years, the e-commerce platforms and marketplaces have enriched their services with new features to improve their user experience and increase their profitability. Such features include relevant products suggestion, personalized recommendations, query understanding algorithms and numerous others. To effectively implement all these features, a robust products categorization method is required. Due to its importance, the problem of the automatic products classification into a given taxonomy has attracted the attention of multiple researchers. In the current literature, we encounter a broad variety of solutions, ranging from supervised and deep learning algorithms, as well as convolutional and recurrent neural networks. In this paper we introduce a supervised learning method which performs morphological analysis of the product titles by extracting and processing a combination of words and n-grams. In the sequel, each of these tokens receives an importance score according to several criteria which reflect the strength of the correlation of the token with a category. Based on these importance scores, we also propose a dimensionality reduction technique to reduce the size of the feature space without sacrificing much of the performance of the algorithm. The experimental evaluation of our method was conducted by using a real-world dataset, comprised of approximately 320 thousand product titles, which we acquired by crawling a product comparison Web platform. The results of this evaluation indicate that our approach is highly accurate, since it achieves a remarkable classification accuracy of over 95%.
在过去的几年里,电子商务平台和市场为其服务增加了新的功能,以改善用户体验,提高盈利能力。这些功能包括相关产品建议、个性化推荐、查询理解算法等等。为了有效地实现所有这些特性,需要一个健壮的产品分类方法。由于其重要性,产品自动分类问题引起了众多研究者的关注。在目前的文献中,我们遇到了各种各样的解决方案,从监督和深度学习算法,以及卷积和循环神经网络。本文介绍了一种监督学习方法,该方法通过提取和处理词和n-图的组合来对产品标题进行形态分析。在续集中,这些令牌中的每一个都会根据几个标准获得一个重要分数,这些标准反映了令牌与类别的相关性的强度。基于这些重要分数,我们还提出了一种降维技术,在不牺牲算法性能的情况下减少特征空间的大小。我们的方法通过使用一个真实的数据集进行了实验评估,该数据集由大约32万个产品标题组成,我们通过抓取产品比较Web平台获得。这次评估的结果表明,我们的方法是非常准确的,因为它达到了95%以上的分类准确率。
{"title":"Effective Products Categorization with Importance Scores and Morphological Analysis of the Titles","authors":"Leonidas Akritidis, Athanasios Fevgas, Panayiotis Bozanis","doi":"10.1109/ICTAI.2018.00041","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00041","url":null,"abstract":"During the past few years, the e-commerce platforms and marketplaces have enriched their services with new features to improve their user experience and increase their profitability. Such features include relevant products suggestion, personalized recommendations, query understanding algorithms and numerous others. To effectively implement all these features, a robust products categorization method is required. Due to its importance, the problem of the automatic products classification into a given taxonomy has attracted the attention of multiple researchers. In the current literature, we encounter a broad variety of solutions, ranging from supervised and deep learning algorithms, as well as convolutional and recurrent neural networks. In this paper we introduce a supervised learning method which performs morphological analysis of the product titles by extracting and processing a combination of words and n-grams. In the sequel, each of these tokens receives an importance score according to several criteria which reflect the strength of the correlation of the token with a category. Based on these importance scores, we also propose a dimensionality reduction technique to reduce the size of the feature space without sacrificing much of the performance of the algorithm. The experimental evaluation of our method was conducted by using a real-world dataset, comprised of approximately 320 thousand product titles, which we acquired by crawling a product comparison Web platform. The results of this evaluation indicate that our approach is highly accurate, since it achieves a remarkable classification accuracy of over 95%.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"178 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116308083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Co-Ride: Collaborative Preference-Based Taxi-Sharing and Taxi-Dispatch 共乘:基于协同偏好的出租车共享与调度
F. Golpayegani, S. Clarke
Taxi-sharing is an emergent transport mode, which has shown promising results economically, by splitting the travel cost between passengers and environmentally, by serving more people in each trip. Intelligent taxi-dispatch approaches can also manage demand by distributing taxis according to population density in a city. Current approaches to taxi-sharing recommend passengers share a taxi by matching their origin and destination, and taxi-dispatch approaches simply send more taxis to populated areas. However, each passenger may have multiple preferences (e.g., level of convenience, time, cost, and environmental factors), and require a mechanism that offers options considering these preferences. Similarly, taxi drivers may have multiple preferences (e.g., number of hours to work, minimum revenue per day) that need to be considered during a taxi-dispatch planning process. This paper presents a multi-agent collaborative passenger matching and taxi-dispatch model. Passengers and drivers are modeled as autonomous agents having multiple often-conflicting preferences. Passenger agents collaboratively take actions to form a group for a taxi-share, and taxi agents collaborate to achieve a dispatch plan.
出租车共享是一种新兴的交通方式,它通过在乘客和环境之间分摊出行成本,每次出行服务更多的人,在经济上显示出良好的效果。智能出租车调度方法还可以根据城市的人口密度分配出租车来管理需求。目前的出租车共享方法是通过匹配出发地和目的地来推荐乘客共享一辆出租车,而出租车调度方法只是将更多的出租车派往人口密集的地区。然而,每个乘客可能有多种偏好(例如,便利程度、时间、成本和环境因素),并且需要一种机制来提供考虑这些偏好的选项。同样,出租车司机可能有多种偏好(例如,工作小时数,每天的最低收入),这些都需要在出租车调度计划过程中考虑。提出了一种多智能体协同乘客匹配与出租车调度模型。乘客和司机被建模为具有多种经常相互冲突的偏好的自主代理。乘客代理协作采取行动,形成出租车共享组,出租车代理协作实现调度计划。
{"title":"Co-Ride: Collaborative Preference-Based Taxi-Sharing and Taxi-Dispatch","authors":"F. Golpayegani, S. Clarke","doi":"10.1109/ICTAI.2018.00135","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00135","url":null,"abstract":"Taxi-sharing is an emergent transport mode, which has shown promising results economically, by splitting the travel cost between passengers and environmentally, by serving more people in each trip. Intelligent taxi-dispatch approaches can also manage demand by distributing taxis according to population density in a city. Current approaches to taxi-sharing recommend passengers share a taxi by matching their origin and destination, and taxi-dispatch approaches simply send more taxis to populated areas. However, each passenger may have multiple preferences (e.g., level of convenience, time, cost, and environmental factors), and require a mechanism that offers options considering these preferences. Similarly, taxi drivers may have multiple preferences (e.g., number of hours to work, minimum revenue per day) that need to be considered during a taxi-dispatch planning process. This paper presents a multi-agent collaborative passenger matching and taxi-dispatch model. Passengers and drivers are modeled as autonomous agents having multiple often-conflicting preferences. Passenger agents collaboratively take actions to form a group for a taxi-share, and taxi agents collaborate to achieve a dispatch plan.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126479849","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Empirical Activation Function Effects on Unsupervised Convolutional LSTM Learning 经验激活函数对无监督卷积LSTM学习的影响
Nelly Elsayed, A. Maida, M. Bayoumi
This paper empirically evaluates and analyzes the effect of the choice of recurrent activation and unit activation functions on the unsupervised convolutional LSTM learning process. The goal of this work is to provide guidance for selecting the optimal non-linear activation function for the convolutional LSTM models which target the video prediction problem. This paper shows an empirical analysis of different non-linear activation functions that are commonly implemented in different deep learning APIs. We used the moving MNIST dataset as the most common benchmark for video prediction problems.
本文对循环激活函数和单元激活函数的选择对无监督卷积LSTM学习过程的影响进行了实证评价和分析。本工作的目的是为针对视频预测问题的卷积LSTM模型选择最优非线性激活函数提供指导。本文对不同深度学习api中常用的非线性激活函数进行了实证分析。我们使用移动的MNIST数据集作为视频预测问题的最常见基准。
{"title":"Empirical Activation Function Effects on Unsupervised Convolutional LSTM Learning","authors":"Nelly Elsayed, A. Maida, M. Bayoumi","doi":"10.1109/ICTAI.2018.00060","DOIUrl":"https://doi.org/10.1109/ICTAI.2018.00060","url":null,"abstract":"This paper empirically evaluates and analyzes the effect of the choice of recurrent activation and unit activation functions on the unsupervised convolutional LSTM learning process. The goal of this work is to provide guidance for selecting the optimal non-linear activation function for the convolutional LSTM models which target the video prediction problem. This paper shows an empirical analysis of different non-linear activation functions that are commonly implemented in different deep learning APIs. We used the moving MNIST dataset as the most common benchmark for video prediction problems.","PeriodicalId":254686,"journal":{"name":"2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128968988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
期刊
2018 IEEE 30th International Conference on Tools with Artificial Intelligence (ICTAI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1