首页 > 最新文献

2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)最新文献

英文 中文
Reasoning About Future Cyber-Attacks Through Socio-Technical Hacking Information 通过社会技术黑客信息推断未来的网络攻击
E. Marin, Mohammed Almukaynizi, P. Shakarian
With the widespread of cyber-attack incidents, cybersecurity has become a major concern for organizations. The waste of time, money and resources while organizations counter irrelevant cyber threats can turn them into the next victim of malicious hackers. In addition, the online hacking community has grown rapidly, making the cyber threat landscape hard to keep track of. In this work, we describe an AI tool that uses a temporal logical framework to learn rules that correlate malicious hacking activity with real-world cyber incidents, aiming to leverage these rules for predicting future cyber-attacks. The framework considers socio-personal and technical indicators of enterprise attacks, analyzing the hackers and their strategies when they are planning cyber offensives online. Our results demonstrate the viability of the proposed approach, which outperforms baseline systems by an average F1 score increase of 138%, 71% and 17% for intervals of 1, 2 and 3 days respectively, providing security teams mechanisms to predict and avoid cyber-attacks.
随着网络攻击事件的广泛发生,网络安全已成为组织关注的主要问题。组织在应对无关的网络威胁时浪费时间、金钱和资源,可能会使他们成为恶意黑客的下一个受害者。此外,网络黑客社区发展迅速,使得网络威胁形势难以追踪。在这项工作中,我们描述了一个人工智能工具,它使用一个时间逻辑框架来学习将恶意黑客活动与现实世界的网络事件相关联的规则,旨在利用这些规则来预测未来的网络攻击。该框架考虑了企业攻击的社会个人和技术指标,分析了黑客及其在线计划网络攻击时的策略。我们的研究结果证明了该方法的可行性,该方法在1天、2天和3天的间隔内分别比基准系统平均提高了138%、71%和17%的F1分数,为安全团队提供了预测和避免网络攻击的机制。
{"title":"Reasoning About Future Cyber-Attacks Through Socio-Technical Hacking Information","authors":"E. Marin, Mohammed Almukaynizi, P. Shakarian","doi":"10.1109/ICTAI.2019.00030","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00030","url":null,"abstract":"With the widespread of cyber-attack incidents, cybersecurity has become a major concern for organizations. The waste of time, money and resources while organizations counter irrelevant cyber threats can turn them into the next victim of malicious hackers. In addition, the online hacking community has grown rapidly, making the cyber threat landscape hard to keep track of. In this work, we describe an AI tool that uses a temporal logical framework to learn rules that correlate malicious hacking activity with real-world cyber incidents, aiming to leverage these rules for predicting future cyber-attacks. The framework considers socio-personal and technical indicators of enterprise attacks, analyzing the hackers and their strategies when they are planning cyber offensives online. Our results demonstrate the viability of the proposed approach, which outperforms baseline systems by an average F1 score increase of 138%, 71% and 17% for intervals of 1, 2 and 3 days respectively, providing security teams mechanisms to predict and avoid cyber-attacks.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114986404","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Adaptive Phone Orientation Method for Continuous Authentication Based on Mobile Motion Sensors 基于移动运动传感器的自适应手机方向连续认证方法
Shixuan Wang, Jiabin Yuan, Jing Wen
With the popularity of mobile terminals and the convenience of mobile operating, more and more private data is stored in mobile phones, which makes users pay more attention to mobile security. The data of mobile motion sensors are used to construct the user's behavioral characteristics and biometric characteristics. The principle is to capture the subtle changes in the smartphones caused by the user holding smartphones and touching screens. These changes are unique to different users. This paper studies the effects of mobile phone orientations on continuous authentication based on mobile motion sensors. Additionally, this paper constructs an adaptive phone orientation method for continuous authentication. An orientation detection model is constructed using K-means and Random Forest. The authentication model is constructed by using one-class SVM. Meanwhile, our experiments show that the data of mobile motion sensors are great difference in different phone orientations. We believe considering the situation of multi-orientations data affecting authentication accuracy. The adaptive phone orientation method can better fit for the scenarios where users use mobile devices in different phone orientations. Considering the orientation of the data will improve the robustness of the system and the accuracy of authentication.
随着移动终端的普及和移动操作的便利性,越来越多的私人数据存储在手机上,这使得用户更加重视移动安全。利用移动运动传感器的数据构建用户的行为特征和生物特征。其原理是捕捉用户手持智能手机和触摸屏幕时智能手机的细微变化。这些更改对于不同的用户是独一无二的。本文研究了基于移动运动传感器的手机方向对连续认证的影响。此外,本文还构造了一种自适应手机定位的连续认证方法。利用k均值和随机森林构造了一个方向检测模型。采用单类支持向量机构建认证模型。同时,我们的实验表明,在不同的手机方向下,移动运动传感器的数据差异很大。我们认为考虑到多方向数据影响认证精度的情况。自适应手机朝向方法可以更好地适应用户在不同手机朝向下使用移动设备的场景。考虑数据的方向将提高系统的鲁棒性和认证的准确性。
{"title":"Adaptive Phone Orientation Method for Continuous Authentication Based on Mobile Motion Sensors","authors":"Shixuan Wang, Jiabin Yuan, Jing Wen","doi":"10.1109/ICTAI.2019.00236","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00236","url":null,"abstract":"With the popularity of mobile terminals and the convenience of mobile operating, more and more private data is stored in mobile phones, which makes users pay more attention to mobile security. The data of mobile motion sensors are used to construct the user's behavioral characteristics and biometric characteristics. The principle is to capture the subtle changes in the smartphones caused by the user holding smartphones and touching screens. These changes are unique to different users. This paper studies the effects of mobile phone orientations on continuous authentication based on mobile motion sensors. Additionally, this paper constructs an adaptive phone orientation method for continuous authentication. An orientation detection model is constructed using K-means and Random Forest. The authentication model is constructed by using one-class SVM. Meanwhile, our experiments show that the data of mobile motion sensors are great difference in different phone orientations. We believe considering the situation of multi-orientations data affecting authentication accuracy. The adaptive phone orientation method can better fit for the scenarios where users use mobile devices in different phone orientations. Considering the orientation of the data will improve the robustness of the system and the accuracy of authentication.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117117701","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Real-Coded Genetic Algorithm Realizing Fast Convergence by Reducing Its Population Size 实数编码遗传算法通过减小种群大小实现快速收敛
Kazuki Nishisaka, H. Iima
Just generation gap (JGG) is a generational alternation model of real-coded genetic algorithms (GAs), and is excellent at finding the global optimum solution of a function optimization problem. However, its population size is large, and therefore its convergence speed is low. A method to accelerate the convergence speed is to reduce the population size. However, if it is reduced throughout the search by JGG, the population diversity is lost, which may cause the failure to find the global optimum solution. The population size should be reduced during only a part of the search period during which the population diversity is not lost. In this paper, we propose a real-coded GA realizing fast convergence by introducing the reduction of the population size into JGG. In the proposed method, the population size is reduced during only an early or late search period. The performance of the proposed method is empirically evaluated by comparing it with only JGG and an existing GA.
正代沟(JGG)是实数编码遗传算法(GAs)的代际交替模型,具有寻找函数优化问题全局最优解的优点。但其种群规模较大,收敛速度较慢。一种加快收敛速度的方法是减小种群规模。但是,如果JGG在整个搜索过程中减小它,则会丢失种群多样性,可能导致无法找到全局最优解。只在种群多样性不丧失的部分搜索期内减少种群规模。在本文中,我们提出了一种实数编码的遗传算法,通过在JGG中引入减少种群大小的方法来实现快速收敛。在所提出的方法中,种群大小只在早期或后期搜索期间减小。通过与JGG和现有遗传算法进行比较,对所提方法的性能进行了实证评价。
{"title":"Real-Coded Genetic Algorithm Realizing Fast Convergence by Reducing Its Population Size","authors":"Kazuki Nishisaka, H. Iima","doi":"10.1109/ICTAI.2019.00264","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00264","url":null,"abstract":"Just generation gap (JGG) is a generational alternation model of real-coded genetic algorithms (GAs), and is excellent at finding the global optimum solution of a function optimization problem. However, its population size is large, and therefore its convergence speed is low. A method to accelerate the convergence speed is to reduce the population size. However, if it is reduced throughout the search by JGG, the population diversity is lost, which may cause the failure to find the global optimum solution. The population size should be reduced during only a part of the search period during which the population diversity is not lost. In this paper, we propose a real-coded GA realizing fast convergence by introducing the reduction of the population size into JGG. In the proposed method, the population size is reduced during only an early or late search period. The performance of the proposed method is empirically evaluated by comparing it with only JGG and an existing GA.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114552825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Optimizing Value of Information Over an Infinite Time Horizon 在无限时间范围内优化信息价值
Sarthak Ghosh, C. Ramakrishnan
Decision-making based on probabilistic reasoning often involves selecting a subset of expensive observations that best predict the system state. In an earlier work, adopting the general notion of value of information (VoI) first introduced by Krause and Guestrin, Ghosh and Ramakrishnan considered the problem of determining optimal conditional observation plans in temporal graphical models, based on non-myopic (non-greedy) VoI, over a finite time horizon. They cast the problem as determining optimal policies in finite-horizon, non-discounted Markov Decision Processes (MDPs). However, there are many practical scenarios where a time horizon is undefinable. In this paper, we consider the VoI optimization problem over an infinite (or equivalently, undefined) time horizon. Adopting an approach similar to Ghosh and Ramakrishnan's, we cast this problem as determining optimal policies in infinite-horizon, finite-state, discounted MDPs. Although our MDP-based framework addresses Dynamic Bayesian Networks (DBNs) that are more restricted than those addressed by Ghosh and Ramakrishnan, we incorporate Krause and Guestrin's general idea of VoI even though it was fundamentally envisioned for finite-horizon settings. We establish the utility of our approach on two graphical models based on real-world datasets.
基于概率推理的决策通常涉及选择最能预测系统状态的昂贵观测值子集。在早期的工作中,采用Krause和Guestrin, Ghosh和Ramakrishnan首先引入的信息价值(VoI)的一般概念,考虑了在有限时间范围内基于非近视(非贪婪)VoI确定时间图形模型中最优条件观察计划的问题。他们将这个问题描述为在有限视界、非贴现马尔可夫决策过程(mdp)中确定最优策略。然而,在许多实际情况下,时间范围是不确定的。在本文中,我们考虑在无限(或等价的,未定义的)时间范围上的VoI优化问题。采用与Ghosh和Ramakrishnan类似的方法,我们将此问题视为确定无限视界,有限状态,贴现mdp的最优策略。尽管我们基于mdp的框架解决了比Ghosh和Ramakrishnan所解决的更受限制的动态贝叶斯网络(dbn),但我们结合了Krause和Guestrin的VoI总体思想,尽管它从根本上是为有限视界设置设想的。我们在基于真实世界数据集的两个图形模型上建立了我们方法的实用性。
{"title":"Optimizing Value of Information Over an Infinite Time Horizon","authors":"Sarthak Ghosh, C. Ramakrishnan","doi":"10.1109/ICTAI.2019.00101","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00101","url":null,"abstract":"Decision-making based on probabilistic reasoning often involves selecting a subset of expensive observations that best predict the system state. In an earlier work, adopting the general notion of value of information (VoI) first introduced by Krause and Guestrin, Ghosh and Ramakrishnan considered the problem of determining optimal conditional observation plans in temporal graphical models, based on non-myopic (non-greedy) VoI, over a finite time horizon. They cast the problem as determining optimal policies in finite-horizon, non-discounted Markov Decision Processes (MDPs). However, there are many practical scenarios where a time horizon is undefinable. In this paper, we consider the VoI optimization problem over an infinite (or equivalently, undefined) time horizon. Adopting an approach similar to Ghosh and Ramakrishnan's, we cast this problem as determining optimal policies in infinite-horizon, finite-state, discounted MDPs. Although our MDP-based framework addresses Dynamic Bayesian Networks (DBNs) that are more restricted than those addressed by Ghosh and Ramakrishnan, we incorporate Krause and Guestrin's general idea of VoI even though it was fundamentally envisioned for finite-horizon settings. We establish the utility of our approach on two graphical models based on real-world datasets.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122123740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distant-Supervised Relation Extraction with Hierarchical Attention Based on Knowledge Graph 基于知识图的层次关注的远程监督关系提取
Hong Yao, Lijun Dong, Shiqi Zhen, Xiaojun Kang, Xinchuan Li, Qingzhong Liang
Relation Extraction is concentrated on finding the unknown relational facts automatically from the unstructured texts. Most current methods, especially the distant supervision relation extraction (DSRE), have been successfully applied to achieve this goal. DSRE combines knowledge graph and text corpus to corporately generate plenty of labeled data without human efforts. However, the existing methods of DSRE ignore the noisy words within sentences and suffer from the noisy labelling problem; the additional knowledge is represented in a common semantic space and ignores the semantic-space difference between relations and entities. To address these problems, this study proposes a novel hierarchical attention model, named the Bi-GRU-based Knowledge Graph Attention Model (BG2KGA) for DSRE using the Bidirectional Gated Recurrent Unit (Bi-GRU) network. BG2KGA contains the word-level and sentence-level attentions with the guidance of additional knowledge graph, to highlight the key words and sentences respectively which can contribute more to the final relation representations. Further-more, the additional knowledge graph are embedded in the multi-semantic vector space to capture the relations in 1-N, N-1 and N-N entity pairs. Experiments are conducted on a widely used dataset for distant supervision. The experimental results have shown that the proposed model outperforms the current methods and can improve the Precision/Recall (PR) curve area by 8% to 16% compared to the state-of-the-art models; the AUC of BG2KGA can reach 0.468 in the best case.
关系抽取是指从非结构化文本中自动发现未知的关系事实。目前的大多数方法,特别是远程监督关系提取(DSRE),已经成功地实现了这一目标。DSRE将知识图和文本语料库相结合,可以在不需要人工的情况下共同生成大量的标记数据。然而,现有的DSRE方法忽略了句子中的噪声词,存在噪声标注问题;附加知识在公共语义空间中表示,忽略关系和实体之间的语义空间差异。为了解决这些问题,本研究提出了一种基于双向门控循环单元(Bi-GRU)网络的DSRE知识图注意力模型(BG2KGA)。BG2KGA在附加知识图的引导下包含词级和句子级的关注,分别突出对最终关系表示贡献更大的关键词和句子。此外,将附加的知识图嵌入到多语义向量空间中,以捕获1-N、N-1和N-N实体对中的关系。实验是在一个广泛使用的远程监督数据集上进行的。实验结果表明,所提出的模型优于现有的方法,与现有模型相比,可将Precision/Recall (PR)曲线面积提高8% ~ 16%;最佳情况下,BG2KGA的AUC可达0.468。
{"title":"Distant-Supervised Relation Extraction with Hierarchical Attention Based on Knowledge Graph","authors":"Hong Yao, Lijun Dong, Shiqi Zhen, Xiaojun Kang, Xinchuan Li, Qingzhong Liang","doi":"10.1109/ICTAI.2019.00040","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00040","url":null,"abstract":"Relation Extraction is concentrated on finding the unknown relational facts automatically from the unstructured texts. Most current methods, especially the distant supervision relation extraction (DSRE), have been successfully applied to achieve this goal. DSRE combines knowledge graph and text corpus to corporately generate plenty of labeled data without human efforts. However, the existing methods of DSRE ignore the noisy words within sentences and suffer from the noisy labelling problem; the additional knowledge is represented in a common semantic space and ignores the semantic-space difference between relations and entities. To address these problems, this study proposes a novel hierarchical attention model, named the Bi-GRU-based Knowledge Graph Attention Model (BG2KGA) for DSRE using the Bidirectional Gated Recurrent Unit (Bi-GRU) network. BG2KGA contains the word-level and sentence-level attentions with the guidance of additional knowledge graph, to highlight the key words and sentences respectively which can contribute more to the final relation representations. Further-more, the additional knowledge graph are embedded in the multi-semantic vector space to capture the relations in 1-N, N-1 and N-N entity pairs. Experiments are conducted on a widely used dataset for distant supervision. The experimental results have shown that the proposed model outperforms the current methods and can improve the Precision/Recall (PR) curve area by 8% to 16% compared to the state-of-the-art models; the AUC of BG2KGA can reach 0.468 in the best case.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128440596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
General Chair’s Foreword 主席的前言
Mohammed Atiquzzaman
The 17 annual IEEE International Conference on Bioinformatics and Bioengineering aims at building synergy between Bioinformatics and Bioengineering/Biomedical, two complementary disciplines that hold great promise for the advancement of research and development in complex medical and biological systems, agriculture, environment, public health, drug design. Research and development in these two areas are impacting the science and technology in fields such as medicine, food production, forensics, etc. by advancing fundamental concepts in molecular biology, by helping us understand living organisms at multiple levels, by developing innovative implants and bio-prosthetics, and by improving tools and techniques for the detection, prevention and treatment of diseases. The BIBE series provides a common platform for the cross fertilization of ideas, and for shaping knowledge and scientific achievements by bridging these two very important and complementary disciplines into an interactive and attractive forum.
第17届年度IEEE国际生物信息学和生物工程会议旨在建立生物信息学和生物工程/生物医学之间的协同作用,这两个互补的学科对复杂的医学和生物系统、农业、环境、公共卫生、药物设计的研究和发展有着巨大的希望。这两个领域的研究和发展通过推进分子生物学的基本概念,通过帮助我们在多个层面上了解生物体,通过开发创新的植入物和生物修复术,以及通过改进检测、预防和治疗疾病的工具和技术,正在影响医学、食品生产、法医学等领域的科学和技术。BIBE系列提供了一个共同的平台,通过将这两个非常重要和互补的学科连接成一个互动和有吸引力的论坛,相互交流思想,形成知识和科学成就。
{"title":"General Chair’s Foreword","authors":"Mohammed Atiquzzaman","doi":"10.1109/ictai.2019.00005","DOIUrl":"https://doi.org/10.1109/ictai.2019.00005","url":null,"abstract":"The 17 annual IEEE International Conference on Bioinformatics and Bioengineering aims at building synergy between Bioinformatics and Bioengineering/Biomedical, two complementary disciplines that hold great promise for the advancement of research and development in complex medical and biological systems, agriculture, environment, public health, drug design. Research and development in these two areas are impacting the science and technology in fields such as medicine, food production, forensics, etc. by advancing fundamental concepts in molecular biology, by helping us understand living organisms at multiple levels, by developing innovative implants and bio-prosthetics, and by improving tools and techniques for the detection, prevention and treatment of diseases. The BIBE series provides a common platform for the cross fertilization of ideas, and for shaping knowledge and scientific achievements by bridging these two very important and complementary disciplines into an interactive and attractive forum.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128714336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TPOT-SH: A Faster Optimization Algorithm to Solve the AutoML Problem on Large Datasets TPOT-SH:一种解决大型数据集上AutoML问题的快速优化算法
Laurent Parmentier, Olivier Nicol, Laetitia Vermeulen-Jourdan, Marie-Éléonore Kessaci
Data are omnipresent nowadays and contain knowledge and patterns that machine learning (ML) algorithms can extract so as to take decisions or perform a task without explicit instructions. To achieve that, these algorithms learn a mathematical model using sample data. However, there are numerous ML algorithms, all learning different models of reality. Furthermore, the behavior of these algorithms can be altered by modifying some of their plethora of hyperparameters. Cleverly tuning these algorithms is costly but essential to reach decent performance. Yet it requires a lot of expertise and remains hard even for experts who tend to resort to exploration-only approaches like random search and grid search. The field of AutoML has consequently emerged in the quest for automatized machine learning processes that would be less expensive than brute force searches. In this paper we continue the research initiated on the Tree-based Pipeline Optimization Tool (TPOT), an AutoML based on Evolutionary Algorithms (EA). EAs are typically slow to converge which makes TPOT incapable of scaling to large datasets. As a consequence, we introduce TPOT-SH inspired from the concept of Successive Halving used in Multi-Armed Bandit problems. This solution allows TPOT to explore the search space faster and have much better performance on larger datasets.
如今,数据无处不在,包含机器学习(ML)算法可以提取的知识和模式,以便在没有明确指令的情况下做出决策或执行任务。为了实现这一点,这些算法使用样本数据学习数学模型。然而,有许多ML算法,都学习不同的现实模型。此外,这些算法的行为可以通过修改它们过多的超参数来改变。巧妙地调整这些算法是昂贵的,但对于达到良好的性能是必要的。然而,它需要大量的专业知识,即使对于那些倾向于采用随机搜索和网格搜索等探索方法的专家来说,它仍然很难。因此,AutoML领域出现在寻求自动化机器学习过程的过程中,这将比暴力搜索更便宜。本文继续对基于树的管道优化工具(TPOT)的研究,这是一种基于进化算法(EA)的自动化工具。ea通常收敛缓慢,这使得TPOT无法扩展到大型数据集。因此,我们引入了TPOT-SH,其灵感来自于多武装强盗问题中使用的连续减半概念。该解决方案允许TPOT更快地探索搜索空间,并且在更大的数据集上具有更好的性能。
{"title":"TPOT-SH: A Faster Optimization Algorithm to Solve the AutoML Problem on Large Datasets","authors":"Laurent Parmentier, Olivier Nicol, Laetitia Vermeulen-Jourdan, Marie-Éléonore Kessaci","doi":"10.1109/ICTAI.2019.00072","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00072","url":null,"abstract":"Data are omnipresent nowadays and contain knowledge and patterns that machine learning (ML) algorithms can extract so as to take decisions or perform a task without explicit instructions. To achieve that, these algorithms learn a mathematical model using sample data. However, there are numerous ML algorithms, all learning different models of reality. Furthermore, the behavior of these algorithms can be altered by modifying some of their plethora of hyperparameters. Cleverly tuning these algorithms is costly but essential to reach decent performance. Yet it requires a lot of expertise and remains hard even for experts who tend to resort to exploration-only approaches like random search and grid search. The field of AutoML has consequently emerged in the quest for automatized machine learning processes that would be less expensive than brute force searches. In this paper we continue the research initiated on the Tree-based Pipeline Optimization Tool (TPOT), an AutoML based on Evolutionary Algorithms (EA). EAs are typically slow to converge which makes TPOT incapable of scaling to large datasets. As a consequence, we introduce TPOT-SH inspired from the concept of Successive Halving used in Multi-Armed Bandit problems. This solution allows TPOT to explore the search space faster and have much better performance on larger datasets.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128729019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Texture Recognition on Metal Surface using Order-Less Scale Invariant GLAC 基于无阶尺度不变性GLAC的金属表面纹理识别
Shangbo Mao, V. Natarajan, L. Chia, G. Huang
Inspection of metal surface textures using computer vision and machine learning techniques plays an important role in Automated Visual Inspection (AVI) systems. Texture recognition on metal surface is challenging because the characteristics of each texture type are dependent on the properties of the metal surface when captured under different lighting conditions. Since these textures have no obvious repetitive patterns like general textures, this results in high intra-class diversities. Prior knowledge has shown that surface properties such as surface curvature and depth are discriminant to different texture types on metal surface. Since scale, shapes and location of textures within the same type are not fixed, scale property and spatial ordering information are less important for differentiating between texture types. There-fore, surface property, scale invariance and order-less property should be considered when exploring a suitable image feature for metal surface texture recognition. This paper proposes Order-less Scale Invariant Gradient Local Auto-Correlation (OS-GLAC) which meets all three requirements for robust texture recognition. The experiment results show that OS-GLAC is robust to separate different metal surface texture types. In addition, we observed that OS-GLAC is not only useful for texture recognition on metal surface but also for general texture recognition when combined with pre-trained deep learning features as these two features capture complimentary information. The experiment results show that such a combination of OS-GLAC achieves competitive results on three well-established general texture datasets i.e., KTH-TIP-2a, KTH-TIPS-2b and FMD.
利用计算机视觉和机器学习技术检测金属表面纹理在自动视觉检测(AVI)系统中起着重要的作用。金属表面的纹理识别具有挑战性,因为每种纹理类型的特征取决于在不同光照条件下捕获的金属表面的特性。由于这些纹理不像一般纹理那样具有明显的重复图案,这导致了高的类内多样性。先前的知识表明,表面曲率和深度等表面性质对金属表面的不同纹理类型具有区别性。由于同一类型内纹理的尺度、形状和位置不固定,尺度属性和空间排序信息对于纹理类型的区分不太重要。因此,在探索适合金属表面纹理识别的图像特征时,应综合考虑表面特性、尺度不变性和无序性。本文提出的无阶尺度不变梯度局部自相关(OS-GLAC)算法满足了鲁棒纹理识别的这三个要求。实验结果表明,OS-GLAC对不同金属表面织构类型的分离具有较强的鲁棒性。此外,我们观察到OS-GLAC不仅可用于金属表面的纹理识别,而且当与预训练的深度学习特征相结合时,也可用于一般纹理识别,因为这两个特征捕获了互补信息。实验结果表明,这种OS-GLAC组合在KTH-TIP-2a、kth - tip -2b和FMD这三个已建立的通用纹理数据集上取得了具有竞争力的结果。
{"title":"Texture Recognition on Metal Surface using Order-Less Scale Invariant GLAC","authors":"Shangbo Mao, V. Natarajan, L. Chia, G. Huang","doi":"10.1109/ICTAI.2019.00093","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00093","url":null,"abstract":"Inspection of metal surface textures using computer vision and machine learning techniques plays an important role in Automated Visual Inspection (AVI) systems. Texture recognition on metal surface is challenging because the characteristics of each texture type are dependent on the properties of the metal surface when captured under different lighting conditions. Since these textures have no obvious repetitive patterns like general textures, this results in high intra-class diversities. Prior knowledge has shown that surface properties such as surface curvature and depth are discriminant to different texture types on metal surface. Since scale, shapes and location of textures within the same type are not fixed, scale property and spatial ordering information are less important for differentiating between texture types. There-fore, surface property, scale invariance and order-less property should be considered when exploring a suitable image feature for metal surface texture recognition. This paper proposes Order-less Scale Invariant Gradient Local Auto-Correlation (OS-GLAC) which meets all three requirements for robust texture recognition. The experiment results show that OS-GLAC is robust to separate different metal surface texture types. In addition, we observed that OS-GLAC is not only useful for texture recognition on metal surface but also for general texture recognition when combined with pre-trained deep learning features as these two features capture complimentary information. The experiment results show that such a combination of OS-GLAC achieves competitive results on three well-established general texture datasets i.e., KTH-TIP-2a, KTH-TIPS-2b and FMD.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127024862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Divide and Conquer Algorithm for Dominance Testing in Acyclic CP-Nets 一种非循环cp -网络优势测试的分治算法
Sultan Ahmed, Malek Mouhoub
The Conditional Preference Network (CP-net) represents user's conditional ceteris paribus (all else being equal) preference statements in a graphical manner. In general, an acyclic CP-net induces a strict partial order over the outcomes. The task of comparing two outcomes (dominance testing) is generally PSPACE-complete, which is a limitation for this intuitive model, especially when representing and solving preference-based constrained optimization problems. In order to overcome this limitation in practice, we propose a divide and conquer algorithm that compares two outcomes according to dominance testing. The algorithm divides the original CP-net into sub CP-nets, and recursively calls itself for each of the sub CP-nets until it reaches to a termination criterion. In the termination criterion, the answer of the dominance query is returned. With a theoretical analysis of the time performance, we demonstrate that the proposed algorithm outperforms the existing methods.
条件偏好网络(CP-net)以图形方式表示用户的条件其他条件(其他条件相同)偏好声明。一般来说,一个无环cp网在其结果上推导出严格的偏序。比较两个结果(优势测试)的任务通常是PSPACE-complete的,这是这种直观模型的限制,特别是在表示和解决基于偏好的约束优化问题时。为了在实践中克服这一限制,我们提出了一种分而治之的算法,根据优势度测试对两个结果进行比较。该算法将原始CP-net划分为子CP-net,并对每个子CP-net递归调用自己,直到达到终止准则。在终止条件中,返回支配性查询的答案。通过对时间性能的理论分析,我们证明了该算法优于现有的方法。
{"title":"A Divide and Conquer Algorithm for Dominance Testing in Acyclic CP-Nets","authors":"Sultan Ahmed, Malek Mouhoub","doi":"10.1109/ICTAI.2019.00062","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00062","url":null,"abstract":"The Conditional Preference Network (CP-net) represents user's conditional ceteris paribus (all else being equal) preference statements in a graphical manner. In general, an acyclic CP-net induces a strict partial order over the outcomes. The task of comparing two outcomes (dominance testing) is generally PSPACE-complete, which is a limitation for this intuitive model, especially when representing and solving preference-based constrained optimization problems. In order to overcome this limitation in practice, we propose a divide and conquer algorithm that compares two outcomes according to dominance testing. The algorithm divides the original CP-net into sub CP-nets, and recursively calls itself for each of the sub CP-nets until it reaches to a termination criterion. In the termination criterion, the answer of the dominance query is returned. With a theoretical analysis of the time performance, we demonstrate that the proposed algorithm outperforms the existing methods.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123361229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Accurate and Robust RGB-D Dense Mapping with Inertial Fusion and Deformation-Graph Optimization 基于惯性融合和变形图优化的精确鲁棒RGB-D密集映射
Yong Liu, Liming Bao, Chaofan Zhang, Wen Zhang, Yingwei Xia
RGB-D dense mapping has become more and more popular, however, when encountering rapid movement or shake, the robustness and accuracy of most RGB-D dense mapping methods are degraded and the generated maps are overlapped or distorted, due to the drift of pose estimation. In this paper, we present a novel RGB-D dense mapping method, which can obtain accurate, robust and global consistency map even in the above complex conditions. Firstly, the improved ORBSLAM method, which tightly-couples RGB-D information and inertial information to estimate the current pose of robot, is firstly introduced for accurate pose estimation rather than traditional frame-to-frame method in most RGB-D dense mapping methods. Besides, the TSDF (Truncated Signed Distance Function) method is used to effectively fuse depth frame into a global model, and to keep the global consistency of the generated map. Furthermore, since the drift error is inevitable, a deformation graph is constructed to minimize the consistent error in global model, to further improve the mapping performance. The performance of the proposed RGB-D dense mapping method was validated by extensive localization and mapping experiments on public datasets and real scene datasets, and it showed strongly accuracy and robustness over other state-of-the-art methods. What's more, the proposed method can achieve real-time performance implemented on GPU.
RGB-D密集映射越来越受欢迎,然而,当遇到快速运动或震动时,由于姿态估计的漂移,大多数RGB-D密集映射方法的鲁棒性和精度下降,生成的地图重叠或扭曲。本文提出了一种新的RGB-D密集映射方法,即使在上述复杂条件下也能得到精确、鲁棒和全局一致性的映射。首先,引入改进的ORBSLAM方法,将RGB-D信息与惯性信息紧密耦合来估计机器人当前姿态,以准确估计姿态,而不是传统的RGB-D密集映射方法中的帧对帧方法。利用TSDF (Truncated Signed Distance Function)方法有效地将深度帧融合到全局模型中,保证了生成地图的全局一致性。此外,由于漂移误差是不可避免的,为了使全局模型的一致性误差最小化,构造了变形图,进一步提高了映射性能。在公共数据集和真实场景数据集上进行了大量的定位和映射实验,验证了所提出的RGB-D密集映射方法的性能,与其他最新方法相比,它具有很强的准确性和鲁棒性。该方法在GPU上实现了实时性。
{"title":"Accurate and Robust RGB-D Dense Mapping with Inertial Fusion and Deformation-Graph Optimization","authors":"Yong Liu, Liming Bao, Chaofan Zhang, Wen Zhang, Yingwei Xia","doi":"10.1109/ICTAI.2019.00249","DOIUrl":"https://doi.org/10.1109/ICTAI.2019.00249","url":null,"abstract":"RGB-D dense mapping has become more and more popular, however, when encountering rapid movement or shake, the robustness and accuracy of most RGB-D dense mapping methods are degraded and the generated maps are overlapped or distorted, due to the drift of pose estimation. In this paper, we present a novel RGB-D dense mapping method, which can obtain accurate, robust and global consistency map even in the above complex conditions. Firstly, the improved ORBSLAM method, which tightly-couples RGB-D information and inertial information to estimate the current pose of robot, is firstly introduced for accurate pose estimation rather than traditional frame-to-frame method in most RGB-D dense mapping methods. Besides, the TSDF (Truncated Signed Distance Function) method is used to effectively fuse depth frame into a global model, and to keep the global consistency of the generated map. Furthermore, since the drift error is inevitable, a deformation graph is constructed to minimize the consistent error in global model, to further improve the mapping performance. The performance of the proposed RGB-D dense mapping method was validated by extensive localization and mapping experiments on public datasets and real scene datasets, and it showed strongly accuracy and robustness over other state-of-the-art methods. What's more, the proposed method can achieve real-time performance implemented on GPU.","PeriodicalId":346657,"journal":{"name":"2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127932370","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1