首页 > 最新文献

2020 International Conference on Machine Learning and Cybernetics (ICMLC)最新文献

英文 中文
Linear Parameter Varying Control of Wind Energy Conversion Systems in Partial Load 部分负荷下风能转换系统的线性参数变化控制
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469556
Jing Xu, Qing Sun
This paper is dealt with the rotor speed tracking problem of variable-speed wind turbine systems operating under the partial load condition. Singular perturbation techniques are used to characterize the two-time-scale property of the wind turbine system, and a linear parameter varying (LPV) model is formed to approximate nonlinear behaviours of a wind turbine system. Based on the slow-fast decomposition method, slow and fast subsystems are constructed: one for the mechanical dynamics and the other for the electrical dynamics. Slow and fast controls are derived, respectively, and then a local state feedback controller, sum of the slow and fast control, is formulated. A design procedure, using the linear parameter varying control to combine local controllers, is proposed to guarantee the robustness of the closed-loop nonlinear wind turbine system. Numerical examples are given to show the validity of the proposed control scheme.
研究了变转速风力发电系统在部分负荷工况下的转子转速跟踪问题。利用奇异摄动技术表征风力发电机组系统的双时间尺度特性,建立线性变参数模型来近似风力发电机组系统的非线性行为。基于慢-快分解方法,分别构建了机械动力学和电动力学的慢-快子系统。分别推导出慢速控制和快速控制,然后构造出一个局部状态反馈控制器,即慢速控制和快速控制的总和。为了保证非线性风力发电机组闭环系统的鲁棒性,提出了一种采用线性变参数控制与局部控制器相结合的设计方法。数值算例表明了所提控制方案的有效性。
{"title":"Linear Parameter Varying Control of Wind Energy Conversion Systems in Partial Load","authors":"Jing Xu, Qing Sun","doi":"10.1109/ICMLC51923.2020.9469556","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469556","url":null,"abstract":"This paper is dealt with the rotor speed tracking problem of variable-speed wind turbine systems operating under the partial load condition. Singular perturbation techniques are used to characterize the two-time-scale property of the wind turbine system, and a linear parameter varying (LPV) model is formed to approximate nonlinear behaviours of a wind turbine system. Based on the slow-fast decomposition method, slow and fast subsystems are constructed: one for the mechanical dynamics and the other for the electrical dynamics. Slow and fast controls are derived, respectively, and then a local state feedback controller, sum of the slow and fast control, is formulated. A design procedure, using the linear parameter varying control to combine local controllers, is proposed to guarantee the robustness of the closed-loop nonlinear wind turbine system. Numerical examples are given to show the validity of the proposed control scheme.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128559694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Segmentation Based Backdoor Attack Detection 基于分段的后门攻击检测
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469037
Natasha Kees, Yaxuan Wang, Yiling Jiang, Fang Lue, P. Chan
Backdoor attacks have become a serious security concern because of the rising popularity of unverified third party machine learning resources such as datasets, pretrained models, and processors. Pre-trained models and shared datasets have become popular due to the high training requirement of deep learning. This raises a serious security concern since the shared models and datasets may be modified intentionally in order to reduce system efficacy. A backdoor attack is difficult to detect since the embedded adversarial decision rule will only be triggered by a pre-chosen pattern, and the contaminated model behaves normally on benign samples. This paper devises a backdoor attack detection method to identify whether a sample is attacked for image-related applications. The information consistence provided by an image without each segment is considered. The absence of the segment containing a trigger strongly affects the consistence since the trigger dominates the decision. Our proposed method is evaluated empirically to confirm the effectiveness in various settings. As there is no restrictive assumption on the trigger of backdoor attacks, we expect our proposed model is generalizable and can defend against a wider range of modern attacks.
由于未经验证的第三方机器学习资源(如数据集、预训练模型和处理器)越来越受欢迎,后门攻击已经成为一个严重的安全问题。由于深度学习的高训练要求,预训练模型和共享数据集已成为流行。这引起了严重的安全问题,因为共享模型和数据集可能会被有意地修改,以降低系统效率。由于嵌入的对抗决策规则只会由预先选择的模式触发,并且受污染的模型在良性样本上表现正常,因此后门攻击很难被检测到。本文针对图像相关应用,设计了一种后门攻击检测方法来识别样本是否受到攻击。不考虑每个片段的图像所提供的信息一致性。缺少包含触发器的片段会严重影响一致性,因为触发器主导决策。我们提出的方法进行了经验评估,以确认在各种设置的有效性。由于对后门攻击的触发没有限制性假设,我们希望我们提出的模型具有通用性,并且可以抵御更大范围的现代攻击。
{"title":"Segmentation Based Backdoor Attack Detection","authors":"Natasha Kees, Yaxuan Wang, Yiling Jiang, Fang Lue, P. Chan","doi":"10.1109/ICMLC51923.2020.9469037","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469037","url":null,"abstract":"Backdoor attacks have become a serious security concern because of the rising popularity of unverified third party machine learning resources such as datasets, pretrained models, and processors. Pre-trained models and shared datasets have become popular due to the high training requirement of deep learning. This raises a serious security concern since the shared models and datasets may be modified intentionally in order to reduce system efficacy. A backdoor attack is difficult to detect since the embedded adversarial decision rule will only be triggered by a pre-chosen pattern, and the contaminated model behaves normally on benign samples. This paper devises a backdoor attack detection method to identify whether a sample is attacked for image-related applications. The information consistence provided by an image without each segment is considered. The absence of the segment containing a trigger strongly affects the consistence since the trigger dominates the decision. Our proposed method is evaluated empirically to confirm the effectiveness in various settings. As there is no restrictive assumption on the trigger of backdoor attacks, we expect our proposed model is generalizable and can defend against a wider range of modern attacks.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130934182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Dynamic Multi-Target Assignment with Decentralised Online Learning to Achieve Multiple Synchronised Goals 动态多目标分配与分散在线学习实现多个同步目标
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469589
D. Nguyen, Arvind Rajagopalan, Jijoong Kim, C. Lim, David Hubczenko
In this paper, we present a decentralised online decision-making strategy for multi-agents carrying out a cooperative mission. Our solution provides the capability for agents to dynamically choose their best targets and arrive at their target locations simultaneously at pre-specified angles. Additionally, the agents are able to cope with any obstacles encountered without compromising the mission goals. The algorithm combines game-theoretic regret minimisation with current best-practice solutions to satisfy complex mission requirements. It is decentralised and readily scalable to a large number of agents for wide area operations. Simulation results show it can be applied to teams of agents in challenging environments and exhibits fast convergence and adaptability.
在本文中,我们提出了一种多智能体执行合作任务的分散在线决策策略。我们的解决方案为智能体提供了动态选择最佳目标并以预先指定的角度同时到达目标位置的能力。此外,特工们能够在不影响任务目标的情况下处理遇到的任何障碍。该算法将博弈论的遗憾最小化与当前的最佳实践解决方案相结合,以满足复杂的任务要求。它是分散的,并且易于扩展到大量代理进行广域操作。仿真结果表明,该方法可以应用于复杂环境下的智能体团队,具有较快的收敛性和适应性。
{"title":"Dynamic Multi-Target Assignment with Decentralised Online Learning to Achieve Multiple Synchronised Goals","authors":"D. Nguyen, Arvind Rajagopalan, Jijoong Kim, C. Lim, David Hubczenko","doi":"10.1109/ICMLC51923.2020.9469589","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469589","url":null,"abstract":"In this paper, we present a decentralised online decision-making strategy for multi-agents carrying out a cooperative mission. Our solution provides the capability for agents to dynamically choose their best targets and arrive at their target locations simultaneously at pre-specified angles. Additionally, the agents are able to cope with any obstacles encountered without compromising the mission goals. The algorithm combines game-theoretic regret minimisation with current best-practice solutions to satisfy complex mission requirements. It is decentralised and readily scalable to a large number of agents for wide area operations. Simulation results show it can be applied to teams of agents in challenging environments and exhibits fast convergence and adaptability.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116990283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Tensor Fusion Network with Hybrid Attention for Multimodal Sentiment Analysis 基于混合关注的多张量融合网络多模态情感分析
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469572
Haiwei Xue, Xueming Yan, Shengyi Jiang, Helang Lai
Multimodal sentiment analysis is a highly sought-after topic in natural language processing. In this paper, a multi-tensor fusion network with hybrid attention architecture for multimodal sentiment analysis is proposed. Firstly, Bi-LSTM is applied to encode contextual representation in different modalities. Following this, modalities features are extracted and interacted with by the hybrid attention mechanism. Finally, multi-tensor fusion approach is used to further enhance the effectiveness of fusing interaction features in different modalities. The proposed approach outperforms the existing advanced approaches on two benchmarks through a series of regression experiments for sentiment intensity prediction, as it improves F1-score by 3.4% and 2.1% points respectively. Our architecture would be open-sourced on Github1 for researchers to use.
多模态情感分析是自然语言处理领域一个备受关注的课题。针对多模态情感分析,提出了一种具有混合注意结构的多张量融合网络。首先,将Bi-LSTM应用于不同模式的上下文表示编码。在此基础上,通过混合注意机制提取模态特征并与之交互。最后,采用多张量融合方法进一步提高了不同模态下相互作用特征融合的有效性。通过一系列情绪强度预测的回归实验,本文提出的方法在两个基准上优于现有的先进方法,因为它将f1得分分别提高了3.4%和2.1%。我们的架构将在Github1上开源,供研究人员使用。
{"title":"Multi-Tensor Fusion Network with Hybrid Attention for Multimodal Sentiment Analysis","authors":"Haiwei Xue, Xueming Yan, Shengyi Jiang, Helang Lai","doi":"10.1109/ICMLC51923.2020.9469572","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469572","url":null,"abstract":"Multimodal sentiment analysis is a highly sought-after topic in natural language processing. In this paper, a multi-tensor fusion network with hybrid attention architecture for multimodal sentiment analysis is proposed. Firstly, Bi-LSTM is applied to encode contextual representation in different modalities. Following this, modalities features are extracted and interacted with by the hybrid attention mechanism. Finally, multi-tensor fusion approach is used to further enhance the effectiveness of fusing interaction features in different modalities. The proposed approach outperforms the existing advanced approaches on two benchmarks through a series of regression experiments for sentiment intensity prediction, as it improves F1-score by 3.4% and 2.1% points respectively. Our architecture would be open-sourced on Github1 for researchers to use.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124492206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Improving Self-Attention Based News Recommendation with Document Classification 基于文档分类改进自关注的新闻推荐
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469577
Hao Ke
Online news services have become the first choice to read news for many internet users. However, thousands of news articles are released and updated on a daily basis, which makes it impossible for users to select relevant and intriguing stories by themselves. The news recommendation models are developed to tackle information overload. News stories on various topics are recommended to users from diversified backgrounds by an automated system. In this paper, we propose a neural news recommendation model with self-attention jointly trained by document classification, SARC. The self-attention mechanism captures the long-term relationships among words. The joint training of recommendation and classification improves representation and generalization capability. We demonstrate our model’s superior performances over other state-of-the-art baselines on a large-scale news recommendation dataset.
在线新闻服务已经成为许多网民阅读新闻的首选。然而,每天都有成千上万的新闻发布和更新,这使得用户无法自己选择相关和有趣的故事。新闻推荐模型是为了解决信息过载问题而开发的。自动系统向不同背景的用户推荐不同主题的新闻故事。在本文中,我们提出了一个由文档分类SARC联合训练的具有自注意的神经新闻推荐模型。自我注意机制捕捉单词之间的长期关系。推荐和分类的联合训练提高了表示和泛化能力。我们在一个大规模的新闻推荐数据集上展示了我们的模型比其他最先进的基线的优越性能。
{"title":"Improving Self-Attention Based News Recommendation with Document Classification","authors":"Hao Ke","doi":"10.1109/ICMLC51923.2020.9469577","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469577","url":null,"abstract":"Online news services have become the first choice to read news for many internet users. However, thousands of news articles are released and updated on a daily basis, which makes it impossible for users to select relevant and intriguing stories by themselves. The news recommendation models are developed to tackle information overload. News stories on various topics are recommended to users from diversified backgrounds by an automated system. In this paper, we propose a neural news recommendation model with self-attention jointly trained by document classification, SARC. The self-attention mechanism captures the long-term relationships among words. The joint training of recommendation and classification improves representation and generalization capability. We demonstrate our model’s superior performances over other state-of-the-art baselines on a large-scale news recommendation dataset.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125733004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Three-stage Method for Classification of Binary Imbalanced Big Data 二元不平衡大数据的三阶段分类方法
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469568
Jun-Hai Zhai, Sufang Zhang, Mo-Han Wang, Yan Li
In the real world, there are many imbalanced data classification problems, such as extreme weather prediction, software defect prediction, machinery fault diagnosis, spam filtering, etc. It has important theoretical and practical value to study the problem of imbalanced data classification. In the framework of binary imbalanced data classification, a three-stage method for classification of binary imbalanced big data was proposed in this paper. Specifically, in the first stage, the negative class big data was clustered into K clusters by K-means algorithm on Hadoop platform. In the second stage, we use instance selection method to select important samples from each cluster in parallel, and obtain K negative class subsets. In the third stage, we first construct K balanced training sets which consist of negative class subset and positive class subset, and then train K classifiers, and finally we integrate these classifiers to classify the unseen samples. Some experiments are conducted to compare the proposed method with two state-of-the-art methods on G-means. The experimental results demonstrate that the proposed method is more effective and efficient than the compared approaches.
在现实世界中,存在着许多不平衡数据分类问题,如极端天气预测、软件缺陷预测、机械故障诊断、垃圾邮件过滤等。研究不平衡数据分类问题具有重要的理论和实用价值。在二元不平衡数据分类框架下,提出了一种二元不平衡大数据的三阶段分类方法。具体而言,第一阶段在Hadoop平台上通过K-means算法将负类大数据聚类成K个簇。在第二阶段,我们使用实例选择方法从每个聚类中并行选择重要样本,并获得K个负类子集。在第三阶段,我们首先构造由负类子集和正类子集组成的K个平衡训练集,然后训练K个分类器,最后我们将这些分类器整合到看不见的样本中进行分类。通过实验将本文提出的方法与两种最先进的g均值方法进行了比较。实验结果表明,该方法比其他方法更有效。
{"title":"A Three-stage Method for Classification of Binary Imbalanced Big Data","authors":"Jun-Hai Zhai, Sufang Zhang, Mo-Han Wang, Yan Li","doi":"10.1109/ICMLC51923.2020.9469568","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469568","url":null,"abstract":"In the real world, there are many imbalanced data classification problems, such as extreme weather prediction, software defect prediction, machinery fault diagnosis, spam filtering, etc. It has important theoretical and practical value to study the problem of imbalanced data classification. In the framework of binary imbalanced data classification, a three-stage method for classification of binary imbalanced big data was proposed in this paper. Specifically, in the first stage, the negative class big data was clustered into K clusters by K-means algorithm on Hadoop platform. In the second stage, we use instance selection method to select important samples from each cluster in parallel, and obtain K negative class subsets. In the third stage, we first construct K balanced training sets which consist of negative class subset and positive class subset, and then train K classifiers, and finally we integrate these classifiers to classify the unseen samples. Some experiments are conducted to compare the proposed method with two state-of-the-art methods on G-means. The experimental results demonstrate that the proposed method is more effective and efficient than the compared approaches.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116314148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature Selection for Big Data Based on Mapreduce and Voting Mechanism 基于Mapreduce和投票机制的大数据特征选择
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469541
Sufang Zhang, Jun-Hai Zhai, Shi Tian, Xiang Zhou, Yan Li
With the rapid development of computer network technology and wireless sensor technology, as well as the arrival of the era of big data, the dimension and sample number of data are growing rapidly. Accordingly, it is important to investigate the problem of feature selection for big data and to design feature selection algorithm for big data. Based on MapReduce and voting mechanism, a feature selection method for big data is proposed in this paper. The proposed methods include three steps: Firstly, partition big data set into m subsets, and deploy the subsets to m computing nodes of Hadoop. Secondly, on the m computing nodes, we employ a feature selection algorithm based on genetic algorithm to select important features in parallel using local data subset, and obtain m feature subsets. Finally, for each feature, m feature subsets are used to vote on it, and the final feature subset is selected according to the voting results. Experimental results on four big data sets demonstrate that the proposed method is effective and efficient.
随着计算机网络技术和无线传感器技术的快速发展,以及大数据时代的到来,数据的维度和样本数量都在快速增长。因此,研究大数据特征选择问题,设计大数据特征选择算法具有重要意义。提出了一种基于MapReduce和投票机制的大数据特征选择方法。提出的方法包括三个步骤:首先,将大数据集划分为m个子集,并将这些子集部署到Hadoop的m个计算节点上。其次,在m个计算节点上,采用基于遗传算法的特征选择算法,利用局部数据子集并行选择重要特征,得到m个特征子集;最后,对每个特征使用m个特征子集进行投票,根据投票结果选出最终的特征子集。在四个大数据集上的实验结果表明,该方法是有效的。
{"title":"Feature Selection for Big Data Based on Mapreduce and Voting Mechanism","authors":"Sufang Zhang, Jun-Hai Zhai, Shi Tian, Xiang Zhou, Yan Li","doi":"10.1109/ICMLC51923.2020.9469541","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469541","url":null,"abstract":"With the rapid development of computer network technology and wireless sensor technology, as well as the arrival of the era of big data, the dimension and sample number of data are growing rapidly. Accordingly, it is important to investigate the problem of feature selection for big data and to design feature selection algorithm for big data. Based on MapReduce and voting mechanism, a feature selection method for big data is proposed in this paper. The proposed methods include three steps: Firstly, partition big data set into m subsets, and deploy the subsets to m computing nodes of Hadoop. Secondly, on the m computing nodes, we employ a feature selection algorithm based on genetic algorithm to select important features in parallel using local data subset, and obtain m feature subsets. Finally, for each feature, m feature subsets are used to vote on it, and the final feature subset is selected according to the voting results. Experimental results on four big data sets demonstrate that the proposed method is effective and efficient.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"390 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124581120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature Selection on Imbalanced Data and Its Application on Toxicity Prediction 不平衡数据特征选择及其在毒性预测中的应用
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469564
Jincheng Li
The principle of computational toxicity prediction is that chemicals with similar molecular structures may possess similar toxicological pathways and effects. There have been many methods that represented each chemical by a set of descriptors, which are identified by experts as promising properties for predicting biological activity or toxicity. These chemical descriptors play a critical role in computational methods, that task correlated descriptors are favorable to achieve high prediction performance. However, there are few work compare the effectiveness of chemical descriptors and evaluate their performance in toxicity prediction. In this paper, we propose a novel ensemble feature selection method based on random under-sampling to analysis the effectiveness of chemical descriptors adopted in toxicity prediction application. The proposed method is efficient and can relief the imbalanced data problem of toxicity. Experiment results on the tox21 toxicity prediction dataset show that "molecular property", "connectivity" and "topological" descriptor are the three most important descriptors for toxicity prediction tasks among the 12 popular descriptors adopted in toxicity prediction applications. The results of this study can be used as a guide to propose new descriptors for chemical toxicity prediction.
计算毒性预测的原理是具有相似分子结构的化学物质可能具有相似的毒理学途径和作用。有许多方法可以用一组描述符来表示每种化学物质,这些描述符被专家认为是预测生物活性或毒性的有希望的特性。这些化学描述符在计算方法中起着至关重要的作用,任务相关描述符有利于实现较高的预测性能。然而,对化学描述符在毒性预测中的有效性进行比较和评价的工作很少。本文提出了一种基于随机欠采样的集合特征选择方法,分析了化学描述符在毒性预测中的有效性。该方法有效地解决了毒性数据不平衡的问题。在tox21毒性预测数据集上的实验结果表明,在毒性预测应用中常用的12种描述符中,“分子性质”、“连通性”和“拓扑”描述符是毒性预测任务中最重要的三个描述符。本研究结果可为提出新的化学毒性预测描述符提供指导。
{"title":"Feature Selection on Imbalanced Data and Its Application on Toxicity Prediction","authors":"Jincheng Li","doi":"10.1109/ICMLC51923.2020.9469564","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469564","url":null,"abstract":"The principle of computational toxicity prediction is that chemicals with similar molecular structures may possess similar toxicological pathways and effects. There have been many methods that represented each chemical by a set of descriptors, which are identified by experts as promising properties for predicting biological activity or toxicity. These chemical descriptors play a critical role in computational methods, that task correlated descriptors are favorable to achieve high prediction performance. However, there are few work compare the effectiveness of chemical descriptors and evaluate their performance in toxicity prediction. In this paper, we propose a novel ensemble feature selection method based on random under-sampling to analysis the effectiveness of chemical descriptors adopted in toxicity prediction application. The proposed method is efficient and can relief the imbalanced data problem of toxicity. Experiment results on the tox21 toxicity prediction dataset show that \"molecular property\", \"connectivity\" and \"topological\" descriptor are the three most important descriptors for toxicity prediction tasks among the 12 popular descriptors adopted in toxicity prediction applications. The results of this study can be used as a guide to propose new descriptors for chemical toxicity prediction.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"133 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124273288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Distribution System for Japanese Synthetic Population Data with Protection Level 日本具有保护等级的综合人口数据分布系统
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469550
T. Murata, S. Date, Yusuke Goto, T. Hanawa, Takuya Harada, M. Ichikawa, Lee Hao, M. Munetomo, Akiyoshi Sugiki
In this paper, we introduce a distribution system of synthesized data of Japanese population using Interdisciplinary Large-scale Information Infra-structures in Japan. Synthetic population is synthesized based on the statistics of the census that are conducted by the government and publicly released. Therefore, the synthesized data have no privacy data. However, it is easy to estimate the compositions of households, working status in a certain area from the synthetic population. Therefore, we currently distribute the synthesized data only for public or academic purposes. For academic purposes, it is important to encourage scholars or researchers to use a large-scale data of households, we define protection levels for the attributes in the synthetic populations. According to the protection levels, we distribute the data with proper attributes to those who try to use them. We encourage researchers to use the synthetic populations to be familiar to large-scale data processing.
本文介绍了一种基于跨学科大规模信息基础设施的日本人口综合数据分布系统。综合人口是根据政府进行并公开发布的人口普查统计数据综合而成的。因此,合成数据没有隐私数据。然而,从综合人口中很容易估计出某一地区的家庭组成、工作状况。因此,我们目前仅为公共或学术目的分发合成数据。出于学术目的,鼓励学者或研究人员使用大规模的家庭数据是很重要的,我们定义了合成群体中属性的保护水平。根据保护级别,我们将具有适当属性的数据分发给试图使用它们的人。我们鼓励研究人员使用合成种群来熟悉大规模数据处理。
{"title":"Distribution System for Japanese Synthetic Population Data with Protection Level","authors":"T. Murata, S. Date, Yusuke Goto, T. Hanawa, Takuya Harada, M. Ichikawa, Lee Hao, M. Munetomo, Akiyoshi Sugiki","doi":"10.1109/ICMLC51923.2020.9469550","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469550","url":null,"abstract":"In this paper, we introduce a distribution system of synthesized data of Japanese population using Interdisciplinary Large-scale Information Infra-structures in Japan. Synthetic population is synthesized based on the statistics of the census that are conducted by the government and publicly released. Therefore, the synthesized data have no privacy data. However, it is easy to estimate the compositions of households, working status in a certain area from the synthetic population. Therefore, we currently distribute the synthesized data only for public or academic purposes. For academic purposes, it is important to encourage scholars or researchers to use a large-scale data of households, we define protection levels for the attributes in the synthetic populations. According to the protection levels, we distribute the data with proper attributes to those who try to use them. We encourage researchers to use the synthetic populations to be familiar to large-scale data processing.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121255562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transfer Learning with Shapeshift Adapter: A Parameter-Efficient Module for Deep Learning Model 基于Shapeshift适配器的迁移学习:深度学习模型的参数高效模块
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469046
Jingyuan Liu, M. Rajati
Fine-tuning pre-trained models is arguably one of the most significant approaches in transfer learning. Recent studies focus on methods whose performance is superior to standard fine-tuning methods, such as Adaptive Filter Fine-tuning and Fine-tuning last-k. The SpotTune model outperforms most common fine-tuning methods due to a novel adaptive fine-tuning approach. Since there is a trade-off between the number of parameters and performance, the SpotTune model is not parameter efficient. In this paper, we propose a shapeshift adapter module that can help reduce training parameters in deep learning models while pre-serving the high-performance merit of SpotTune. The shapeshift adapter yields a flexible structure, which allows us to find a balance between the number of parameters and performance. We integrate our proposed module with the residual blocks in ResNet and conduct several experiments on the SpotTune model. On the Visual Decathlon Challenge, our proposed method gets a score close to SpotTune and it outperforms the SpotTune model over half of the datasets. Particularly, our proposed method notably uses only about 20% of the parameters that are needed when training using a standard fine-tuning approach.
微调预训练模型可以说是迁移学习中最重要的方法之一。最近的研究集中在性能优于标准微调方法的方法上,如自适应滤波微调和微调最后k。由于采用了一种新颖的自适应微调方法,SpotTune模型优于大多数常见的微调方法。由于参数数量和性能之间存在权衡,因此SpotTune模型不是参数有效的。在本文中,我们提出了一个变形适配器模块,可以帮助减少深度学习模型中的训练参数,同时保留SpotTune的高性能优点。变形适配器产生灵活的结构,使我们能够在参数数量和性能之间找到平衡。我们将提出的模块与ResNet中的残差块集成,并在SpotTune模型上进行了多次实验。在视觉十项全能挑战赛上,我们提出的方法获得了接近SpotTune的分数,并且在一半的数据集上优于SpotTune模型。特别是,我们提出的方法只使用了使用标准微调方法训练时所需参数的20%左右。
{"title":"Transfer Learning with Shapeshift Adapter: A Parameter-Efficient Module for Deep Learning Model","authors":"Jingyuan Liu, M. Rajati","doi":"10.1109/ICMLC51923.2020.9469046","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469046","url":null,"abstract":"Fine-tuning pre-trained models is arguably one of the most significant approaches in transfer learning. Recent studies focus on methods whose performance is superior to standard fine-tuning methods, such as Adaptive Filter Fine-tuning and Fine-tuning last-k. The SpotTune model outperforms most common fine-tuning methods due to a novel adaptive fine-tuning approach. Since there is a trade-off between the number of parameters and performance, the SpotTune model is not parameter efficient. In this paper, we propose a shapeshift adapter module that can help reduce training parameters in deep learning models while pre-serving the high-performance merit of SpotTune. The shapeshift adapter yields a flexible structure, which allows us to find a balance between the number of parameters and performance. We integrate our proposed module with the residual blocks in ResNet and conduct several experiments on the SpotTune model. On the Visual Decathlon Challenge, our proposed method gets a score close to SpotTune and it outperforms the SpotTune model over half of the datasets. Particularly, our proposed method notably uses only about 20% of the parameters that are needed when training using a standard fine-tuning approach.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130735589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2020 International Conference on Machine Learning and Cybernetics (ICMLC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1