首页 > 最新文献

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)最新文献

英文 中文
IsoClustering: A Generalized Framework for Local Data Clustering 等聚类:局部数据聚类的一种通用框架
David Haley, Ehsan Kamalinejad, Jiaofei Zhong
In this paper, we propose a generalized framework for local clustering based on isoperimetric inequalities. We also demonstrate that contemporary approaches are included in its scope and that it can accommodate data sets of different types, including those with overlapping communities. We then present an efficient, greedy algorithm using the new framework and compare the output of the new algorithm with existing methods.
本文提出了一种基于等周不等式的广义局部聚类框架。我们还证明,当代方法包括在其范围内,它可以容纳不同类型的数据集,包括那些重叠的社区。然后,我们使用新框架提出了一个高效的贪婪算法,并将新算法的输出与现有方法进行了比较。
{"title":"IsoClustering: A Generalized Framework for Local Data Clustering","authors":"David Haley, Ehsan Kamalinejad, Jiaofei Zhong","doi":"10.1109/ICMLA.2019.00058","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00058","url":null,"abstract":"In this paper, we propose a generalized framework for local clustering based on isoperimetric inequalities. We also demonstrate that contemporary approaches are included in its scope and that it can accommodate data sets of different types, including those with overlapping communities. We then present an efficient, greedy algorithm using the new framework and compare the output of the new algorithm with existing methods.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115751500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning Models to Identify the Risk of Modern Slavery in Brazilian Cities 识别巴西城市现代奴隶制风险的机器学习模型
Marlu da Silva Santos, M. Ladeira, G. V. Erven, Gladston Luiz da Silva
The scope of modern slavery encompasses human trafficking, forced labor, debt bondage and child labor. This article proposes the use of predictive models to identify the risk of modern slavery in Brazilian cities using real socioeconomic, demographic and rescue operations data. The study uses the embedded technique with Lasso regularization (L1) to select variables. A comparative analyze of techniques for treatment of imbalanced data was applied and the results indicated the Random Over-Sampling (ROS) as the best one. In total, 16 models are evaluated, consisting of 8 different data sets and two classifiers: Logistic Regression (LR) and Gradient Boosting Machine (GBM). The results indicate that the GBM model has better performance and efficiency, with accuracy of 77%, AUC 80% and G-mean of 71%.
现代奴隶制的范围包括人口贩运、强迫劳动、债务奴役和童工。本文建议使用预测模型来识别巴西城市中使用真实的社会经济,人口和救援行动数据的现代奴隶制的风险。本研究采用Lasso正则化(L1)嵌入技术来选择变量。对比分析了几种处理不平衡数据的方法,结果表明随机过采样(ROS)是最好的处理方法。总共评估了16个模型,包括8个不同的数据集和两个分类器:逻辑回归(LR)和梯度增强机(GBM)。结果表明,GBM模型具有较好的性能和效率,准确率为77%,AUC为80%,g均值为71%。
{"title":"Machine Learning Models to Identify the Risk of Modern Slavery in Brazilian Cities","authors":"Marlu da Silva Santos, M. Ladeira, G. V. Erven, Gladston Luiz da Silva","doi":"10.1109/ICMLA.2019.00132","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00132","url":null,"abstract":"The scope of modern slavery encompasses human trafficking, forced labor, debt bondage and child labor. This article proposes the use of predictive models to identify the risk of modern slavery in Brazilian cities using real socioeconomic, demographic and rescue operations data. The study uses the embedded technique with Lasso regularization (L1) to select variables. A comparative analyze of techniques for treatment of imbalanced data was applied and the results indicated the Random Over-Sampling (ROS) as the best one. In total, 16 models are evaluated, consisting of 8 different data sets and two classifiers: Logistic Regression (LR) and Gradient Boosting Machine (GBM). The results indicate that the GBM model has better performance and efficiency, with accuracy of 77%, AUC 80% and G-mean of 71%.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120982635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
End-to-End Reinforcement Learning for Multi-agent Continuous Control 多智能体连续控制的端到端强化学习
Z. Jiao, J. Oh
In end-to-end reinforcement learning, an agent captures the entire mapping from its raw sensor data to actuation commands using a single neural network. End-to-end reinforcement learning is mostly studied in single-agent domains, and its scalability to multi-agent setting is under-explored. Without effective techniques, learning effective policies based on the joint observation of agents can be intractable, particularly when sensor data perceived by each agent is high-dimensional. Extending the multi-agent actor-critic method MADDPG, this paper presents Rec-MADDPG, an end-to-end reinforcement learning method for multi-agent continuous control in a cooperative environment. To ease end-to-end learning in a multi-agent setting, we proposed two embedding mechanisms, joint and independent embedding, to project agents' joint sensor observation to low-dimensional features. For training efficiency, we applied parameter sharing and the A3C-based asynchronous framework to Rec-MADDPG. Considering the challenges that can arise in real-world multi-agent control, we evaluated Rec-MADDPG in robotic navigation tasks based on realistic simulated robots and physics enable environments. Through extensive evaluation, we demonstrated that Rec-MADDPG can significantly outperform MADDPG and was able to learn individual end-to-end policies for continuous control based on raw sensor data. In addition, compared to joint embedding, independent embedding enabled Rec-MADDPG to learn even better optimal policies.
在端到端强化学习中,智能体使用单个神经网络捕获从原始传感器数据到驱动命令的整个映射。端到端强化学习的研究主要集中在单智能体领域,其在多智能体环境下的可扩展性尚未得到充分探索。如果没有有效的技术,基于智能体的联合观察来学习有效的策略可能会很棘手,特别是当每个智能体感知的传感器数据是高维的时候。在多智能体行为者评价方法madpg的基础上,提出了一种用于协作环境下多智能体连续控制的端到端强化学习方法rec - madpg。为了简化多智能体环境下的端到端学习,我们提出了联合嵌入和独立嵌入两种嵌入机制,将智能体的联合传感器观察投射到低维特征上。为了提高训练效率,我们将参数共享和基于a3c的异步框架应用到Rec-MADDPG中。考虑到现实世界中多智能体控制可能出现的挑战,我们基于真实的模拟机器人和物理环境评估了机器人导航任务中的Rec-MADDPG。通过广泛的评估,我们证明Rec-MADDPG可以显著优于MADDPG,并且能够学习基于原始传感器数据的连续控制的单个端到端策略。此外,与联合嵌入相比,独立嵌入使Rec-MADDPG能够学习到更好的最优策略。
{"title":"End-to-End Reinforcement Learning for Multi-agent Continuous Control","authors":"Z. Jiao, J. Oh","doi":"10.1109/ICMLA.2019.00100","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00100","url":null,"abstract":"In end-to-end reinforcement learning, an agent captures the entire mapping from its raw sensor data to actuation commands using a single neural network. End-to-end reinforcement learning is mostly studied in single-agent domains, and its scalability to multi-agent setting is under-explored. Without effective techniques, learning effective policies based on the joint observation of agents can be intractable, particularly when sensor data perceived by each agent is high-dimensional. Extending the multi-agent actor-critic method MADDPG, this paper presents Rec-MADDPG, an end-to-end reinforcement learning method for multi-agent continuous control in a cooperative environment. To ease end-to-end learning in a multi-agent setting, we proposed two embedding mechanisms, joint and independent embedding, to project agents' joint sensor observation to low-dimensional features. For training efficiency, we applied parameter sharing and the A3C-based asynchronous framework to Rec-MADDPG. Considering the challenges that can arise in real-world multi-agent control, we evaluated Rec-MADDPG in robotic navigation tasks based on realistic simulated robots and physics enable environments. Through extensive evaluation, we demonstrated that Rec-MADDPG can significantly outperform MADDPG and was able to learn individual end-to-end policies for continuous control based on raw sensor data. In addition, compared to joint embedding, independent embedding enabled Rec-MADDPG to learn even better optimal policies.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123885115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Hyperparameter Optimization of Topological Features for Machine Learning Applications 机器学习应用中拓扑特征的超参数优化
Francis C. Motta, J. Harer, Nicholas Leiby, F. Marinozzi, Scott Novotney, G. Rocklin, Jed Singer, D. Strickland, M. Vaughn, Christopher J. Tralie, R. Bedini, F. Bini, G. Bini, Hamed Eramian, Marcio Gameiro, S. Haase, Hugh K. Haddox
This paper describes a general pipeline for generating optimal vector representations of topological features of data for use with machine learning algorithms. This pipeline can be viewed as a costly black-box function defined over a complex configuration space, each point of which specifies both how features are generated and how predictive models are trained on those features. We propose using state-of-the-art Bayesian optimization algorithms to inform the choice of topological vectorization hyperparameters while simultaneously choosing learning model parameters. We demonstrate the need for and effectiveness of this pipeline using two difficult biological learning problems, and illustrate the nontrivial interactions between topological feature generation and learning model hyperparameters.
本文描述了一种用于生成用于机器学习算法的数据拓扑特征的最佳向量表示的通用管道。这个管道可以看作是在复杂配置空间上定义的一个昂贵的黑盒函数,它的每个点都指定了如何生成特征以及如何在这些特征上训练预测模型。我们建议使用最先进的贝叶斯优化算法来选择拓扑矢量化超参数,同时选择学习模型参数。我们通过两个困难的生物学习问题证明了这种管道的必要性和有效性,并说明了拓扑特征生成和学习模型超参数之间的重要相互作用。
{"title":"Hyperparameter Optimization of Topological Features for Machine Learning Applications","authors":"Francis C. Motta, J. Harer, Nicholas Leiby, F. Marinozzi, Scott Novotney, G. Rocklin, Jed Singer, D. Strickland, M. Vaughn, Christopher J. Tralie, R. Bedini, F. Bini, G. Bini, Hamed Eramian, Marcio Gameiro, S. Haase, Hugh K. Haddox","doi":"10.1109/ICMLA.2019.00185","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00185","url":null,"abstract":"This paper describes a general pipeline for generating optimal vector representations of topological features of data for use with machine learning algorithms. This pipeline can be viewed as a costly black-box function defined over a complex configuration space, each point of which specifies both how features are generated and how predictive models are trained on those features. We propose using state-of-the-art Bayesian optimization algorithms to inform the choice of topological vectorization hyperparameters while simultaneously choosing learning model parameters. We demonstrate the need for and effectiveness of this pipeline using two difficult biological learning problems, and illustrate the nontrivial interactions between topological feature generation and learning model hyperparameters.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125234175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Physics-Guided Neural Network with Model Discrepancy Based on Upper Troposphere Wind Prediction 基于模式差异的物理导向神经网络对流层高层风预报
Ken-ichi Fukui, Junya Tanaka, T. Tomita, M. Numao
In this paper, we focus on a method that integrates a physical model into a neural network. This study proposes a neural network that can predict two components, namely outputs based on a physical model and its model discrepancy. To achieve such a goal, we propose a novel neural network architecture and associated loss functions designed based on a target physical model. The physical model is used as a regularizer of spatial behavior where output from the neural network is used as an intermediate variable. Then, the model discrepancy is defined as its residual to the observation value. We also propose a network architecture which has Shared and Non-Shared networks, and the neural network can be trained by alternate optimization. We constructed the proposed method with wind prediction in the upper troposphere based on thermal wind equations as an example. The experimental results demonstrate that the proposed method can achieve higher predictive accuracy than normal convolutional neural network or using thermal wind equation, also the obtained model discrepancy expresses convergence and divergence of wind vectors.
在本文中,我们重点研究了一种将物理模型集成到神经网络中的方法。本研究提出了一种神经网络,它可以预测两个组成部分,即基于物理模型的输出和模型差异。为了实现这一目标,我们提出了一种新的神经网络架构和基于目标物理模型设计的相关损失函数。物理模型被用作空间行为的正则化器,其中神经网络的输出被用作中间变量。然后,将模型差异定义为其对观测值的残差。我们还提出了一种共享网络和非共享网络的网络结构,神经网络可以通过交替优化来训练。以基于热风方程的对流层上层风预报为例,构建了该方法。实验结果表明,该方法比普通卷积神经网络或利用热风方程的方法具有更高的预测精度,且模型差异表达了风向量的收敛和发散。
{"title":"Physics-Guided Neural Network with Model Discrepancy Based on Upper Troposphere Wind Prediction","authors":"Ken-ichi Fukui, Junya Tanaka, T. Tomita, M. Numao","doi":"10.1109/ICMLA.2019.00078","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00078","url":null,"abstract":"In this paper, we focus on a method that integrates a physical model into a neural network. This study proposes a neural network that can predict two components, namely outputs based on a physical model and its model discrepancy. To achieve such a goal, we propose a novel neural network architecture and associated loss functions designed based on a target physical model. The physical model is used as a regularizer of spatial behavior where output from the neural network is used as an intermediate variable. Then, the model discrepancy is defined as its residual to the observation value. We also propose a network architecture which has Shared and Non-Shared networks, and the neural network can be trained by alternate optimization. We constructed the proposed method with wind prediction in the upper troposphere based on thermal wind equations as an example. The experimental results demonstrate that the proposed method can achieve higher predictive accuracy than normal convolutional neural network or using thermal wind equation, also the obtained model discrepancy expresses convergence and divergence of wind vectors.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131538635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Using Bidirectional Long Short Term Memory with Attention Layer to Estimate Driver Behavior 基于双向长短期记忆和注意层的驾驶员行为估计
Shokoufeh Monjezi Kouchak, A. Gaffar
Driver distraction is one of the primary causes of fatal car accidents in U.S. Analyzing driver behavior using different types of data including driving data, driver status or a combination of them is an emerging machine learning solution to detect the distraction level and notify the driver. Deep learning methods such as recurrent neural networks outperform other machine learning methods in car safety applications. In this paper, we used time-sequenced driving data that we collected in eight driving contexts to measure the driver distraction level. Our RNN is also capable of detecting the type of behavior that caused distraction. We used the driver interaction with the car infotainment system as the distracting activity. Two types of LSTM networks were used including bidirectional LSTM network and attention network. We compare the performance of these two complex networks to that of the simple LSTM in estimating driver behavior. We show that our attention network outperforms the other two, while adding bidirectional LSTM networks enhanced the training process of simple LSTM network.
驾驶员分心是美国致命车祸的主要原因之一。利用驾驶数据、驾驶员状态或两者的组合等不同类型的数据分析驾驶员行为,是一种新兴的机器学习解决方案,可以检测驾驶员的分心程度并通知驾驶员。深度学习方法,如循环神经网络,在汽车安全应用中优于其他机器学习方法。在本文中,我们使用在八种驾驶环境中收集的时间序列驾驶数据来测量驾驶员的分心水平。我们的RNN也能够检测出引起注意力分散的行为类型。我们使用驾驶员与汽车信息娱乐系统的互动作为分散注意力的活动。使用了两种类型的LSTM网络:双向LSTM网络和注意网络。我们比较了这两种复杂网络与简单LSTM在估计驾驶员行为方面的性能。我们发现我们的注意力网络优于其他两种网络,而添加双向LSTM网络增强了简单LSTM网络的训练过程。
{"title":"Using Bidirectional Long Short Term Memory with Attention Layer to Estimate Driver Behavior","authors":"Shokoufeh Monjezi Kouchak, A. Gaffar","doi":"10.1109/ICMLA.2019.00059","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00059","url":null,"abstract":"Driver distraction is one of the primary causes of fatal car accidents in U.S. Analyzing driver behavior using different types of data including driving data, driver status or a combination of them is an emerging machine learning solution to detect the distraction level and notify the driver. Deep learning methods such as recurrent neural networks outperform other machine learning methods in car safety applications. In this paper, we used time-sequenced driving data that we collected in eight driving contexts to measure the driver distraction level. Our RNN is also capable of detecting the type of behavior that caused distraction. We used the driver interaction with the car infotainment system as the distracting activity. Two types of LSTM networks were used including bidirectional LSTM network and attention network. We compare the performance of these two complex networks to that of the simple LSTM in estimating driver behavior. We show that our attention network outperforms the other two, while adding bidirectional LSTM networks enhanced the training process of simple LSTM network.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127739659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
The Effect of Time on the Maintenance of a Predictive Model 时间对预测模型维持的影响
Joffrey L. Leevy, T. Khoshgoftaar, Richard A. Bauder, Naeem Seliya
Periodic updating of a machine learning model may become necessary because new data could have a distribution that has drifted significantly over time from the original data distribution, thus impacting the model's usefulness. The primary objective of this paper is to evaluate temporal influence on the maintenance of a predictive model. We investigate the impact of using training data from various year-groupings on a model designed to detect Medicare Part B billing fraud. Training datasets are obtained from year-groupings of 2015, 2014-2015, 2013-2015, and 2012-2015. The test dataset is represented by 2016 data. Our study utilizes five popular learners and five class ratios obtained by Random Undersampling. Using the Area Under the Receiver Operating Characteristic (ROC) Curve as the performance metric, our case study indicates that the Logistic Regression learner yields the highest overall value for the yeargrouping of 2013-2015, with a majority-to-minority ratio of 90:10. For the problem of maintaining predictive models for Medicare fraud, we conclude that a sampled dataset should be chosen over the full dataset and that the largest training dataset (i.e., 2012- 2015) does not always produce the best results. To the best of our knowledge, this is the first big data study that examines the influence of time on the maintenance of machine learning models.
机器学习模型的定期更新可能是必要的,因为新数据的分布可能会随着时间的推移而与原始数据分布显著偏离,从而影响模型的有用性。本文的主要目的是评估时间对预测模型维持的影响。我们研究了使用来自不同年份分组的训练数据对设计用于检测医疗保险B部分账单欺诈的模型的影响。训练数据集来自2015年、2014-2015年、2013-2015年和2012-2015年的年份组。测试数据集用2016年的数据表示。我们的研究使用了5个受欢迎的学习者和随机欠抽样得到的5个班级比例。使用接受者工作特征(ROC)曲线下的面积作为绩效指标,我们的案例研究表明,Logistic回归学习器在2013-2015年的年度分组中产生了最高的总体价值,多数与少数比例为90:10。对于维护医疗保险欺诈预测模型的问题,我们得出结论,应该选择抽样数据集而不是完整数据集,并且最大的训练数据集(即2012- 2015)并不总是产生最好的结果。据我们所知,这是第一个检验时间对机器学习模型维护影响的大数据研究。
{"title":"The Effect of Time on the Maintenance of a Predictive Model","authors":"Joffrey L. Leevy, T. Khoshgoftaar, Richard A. Bauder, Naeem Seliya","doi":"10.1109/ICMLA.2019.00304","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00304","url":null,"abstract":"Periodic updating of a machine learning model may become necessary because new data could have a distribution that has drifted significantly over time from the original data distribution, thus impacting the model's usefulness. The primary objective of this paper is to evaluate temporal influence on the maintenance of a predictive model. We investigate the impact of using training data from various year-groupings on a model designed to detect Medicare Part B billing fraud. Training datasets are obtained from year-groupings of 2015, 2014-2015, 2013-2015, and 2012-2015. The test dataset is represented by 2016 data. Our study utilizes five popular learners and five class ratios obtained by Random Undersampling. Using the Area Under the Receiver Operating Characteristic (ROC) Curve as the performance metric, our case study indicates that the Logistic Regression learner yields the highest overall value for the yeargrouping of 2013-2015, with a majority-to-minority ratio of 90:10. For the problem of maintaining predictive models for Medicare fraud, we conclude that a sampled dataset should be chosen over the full dataset and that the largest training dataset (i.e., 2012- 2015) does not always produce the best results. To the best of our knowledge, this is the first big data study that examines the influence of time on the maintenance of machine learning models.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127774979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Legislative Vote Prediction using Campaign Donations and Fuzzy Hierarchical Communities 基于竞选捐款和模糊等级社区的立法投票预测
Scott Wahl, John W. Sheppard, Elizabeth A. Shanahan
An important aspect of social networks is the discovery and partitioning of the complex graphs into dense sub-networks referred to as communities. The goal of such partitioning is to find groups who have similar attributes or behaviors. In the realm of politics, it is possible to group individuals with similar political behavior by analyzing campaign finance records. In this paper we use fuzzy hierarchical spectral clustering to find communities with campaign finance networks. Multiple experiments were performed using varying edge weighting, number and type of communities, as well as analyzing multiple different years of voting data. The results show that using the full hierarchy of community assignments for legislators is highly predictive of voting behavior in the US House of Representatives and Senate.
社交网络的一个重要方面是发现复杂图并将其划分为称为社区的密集子网络。这种划分的目标是找到具有相似属性或行为的组。在政治领域,通过分析竞选财务记录,可以将具有相似政治行为的个人分组。在本文中,我们使用模糊层次光谱聚类来寻找具有竞选资金网络的社区。使用不同的边缘权重、社区数量和类型进行了多次实验,并分析了多个不同年份的投票数据。结果表明,使用社区分配的完整层次结构对美国众议院和参议院的投票行为具有很高的预测性。
{"title":"Legislative Vote Prediction using Campaign Donations and Fuzzy Hierarchical Communities","authors":"Scott Wahl, John W. Sheppard, Elizabeth A. Shanahan","doi":"10.1109/ICMLA.2019.00129","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00129","url":null,"abstract":"An important aspect of social networks is the discovery and partitioning of the complex graphs into dense sub-networks referred to as communities. The goal of such partitioning is to find groups who have similar attributes or behaviors. In the realm of politics, it is possible to group individuals with similar political behavior by analyzing campaign finance records. In this paper we use fuzzy hierarchical spectral clustering to find communities with campaign finance networks. Multiple experiments were performed using varying edge weighting, number and type of communities, as well as analyzing multiple different years of voting data. The results show that using the full hierarchy of community assignments for legislators is highly predictive of voting behavior in the US House of Representatives and Senate.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128112129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Generation of Pedestrian Pose Structures using Generative Adversarial Networks 使用生成对抗网络生成行人姿态结构
James Spooner, Madeline Cheah, V. Palade, S. Kanarachos, A. Daneshkhah
The safety of vulnerable road users is of paramount importance as transport moves towards fully automated driving. The richness of real-world data required for testing autonomous vehicles is limited, and furthermore, the available data does not have a fair representation of different scenarios and rare events. This work presents a novel approach for the generation of human pose structures, specifically the type of pose structures that would appear to be in pedestrian scenarios. The results show that the generated pedestrian structures are indistinguishable from the ground truth pose structures when classified using a suitably trained classifier. The paper demonstrates that the Generative Adversarial Network architecture can be used to create realistic new training samples, and, in future, new pedestrian events.
随着交通运输向全自动驾驶方向发展,弱势道路使用者的安全至关重要。测试自动驾驶汽车所需的真实世界数据的丰富性是有限的,此外,可用的数据不能公平地代表不同的场景和罕见事件。这项工作提出了一种新的方法来生成人体姿势结构,特别是在行人场景中出现的姿势结构类型。结果表明,当使用适当训练的分类器进行分类时,生成的行人结构与地面真位姿结构无法区分。本文证明了生成对抗网络架构可用于创建逼真的新训练样本,并在未来创建新的行人事件。
{"title":"Generation of Pedestrian Pose Structures using Generative Adversarial Networks","authors":"James Spooner, Madeline Cheah, V. Palade, S. Kanarachos, A. Daneshkhah","doi":"10.1109/ICMLA.2019.00269","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00269","url":null,"abstract":"The safety of vulnerable road users is of paramount importance as transport moves towards fully automated driving. The richness of real-world data required for testing autonomous vehicles is limited, and furthermore, the available data does not have a fair representation of different scenarios and rare events. This work presents a novel approach for the generation of human pose structures, specifically the type of pose structures that would appear to be in pedestrian scenarios. The results show that the generated pedestrian structures are indistinguishable from the ground truth pose structures when classified using a suitably trained classifier. The paper demonstrates that the Generative Adversarial Network architecture can be used to create realistic new training samples, and, in future, new pedestrian events.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133338470","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Deep Structural Model for Analyzing Correlated Multivariate Time Series 一种分析相关多元时间序列的深层结构模型
Changwei Hu, Yifan Hu, Sungyong Seo
Multivariate time series are routinely encountered in real-world applications, and in many cases, these time series are strongly correlated. In this paper, we present a deep learning structural time series model which can (i) handle correlated multivariate time series input, and (ii) forecast the targeted temporal sequence by explicitly learning/extracting the trend, seasonality, and event components. The trend is learned via a 1D and 2D temporal CNN and LSTM hierarchical neural net. The CNN-LSTM architecture can (i) seamlessly leverage the dependency among multiple correlated time series in a natural way, (ii) extract the weighted differencing feature for better trend learning, and (iii) memorize the long-term sequential pattern. The seasonality component is approximated via a non-liner function of a set of Fourier terms, and the event components are learned by a simple linear function of regressor encoding the event dates. We compare our model with several state-of-the-art methods through a comprehensive set of experiments on a variety of time series data sets, such as forecasts of Amazon AWS Simple Storage Service (S3) and Elastic Compute Cloud (EC2) billings, and the closing prices for corporate stocks in the same category.
在实际应用程序中经常遇到多元时间序列,并且在许多情况下,这些时间序列是强相关的。在本文中,我们提出了一个深度学习结构时间序列模型,它可以(i)处理相关的多变量时间序列输入,(ii)通过明确地学习/提取趋势、季节性和事件成分来预测目标时间序列。通过一维和二维时间CNN和LSTM分层神经网络学习趋势。CNN-LSTM架构可以(i)以自然的方式无缝地利用多个相关时间序列之间的依赖关系,(ii)提取加权差分特征以更好地进行趋势学习,(iii)记忆长期序列模式。季节性成分通过一组傅立叶项的非线性函数来近似,事件成分通过一个简单的线性回归函数来学习,回归函数编码事件日期。我们通过对各种时间序列数据集的综合实验,将我们的模型与几种最先进的方法进行比较,例如对亚马逊AWS简单存储服务(S3)和弹性计算云(EC2)账单的预测,以及同一类别公司股票的收盘价。
{"title":"A Deep Structural Model for Analyzing Correlated Multivariate Time Series","authors":"Changwei Hu, Yifan Hu, Sungyong Seo","doi":"10.1109/ICMLA.2019.00020","DOIUrl":"https://doi.org/10.1109/ICMLA.2019.00020","url":null,"abstract":"Multivariate time series are routinely encountered in real-world applications, and in many cases, these time series are strongly correlated. In this paper, we present a deep learning structural time series model which can (i) handle correlated multivariate time series input, and (ii) forecast the targeted temporal sequence by explicitly learning/extracting the trend, seasonality, and event components. The trend is learned via a 1D and 2D temporal CNN and LSTM hierarchical neural net. The CNN-LSTM architecture can (i) seamlessly leverage the dependency among multiple correlated time series in a natural way, (ii) extract the weighted differencing feature for better trend learning, and (iii) memorize the long-term sequential pattern. The seasonality component is approximated via a non-liner function of a set of Fourier terms, and the event components are learned by a simple linear function of regressor encoding the event dates. We compare our model with several state-of-the-art methods through a comprehensive set of experiments on a variety of time series data sets, such as forecasts of Amazon AWS Simple Storage Service (S3) and Elastic Compute Cloud (EC2) billings, and the closing prices for corporate stocks in the same category.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133714751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1