首页 > 最新文献

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
Score-based Image-to-Image Regression with Synchronized Diffusion 基于分数的图像到图像的同步扩散回归
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00056
Hao Xin, M. Zhu
Image-to-image regression is an important computer vision task. In this paper, we propose a novel image-to-image regression model following the recent trend in generative modeling that employs Stochastic Differential Equations (SDEs) and score matching. We first apply diffusion processes to regression data using designed SDEs, and then perform inference by gradually reversing the processes. In particular, our method uses synchronized diffusion, which simultaneously applies diffusion to both input and response images to stabilize diffusion and subsequent parameter learning. Furthermore, based on the Expectation-Maximization (EM) algorithm, we develop an effective algorithm for prediction. We implement a conditional U-Net architecture with pre-trained DenseNet encoder for our proposed model and refer to it as DenseSocre. Our new model is able to generate diverse outcomes for image colorization, and the proposed prediction algorithm is able to achieve close to state-of-art performance on high-resolution monocular depth estimation.
图像到图像的回归是一项重要的计算机视觉任务。在本文中,我们提出了一种新的图像到图像回归模型,该模型采用随机微分方程(SDEs)和分数匹配,顺应了生成模型的最新趋势。我们首先使用设计的SDEs将扩散过程应用于回归数据,然后通过逐渐逆转过程来进行推理。特别是,我们的方法使用同步扩散,它同时对输入和响应图像应用扩散,以稳定扩散和随后的参数学习。在期望最大化算法的基础上,提出了一种有效的预测算法。我们为我们提出的模型实现了一个条件U-Net架构和预训练的DenseNet编码器,并将其称为DenseSocre。我们的新模型能够生成不同的图像着色结果,并且所提出的预测算法能够在高分辨率单目深度估计上实现接近最先进的性能。
{"title":"Score-based Image-to-Image Regression with Synchronized Diffusion","authors":"Hao Xin, M. Zhu","doi":"10.1109/ICMLA55696.2022.00056","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00056","url":null,"abstract":"Image-to-image regression is an important computer vision task. In this paper, we propose a novel image-to-image regression model following the recent trend in generative modeling that employs Stochastic Differential Equations (SDEs) and score matching. We first apply diffusion processes to regression data using designed SDEs, and then perform inference by gradually reversing the processes. In particular, our method uses synchronized diffusion, which simultaneously applies diffusion to both input and response images to stabilize diffusion and subsequent parameter learning. Furthermore, based on the Expectation-Maximization (EM) algorithm, we develop an effective algorithm for prediction. We implement a conditional U-Net architecture with pre-trained DenseNet encoder for our proposed model and refer to it as DenseSocre. Our new model is able to generate diverse outcomes for image colorization, and the proposed prediction algorithm is able to achieve close to state-of-art performance on high-resolution monocular depth estimation.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"219 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122447426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intent based Multimodal Speech and Gesture Fusion for Human-Robot Communication in Assembly Situation 基于意图的多模态语音和手势融合在装配情境下的人机通信
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00127
Sheuli Paul, Michael Sintek, Veton Këpuska, M. Silaghi, Liam Robertson
Understanding the intent is an essential step for maintaining effective communications. This essential feature is used in communications for assembling, patrolling, and surveillance. A fused and interactive multimodal system for human-robot communication, used in assembly applications, is presented in this paper. Communication is multimodal. Having the options of multiple communication modes such as gestures, text, symbols, graphics, images, and speech increase the chance of effective communication. The intent is the main component that we are aiming to model, specifically in human machine dialogues. For this, we extract the intents from spoken dialogues and fuse the intent with any detected matching gesture that is used in interaction with the robot. The main components of the presented system are: (1) a speech recognizer system using Kaldi, (2) a deep-learning based Dual Intent and Entity Transformer (DIET) based classifier for intent and entity extraction, (3) a hand gesture recognition system, and (4) a dynamic fusion model for speech and gesture based communication. These are evaluated on contextual assembly situation using a simulated interactive robot.
理解意图是保持有效沟通的必要步骤。这一基本功能用于通信,用于组装,巡逻和监视。本文提出了一种用于装配应用的融合交互式多模态人机通信系统。交流是多模式的。拥有多种通信模式的选项,如手势、文本、符号、图形、图像和语音,增加了有效通信的机会。意图是我们要建模的主要组件,特别是在人机对话中。为此,我们从口语对话中提取意图,并将意图融合到与机器人交互中使用的任何检测到的匹配手势中。该系统的主要组成部分是:(1)使用Kaldi的语音识别系统;(2)基于深度学习的基于意图和实体提取的双重意图和实体转换(DIET)分类器;(3)手势识别系统;(4)基于语音和手势通信的动态融合模型。使用模拟的交互式机器人对上下文装配情况进行评估。
{"title":"Intent based Multimodal Speech and Gesture Fusion for Human-Robot Communication in Assembly Situation","authors":"Sheuli Paul, Michael Sintek, Veton Këpuska, M. Silaghi, Liam Robertson","doi":"10.1109/ICMLA55696.2022.00127","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00127","url":null,"abstract":"Understanding the intent is an essential step for maintaining effective communications. This essential feature is used in communications for assembling, patrolling, and surveillance. A fused and interactive multimodal system for human-robot communication, used in assembly applications, is presented in this paper. Communication is multimodal. Having the options of multiple communication modes such as gestures, text, symbols, graphics, images, and speech increase the chance of effective communication. The intent is the main component that we are aiming to model, specifically in human machine dialogues. For this, we extract the intents from spoken dialogues and fuse the intent with any detected matching gesture that is used in interaction with the robot. The main components of the presented system are: (1) a speech recognizer system using Kaldi, (2) a deep-learning based Dual Intent and Entity Transformer (DIET) based classifier for intent and entity extraction, (3) a hand gesture recognition system, and (4) a dynamic fusion model for speech and gesture based communication. These are evaluated on contextual assembly situation using a simulated interactive robot.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128508919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Continuous Human Activity Recognition using Radar Imagery and Dynamic Time Warping 基于雷达图像和动态时间翘曲的连续人体活动识别
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00076
Ruchita Mehta, V. Palade, S. Sharifzadeh, Bo Tan, Yordanka Karayaneva
Remote Human Activity Recognition (HAR) in a private residential area has a beneficial influence on the elderly population's life, since this group of people require regular monitoring of health conditions. This paper addresses the problem of continuous detection of daily human activities using mm-wave Doppler radar. Unlike most previous research, this work records the data in terms of continuous series of activities rather than individual activities. These series of activities are similar to real-life activity patterns. The Dynamic Time Warping (DTW) algorithm is used for the detection of human activities in the recorded time series of data and compared to other time-series classification methods. DTW requires less amount of labelled data. The input for DTW was provided using three strategies, and the obtained results were compared against each other. The first approach uses the pixel-level data of frames (named UnSup-PLevel). In the other two strategies, a Convolutional Variational Autoencoder (CVAE) is used to extract Un-Supervised Encoded features (UnSup-EnLevel) and Supervised Encoded features (Sup-EnLevel) from the series of Doppler frames. Results demonstrates the superiority of the Sup-EnLevel features over UnSup-EnLevel and UnSup-PLevel strategies. However, the performance of the UnSup-PLevel strategy worked surprisingly well without using annotations.
私人住宅区的远程人类活动识别(HAR)对老年人的生活有有益的影响,因为这群人需要定期监测健康状况。本文研究了利用毫米波多普勒雷达对人类日常活动进行连续探测的问题。与之前的大多数研究不同,这项工作记录了连续系列活动的数据,而不是单个活动。这一系列的活动与现实生活中的活动模式相似。动态时间翘曲(Dynamic Time Warping, DTW)算法用于在记录的时间序列数据中检测人类活动,并与其他时间序列分类方法进行比较。DTW需要较少的标记数据。使用三种策略提供了DTW的输入,并对所获得的结果进行了比较。第一种方法使用帧的像素级数据(称为unsup -level)。在另外两种策略中,使用卷积变分自编码器(CVAE)从多普勒帧序列中提取无监督编码特征(UnSup-EnLevel)和有监督编码特征(Sup-EnLevel)。结果表明,supp - enlevel特征优于unsupp - enlevel和unsupp - level策略。然而,unsup - level策略在不使用注释的情况下表现得出奇地好。
{"title":"Continuous Human Activity Recognition using Radar Imagery and Dynamic Time Warping","authors":"Ruchita Mehta, V. Palade, S. Sharifzadeh, Bo Tan, Yordanka Karayaneva","doi":"10.1109/ICMLA55696.2022.00076","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00076","url":null,"abstract":"Remote Human Activity Recognition (HAR) in a private residential area has a beneficial influence on the elderly population's life, since this group of people require regular monitoring of health conditions. This paper addresses the problem of continuous detection of daily human activities using mm-wave Doppler radar. Unlike most previous research, this work records the data in terms of continuous series of activities rather than individual activities. These series of activities are similar to real-life activity patterns. The Dynamic Time Warping (DTW) algorithm is used for the detection of human activities in the recorded time series of data and compared to other time-series classification methods. DTW requires less amount of labelled data. The input for DTW was provided using three strategies, and the obtained results were compared against each other. The first approach uses the pixel-level data of frames (named UnSup-PLevel). In the other two strategies, a Convolutional Variational Autoencoder (CVAE) is used to extract Un-Supervised Encoded features (UnSup-EnLevel) and Supervised Encoded features (Sup-EnLevel) from the series of Doppler frames. Results demonstrates the superiority of the Sup-EnLevel features over UnSup-EnLevel and UnSup-PLevel strategies. However, the performance of the UnSup-PLevel strategy worked surprisingly well without using annotations.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128187091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Balancing Similarity-Contrast in Unsupervised Representation Learning: Evaluation with Reinforcement Learning 无监督表示学习中相似性-对比的平衡:强化学习评价
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00273
Menore Tekeba Mengistu, Getachew Alemu, P. Chevaillier, P. D. Loor
In this paper, we provided an unsupervised contrastive representation learning method which uses contrastive views in which both spatial and temporal similarity-contrast is balanced. The balanced views are created by taking pixels from the anchor sample and any randomly selected negative sample and balancing the ratio of number of pixels taken from the anchor and the negative. Then these balanced views are paired with the anchor to create the positive contrastive views and all other samples paired with the anchor are taken as negative contrastive views. We made the evaluation using reinforcement learning tasks on Atari games and Deep Mind Control suites (DMControl). Our evaluations on 26 Atari games and six DMControl tasks show that the proposed method is superior in learning spatio-temporally evolving factors of the environment by capturing the relevant task controlling generative factors from the agents’ raw observations.
在本文中,我们提供了一种无监督的对比表示学习方法,该方法使用对比视图,其中空间和时间的相似性-对比度是平衡的。平衡视图是通过从锚点样本和任何随机选择的负样本中获取像素,并平衡从锚点和负样本中获取的像素数量的比例来创建的。然后将这些平衡的视图与锚配对以创建正对比视图,而与锚配对的所有其他样本都被视为负对比视图。我们使用Atari游戏和Deep Mind Control套件(DMControl)上的强化学习任务进行评估。我们对26个Atari游戏和6个DMControl任务的评估表明,该方法通过从智能体的原始观察中捕获相关的任务控制生成因素,在学习环境的时空演变因素方面具有优势。
{"title":"Balancing Similarity-Contrast in Unsupervised Representation Learning: Evaluation with Reinforcement Learning","authors":"Menore Tekeba Mengistu, Getachew Alemu, P. Chevaillier, P. D. Loor","doi":"10.1109/ICMLA55696.2022.00273","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00273","url":null,"abstract":"In this paper, we provided an unsupervised contrastive representation learning method which uses contrastive views in which both spatial and temporal similarity-contrast is balanced. The balanced views are created by taking pixels from the anchor sample and any randomly selected negative sample and balancing the ratio of number of pixels taken from the anchor and the negative. Then these balanced views are paired with the anchor to create the positive contrastive views and all other samples paired with the anchor are taken as negative contrastive views. We made the evaluation using reinforcement learning tasks on Atari games and Deep Mind Control suites (DMControl). Our evaluations on 26 Atari games and six DMControl tasks show that the proposed method is superior in learning spatio-temporally evolving factors of the environment by capturing the relevant task controlling generative factors from the agents’ raw observations.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130664195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Transparent Neural Networks and Wearable Inertial Sensors to Generate Physiologically-Relevant Insights for Gait 使用透明神经网络和可穿戴惯性传感器生成步态的生理相关洞察
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00204
Lin Zhou, Eric Fischer, C. M. Brahms, U. Granacher, B. Arnrich
Neural networks have been successfully applied to a wide range of human motion analysis topics in combination with wearable sensor data. However, their computation process is not readily comprehensible. Alternatively, many of the model interpretation efforts do not provide physiologically-relevant insights, thus still limiting their use in clinical settings. In this work, we take gait modifications under fatigue and cognitive task performance as a use case to present how in-depth investigations of neural networks can be performed using wearable sensor data. We collected walking data from 16 young healthy individuals in unfatigued and fatigued states and under single- (walking only) and dual-task (walking while concurrently performing a cognitive task) conditions using inertial measurement units. Convolutional neural networks were able to identify both fatigue and dual-task gait patterns with high classification accuracy. To interpret the model, the importance of each time step in the input time series was visualized using Layer-wise Relevance Propagation. The visualization revealed highly individualized gait changes among participants, as well as changes at precise time steps of the input signal that allow further investigations to infer potential underlying mechanisms. Our methods enable in-depth analysis of human movement using transparent neural networks with data collected from unobtrusive, mobile wearable sensors.
结合可穿戴传感器数据,神经网络已经成功地应用于广泛的人体运动分析主题。然而,它们的计算过程并不容易理解。另外,许多模型解释工作不能提供生理学相关的见解,因此仍然限制了它们在临床环境中的应用。在这项工作中,我们以疲劳和认知任务表现下的步态改变为用例,展示了如何使用可穿戴传感器数据进行神经网络的深入研究。我们使用惯性测量单元收集了16名年轻健康个体在非疲劳和疲劳状态下、单任务(仅行走)和双任务(行走同时执行认知任务)条件下的行走数据。卷积神经网络能够以较高的分类准确率识别疲劳和双任务步态模式。为了解释该模型,使用分层相关传播将输入时间序列中每个时间步骤的重要性可视化。可视化显示了参与者之间高度个性化的步态变化,以及输入信号精确时间步长的变化,从而允许进一步研究推断潜在的潜在机制。我们的方法利用透明的神经网络和从不显眼的移动可穿戴传感器收集的数据,对人体运动进行深入分析。
{"title":"Using Transparent Neural Networks and Wearable Inertial Sensors to Generate Physiologically-Relevant Insights for Gait","authors":"Lin Zhou, Eric Fischer, C. M. Brahms, U. Granacher, B. Arnrich","doi":"10.1109/ICMLA55696.2022.00204","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00204","url":null,"abstract":"Neural networks have been successfully applied to a wide range of human motion analysis topics in combination with wearable sensor data. However, their computation process is not readily comprehensible. Alternatively, many of the model interpretation efforts do not provide physiologically-relevant insights, thus still limiting their use in clinical settings. In this work, we take gait modifications under fatigue and cognitive task performance as a use case to present how in-depth investigations of neural networks can be performed using wearable sensor data. We collected walking data from 16 young healthy individuals in unfatigued and fatigued states and under single- (walking only) and dual-task (walking while concurrently performing a cognitive task) conditions using inertial measurement units. Convolutional neural networks were able to identify both fatigue and dual-task gait patterns with high classification accuracy. To interpret the model, the importance of each time step in the input time series was visualized using Layer-wise Relevance Propagation. The visualization revealed highly individualized gait changes among participants, as well as changes at precise time steps of the input signal that allow further investigations to infer potential underlying mechanisms. Our methods enable in-depth analysis of human movement using transparent neural networks with data collected from unobtrusive, mobile wearable sensors.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"18 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130997003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Variational Inference via Rényi Upper-Lower Bound Optimization 基于r<s:1>上下限优化的变分推理
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00136
Dana Oshri Zalman, S. Fine
Variational inference provides a way to approximate probability densities. It does so by optimizing an upper or a lower bound on the likelihood of the observed data (the evidence). The classic variational inference approach suggests to maximize the Evidence Lower BOund (ELBO). Recent proposals suggest to optimize the variational Rényi bound (VR) and χ upper bound. However, these estimates are either biased or difficult to approximate, due to a high variance.In this paper we introduce a new upper bound (termed VRLU) which is based on the existing variational Rényi bound. In contrast to the existing VR bound, the Monte Carlo (MC) approximation of the VRLU bound is unbiased. Furthermore, we devise a (sandwiched) upper-lower bound variational inference method (termed VRS) to jointly optimize the upper and lower bounds. We present a set of experiments, designed to evaluate the new VRLU bound, and to compare the VRS method with the classic VAE and the VR methods over a set of digit recognition tasks. The experiments and results demonstrate the VRLU bound advantage, and the wide applicability of the VRS method.
变分推理提供了一种近似概率密度的方法。它通过优化观测数据(证据)可能性的上限或下限来实现这一点。经典的变分推理方法建议最大化证据下限(ELBO)。最近的建议是优化变分r边界(VR)和χ上界。然而,由于方差很大,这些估计要么是有偏差的,要么是难以近似的。本文在已有变分rsamunyi界的基础上,引入了一个新的上界(VRLU)。与现有的虚拟现实界相比,VRLU界的蒙特卡罗(MC)近似是无偏的。此外,我们设计了一种(夹在中间的)上下界变分推理方法(VRS)来联合优化上界和下界。我们提出了一组实验,旨在评估新的VRLU边界,并在一组数字识别任务中将VRS方法与经典VAE和VR方法进行比较。实验和结果证明了VRLU绑定的优势,以及VRS方法的广泛适用性。
{"title":"Variational Inference via Rényi Upper-Lower Bound Optimization","authors":"Dana Oshri Zalman, S. Fine","doi":"10.1109/ICMLA55696.2022.00136","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00136","url":null,"abstract":"Variational inference provides a way to approximate probability densities. It does so by optimizing an upper or a lower bound on the likelihood of the observed data (the evidence). The classic variational inference approach suggests to maximize the Evidence Lower BOund (ELBO). Recent proposals suggest to optimize the variational Rényi bound (VR) and χ upper bound. However, these estimates are either biased or difficult to approximate, due to a high variance.In this paper we introduce a new upper bound (termed VRLU) which is based on the existing variational Rényi bound. In contrast to the existing VR bound, the Monte Carlo (MC) approximation of the VRLU bound is unbiased. Furthermore, we devise a (sandwiched) upper-lower bound variational inference method (termed VRS) to jointly optimize the upper and lower bounds. We present a set of experiments, designed to evaluate the new VRLU bound, and to compare the VRS method with the classic VAE and the VR methods over a set of digit recognition tasks. The experiments and results demonstrate the VRLU bound advantage, and the wide applicability of the VRS method.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131105412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adversarial Attacks on Deep Temporal Point Process 深度时间点过程的对抗性攻击
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.10102767
Samira Khorshidi, Bao Wang, G. Mohler
Temporal point processes have many applications, from crime forecasting to modeling earthquake aftershocks sequences. Due to the flexibility and expressiveness of deep learning, neural network-based approaches have recently shown promise for modeling point process intensities. However, there is a lack of research on the robustness of such models in regards to adversarial attacks and natural shocks to systems. Precisely, while neural point processes may outperform simpler parametric models on in-sample tests, how these models perform when encountering adversarial examples or sharp non-stationary trends remains unknown. Current work proposes several white-box and blackbox adversarial attacks against temporal point processes modeled by deep neural networks. Extensive experiments confirm that predictive performance and parametric modeling of neural point processes are vulnerable to adversarial attacks. Additionally, we evaluate the vulnerability and performance of these models in the presence of non-stationary abrupt changes, using the crimes dataset, during the Covid-19 pandemic, as an example.
时间点过程有许多应用,从犯罪预测到地震余震序列建模。由于深度学习的灵活性和表达性,基于神经网络的方法最近显示出对点过程强度建模的希望。然而,对于这些模型在对抗性攻击和系统自然冲击方面的鲁棒性,缺乏研究。确切地说,虽然神经点过程在样本内测试中可能优于更简单的参数模型,但这些模型在遇到对抗性示例或急剧非平稳趋势时的表现如何仍然未知。目前的工作提出了几种针对深度神经网络建模的时间点过程的白盒和黑盒对抗性攻击。大量的实验证实,神经点过程的预测性能和参数化建模容易受到对抗性攻击。此外,我们以Covid-19大流行期间的犯罪数据集为例,评估了这些模型在非平稳突变存在下的脆弱性和性能。
{"title":"Adversarial Attacks on Deep Temporal Point Process","authors":"Samira Khorshidi, Bao Wang, G. Mohler","doi":"10.1109/ICMLA55696.2022.10102767","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.10102767","url":null,"abstract":"Temporal point processes have many applications, from crime forecasting to modeling earthquake aftershocks sequences. Due to the flexibility and expressiveness of deep learning, neural network-based approaches have recently shown promise for modeling point process intensities. However, there is a lack of research on the robustness of such models in regards to adversarial attacks and natural shocks to systems. Precisely, while neural point processes may outperform simpler parametric models on in-sample tests, how these models perform when encountering adversarial examples or sharp non-stationary trends remains unknown. Current work proposes several white-box and blackbox adversarial attacks against temporal point processes modeled by deep neural networks. Extensive experiments confirm that predictive performance and parametric modeling of neural point processes are vulnerable to adversarial attacks. Additionally, we evaluate the vulnerability and performance of these models in the presence of non-stationary abrupt changes, using the crimes dataset, during the Covid-19 pandemic, as an example.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129246937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Hawkes Process Multi-armed Bandits for Search and Rescue 霍克斯过程多武装土匪搜索和救援
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00046
Wen-Hao Chiang, G. Mohler
We propose a novel framework for integrating Hawkes processes with multi-armed bandit algorithms to solve spatio-temporal event forecasting and detection problems when data may be undersampled or spatially biased. In particular, we introduce an upper confidence bound algorithm using Bayesian spatial Hawkes process estimation for balancing the trade-off between exploiting geographic regions where data has been collected and exploring geographic regions where data is unobserved. We first validate our model using simulated data. We then apply it to the problem of disaster search and rescue using calls for service data from hurricane Harvey in 2017 and the problem of detection and clearance of improvised explosive devices (IEDs) using IED attack records in Iraq. Our model outperforms state-of-the-art baseline spatial MAB algorithms in terms of cumulative reward and several other ranking evaluation metrics.
我们提出了一个新的框架,将Hawkes过程与多臂强盗算法相结合,以解决数据可能采样不足或空间偏差时的时空事件预测和检测问题。特别地,我们引入了一种使用贝叶斯空间霍克斯过程估计的上置信度界算法,以平衡利用已收集数据的地理区域和探索未观测到数据的地理区域之间的权衡。我们首先使用模拟数据验证我们的模型。然后,我们使用2017年哈维飓风的服务数据呼叫将其应用于灾难搜索和救援问题,以及使用伊拉克简易爆炸装置袭击记录检测和清除简易爆炸装置(IED)的问题。我们的模型在累积奖励和其他几个排名评估指标方面优于最先进的基线空间MAB算法。
{"title":"Hawkes Process Multi-armed Bandits for Search and Rescue","authors":"Wen-Hao Chiang, G. Mohler","doi":"10.1109/ICMLA55696.2022.00046","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00046","url":null,"abstract":"We propose a novel framework for integrating Hawkes processes with multi-armed bandit algorithms to solve spatio-temporal event forecasting and detection problems when data may be undersampled or spatially biased. In particular, we introduce an upper confidence bound algorithm using Bayesian spatial Hawkes process estimation for balancing the trade-off between exploiting geographic regions where data has been collected and exploring geographic regions where data is unobserved. We first validate our model using simulated data. We then apply it to the problem of disaster search and rescue using calls for service data from hurricane Harvey in 2017 and the problem of detection and clearance of improvised explosive devices (IEDs) using IED attack records in Iraq. Our model outperforms state-of-the-art baseline spatial MAB algorithms in terms of cumulative reward and several other ranking evaluation metrics.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128672750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertainty Prediction for Facial Action Units Recognition under Degraded Conditions 退化条件下面部动作单元识别的不确定性预测
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00069
Junya Saito, Sachihiro Youoku, Ryosuke Kawamura, A. Uchida, Kentaro Murase, Xiaoyue Mi
Facial action units (AUs) represent muscular activities, and their recognition from facial images can capture various psychological states, such as people’s interests as consumers and mental health states. However, degradation of conditions, such as occlusions by hand, often occurs and affects the accuracy of AUs recognition in the real world. Most existing studies on degraded conditions have adopted the approach using additional training images and advanced structures of neural networks to improve the robustness of AUs recognition from a degraded facial image. However, such an approach cannot deal with cases in which evidence of the AUs is completely or almost invisible. Therefore, we propose a novel method to address the degraded conditions by predicting the uncertainties of the AUs recognition caused by them. Our method interpolates the high-uncertainty data using surrounding data to reduce the influence of the degraded conditions, and visualizes the conditions causing the uncertainties to handle cases where the conditions are very poor and need to be improved. In the evaluation experiments, the public datasets BP4D+ and DISFA were modified to degrade them for testing. By evaluating the modified test data, we demonstrated that the maximum improvement with our method was 12% for BP4D+ and 17% for DISFA, and that our method can prevent the decrease in accuracy owing to degraded conditions. We also presented some visualization examples which demonstrate that our method can reasonably predict the conditions and uncertainties.
面部动作单位(AUs)代表肌肉活动,从面部图像中识别它们可以捕捉到各种心理状态,如人们作为消费者的兴趣和心理健康状态。然而,在现实世界中,经常发生诸如手遮挡等条件的退化,并影响AUs识别的准确性。大多数关于退化条件的现有研究都采用了使用附加训练图像和高级神经网络结构的方法来提高退化面部图像的AUs识别的鲁棒性。然而,这种方法无法处理证据完全或几乎看不见的案件。因此,我们提出了一种新的方法,通过预测由它们引起的AUs识别的不确定性来解决退化条件。我们的方法利用周围数据对高不确定性数据进行插值,以减少退化条件的影响,并将导致不确定性的条件可视化,以处理条件非常差且需要改进的情况。在评价实验中,对公共数据集BP4D+和DISFA进行修改,使其降级以供测试。通过对改进后的测试数据进行评估,我们证明了我们的方法对BP4D+的最大改进是12%,对DISFA的最大改进是17%,并且我们的方法可以防止由于条件退化而导致的精度下降。并给出了一些可视化实例,表明该方法可以合理地预测条件和不确定性。
{"title":"Uncertainty Prediction for Facial Action Units Recognition under Degraded Conditions","authors":"Junya Saito, Sachihiro Youoku, Ryosuke Kawamura, A. Uchida, Kentaro Murase, Xiaoyue Mi","doi":"10.1109/ICMLA55696.2022.00069","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00069","url":null,"abstract":"Facial action units (AUs) represent muscular activities, and their recognition from facial images can capture various psychological states, such as people’s interests as consumers and mental health states. However, degradation of conditions, such as occlusions by hand, often occurs and affects the accuracy of AUs recognition in the real world. Most existing studies on degraded conditions have adopted the approach using additional training images and advanced structures of neural networks to improve the robustness of AUs recognition from a degraded facial image. However, such an approach cannot deal with cases in which evidence of the AUs is completely or almost invisible. Therefore, we propose a novel method to address the degraded conditions by predicting the uncertainties of the AUs recognition caused by them. Our method interpolates the high-uncertainty data using surrounding data to reduce the influence of the degraded conditions, and visualizes the conditions causing the uncertainties to handle cases where the conditions are very poor and need to be improved. In the evaluation experiments, the public datasets BP4D+ and DISFA were modified to degrade them for testing. By evaluating the modified test data, we demonstrated that the maximum improvement with our method was 12% for BP4D+ and 17% for DISFA, and that our method can prevent the decrease in accuracy owing to degraded conditions. We also presented some visualization examples which demonstrate that our method can reasonably predict the conditions and uncertainties.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127349866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature Reduction Method Comparison Towards Explainability and Efficiency in Cybersecurity Intrusion Detection Systems 面向网络安全入侵检测系统可解释性和效率的特征约简方法比较
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00211
Adam Lehavi, S. Kim
In the realm of cybersecurity, intrusion detection systems (IDS) detect and prevent attacks based on collected computer and network data. In recent research, IDS models have been constructed using machine learning (ML) and deep learning (DL) methods such as Random Forest (RF) and deep neural networks (DNN). Feature selection (FS) can be used to construct faster, more interpretable, and more accurate models. We look at three different FS techniques; RF information gain (RF-IG), correlation feature selection using the Bat Algorithm (CFS-BA), and CFS using the Aquila Optimizer (CFS-AO). Our results show CFS-BA to be the most efficient of the FS methods, building in 55% of the time of the best RF-IG model while achieving 99.99% of its accuracy. This reinforces prior contributions attesting to CFS-BA’s accuracy while building upon the relationship between subset size, CFS score, and RF-IG score in final results.
在网络安全领域,入侵检测系统(IDS)基于收集的计算机和网络数据来检测和预防攻击。在最近的研究中,IDS模型已经使用机器学习(ML)和深度学习(DL)方法,如随机森林(RF)和深度神经网络(DNN)来构建。特征选择(FS)可用于构建更快、更可解释和更准确的模型。我们来看三种不同的FS技术;RF信息增益(RF- ig),使用Bat算法(CFS- ba)进行相关特征选择,使用Aquila优化器(CFS- ao)进行CFS。我们的研究结果表明,CFS-BA是最有效的FS方法,构建最佳RF-IG模型的时间为55%,准确率为99.99%。这加强了先前证明CFS- ba准确性的贡献,同时建立在最终结果中子集大小、CFS评分和RF-IG评分之间的关系上。
{"title":"Feature Reduction Method Comparison Towards Explainability and Efficiency in Cybersecurity Intrusion Detection Systems","authors":"Adam Lehavi, S. Kim","doi":"10.1109/ICMLA55696.2022.00211","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00211","url":null,"abstract":"In the realm of cybersecurity, intrusion detection systems (IDS) detect and prevent attacks based on collected computer and network data. In recent research, IDS models have been constructed using machine learning (ML) and deep learning (DL) methods such as Random Forest (RF) and deep neural networks (DNN). Feature selection (FS) can be used to construct faster, more interpretable, and more accurate models. We look at three different FS techniques; RF information gain (RF-IG), correlation feature selection using the Bat Algorithm (CFS-BA), and CFS using the Aquila Optimizer (CFS-AO). Our results show CFS-BA to be the most efficient of the FS methods, building in 55% of the time of the best RF-IG model while achieving 99.99% of its accuracy. This reinforces prior contributions attesting to CFS-BA’s accuracy while building upon the relationship between subset size, CFS score, and RF-IG score in final results.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126637594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1