首页 > 最新文献

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
Contactless Low Power Air-Writing Based on FMCW Radar Networks Using Spiking Neural Networks 基于脉冲神经网络的FMCW雷达网络非接触式低功耗空写
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00155
Muhammad Arsalan, Tao Zheng, Avik Santra, V. Issakov
Contactless detection of hand gestures with radar has gained a lot of attention as an intuitive form of human-computer interface. In this paper, we propose an air-writing system, writing of linguistic characters or words in free space by hand gesture movements using a network of milli-meter wave radars. Most of the works reported in the literature are based on deep learning approaches, which in some cases can involve prohibitively large computational/energy costs making them undesirable for edge IoT devices, where energy efficiency is the prime concern. We propose a highly energy-efficient air-writing system using spiking neural networks, where the trajectory of the character created by fine range estimates together with trilateration from a network of radars are recognized and classified by a spiking neural network (SNN). The proposed system achieves a similar level of classification accuracy (98.6%) compared to the state-of-the-art deep learning methods for 15 characters containing 10 alphabets (A to J) and 5 numerals (1 to 5). Additionally, the proposed SNN model is of 3.7 MB in size making it memory efficient in terms of storage. We demonstrated the proposed method in real-time using a network of 60-GHz frequency-modulated continuous wave radar chipset.
利用雷达进行非接触式手势检测作为一种直观的人机界面形式受到了广泛关注。在本文中,我们提出了一种空气书写系统,使用毫米波雷达网络通过手势运动在自由空间中书写语言字符或单词。文献中报道的大多数工作都是基于深度学习方法的,在某些情况下,深度学习方法可能涉及过高的计算/能源成本,这使得它们不适合边缘物联网设备,其中能源效率是主要关注的问题。我们提出了一种使用尖峰神经网络的高能效空气书写系统,其中由雷达网络的精细距离估计和三边测量创建的字符轨迹由尖峰神经网络(SNN)识别和分类。与最先进的深度学习方法相比,所提出的系统在包含10个字母(a到J)和5个数字(1到5)的15个字符上实现了相似的分类准确率(98.6%)。此外,所提出的SNN模型的大小为3.7 MB,使其在存储方面具有内存效率。我们使用60 ghz调频连续波雷达芯片组网络实时演示了所提出的方法。
{"title":"Contactless Low Power Air-Writing Based on FMCW Radar Networks Using Spiking Neural Networks","authors":"Muhammad Arsalan, Tao Zheng, Avik Santra, V. Issakov","doi":"10.1109/ICMLA55696.2022.00155","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00155","url":null,"abstract":"Contactless detection of hand gestures with radar has gained a lot of attention as an intuitive form of human-computer interface. In this paper, we propose an air-writing system, writing of linguistic characters or words in free space by hand gesture movements using a network of milli-meter wave radars. Most of the works reported in the literature are based on deep learning approaches, which in some cases can involve prohibitively large computational/energy costs making them undesirable for edge IoT devices, where energy efficiency is the prime concern. We propose a highly energy-efficient air-writing system using spiking neural networks, where the trajectory of the character created by fine range estimates together with trilateration from a network of radars are recognized and classified by a spiking neural network (SNN). The proposed system achieves a similar level of classification accuracy (98.6%) compared to the state-of-the-art deep learning methods for 15 characters containing 10 alphabets (A to J) and 5 numerals (1 to 5). Additionally, the proposed SNN model is of 3.7 MB in size making it memory efficient in terms of storage. We demonstrated the proposed method in real-time using a network of 60-GHz frequency-modulated continuous wave radar chipset.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121791344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cost-effective Models for Detecting Depression from Speech 从语音中检测抑郁的成本效益模型
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00259
Mashrura Tasnim, Jekaterina Novikova
Depression is the most common psychological disorder and is considered as a leading cause of disability and suicide worldwide. An automated system capable of detecting signs of depression in human speech can contribute to ensuring timely and effective mental health care for individuals suffering from the disorder. Developing such automated system requires accurate machine learning models, capable of capturing signs of depression. However, state-of-the-art models based on deep acoustic representations require abundant data, meticulous selection of features, and rigorous training; the procedure involves enormous computational resources. In this work, we explore the effectiveness of two different acoustic feature groups-conventional hand-curated and deep representation features, for predicting the severity of depression from speech. We explore the relevance of possible contributing factors to the models’ performance, including gender of the individual, severity of the disorder, content and length of speech. Our findings suggest that models trained on conventional acoustic features perform equally well or better than the ones trained on deep representation features at significantly lower computational cost, irrespective of other factors, e.g. content and length of speech, gender of the speaker and severity of the disorder. This makes such models a better fit for deployment where availability of computational resources is restricted, such as real time depression monitoring applications in smart devices.
抑郁症是最常见的心理障碍,被认为是世界范围内导致残疾和自杀的主要原因。一个能够在人类语言中检测抑郁迹象的自动化系统,有助于确保对患有这种疾病的个体进行及时有效的精神卫生保健。开发这样的自动化系统需要精确的机器学习模型,能够捕捉抑郁症的迹象。然而,基于深度声学表示的最先进的模型需要丰富的数据、细致的特征选择和严格的训练;这个过程涉及到大量的计算资源。在这项工作中,我们探索了两种不同的声学特征组——传统的手工绘制特征和深度表征特征——在预测语音抑郁严重程度方面的有效性。我们探讨了影响模型表现的可能因素的相关性,包括个体的性别、障碍的严重程度、演讲的内容和长度。我们的研究结果表明,无论其他因素,如演讲内容和长度、说话者的性别和障碍的严重程度,用传统声学特征训练的模型的表现与用深度表征特征训练的模型一样好,甚至更好,而且计算成本显著降低。这使得这种模型更适合于计算资源可用性受限的部署,例如智能设备中的实时抑郁监测应用程序。
{"title":"Cost-effective Models for Detecting Depression from Speech","authors":"Mashrura Tasnim, Jekaterina Novikova","doi":"10.1109/ICMLA55696.2022.00259","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00259","url":null,"abstract":"Depression is the most common psychological disorder and is considered as a leading cause of disability and suicide worldwide. An automated system capable of detecting signs of depression in human speech can contribute to ensuring timely and effective mental health care for individuals suffering from the disorder. Developing such automated system requires accurate machine learning models, capable of capturing signs of depression. However, state-of-the-art models based on deep acoustic representations require abundant data, meticulous selection of features, and rigorous training; the procedure involves enormous computational resources. In this work, we explore the effectiveness of two different acoustic feature groups-conventional hand-curated and deep representation features, for predicting the severity of depression from speech. We explore the relevance of possible contributing factors to the models’ performance, including gender of the individual, severity of the disorder, content and length of speech. Our findings suggest that models trained on conventional acoustic features perform equally well or better than the ones trained on deep representation features at significantly lower computational cost, irrespective of other factors, e.g. content and length of speech, gender of the speaker and severity of the disorder. This makes such models a better fit for deployment where availability of computational resources is restricted, such as real time depression monitoring applications in smart devices.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122888918","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the Explicit Modelling of Bias in Machine Learning Classifiers: A Deep Multi-label ConvNet Approach * 探索机器学习分类器中偏差的显式建模:一种深度多标签卷积神经网络方法*
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00277
Mashael Al-Luhaybi, S. Swift, S. Counsell, A. Tucker
This paper addresses the problem that many machine learning classifiers make decisions based on data that are biased and can therefore result in prejudiced decisions. For example, in education (which this paper focuses on) a student may be rejected from a course based on historical decisions in the data that only exist due to historical biases in society or due to the skewed sampling of the data. Other approaches to dealing with bias in data include resampling methods (to counter imbalanced samples) and dimensionality reduction (to focus only on relevant features to the classification task). In this paper, we explore issues of modelling bias explicitly so that we can identify the types of bias and whether they are accounting for inflated predictive accuracies. In particular, we compare graphical model approaches to building classifiers, that are transparent in how they make decisions, with two forms of Deep Multi-label Convolutional Neural Networks to investigate if models can be built that maximise accuracy and minimise bias. We carry out this comparison on student entry and performance data from a higher educational institution.
本文解决了许多机器学习分类器基于有偏见的数据做出决策的问题,因此可能导致有偏见的决策。例如,在教育(本文所关注的)中,一个学生可能会因为数据中的历史决策而被拒绝上一门课程,这些决策只存在于社会中的历史偏见或由于数据的抽样偏差。处理数据偏差的其他方法包括重新采样方法(以对抗不平衡的样本)和降维方法(仅关注与分类任务相关的特征)。在本文中,我们明确地探讨了建模偏差的问题,以便我们可以识别偏差的类型以及它们是否会导致预测精度过高。特别是,我们将图形模型方法与两种形式的深度多标签卷积神经网络进行比较,以构建分类器,这些分类器在决策过程中是透明的,以研究是否可以构建出最大化准确性和最小化偏差的模型。我们对一所高等教育机构的学生入学和表现数据进行了比较。
{"title":"Exploring the Explicit Modelling of Bias in Machine Learning Classifiers: A Deep Multi-label ConvNet Approach *","authors":"Mashael Al-Luhaybi, S. Swift, S. Counsell, A. Tucker","doi":"10.1109/ICMLA55696.2022.00277","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00277","url":null,"abstract":"This paper addresses the problem that many machine learning classifiers make decisions based on data that are biased and can therefore result in prejudiced decisions. For example, in education (which this paper focuses on) a student may be rejected from a course based on historical decisions in the data that only exist due to historical biases in society or due to the skewed sampling of the data. Other approaches to dealing with bias in data include resampling methods (to counter imbalanced samples) and dimensionality reduction (to focus only on relevant features to the classification task). In this paper, we explore issues of modelling bias explicitly so that we can identify the types of bias and whether they are accounting for inflated predictive accuracies. In particular, we compare graphical model approaches to building classifiers, that are transparent in how they make decisions, with two forms of Deep Multi-label Convolutional Neural Networks to investigate if models can be built that maximise accuracy and minimise bias. We carry out this comparison on student entry and performance data from a higher educational institution.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"82 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129808166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adversarial Attacks on Deep Temporal Point Process 深度时间点过程的对抗性攻击
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.10102767
Samira Khorshidi, Bao Wang, G. Mohler
Temporal point processes have many applications, from crime forecasting to modeling earthquake aftershocks sequences. Due to the flexibility and expressiveness of deep learning, neural network-based approaches have recently shown promise for modeling point process intensities. However, there is a lack of research on the robustness of such models in regards to adversarial attacks and natural shocks to systems. Precisely, while neural point processes may outperform simpler parametric models on in-sample tests, how these models perform when encountering adversarial examples or sharp non-stationary trends remains unknown. Current work proposes several white-box and blackbox adversarial attacks against temporal point processes modeled by deep neural networks. Extensive experiments confirm that predictive performance and parametric modeling of neural point processes are vulnerable to adversarial attacks. Additionally, we evaluate the vulnerability and performance of these models in the presence of non-stationary abrupt changes, using the crimes dataset, during the Covid-19 pandemic, as an example.
时间点过程有许多应用,从犯罪预测到地震余震序列建模。由于深度学习的灵活性和表达性,基于神经网络的方法最近显示出对点过程强度建模的希望。然而,对于这些模型在对抗性攻击和系统自然冲击方面的鲁棒性,缺乏研究。确切地说,虽然神经点过程在样本内测试中可能优于更简单的参数模型,但这些模型在遇到对抗性示例或急剧非平稳趋势时的表现如何仍然未知。目前的工作提出了几种针对深度神经网络建模的时间点过程的白盒和黑盒对抗性攻击。大量的实验证实,神经点过程的预测性能和参数化建模容易受到对抗性攻击。此外,我们以Covid-19大流行期间的犯罪数据集为例,评估了这些模型在非平稳突变存在下的脆弱性和性能。
{"title":"Adversarial Attacks on Deep Temporal Point Process","authors":"Samira Khorshidi, Bao Wang, G. Mohler","doi":"10.1109/ICMLA55696.2022.10102767","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.10102767","url":null,"abstract":"Temporal point processes have many applications, from crime forecasting to modeling earthquake aftershocks sequences. Due to the flexibility and expressiveness of deep learning, neural network-based approaches have recently shown promise for modeling point process intensities. However, there is a lack of research on the robustness of such models in regards to adversarial attacks and natural shocks to systems. Precisely, while neural point processes may outperform simpler parametric models on in-sample tests, how these models perform when encountering adversarial examples or sharp non-stationary trends remains unknown. Current work proposes several white-box and blackbox adversarial attacks against temporal point processes modeled by deep neural networks. Extensive experiments confirm that predictive performance and parametric modeling of neural point processes are vulnerable to adversarial attacks. Additionally, we evaluate the vulnerability and performance of these models in the presence of non-stationary abrupt changes, using the crimes dataset, during the Covid-19 pandemic, as an example.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129246937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Intent based Multimodal Speech and Gesture Fusion for Human-Robot Communication in Assembly Situation 基于意图的多模态语音和手势融合在装配情境下的人机通信
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00127
Sheuli Paul, Michael Sintek, Veton Këpuska, M. Silaghi, Liam Robertson
Understanding the intent is an essential step for maintaining effective communications. This essential feature is used in communications for assembling, patrolling, and surveillance. A fused and interactive multimodal system for human-robot communication, used in assembly applications, is presented in this paper. Communication is multimodal. Having the options of multiple communication modes such as gestures, text, symbols, graphics, images, and speech increase the chance of effective communication. The intent is the main component that we are aiming to model, specifically in human machine dialogues. For this, we extract the intents from spoken dialogues and fuse the intent with any detected matching gesture that is used in interaction with the robot. The main components of the presented system are: (1) a speech recognizer system using Kaldi, (2) a deep-learning based Dual Intent and Entity Transformer (DIET) based classifier for intent and entity extraction, (3) a hand gesture recognition system, and (4) a dynamic fusion model for speech and gesture based communication. These are evaluated on contextual assembly situation using a simulated interactive robot.
理解意图是保持有效沟通的必要步骤。这一基本功能用于通信,用于组装,巡逻和监视。本文提出了一种用于装配应用的融合交互式多模态人机通信系统。交流是多模式的。拥有多种通信模式的选项,如手势、文本、符号、图形、图像和语音,增加了有效通信的机会。意图是我们要建模的主要组件,特别是在人机对话中。为此,我们从口语对话中提取意图,并将意图融合到与机器人交互中使用的任何检测到的匹配手势中。该系统的主要组成部分是:(1)使用Kaldi的语音识别系统;(2)基于深度学习的基于意图和实体提取的双重意图和实体转换(DIET)分类器;(3)手势识别系统;(4)基于语音和手势通信的动态融合模型。使用模拟的交互式机器人对上下文装配情况进行评估。
{"title":"Intent based Multimodal Speech and Gesture Fusion for Human-Robot Communication in Assembly Situation","authors":"Sheuli Paul, Michael Sintek, Veton Këpuska, M. Silaghi, Liam Robertson","doi":"10.1109/ICMLA55696.2022.00127","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00127","url":null,"abstract":"Understanding the intent is an essential step for maintaining effective communications. This essential feature is used in communications for assembling, patrolling, and surveillance. A fused and interactive multimodal system for human-robot communication, used in assembly applications, is presented in this paper. Communication is multimodal. Having the options of multiple communication modes such as gestures, text, symbols, graphics, images, and speech increase the chance of effective communication. The intent is the main component that we are aiming to model, specifically in human machine dialogues. For this, we extract the intents from spoken dialogues and fuse the intent with any detected matching gesture that is used in interaction with the robot. The main components of the presented system are: (1) a speech recognizer system using Kaldi, (2) a deep-learning based Dual Intent and Entity Transformer (DIET) based classifier for intent and entity extraction, (3) a hand gesture recognition system, and (4) a dynamic fusion model for speech and gesture based communication. These are evaluated on contextual assembly situation using a simulated interactive robot.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128508919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Source Domain Selection for Cross-House Human Activity Recognition with Ambient Sensors 基于环境传感器的跨屋人体活动识别的源域选择
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00126
Hao Niu, H. Ung, Shinya Wada
Human activity recognition using ambient sensors has become particularly important due to social demands of applications in smart homes. To address the problem of labeling sensing data for every individual house, cross-house human activity recognition is proposed to use available labeled houses (source domains) to train recognition models for applying to unlabeled houses (target domains). In this paper, we propose a method of source domain selection for cross-house human activity recognition. We first improve the method for representing semantic relationships of sensors. To select the best similar source houses for a target house, we then propose a method for calculating similarity score between two houses. Using 19 houses of the CASAS dataset, we evaluate the recognition performance in target houses using models trained by several similar source houses, randomly selected houses, dissimilar source houses, and all source houses without selection. Experimental results illustrate that the average accuracy of models trained from the small number of the best similar houses achieve the best performance, and thus they confirm the effectiveness of our proposed method.
由于智能家居应用的社会需求,使用环境传感器进行人类活动识别变得尤为重要。为了解决每个单独房屋的标记传感数据问题,提出了跨房屋人类活动识别,利用可用的标记房屋(源域)来训练识别模型,以应用于未标记房屋(目标域)。本文提出了一种基于源域选择的跨屋人体活动识别方法。我们首先改进了传感器语义关系的表示方法。为了为目标房屋选择最佳的相似源房屋,我们提出了一种计算两间房屋之间相似性得分的方法。使用CASAS数据集的19个房屋,我们使用几个相似的源房屋、随机选择的房屋、不相似的源房屋和未选择的所有源房屋训练的模型来评估目标房屋的识别性能。实验结果表明,由少量最佳相似房屋训练的模型平均准确率达到最佳,从而验证了本文方法的有效性。
{"title":"Source Domain Selection for Cross-House Human Activity Recognition with Ambient Sensors","authors":"Hao Niu, H. Ung, Shinya Wada","doi":"10.1109/ICMLA55696.2022.00126","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00126","url":null,"abstract":"Human activity recognition using ambient sensors has become particularly important due to social demands of applications in smart homes. To address the problem of labeling sensing data for every individual house, cross-house human activity recognition is proposed to use available labeled houses (source domains) to train recognition models for applying to unlabeled houses (target domains). In this paper, we propose a method of source domain selection for cross-house human activity recognition. We first improve the method for representing semantic relationships of sensors. To select the best similar source houses for a target house, we then propose a method for calculating similarity score between two houses. Using 19 houses of the CASAS dataset, we evaluate the recognition performance in target houses using models trained by several similar source houses, randomly selected houses, dissimilar source houses, and all source houses without selection. Experimental results illustrate that the average accuracy of models trained from the small number of the best similar houses achieve the best performance, and thus they confirm the effectiveness of our proposed method.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124655825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variational Inference via Rényi Upper-Lower Bound Optimization 基于r<s:1>上下限优化的变分推理
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00136
Dana Oshri Zalman, S. Fine
Variational inference provides a way to approximate probability densities. It does so by optimizing an upper or a lower bound on the likelihood of the observed data (the evidence). The classic variational inference approach suggests to maximize the Evidence Lower BOund (ELBO). Recent proposals suggest to optimize the variational Rényi bound (VR) and χ upper bound. However, these estimates are either biased or difficult to approximate, due to a high variance.In this paper we introduce a new upper bound (termed VRLU) which is based on the existing variational Rényi bound. In contrast to the existing VR bound, the Monte Carlo (MC) approximation of the VRLU bound is unbiased. Furthermore, we devise a (sandwiched) upper-lower bound variational inference method (termed VRS) to jointly optimize the upper and lower bounds. We present a set of experiments, designed to evaluate the new VRLU bound, and to compare the VRS method with the classic VAE and the VR methods over a set of digit recognition tasks. The experiments and results demonstrate the VRLU bound advantage, and the wide applicability of the VRS method.
变分推理提供了一种近似概率密度的方法。它通过优化观测数据(证据)可能性的上限或下限来实现这一点。经典的变分推理方法建议最大化证据下限(ELBO)。最近的建议是优化变分r边界(VR)和χ上界。然而,由于方差很大,这些估计要么是有偏差的,要么是难以近似的。本文在已有变分rsamunyi界的基础上,引入了一个新的上界(VRLU)。与现有的虚拟现实界相比,VRLU界的蒙特卡罗(MC)近似是无偏的。此外,我们设计了一种(夹在中间的)上下界变分推理方法(VRS)来联合优化上界和下界。我们提出了一组实验,旨在评估新的VRLU边界,并在一组数字识别任务中将VRS方法与经典VAE和VR方法进行比较。实验和结果证明了VRLU绑定的优势,以及VRS方法的广泛适用性。
{"title":"Variational Inference via Rényi Upper-Lower Bound Optimization","authors":"Dana Oshri Zalman, S. Fine","doi":"10.1109/ICMLA55696.2022.00136","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00136","url":null,"abstract":"Variational inference provides a way to approximate probability densities. It does so by optimizing an upper or a lower bound on the likelihood of the observed data (the evidence). The classic variational inference approach suggests to maximize the Evidence Lower BOund (ELBO). Recent proposals suggest to optimize the variational Rényi bound (VR) and χ upper bound. However, these estimates are either biased or difficult to approximate, due to a high variance.In this paper we introduce a new upper bound (termed VRLU) which is based on the existing variational Rényi bound. In contrast to the existing VR bound, the Monte Carlo (MC) approximation of the VRLU bound is unbiased. Furthermore, we devise a (sandwiched) upper-lower bound variational inference method (termed VRS) to jointly optimize the upper and lower bounds. We present a set of experiments, designed to evaluate the new VRLU bound, and to compare the VRS method with the classic VAE and the VR methods over a set of digit recognition tasks. The experiments and results demonstrate the VRLU bound advantage, and the wide applicability of the VRS method.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131105412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hawkes Process Multi-armed Bandits for Search and Rescue 霍克斯过程多武装土匪搜索和救援
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00046
Wen-Hao Chiang, G. Mohler
We propose a novel framework for integrating Hawkes processes with multi-armed bandit algorithms to solve spatio-temporal event forecasting and detection problems when data may be undersampled or spatially biased. In particular, we introduce an upper confidence bound algorithm using Bayesian spatial Hawkes process estimation for balancing the trade-off between exploiting geographic regions where data has been collected and exploring geographic regions where data is unobserved. We first validate our model using simulated data. We then apply it to the problem of disaster search and rescue using calls for service data from hurricane Harvey in 2017 and the problem of detection and clearance of improvised explosive devices (IEDs) using IED attack records in Iraq. Our model outperforms state-of-the-art baseline spatial MAB algorithms in terms of cumulative reward and several other ranking evaluation metrics.
我们提出了一个新的框架,将Hawkes过程与多臂强盗算法相结合,以解决数据可能采样不足或空间偏差时的时空事件预测和检测问题。特别地,我们引入了一种使用贝叶斯空间霍克斯过程估计的上置信度界算法,以平衡利用已收集数据的地理区域和探索未观测到数据的地理区域之间的权衡。我们首先使用模拟数据验证我们的模型。然后,我们使用2017年哈维飓风的服务数据呼叫将其应用于灾难搜索和救援问题,以及使用伊拉克简易爆炸装置袭击记录检测和清除简易爆炸装置(IED)的问题。我们的模型在累积奖励和其他几个排名评估指标方面优于最先进的基线空间MAB算法。
{"title":"Hawkes Process Multi-armed Bandits for Search and Rescue","authors":"Wen-Hao Chiang, G. Mohler","doi":"10.1109/ICMLA55696.2022.00046","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00046","url":null,"abstract":"We propose a novel framework for integrating Hawkes processes with multi-armed bandit algorithms to solve spatio-temporal event forecasting and detection problems when data may be undersampled or spatially biased. In particular, we introduce an upper confidence bound algorithm using Bayesian spatial Hawkes process estimation for balancing the trade-off between exploiting geographic regions where data has been collected and exploring geographic regions where data is unobserved. We first validate our model using simulated data. We then apply it to the problem of disaster search and rescue using calls for service data from hurricane Harvey in 2017 and the problem of detection and clearance of improvised explosive devices (IEDs) using IED attack records in Iraq. Our model outperforms state-of-the-art baseline spatial MAB algorithms in terms of cumulative reward and several other ranking evaluation metrics.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128672750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Score-based Image-to-Image Regression with Synchronized Diffusion 基于分数的图像到图像的同步扩散回归
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00056
Hao Xin, M. Zhu
Image-to-image regression is an important computer vision task. In this paper, we propose a novel image-to-image regression model following the recent trend in generative modeling that employs Stochastic Differential Equations (SDEs) and score matching. We first apply diffusion processes to regression data using designed SDEs, and then perform inference by gradually reversing the processes. In particular, our method uses synchronized diffusion, which simultaneously applies diffusion to both input and response images to stabilize diffusion and subsequent parameter learning. Furthermore, based on the Expectation-Maximization (EM) algorithm, we develop an effective algorithm for prediction. We implement a conditional U-Net architecture with pre-trained DenseNet encoder for our proposed model and refer to it as DenseSocre. Our new model is able to generate diverse outcomes for image colorization, and the proposed prediction algorithm is able to achieve close to state-of-art performance on high-resolution monocular depth estimation.
图像到图像的回归是一项重要的计算机视觉任务。在本文中,我们提出了一种新的图像到图像回归模型,该模型采用随机微分方程(SDEs)和分数匹配,顺应了生成模型的最新趋势。我们首先使用设计的SDEs将扩散过程应用于回归数据,然后通过逐渐逆转过程来进行推理。特别是,我们的方法使用同步扩散,它同时对输入和响应图像应用扩散,以稳定扩散和随后的参数学习。在期望最大化算法的基础上,提出了一种有效的预测算法。我们为我们提出的模型实现了一个条件U-Net架构和预训练的DenseNet编码器,并将其称为DenseSocre。我们的新模型能够生成不同的图像着色结果,并且所提出的预测算法能够在高分辨率单目深度估计上实现接近最先进的性能。
{"title":"Score-based Image-to-Image Regression with Synchronized Diffusion","authors":"Hao Xin, M. Zhu","doi":"10.1109/ICMLA55696.2022.00056","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00056","url":null,"abstract":"Image-to-image regression is an important computer vision task. In this paper, we propose a novel image-to-image regression model following the recent trend in generative modeling that employs Stochastic Differential Equations (SDEs) and score matching. We first apply diffusion processes to regression data using designed SDEs, and then perform inference by gradually reversing the processes. In particular, our method uses synchronized diffusion, which simultaneously applies diffusion to both input and response images to stabilize diffusion and subsequent parameter learning. Furthermore, based on the Expectation-Maximization (EM) algorithm, we develop an effective algorithm for prediction. We implement a conditional U-Net architecture with pre-trained DenseNet encoder for our proposed model and refer to it as DenseSocre. Our new model is able to generate diverse outcomes for image colorization, and the proposed prediction algorithm is able to achieve close to state-of-art performance on high-resolution monocular depth estimation.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"219 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122447426","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Feature Reduction Method Comparison Towards Explainability and Efficiency in Cybersecurity Intrusion Detection Systems 面向网络安全入侵检测系统可解释性和效率的特征约简方法比较
Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00211
Adam Lehavi, S. Kim
In the realm of cybersecurity, intrusion detection systems (IDS) detect and prevent attacks based on collected computer and network data. In recent research, IDS models have been constructed using machine learning (ML) and deep learning (DL) methods such as Random Forest (RF) and deep neural networks (DNN). Feature selection (FS) can be used to construct faster, more interpretable, and more accurate models. We look at three different FS techniques; RF information gain (RF-IG), correlation feature selection using the Bat Algorithm (CFS-BA), and CFS using the Aquila Optimizer (CFS-AO). Our results show CFS-BA to be the most efficient of the FS methods, building in 55% of the time of the best RF-IG model while achieving 99.99% of its accuracy. This reinforces prior contributions attesting to CFS-BA’s accuracy while building upon the relationship between subset size, CFS score, and RF-IG score in final results.
在网络安全领域,入侵检测系统(IDS)基于收集的计算机和网络数据来检测和预防攻击。在最近的研究中,IDS模型已经使用机器学习(ML)和深度学习(DL)方法,如随机森林(RF)和深度神经网络(DNN)来构建。特征选择(FS)可用于构建更快、更可解释和更准确的模型。我们来看三种不同的FS技术;RF信息增益(RF- ig),使用Bat算法(CFS- ba)进行相关特征选择,使用Aquila优化器(CFS- ao)进行CFS。我们的研究结果表明,CFS-BA是最有效的FS方法,构建最佳RF-IG模型的时间为55%,准确率为99.99%。这加强了先前证明CFS- ba准确性的贡献,同时建立在最终结果中子集大小、CFS评分和RF-IG评分之间的关系上。
{"title":"Feature Reduction Method Comparison Towards Explainability and Efficiency in Cybersecurity Intrusion Detection Systems","authors":"Adam Lehavi, S. Kim","doi":"10.1109/ICMLA55696.2022.00211","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00211","url":null,"abstract":"In the realm of cybersecurity, intrusion detection systems (IDS) detect and prevent attacks based on collected computer and network data. In recent research, IDS models have been constructed using machine learning (ML) and deep learning (DL) methods such as Random Forest (RF) and deep neural networks (DNN). Feature selection (FS) can be used to construct faster, more interpretable, and more accurate models. We look at three different FS techniques; RF information gain (RF-IG), correlation feature selection using the Bat Algorithm (CFS-BA), and CFS using the Aquila Optimizer (CFS-AO). Our results show CFS-BA to be the most efficient of the FS methods, building in 55% of the time of the best RF-IG model while achieving 99.99% of its accuracy. This reinforces prior contributions attesting to CFS-BA’s accuracy while building upon the relationship between subset size, CFS score, and RF-IG score in final results.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126637594","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1