2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文中文

Building Action Sets in a Deep Reinforcement Learner 在深度强化学习器中构建动作集

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00081

Yongzhao Wang, Arunesh Sinha, Sky CH-Wang, Michael P. Wellman

In many policy-learning applications, the agent may execute a set of actions at each decision stage. Choosing among an exponential number of alternatives poses a computational challenge, and even representing actions naturally expressed as sets can be a tricky design problem. Building upon prior approaches that employ deep neural networks and iterative construction of action sets, we introduce a reward-shaping approach to apportion reward to each atomic action based on its marginal contribution within an action set, thereby providing useful feedback for learning to build these sets. We demonstrate our method in two environments where action spaces are combinatorial. Experiments reveal that our method significantly accelerates and stabilizes policy learning with combinatorial actions.

在许多策略学习应用程序中，代理可能在每个决策阶段执行一组操作。在指数级的备选方案中进行选择是一项计算挑战，甚至将自然表达为集合的动作也可能是一个棘手的设计问题。在先前使用深度神经网络和动作集迭代构建的方法的基础上，我们引入了一种奖励塑造方法，根据每个原子动作在动作集中的边际贡献来分配奖励，从而为学习构建这些集合提供有用的反馈。我们在两个动作空间是组合的环境中演示了我们的方法。实验表明，我们的方法可以显著地加速和稳定组合行为下的策略学习。

引用次数: 1

Learn to Trace Odors: Autonomous Odor Source Localization via Deep Learning Methods 学习追踪气味:通过深度学习方法自主定位气味源

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00230

Lingxiao Wang, S. Pang, Jinlong Li

Autonomous odor source localization (OSL) has been viewed as a challenging task due to the nature of turbulent airflows and the resulting odor plume characteristics. Here we present an olfactory-based navigation algorithm via deep learning (DL) methods, which navigates a mobile robot to find an odor source without explicating specific search algorithms. Two types of deep neural networks (DNNs), namely traditional feedforward and convolutional neural networks (FNN and CNN), are proposed to generate robot velocity commands on x and y directions based on onboard sensor measurements. Training data is obtained by applying the traditional olfactory-based navigation algorithms, including moth-inspired and Bayesian-inference methods, in thousands of simulated OSL trials. After the supervised training, DNN models are validated in OSL tests with varying search conditions. Experiment results show that given the same training data, CNN is more effective than FNN, and by training with a fused data set, the proposed CNN achieves a comparable search performance with the Bayesian-inference method while requires less computational time.

由于湍流气流的性质和产生的气味羽流特征，自主气味源定位(OSL)一直被视为一项具有挑战性的任务。在这里，我们提出了一种基于嗅觉的导航算法，该算法通过深度学习(DL)方法，在不说明特定搜索算法的情况下，导航移动机器人找到气味源。提出了两种深度神经网络(dnn)，即传统的前馈神经网络和卷积神经网络(FNN和CNN)，根据机载传感器的测量结果生成机器人在x和y方向上的速度命令。在数千次模拟的OSL试验中，应用传统的基于嗅觉的导航算法(包括飞蛾启发和贝叶斯推理方法)获得训练数据。经过监督训练后，DNN模型在不同搜索条件下的OSL测试中得到验证。实验结果表明，在相同的训练数据下，CNN比FNN更有效，并且通过融合数据集的训练，该CNN的搜索性能与贝叶斯推理方法相当，而计算时间更少。

{"title":"Learn to Trace Odors: Autonomous Odor Source Localization via Deep Learning Methods","authors":"Lingxiao Wang, S. Pang, Jinlong Li","doi":"10.1109/ICMLA52953.2021.00230","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00230","url":null,"abstract":"Autonomous odor source localization (OSL) has been viewed as a challenging task due to the nature of turbulent airflows and the resulting odor plume characteristics. Here we present an olfactory-based navigation algorithm via deep learning (DL) methods, which navigates a mobile robot to find an odor source without explicating specific search algorithms. Two types of deep neural networks (DNNs), namely traditional feedforward and convolutional neural networks (FNN and CNN), are proposed to generate robot velocity commands on x and y directions based on onboard sensor measurements. Training data is obtained by applying the traditional olfactory-based navigation algorithms, including moth-inspired and Bayesian-inference methods, in thousands of simulated OSL trials. After the supervised training, DNN models are validated in OSL tests with varying search conditions. Experiment results show that given the same training data, CNN is more effective than FNN, and by training with a fused data set, the proposed CNN achieves a comparable search performance with the Bayesian-inference method while requires less computational time.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"36 1","pages":"1429-1436"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85823074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Super Resolution with Sparse Gradient-Guided Attention for Suppressing Structural Distortion 基于稀疏梯度引导注意力的超分辨率结构畸变抑制

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00146

Geonhak Song, Tien-Dung Nguyen, J. Bum, Hwijong Yi, C. Son, Hyunseung Choo

Generative adversarial network (GAN)-based methods recover perceptually pleasant details in super resolution (SR), but they pertain to structural distortions. Recent study alleviates such structural distortions by attaching a gradient branch to the generator. However, this method compromises the perceptual details. In this paper, we propose a sparse gradient-guided attention generative adversarial network (SGAGAN), which incorporates a modified residual-in-residual sparse block (MRRSB) in the gradient branch and gradient-guided self-attention (GSA) to suppress structural distortions. Compared to the most frequently used block in GAN-based SR methods, i.e., residual-in-residual dense block (RRDB), MRRSB reduces computational cost and avoids gradient redundancy. In addition, GSA emphasizes the highly correlated features in the generator by guiding sparse gradient. It captures the semantic information by connecting the global interdependencies of the sparse gradient features in the gradient branch and the features in the SR branch. Experimental results show that SGAGAN relieves the structural distortions and generates more realistic images compared to state-of-the-art SR methods. Qualitative and quantitative evaluations in the ablation study show that combining GSA and MRRSB together has a better perceptual quality than combining self-attention alone.

基于生成对抗网络(GAN)的方法在超分辨率(SR)中恢复感知愉悦的细节，但它们适用于结构扭曲。最近的研究通过在发电机上附加一个梯度支路来减轻这种结构扭曲。然而，这种方法损害了感知细节。本文提出了一种稀疏梯度引导注意力生成对抗网络(SGAGAN)，该网络在梯度分支中引入了改进的残差稀疏块(MRRSB)和梯度引导自注意(GSA)来抑制结构扭曲。与基于gan的SR方法中最常用的块即残差密集块(RRDB)相比，MRRSB降低了计算成本并避免了梯度冗余。此外，GSA通过引导稀疏梯度来强调生成器中高度相关的特征。它通过连接梯度分支中的稀疏梯度特征和SR分支中的特征的全局相互依赖关系来捕获语义信息。实验结果表明，与目前最先进的SR方法相比，sagan减轻了结构扭曲，生成的图像更真实。消融研究的定性和定量评价表明，GSA和MRRSB联合使用比单独使用自我注意具有更好的感知质量。

{"title":"Super Resolution with Sparse Gradient-Guided Attention for Suppressing Structural Distortion","authors":"Geonhak Song, Tien-Dung Nguyen, J. Bum, Hwijong Yi, C. Son, Hyunseung Choo","doi":"10.1109/ICMLA52953.2021.00146","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00146","url":null,"abstract":"Generative adversarial network (GAN)-based methods recover perceptually pleasant details in super resolution (SR), but they pertain to structural distortions. Recent study alleviates such structural distortions by attaching a gradient branch to the generator. However, this method compromises the perceptual details. In this paper, we propose a sparse gradient-guided attention generative adversarial network (SGAGAN), which incorporates a modified residual-in-residual sparse block (MRRSB) in the gradient branch and gradient-guided self-attention (GSA) to suppress structural distortions. Compared to the most frequently used block in GAN-based SR methods, i.e., residual-in-residual dense block (RRDB), MRRSB reduces computational cost and avoids gradient redundancy. In addition, GSA emphasizes the highly correlated features in the generator by guiding sparse gradient. It captures the semantic information by connecting the global interdependencies of the sparse gradient features in the gradient branch and the features in the SR branch. Experimental results show that SGAGAN relieves the structural distortions and generates more realistic images compared to state-of-the-art SR methods. Qualitative and quantitative evaluations in the ablation study show that combining GSA and MRRSB together has a better perceptual quality than combining self-attention alone.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"104 1","pages":"885-890"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88163013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00013

R. Zuech, John T. Hancock, T. Khoshgoftaar

We introduce the novel concept of feature popularity with three different web attacks and big data from the CSE-CIC-IDS2018 dataset: Brute Force, SQL Injection, and XSS web attacks. Feature popularity is based upon ensemble Feature Selection Techniques (FSTs) and allows us to more easily understand common important features between different cyberattacks, for two main reasons. First, feature popularity lists can be generated to provide an easy comprehension of important features across different attacks. Second, the Jaccard similarity metric can provide a quantitative score for how similar feature subsets are between different attacks. Both of these approaches not only provide more explainable and easier-to-understand models, but they can also reduce the complexity of implementing models in real-world systems. Four supervised learning-based FSTs are used to generate feature subsets for each of our three different web attack datasets, and then our feature popularity frameworks are applied. For these three web attacks, the XSS and SQL Injection feature subsets are the most similar per the Jaccard similarity. The most popular features across all three web attacks are: Flow_Bytes_s, Flow_IAT_Max, and Flow_Packets_s. While this introductory study is only a simple example using only three web attacks, this feature popularity concept can be easily extended, allowing an automated framework to more easily determine the most popular features across a very large number of attacks and features.

我们通过CSE-CIC-IDS2018数据集中的三种不同的web攻击和大数据引入了特征流行度的新概念:暴力破解、SQL注入和XSS web攻击。特征流行度基于集成特征选择技术(FSTs)，使我们能够更容易地理解不同网络攻击之间的共同重要特征，主要有两个原因。首先，可以生成特性流行度列表，以便轻松理解不同攻击之间的重要特性。其次，Jaccard相似度度量可以为不同攻击之间的特征子集的相似程度提供定量评分。这两种方法不仅提供了更易于解释和更易于理解的模型，而且还可以减少在实际系统中实现模型的复杂性。使用四个基于监督学习的fst为我们的三个不同的web攻击数据集生成特征子集，然后应用我们的特征流行框架。对于这三种web攻击，XSS和SQL注入的特征子集按照Jaccard相似性是最相似的。这三种网络攻击中最受欢迎的特性是:Flow_Bytes_s、Flow_IAT_Max和Flow_Packets_s。虽然这个介绍性的研究只是一个简单的例子，只使用了三种web攻击，但这个功能流行度的概念可以很容易地扩展，允许一个自动化的框架更容易地在大量的攻击和功能中确定最流行的功能。

{"title":"Feature Popularity Between Different Web Attacks with Supervised Feature Selection Rankers","authors":"R. Zuech, John T. Hancock, T. Khoshgoftaar","doi":"10.1109/ICMLA52953.2021.00013","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00013","url":null,"abstract":"We introduce the novel concept of feature popularity with three different web attacks and big data from the CSE-CIC-IDS2018 dataset: Brute Force, SQL Injection, and XSS web attacks. Feature popularity is based upon ensemble Feature Selection Techniques (FSTs) and allows us to more easily understand common important features between different cyberattacks, for two main reasons. First, feature popularity lists can be generated to provide an easy comprehension of important features across different attacks. Second, the Jaccard similarity metric can provide a quantitative score for how similar feature subsets are between different attacks. Both of these approaches not only provide more explainable and easier-to-understand models, but they can also reduce the complexity of implementing models in real-world systems. Four supervised learning-based FSTs are used to generate feature subsets for each of our three different web attack datasets, and then our feature popularity frameworks are applied. For these three web attacks, the XSS and SQL Injection feature subsets are the most similar per the Jaccard similarity. The most popular features across all three web attacks are: Flow_Bytes_s, Flow_IAT_Max, and Flow_Packets_s. While this introductory study is only a simple example using only three web attacks, this feature popularity concept can be easily extended, allowing an automated framework to more easily determine the most popular features across a very large number of attacks and features.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"94 1","pages":"30-37"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88192387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

EMU: Early Mental Health Uncovering Framework and Dataset EMU:早期心理健康揭示框架和数据集

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00213

M. L. Tlachac, E. Toto, Joshua Lovering, Rimsha Kayastha, Nina Taurich, E. Rundensteiner

Mental illnesses are often undiagnosed, demonstrating need for an effective unbiased alternative to traditional screening surveys. For this we propose our Early Mental Health Uncovering (EMU) framework that supports near instantaneous mental illness screening with non-intrusive active and passive modalities. We designed, deployed, and evaluated the EMU app to passively collect retrospective digital phenotype data and actively collect short voice recordings. Additionally, the EMU app also administered depression and anxiety screening surveys to produce depression and anxiety screening labels for the data. Notably, more than twice as many participants elected to share scripted audio recordings than any passive modality. We then study the effectiveness of machine learning models trained with the active modalities. Using scripted audio, EMU screens for depression with F1=0.746, anxiety with F1=0.667, and suicidal ideation with F1=0.706. Using unscripted audio, EMU screens for depression with F1=0.691, anxiety with F1=0.636, and suicidal ideation with F1=0.667. Jitter is an important feature for screening with scripted audio, while Mel-Frequency Cepstral Coefficient is an important feature for screening with unscripted audio. Further, the frequency of help-related words carried a strong signal for suicidal ideation screening with unscripted audio transcripts. This research results in a deeper understanding of the selection of modalities and corresponding features for mobile screening. The EMU dataset will be made available to public domain, representing valuable data resource for the community to further advance universal mental illness screening research.

精神疾病往往没有得到诊断，这表明需要一种有效、公正的方法来替代传统的筛查调查。为此，我们提出了我们的早期心理健康发现(EMU)框架，该框架支持近乎即时的非侵入性主动和被动模式的精神疾病筛查。我们设计、部署并评估了EMU应用程序，以被动地收集回顾性数字表型数据，并主动收集短录音。此外，EMU应用程序还管理抑郁和焦虑筛查调查，为数据生成抑郁和焦虑筛查标签。值得注意的是，选择分享脚本录音的参与者是任何被动语态的两倍多。然后，我们研究了用主动模态训练的机器学习模型的有效性。使用脚本音频，EMU对F1=0.746的抑郁、F1=0.667的焦虑和F1=0.706的自杀意念进行筛查。使用无脚本音频，EMU筛选F1=0.691的抑郁、F1=0.636的焦虑和F1=0.667的自杀意念。抖动是筛选脚本音频的重要特征，而Mel-Frequency倒谱系数是筛选非脚本音频的重要特征。此外，与帮助相关的单词的频率对无脚本音频文本的自杀意念筛查具有强烈的信号。本研究对移动筛查的模式选择和特征有了更深入的了解。EMU数据集将向公众开放，为社会进一步推进普遍精神疾病筛查研究提供宝贵的数据资源。

{"title":"EMU: Early Mental Health Uncovering Framework and Dataset","authors":"M. L. Tlachac, E. Toto, Joshua Lovering, Rimsha Kayastha, Nina Taurich, E. Rundensteiner","doi":"10.1109/ICMLA52953.2021.00213","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00213","url":null,"abstract":"Mental illnesses are often undiagnosed, demonstrating need for an effective unbiased alternative to traditional screening surveys. For this we propose our Early Mental Health Uncovering (EMU) framework that supports near instantaneous mental illness screening with non-intrusive active and passive modalities. We designed, deployed, and evaluated the EMU app to passively collect retrospective digital phenotype data and actively collect short voice recordings. Additionally, the EMU app also administered depression and anxiety screening surveys to produce depression and anxiety screening labels for the data. Notably, more than twice as many participants elected to share scripted audio recordings than any passive modality. We then study the effectiveness of machine learning models trained with the active modalities. Using scripted audio, EMU screens for depression with F1=0.746, anxiety with F1=0.667, and suicidal ideation with F1=0.706. Using unscripted audio, EMU screens for depression with F1=0.691, anxiety with F1=0.636, and suicidal ideation with F1=0.667. Jitter is an important feature for screening with scripted audio, while Mel-Frequency Cepstral Coefficient is an important feature for screening with unscripted audio. Further, the frequency of help-related words carried a strong signal for suicidal ideation screening with unscripted audio transcripts. This research results in a deeper understanding of the selection of modalities and corresponding features for mobile screening. The EMU dataset will be made available to public domain, representing valuable data resource for the community to further advance universal mental illness screening research.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"20 1","pages":"1311-1318"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86258299","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 11

Identifying Catheter and Line Position in Chest X-Rays Using GANs 使用gan识别胸部x光中导管和线的位置

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00027

Milan Aryal, Nasim Yahyasoltani

Catheter is a thin tube that is inserted into patients body to provide fluids or medication. The placement of catheter in the chest is very important and if placed wrongly can be life threatening. Radiologists utilize X-ray images of the chest to determine the correctness of placement of catheter. In the time of global pandemic, when the hospitals are crowded with the patients, radiologists might not be able to manually observe all the X-rays. In this situation, an automatic method to identify catheter in the X-ray images would be of great help. In this paper, a novel method to automatically detect the presence and position of the catheter using X-ray images is developed. The proposed algorithm deploys generative adversarial network (GAN) to synthesize the catheter in X-ray images. Transfer learning is then used to classify the catheter and its correct placement. Octave convolution instead of vanilla convolution is utilized to improve the efficiency of deep learning method for classification. Through data augmentation different transformation of images are generated to make the model more robust to noisy images.

导管是一种插入病人体内提供液体或药物的细管。导管在胸部的放置是非常重要的，如果放置错误可能会危及生命。放射科医生利用胸部x光片来确定导管放置的正确性。在全球大流行时期，医院里挤满了病人，放射科医生可能无法手动观察所有的x光片。在这种情况下，在x射线图像中自动识别导管的方法将大有帮助。本文提出了一种利用x射线图像自动检测导管存在和位置的新方法。该算法利用生成对抗网络(GAN)在x射线图像中合成导管。然后使用迁移学习对导管进行分类和正确放置。利用八度卷积代替普通卷积来提高深度学习分类方法的效率。通过数据增强，生成图像的不同变换，使模型对噪声图像具有更强的鲁棒性。

{"title":"Identifying Catheter and Line Position in Chest X-Rays Using GANs","authors":"Milan Aryal, Nasim Yahyasoltani","doi":"10.1109/ICMLA52953.2021.00027","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00027","url":null,"abstract":"Catheter is a thin tube that is inserted into patients body to provide fluids or medication. The placement of catheter in the chest is very important and if placed wrongly can be life threatening. Radiologists utilize X-ray images of the chest to determine the correctness of placement of catheter. In the time of global pandemic, when the hospitals are crowded with the patients, radiologists might not be able to manually observe all the X-rays. In this situation, an automatic method to identify catheter in the X-ray images would be of great help. In this paper, a novel method to automatically detect the presence and position of the catheter using X-ray images is developed. The proposed algorithm deploys generative adversarial network (GAN) to synthesize the catheter in X-ray images. Transfer learning is then used to classify the catheter and its correct placement. Octave convolution instead of vanilla convolution is utilized to improve the efficiency of deep learning method for classification. Through data augmentation different transformation of images are generated to make the model more robust to noisy images.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"6 1","pages":"122-127"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82948878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Graph-Based Spatial Cross-Validation Approach for Assessing Models Learned with Selected Features to Understand Election Results 一种基于图的空间交叉验证方法，用于评估使用选定特征学习的模型以理解选举结果

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00150

Tiago Pinho da Silva, A. R. Parmezan, Gustavo E. A. P. A. Batista

Elections are complex activities fundamental to any democracy. The contextualized analysis of election data allows us to understand electoral behavior and the factors that influence it. Multidisciplinary studies have been prioritized the predictive modeling of electoral features from thousands of explanatory features, considering geographic and spatial aspects inherent to the data. When building a model for such a purpose, it must be rigorously evaluated to understand its prediction error in future test cases. Although cross-validation is a widely used procedure for this task, it leads to optimistic results because the spatial independence between test and training data is not ensured in the resampling. On the other hand, alternatives to deal with spatial dependence may fall into a pessimistic scenario by assuming total spatial independence between the test and training sets regardless of the size of the first one, increasing the probability of overfitting. This paper addresses these issues by proposing a graph-based spatial cross-validation approach to assess models learned with selected features from spatially contextualized electoral datasets. Our approach takes advantage of the spatial graph structure provided by the lattice-type spatial objects to define a local training set to each test fold. We generate the local training sets by removing spatially close data that are highly correlated and irrelevant distant data that may interfere with error estimates. Experiments involving the second round of the 2018 Brazilian presidential election demonstrate that our approach contributes to the fair evaluation of models by enabling more realistic and local modeling.

选举是复杂的活动，是任何民主制度的基础。对选举数据的情境化分析使我们能够理解选举行为及其影响因素。考虑到数据固有的地理和空间方面，多学科研究已优先考虑从数千个解释特征对选举特征进行预测建模。当为这样的目的构建模型时，必须严格地评估它，以了解它在未来测试用例中的预测误差。虽然交叉验证是该任务中广泛使用的一种方法，但由于在重采样中不能保证测试数据和训练数据之间的空间独立性，因此结果并不乐观。另一方面，处理空间依赖性的替代方案可能会陷入悲观的情况，即假设测试集和训练集之间的空间完全独立，而不管第一个集的大小，从而增加了过拟合的概率。本文通过提出基于图的空间交叉验证方法来解决这些问题，以评估从空间上下文化选举数据集中选择特征学习的模型。我们的方法利用格子型空间对象提供的空间图结构为每个测试折叠定义一个局部训练集。我们通过去除可能干扰误差估计的高度相关和不相关的远程数据的空间接近数据来生成局部训练集。涉及2018年巴西总统选举第二轮的实验表明，我们的方法通过实现更现实和局部的建模，有助于对模型进行公平评估。

{"title":"A Graph-Based Spatial Cross-Validation Approach for Assessing Models Learned with Selected Features to Understand Election Results","authors":"Tiago Pinho da Silva, A. R. Parmezan, Gustavo E. A. P. A. Batista","doi":"10.1109/ICMLA52953.2021.00150","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00150","url":null,"abstract":"Elections are complex activities fundamental to any democracy. The contextualized analysis of election data allows us to understand electoral behavior and the factors that influence it. Multidisciplinary studies have been prioritized the predictive modeling of electoral features from thousands of explanatory features, considering geographic and spatial aspects inherent to the data. When building a model for such a purpose, it must be rigorously evaluated to understand its prediction error in future test cases. Although cross-validation is a widely used procedure for this task, it leads to optimistic results because the spatial independence between test and training data is not ensured in the resampling. On the other hand, alternatives to deal with spatial dependence may fall into a pessimistic scenario by assuming total spatial independence between the test and training sets regardless of the size of the first one, increasing the probability of overfitting. This paper addresses these issues by proposing a graph-based spatial cross-validation approach to assess models learned with selected features from spatially contextualized electoral datasets. Our approach takes advantage of the spatial graph structure provided by the lattice-type spatial objects to define a local training set to each test fold. We generate the local training sets by removing spatially close data that are highly correlated and irrelevant distant data that may interfere with error estimates. Experiments involving the second round of the 2018 Brazilian presidential election demonstrate that our approach contributes to the fair evaluation of models by enabling more realistic and local modeling.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"81 1","pages":"909-915"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88088855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Game Character Facial Animation Using Actor Video Corpus and Recurrent Neural Networks 基于演员视频语料库和递归神经网络的游戏角色面部动画

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00113

Sheldon Schiffer

Creating photorealistic facial animation for game characters is a labor-intensive process that gives authorial primacy to animators. This research presents an experimental autonomous animation controller based on an emotion model that uses a team of embedded recurrent neural networks (RNNs). The design is a novel alternative method that can elevate an actor’s contribution to game character design. This research presents the first results of combining a facial emotion neural network model with a workflow that incorporates actor preparation methods and the training of auto-regressive bi-directional RNNs with long short-term memory (LSTM) cells. The predicted emotion vectors triggered by player facial stimuli strongly resemble a performing actor for a game character with accuracies over 80% for targeted emotion labels and show accuracy near or above a high baseline standard.

为游戏角色创造逼真的面部动画是一个劳动密集型的过程，赋予动画师以作者的首要地位。本研究提出了一种基于情感模型的实验性自主动画控制器，该模型使用了一组嵌入式递归神经网络(rnn)。这种设计是一种新颖的替代方法，可以提升演员对游戏角色设计的贡献。本研究提出了将面部情绪神经网络模型与工作流相结合的第一个结果，该工作流结合了演员准备方法和具有长短期记忆(LSTM)细胞的自回归双向rnn的训练。由玩家面部刺激触发的预测情感向量非常类似于游戏角色的表演演员，目标情感标签的准确率超过80%，并且显示出接近或高于高基线标准的准确性。

引用次数: 1

Pruned Genetic-NAS on GPU Accelerator Platforms with Chaos-on-Edge Hyperparameters 带有边缘混沌超参数的GPU加速器平台上的剪枝遗传nas

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00158

Anand Ravishankar, S. Natarajan, A. B. Malakreddy

Deep Neural Networks (DNNs) is an extremely attractive subset of computational models due to their remarkable ability to provide promising results for a wide variety of problems. However, the performance delivered by DNNs often overshadows the work done before training the network, which includes Network Architecture Search (NAS) and its suitability concerning the task. This paper presents a modified Genetic-NAS framework designed to prevent network stagnation and reduce training loss. The network hyperparameters are initialized in a “Chaos on Edge” region, preventing premature convergence through reverse biases. The Genetic-NAS and parameter space exploration process is co-evolved by applying genetic operators and subjugating them to layer-wise competition. The inherent parallelism offered by both the neural network and its genetic extension is exploited by deploying the model on a GPU which improves the throughput. the GPU device provides an acceleration of 8.4x with 92.9% of the workload placed on the GPU device for the text-based datasets. On average, the task of classifying an image-based dataset takes 3 GPU hours.

深度神经网络(dnn)是一个非常有吸引力的计算模型子集，因为它们具有为各种各样的问题提供有希望的结果的卓越能力。然而，深度神经网络提供的性能往往掩盖了训练网络之前所做的工作，其中包括网络架构搜索(NAS)及其对任务的适用性。本文提出了一种改进的遗传- nas框架，旨在防止网络停滞和减少训练损失。网络超参数在“混沌边缘”区域初始化，防止通过反向偏差过早收敛。遗传- nas和参数空间探索过程是通过应用遗传算子并使它们服从于分层竞争而共同进化的。通过在GPU上部署该模型，利用神经网络及其遗传扩展所提供的固有并行性，提高了吞吐量。GPU设备提供8.4倍的加速，92.9%的工作负载放在GPU设备上用于基于文本的数据集。平均而言，对基于图像的数据集进行分类的任务需要3个GPU小时。

{"title":"Pruned Genetic-NAS on GPU Accelerator Platforms with Chaos-on-Edge Hyperparameters","authors":"Anand Ravishankar, S. Natarajan, A. B. Malakreddy","doi":"10.1109/ICMLA52953.2021.00158","DOIUrl":"https://doi.org/10.1109/ICMLA52953.2021.00158","url":null,"abstract":"Deep Neural Networks (DNNs) is an extremely attractive subset of computational models due to their remarkable ability to provide promising results for a wide variety of problems. However, the performance delivered by DNNs often overshadows the work done before training the network, which includes Network Architecture Search (NAS) and its suitability concerning the task. This paper presents a modified Genetic-NAS framework designed to prevent network stagnation and reduce training loss. The network hyperparameters are initialized in a “Chaos on Edge” region, preventing premature convergence through reverse biases. The Genetic-NAS and parameter space exploration process is co-evolved by applying genetic operators and subjugating them to layer-wise competition. The inherent parallelism offered by both the neural network and its genetic extension is exploited by deploying the model on a GPU which improves the throughput. the GPU device provides an acceleration of 8.4x with 92.9% of the workload placed on the GPU device for the text-based datasets. On average, the task of classifying an image-based dataset takes 3 GPU hours.","PeriodicalId":6750,"journal":{"name":"2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"28 1","pages":"958-963"},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84154561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Discrete Latent Variables Discovery and Structure Learning in Mixed Bayesian Networks 混合贝叶斯网络中的离散潜变量发现与结构学习

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2021-12-01 DOI: 10.1109/ICMLA52953.2021.00046

Aviv Peled, S. Fine

Latent variables pose a challenge for accurate modelling, experimental design, and inference, since they may cause non-adjustable bias in the estimation of effects. While most of the research regarding latent variables revolves around accounting for their presence and learning how they interact with other variables in the experiment, their bare existence is assumed to be deduced based on domain expertise. In this work we focus on the discovery of such latent variables, utilizing statistical hypothesis testing methods and Bayesian Networks learning. Specifically, we present a novel method for detecting discrete latent factors which affect continuous observed outcomes, in mixed discrete/continuous observed data, and device a structure learning algorithm that adds the detected latent factors to a fully observed Bayesian Network. Finally, we demonstrate the utility of our method with a set of experiments, in both controlled and real-life settings, one of which is a prediction for the outcome of COVID-19 test results.

潜在变量对准确建模、实验设计和推断提出了挑战，因为它们可能在效果估计中引起不可调节的偏差。虽然大多数关于潜在变量的研究都是围绕着它们的存在和学习它们如何与实验中的其他变量相互作用展开的，但它们的存在被认为是基于领域专业知识推断出来的。在这项工作中，我们专注于发现这些潜在变量，利用统计假设检验方法和贝叶斯网络学习。具体来说，我们提出了一种新的方法来检测影响连续观测结果的离散潜在因素，在混合的离散/连续观测数据中，并设计了一种结构学习算法，将检测到的潜在因素添加到完全观察的贝叶斯网络中。最后，我们在控制和现实环境中进行了一组实验，其中一个是对COVID-19检测结果的预测，以此来证明我们的方法的实用性。

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀