2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文中文

Cluster Management of Scientific Literature in HSTOOL HSTOOL中科技文献的集群管理

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00062

J. Schubert, U. W. Bolin

In this paper, we expand a methodology for horizon scanning of scientific literature to discover scientific trends. In this methodology, scientific articles are automatically clustered within a broadly defined field of research based on the topic. We develop a new method to allow an analyst to handle the large number of clusters that result from the automatic clustering of articles. The method is based on estimating an information-theoretical distance between all possible pairs of clusters. Each of the scientific articles has a probability distribution of affiliation over all possible clusters arising from the clustering process. Using these, we investigate possible pairwise mergers between all pairs of existing clusters and calculate the entropies of the probability distributions of all articles after each possible merger of two clusters. These entropies are visualized in a dendritic tree and a cluster graph. The merger with minimal total entropy is the proposed cluster pair to be merged.

在本文中，我们扩展了一种科学文献水平扫描的方法，以发现科学趋势。在这种方法中，科学文章根据主题自动聚集在一个广泛定义的研究领域内。我们开发了一种新方法，允许分析人员处理由文章自动聚类产生的大量聚类。该方法基于估计所有可能的簇对之间的信息理论距离。每一篇科学文章在聚类过程中产生的所有可能的聚类中都有一个隶属关系的概率分布。利用这些，我们研究了所有现有簇对之间可能的成对合并，并计算了两个簇每次可能合并后所有文章的概率分布的熵。这些熵用树突树和聚类图来表示。总熵最小的合并是我们提出的待合并簇对。

引用次数: 0

Uncertainty Prediction for Facial Action Units Recognition under Degraded Conditions 退化条件下面部动作单元识别的不确定性预测

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00069

Junya Saito, Sachihiro Youoku, Ryosuke Kawamura, A. Uchida, Kentaro Murase, Xiaoyue Mi

Facial action units (AUs) represent muscular activities, and their recognition from facial images can capture various psychological states, such as people’s interests as consumers and mental health states. However, degradation of conditions, such as occlusions by hand, often occurs and affects the accuracy of AUs recognition in the real world. Most existing studies on degraded conditions have adopted the approach using additional training images and advanced structures of neural networks to improve the robustness of AUs recognition from a degraded facial image. However, such an approach cannot deal with cases in which evidence of the AUs is completely or almost invisible. Therefore, we propose a novel method to address the degraded conditions by predicting the uncertainties of the AUs recognition caused by them. Our method interpolates the high-uncertainty data using surrounding data to reduce the influence of the degraded conditions, and visualizes the conditions causing the uncertainties to handle cases where the conditions are very poor and need to be improved. In the evaluation experiments, the public datasets BP4D+ and DISFA were modified to degrade them for testing. By evaluating the modified test data, we demonstrated that the maximum improvement with our method was 12% for BP4D+ and 17% for DISFA, and that our method can prevent the decrease in accuracy owing to degraded conditions. We also presented some visualization examples which demonstrate that our method can reasonably predict the conditions and uncertainties.

面部动作单位(AUs)代表肌肉活动，从面部图像中识别它们可以捕捉到各种心理状态，如人们作为消费者的兴趣和心理健康状态。然而，在现实世界中，经常发生诸如手遮挡等条件的退化，并影响AUs识别的准确性。大多数关于退化条件的现有研究都采用了使用附加训练图像和高级神经网络结构的方法来提高退化面部图像的AUs识别的鲁棒性。然而，这种方法无法处理证据完全或几乎看不见的案件。因此，我们提出了一种新的方法，通过预测由它们引起的AUs识别的不确定性来解决退化条件。我们的方法利用周围数据对高不确定性数据进行插值，以减少退化条件的影响，并将导致不确定性的条件可视化，以处理条件非常差且需要改进的情况。在评价实验中，对公共数据集BP4D+和DISFA进行修改，使其降级以供测试。通过对改进后的测试数据进行评估，我们证明了我们的方法对BP4D+的最大改进是12%，对DISFA的最大改进是17%，并且我们的方法可以防止由于条件退化而导致的精度下降。并给出了一些可视化实例，表明该方法可以合理地预测条件和不确定性。

{"title":"Uncertainty Prediction for Facial Action Units Recognition under Degraded Conditions","authors":"Junya Saito, Sachihiro Youoku, Ryosuke Kawamura, A. Uchida, Kentaro Murase, Xiaoyue Mi","doi":"10.1109/ICMLA55696.2022.00069","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00069","url":null,"abstract":"Facial action units (AUs) represent muscular activities, and their recognition from facial images can capture various psychological states, such as people’s interests as consumers and mental health states. However, degradation of conditions, such as occlusions by hand, often occurs and affects the accuracy of AUs recognition in the real world. Most existing studies on degraded conditions have adopted the approach using additional training images and advanced structures of neural networks to improve the robustness of AUs recognition from a degraded facial image. However, such an approach cannot deal with cases in which evidence of the AUs is completely or almost invisible. Therefore, we propose a novel method to address the degraded conditions by predicting the uncertainties of the AUs recognition caused by them. Our method interpolates the high-uncertainty data using surrounding data to reduce the influence of the degraded conditions, and visualizes the conditions causing the uncertainties to handle cases where the conditions are very poor and need to be improved. In the evaluation experiments, the public datasets BP4D+ and DISFA were modified to degrade them for testing. By evaluating the modified test data, we demonstrated that the maximum improvement with our method was 12% for BP4D+ and 17% for DISFA, and that our method can prevent the decrease in accuracy owing to degraded conditions. We also presented some visualization examples which demonstrate that our method can reasonably predict the conditions and uncertainties.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"2016 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127349866","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00273

Menore Tekeba Mengistu, Getachew Alemu, P. Chevaillier, P. D. Loor

In this paper, we provided an unsupervised contrastive representation learning method which uses contrastive views in which both spatial and temporal similarity-contrast is balanced. The balanced views are created by taking pixels from the anchor sample and any randomly selected negative sample and balancing the ratio of number of pixels taken from the anchor and the negative. Then these balanced views are paired with the anchor to create the positive contrastive views and all other samples paired with the anchor are taken as negative contrastive views. We made the evaluation using reinforcement learning tasks on Atari games and Deep Mind Control suites (DMControl). Our evaluations on 26 Atari games and six DMControl tasks show that the proposed method is superior in learning spatio-temporally evolving factors of the environment by capturing the relevant task controlling generative factors from the agents’ raw observations.

在本文中，我们提供了一种无监督的对比表示学习方法，该方法使用对比视图，其中空间和时间的相似性-对比度是平衡的。平衡视图是通过从锚点样本和任何随机选择的负样本中获取像素，并平衡从锚点和负样本中获取的像素数量的比例来创建的。然后将这些平衡的视图与锚配对以创建正对比视图，而与锚配对的所有其他样本都被视为负对比视图。我们使用Atari游戏和Deep Mind Control套件(DMControl)上的强化学习任务进行评估。我们对26个Atari游戏和6个DMControl任务的评估表明，该方法通过从智能体的原始观察中捕获相关的任务控制生成因素，在学习环境的时空演变因素方面具有优势。

引用次数: 0

Using Transparent Neural Networks and Wearable Inertial Sensors to Generate Physiologically-Relevant Insights for Gait 使用透明神经网络和可穿戴惯性传感器生成步态的生理相关洞察

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00204

Lin Zhou, Eric Fischer, C. M. Brahms, U. Granacher, B. Arnrich

Neural networks have been successfully applied to a wide range of human motion analysis topics in combination with wearable sensor data. However, their computation process is not readily comprehensible. Alternatively, many of the model interpretation efforts do not provide physiologically-relevant insights, thus still limiting their use in clinical settings. In this work, we take gait modifications under fatigue and cognitive task performance as a use case to present how in-depth investigations of neural networks can be performed using wearable sensor data. We collected walking data from 16 young healthy individuals in unfatigued and fatigued states and under single- (walking only) and dual-task (walking while concurrently performing a cognitive task) conditions using inertial measurement units. Convolutional neural networks were able to identify both fatigue and dual-task gait patterns with high classification accuracy. To interpret the model, the importance of each time step in the input time series was visualized using Layer-wise Relevance Propagation. The visualization revealed highly individualized gait changes among participants, as well as changes at precise time steps of the input signal that allow further investigations to infer potential underlying mechanisms. Our methods enable in-depth analysis of human movement using transparent neural networks with data collected from unobtrusive, mobile wearable sensors.

结合可穿戴传感器数据，神经网络已经成功地应用于广泛的人体运动分析主题。然而，它们的计算过程并不容易理解。另外，许多模型解释工作不能提供生理学相关的见解，因此仍然限制了它们在临床环境中的应用。在这项工作中，我们以疲劳和认知任务表现下的步态改变为用例，展示了如何使用可穿戴传感器数据进行神经网络的深入研究。我们使用惯性测量单元收集了16名年轻健康个体在非疲劳和疲劳状态下、单任务(仅行走)和双任务(行走同时执行认知任务)条件下的行走数据。卷积神经网络能够以较高的分类准确率识别疲劳和双任务步态模式。为了解释该模型，使用分层相关传播将输入时间序列中每个时间步骤的重要性可视化。可视化显示了参与者之间高度个性化的步态变化，以及输入信号精确时间步长的变化，从而允许进一步研究推断潜在的潜在机制。我们的方法利用透明的神经网络和从不显眼的移动可穿戴传感器收集的数据，对人体运动进行深入分析。

{"title":"Using Transparent Neural Networks and Wearable Inertial Sensors to Generate Physiologically-Relevant Insights for Gait","authors":"Lin Zhou, Eric Fischer, C. M. Brahms, U. Granacher, B. Arnrich","doi":"10.1109/ICMLA55696.2022.00204","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00204","url":null,"abstract":"Neural networks have been successfully applied to a wide range of human motion analysis topics in combination with wearable sensor data. However, their computation process is not readily comprehensible. Alternatively, many of the model interpretation efforts do not provide physiologically-relevant insights, thus still limiting their use in clinical settings. In this work, we take gait modifications under fatigue and cognitive task performance as a use case to present how in-depth investigations of neural networks can be performed using wearable sensor data. We collected walking data from 16 young healthy individuals in unfatigued and fatigued states and under single- (walking only) and dual-task (walking while concurrently performing a cognitive task) conditions using inertial measurement units. Convolutional neural networks were able to identify both fatigue and dual-task gait patterns with high classification accuracy. To interpret the model, the importance of each time step in the input time series was visualized using Layer-wise Relevance Propagation. The visualization revealed highly individualized gait changes among participants, as well as changes at precise time steps of the input signal that allow further investigations to infer potential underlying mechanisms. Our methods enable in-depth analysis of human movement using transparent neural networks with data collected from unobtrusive, mobile wearable sensors.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"18 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130997003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Managing imprecise map and image data in a possibility theory framework 在可能性理论框架下管理不精确的地图和图像数据

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00248

Khensa Daoudi, Maroua Yamami, S. Benferhat, Lila Méziani

The representation and combination of imprecise information is an important topic present in many applications. This paper first deals with the representation of imprecise positions of objects detected from maps and images of urban networks. In particular, it deals with the question of the combination of uncertain information, from different sources, to address the problem of inaccuracies related to the geographical coordinates of the detected objects. To illustrate the representation and the combination modes presented in this paper, we focus on wastewater networks data. More precisely, we use the manhole detection problem as an example of object detection in our study. We will use two sources of data: i) the images obtained from the google street view utility and ii) the maps of the sanitation networks. As the geographical positions of the detected objects are imprecise, we will use possibility theory to represent this uncertainty. Possibility theory is particularly suitable for representing qualitative uncertainty, where only the plausibility relation (between the different geographical positions that are candidates to be the actual position of the manholes) is important. Finally, we propose to use two aggregation modes, conjunctive and disjunctive modes, to combine the possibility distributions associated with the detected objects.

在许多应用中，不精确信息的表示和组合是一个重要的问题。本文首先处理从地图和城市网络图像中检测到的物体的不精确位置的表示。特别是，它处理来自不同来源的不确定信息的组合问题，以解决与被探测物体的地理坐标有关的不准确问题。为了说明本文提出的表示和组合模式，我们将重点放在废水网络数据上。更准确地说，我们在研究中使用人孔检测问题作为目标检测的一个例子。我们将使用两个数据来源:i)从谷歌街景工具获得的图像和ii)卫生网络地图。由于被探测物体的地理位置是不精确的，我们将使用可能性理论来表示这种不确定性。可能性理论特别适合表示定性的不确定性，其中只有合理性关系(不同地理位置之间的候选是人孔的实际位置)是重要的。最后，我们建议使用两种聚合模式，即合取和析取模式，来组合与检测对象相关的可能性分布。

{"title":"Managing imprecise map and image data in a possibility theory framework","authors":"Khensa Daoudi, Maroua Yamami, S. Benferhat, Lila Méziani","doi":"10.1109/ICMLA55696.2022.00248","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00248","url":null,"abstract":"The representation and combination of imprecise information is an important topic present in many applications. This paper first deals with the representation of imprecise positions of objects detected from maps and images of urban networks. In particular, it deals with the question of the combination of uncertain information, from different sources, to address the problem of inaccuracies related to the geographical coordinates of the detected objects. To illustrate the representation and the combination modes presented in this paper, we focus on wastewater networks data. More precisely, we use the manhole detection problem as an example of object detection in our study. We will use two sources of data: i) the images obtained from the google street view utility and ii) the maps of the sanitation networks. As the geographical positions of the detected objects are imprecise, we will use possibility theory to represent this uncertainty. Possibility theory is particularly suitable for representing qualitative uncertainty, where only the plausibility relation (between the different geographical positions that are candidates to be the actual position of the manholes) is important. Finally, we propose to use two aggregation modes, conjunctive and disjunctive modes, to combine the possibility distributions associated with the detected objects.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132472996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Connecting the Semantic Dots: Zero-shot Learning with Self-Aligning Autoencoders and a New Contrastive-Loss for Negative Sampling 连接语义点:使用自对准自编码器的零射击学习和一种新的负采样对比损失

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00236

Mohammed Terry-Jack, N. Rozanov

We introduce a novel zero-shot learning (ZSL) method, known as ‘self-alignment training’, and use it to train a vanilla autoencoder which is then evaluated on four prominent ZSL Tasks CUB, SUN, AWA1&2. Despite being a far simpler model than the competition, our method achieved results on par with SOTA. In addition, we also present a novel ‘contrastive-loss’ objective to allow autoencoders to learn from negative samples. In particular, we achieve new SOTA of 64.5 on AWA2 for Generalised ZSL and a new SOTA for standard ZSL of 47.7 on SUN. The code is publicly accessible on https://github.com/Wluper/satae.

我们引入了一种新的零射击学习(ZSL)方法，称为“自对准训练”，并使用它来训练一个香草自编码器，然后在四个突出的ZSL任务CUB, SUN, awa1和2上进行评估。尽管是一个比竞争对手简单得多的模型，但我们的方法取得了与SOTA相当的结果。此外，我们还提出了一种新的“对比损失”目标，允许自编码器从负样本中学习。特别是，我们在AWA2上实现了通用ZSL的新SOTA为64.5，在SUN上实现了标准ZSL的新SOTA为47.7。该代码可在https://github.com/Wluper/satae上公开访问。

引用次数: 0

Aspect-based Sentiment Analysis of English and Hindi Opinionated Social Media Texts 基于面向的英语和印地语自以为是的社交媒体文本情感分析

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00235

Kavitha Karimbi Mahesh, A. Nishmitha, Gowda Karthik Balgopal, Kausalya K Naik, Mranali Gourish Gaonkar

We present a lexicon-based approach for classifying opinionated social media texts in English and Hindi. The effect of conjunctions, degree modifiers, negations, emojis and emoticons in scoring the intensity of opinion expressed is further explored. Using a manually built Hindi polarity lexicon, we achieve an accuracy of 86.45% in classifying 2,717 Hindi reviews. A real-time analysis on YouTube reviews showed 86% accuracy for English review classification task.

我们提出了一种基于词典的方法来分类英语和印地语中固执己见的社交媒体文本。进一步探讨了连词、程度修饰语、否定、表情符号和表情符号在评价意见表达强度方面的作用。使用人工构建的印地语极性词典，我们对2,717篇印地语评论进行分类，准确率达到86.45%。对YouTube评论的实时分析显示，英语评论分类任务的准确率为86%。

引用次数: 0

Continuous Human Activity Recognition using Radar Imagery and Dynamic Time Warping 基于雷达图像和动态时间翘曲的连续人体活动识别

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00076

Ruchita Mehta, V. Palade, S. Sharifzadeh, Bo Tan, Yordanka Karayaneva

Remote Human Activity Recognition (HAR) in a private residential area has a beneficial influence on the elderly population's life, since this group of people require regular monitoring of health conditions. This paper addresses the problem of continuous detection of daily human activities using mm-wave Doppler radar. Unlike most previous research, this work records the data in terms of continuous series of activities rather than individual activities. These series of activities are similar to real-life activity patterns. The Dynamic Time Warping (DTW) algorithm is used for the detection of human activities in the recorded time series of data and compared to other time-series classification methods. DTW requires less amount of labelled data. The input for DTW was provided using three strategies, and the obtained results were compared against each other. The first approach uses the pixel-level data of frames (named UnSup-PLevel). In the other two strategies, a Convolutional Variational Autoencoder (CVAE) is used to extract Un-Supervised Encoded features (UnSup-EnLevel) and Supervised Encoded features (Sup-EnLevel) from the series of Doppler frames. Results demonstrates the superiority of the Sup-EnLevel features over UnSup-EnLevel and UnSup-PLevel strategies. However, the performance of the UnSup-PLevel strategy worked surprisingly well without using annotations.

私人住宅区的远程人类活动识别(HAR)对老年人的生活有有益的影响，因为这群人需要定期监测健康状况。本文研究了利用毫米波多普勒雷达对人类日常活动进行连续探测的问题。与之前的大多数研究不同，这项工作记录了连续系列活动的数据，而不是单个活动。这一系列的活动与现实生活中的活动模式相似。动态时间翘曲(Dynamic Time Warping, DTW)算法用于在记录的时间序列数据中检测人类活动，并与其他时间序列分类方法进行比较。DTW需要较少的标记数据。使用三种策略提供了DTW的输入，并对所获得的结果进行了比较。第一种方法使用帧的像素级数据(称为unsup -level)。在另外两种策略中，使用卷积变分自编码器(CVAE)从多普勒帧序列中提取无监督编码特征(UnSup-EnLevel)和有监督编码特征(Sup-EnLevel)。结果表明，supp - enlevel特征优于unsupp - enlevel和unsupp - level策略。然而，unsup - level策略在不使用注释的情况下表现得出奇地好。

{"title":"Continuous Human Activity Recognition using Radar Imagery and Dynamic Time Warping","authors":"Ruchita Mehta, V. Palade, S. Sharifzadeh, Bo Tan, Yordanka Karayaneva","doi":"10.1109/ICMLA55696.2022.00076","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00076","url":null,"abstract":"Remote Human Activity Recognition (HAR) in a private residential area has a beneficial influence on the elderly population's life, since this group of people require regular monitoring of health conditions. This paper addresses the problem of continuous detection of daily human activities using mm-wave Doppler radar. Unlike most previous research, this work records the data in terms of continuous series of activities rather than individual activities. These series of activities are similar to real-life activity patterns. The Dynamic Time Warping (DTW) algorithm is used for the detection of human activities in the recorded time series of data and compared to other time-series classification methods. DTW requires less amount of labelled data. The input for DTW was provided using three strategies, and the obtained results were compared against each other. The first approach uses the pixel-level data of frames (named UnSup-PLevel). In the other two strategies, a Convolutional Variational Autoencoder (CVAE) is used to extract Un-Supervised Encoded features (UnSup-EnLevel) and Supervised Encoded features (Sup-EnLevel) from the series of Doppler frames. Results demonstrates the superiority of the Sup-EnLevel features over UnSup-EnLevel and UnSup-PLevel strategies. However, the performance of the UnSup-PLevel strategy worked surprisingly well without using annotations.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128187091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unsupervised Anomaly Detection and Root Cause Analysis for an Industrial Press Machine based on Skip-Connected Autoencoder 基于跳接自编码器的工业压力机无监督异常检测及根本原因分析

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00113

Chenwei Sun, Martin Trat, Jane Bender, J. Ovtcharova, George Jeppesen, Jan Bär

We propose an unsupervised-learning-based method for anomaly detection and root cause analysis for an industrial press machine. A skip-connected autoencoder with 55% performance improvement measured by reconstruction root mean square error to vanilla variant in average is used to train the collected multivariant time series data in different schemes. We then conduct a stacked evaluation method for both machine- level anomalies with the root cause localization and anomaly on specific cylinder tracks. Both real-world and synthetic anomalies embedded in real data are used for evaluation. The result shows that the multi-models training scheme and the relatively short window length can gain better performance, i.e., fewer anomaly false alarms and misses.

我们提出了一种基于无监督学习的工业压力机异常检测和根本原因分析方法。采用不同方案对采集到的多变量时间序列数据进行训练，采用均方根误差重构的方法测量其平均性能提高55%的跳跃式自编码器。然后，我们对机器级异常和特定汽缸轨迹上的异常进行了堆叠评估方法。真实数据中的真实和合成异常都被用于评估。结果表明，多模型训练方案和相对较短的窗口长度可以获得更好的性能，即更少的异常误报和漏报。

引用次数: 0

Online Handwriting Recognition using LSTM on Microcontroller and IMU Sensors 基于微控制器和IMU传感器的LSTM在线手写识别

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

Pub Date : 2022-12-01 DOI: 10.1109/ICMLA55696.2022.00167

Florian Meissl, F. Eibensteiner, P. Petz, J. Langer

The trend toward the Internet of Things has led to a rapid increase in the amount of data that needs to be processed. Artificial intelligence (AI) can serve as a very helpful tool to extract or compress essential information of data. However, AI places high demands on a system’s hardware. This is not exactly in line with the strengths of embedded systems.This paper combines AI on embedded systems with the not-yet fully explored subject of online handwriting recognition (HWR). The main contribution is the deployment and real-time operation of AI on a microcontroller (MCU). Model architectures using long short-term memory (LSTM) cells and 1D convolutional neural networks (CNNs) are used to process live data from inertial measurement units (IMUs) sensors. The dataset used for training the AI models was recorded with a self-developed prototype. After training, the models are converted and deployed on a MCU. The conversion process includes quantization from a 32-bit floating-point to an 8-bit fixed-point datatype. The TensorFlow Lite Micro (TFLM) framework is used to run inference on the MCU. For predictions in real-time optimizations are applied to the framework, which results in running inference approx. 827 times faster. The optimized AI model implementation is then used to classify handwritten characters using the live data from the IMU sensors. This first approach has shown, that the separation of the symbols is necessary to be able to classify characters from live sensor data with high accuracy.

物联网的趋势导致需要处理的数据量迅速增加。人工智能(AI)可以作为一种非常有用的工具来提取或压缩数据中的重要信息。然而，人工智能对系统的硬件要求很高。这并不完全符合嵌入式系统的优势。本文将嵌入式系统上的人工智能与尚未完全探索的在线手写识别(HWR)相结合。主要贡献是在微控制器(MCU)上部署和实时操作人工智能。使用长短期记忆(LSTM)单元和一维卷积神经网络(cnn)的模型架构来处理来自惯性测量单元(imu)传感器的实时数据。用于训练人工智能模型的数据集是用自主开发的原型记录的。经过训练后，将模型转换并部署在单片机上。转换过程包括从32位浮点到8位定点数据类型的量化。使用TensorFlow Lite Micro (TFLM)框架在MCU上运行推理。对于预测中的实时优化应用于框架，这导致运行推理近似。快了827倍。然后使用优化的AI模型实现使用来自IMU传感器的实时数据对手写字符进行分类。第一种方法表明，符号的分离对于能够从实时传感器数据中对字符进行高精度分类是必要的。

{"title":"Online Handwriting Recognition using LSTM on Microcontroller and IMU Sensors","authors":"Florian Meissl, F. Eibensteiner, P. Petz, J. Langer","doi":"10.1109/ICMLA55696.2022.00167","DOIUrl":"https://doi.org/10.1109/ICMLA55696.2022.00167","url":null,"abstract":"The trend toward the Internet of Things has led to a rapid increase in the amount of data that needs to be processed. Artificial intelligence (AI) can serve as a very helpful tool to extract or compress essential information of data. However, AI places high demands on a system’s hardware. This is not exactly in line with the strengths of embedded systems.This paper combines AI on embedded systems with the not-yet fully explored subject of online handwriting recognition (HWR). The main contribution is the deployment and real-time operation of AI on a microcontroller (MCU). Model architectures using long short-term memory (LSTM) cells and 1D convolutional neural networks (CNNs) are used to process live data from inertial measurement units (IMUs) sensors. The dataset used for training the AI models was recorded with a self-developed prototype. After training, the models are converted and deployed on a MCU. The conversion process includes quantization from a 32-bit floating-point to an 8-bit fixed-point datatype. The TensorFlow Lite Micro (TFLM) framework is used to run inference on the MCU. For predictions in real-time optimizations are applied to the framework, which results in running inference approx. 827 times faster. The optimized AI model implementation is then used to classify handwritten characters using the live data from the IMU sensors. This first approach has shown, that the separation of the symbols is necessary to be able to classify characters from live sensor data with high accuracy.","PeriodicalId":128160,"journal":{"name":"2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132043978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀