首页 > 最新文献

2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)最新文献

英文 中文
Understading Image Restoration Convolutional Neural Networks with Network Inversion 用网络反演理解图像恢复卷积神经网络
Églen Protas, José Douglas Bratti, J. O. Gaya, Paulo L. J. Drews-Jr, S. Botelho
In recent years, Convolutional Neural Networks (CNNs) have achieved state-of-the-art performance in many image restoration applications. The knowledge of how these models work, however, is still limited. While there have been many attempts at better understanding the inner working of CNNs, they have mostly been applied to classification networks. Because of this, most existing CNN visualization techniques may be inadequate to the study of image restoration architectures. In the paper, we present network inversion, a new method developed specifically to help in the understanding of image restoration Convolutional Neural Networks. We apply our method to underwater image restoration and dehazing CNNs, showing how it can help in the understanding and improvement of these models.
近年来,卷积神经网络(cnn)在许多图像恢复应用中取得了最先进的性能。然而,关于这些模型如何工作的知识仍然有限。虽然有很多人试图更好地理解cnn的内部工作原理,但它们大多被应用于分类网络。正因为如此,大多数现有的CNN可视化技术可能不足以研究图像恢复架构。在本文中,我们提出了网络反演,这是一种专门用于帮助理解卷积神经网络图像恢复的新方法。我们将我们的方法应用于水下图像恢复和去雾cnn,展示了它如何帮助理解和改进这些模型。
{"title":"Understading Image Restoration Convolutional Neural Networks with Network Inversion","authors":"Églen Protas, José Douglas Bratti, J. O. Gaya, Paulo L. J. Drews-Jr, S. Botelho","doi":"10.1109/ICMLA.2017.0-156","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-156","url":null,"abstract":"In recent years, Convolutional Neural Networks (CNNs) have achieved state-of-the-art performance in many image restoration applications. The knowledge of how these models work, however, is still limited. While there have been many attempts at better understanding the inner working of CNNs, they have mostly been applied to classification networks. Because of this, most existing CNN visualization techniques may be inadequate to the study of image restoration architectures. In the paper, we present network inversion, a new method developed specifically to help in the understanding of image restoration Convolutional Neural Networks. We apply our method to underwater image restoration and dehazing CNNs, showing how it can help in the understanding and improvement of these models.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"92 1","pages":"215-220"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77499551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A Review of Deep Learning Methods Applied on Load Forecasting 深度学习方法在负荷预测中的应用综述
Abdulaziz Almalaq, G. Edwards
The utility industry has invested widely in smart grid (SG) over the past decade. They considered it the future electrical grid while the information and electricity are delivered in two-way flow. SG has many Artificial Intelligence (AI) applications such as Artificial Neural Network (ANN), Machine Learning (ML) and Deep Learning (DL). Recently, DL has been a hot topic for AI applications in many fields such as time series load forecasting. This paper introduces the common algorithms of DL in the literature applied to load forecasting problems in the SG and power systems. The intention of this survey is to explore the different applications of DL that are used in the power systems and smart grid load forecasting. In addition, it compares the accuracy results RMSE and MAE for the reviewed applications and shows the use of convolutional neural network CNN with k-means algorithm had a great percentage of reduction in terms of RMSE.
在过去的十年里,公用事业行业对智能电网进行了广泛的投资。他们认为这是未来的电网,信息和电力以双向流动的方式传递。SG有许多人工智能(AI)应用,如人工神经网络(ANN)、机器学习(ML)和深度学习(DL)。近年来,深度学习已成为人工智能在时间序列负荷预测等领域应用的热点。本文介绍了文献中常用的深度学习算法在SG和电力系统负荷预测中的应用。本调查的目的是探讨深度学习在电力系统和智能电网负荷预测中的不同应用。此外,比较了所审查应用的RMSE和MAE的准确性结果,并表明使用卷积神经网络CNN与k-means算法在RMSE方面有很大的百分比降低。
{"title":"A Review of Deep Learning Methods Applied on Load Forecasting","authors":"Abdulaziz Almalaq, G. Edwards","doi":"10.1109/ICMLA.2017.0-110","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-110","url":null,"abstract":"The utility industry has invested widely in smart grid (SG) over the past decade. They considered it the future electrical grid while the information and electricity are delivered in two-way flow. SG has many Artificial Intelligence (AI) applications such as Artificial Neural Network (ANN), Machine Learning (ML) and Deep Learning (DL). Recently, DL has been a hot topic for AI applications in many fields such as time series load forecasting. This paper introduces the common algorithms of DL in the literature applied to load forecasting problems in the SG and power systems. The intention of this survey is to explore the different applications of DL that are used in the power systems and smart grid load forecasting. In addition, it compares the accuracy results RMSE and MAE for the reviewed applications and shows the use of convolutional neural network CNN with k-means algorithm had a great percentage of reduction in terms of RMSE.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"19 1","pages":"511-516"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84510595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 151
On the Impacts of Noise from Group-Based Label Collection for Visual Classification 基于组的标签收集噪声对视觉分类的影响
Maggie B. Wigness, Steven Gutstein
State of the art visual classification continues to improve, particularly with the use of deep learning and millions of labeled images. However, the effort required to label training sets of this size has led to semi-supervised approaches that collect partially noisy labeled data with less effort. Label noise has been shown to degrade supervised learning, but these analyses focus on noise from erroneous label assignment of data instances. Group-based labeling reduces workload by assigning a single label to a group of images simultaneously, which introduces label noise with structure dependent on all training instances. This work investigates the impact of group-based label noise on classifier learning, and discusses how and why this differs from instance-based label noise. We also discuss label noise modeling designed to provide more robust classification given noisy training instances, and evaluate the generalization of these techniques to group-based noise.
视觉分类的技术水平在不断提高,特别是随着深度学习和数百万标记图像的使用。然而,标记这种规模的训练集所需的努力导致了半监督方法,这种方法以较少的努力收集部分有噪声的标记数据。标签噪声已被证明会降低监督学习,但这些分析主要集中在数据实例错误标签分配的噪声上。基于组的标记通过同时为一组图像分配单个标签来减少工作量,该方法引入了结构依赖于所有训练实例的标签噪声。这项工作调查了基于组的标签噪声对分类器学习的影响,并讨论了它与基于实例的标签噪声的不同之处。我们还讨论了标签噪声建模,旨在为给定的噪声训练实例提供更稳健的分类,并评估了这些技术在基于组的噪声中的泛化性。
{"title":"On the Impacts of Noise from Group-Based Label Collection for Visual Classification","authors":"Maggie B. Wigness, Steven Gutstein","doi":"10.1109/ICMLA.2017.0-174","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-174","url":null,"abstract":"State of the art visual classification continues to improve, particularly with the use of deep learning and millions of labeled images. However, the effort required to label training sets of this size has led to semi-supervised approaches that collect partially noisy labeled data with less effort. Label noise has been shown to degrade supervised learning, but these analyses focus on noise from erroneous label assignment of data instances. Group-based labeling reduces workload by assigning a single label to a group of images simultaneously, which introduces label noise with structure dependent on all training instances. This work investigates the impact of group-based label noise on classifier learning, and discusses how and why this differs from instance-based label noise. We also discuss label noise modeling designed to provide more robust classification given noisy training instances, and evaluate the generalization of these techniques to group-based noise.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"16 1","pages":"84-91"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76611206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Thyme: Improving Smartphone Prompt Timing Through Activity Awareness 百里香:通过活动感知改善智能手机提示时间
S. Aminikhanghahi, Ramin Fallahzadeh, M. Sawyer, D. Cook, L. Holder
Smartphone prompts and notifications are popular because they provide users with timely and important information. However, they can also be an annoyance if they pop up at inopportune times and interrupt important tasks. In this paper, we introduce Thyme, an intelligent notification front end that uses activity recognition and machine learning to identify the best times to prompt smartphone users. We evaluate the performance of an activity-aware prompting approach based on 47 participants with fixed time and Thyme-based prompts. Our results show that responsiveness improves from 12.8% to 93.2% using this intelligent approach to the timing of smartphone-based prompts.
智能手机提示和通知很受欢迎,因为它们为用户提供了及时而重要的信息。然而,如果它们在不合适的时间出现并打断重要的任务,它们也会令人烦恼。在本文中,我们介绍了Thyme,这是一个智能通知前端,它使用活动识别和机器学习来识别提示智能手机用户的最佳时间。我们评估了基于47个参与者的活动感知提示方法的性能,这些参与者具有固定的时间和基于百里香的提示。我们的研究结果表明,使用这种智能方法对基于智能手机的提示进行计时,响应性从12.8%提高到93.2%。
{"title":"Thyme: Improving Smartphone Prompt Timing Through Activity Awareness","authors":"S. Aminikhanghahi, Ramin Fallahzadeh, M. Sawyer, D. Cook, L. Holder","doi":"10.1109/ICMLA.2017.0-141","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-141","url":null,"abstract":"Smartphone prompts and notifications are popular because they provide users with timely and important information. However, they can also be an annoyance if they pop up at inopportune times and interrupt important tasks. In this paper, we introduce Thyme, an intelligent notification front end that uses activity recognition and machine learning to identify the best times to prompt smartphone users. We evaluate the performance of an activity-aware prompting approach based on 47 participants with fixed time and Thyme-based prompts. Our results show that responsiveness improves from 12.8% to 93.2% using this intelligent approach to the timing of smartphone-based prompts.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"87 1","pages":"315-322"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78262409","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A Mask-Based Post Processing Approach for Improving the Quality and Intelligibility of Deep Neural Network Enhanced Speech 一种基于掩码的提高深度神经网络增强语音质量和可理解性的后处理方法
B. O. Odelowo, David V. Anderson
In this paper, we propose a method for post-processing of deep neural network (DNN) enhanced speech. The method, which is simple and does not require additional training or expansion of the feature or target vectors, can be viewed as a mask-based approach in which a noisy speech signal is processed by a time-frequency (T-F) weighting derived from the noise-free spectral estimate of a DNN. A series of experiments and statistical analyses of results are carried out to compare the performance of the proposed approach to a baseline DNN enhancement system that features no post processing. Objective tests show that the proposed approach always improves both speech quality and intelligibility, and it outperforms a corresponding baseline system in both matched and mismatched noise conditions. Analysis of the enhanced speech shows that post processing reduces severe amplification distortions in the magnitude spectrum of the enhanced speech at the cost of a slight increase in severe attenuation distortions.
本文提出了一种深度神经网络增强语音的后处理方法。该方法简单,不需要额外的训练或扩展特征或目标向量,可以视为一种基于掩模的方法,其中含噪语音信号通过DNN的无噪声谱估计得出的时频(T-F)加权来处理。进行了一系列实验和结果的统计分析,以比较所提出的方法与无后处理的基线深度神经网络增强系统的性能。客观测试表明,该方法在噪声匹配和不匹配条件下都能提高语音质量和可理解性,并且优于相应的基线系统。对增强语音的分析表明,后处理减少了增强语音幅度谱中的严重放大失真,代价是严重衰减失真略有增加。
{"title":"A Mask-Based Post Processing Approach for Improving the Quality and Intelligibility of Deep Neural Network Enhanced Speech","authors":"B. O. Odelowo, David V. Anderson","doi":"10.1109/ICMLA.2017.00014","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00014","url":null,"abstract":"In this paper, we propose a method for post-processing of deep neural network (DNN) enhanced speech. The method, which is simple and does not require additional training or expansion of the feature or target vectors, can be viewed as a mask-based approach in which a noisy speech signal is processed by a time-frequency (T-F) weighting derived from the noise-free spectral estimate of a DNN. A series of experiments and statistical analyses of results are carried out to compare the performance of the proposed approach to a baseline DNN enhancement system that features no post processing. Objective tests show that the proposed approach always improves both speech quality and intelligibility, and it outperforms a corresponding baseline system in both matched and mismatched noise conditions. Analysis of the enhanced speech shows that post processing reduces severe amplification distortions in the magnitude spectrum of the enhanced speech at the cost of a slight increase in severe attenuation distortions.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"35 1","pages":"1134-1138"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81362497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
DeepPositioning: Intelligent Fusion of Pervasive Magnetic Field and WiFi Fingerprinting for Smartphone Indoor Localization via Deep Learning 深度定位:通过深度学习将普适磁场与WiFi指纹智能融合,实现智能手机室内定位
Wei Zhang, Rahul Sengupta, John Fodero, Xiaolin Li
Since WiFi has been pervasively available indoor, most smartphone indoor localization systems are based on WiFi fingerprinting although they only give coarse-grained location estimation. In this paper, we propose a novel deep learning-based indoor fingerprinting system (called DeepPositioning), combining Received Signal Strength Indicator (RSSI) of WiFi and pervasive magnetic field to obtain richer fingerprinting. DeepPositioning includes an offline learning phase and an online serving phase. In the offline learning phase, deep learning is utilized to automatically extract rich intrinsic features from a large number of multi-class fingerprints collected using mobile phones. Experimental results demonstrate that deep learning models with the intelligent fusion of pervasive WiFi and magnetic field data can effectively improve smartphone indoor localization compared to existing approaches based on WiFi only.
由于WiFi在室内的普及,大多数智能手机的室内定位系统都是基于WiFi指纹的,尽管它们只能给出粗粒度的位置估计。本文提出了一种新的基于深度学习的室内指纹识别系统(称为DeepPositioning),将WiFi的接收信号强度指标(RSSI)与普射磁场相结合,以获得更丰富的指纹识别。深度定位包括离线学习阶段和在线服务阶段。在离线学习阶段,利用深度学习从手机采集的大量多类指纹中自动提取丰富的内在特征。实验结果表明,与仅基于WiFi的现有方法相比,将无处不在的WiFi和磁场数据智能融合的深度学习模型可以有效地提高智能手机室内定位。
{"title":"DeepPositioning: Intelligent Fusion of Pervasive Magnetic Field and WiFi Fingerprinting for Smartphone Indoor Localization via Deep Learning","authors":"Wei Zhang, Rahul Sengupta, John Fodero, Xiaolin Li","doi":"10.1109/ICMLA.2017.0-185","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-185","url":null,"abstract":"Since WiFi has been pervasively available indoor, most smartphone indoor localization systems are based on WiFi fingerprinting although they only give coarse-grained location estimation. In this paper, we propose a novel deep learning-based indoor fingerprinting system (called DeepPositioning), combining Received Signal Strength Indicator (RSSI) of WiFi and pervasive magnetic field to obtain richer fingerprinting. DeepPositioning includes an offline learning phase and an online serving phase. In the offline learning phase, deep learning is utilized to automatically extract rich intrinsic features from a large number of multi-class fingerprints collected using mobile phones. Experimental results demonstrate that deep learning models with the intelligent fusion of pervasive WiFi and magnetic field data can effectively improve smartphone indoor localization compared to existing approaches based on WiFi only.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"134 1","pages":"7-13"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81652312","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Automated Patent Classification Using Word Embedding 使用词嵌入的自动专利分类
Mattyws F. Grawe, C. A. Martins, Andreia Gentil Bonfante
Patent classification is the task of assign a special code to a patent, where the assigned code is used to group patents with similar subject into a same category. This paper presents a patent categorization method based on word embedding and long short term memory network to classify patents down to the subgroup IPC level. The experimental results indicate that our classification method achieve 63% accuracy at the subgroup level.
专利分类是为专利分配特殊代码的任务,分配的代码用于将具有相似主题的专利分组到同一类别中。本文提出了一种基于词嵌入和长短期记忆网络的专利分类方法,将专利分类精确到子组IPC级别。实验结果表明,我们的分类方法在子组水平上达到了63%的准确率。
{"title":"Automated Patent Classification Using Word Embedding","authors":"Mattyws F. Grawe, C. A. Martins, Andreia Gentil Bonfante","doi":"10.1109/ICMLA.2017.0-127","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-127","url":null,"abstract":"Patent classification is the task of assign a special code to a patent, where the assigned code is used to group patents with similar subject into a same category. This paper presents a patent categorization method based on word embedding and long short term memory network to classify patents down to the subgroup IPC level. The experimental results indicate that our classification method achieve 63% accuracy at the subgroup level.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"29 1","pages":"408-411"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87142957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 42
Prediction of Power Grid Failure Using Neural Network Learning 基于神经网络学习的电网故障预测
Carmen Haseltine, E. Eman
Power Grid failures have the potential to drastically affect the population be it a localized outage or a large-scale blackout. Pre-event planning currently consists of preparation for all scenarios and some enthusiastic prognoses, leading to most resources spreading thin. Focus on a specific area of concern typically follows large scale power grid failures as post event analysis and does not include an overall analysis. In this study, a neural network is used to conduct “pre-event” analysis of a power grid to determine if it is susceptible to failure. This research study demonstrates that overall “pre-event” analysis can be beneficial with the use of a machine learning agent. The agent can also be used to determine areas that need the most attention. Future work with larger number of constraints and additional machine learning algorithms will be explored to further improve power grid analysis and performance.
无论是局部停电还是大规模停电,电网故障都有可能对人口造成巨大影响。目前的事前规划包括对所有情况的准备和一些热情的预测,导致大多数资源分散。关注某一特定领域通常是大规模电网故障后的事后分析,而不包括整体分析。在本研究中,使用神经网络对电网进行“事前”分析,以确定电网是否容易发生故障。这项研究表明,使用机器学习代理,整体的“事前”分析是有益的。代理还可以用来确定最需要关注的领域。未来将探索更多约束和额外机器学习算法的工作,以进一步改善电网分析和性能。
{"title":"Prediction of Power Grid Failure Using Neural Network Learning","authors":"Carmen Haseltine, E. Eman","doi":"10.1109/ICMLA.2017.0-111","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.0-111","url":null,"abstract":"Power Grid failures have the potential to drastically affect the population be it a localized outage or a large-scale blackout. Pre-event planning currently consists of preparation for all scenarios and some enthusiastic prognoses, leading to most resources spreading thin. Focus on a specific area of concern typically follows large scale power grid failures as post event analysis and does not include an overall analysis. In this study, a neural network is used to conduct “pre-event” analysis of a power grid to determine if it is susceptible to failure. This research study demonstrates that overall “pre-event” analysis can be beneficial with the use of a machine learning agent. The agent can also be used to determine areas that need the most attention. Future work with larger number of constraints and additional machine learning algorithms will be explored to further improve power grid analysis and performance.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"22 9 1","pages":"505-510"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88253900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Evaluation of Microgesture Recognition Using a Smartwatch 基于智能手表的微手势识别评估
Sonu Agarwal, Sanjay Ghosh
Gesture based interaction and its recognition has been an area of active research with the growing popularity of wearables. We here propose an approach to detect fine-grained finger and palm motions using inertial sensors in a commercial smartwatch. A user specific SVM based classifier is developed for 7 microgestures with a classification accuracy of 94.4%. We extend this to a user adaptive model by including a few representative instances of a new user and achieve a classification accuracy of 91.7%. Further, we are able to differentiate between variations of a microgesture using three fundamental building blocks - distance, speed and orientation. A novel regression based approach is presented to predict the distance parameter. The idea is demonstrated on a swipe gesture with an error of 14%.
随着可穿戴设备的日益普及,基于手势的交互及其识别已经成为一个活跃的研究领域。我们在这里提出了一种方法来检测细粒度的手指和手掌运动使用惯性传感器在商业智能手表。针对7种微手势,开发了基于用户支持向量机的分类器,分类准确率达到94.4%。我们将其扩展到用户自适应模型,包括一些新用户的代表性实例,并实现了91.7%的分类精度。此外,我们能够通过三个基本的构建块来区分微手势的变化——距离、速度和方向。提出了一种新的基于回归的距离参数预测方法。这个想法在滑动手势上得到了验证,误差为14%。
{"title":"Evaluation of Microgesture Recognition Using a Smartwatch","authors":"Sonu Agarwal, Sanjay Ghosh","doi":"10.1109/ICMLA.2017.00-24","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-24","url":null,"abstract":"Gesture based interaction and its recognition has been an area of active research with the growing popularity of wearables. We here propose an approach to detect fine-grained finger and palm motions using inertial sensors in a commercial smartwatch. A user specific SVM based classifier is developed for 7 microgestures with a classification accuracy of 94.4%. We extend this to a user adaptive model by including a few representative instances of a new user and achieve a classification accuracy of 91.7%. Further, we are able to differentiate between variations of a microgesture using three fundamental building blocks - distance, speed and orientation. A novel regression based approach is presented to predict the distance parameter. The idea is demonstrated on a swipe gesture with an error of 14%.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"2012 1","pages":"986-991"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82605410","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Deep Uncertainty Interpretation in Dyadic Human Activity Prediction 二元人类活动预测中的深度不确定性解释
M. Ziaeefard, R. Bergevin, Jean-François Lalonde
We propose a deep learning framework to analyse the uncertainty associated with dyadic human activities at a small temporal granularity. Such time-slice analysis is able to infer human behaviours from short-term observations. Instead of classifying time-slices into k classes of activities, we report to what degree of certainty each activity is occurring from definitely not occurring to definitely occurring. To this end, we extract CNN-based unary probabilities and pairwise relations between body joints. The unary term gives cues on the local appearance while the pairwise term captures the contextual relations between the parts. We extract the features from each frame in a timeslice and examine different temporal aggregation schemes to generate a descriptor for the whole time-slice. Evaluations on the TAP dataset which is well-suited for time-slice activity analysis demonstrate the effectiveness of our approach for the task of uncertainty analysis in activity prediction.
我们提出了一个深度学习框架来分析与小时间粒度的二元人类活动相关的不确定性。这种时间片分析能够从短期观察中推断人类的行为。我们不是将时间片划分为k类活动,而是报告每个活动发生的确定性程度,从确定不发生到确定发生。为此,我们提取了基于cnn的一元概率和人体关节之间的成对关系。一元术语提供局部外观的线索,而成对术语捕获部分之间的上下文关系。我们从时间片的每一帧中提取特征,并研究不同的时间聚合方案来生成整个时间片的描述符。对适合于时间片活度分析的TAP数据集的评估表明,我们的方法对于活度预测中的不确定性分析任务是有效的。
{"title":"Deep Uncertainty Interpretation in Dyadic Human Activity Prediction","authors":"M. Ziaeefard, R. Bergevin, Jean-François Lalonde","doi":"10.1109/ICMLA.2017.00-55","DOIUrl":"https://doi.org/10.1109/ICMLA.2017.00-55","url":null,"abstract":"We propose a deep learning framework to analyse the uncertainty associated with dyadic human activities at a small temporal granularity. Such time-slice analysis is able to infer human behaviours from short-term observations. Instead of classifying time-slices into k classes of activities, we report to what degree of certainty each activity is occurring from definitely not occurring to definitely occurring. To this end, we extract CNN-based unary probabilities and pairwise relations between body joints. The unary term gives cues on the local appearance while the pairwise term captures the contextual relations between the parts. We extract the features from each frame in a timeslice and examine different temporal aggregation schemes to generate a descriptor for the whole time-slice. Evaluations on the TAP dataset which is well-suited for time-slice activity analysis demonstrate the effectiveness of our approach for the task of uncertainty analysis in activity prediction.","PeriodicalId":6636,"journal":{"name":"2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)","volume":"37 1","pages":"822-825"},"PeriodicalIF":0.0,"publicationDate":"2017-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82764983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1