Deep-Learning-Assisted Single-Pixel Imaging for Gesture Recognition in Consideration of Privacy

Naoya Mukojima, Masaki Yasugi, Y. Mizutani, T. Yasui, Hirotsugu Yamamoto
{"title":"Deep-Learning-Assisted Single-Pixel Imaging for Gesture Recognition in Consideration of Privacy","authors":"Naoya Mukojima, Masaki Yasugi, Y. Mizutani, T. Yasui, Hirotsugu Yamamoto","doi":"10.1587/transele.2021dii0002","DOIUrl":null,"url":null,"abstract":"SUMMARY We have utilized single-pixel imaging and deep-learning to solve the privacy-preserving problem in gesture recognition for interactive display. Silhouette images of hand gestures were acquired by use of a display panel as an illumination. Reconstructions of gesture images have been performed by numerical experiments on single-pixel imaging by changing the number of illumination mask patterns. For the training and the image restoration with deep learning, we prepared reconstructed data with 250 and 500 illuminations as datasets. For each of the 250 and 500 illuminations, we prepared 9000 datasets in which original images and reconstructed data were paired. Of these data, 8500 data were used for training a neural network (6800 data for training and 1700 data for validation), and 500 data were used to evaluate the accuracy of image restoration. Our neural network, based on U-net, was able to restore images close to the original images even from reconstructed data with greatly reduced number of illuminations, which is 1/40 of the single-pixel imaging without deep learning. Compared restoration accuracy between cases using shadowgraph (black on white background) and negative-positive reversed images (white on black background) as silhouette image, the accuracy of the restored image was lower for negative-positive-reversed images when the number of illuminations was small. Moreover, we found that the restoration accuracy decreased in the order of rock, scissor, and paper. Shadowgraph is suitable for gesture silhouette, and it is necessary to prepare training data and construct neural networks, to avoid the restoration accuracy between gestures when further reducing the number of illuminations.","PeriodicalId":13259,"journal":{"name":"IEICE Trans. Electron.","volume":"735 1","pages":"79-85"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEICE Trans. Electron.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1587/transele.2021dii0002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

SUMMARY We have utilized single-pixel imaging and deep-learning to solve the privacy-preserving problem in gesture recognition for interactive display. Silhouette images of hand gestures were acquired by use of a display panel as an illumination. Reconstructions of gesture images have been performed by numerical experiments on single-pixel imaging by changing the number of illumination mask patterns. For the training and the image restoration with deep learning, we prepared reconstructed data with 250 and 500 illuminations as datasets. For each of the 250 and 500 illuminations, we prepared 9000 datasets in which original images and reconstructed data were paired. Of these data, 8500 data were used for training a neural network (6800 data for training and 1700 data for validation), and 500 data were used to evaluate the accuracy of image restoration. Our neural network, based on U-net, was able to restore images close to the original images even from reconstructed data with greatly reduced number of illuminations, which is 1/40 of the single-pixel imaging without deep learning. Compared restoration accuracy between cases using shadowgraph (black on white background) and negative-positive reversed images (white on black background) as silhouette image, the accuracy of the restored image was lower for negative-positive-reversed images when the number of illuminations was small. Moreover, we found that the restoration accuracy decreased in the order of rock, scissor, and paper. Shadowgraph is suitable for gesture silhouette, and it is necessary to prepare training data and construct neural networks, to avoid the restoration accuracy between gestures when further reducing the number of illuminations.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
考虑隐私的深度学习辅助单像素手势识别
我们利用单像素成像和深度学习来解决交互式显示手势识别中的隐私保护问题。手势的轮廓图像是通过使用显示面板作为照明来获得的。通过改变照明掩模模式的数目,在单像素成像上进行了手势图像的重建实验。为了训练和深度学习图像恢复,我们准备了250和500照度的重建数据作为数据集。对于250和500照明中的每一个,我们准备了9000个数据集,其中原始图像和重建数据配对。其中8500个数据用于训练神经网络(6800个数据用于训练,1700个数据用于验证),500个数据用于评估图像恢复的准确性。我们基于U-net的神经网络,即使在光照数量大大减少的情况下,也能从重建的数据中恢复接近原始图像的图像,这是没有深度学习的单像素成像的1/40。对比阴影图(白色背景上的黑色)和正负反转图(黑色背景上的白色)作为剪影图像的恢复精度,光照数较少时正负反转图像的恢复精度较低。此外,我们发现恢复精度依次为岩石、剪刀和纸。阴影图适合于手势剪影,需要准备训练数据和构建神经网络,以避免在进一步减少光照数量时手势之间的恢复精度降低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Vapor Deposition of Fluoropolymer Thin Films for Antireflection Coating Fundamental Study on Grasping Growth State of Paddy Rice Using Quad-Polarimetric SAR Data Millimeter-Wave Single-Pixel Imaging Using Electrically-Switchable Liquid-Crystal Mask Approaches to High Performance Terahertz-Waves Emitting Devices Utilizing Single Crystals of High Temperature Superconductor Bi2Sr2CaCu2O8+δ Design and Analysis of Si/CaF2 Near-Infrared (λ∼1.7µm) DFB Quantum Cascade Laser for Silicon Photonics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1