首页 > 最新文献

Journal of Information Hiding and Multimedia Signal Processing最新文献

英文 中文
Improving the Accuracy of Under-Fog Driving Assistance System 提高雾下驾驶辅助系统的准确性
Q3 Computer Science Pub Date : 2020-07-28 DOI: 10.4236/jsip.2020.112002
B. Kerim
Driving in fog condition is dangerous. Fog causes poor visibility on roads leading to road traffic accident (RTA). RTA in Albaha is common because of its rough terrain, in addition to the climate that is mainly rainy and foggy. The rain season in Albaha region begins in October to February characterized by rainfall and fog. Many studies have reported the adverse effects of the rain on RTA which results in an increased rate of crashes. On the other hand, Albaha region is not supported by a proper intelligent transportation system and infrastructure. Thus, a Driver Assistance System (DAS) that requires minimum infrastructure is needed. A DAS under fog called No_Collision has been developed by a researcher in Albaha University. This paper discusses an implementation of adaptive Kalman Filter by utilizing Fuzzy logic system with the aim to improve the accuracy of position and velocity prediction of the No_Collision system. The experiment results show a promising adaptive system that reduces the error percentage of the prediction up to 56.58%.
在大雾天气开车很危险。雾造成道路能见度低,导致道路交通意外。RTA在Albaha很常见,因为它的地形崎岖,除了气候主要是多雨和多雾。Albaha地区的雨季开始于10月至2月,以降雨和雾为特征。许多研究报告了降雨对RTA的不利影响,导致撞车率增加。另一方面,Albaha地区没有适当的智能交通系统和基础设施的支持。因此,需要一个对基础设施要求最低的驾驶辅助系统(DAS)。阿尔巴哈大学的一名研究人员开发了一种名为“No_Collision”的雾下DAS。本文讨论了利用模糊逻辑系统实现自适应卡尔曼滤波,以提高无碰撞系统的位置和速度预测精度。实验结果表明,该自适应系统可将预测错误率降低到56.58%。
{"title":"Improving the Accuracy of Under-Fog Driving Assistance System","authors":"B. Kerim","doi":"10.4236/jsip.2020.112002","DOIUrl":"https://doi.org/10.4236/jsip.2020.112002","url":null,"abstract":"Driving in fog condition is dangerous. Fog \u0000causes poor visibility on roads leading to road traffic accident (RTA). RTA in \u0000Albaha is common because of its rough terrain, in addition to the climate that \u0000is mainly rainy and foggy. The rain season in Albaha region begins in October \u0000to February characterized by rainfall and fog. Many studies have reported the \u0000adverse effects of the rain on RTA which results in an increased rate of \u0000crashes. On the other hand, Albaha region is not supported by a proper \u0000intelligent transportation system and infrastructure. Thus, a Driver Assistance \u0000System (DAS) that requires minimum infrastructure is needed. A DAS under fog \u0000called No_Collision has been developed by a researcher in Albaha University. \u0000This paper discusses an implementation of adaptive Kalman Filter by utilizing \u0000Fuzzy logic system with the aim to improve the accuracy of position and \u0000velocity prediction of the No_Collision system. The experiment results show a \u0000promising adaptive system that reduces the error percentage of the prediction \u0000up to 56.58%.","PeriodicalId":38474,"journal":{"name":"Journal of Information Hiding and Multimedia Signal Processing","volume":"21 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82620064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of 3D Projection Profilometry in the High Speed Impaction Surface Deformation Measurement Research 三维投影轮廓法在高速碰撞面变形测量中的应用研究
Q3 Computer Science Pub Date : 2020-01-01 DOI: 10.4236/jsip.2020.114006
Eryi Hu
In order to study the strength of the composite material plate problems, need to adopt a nondestructive testing method to obtain the specimen surface under the effect of high-speed impact regularity of shape. The projection profilometry was used to measure the surface profile or the full field deformation. Furtherly, by using the Fourier transform algorithm, there is only one frame of captured image which is needed in the measurement, so that it can be introduced into the high speed impaction procedure measurement. An experimental system, which was contained with an impact setup and the projection profilometry measurement part, was constructed for the impaction action characteristic research. The metallic impact object can be launched by a gas gun or a spin fan, respectively. The detected object is manufactured by composite materials. In order to increase the surface deformation measurement accuracy, the calibration method and the error was discussed with different calibration specimen. And then, the proposed profilometry measurement method is proved by the gas gun and spin fan projectile test. The surface deformation of the manufactured composite plates and fan case are measured in the impaction procedure. So that the impact action details can be described much more clearly than the traditional video monitoring method.
为了研究复合材料板的强度问题,需要采用无损检测的方法来获取试样表面在高速冲击作用下的规律形状。投影轮廓术用于测量表面轮廓或全场变形。此外,利用傅里叶变换算法,在测量中只需要捕获一帧图像,从而可以将其引入到高速碰撞过程测量中。构建了一个包含冲击装置和投影轮廓测量部分的冲击作用特性实验系统。金属撞击物体可以分别由气枪或旋转风扇发射。被检测物体由复合材料制造。为了提高表面变形测量精度,讨论了不同标定试样的标定方法和标定误差。然后,通过气枪和自旋扇弹丸试验验证了所提出的轮廓测量方法。在冲击过程中测量了所制造的复合材料板和风机壳的表面变形。与传统的视频监控方法相比,可以更清晰地描述撞击动作的细节。
{"title":"Application of 3D Projection Profilometry in the High Speed Impaction Surface Deformation Measurement Research","authors":"Eryi Hu","doi":"10.4236/jsip.2020.114006","DOIUrl":"https://doi.org/10.4236/jsip.2020.114006","url":null,"abstract":"In order to study the strength of the composite material plate problems, need to adopt a nondestructive testing method to obtain the specimen surface under the effect of high-speed impact regularity of shape. The projection profilometry was used to measure the surface profile or the full field deformation. Furtherly, by using the Fourier transform algorithm, there is only one frame of captured image which is needed in the measurement, so that it can be introduced into the high speed impaction procedure measurement. An experimental system, which was contained with an impact setup and the projection profilometry measurement part, was constructed for the impaction action characteristic research. The metallic impact object can be launched by a gas gun or a spin fan, respectively. The detected object is manufactured by composite materials. In order to increase the surface deformation measurement accuracy, the calibration method and the error was discussed with different calibration specimen. And then, the proposed profilometry measurement method is proved by the gas gun and spin fan projectile test. The surface deformation of the manufactured composite plates and fan case are measured in the impaction procedure. So that the impact action details can be described much more clearly than the traditional video monitoring method.","PeriodicalId":38474,"journal":{"name":"Journal of Information Hiding and Multimedia Signal Processing","volume":"27 1","pages":"103-115"},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80808637","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha 基于多语言深度神经网络建模方法的低资源语言自动语音识别系统研究,查哈
Q3 Computer Science Pub Date : 2020-01-01 DOI: 10.4236/jsip.2020.111001
Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu
Automatic speech recognition (ASR) is vital for very low-resource languages for mitigating the extinction trouble. Chaha is one of the low-resource languages, which suffers from the problem of resource insufficiency and some of its phonological, morphological, and orthographic features challenge the development and initiatives in the area of ASR. By considering these challenges, this study is the first endeavor, which analyzed the characteristics of the language, prepared speech corpus, and developed different ASR systems. A small 3-hour read speech corpus was prepared and transcribed. Different basic and rounded phone unit-based speech recognizers were explored using multilingual deep neural network (DNN) modeling methods. The experimental results demonstrated that all the basic phone and rounded phone unit-based multilingual models outperformed the corresponding unilingual models with the relative performance improvements of 5.47% to 19.87% and 5.74% to 16.77%, respectively. The rounded phone unit-based multilingual models outperformed the equivalent basic phone unit-based models with relative performance improvements of 0.95% to 4.98%. Overall, we discovered that multilingual DNN modeling methods are profoundly effective to develop Chaha speech recognizers. Both the basic and rounded phone acoustic units are convenient to build Chaha ASR system. However, the rounded phone unit-based models are superior in performance and faster in recognition speed over the corresponding basic phone unit-based models. Hence, the rounded phone units are the most suitable acoustic units to develop Chaha ASR systems.
语音自动识别对于资源非常少的语言来说,是一种非常重要的技术。查哈语是一种资源不足的语言,其语音、词形和正字法的一些特点对ASR领域的发展和倡议提出了挑战。考虑到这些挑战,本研究是第一次尝试,分析了语言的特点,准备了语音语料库,并开发了不同的ASR系统。准备并转录了一个小的3小时阅读语音语料库。采用多语言深度神经网络(DNN)建模方法,探索了基于电话单元的基本和圆形语音识别方法。实验结果表明,基于基本电话和圆形电话单元的多语言模型均优于相应的单语言模型,相对性能分别提高了5.47% ~ 19.87%和5.74% ~ 16.77%。基于圆形电话单元的多语言模型表现优于等效的基于基本电话单元的模型,相对性能提高了0.95%至4.98%。总的来说,我们发现多语言DNN建模方法对开发Chaha语音识别器非常有效。基本型和圆形电话声学单元都便于构建查哈ASR系统。然而,基于圆形手机单元的模型在性能上更优越,识别速度也比相应的基于基本手机单元的模型更快。因此,圆形电话单元是最适合开发查哈ASR系统的声学单元。
{"title":"Investigation of Automatic Speech Recognition Systems via the Multilingual Deep Neural Network Modeling Methods for a Very Low-Resource Language, Chaha","authors":"Tessfu Geteye Fantaye, Junqing Yu, Tulu Tilahun Hailu","doi":"10.4236/jsip.2020.111001","DOIUrl":"https://doi.org/10.4236/jsip.2020.111001","url":null,"abstract":"Automatic speech recognition (ASR) is vital for very low-resource languages for mitigating the extinction trouble. Chaha is one of the low-resource languages, which suffers from the problem of resource insufficiency and some of its phonological, morphological, and orthographic features challenge the development and initiatives in the area of ASR. By considering these challenges, this study is the first endeavor, which analyzed the characteristics of the language, prepared speech corpus, and developed different ASR systems. A small 3-hour read speech corpus was prepared and transcribed. Different basic and rounded phone unit-based speech recognizers were explored using multilingual deep neural network (DNN) modeling methods. The experimental results demonstrated that all the basic phone and rounded phone unit-based multilingual models outperformed the corresponding unilingual models with the relative performance improvements of 5.47% to 19.87% and 5.74% to 16.77%, respectively. The rounded phone unit-based multilingual models outperformed the equivalent basic phone unit-based models with relative performance improvements of 0.95% to 4.98%. Overall, we discovered that multilingual DNN modeling methods are profoundly effective to develop Chaha speech recognizers. Both the basic and rounded phone acoustic units are convenient to build Chaha ASR system. However, the rounded phone unit-based models are superior in performance and faster in recognition speed over the corresponding basic phone unit-based models. Hence, the rounded phone units are the most suitable acoustic units to develop Chaha ASR systems.","PeriodicalId":38474,"journal":{"name":"Journal of Information Hiding and Multimedia Signal Processing","volume":"54 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83851324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Shadow Detection Method Based on HMRF with Soft Edges for High-Resolution Remote-Sensing Images 基于软边缘HMRF的高分辨率遥感图像阴影检测方法
Q3 Computer Science Pub Date : 2019-11-29 DOI: 10.4236/jsip.2019.104011
Wenying Ge
Shadow detection is a crucial task in high-resolution remote-sensing image processing. Various shadow detection methods have been explored during the last decades. These methods did improve the detection accuracy but are still not robust enough to get satisfactory results for failing to extract enough information from the original images. To take full advantage of various features of shadows, a new method combining edges information with the spectral and spatial information is proposed in this paper. As known, edge is one of the most important characteristics in the high-resolution remote-sensing images. Unfortunately, in shadow detection, it is a high-risk strategy to determine whether a pixel is the edge or not strictly because intensity values on shadow boundaries are always between those in shadow and non-shadow areas. Therefore, a soft edge description model is developed to describe the degree of each pixel belonging to the edges or not. Sequentially, the soft edge description is incorporating to a fuzzy clustering procedure based on HMRF (Hidden Markov Random Fields), in which more appropriate spatial contextual information can be used. More concretely, it consists of two components: the soft edge description model and an iterative shadow detection algorithm. Experiments on several remote sensing images have shown that the proposed method can obtain more accurate shadow detection results.
阴影检测是高分辨率遥感图像处理中的一项重要任务。在过去的几十年里,人们探索了各种阴影检测方法。这些方法虽然提高了检测精度,但由于没有从原始图像中提取足够的信息,鲁棒性仍然不够理想。为了充分利用阴影的各种特征,本文提出了一种将边缘信息与光谱信息和空间信息相结合的新方法。众所周知,边缘是高分辨率遥感图像中最重要的特征之一。不幸的是,在阴影检测中,严格判断一个像素是否是边缘是一种高风险的策略,因为阴影边界上的强度值总是在阴影区域和非阴影区域之间。因此,开发了一种软边缘描述模型来描述每个像素属于边缘或不属于边缘的程度。然后,将软边缘描述与基于隐马尔可夫随机场的模糊聚类过程相结合,从而使用更合适的空间上下文信息。具体来说,它由两个部分组成:软边缘描述模型和迭代阴影检测算法。在多幅遥感图像上的实验表明,该方法可以获得更准确的阴影检测结果。
{"title":"Shadow Detection Method Based on HMRF with Soft Edges for High-Resolution Remote-Sensing Images","authors":"Wenying Ge","doi":"10.4236/jsip.2019.104011","DOIUrl":"https://doi.org/10.4236/jsip.2019.104011","url":null,"abstract":"Shadow detection is a crucial task in high-resolution remote-sensing image processing. Various shadow detection methods have been explored during the last decades. These methods did improve the detection accuracy but are still not robust enough to get satisfactory results for failing to extract enough information from the original images. To take full advantage of various features of shadows, a new method combining edges information with the spectral and spatial information is proposed in this paper. As known, edge is one of the most important characteristics in the high-resolution remote-sensing images. Unfortunately, in shadow detection, it is a high-risk strategy to determine whether a pixel is the edge or not strictly because intensity values on shadow boundaries are always between those in shadow and non-shadow areas. Therefore, a soft edge description model is developed to describe the degree of each pixel belonging to the edges or not. Sequentially, the soft edge description is incorporating to a fuzzy clustering procedure based on HMRF (Hidden Markov Random Fields), in which more appropriate spatial contextual information can be used. More concretely, it consists of two components: the soft edge description model and an iterative shadow detection algorithm. Experiments on several remote sensing images have shown that the proposed method can obtain more accurate shadow detection results.","PeriodicalId":38474,"journal":{"name":"Journal of Information Hiding and Multimedia Signal Processing","volume":"46 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80589352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perceptually Lossless Compression for Mastcam Multispectral Images: A Comparative Study 桅杆凸轮多光谱图像感知无损压缩的比较研究
Q3 Computer Science Pub Date : 2019-11-28 DOI: 10.4236/jsip.2019.104009
C. Kwan, Jude Larkin
The two mast cameras, Mastcams, onboard Mars rover Curiosity are multispectral imagers with nine bands in each. Currently, the images are compressed losslessly using JPEG, which can achieve only two to three times of compression. We present a comparative study of four approaches to compressing multispectral Mastcam images. The first approach is to divide the nine bands into three groups with each group having three bands. Since the multispectral bands have strong correlation, we treat the three groups of images as video frames. We call this approach the Video approach. The second approach is to compress each group separately and we call it the split band (SB) approach. The third one is to apply a two-step approach in which the first step uses principal component analysis (PCA) to compress a nine-band image cube to six bands and a second step compresses the six PCA bands using conventional codecs. The fourth one is to apply PCA only. In addition, we also present subjective and objective assessment results for compressing RGB images because RGB images have been used for stereo and disparity map generation. Five well-known compression codecs, including JPEG, JPEG-2000 (J2K), X264, X265, and Daala in the literature, have been applied and compared in each approach. The performance of different algorithms was assessed using four well-known performance metrics. Two are conventional and another two are known to have good correlation with human perception. Extensive experiments using actual Mastcam images have been performed to demonstrate the various approaches. We observed that perceptually lossless compression can be achieved at 10:1 compression ratio. In particular, the performance gain of the SB approach with Daala is at least 5 dBs in terms peak signal-to-noise ratio (PSNR) at 10:1 compression ratio over that of JPEG. Subjective comparisons also corroborated with the objective metrics in that perceptually lossless compression can be achieved even at 20 to 1 compression.
好奇号火星探测器上的两个桅杆照相机是多光谱成像仪,每个都有九个波段。目前使用JPEG对图像进行无损压缩,只能实现2 ~ 3倍的压缩。我们提出了四种方法来压缩多光谱Mastcam图像的比较研究。第一种方法是将九个波段分成三组,每组有三个波段。由于多光谱波段具有很强的相关性,我们将三组图像作为视频帧处理。我们称这种方法为视频方法。第二种方法是分别压缩每个组,我们称之为分割带(SB)方法。第三步是采用两步方法,其中第一步使用主成分分析(PCA)将九波段图像立方体压缩为六个波段,第二步使用传统编解码器压缩六个PCA波段。第四种方法是只应用PCA。此外,由于RGB图像已被用于立体和视差图的生成,我们还提供了压缩RGB图像的主观和客观评估结果。在每种方法中应用并比较了五种知名的压缩编解码器,包括文献中的JPEG、JPEG-2000 (J2K)、X264、X265和Daala。使用四个众所周知的性能指标来评估不同算法的性能。两种是传统的,另外两种已知与人类感知有很好的相关性。利用Mastcam的实际图像进行了大量实验,以演示各种方法。我们观察到在10:1的压缩比下可以实现感知无损压缩。特别是,使用Daala的SB方法在10:1压缩比下的峰值信噪比(PSNR)比JPEG的性能增益至少为5 db。主观比较也证实了客观指标,即即使在20比1的压缩下也可以实现感知无损压缩。
{"title":"Perceptually Lossless Compression for Mastcam Multispectral Images: A Comparative Study","authors":"C. Kwan, Jude Larkin","doi":"10.4236/jsip.2019.104009","DOIUrl":"https://doi.org/10.4236/jsip.2019.104009","url":null,"abstract":"The two mast cameras, Mastcams, onboard Mars rover Curiosity are multispectral imagers with nine bands in each. Currently, the images are compressed losslessly using JPEG, which can achieve only two to three times of compression. We present a comparative study of four approaches to compressing multispectral Mastcam images. The first approach is to divide the nine bands into three groups with each group having three bands. Since the multispectral bands have strong correlation, we treat the three groups of images as video frames. We call this approach the Video approach. The second approach is to compress each group separately and we call it the split band (SB) approach. The third one is to apply a two-step approach in which the first step uses principal component analysis (PCA) to compress a nine-band image cube to six bands and a second step compresses the six PCA bands using conventional codecs. The fourth one is to apply PCA only. In addition, we also present subjective and objective assessment results for compressing RGB images because RGB images have been used for stereo and disparity map generation. Five well-known compression codecs, including JPEG, JPEG-2000 (J2K), X264, X265, and Daala in the literature, have been applied and compared in each approach. The performance of different algorithms was assessed using four well-known performance metrics. Two are conventional and another two are known to have good correlation with human perception. Extensive experiments using actual Mastcam images have been performed to demonstrate the various approaches. We observed that perceptually lossless compression can be achieved at 10:1 compression ratio. In particular, the performance gain of the SB approach with Daala is at least 5 dBs in terms peak signal-to-noise ratio (PSNR) at 10:1 compression ratio over that of JPEG. Subjective comparisons also corroborated with the objective metrics in that perceptually lossless compression can be achieved even at 20 to 1 compression.","PeriodicalId":38474,"journal":{"name":"Journal of Information Hiding and Multimedia Signal Processing","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86489407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Deep Learning Based Target Tracking and Classification for Infrared Videos Using Compressive Measurements 基于压缩测量的红外视频深度学习目标跟踪与分类
Q3 Computer Science Pub Date : 2019-11-13 DOI: 10.4236/jsip.2019.104010
C. Kwan, Bryan Chou, Jonathan Yang, T. Tran
Although compressive measurements save data storage and bandwidth usage, they are difficult to be used directly for target tracking and classification without pixel reconstruction. This is because the Gaussian random matrix destroys the target location information in the original video frames. This paper summarizes our research effort on target tracking and classification directly in the compressive measurement domain. We focus on one particular type of compressive measurement using pixel subsampling. That is, original pixels in video frames are randomly subsampled. Even in such a special compressive sensing setting, conventional trackers do not work in a satisfactory manner. We propose a deep learning approach that integrates YOLO (You Only Look Once) and ResNet (residual network) for multiple target tracking and classification. YOLO is used for multiple target tracking and ResNet is for target classification. Extensive experiments using short wave infrared (SWIR), mid-wave infrared (MWIR), and long-wave infrared (LWIR) videos demonstrated the efficacy of the proposed approach even though the training data are very scarce.
压缩测量虽然节省了数据存储和带宽使用,但如果不进行像素重建,则难以直接用于目标跟踪和分类。这是因为高斯随机矩阵破坏了原始视频帧中的目标位置信息。本文总结了压缩测量领域中目标跟踪与分类的研究成果。我们专注于使用像素子采样的一种特殊类型的压缩测量。也就是说,视频帧中的原始像素被随机抽样。即使在这种特殊的压缩感知设置中,传统的跟踪器也不能以令人满意的方式工作。我们提出了一种集成YOLO (You Only Look Once)和ResNet(残差网络)的深度学习方法,用于多目标跟踪和分类。YOLO用于多目标跟踪,ResNet用于目标分类。大量使用短波红外(SWIR)、中波红外(MWIR)和长波红外(LWIR)视频的实验证明了该方法的有效性,尽管训练数据非常稀缺。
{"title":"Deep Learning Based Target Tracking and Classification for Infrared Videos Using Compressive Measurements","authors":"C. Kwan, Bryan Chou, Jonathan Yang, T. Tran","doi":"10.4236/jsip.2019.104010","DOIUrl":"https://doi.org/10.4236/jsip.2019.104010","url":null,"abstract":"Although compressive measurements save data storage \u0000and bandwidth usage, they are difficult to be used directly for target tracking \u0000and classification without pixel reconstruction. This is because the Gaussian \u0000random matrix destroys the target location information in the original video \u0000frames. This paper summarizes our research effort on target tracking and \u0000classification directly in the compressive measurement domain. We focus on one \u0000particular type of compressive measurement using pixel subsampling. That is, \u0000original pixels in video frames are randomly subsampled. Even in such a special \u0000compressive sensing setting, conventional trackers do not work in a \u0000satisfactory manner. We propose a deep learning approach that integrates YOLO \u0000(You Only Look Once) and ResNet (residual network) for multiple target tracking \u0000and classification. YOLO is used for multiple target tracking and ResNet is for \u0000target classification. Extensive experiments using short wave infrared (SWIR), \u0000mid-wave infrared (MWIR), and long-wave infrared (LWIR) videos demonstrated the \u0000efficacy of the proposed approach even though the training data are very \u0000scarce.","PeriodicalId":38474,"journal":{"name":"Journal of Information Hiding and Multimedia Signal Processing","volume":"32 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78890066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
An Adaptive EMD Technique for Induction Motor Fault Detection 自适应EMD技术在感应电机故障检测中的应用
Q3 Computer Science Pub Date : 2019-11-13 DOI: 10.4236/jsip.2019.104008
Manzar Mahmud, Wilson Q. Wang
Reliable induction motor (IM) fault detection techniques are very useful in industries to diagnose IM defects and improve operational performance. An adaptive empirical mode decomposition (EMD) technology is proposed in this paper for rotor bar fault detection in IMs. As the characteristic fault frequency will change with operating conditions related to load and speed, the proposed adaptive EMD technique correlates fault features over different frequency bands and intrinsic mode function (IMF) sidebands. The adaptive EMD technique uses the first IMF to detect the fault type and the second IMF as an indicator to predict the fault severity. It can overcome the problems of the sensitivity of sideband frequencies related to the speed and load oscillations. The effectiveness of the proposed adaptive EMD technique is verified by experimental tests under different motor conditions.
可靠的感应电机故障检测技术在工业中诊断感应电机缺陷和提高运行性能是非常有用的。提出了一种基于自适应经验模态分解(EMD)的转子棒故障检测方法。由于故障特征频率会随负载和转速等工况变化,本文提出的自适应EMD技术将不同频带和IMF边带上的故障特征关联起来。自适应EMD技术使用第一个IMF来检测故障类型,使用第二个IMF作为预测故障严重程度的指标。它可以克服与速度和负载振荡有关的边带频率的灵敏度问题。通过不同运动条件下的实验验证了所提出的自适应EMD技术的有效性。
{"title":"An Adaptive EMD Technique for Induction Motor Fault Detection","authors":"Manzar Mahmud, Wilson Q. Wang","doi":"10.4236/jsip.2019.104008","DOIUrl":"https://doi.org/10.4236/jsip.2019.104008","url":null,"abstract":"Reliable induction motor (IM) fault detection techniques are very useful in industries to diagnose IM defects and improve operational performance. An adaptive empirical mode decomposition (EMD) technology is proposed in this paper for rotor bar fault detection in IMs. As the characteristic fault frequency will change with operating conditions related to load and speed, the proposed adaptive EMD technique correlates fault features over different frequency bands and intrinsic mode function (IMF) sidebands. The adaptive EMD technique uses the first IMF to detect the fault type and the second IMF as an indicator to predict the fault severity. It can overcome the problems of the sensitivity of sideband frequencies related to the speed and load oscillations. The effectiveness of the proposed adaptive EMD technique is verified by experimental tests under different motor conditions.","PeriodicalId":38474,"journal":{"name":"Journal of Information Hiding and Multimedia Signal Processing","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79725448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
New Results in Perceptually Lossless Compression of Hyperspectral Images 高光谱图像感知无损压缩的新成果
Q3 Computer Science Pub Date : 2019-08-26 DOI: 10.4236/JSIP.2019.103007
C. Kwan, Jude Larkin
Hyperspectral images (HSI) have hundreds of bands, which impose heavy burden on data storage and transmission bandwidth. Quite a few compression techniques have been explored for HSI in the past decades. One high performing technique is the combination of principal component analysis (PCA) and JPEG-2000 (J2K). However, since there are several new compression codecs developed after J2K in the past 15 years, it is worthwhile to revisit this research area and investigate if there are better techniques for HSI compression. In this paper, we present some new results in HSI compression. We aim at perceptually lossless compression of HSI. Perceptually lossless means that the decompressed HSI data cube has a performance metric near 40 dBs in terms of peak-signal-to-noise ratio (PSNR) or human visual system (HVS) based metrics. The key idea is to compare several combinations of PCA and video/ image codecs. Three representative HSI data cubes were used in our studies. Four video/image codecs, including J2K, X264, X265, and Daala, have been investigated and four performance metrics were used in our comparative studies. Moreover, some alternative techniques such as video, split band, and PCA only approaches were also compared. It was observed that the combination of PCA and X264 yielded the best performance in terms of compression performance and computational complexity. In some cases, the PCA + X264 combination achieved more than 3 dBs than the PCA + J2K combination.
高光谱图像有数百个波段,这给数据存储和传输带宽带来了沉重的负担。在过去的几十年里,相当多的压缩技术已经被探索用于恒生指数。一种高性能的技术是主成分分析(PCA)和JPEG-2000 (J2K)的结合。然而,由于在过去15年中J2K之后开发了几种新的压缩编解码器,因此有必要重新审视这个研究领域,并调查是否有更好的HSI压缩技术。在本文中,我们提出了一些关于HSI压缩的新结果。我们的目标是感知无损压缩的HSI。感知无损意味着解压后的HSI数据立方体在峰值信噪比(PSNR)或基于人类视觉系统(HVS)的指标方面具有接近40 db的性能指标。关键思想是比较PCA和视频/图像编解码器的几种组合。我们的研究中使用了三个具有代表性的HSI数据集。我们研究了四种视频/图像编解码器,包括J2K、X264、X265和Daala,并在我们的比较研究中使用了四种性能指标。此外,还比较了一些替代技术,如视频、分割带和仅PCA的方法。从压缩性能和计算复杂度方面来看,PCA和X264的组合产生了最好的性能。在某些情况下,PCA + X264组合比PCA + J2K组合实现了超过3个db。
{"title":"New Results in Perceptually Lossless Compression of Hyperspectral Images","authors":"C. Kwan, Jude Larkin","doi":"10.4236/JSIP.2019.103007","DOIUrl":"https://doi.org/10.4236/JSIP.2019.103007","url":null,"abstract":"Hyperspectral images (HSI) have hundreds of bands, which impose heavy \u0000burden on data storage and transmission bandwidth. Quite a few compression \u0000techniques have been explored for HSI in the past decades. One high performing \u0000technique is the combination of principal component analysis (PCA) \u0000and JPEG-2000 (J2K). However, since there are several new compression codecs \u0000developed after J2K in the past 15 years, it is worthwhile to revisit this research \u0000area and investigate if there are better techniques for HSI compression. \u0000In this paper, we present some new results in HSI compression. We aim at \u0000perceptually lossless compression of HSI. Perceptually lossless means that the \u0000decompressed HSI data cube has a performance metric near 40 dBs in terms of \u0000peak-signal-to-noise ratio (PSNR) or human visual system (HVS) based metrics. \u0000The key idea is to compare several combinations of PCA and video/ \u0000image codecs. Three representative HSI data cubes were used in our studies. \u0000Four video/image codecs, including J2K, X264, X265, and Daala, have \u0000been investigated and four performance metrics were used in our comparative \u0000studies. Moreover, some alternative techniques such as video, split band, and \u0000PCA only approaches were also compared. It was observed that the combination \u0000of PCA and X264 yielded the best performance in terms of compression \u0000performance and computational complexity. In some cases, the PCA + X264 \u0000combination achieved more than 3 dBs than the PCA + J2K combination.","PeriodicalId":38474,"journal":{"name":"Journal of Information Hiding and Multimedia Signal Processing","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74762913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Adaptive Lossy Image Compression Based on Singular Value Decomposition 基于奇异值分解的有损图像自适应压缩
Q3 Computer Science Pub Date : 2019-08-08 DOI: 10.4236/JSIP.2019.103005
M. R. Souza, H. Pedrini
Image compression techniques aim to reduce redundant information in order to allow data storage and transmission in an efficient way. In this work, we propose and analyze a lossy image compression method based on the singular value decomposition using an optimal choice of eigenvalues and an adaptive mechanism for block partitioning. Experiments are conducted on several images to demonstrate the effectiveness of the proposed compression method in comparison with the direct application of the singular value decomposition.
图像压缩技术的目的是减少冗余信息,以便有效地存储和传输数据。在这项工作中,我们提出并分析了一种基于奇异值分解的有损图像压缩方法,该方法使用了特征值的最优选择和块划分的自适应机制。在多幅图像上进行了实验,并与直接应用奇异值分解的方法进行了比较,验证了所提出的压缩方法的有效性。
{"title":"Adaptive Lossy Image Compression Based on Singular Value Decomposition","authors":"M. R. Souza, H. Pedrini","doi":"10.4236/JSIP.2019.103005","DOIUrl":"https://doi.org/10.4236/JSIP.2019.103005","url":null,"abstract":"Image compression techniques aim to reduce redundant information in order to allow data storage and transmission in an efficient way. In this work, we propose and analyze a lossy image compression method based on the singular value decomposition using an optimal choice of eigenvalues and an adaptive mechanism for block partitioning. Experiments are conducted on several images to demonstrate the effectiveness of the proposed compression method in comparison with the direct application of the singular value decomposition.","PeriodicalId":38474,"journal":{"name":"Journal of Information Hiding and Multimedia Signal Processing","volume":"98 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79140126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Target Tracking and Classification Using Compressive Measurements of MWIR and LWIR Coded Aperture Cameras 基于MWIR和LWIR编码孔径相机压缩测量的目标跟踪与分类
Q3 Computer Science Pub Date : 2019-08-08 DOI: 10.4236/JSIP.2019.103006
C. Kwan, Bryan Chou, Jonathan Yang, Akshay Rangamani, T. Tran, Jack Zhang, R. Etienne-Cummings
Pixel-wise Code Exposure (PCE) camera is one type of compressive sensing camera that has low power consumption and high compression ratio. Moreover, a PCE camera can control individual pixel exposure time that can enable high dynamic range. Conventional approaches of using PCE camera involve a time consuming and lossy process to reconstruct the original frames and then use those frames for target tracking and classification. In this paper, we present a deep learning approach that directly performs target tracking and classification in the compressive measurement domain without any frame reconstruction. Our approach has two parts: tracking and classification. The tracking has been done using YOLO (You Only Look Once) and the classification is achieved using Residual Network (ResNet). Extensive experiments using mid-wave infrared (MWIR) and long-wave infrared (LWIR) videos demonstrated the efficacy of our proposed approach.
PCE (Pixel-wise Code Exposure)相机是一种低功耗、高压缩比的压缩感知相机。此外,PCE相机可以控制能够实现高动态范围的单个像素曝光时间。利用PCE相机的传统方法需要对原始帧进行重建,然后利用这些帧进行目标跟踪和分类,这是一个耗时且有损的过程。在本文中,我们提出了一种深度学习方法,该方法直接在压缩测量域中进行目标跟踪和分类,而不需要任何帧重构。我们的方法有两个部分:跟踪和分类。使用YOLO(你只看一次)完成跟踪,使用残余网络(ResNet)实现分类。利用中波红外(MWIR)和长波红外(LWIR)视频进行的大量实验证明了我们提出的方法的有效性。
{"title":"Target Tracking and Classification Using Compressive Measurements of MWIR and LWIR Coded Aperture Cameras","authors":"C. Kwan, Bryan Chou, Jonathan Yang, Akshay Rangamani, T. Tran, Jack Zhang, R. Etienne-Cummings","doi":"10.4236/JSIP.2019.103006","DOIUrl":"https://doi.org/10.4236/JSIP.2019.103006","url":null,"abstract":"Pixel-wise Code Exposure (PCE) camera is one type of compressive sensing \u0000camera that has low power consumption and high compression ratio. Moreover, \u0000a PCE camera can control individual pixel exposure time that can enable \u0000high dynamic range. Conventional approaches of using PCE camera involve \u0000a time consuming and lossy process to reconstruct the original frames \u0000and then use those frames for target tracking and classification. In this paper, \u0000we present a deep learning approach that directly performs target tracking \u0000and classification in the compressive measurement domain without any \u0000frame reconstruction. Our approach has two parts: tracking and classification. \u0000The tracking has been done using YOLO (You Only Look Once) and the \u0000classification is achieved using Residual Network (ResNet). Extensive experiments \u0000using mid-wave infrared (MWIR) and long-wave infrared (LWIR) \u0000videos demonstrated the efficacy of our proposed approach.","PeriodicalId":38474,"journal":{"name":"Journal of Information Hiding and Multimedia Signal Processing","volume":"231 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76109071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
期刊
Journal of Information Hiding and Multimedia Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1