首页 > 最新文献

Signal and image processing : an international journal最新文献

英文 中文
Omni-Modeler: Rapid Adaptive Visual Recognition with Dynamic Learning Omni-Modeler:动态学习的快速自适应视觉识别
Pub Date : 2023-10-27 DOI: 10.5121/sipij.2023.14501
Michael Karnes, Alper Yilmaz
Deep neural network (DNN) image classification has grown rapidly as a general pattern detection tool for an extremely diverse set of applications; yet dataset accessibility remains a major limiting factor for many applications. This paper presents a novel dynamic learning approach to leverage pretrained knowledge to novel image spaces in the effort to extend the algorithm knowledge domain and reduce dataset collection requirements. The proposed Omni-Modeler generates a dynamic knowledge set by reshaping known concepts to create dynamic representation models of unknown concepts. The Omni-Modeler embeds images with a pretrained DNN and formulates compressed language encoder. The language encoded feature space is then used to rapidly generate a dynamic dictionary of concept appearance models. The results of this study demonstrate the Omni-Modeler capability to rapidly adapt across a range of image types enabling the usage of dynamically learning image classification with limited data availability.
深度神经网络(DNN)图像分类已经迅速发展成为一种通用的模式检测工具,用于极其多样化的应用;然而,数据集可访问性仍然是许多应用程序的主要限制因素。本文提出了一种新的动态学习方法,将预训练的知识应用于新的图像空间,以扩展算法的知识域并减少数据集收集需求。提出的Omni-Modeler通过重塑已知概念来创建未知概念的动态表示模型,从而生成动态知识集。Omni-Modeler使用预训练的DNN嵌入图像,并制定压缩语言编码器。然后使用语言编码的特征空间快速生成概念外观模型的动态字典。本研究的结果表明,Omni-Modeler能够快速适应一系列图像类型,从而在有限的数据可用性下使用动态学习图像分类。
{"title":"Omni-Modeler: Rapid Adaptive Visual Recognition with Dynamic Learning","authors":"Michael Karnes, Alper Yilmaz","doi":"10.5121/sipij.2023.14501","DOIUrl":"https://doi.org/10.5121/sipij.2023.14501","url":null,"abstract":"Deep neural network (DNN) image classification has grown rapidly as a general pattern detection tool for an extremely diverse set of applications; yet dataset accessibility remains a major limiting factor for many applications. This paper presents a novel dynamic learning approach to leverage pretrained knowledge to novel image spaces in the effort to extend the algorithm knowledge domain and reduce dataset collection requirements. The proposed Omni-Modeler generates a dynamic knowledge set by reshaping known concepts to create dynamic representation models of unknown concepts. The Omni-Modeler embeds images with a pretrained DNN and formulates compressed language encoder. The language encoded feature space is then used to rapidly generate a dynamic dictionary of concept appearance models. The results of this study demonstrate the Omni-Modeler capability to rapidly adapt across a range of image types enabling the usage of dynamically learning image classification with limited data availability.","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"197 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136319809","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparative Study of Machine Learning Algorithms for EEG Signal Classification 脑电信号分类的机器学习算法比较研究
Pub Date : 2021-12-31 DOI: 10.5121/sipij.2021.12603
Anam Hashmi, B. Khan, Omar Farooq
In this paper, different machine learning algorithms such as Linear Discriminant Analysis, Support vector machine (SVM), Multi-layer perceptron, Random forest, K-nearest neighbour, and Autoencoder with SVM have been compared. This comparison was conducted to seek a robust method that would produce good classification accuracy. To this end, a robust method of classifying raw Electroencephalography (EEG) signals associated with imagined movement of the right hand and relaxation state, namely Autoencoder with SVM has been proposed. The EEG dataset used in this research was created by the University of Tubingen, Germany. The best classification accuracy achieved was 70.4% with SVM through feature engineering. However, our prosed method of autoencoder in combination with SVM produced a similar accuracy of 65% without using any feature engineering technique. This research shows that this system of classification of motor movements can be used in a Brain-Computer Interface system (BCI) to mentally control a robotic device or an exoskeleton.
本文比较了线性判别分析、支持向量机(SVM)、多层感知器、随机森林、k近邻、自动编码器等不同的机器学习算法。进行这种比较是为了寻求一种能够产生良好分类精度的鲁棒方法。为此,提出了一种对与想象中的右手运动和放松状态相关的原始脑电图信号进行鲁棒分类的方法,即基于支持向量机的自动编码器。这项研究中使用的脑电图数据集是由德国蒂宾根大学创建的。通过特征工程实现SVM的最佳分类准确率为70.4%。然而,我们提出的自动编码器与支持向量机相结合的方法在不使用任何特征工程技术的情况下产生了相似的65%的精度。这项研究表明,这种运动分类系统可以用于脑机接口系统(BCI),以对机器人设备或外骨骼进行精神控制。
{"title":"A Comparative Study of Machine Learning Algorithms for EEG Signal Classification","authors":"Anam Hashmi, B. Khan, Omar Farooq","doi":"10.5121/sipij.2021.12603","DOIUrl":"https://doi.org/10.5121/sipij.2021.12603","url":null,"abstract":"In this paper, different machine learning algorithms such as Linear Discriminant Analysis, Support vector machine (SVM), Multi-layer perceptron, Random forest, K-nearest neighbour, and Autoencoder with SVM have been compared. This comparison was conducted to seek a robust method that would produce good classification accuracy. To this end, a robust method of classifying raw Electroencephalography (EEG) signals associated with imagined movement of the right hand and relaxation state, namely Autoencoder with SVM has been proposed. The EEG dataset used in this research was created by the University of Tubingen, Germany. The best classification accuracy achieved was 70.4% with SVM through feature engineering. However, our prosed method of autoencoder in combination with SVM produced a similar accuracy of 65% without using any feature engineering technique. This research shows that this system of classification of motor movements can be used in a Brain-Computer Interface system (BCI) to mentally control a robotic device or an exoskeleton.","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"10 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89486316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Combining of Narrative News and VR Games: Comparison of Various Forms of News Games 叙事新闻与VR游戏的结合:各种新闻游戏形式的比较
Pub Date : 2021-10-31 DOI: 10.5121/sipij.2021.12501
Xiaohan Feng, Makoto Murakami
The information explosion makes it easier to ignore information that requires social attention, and news games can make that information stand out. There is also considerable research that shows that people are more likely to remember narrative content. Virtual environments can also increase the amount of information a person can recall. If these elements are blended together, it may help people remember important information. This research aims to provide directional results for researchers interested in combining VR and narrative, enumerating the advantages and limitations of using text or non-text plot prompts in news games. It also provides hints for the use of virtual environments as learning platforms in news games. The research method is to first derive a theoretical derivation, then create a sample of news games, and then compare the experimental data of the sample to prove the theory. The research compares the survey data of a VR game that presents a story in non-text format (Group VR), a game that presents the story in non-text format (Group NVR), a VR game that presents the story in text (Group VRIT), and a game that presents the story in text (Group NVRIT) will be compared and analyzed. This paper describes the experiment. The results of the experiment show that among the four groups, the means that can make subjects remember the most information is a VR news game with a storyline. And there is a positive correlation between subjects' experience and confidence in recognizing memories, and empathy is positively correlated with the correctness of memories. In addition, the effects of "VR," "experience," and "presenting a story from text or video" on the percentage of correct answers differed depending on the type of question.
信息爆炸使得人们更容易忽略那些需要社会关注的信息,而新闻游戏可以让这些信息脱颖而出。也有相当多的研究表明,人们更有可能记住叙事内容。虚拟环境还可以增加一个人能回忆起的信息量。如果这些元素混合在一起,可能有助于人们记住重要信息。本研究旨在为有意将VR与叙事相结合的研究者提供方向性结果,列举在新闻游戏中使用文本或非文本情节提示的优势和局限性。同时也为新闻游戏中虚拟环境作为学习平台的使用提供了提示。研究方法是先进行理论推导,然后创建一个新闻游戏样本,然后比较样本的实验数据来证明理论。本研究将对以非文本形式呈现故事的VR游戏(Group VR)、以非文本形式呈现故事的VR游戏(Group NVR)、以文本形式呈现故事的VR游戏(Group VRIT)、以文本形式呈现故事的游戏(Group NVRIT)的调查数据进行对比分析。本文对实验进行了描述。实验结果表明,在四组中,最能让被试记住信息的手段是带有故事情节的VR新闻游戏。被试的经验与记忆识别的自信呈显著正相关,共情与记忆的正确性呈显著正相关。此外,“VR”、“体验”、“从文本或视频中呈现故事”对正确率的影响根据问题的类型而有所不同。
{"title":"Combining of Narrative News and VR Games: Comparison of Various Forms of News Games","authors":"Xiaohan Feng, Makoto Murakami","doi":"10.5121/sipij.2021.12501","DOIUrl":"https://doi.org/10.5121/sipij.2021.12501","url":null,"abstract":"The information explosion makes it easier to ignore information that requires social attention, and news games can make that information stand out. There is also considerable research that shows that people are more likely to remember narrative content. Virtual environments can also increase the amount of information a person can recall. If these elements are blended together, it may help people remember important information. This research aims to provide directional results for researchers interested in combining VR and narrative, enumerating the advantages and limitations of using text or non-text plot prompts in news games. It also provides hints for the use of virtual environments as learning platforms in news games. The research method is to first derive a theoretical derivation, then create a sample of news games, and then compare the experimental data of the sample to prove the theory. The research compares the survey data of a VR game that presents a story in non-text format (Group VR), a game that presents the story in non-text format (Group NVR), a VR game that presents the story in text (Group VRIT), and a game that presents the story in text (Group NVRIT) will be compared and analyzed. This paper describes the experiment. The results of the experiment show that among the four groups, the means that can make subjects remember the most information is a VR news game with a storyline. And there is a positive correlation between subjects' experience and confidence in recognizing memories, and empathy is positively correlated with the correctness of memories. In addition, the effects of \"VR,\" \"experience,\" and \"presenting a story from text or video\" on the percentage of correct answers differed depending on the type of question.","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"781 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80847468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mixed Spectra for Stable Signals from Discrete Observations 离散观测稳定信号的混合光谱
Pub Date : 2021-10-31 DOI: 10.5121/sipij.2021.12502
R. Sabre
This paper concerns the continuous-time stable alpha symmetric processes which are inivitable in the modeling of certain signals with indefinitely increasing variance. Particularly the case where the spectral measurement is mixed: sum of a continuous measurement and a discrete measurement. Our goal is to estimate the spectral density of the continuous part by observing the signal in a discrete way. For that, we propose a method which consists in sampling the signal at periodic instants. We use Jackson's polynomial kernel to build a periodogram which we then smooth by two spectral windows taking into account the width of the interval where the spectral density is non-zero. Thus, we bypass the phenomenon of aliasing often encountered in the case of estimation from discrete observations of a continuous time process.
本文研究了在对方差无限增大的信号进行建模时不可避免的连续时间稳定α对称过程。特别是在光谱测量是混合的情况下:连续测量和离散测量的总和。我们的目标是通过以离散的方式观察信号来估计连续部分的谱密度。为此,我们提出了一种在周期时刻对信号进行采样的方法。我们使用Jackson的多项式核来构建一个周期图,然后考虑到谱密度非零的区间的宽度,我们通过两个谱窗来平滑该周期图。因此,我们绕过了在连续时间过程的离散观测估计中经常遇到的混叠现象。
{"title":"Mixed Spectra for Stable Signals from Discrete Observations","authors":"R. Sabre","doi":"10.5121/sipij.2021.12502","DOIUrl":"https://doi.org/10.5121/sipij.2021.12502","url":null,"abstract":"This paper concerns the continuous-time stable alpha symmetric processes which are inivitable in the modeling of certain signals with indefinitely increasing variance. Particularly the case where the spectral measurement is mixed: sum of a continuous measurement and a discrete measurement. Our goal is to estimate the spectral density of the continuous part by observing the signal in a discrete way. For that, we propose a method which consists in sampling the signal at periodic instants. We use Jackson's polynomial kernel to build a periodogram which we then smooth by two spectral windows taking into account the width of the interval where the spectral density is non-zero. Thus, we bypass the phenomenon of aliasing often encountered in the case of estimation from discrete observations of a continuous time process.","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"43 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82737930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Fractional Order Butterworth Filter for Fetal Electrocardiographic Signal Feature Extraction 分数阶Butterworth滤波器用于胎儿心电图信号特征提取
Pub Date : 2021-10-31 DOI: 10.5121/sipij.2021.12503
Hadi Mohsen Alkanfery, Ibrahim Mustafa Mehedi
The non-invasive Fetal Electrocardiogram (FECG) signal has become a significant method for monitoring the fetus's physiological conditions, extracted from the Abdominal Electrocardiogram (AECG) during pregnancy. The current techniques are limited during delivery for detecting and analyzing fECG. The non - intrusive fECG recorded from the mother's abdomen is contaminated by a variety of noise sources, can be a more challenging task for removing the maternal ECG. These contaminated noises have become a major challenge during the extraction of fetal ECG is managed by uni-modal technique. In this research, a new method based on the combination of Wavelet Transform (WT) and Fast Independent Component Analysis (FICA) algorithm approach to extract fECG from AECG recordings of the pregnant woman is proposed. Initially, preprocessing of a signal is done by applying a Fractional Order Butterworth Filter (FBWF). To select the Direct ECG signal which is characterized as a reference signal and the abdominal signal which is characterized as an input signal to the WT, the cross-correlation technique is used to find the signal with greater similarity among the available four abdominal signals. The model performance of the proposed method shows the most frequent similarity of fetal heartbeat rate present in the database can be evaluated through MAE and MAPE is 0.6 and 0.041209 respectively. Thus the proposed methodology of de-noising and separation of fECG signals will act as the predominant one and assist in understanding the nature of the delivery on further analysis.
无创胎儿心电图(FECG)信号从妊娠期腹部心电图(AECG)中提取,已成为监测胎儿生理状况的重要方法。目前的技术在生产过程中检测和分析fECG受到限制。从母亲腹部记录的非侵入性fECG受到各种噪声源的污染,对于去除母亲的心电图来说是一项更具挑战性的任务。这些噪声污染已成为单峰技术处理胎儿心电信号提取过程中的一大难题。本研究提出了一种基于小波变换(WT)和快速独立分量分析(FICA)算法相结合的从孕妇AECG记录中提取feg的新方法。最初,信号的预处理是通过应用分数阶巴特沃斯滤波器(FBWF)来完成的。选取作为参考信号的直接心电信号和作为小波变换输入信号的腹部信号,利用互相关技术在可选的4个腹部信号中寻找相似度较大的信号。该方法的模型性能表明,通过MAE和MAPE可以评估数据库中胎儿心跳频率的最常见相似性分别为0.6和0.041209。因此,建议的降噪和分离fECG信号的方法将作为主要方法,并有助于进一步分析了解交付的性质。
{"title":"Fractional Order Butterworth Filter for Fetal Electrocardiographic Signal Feature Extraction","authors":"Hadi Mohsen Alkanfery, Ibrahim Mustafa Mehedi","doi":"10.5121/sipij.2021.12503","DOIUrl":"https://doi.org/10.5121/sipij.2021.12503","url":null,"abstract":"The non-invasive Fetal Electrocardiogram (FECG) signal has become a significant method for monitoring the fetus's physiological conditions, extracted from the Abdominal Electrocardiogram (AECG) during pregnancy. The current techniques are limited during delivery for detecting and analyzing fECG. The non - intrusive fECG recorded from the mother's abdomen is contaminated by a variety of noise sources, can be a more challenging task for removing the maternal ECG. These contaminated noises have become a major challenge during the extraction of fetal ECG is managed by uni-modal technique. In this research, a new method based on the combination of Wavelet Transform (WT) and Fast Independent Component Analysis (FICA) algorithm approach to extract fECG from AECG recordings of the pregnant woman is proposed. Initially, preprocessing of a signal is done by applying a Fractional Order Butterworth Filter (FBWF). To select the Direct ECG signal which is characterized as a reference signal and the abdominal signal which is characterized as an input signal to the WT, the cross-correlation technique is used to find the signal with greater similarity among the available four abdominal signals. The model performance of the proposed method shows the most frequent similarity of fetal heartbeat rate present in the database can be evaluated through MAE and MAPE is 0.6 and 0.041209 respectively. Thus the proposed methodology of de-noising and separation of fECG signals will act as the predominant one and assist in understanding the nature of the delivery on further analysis.","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"34 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87954681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
General Purpose Image Tampering Detection using Convolutional Neural Network and Local Optimal Oriented Pattern (LOOP) 基于卷积神经网络和局部最优定向模式(LOOP)的通用图像篡改检测
Pub Date : 2021-04-30 DOI: 10.5121/SIPIJ.2021.12202
Ali Ahmad Aminu, N. N. Agwu
Digital image tampering detection has been an active area of research in recent times due to the ease with which digital image can be modified to convey false or misleading information. To address this problem, several studies have proposed forensics algorithms for digital image tampering detection. While these approaches have shown remarkable improvement, most of them only focused on detecting a specific type of image tampering. The limitation of these approaches is that new forensic method must be designed for each new manipulation approach that is developed. Consequently, there is a need to develop methods capable of detecting multiple tampering operations. In this paper, we proposed a novel general purpose image tampering scheme based on CNNs and Local Optimal Oriented Pattern (LOOP) which is capable of detecting five types of image tampering in both binary and multiclass scenarios. Unlike the existing deep learning techniques which used constrained pre-processing layers to suppress the effect of image content in order to capture image tampering traces, our method uses LOOP features, which can effectively subdue the effect image content, thus, allowing the proposed CNNs to capture the needed features to distinguish among different types of image tampering. Through a number of detailed experiments, our results demonstrate that the proposed general purpose image tampering method can achieve high detection accuracies in individual and multiclass image tampering detections respectively and a comparative analysis of our results with the existing state of the arts reveals that the proposed model is more robust than most of the exiting methods.
数字图像篡改检测近年来一直是一个活跃的研究领域,因为数字图像很容易被修改以传递虚假或误导性信息。为了解决这个问题,一些研究提出了用于数字图像篡改检测的取证算法。虽然这些方法已经显示出显著的改进,但大多数方法只专注于检测特定类型的图像篡改。这些方法的局限性是,必须为每一种新的操作方法设计新的取证方法。因此,有必要开发能够检测多重篡改操作的方法。本文提出了一种基于cnn和局部最优定向模式(LOOP)的通用图像篡改方案,该方案能够检测二值和多类场景下的五种图像篡改。与现有的深度学习技术使用约束预处理层来抑制图像内容的影响以捕获图像篡改痕迹不同,我们的方法使用LOOP特征,可以有效地抑制效果图像内容,从而使所提出的cnn能够捕获所需的特征来区分不同类型的图像篡改。通过大量详细的实验,我们的结果表明,我们提出的通用图像篡改方法在单个和多类图像篡改检测中分别可以达到较高的检测精度,并且我们的结果与现有技术的比较分析表明,我们提出的模型比大多数现有方法更具鲁棒性。
{"title":"General Purpose Image Tampering Detection using Convolutional Neural Network and Local Optimal Oriented Pattern (LOOP)","authors":"Ali Ahmad Aminu, N. N. Agwu","doi":"10.5121/SIPIJ.2021.12202","DOIUrl":"https://doi.org/10.5121/SIPIJ.2021.12202","url":null,"abstract":"Digital image tampering detection has been an active area of research in recent times due to the ease with which digital image can be modified to convey false or misleading information. To address this problem, several studies have proposed forensics algorithms for digital image tampering detection. While these approaches have shown remarkable improvement, most of them only focused on detecting a specific type of image tampering. The limitation of these approaches is that new forensic method must be designed for each new manipulation approach that is developed. Consequently, there is a need to develop methods capable of detecting multiple tampering operations. In this paper, we proposed a novel general purpose image tampering scheme based on CNNs and Local Optimal Oriented Pattern (LOOP) which is capable of detecting five types of image tampering in both binary and multiclass scenarios. Unlike the existing deep learning techniques which used constrained pre-processing layers to suppress the effect of image content in order to capture image tampering traces, our method uses LOOP features, which can effectively subdue the effect image content, thus, allowing the proposed CNNs to capture the needed features to distinguish among different types of image tampering. Through a number of detailed experiments, our results demonstrate that the proposed general purpose image tampering method can achieve high detection accuracies in individual and multiclass image tampering detections respectively and a comparative analysis of our results with the existing state of the arts reveals that the proposed model is more robust than most of the exiting methods.","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"60 1","pages":"13-32"},"PeriodicalIF":0.0,"publicationDate":"2021-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87525407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Target Detection and Classification Performance Enhancement using Super-Resolution Infrared Videos 利用超分辨率红外视频增强目标检测和分类性能
Pub Date : 2021-04-30 DOI: 10.5121/SIPIJ.2021.12203
C. Kwan, David Gribben, Bence Budavari
Long range infrared videos such as the Defense Systems Information Analysis Center (DSIAC) videos usually do not have high resolution. In recent years, there are significant advancement in video super-resolution algorithms. Here, we summarize our study on the use of super-resolution videos for target detection and classification. We observed that super-resolution videos can significantly improve the detection and classification performance. For example, for 3000 m range videos, we were able to improve the average precision of target detection from 11% (without super-resolution) to 44% (with 4x super-resolution) and the overall accuracy of target classification from 10% (without super-resolution) to 44% (with 2x superresolution).
远程红外视频,如国防系统信息分析中心(DSIAC)视频通常不具有高分辨率。近年来,视频超分辨率算法取得了重大进展。本文对超分辨率视频用于目标检测和分类的研究进行了综述。我们观察到,超分辨率视频可以显著提高检测和分类性能。例如,对于3000米范围的视频,我们能够将目标检测的平均精度从11%(无超分辨率)提高到44%(有4倍超分辨率),目标分类的总体精度从10%(无超分辨率)提高到44%(有2倍超分辨率)。
{"title":"Target Detection and Classification Performance Enhancement using Super-Resolution Infrared Videos","authors":"C. Kwan, David Gribben, Bence Budavari","doi":"10.5121/SIPIJ.2021.12203","DOIUrl":"https://doi.org/10.5121/SIPIJ.2021.12203","url":null,"abstract":"Long range infrared videos such as the Defense Systems Information Analysis Center (DSIAC) videos usually do not have high resolution. In recent years, there are significant advancement in video super-resolution algorithms. Here, we summarize our study on the use of super-resolution videos for target detection and classification. We observed that super-resolution videos can significantly improve the detection and classification performance. For example, for 3000 m range videos, we were able to improve the average precision of target detection from 11% (without super-resolution) to 44% (with 4x super-resolution) and the overall accuracy of target classification from 10% (without super-resolution) to 44% (with 2x superresolution).","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"21 1","pages":"33-45"},"PeriodicalIF":0.0,"publicationDate":"2021-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82220640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Research on Noise Reduction and Enhancement Algorithm of Girth Weld Image 环焊缝图像的降噪增强算法研究
Pub Date : 2021-02-28 DOI: 10.5121/SIPIJ.2021.12102
Xiang-Song Zhang, Wei-Xin Gao, Shihuan Zhu
In order to eliminate the salt pepper and Gaussian mixed noise in X-ray weld image, the extreme value characteristics of salt and pepper noise are used to separate the mixed noise, and the non local mean filtering algorithm is used to denoise it. Because the smoothness of the exponential weighted kernel function is too large, it is easy to cause the image details fuzzy, so the cosine coefficient based on the function is adopted. An improved non local mean image denoising algorithm is designed by using weighted Gaussian kernel function. The experimental results show that the new algorithm reduces the noise and retains the details of the original image, and the peak signal-to-noise ratio is increased by 1.5 dB. An adaptive salt and pepper noise elimination algorithm is proposed, which can automatically adjust the filtering window to identify the noise probability. Firstly, the median filter is applied to the image, and the filtering results are compared with the pre filtering results to get the noise points. Then the weighted average of the middle three groups of data under each filtering window is used to estimate the image noise probability. Before filtering, the obvious noise points are removed by threshold method, and then the central pixel is estimated by the reciprocal square of the distance from the center pixel of the window. Finally, according to Takagi Sugeno (T-S) fuzzy rules, the output estimates of different models are fused by using noise probability. Experimental results show that the algorithm has the ability of automatic noise estimation and adaptive window adjustment. After filtering, the standard mean square deviation can be reduced by more than 20%, and the speed can be increased more than twice. In the enhancement part, a nonlinear image enhancement method is proposed, which can adjust the parameters adaptively and enhance the weld area automatically instead of the background area. The enhancement effect achieves the best personal visual effect. Compared with the traditional method, the enhancement effect is better and more in line with the needs of industrial field.
为了消除x射线焊缝图像中的盐胡椒和高斯混合噪声,利用盐胡椒噪声的极值特征对混合噪声进行分离,并采用非局部均值滤波算法对混合噪声进行去噪。由于指数加权核函数的平滑度过大,容易造成图像细节模糊,因此采用基于该函数的余弦系数。采用加权高斯核函数,设计了一种改进的非局部均值图像去噪算法。实验结果表明,新算法在降低噪声的同时保留了原始图像的细节,峰值信噪比提高了1.5 dB。提出了一种自适应椒盐噪声消除算法,该算法可以自动调整滤波窗口来识别噪声的概率。首先对图像进行中值滤波,并将滤波结果与预滤波结果进行比较,得到噪声点;然后利用每个滤波窗口下中间三组数据的加权平均来估计图像噪声的概率。滤波前先用阈值法去除明显的噪声点,然后用距离窗口中心像素点距离的倒数平方估计中心像素点。最后,根据Takagi Sugeno (T-S)模糊规则,利用噪声概率对不同模型的输出估计进行融合。实验结果表明,该算法具有自动噪声估计和自适应窗口调整的能力。滤波后标准均方差可降低20%以上,速度可提高2倍以上。在增强部分,提出了一种非线性图像增强方法,该方法可以自适应调整参数,自动增强焊缝区域而不是背景区域。增强效果达到最佳的个人视觉效果。与传统方法相比,增强效果更好,更符合工业现场的需要。
{"title":"Research on Noise Reduction and Enhancement Algorithm of Girth Weld Image","authors":"Xiang-Song Zhang, Wei-Xin Gao, Shihuan Zhu","doi":"10.5121/SIPIJ.2021.12102","DOIUrl":"https://doi.org/10.5121/SIPIJ.2021.12102","url":null,"abstract":"In order to eliminate the salt pepper and Gaussian mixed noise in X-ray weld image, the extreme value characteristics of salt and pepper noise are used to separate the mixed noise, and the non local mean filtering algorithm is used to denoise it. Because the smoothness of the exponential weighted kernel function is too large, it is easy to cause the image details fuzzy, so the cosine coefficient based on the function is adopted. An improved non local mean image denoising algorithm is designed by using weighted Gaussian kernel function. The experimental results show that the new algorithm reduces the noise and retains the details of the original image, and the peak signal-to-noise ratio is increased by 1.5 dB. An adaptive salt and pepper noise elimination algorithm is proposed, which can automatically adjust the filtering window to identify the noise probability. Firstly, the median filter is applied to the image, and the filtering results are compared with the pre filtering results to get the noise points. Then the weighted average of the middle three groups of data under each filtering window is used to estimate the image noise probability. Before filtering, the obvious noise points are removed by threshold method, and then the central pixel is estimated by the reciprocal square of the distance from the center pixel of the window. Finally, according to Takagi Sugeno (T-S) fuzzy rules, the output estimates of different models are fused by using noise probability. Experimental results show that the algorithm has the ability of automatic noise estimation and adaptive window adjustment. After filtering, the standard mean square deviation can be reduced by more than 20%, and the speed can be increased more than twice. In the enhancement part, a nonlinear image enhancement method is proposed, which can adjust the parameters adaptively and enhance the weld area automatically instead of the background area. The enhancement effect achieves the best personal visual effect. Compared with the traditional method, the enhancement effect is better and more in line with the needs of industrial field.","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"49 1","pages":"9-21"},"PeriodicalIF":0.0,"publicationDate":"2021-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72835839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of Convolutional Neural Network In LAWN Measurement 卷积神经网络在草坪草坪测量中的应用
Pub Date : 2021-02-28 DOI: 10.5121/SIPIJ.2021.12101
J. Wilkins, M. Nguyen, B. Rahmani
Lawn area measurement is an application of image processing and deep learning. Researchers used hierarchical networks, segmented images, and other methods to measure the lawn area. Methods’ effectiveness and accuracy varies. In this project, deep learning method, specifically Convolutional neural network, was applied to measure the lawn area. We used Keras and TensorFlow in Python to develop a model that was trained on the dataset of houses then tuned the parameters with GridSearchCV in ScikitLearn (a machine learning library in Python) to estimate the lawn area. Convolutional neural network or shortly CNN shows high accuracy (94 -97%). We may conclude that deep learning method, especially CNN, could be a good method with a high state-of-art accuracy.
草坪面积测量是图像处理和深度学习的一个应用。研究人员使用分层网络、分割图像和其他方法来测量草坪面积。方法的有效性和准确性各不相同。在这个项目中,我们使用深度学习方法,特别是卷积神经网络来测量草坪面积。我们使用Python中的Keras和TensorFlow来开发一个模型,该模型在房屋数据集上进行训练,然后使用ScikitLearn (Python中的机器学习库)中的GridSearchCV调整参数来估计草坪面积。卷积神经网络或简称CNN显示出较高的准确率(94 -97%)。我们可以得出结论,深度学习方法,特别是CNN,可能是一种具有高精确度的好方法。
{"title":"Application of Convolutional Neural Network In LAWN Measurement","authors":"J. Wilkins, M. Nguyen, B. Rahmani","doi":"10.5121/SIPIJ.2021.12101","DOIUrl":"https://doi.org/10.5121/SIPIJ.2021.12101","url":null,"abstract":"Lawn area measurement is an application of image processing and deep learning. Researchers used hierarchical networks, segmented images, and other methods to measure the lawn area. Methods’ effectiveness and accuracy varies. In this project, deep learning method, specifically Convolutional neural network, was applied to measure the lawn area. We used Keras and TensorFlow in Python to develop a model that was trained on the dataset of houses then tuned the parameters with GridSearchCV in ScikitLearn (a machine learning library in Python) to estimate the lawn area. Convolutional neural network or shortly CNN shows high accuracy (94 -97%). We may conclude that deep learning method, especially CNN, could be a good method with a high state-of-art accuracy.","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"2 1","pages":"1-8"},"PeriodicalIF":0.0,"publicationDate":"2021-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89694845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Role of Hybrid Level Set in Fetal Contour Extraction 混合水平集在胎儿轮廓提取中的作用
Pub Date : 2021-02-28 DOI: 10.5121/SIPIJ.2021.12104
Rachana Jaiswal, S. Satarkar
Image processing technologies may be employed for quicker and accurate diagnosis in analysis and feature extraction of medical images. Here, existing level set algorithm is modified and it is employed for extracting contour of fetus in an image. In traditional approach, fetal parameters are extracted manually from ultrasound images. An automatic technique is highly desirable to obtain fetal biometric measurements due to some problems in traditional approach such as lack of consistency and accuracy. The proposed approach utilizes global & local region information for fetal contour extraction from ultrasonic images. The main goal of this research is to develop a new methodology to aid the analysis and feature extraction.
在医学图像的分析和特征提取中,图像处理技术可以用于更快、更准确的诊断。本文对现有的水平集算法进行了改进,并将其用于提取图像中的胎儿轮廓。传统的方法是人工从超声图像中提取胎儿参数。由于传统的胎儿生物特征测量方法存在一致性和准确性不高的问题,自动化技术是实现胎儿生物特征测量的迫切需要。该方法利用全局和局部信息从超声图像中提取胎儿轮廓。本研究的主要目标是开发一种新的方法来辅助分析和特征提取。
{"title":"Role of Hybrid Level Set in Fetal Contour Extraction","authors":"Rachana Jaiswal, S. Satarkar","doi":"10.5121/SIPIJ.2021.12104","DOIUrl":"https://doi.org/10.5121/SIPIJ.2021.12104","url":null,"abstract":"Image processing technologies may be employed for quicker and accurate diagnosis in analysis and feature extraction of medical images. Here, existing level set algorithm is modified and it is employed for extracting contour of fetus in an image. In traditional approach, fetal parameters are extracted manually from ultrasound images. An automatic technique is highly desirable to obtain fetal biometric measurements due to some problems in traditional approach such as lack of consistency and accuracy. The proposed approach utilizes global & local region information for fetal contour extraction from ultrasonic images. The main goal of this research is to develop a new methodology to aid the analysis and feature extraction.","PeriodicalId":90726,"journal":{"name":"Signal and image processing : an international journal","volume":"17 9","pages":"39-52"},"PeriodicalIF":0.0,"publicationDate":"2021-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72371788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Signal and image processing : an international journal
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1