首页 > 最新文献

2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)最新文献

英文 中文
Short-term Power Prediction using ANN 基于人工神经网络的短期电力预测
Pub Date : 2021-09-13 DOI: 10.1109/ICSIPA52582.2021.9576813
G. Perveen, P. Anand, Amod Kumar
An accurate prediction of solar energy becomes imperative for the planning and optimization of solar-based energy systems. The present research involves the implementation of Artificial Neural Network (ANN) models employing a cascade forward backpropagation algorithm for predicting short-term PV power using meteorological parameters based on distinct weather conditions. Prediction of solar energy during clear weather is easily done; however, the challenge lies in prediction under cloudy weather conditions. Therefore, the present work involves the prediction of power in solar PV systems for clear, hazy, partly and fully cloudy weather in composite climatic zone. Models are developed by simulating in MATLAB platform and for validating the accuracy of the results, statistical evaluation indices are used. The model can be used easily for predicting power for the preliminary design of solar-based applications.
准确的太阳能预测对太阳能能源系统的规划和优化至关重要。目前的研究涉及采用级联前向反向传播算法的人工神经网络(ANN)模型的实现,该模型使用基于不同天气条件的气象参数预测短期光伏功率。天气晴朗时的太阳能预测很容易做到;然而,挑战在于多云天气条件下的预测。因此,本研究涉及到复合气气带晴朗、朦胧、部分和完全多云天气下太阳能光伏发电系统功率的预测。在MATLAB平台上通过仿真建立了模型,并采用统计评价指标验证了结果的准确性。该模型可方便地为太阳能应用的初步设计预测功率。
{"title":"Short-term Power Prediction using ANN","authors":"G. Perveen, P. Anand, Amod Kumar","doi":"10.1109/ICSIPA52582.2021.9576813","DOIUrl":"https://doi.org/10.1109/ICSIPA52582.2021.9576813","url":null,"abstract":"An accurate prediction of solar energy becomes imperative for the planning and optimization of solar-based energy systems. The present research involves the implementation of Artificial Neural Network (ANN) models employing a cascade forward backpropagation algorithm for predicting short-term PV power using meteorological parameters based on distinct weather conditions. Prediction of solar energy during clear weather is easily done; however, the challenge lies in prediction under cloudy weather conditions. Therefore, the present work involves the prediction of power in solar PV systems for clear, hazy, partly and fully cloudy weather in composite climatic zone. Models are developed by simulating in MATLAB platform and for validating the accuracy of the results, statistical evaluation indices are used. The model can be used easily for predicting power for the preliminary design of solar-based applications.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126507019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Class 1 and Class 2 Underwater Image Enhancement and Restoration Under Turbidity Conditions 浑浊条件下的水下图像增强与恢复
Pub Date : 2021-09-13 DOI: 10.1109/ICSIPA52582.2021.9576782
M. K. Awang, Halimatun Saidah Aminuddin, Nurul Kamilah Mat Kamil, K. Mustafa
Poor visibility in underwater images is commonly attributed to the presence of impurities and the absorbed light being scattered while travelling through impure water. In this paper, image enhancement and restoration techniques are applied to TURBID image datasets. The TURBID dataset consists of three different types of underwater image conditions where blue solution, milk solution, or chlorophyll solution is added to water. The images undergo Histogram equalization (HE) and are filtered with a Wiener filter for image enhancement and image restoration, respectively. HE as the chosen enhancement method proved that it could enhance the image quality as the water surface can be seen clearer after enhancement by visual inspection. Three Wiener filter classes are chosen as the restoration method to reduce the Mean Square Error (MSE) value and to get high Peak Signal-to-Noise-Ratio (PSNR) with desired SNR value. Finally, these two image processing techniques, enhancement, and restoration are combined and the image quantitative values are compared to show that the image performance can be improved with combined enhancement and restoration techniques. It is found that Class 1 Wiener Filter with Enhance then Restore (ER) method has a value of 0 for MSE which is the lowest compared to other studied methods and has infinity values for PSNR and SNR.
水下图像能见度差通常归因于杂质的存在和吸收的光在穿过不纯净的水时散射。本文将图像增强和恢复技术应用于浑浊图像数据集。TURBID数据集由三种不同类型的水下图像条件组成,其中蓝色溶液,牛奶溶液或叶绿素溶液被添加到水中。图像经过直方图均衡化(HE),并分别用维纳滤波器进行图像增强和图像恢复。实验证明,通过目测,增强后的水面更加清晰,可以提高图像质量。选择三种维纳滤波器作为恢复方法,以降低均方误差(MSE)值,并获得符合期望信噪比值的峰值信噪比(PSNR)。最后,将增强和恢复这两种图像处理技术结合起来,并对图像的定量值进行了比较,表明增强和恢复相结合可以提高图像的性能。研究发现,采用增强后恢复(ER)方法的1类维纳滤波器的MSE值为0,与其他研究方法相比最低,PSNR和SNR值为无穷大。
{"title":"Class 1 and Class 2 Underwater Image Enhancement and Restoration Under Turbidity Conditions","authors":"M. K. Awang, Halimatun Saidah Aminuddin, Nurul Kamilah Mat Kamil, K. Mustafa","doi":"10.1109/ICSIPA52582.2021.9576782","DOIUrl":"https://doi.org/10.1109/ICSIPA52582.2021.9576782","url":null,"abstract":"Poor visibility in underwater images is commonly attributed to the presence of impurities and the absorbed light being scattered while travelling through impure water. In this paper, image enhancement and restoration techniques are applied to TURBID image datasets. The TURBID dataset consists of three different types of underwater image conditions where blue solution, milk solution, or chlorophyll solution is added to water. The images undergo Histogram equalization (HE) and are filtered with a Wiener filter for image enhancement and image restoration, respectively. HE as the chosen enhancement method proved that it could enhance the image quality as the water surface can be seen clearer after enhancement by visual inspection. Three Wiener filter classes are chosen as the restoration method to reduce the Mean Square Error (MSE) value and to get high Peak Signal-to-Noise-Ratio (PSNR) with desired SNR value. Finally, these two image processing techniques, enhancement, and restoration are combined and the image quantitative values are compared to show that the image performance can be improved with combined enhancement and restoration techniques. It is found that Class 1 Wiener Filter with Enhance then Restore (ER) method has a value of 0 for MSE which is the lowest compared to other studied methods and has infinity values for PSNR and SNR.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126522558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Modeling of COVID-19 Disease with Impact of Lockdown in Pakistan & Malaysia 巴基斯坦和马来西亚封锁对COVID-19疾病影响的动态建模
Pub Date : 2021-09-13 DOI: 10.1109/ICSIPA52582.2021.9576795
G. M. Abro, S. Zulkifli, V. Asirvadam, Nirbhay Mathur, Rahul Kumar, Vipin Kumar Oad
Being researchers, it is an utmost responsibility to provide insight on social issues thus, this work addresses the dynamic modeling of first and most contagious disease named as COVID-19 caused by coronavirus. The first case of COVID-19 appeared in Pakistan was on 26th February 2020 and in Malaysia on 27th February 2020; both patients had foreign travel history. In the paper, the number of total affected cases and total deaths in both countries, are quite the same up till 12th April 2020 but the frequency of new cases per day and recovery rate are different from one another. The movement control approach had also been imposed on 18th March 2020 by both countries. Keeping these facts and figures, the paper proposes a mathematical model based on Lotka-Volterra equations and provides numerical solution of differential equations using the suspectable, exposed, infected, and recovered people data to estimate future consequences and address the difference in the growth rate of COVID-19 patients before and after locked down to reduce the spread further by taking pro-active approaches i.e., social distancing and being quarantined for the essential time frame.
作为研究人员,对社会问题提供见解是最大的责任,因此,这项工作解决了由冠状病毒引起的第一个也是最具传染性的疾病COVID-19的动态建模。巴基斯坦和马来西亚分别于2020年2月26日和27日出现第一例COVID-19病例;两例患者均有国外旅行史。在这篇论文中,截至2020年4月12日,两国的总感染病例数和总死亡人数基本相同,但每天新增病例的频率和康复率彼此不同。两国也于2020年3月18日实施了行动控制措施。根据这些事实和数据,本文提出了基于Lotka-Volterra方程的数学模型,并使用疑似、暴露、感染和康复人员数据提供微分方程的数值解,以估计未来的后果,并解决封锁前后COVID-19患者增长率的差异,通过采取积极主动的方法,如保持社会距离和隔离必要的时间框架,进一步减少传播。
{"title":"Dynamic Modeling of COVID-19 Disease with Impact of Lockdown in Pakistan & Malaysia","authors":"G. M. Abro, S. Zulkifli, V. Asirvadam, Nirbhay Mathur, Rahul Kumar, Vipin Kumar Oad","doi":"10.1109/ICSIPA52582.2021.9576795","DOIUrl":"https://doi.org/10.1109/ICSIPA52582.2021.9576795","url":null,"abstract":"Being researchers, it is an utmost responsibility to provide insight on social issues thus, this work addresses the dynamic modeling of first and most contagious disease named as COVID-19 caused by coronavirus. The first case of COVID-19 appeared in Pakistan was on 26th February 2020 and in Malaysia on 27th February 2020; both patients had foreign travel history. In the paper, the number of total affected cases and total deaths in both countries, are quite the same up till 12th April 2020 but the frequency of new cases per day and recovery rate are different from one another. The movement control approach had also been imposed on 18th March 2020 by both countries. Keeping these facts and figures, the paper proposes a mathematical model based on Lotka-Volterra equations and provides numerical solution of differential equations using the suspectable, exposed, infected, and recovered people data to estimate future consequences and address the difference in the growth rate of COVID-19 patients before and after locked down to reduce the spread further by taking pro-active approaches i.e., social distancing and being quarantined for the essential time frame.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115828576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Modeling, Controlling & Stabilization of An Underactuated Air-Cushion Vehicle (ACV) 欠驱动气垫车(ACV)的建模、控制与镇定
Pub Date : 2021-09-13 DOI: 10.1109/ICSIPA52582.2021.9576785
G. M. Abro, S. Zulkifli, V. Asirvadam, Z. Ali, Nirbhay Mathur, Rahul Kumar
Underactuated systems are very difficult to control and stabilize due to fewer number of control inputs as compared to degrees of freedom (DOF). Thus, this research manuscript presents a comparative analysis of two major control schemes for an underactuated air cushion vehicle (ACV) commonly known as hovercraft. By studying the translational and angular dynamics of proposed underactuated mechatronic system, the mathematical model had been derived using Newton Euler formalism. The validity and effectiveness of proportional integrated differentiator (PID) control design is compared with the Fuzzy based PID (F-PID) scheme. Thus, with provided simulation results, paper concludes that the proposed algorithm of fuzzy based PID (F-PID) is better solution for achieving robust transient and steady state performances than simple PID control scheme even in the availability of bounded uncertainties with quick convergence rate.
与自由度(DOF)相比,欠驱动系统由于控制输入数量较少而很难控制和稳定。因此,本研究手稿提出了一个欠驱动气垫飞行器(ACV)通常被称为气垫船的两种主要控制方案的比较分析。通过研究欠驱动机电系统的平动动力学和角动力学,利用牛顿-欧拉公式推导了欠驱动机电系统的数学模型。比较了比例积分微分器(PID)控制设计与基于模糊的PID (F-PID)控制方案的有效性。因此,通过提供的仿真结果,本文得出结论,即使在有界不确定性的情况下,本文提出的基于模糊PID (F-PID)算法比简单PID控制方案更能实现鲁棒的暂态和稳态性能,收敛速度快。
{"title":"Modeling, Controlling & Stabilization of An Underactuated Air-Cushion Vehicle (ACV)","authors":"G. M. Abro, S. Zulkifli, V. Asirvadam, Z. Ali, Nirbhay Mathur, Rahul Kumar","doi":"10.1109/ICSIPA52582.2021.9576785","DOIUrl":"https://doi.org/10.1109/ICSIPA52582.2021.9576785","url":null,"abstract":"Underactuated systems are very difficult to control and stabilize due to fewer number of control inputs as compared to degrees of freedom (DOF). Thus, this research manuscript presents a comparative analysis of two major control schemes for an underactuated air cushion vehicle (ACV) commonly known as hovercraft. By studying the translational and angular dynamics of proposed underactuated mechatronic system, the mathematical model had been derived using Newton Euler formalism. The validity and effectiveness of proportional integrated differentiator (PID) control design is compared with the Fuzzy based PID (F-PID) scheme. Thus, with provided simulation results, paper concludes that the proposed algorithm of fuzzy based PID (F-PID) is better solution for achieving robust transient and steady state performances than simple PID control scheme even in the availability of bounded uncertainties with quick convergence rate.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134061072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Affect Recognition Using Dynamic Characteristics of Motion 利用运动的动态特性影响识别
Pub Date : 2021-09-13 DOI: 10.1109/ICSIPA52582.2021.9576772
Saba Baloch, S. Abu-Bakar, M. Mokji, Saima Waseem
Recently, there has been a lot of work on affect recognition which is a relatively new research field. Here the detection and recognition of multiple emotions such as anger, joy, fear, etc. are being carried out. Furthermore, the research on affective computing is being conducted using various modalities, such as; facial expressions, bodily expressions, and speech, with facial expressions being the most researched modality. However, many researchers have lately highlighted the significance of bodily expressions in affect detection. In this paper, we have considered bodily expressions for recognizing affect and proposed a modified approach using dynamic characteristics of motion. The experiments were performed in MATLAB using the MPIIEmo dataset and results were compared with the existing research.
情感识别是一个相对较新的研究领域,近年来已有大量的研究工作。在这里,检测和识别多种情绪,如愤怒、喜悦、恐惧等正在进行。此外,情感计算的研究正在使用各种方式进行,例如;面部表情,身体表情和语言,其中面部表情是研究最多的形式。然而,许多研究人员最近强调了身体表情在情感检测中的重要性。在本文中,我们考虑了身体表达来识别情感,并提出了一种使用运动动态特征的改进方法。利用MPIIEmo数据集在MATLAB中进行了实验,并与已有研究结果进行了比较。
{"title":"Affect Recognition Using Dynamic Characteristics of Motion","authors":"Saba Baloch, S. Abu-Bakar, M. Mokji, Saima Waseem","doi":"10.1109/ICSIPA52582.2021.9576772","DOIUrl":"https://doi.org/10.1109/ICSIPA52582.2021.9576772","url":null,"abstract":"Recently, there has been a lot of work on affect recognition which is a relatively new research field. Here the detection and recognition of multiple emotions such as anger, joy, fear, etc. are being carried out. Furthermore, the research on affective computing is being conducted using various modalities, such as; facial expressions, bodily expressions, and speech, with facial expressions being the most researched modality. However, many researchers have lately highlighted the significance of bodily expressions in affect detection. In this paper, we have considered bodily expressions for recognizing affect and proposed a modified approach using dynamic characteristics of motion. The experiments were performed in MATLAB using the MPIIEmo dataset and results were compared with the existing research.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130933527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Comparative Study of In-Domain vs Cross-Domain Learning for Porn Cartoon Classification 色情漫画分类的领域内与跨领域学习比较研究
Pub Date : 2021-09-13 DOI: 10.1109/ICSIPA52582.2021.9576769
Nouar Aldahoul, H. A. Karim, A. Wazir, Mhd Adel Momo, Mohd Haris Lye Abdullah
Detection of adult contents such as pornography, sex, and nudity has been investigated extensively in the literature. Recently, content moderator is a significant component for social platforms to be integrated in their software applications and services. Cartoon content moderator is a specific kind of moderators that should be highly accurate to reduce the classification error and increase the model’s sensitivity to adult contents. This paper aims to compare the models pre-trained on natural adult images and called cross-domain learning models with ones pre-trained on cartoon images and called in-domain learning models for adult content detection in cartoons. The paper utilized pre-trained convolutional neural networks such as ResNet and EfficientNet to extract features that were applied to support vector machine for porn/normal classification. It was found that in-domain models outperformed cross-domain model in terms of performance metrics to improve the accuracy by 13 %, recall by 2 %, precision by 18 %, F1 score by 14 %, false negative rate by 2 %, and false positive rate by 16 %.
检测成人内容,如色情,性和裸体已被广泛的研究在文献中。最近,内容版主成为社交平台集成在其软件应用和服务中的重要组成部分。卡通内容版主是一种特定的版主,需要具有较高的准确率,以减少分类误差,提高模型对成人内容的敏感性。本文旨在比较基于自然成人图像的预训练模型(称为跨领域学习模型)与基于卡通图像的预训练模型(称为域内学习模型)在卡通成人内容检测中的应用。本文利用预训练的卷积神经网络(如ResNet和effentnet)提取特征,应用于支持向量机进行色情/正常分类。研究发现,在性能指标方面,域内模型优于跨域模型,准确率提高了13%,召回率提高了2%,精确度提高了18%,F1分数提高了14%,假阴性率提高了2%,假阳性率提高了16%。
{"title":"A Comparative Study of In-Domain vs Cross-Domain Learning for Porn Cartoon Classification","authors":"Nouar Aldahoul, H. A. Karim, A. Wazir, Mhd Adel Momo, Mohd Haris Lye Abdullah","doi":"10.1109/ICSIPA52582.2021.9576769","DOIUrl":"https://doi.org/10.1109/ICSIPA52582.2021.9576769","url":null,"abstract":"Detection of adult contents such as pornography, sex, and nudity has been investigated extensively in the literature. Recently, content moderator is a significant component for social platforms to be integrated in their software applications and services. Cartoon content moderator is a specific kind of moderators that should be highly accurate to reduce the classification error and increase the model’s sensitivity to adult contents. This paper aims to compare the models pre-trained on natural adult images and called cross-domain learning models with ones pre-trained on cartoon images and called in-domain learning models for adult content detection in cartoons. The paper utilized pre-trained convolutional neural networks such as ResNet and EfficientNet to extract features that were applied to support vector machine for porn/normal classification. It was found that in-domain models outperformed cross-domain model in terms of performance metrics to improve the accuracy by 13 %, recall by 2 %, precision by 18 %, F1 score by 14 %, false negative rate by 2 %, and false positive rate by 16 %.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121450818","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Training Convolutional Neural Networks to Detect Waste in Train Carriages 训练卷积神经网络来检测火车车厢中的废物
Pub Date : 2021-09-13 DOI: 10.1109/ICSIPA52582.2021.9576771
Nathan Western, X. Kong, Mustafa Erden
This research constitutes a systematic investigation of the effect of image view on Convolutional Neural Networks (CNNs) when trained to detect waste in train carriages. Additionally, this research identifies neural network architecture and training conditions for use in an automated train cleaning robot. Specifically, we investigate the relationship between the size of the CNN training dataset, whether these images are taken from a view sympathetic to the CNN application, and the effectiveness of the trained networks. Three datasets were constructed specifically for this research; a large dataset of 58,300 studio images of waste in a variety of conditions, a smaller dataset of 4,515 images taken of actual waste items on trains, and a dataset of 7,290 images of actual waste on trains used to test the CNNs. The images taken on trains were captured from the perspective of a hypothetical cleaning robot that would use these networks. Additionally, we provide a comparison of MobileNetV2, ShuffleNet, and SqueezeNet CNNs based on their suitability for implementation in an automated train cleaning system, and the optimum conditions to do so. Training with a smaller dataset of images taken from a “robot-eye view” resulted in an average increase in classification accuracy of 10.5%, with the largest increase being 26%, when compared to training with a larger dataset of images of waste items in various poses. ShuffleNet was identified as the optimally performing CNN for waste detection, achieving an accuracy of 88.61% when trained with a small dataset of images sympathetic to the end use. MobileNetV2 was found to perform optimally with a larger dataset of training images, even if these are less specific to the application of the network.
本研究系统地研究了卷积神经网络(cnn)在训练后检测火车车厢中的废物时,图像视图对其的影响。此外,本研究确定了用于自动列车清洁机器人的神经网络架构和训练条件。具体来说,我们研究了CNN训练数据集的大小、这些图像是否取自与CNN应用相一致的视图以及训练网络的有效性之间的关系。本研究专门构建了三个数据集;一个大型数据集包含58300张不同条件下的垃圾照片,一个较小的数据集包含4515张火车上的实际垃圾照片,一个数据集包含7290张火车上的实际垃圾照片,用于测试cnn。在火车上拍摄的图像是从一个假想的清洁机器人的角度拍摄的,这个机器人将使用这些网络。此外,我们根据MobileNetV2、ShuffleNet和SqueezeNet cnn在自动列车清洁系统中的适用性和最佳条件,对它们进行了比较。与使用各种姿势的废物图像的更大数据集进行训练相比,使用从“机器眼视图”拍摄的较小图像数据集进行训练导致分类准确率平均提高10.5%,最大增幅为26%。ShuffleNet被认为是用于垃圾检测的性能最佳的CNN,当使用与最终用途一致的小图像数据集进行训练时,准确率达到了88.61%。MobileNetV2被发现在更大的训练图像数据集上表现最佳,即使这些数据集对网络应用的特异性较低。
{"title":"Training Convolutional Neural Networks to Detect Waste in Train Carriages","authors":"Nathan Western, X. Kong, Mustafa Erden","doi":"10.1109/ICSIPA52582.2021.9576771","DOIUrl":"https://doi.org/10.1109/ICSIPA52582.2021.9576771","url":null,"abstract":"This research constitutes a systematic investigation of the effect of image view on Convolutional Neural Networks (CNNs) when trained to detect waste in train carriages. Additionally, this research identifies neural network architecture and training conditions for use in an automated train cleaning robot. Specifically, we investigate the relationship between the size of the CNN training dataset, whether these images are taken from a view sympathetic to the CNN application, and the effectiveness of the trained networks. Three datasets were constructed specifically for this research; a large dataset of 58,300 studio images of waste in a variety of conditions, a smaller dataset of 4,515 images taken of actual waste items on trains, and a dataset of 7,290 images of actual waste on trains used to test the CNNs. The images taken on trains were captured from the perspective of a hypothetical cleaning robot that would use these networks. Additionally, we provide a comparison of MobileNetV2, ShuffleNet, and SqueezeNet CNNs based on their suitability for implementation in an automated train cleaning system, and the optimum conditions to do so. Training with a smaller dataset of images taken from a “robot-eye view” resulted in an average increase in classification accuracy of 10.5%, with the largest increase being 26%, when compared to training with a larger dataset of images of waste items in various poses. ShuffleNet was identified as the optimally performing CNN for waste detection, achieving an accuracy of 88.61% when trained with a small dataset of images sympathetic to the end use. MobileNetV2 was found to perform optimally with a larger dataset of training images, even if these are less specific to the application of the network.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122500183","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Local File Inclusion Vulnerability Scanner with Tor Proxy 本地文件包含漏洞扫描器与Tor代理
Pub Date : 2021-09-13 DOI: 10.1109/ICSIPA52582.2021.9576783
Ku Ahmad Haziq Hezret Bin Che Ku Mohd Sahidi, Muhammad Azizi Mohd Ariffin, Muhammad Izzad Ramli, Z. Kasiran
Web applications have made communication and services for users extremely simple because of the user-friendly interface, global accessibility, and ease of management. However, careless web application design and implementation are crucial to a security compromise that is incredibly troubling both to the user and web administrators. The weakness in Local File Inclusion (LFI) currently exists in many web applications that result in remote code execution in a host server. Hence, detecting the vulnerability of LFI is becoming extremely important to the web owner in taking effective risk mitigation action. Meanwhile, the current vulnerability scanner that is available nowadays focuses more on SQL injection and cross site scripting but fewer over Local File Inclusion vulnerability. Other than that, users cannot observe what sort of sensitive file or data could be obtained by an attacker and maintain the anonymity of the user because current Vulnerability scanner on the market does not integrate with TOR network out-of-the-box. This project proposed an automated system for the identification of LFI vulnerabilities with obscure for web applications. Therefore, the objective of this project is to develop a system that can detect LFI vulnerabilities within the web application and while still able to maintain user anonymity across the network by covering the source IP address of the scanner using the Tor network and simulates how a real-world hacker attacks web application using LFI vulnerability. Furthermore, there are six phases involved in the methodology to complete this project: information gathering, requirement analysis, system design, development, testing, and documentation. Lastly for documentation, is to make a report about Local File Inclusion Vulnerability Scanner with Tor Onion Router Proxy. From the result testing, it indicates that the project can identify any local file inclusion vulnerabilities that exist over the web application while also having the advantage to observe the point of view of an attacker capable of hiding the scanner source of IP address.
由于用户友好的界面、全局可访问性和易于管理,Web应用程序使用户的通信和服务变得极其简单。然而,粗心的web应用程序设计和实现对于用户和web管理员来说都是非常麻烦的安全问题至关重要。本地文件包含(LFI)的弱点目前存在于许多web应用程序中,导致在主机服务器上远程执行代码。因此,检测LFI的脆弱性对于网站所有者采取有效的风险缓解行动变得极其重要。同时,目前可用的漏洞扫描器更多地关注SQL注入和跨站脚本,而较少关注本地文件包含漏洞。除此之外,用户无法观察到攻击者可以获得什么样的敏感文件或数据,并保持用户的匿名性,因为目前市场上的漏洞扫描器没有与TOR网络集成。本课题提出了一种用于web应用程序模糊的LFI漏洞自动识别系统。因此,该项目的目标是开发一个系统,可以检测web应用程序中的LFI漏洞,同时仍然能够通过使用Tor网络覆盖扫描仪的源IP地址来保持网络中的用户匿名性,并模拟真实世界的黑客如何使用LFI漏洞攻击web应用程序。此外,完成这个项目的方法包括六个阶段:信息收集、需求分析、系统设计、开发、测试和文档。最后的文档,是做一个关于本地文件包含漏洞扫描器与Tor洋葱路由器代理的报告。从结果测试来看,它表明该项目可以识别存在于web应用程序上的任何本地文件包含漏洞,同时也具有观察攻击者能够隐藏扫描程序IP地址源的优势。
{"title":"Local File Inclusion Vulnerability Scanner with Tor Proxy","authors":"Ku Ahmad Haziq Hezret Bin Che Ku Mohd Sahidi, Muhammad Azizi Mohd Ariffin, Muhammad Izzad Ramli, Z. Kasiran","doi":"10.1109/ICSIPA52582.2021.9576783","DOIUrl":"https://doi.org/10.1109/ICSIPA52582.2021.9576783","url":null,"abstract":"Web applications have made communication and services for users extremely simple because of the user-friendly interface, global accessibility, and ease of management. However, careless web application design and implementation are crucial to a security compromise that is incredibly troubling both to the user and web administrators. The weakness in Local File Inclusion (LFI) currently exists in many web applications that result in remote code execution in a host server. Hence, detecting the vulnerability of LFI is becoming extremely important to the web owner in taking effective risk mitigation action. Meanwhile, the current vulnerability scanner that is available nowadays focuses more on SQL injection and cross site scripting but fewer over Local File Inclusion vulnerability. Other than that, users cannot observe what sort of sensitive file or data could be obtained by an attacker and maintain the anonymity of the user because current Vulnerability scanner on the market does not integrate with TOR network out-of-the-box. This project proposed an automated system for the identification of LFI vulnerabilities with obscure for web applications. Therefore, the objective of this project is to develop a system that can detect LFI vulnerabilities within the web application and while still able to maintain user anonymity across the network by covering the source IP address of the scanner using the Tor network and simulates how a real-world hacker attacks web application using LFI vulnerability. Furthermore, there are six phases involved in the methodology to complete this project: information gathering, requirement analysis, system design, development, testing, and documentation. Lastly for documentation, is to make a report about Local File Inclusion Vulnerability Scanner with Tor Onion Router Proxy. From the result testing, it indicates that the project can identify any local file inclusion vulnerabilities that exist over the web application while also having the advantage to observe the point of view of an attacker capable of hiding the scanner source of IP address.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121536047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Active Stereo Matching Benchmark for 3D Reconstruction using Multi-view Depths 基于多视角深度的三维重建的主动立体匹配基准
Pub Date : 2021-09-13 DOI: 10.1109/ICSIPA52582.2021.9576787
M. Jang, Seongmin Lee, Jiwoo Kang, Sanghoon Lee
With the advance of 3D entertainment, 3D reconstruction has been widely researched. Recently, for the 3D reconstruction, multi-view depth images are generally used due to the wide availability of commercial RGBD sensors. The depth image can be directly acquired from the specific sensor or estimated from the stereo images by using a stereo matching algorithm. The performance of the depth estimation using a specific sensor is only dependent on the sensor performance. However, since the stereo matching method is dependent on stereo matching accuracy, a more accurate depth can be obtained from the high accuracy stereo matching method. Therefore, we focus on the stereo matching method for estimating the depth image. In this paper, we present the benchmark on the active stereo matching method for 3D reconstruction. Through the quantitative and qualitative benchmarks, we analyze and visualize the depth estimation and 3D reconstruction results. By presenting the active stereo matching benchmark, we provide guidance for 3D reconstruction using multi-view depths.
随着3D娱乐的发展,三维重建得到了广泛的研究。目前,由于商用RGBD传感器的广泛应用,对于三维重建,通常采用多视角深度图像。深度图像可以直接从特定传感器获取,也可以使用立体匹配算法从立体图像中估计。使用特定传感器的深度估计的性能仅取决于传感器的性能。然而,由于立体匹配方法依赖于立体匹配精度,因此高精度立体匹配方法可以获得更精确的深度。因此,我们重点研究了深度图像估计的立体匹配方法。本文提出了一种用于三维重建的主动立体匹配方法的基准。通过定量和定性的基准测试,对深度估计和三维重建结果进行分析和可视化。通过提出主动立体匹配基准,为多视角深度三维重建提供指导。
{"title":"Active Stereo Matching Benchmark for 3D Reconstruction using Multi-view Depths","authors":"M. Jang, Seongmin Lee, Jiwoo Kang, Sanghoon Lee","doi":"10.1109/ICSIPA52582.2021.9576787","DOIUrl":"https://doi.org/10.1109/ICSIPA52582.2021.9576787","url":null,"abstract":"With the advance of 3D entertainment, 3D reconstruction has been widely researched. Recently, for the 3D reconstruction, multi-view depth images are generally used due to the wide availability of commercial RGBD sensors. The depth image can be directly acquired from the specific sensor or estimated from the stereo images by using a stereo matching algorithm. The performance of the depth estimation using a specific sensor is only dependent on the sensor performance. However, since the stereo matching method is dependent on stereo matching accuracy, a more accurate depth can be obtained from the high accuracy stereo matching method. Therefore, we focus on the stereo matching method for estimating the depth image. In this paper, we present the benchmark on the active stereo matching method for 3D reconstruction. Through the quantitative and qualitative benchmarks, we analyze and visualize the depth estimation and 3D reconstruction results. By presenting the active stereo matching benchmark, we provide guidance for 3D reconstruction using multi-view depths.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128178377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An Insight Into the Rise Time of Exponential Smoothing for Speech Enhancement Methods 语音增强方法中指数平滑上升时间的研究
Pub Date : 2021-09-13 DOI: 10.1109/ICSIPA52582.2021.9576801
S. Low
Exponential smoothing is a widely used averaging function to estimate the speech and noise statistics. However, the setting of the smoothing constant has been inconsistent or to a certain extent arbitrarily set. This paper aims to fill the gap by formulating the smoothing constant as a function of rise time to better reflect the variability of the signal to be smoothed. Experimental results with real world noise reveal that the performance is very sensitive to the rise time of the short term averaging function, whilst less so for the longer term averaging function. The results provide a guideline for speech enhancement methods to set the smoothing constant for the estimation of speech and noise statistics.
指数平滑是一种广泛应用于语音和噪声统计估计的平均函数。然而,平滑常数的设置一直不一致或在一定程度上任意设置。本文旨在通过将平滑常数表述为上升时间的函数来填补这一空白,以更好地反映待平滑信号的可变性。实际噪声条件下的实验结果表明,该算法对短期平均函数的上升时间非常敏感,而对长期平均函数的上升时间则不太敏感。研究结果为语音增强方法设置平滑常数以估计语音和噪声统计量提供了指导。
{"title":"An Insight Into the Rise Time of Exponential Smoothing for Speech Enhancement Methods","authors":"S. Low","doi":"10.1109/ICSIPA52582.2021.9576801","DOIUrl":"https://doi.org/10.1109/ICSIPA52582.2021.9576801","url":null,"abstract":"Exponential smoothing is a widely used averaging function to estimate the speech and noise statistics. However, the setting of the smoothing constant has been inconsistent or to a certain extent arbitrarily set. This paper aims to fill the gap by formulating the smoothing constant as a function of rise time to better reflect the variability of the signal to be smoothed. Experimental results with real world noise reveal that the performance is very sensitive to the rise time of the short term averaging function, whilst less so for the longer term averaging function. The results provide a guideline for speech enhancement methods to set the smoothing constant for the estimation of speech and noise statistics.","PeriodicalId":326688,"journal":{"name":"2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115448311","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2021 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1