首页 > 最新文献

Optical Memory and Neural Networks最新文献

英文 中文
ASE-UNet: An Orange Fruit Segmentation Model in an Agricultural Environment Based on Deep Learning ASE-UNet:基于深度学习的农业环境中橙色水果分割模型
IF 1 Q4 OPTICS Pub Date : 2023-12-22 DOI: 10.3103/S1060992X23040045
Changgeng Yu, Dashi Lin, Chaowen He

Fruit picking robot requires a powerful vision system that can accurately identify the fruit on the tree. Accurate segmentation of orange fruit in orchards is challenging because of the complex environments due to the overlapping of fruits and occlusions from foliage. In this work, we proposed an image segmentation model called ASE-UNet based on the U-Net architecture, which can achieve accurate segmentation of oranges in complex environments. Firstly, the backbone network structure is improved to reduce the down-sampling rate of orange fruit images, thereby retaining more spatial detail information. Secondly, we introduced the Shape Feature Extraction Module (SFEM), which at enhancing the ability of the model to distinguish between the fruits and backgrounds, such as branches and leaves, by extracting shape and outline information from the orange fruit target. Finally, an attention mechanism was utilized to suppress background channel feature interference in the skip connection and improve the fusion of high-layer and low-layer features. We evaluate the proposed model on the orange fruit images dataset collected in the agricultural environment. The results showed that ASE-UNet achieves IoU, Precision, Recall, and F1-scores of 90.03, 96.10, 93.45, and 94.75%, respectively, which outperform other semantic segmentation methods, such as U-Net, PSPNet, and DeepLabv3+. The proposed method effectively solves the problem of low accuracy fruit segmentation models in the agricultural environment and provides technical support for fruit picking robots.

水果采摘机器人需要一个能够准确识别树上水果的强大视觉系统。由于果实重叠和树叶遮挡导致环境复杂,因此在果园中准确分割橙色果实具有挑战性。在这项工作中,我们提出了一种基于 U-Net 架构的图像分割模型 ASE-UNet,可以在复杂环境中实现橘子的精确分割。首先,改进了骨干网络结构,降低了橙子图像的下采样率,从而保留了更多的空间细节信息。其次,我们引入了形状特征提取模块(SFEM),通过提取橙果目标的形状和轮廓信息,增强模型区分水果和背景(如树枝和树叶)的能力。最后,利用注意力机制来抑制跳转连接中的背景通道特征干扰,并改进高层和低层特征的融合。我们在农业环境中收集的橙果图像数据集上对所提出的模型进行了评估。结果表明,ASE-UNet 的 IoU、Precision、Recall 和 F1 分数分别达到 90.03、96.10、93.45 和 94.75%,优于 U-Net、PSPNet 和 DeepLabv3+ 等其他语义分割方法。该方法有效解决了农业环境中水果分割模型准确率低的问题,为水果采摘机器人提供了技术支持。
{"title":"ASE-UNet: An Orange Fruit Segmentation Model in an Agricultural Environment Based on Deep Learning","authors":"Changgeng Yu,&nbsp;Dashi Lin,&nbsp;Chaowen He","doi":"10.3103/S1060992X23040045","DOIUrl":"10.3103/S1060992X23040045","url":null,"abstract":"<p>Fruit picking robot requires a powerful vision system that can accurately identify the fruit on the tree. Accurate segmentation of orange fruit in orchards is challenging because of the complex environments due to the overlapping of fruits and occlusions from foliage. In this work, we proposed an image segmentation model called ASE-UNet based on the U-Net architecture, which can achieve accurate segmentation of oranges in complex environments. Firstly, the backbone network structure is improved to reduce the down-sampling rate of orange fruit images, thereby retaining more spatial detail information. Secondly, we introduced the Shape Feature Extraction Module (SFEM), which at enhancing the ability of the model to distinguish between the fruits and backgrounds, such as branches and leaves, by extracting shape and outline information from the orange fruit target. Finally, an attention mechanism was utilized to suppress background channel feature interference in the skip connection and improve the fusion of high-layer and low-layer features. We evaluate the proposed model on the orange fruit images dataset collected in the agricultural environment. The results showed that ASE-UNet achieves IoU, Precision, Recall, and <i>F</i><sub>1</sub>-scores of 90.03, 96.10, 93.45, and 94.75%, respectively, which outperform other semantic segmentation methods, such as U-Net, PSPNet, and DeepLabv3+. The proposed method effectively solves the problem of low accuracy fruit segmentation models in the agricultural environment and provides technical support for fruit picking robots.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"32 4","pages":"247 - 257"},"PeriodicalIF":1.0,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138988825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of the Luminescent Carbon Nanoparticles for Optical Diagnostics of Structure-Inhomogeneous Objects at the Micro- and Nanoscales 发光碳纳米粒子在微米和纳米尺度结构非均质物体光学诊断中的应用
IF 1 Q4 OPTICS Pub Date : 2023-12-22 DOI: 10.3103/S1060992X23040069
O. Angelsky, A. Bekshaev, C. Zenkova, D. Ivanskyi, P. Maksymyak, V. Kryvetsky, Zhebo Chen

The paper offers a short review of the recent works associated with the use of luminescent carbon nanoparticles for the studies of structurally inhomogeneous optical fields carrying a diagnostic information on inhomogeneous material objects. Methods for obtaining nanoparticles with various specially assigned optical and electrical properties, necessary for research and diagnostic tasks, are analyzed. It is shown that the light-induced motion of nanoparticles suspended in the optical field enable detection and localization of the points of intensity minima and phase singularities. Optically-driven nanoparticles can serve as highly-sensitive probes of the object surface inhomogeneities, realizing a contactless version of the atomic-force profilometry. In many cases, the use of nanoparticles makes it possible to circumvent the spatial-resolution limitations of optical systems dictated by the classical wave-optics concepts (Rayleigh limit).

本文简要回顾了近期与使用发光碳纳米粒子研究结构不均匀光场有关的工作,这些光场携带着对不均匀材料物体的诊断信息。分析了获得研究和诊断任务所需的具有各种特殊光学和电学特性的纳米粒子的方法。研究表明,悬浮在光场中的纳米粒子在光的诱导下运动,可以检测和定位强度最小点和相位奇异点。光驱动纳米粒子可以作为物体表面不均匀性的高灵敏度探针,实现非接触式原子力轮廓测量。在许多情况下,使用纳米粒子可以规避经典波光学概念(瑞利极限)对光学系统空间分辨率的限制。
{"title":"Application of the Luminescent Carbon Nanoparticles for Optical Diagnostics of Structure-Inhomogeneous Objects at the Micro- and Nanoscales","authors":"O. Angelsky,&nbsp;A. Bekshaev,&nbsp;C. Zenkova,&nbsp;D. Ivanskyi,&nbsp;P. Maksymyak,&nbsp;V. Kryvetsky,&nbsp;Zhebo Chen","doi":"10.3103/S1060992X23040069","DOIUrl":"10.3103/S1060992X23040069","url":null,"abstract":"<p>The paper offers a short review of the recent works associated with the use of luminescent carbon nanoparticles for the studies of structurally inhomogeneous optical fields carrying a diagnostic information on inhomogeneous material objects. Methods for obtaining nanoparticles with various specially assigned optical and electrical properties, necessary for research and diagnostic tasks, are analyzed. It is shown that the light-induced motion of nanoparticles suspended in the optical field enable detection and localization of the points of intensity minima and phase singularities. Optically-driven nanoparticles can serve as highly-sensitive probes of the object surface inhomogeneities, realizing a contactless version of the atomic-force profilometry. In many cases, the use of nanoparticles makes it possible to circumvent the spatial-resolution limitations of optical systems dictated by the classical wave-optics concepts (Rayleigh limit).</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"32 4","pages":"258 - 274"},"PeriodicalIF":1.0,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139015709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of the Luminescent Carbon Nanoparticles for Optical Diagnostics of Structure-Inhomogeneous Objects at the Micro- and Nanoscales 发光碳纳米粒子在微米和纳米尺度结构非均质物体光学诊断中的应用
IF 0.9 Q4 OPTICS Pub Date : 2023-12-22 DOI: 10.3103/s1060992x23040069
O. Angelsky, A. Bekshaev, C. Zenkova, D. Ivanskyi, P. Maksymyak, V. Kryvetsky, Zhebo Chen

Abstract

The paper offers a short review of the recent works associated with the use of luminescent carbon nanoparticles for the studies of structurally inhomogeneous optical fields carrying a diagnostic information on inhomogeneous material objects. Methods for obtaining nanoparticles with various specially assigned optical and electrical properties, necessary for research and diagnostic tasks, are analyzed. It is shown that the light-induced motion of nanoparticles suspended in the optical field enable detection and localization of the points of intensity minima and phase singularities. Optically-driven nanoparticles can serve as highly-sensitive probes of the object surface inhomogeneities, realizing a contactless version of the atomic-force profilometry. In many cases, the use of nanoparticles makes it possible to circumvent the spatial-resolution limitations of optical systems dictated by the classical wave-optics concepts (Rayleigh limit).

摘要 本文简要回顾了近期与使用发光碳纳米粒子研究结构不均匀光场有关的工作,这些光场携带着对不均匀材料物体的诊断信息。分析了获得研究和诊断任务所需的具有各种特殊光学和电学特性的纳米粒子的方法。研究表明,悬浮在光场中的纳米粒子在光的诱导下运动,可以检测和定位强度最小点和相位奇异点。光驱动纳米粒子可以作为物体表面不均匀性的高灵敏度探针,实现非接触式原子力轮廓测量。在许多情况下,使用纳米粒子可以规避经典波光学概念(瑞利极限)对光学系统空间分辨率的限制。
{"title":"Application of the Luminescent Carbon Nanoparticles for Optical Diagnostics of Structure-Inhomogeneous Objects at the Micro- and Nanoscales","authors":"O. Angelsky, A. Bekshaev, C. Zenkova, D. Ivanskyi, P. Maksymyak, V. Kryvetsky, Zhebo Chen","doi":"10.3103/s1060992x23040069","DOIUrl":"https://doi.org/10.3103/s1060992x23040069","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>The paper offers a short review of the recent works associated with the use of luminescent carbon nanoparticles for the studies of structurally inhomogeneous optical fields carrying a diagnostic information on inhomogeneous material objects. Methods for obtaining nanoparticles with various specially assigned optical and electrical properties, necessary for research and diagnostic tasks, are analyzed. It is shown that the light-induced motion of nanoparticles suspended in the optical field enable detection and localization of the points of intensity minima and phase singularities. Optically-driven nanoparticles can serve as highly-sensitive probes of the object surface inhomogeneities, realizing a contactless version of the atomic-force profilometry. In many cases, the use of nanoparticles makes it possible to circumvent the spatial-resolution limitations of optical systems dictated by the classical wave-optics concepts (Rayleigh limit).</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"40 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139029220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Augmentation and Fine Tuning of Convolutional Neural Network during Training for Person Re-Identification in Video Surveillance Systems 用于视频监控系统中人员再识别的卷积神经网络在训练过程中的数据增强和微调
IF 1 Q4 OPTICS Pub Date : 2023-12-22 DOI: 10.3103/S1060992X23040124
S. Ye, R. Bohush, H. Chen, S. Ihnatsyeva, S. V. Ablameyko

A new image set, augmentation method and fine in-learning adjustment of convolutional neural networks (CNN) are proposed to increase the accuracy of CNN-based person re-identification. Unlike other known sets, we have used many video frames from external and internal surveillance systems shot at all seasons of the year to make up our PolReID1077 set of person images. The PolReID1077-forming samples are subjected to the cyclic shift, chroma subsampling, and replacement of a fragment by a reduced copy of another sample to get a wider range of images. The learning set generating technique is used to train a CNN. The training is carried out in two stages. The first stage is pre-training using the augmented data. At the second stage the original images are used to carry out fine-tuning of CNN weight coefficients to reduce in-learning losses and increase re-identification efficiency. The approach doesn’t allow the CNN to remember learning sets and decreases the chances of overfitting. Different augmentation methods, data sets and learning techniques are used in the experiments.

我们提出了一种新的图像集、增强方法和卷积神经网络(CNN)的精细内学习调整,以提高基于 CNN 的人物再识别准确率。与其他已知图像集不同的是,我们使用了来自外部和内部监控系统在一年四季拍摄的许多视频帧来构成我们的 PolReID1077 人物图像集。PolReID1077 形成的样本要经过循环移位、色度子采样,并用另一个样本的缩小副本替换一个片段,以获得范围更广的图像。学习集生成技术用于训练 CNN。训练分两个阶段进行。第一阶段是使用增强数据进行预训练。第二阶段使用原始图像对 CNN 权重系数进行微调,以减少学习中的损失,提高重新识别效率。这种方法不会让 CNN 记住学习集,降低了过度拟合的几率。实验中使用了不同的增强方法、数据集和学习技术。
{"title":"Data Augmentation and Fine Tuning of Convolutional Neural Network during Training for Person Re-Identification in Video Surveillance Systems","authors":"S. Ye,&nbsp;R. Bohush,&nbsp;H. Chen,&nbsp;S. Ihnatsyeva,&nbsp;S. V. Ablameyko","doi":"10.3103/S1060992X23040124","DOIUrl":"10.3103/S1060992X23040124","url":null,"abstract":"<p>A new image set, augmentation method and fine in-learning adjustment of convolutional neural networks (CNN) are proposed to increase the accuracy of CNN-based person re-identification. Unlike other known sets, we have used many video frames from external and internal surveillance systems shot at all seasons of the year to make up our PolReID1077 set of person images. The PolReID1077-forming samples are subjected to the cyclic shift, chroma subsampling, and replacement of a fragment by a reduced copy of another sample to get a wider range of images. The learning set generating technique is used to train a CNN. The training is carried out in two stages. The first stage is pre-training using the augmented data. At the second stage the original images are used to carry out fine-tuning of CNN weight coefficients to reduce in-learning losses and increase re-identification efficiency. The approach doesn’t allow the CNN to remember learning sets and decreases the chances of overfitting. Different augmentation methods, data sets and learning techniques are used in the experiments.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"32 4","pages":"233 - 246"},"PeriodicalIF":1.0,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139015710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Review on Pest Detection and Classification in Agricultural Environments Using Image-Based Deep Learning Models and Its Challenges 基于图像的深度学习模型在农业环境中的害虫检测和分类及其挑战综述
IF 1 Q4 OPTICS Pub Date : 2023-12-22 DOI: 10.3103/S1060992X23040112
P. Venkatasaichandrakanth, M. Iyapparaja

Agronomic pests cause agriculture to incur financial losses because they diminish production, which lowers revenue. Pest control, essential to lowering these losses, involves identifying and eliminating this risk. Since it enables management to take place, identification is the fundamental component of control. Utilizing the pest’s traits, visual identification is done. These characteristics differ between animals and are intrinsic. Since identification is so difficult, specialists in the field handle most of the work, which concentrates the information. Researchers have developed various techniques for predicting crop diseases using images of infected leaves. While progress has been made in identifying plant diseases using different models and methods, new advancements and discussions still offer room for improvement. Technology can significantly improve global crop production, and large datasets can be used to train models and approaches that uncover new and improved methods for detecting plant diseases and addressing low-yield issues. The effectiveness of machine learning and deep learning for identifying and categorizing pests has been confirmed by prior research. This paper thoroughly examines and critically evaluates the many strategies and methodologies used to classify and detect pests or insects using deep learning. The paper examines the benefits and drawbacks of various methodologies and considers potential problems with insect detection via image processing. The paper concludes by providing an analysis and outlook on the future direction of pest detection and classification using deep learning on plants like peanuts.

农艺害虫会造成农业经济损失,因为它们会降低产量,减少收入。害虫控制是降低这些损失的关键,包括识别和消除这种风险。要进行管理,识别是控制的基本要素。利用害虫的特征进行目视识别。这些特征因动物而异,是内在的。由于识别难度很大,因此大部分工作都由现场的专家来完成,这样可以集中信息。研究人员已开发出各种技术,利用受感染叶片的图像预测作物病害。虽然利用不同的模型和方法在识别植物病害方面取得了进展,但新的进步和讨论仍然提供了改进的空间。技术可以极大地提高全球作物产量,大量数据集可用来训练模型和方法,从而发现新的改良方法来检测植物病害和解决低产问题。机器学习和深度学习在识别害虫并对其进行分类方面的有效性已被先前的研究证实。本文深入研究并批判性评估了利用深度学习对害虫或昆虫进行分类和检测的多种策略和方法。本文研究了各种方法的优点和缺点,并考虑了通过图像处理进行昆虫检测的潜在问题。最后,本文对使用深度学习对花生等植物进行害虫检测和分类的未来方向进行了分析和展望。
{"title":"Review on Pest Detection and Classification in Agricultural Environments Using Image-Based Deep Learning Models and Its Challenges","authors":"P. Venkatasaichandrakanth,&nbsp;M. Iyapparaja","doi":"10.3103/S1060992X23040112","DOIUrl":"10.3103/S1060992X23040112","url":null,"abstract":"<p>Agronomic pests cause agriculture to incur financial losses because they diminish production, which lowers revenue. Pest control, essential to lowering these losses, involves identifying and eliminating this risk. Since it enables management to take place, identification is the fundamental component of control. Utilizing the pest’s traits, visual identification is done. These characteristics differ between animals and are intrinsic. Since identification is so difficult, specialists in the field handle most of the work, which concentrates the information. Researchers have developed various techniques for predicting crop diseases using images of infected leaves. While progress has been made in identifying plant diseases using different models and methods, new advancements and discussions still offer room for improvement. Technology can significantly improve global crop production, and large datasets can be used to train models and approaches that uncover new and improved methods for detecting plant diseases and addressing low-yield issues. The effectiveness of machine learning and deep learning for identifying and categorizing pests has been confirmed by prior research. This paper thoroughly examines and critically evaluates the many strategies and methodologies used to classify and detect pests or insects using deep learning. The paper examines the benefits and drawbacks of various methodologies and considers potential problems with insect detection via image processing. The paper concludes by providing an analysis and outlook on the future direction of pest detection and classification using deep learning on plants like peanuts.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"32 4","pages":"295 - 309"},"PeriodicalIF":1.0,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139013597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Plant Foliage Disease Diagnosis Using Light-Weight Efficient Sequential CNN Model 利用轻量高效序列 CNN 模型诊断植物叶面病害
IF 1 Q4 OPTICS Pub Date : 2023-12-22 DOI: 10.3103/S1060992X23040100
Raj Kumar, Anuradha Chug, Amit Prakash Singh

The Precise and prompt identification of plant pathogens is essential to keep agricultural losses as low as possible. In recent time, deep convolution neural networks have seen an exponential growth in their use in phytopathology due to its capacity for rapid and precise disease identification. However, deep convolutional neural network needs a lot of processing power because of its intricate structure consisting of a large stack of layers and millions of trainable parameters which makes them inedquate for light computing devices. In this article, authors have introduced a novel light-weight sequential CNN architecture for the diagnosis of leaf diseases. The suggested CNN approach contains fewer layers and around 70% less attributes than pre-trained CNN-based approaches. For the experiments and performance evaluation, authors have chosen a benchmark public dataset consisting of 7012 images of tomato and potato leaves affected with early and late blight diseases. The performance of the proposed architecture is compared against three recent priorly trained CNN architectures such as ResNet-50, VGG-16 and MobileNet-V2. The average accuracy percentage reported by the proposed architecture is 98.02 and the time consumed in training is also much better than the existing priorly trained CNN architectures. The experimental findings clearly demonstrate that the suggested approach outperforms the recent existing trained CNN approaches and has a very less number of layers and parameters which significantly reduces the amount of computing resources and time to train the model which could be a better choice for mobile-based real-time plant disease diagnosis applications.

要尽可能减少农业损失,就必须准确、迅速地识别植物病原体。近来,深度卷积神经网络在植物病理学领域的应用呈指数级增长,这得益于其快速、精确的病害识别能力。然而,深度卷积神经网络需要大量的处理能力,因为其复杂的结构包括大量的层堆和数百万个可训练参数,这使其不适合轻型计算设备。在本文中,作者介绍了一种新型轻量级顺序 CNN 架构,用于诊断叶片疾病。与基于预训练的 CNN 方法相比,建议的 CNN 方法包含的层数更少,属性也减少了约 70%。在实验和性能评估中,作者选择了一个基准公共数据集,该数据集由 7012 张受早疫病和晚疫病影响的番茄和马铃薯叶片图像组成。所提架构的性能与三种最新的预先训练过的 CNN 架构(如 ResNet-50、VGG-16 和 MobileNet-V2)进行了比较。所提架构的平均准确率为 98.02%,而训练所消耗的时间也大大优于现有的事先训练过的 CNN 架构。实验结果清楚地表明,所建议的方法优于最近现有的经过训练的 CNN 方法,而且层数和参数都非常少,大大减少了计算资源和训练模型的时间,是基于移动的实时植物病害诊断应用的更好选择。
{"title":"Plant Foliage Disease Diagnosis Using Light-Weight Efficient Sequential CNN Model","authors":"Raj Kumar,&nbsp;Anuradha Chug,&nbsp;Amit Prakash Singh","doi":"10.3103/S1060992X23040100","DOIUrl":"10.3103/S1060992X23040100","url":null,"abstract":"<p>The Precise and prompt identification of plant pathogens is essential to keep agricultural losses as low as possible. In recent time, deep convolution neural networks have seen an exponential growth in their use in phytopathology due to its capacity for rapid and precise disease identification. However, deep convolutional neural network needs a lot of processing power because of its intricate structure consisting of a large stack of layers and millions of trainable parameters which makes them inedquate for light computing devices. In this article, authors have introduced a novel light-weight sequential CNN architecture for the diagnosis of leaf diseases. The suggested CNN approach contains fewer layers and around 70% less attributes than pre-trained CNN-based approaches. For the experiments and performance evaluation, authors have chosen a benchmark public dataset consisting of 7012 images of tomato and potato leaves affected with early and late blight diseases. The performance of the proposed architecture is compared against three recent priorly trained CNN architectures such as ResNet-50, VGG-16 and MobileNet-V2. The average accuracy percentage reported by the proposed architecture is 98.02 and the time consumed in training is also much better than the existing priorly trained CNN architectures. The experimental findings clearly demonstrate that the suggested approach outperforms the recent existing trained CNN approaches and has a very less number of layers and parameters which significantly reduces the amount of computing resources and time to train the model which could be a better choice for mobile-based real-time plant disease diagnosis applications.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"32 4","pages":"331 - 345"},"PeriodicalIF":1.0,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139023049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Augmentation and Fine Tuning of Convolutional Neural Network during Training for Person Re-Identification in Video Surveillance Systems 用于视频监控系统中人员再识别的卷积神经网络在训练过程中的数据增强和微调
IF 0.9 Q4 OPTICS Pub Date : 2023-12-22 DOI: 10.3103/s1060992x23040124
S. Ye, R. Bohush, H. Chen, S. Ihnatsyeva, S. V. Ablameyko

Abstract

A new image set, augmentation method and fine in-learning adjustment of convolutional neural networks (CNN) are proposed to increase the accuracy of CNN-based person re-identification. Unlike other known sets, we have used many video frames from external and internal surveillance systems shot at all seasons of the year to make up our PolReID1077 set of person images. The PolReID1077-forming samples are subjected to the cyclic shift, chroma subsampling, and replacement of a fragment by a reduced copy of another sample to get a wider range of images. The learning set generating technique is used to train a CNN. The training is carried out in two stages. The first stage is pre-training using the augmented data. At the second stage the original images are used to carry out fine-tuning of CNN weight coefficients to reduce in-learning losses and increase re-identification efficiency. The approach doesn’t allow the CNN to remember learning sets and decreases the chances of overfitting. Different augmentation methods, data sets and learning techniques are used in the experiments.

摘要 我们提出了一种新的图像集、增强方法和卷积神经网络(CNN)的精细内学习调整,以提高基于 CNN 的人物再识别的准确性。与其他已知图像集不同的是,我们使用了许多来自外部和内部监控系统的一年四季拍摄的视频帧来组成我们的 PolReID1077 人物图像集。PolReID1077 形成的样本要经过循环移位、色度子采样,并用另一个样本的缩小副本替换一个片段,以获得范围更广的图像。学习集生成技术用于训练 CNN。训练分两个阶段进行。第一阶段是使用增强数据进行预训练。第二阶段使用原始图像对 CNN 权重系数进行微调,以减少学习中的损失,提高重新识别效率。这种方法不会让 CNN 记住学习集,降低了过度拟合的几率。实验中使用了不同的增强方法、数据集和学习技术。
{"title":"Data Augmentation and Fine Tuning of Convolutional Neural Network during Training for Person Re-Identification in Video Surveillance Systems","authors":"S. Ye, R. Bohush, H. Chen, S. Ihnatsyeva, S. V. Ablameyko","doi":"10.3103/s1060992x23040124","DOIUrl":"https://doi.org/10.3103/s1060992x23040124","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>A new image set, augmentation method and fine in-learning adjustment of convolutional neural networks (CNN) are proposed to increase the accuracy of CNN-based person re-identification. Unlike other known sets, we have used many video frames from external and internal surveillance systems shot at all seasons of the year to make up our PolReID1077 set of person images. The PolReID1077-forming samples are subjected to the cyclic shift, chroma subsampling, and replacement of a fragment by a reduced copy of another sample to get a wider range of images. The learning set generating technique is used to train a CNN. The training is carried out in two stages. The first stage is pre-training using the augmented data. At the second stage the original images are used to carry out fine-tuning of CNN weight coefficients to reduce in-learning losses and increase re-identification efficiency. The approach doesn’t allow the CNN to remember learning sets and decreases the chances of overfitting. Different augmentation methods, data sets and learning techniques are used in the experiments.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"120 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139029213","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the Efficiency of Using U-Net, Erf-Net and DeepLabV3 Architectures in Inverse Lithography-based 90-nm Photomask Generation 研究在基于反向光刻技术的 90 纳米光掩膜生成中使用 U-Net、Erf-Net 和 DeepLabV3 架构的效率
IF 1 Q4 OPTICS Pub Date : 2023-12-22 DOI: 10.3103/S1060992X23040094
I. M. Karandashev, G. S. Teplov, A. A. Karmanov, V. V. Keremet, A. V. Kuzovkov

The paper deals with the inverse problem of computational lithography. We turn to deep neural network algorithms to compute photomask topologies. The chief goal of the research is to understand how efficient the neural net architectures such as U-net, Erf-Net and Deep Lab v.3, as well as built-in Calibre Workbench algorithms, can be in tackling inverse lithography problems. Specially generated and marked data sets are used to train the artificial neural nets. Calibre EDA software is used to generate haphazard patterns for a 90 nm transistor gate mask. The accuracy and speed parameters are used for the comparison. The edge placement error (EPE) and intersection over union (IOU) are used as metrics. The use of the neural nets allows two orders of magnitude reduction of the mask computation time, with accuracy keeping to 92% for the IOU metric.

摘要 本文涉及计算光刻的逆问题。我们采用深度神经网络算法来计算光掩膜拓扑结构。研究的主要目标是了解 U-net、Erf-Net 和 Deep Lab v.3 等神经网络架构以及 Calibre Workbench 内置算法在处理反向光刻问题时的效率。专门生成和标记的数据集用于训练人工神经网络。Calibre EDA 软件用于生成 90 纳米晶体管栅极掩模的杂乱图案。准确度和速度参数用于比较。边缘放置误差 (EPE) 和交集大于联合 (IOU) 被用作衡量标准。使用神经网络可将掩膜计算时间缩短两个数量级,IOU 指标的准确率保持在 92%。
{"title":"Investigating the Efficiency of Using U-Net, Erf-Net and DeepLabV3 Architectures in Inverse Lithography-based 90-nm Photomask Generation","authors":"I. M. Karandashev,&nbsp;G. S. Teplov,&nbsp;A. A. Karmanov,&nbsp;V. V. Keremet,&nbsp;A. V. Kuzovkov","doi":"10.3103/S1060992X23040094","DOIUrl":"10.3103/S1060992X23040094","url":null,"abstract":"<p>The paper deals with the inverse problem of computational lithography. We turn to deep neural network algorithms to compute photomask topologies. The chief goal of the research is to understand how efficient the neural net architectures such as U-net, Erf-Net and Deep Lab v.3, as well as built-in Calibre Workbench algorithms, can be in tackling inverse lithography problems. Specially generated and marked data sets are used to train the artificial neural nets. Calibre EDA software is used to generate haphazard patterns for a 90 nm transistor gate mask. The accuracy and speed parameters are used for the comparison. The edge placement error (EPE) and intersection over union (IOU) are used as metrics. The use of the neural nets allows two orders of magnitude reduction of the mask computation time, with accuracy keeping to 92% for the IOU metric.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"32 4","pages":"219 - 225"},"PeriodicalIF":1.0,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.3103/S1060992X23040094.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139029348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Plant Foliage Disease Diagnosis Using Light-Weight Efficient Sequential CNN Model 利用轻量高效序列 CNN 模型诊断植物叶面病害
IF 0.9 Q4 OPTICS Pub Date : 2023-12-22 DOI: 10.3103/s1060992x23040100
Raj Kumar, Anuradha Chug, Amit Prakash Singh

Abstract

The Precise and prompt identification of plant pathogens is essential to keep agricultural losses as low as possible. In recent time, deep convolution neural networks have seen an exponential growth in their use in phytopathology due to its capacity for rapid and precise disease identification. However, deep convolutional neural network needs a lot of processing power because of its intricate structure consisting of a large stack of layers and millions of trainable parameters which makes them inedquate for light computing devices. In this article, authors have introduced a novel light-weight sequential CNN architecture for the diagnosis of leaf diseases. The suggested CNN approach contains fewer layers and around 70% less attributes than pre-trained CNN-based approaches. For the experiments and performance evaluation, authors have chosen a benchmark public dataset consisting of 7012 images of tomato and potato leaves affected with early and late blight diseases. The performance of the proposed architecture is compared against three recent priorly trained CNN architectures such as ResNet-50, VGG-16 and MobileNet-V2. The average accuracy percentage reported by the proposed architecture is 98.02 and the time consumed in training is also much better than the existing priorly trained CNN architectures. The experimental findings clearly demonstrate that the suggested approach outperforms the recent existing trained CNN approaches and has a very less number of layers and parameters which significantly reduces the amount of computing resources and time to train the model which could be a better choice for mobile-based real-time plant disease diagnosis applications.

摘要要尽可能减少农业损失,就必须准确、迅速地识别植物病原体。近年来,深度卷积神经网络在植物病理学领域的应用呈指数级增长,因为它具有快速、精确识别病害的能力。然而,深度卷积神经网络需要大量的处理能力,因为其复杂的结构包括大量的层堆和数百万个可训练参数,这使其不适合轻型计算设备。在本文中,作者介绍了一种新型轻量级顺序 CNN 架构,用于诊断叶片疾病。与基于预训练的 CNN 方法相比,建议的 CNN 方法包含的层数更少,属性也减少了约 70%。在实验和性能评估中,作者选择了一个基准公共数据集,该数据集由 7012 张受早疫病和晚疫病影响的番茄和马铃薯叶片图像组成。所提架构的性能与三种最新的预先训练过的 CNN 架构(如 ResNet-50、VGG-16 和 MobileNet-V2)进行了比较。所提架构的平均准确率为 98.02%,而训练所消耗的时间也大大优于现有的事先训练过的 CNN 架构。实验结果清楚地表明,所建议的方法优于最近现有的经过训练的 CNN 方法,而且层数和参数都非常少,大大减少了计算资源和训练模型的时间,是基于移动的实时植物病害诊断应用的更好选择。
{"title":"Plant Foliage Disease Diagnosis Using Light-Weight Efficient Sequential CNN Model","authors":"Raj Kumar, Anuradha Chug, Amit Prakash Singh","doi":"10.3103/s1060992x23040100","DOIUrl":"https://doi.org/10.3103/s1060992x23040100","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>The Precise and prompt identification of plant pathogens is essential to keep agricultural losses as low as possible. In recent time, deep convolution neural networks have seen an exponential growth in their use in phytopathology due to its capacity for rapid and precise disease identification. However, deep convolutional neural network needs a lot of processing power because of its intricate structure consisting of a large stack of layers and millions of trainable parameters which makes them inedquate for light computing devices. In this article, authors have introduced a novel light-weight sequential CNN architecture for the diagnosis of leaf diseases. The suggested CNN approach contains fewer layers and around 70% less attributes than pre-trained CNN-based approaches. For the experiments and performance evaluation, authors have chosen a benchmark public dataset consisting of 7012 images of tomato and potato leaves affected with early and late blight diseases. The performance of the proposed architecture is compared against three recent priorly trained CNN architectures such as ResNet-50, VGG-16 and MobileNet-V2. The average accuracy percentage reported by the proposed architecture is 98.02 and the time consumed in training is also much better than the existing priorly trained CNN architectures. The experimental findings clearly demonstrate that the suggested approach outperforms the recent existing trained CNN approaches and has a very less number of layers and parameters which significantly reduces the amount of computing resources and time to train the model which could be a better choice for mobile-based real-time plant disease diagnosis applications.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"22 1","pages":""},"PeriodicalIF":0.9,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139029382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of Prediction Models for Vulnerable Road User Accident Severity 开发易受伤害道路使用者事故严重程度预测模型
IF 1 Q4 OPTICS Pub Date : 2023-12-22 DOI: 10.3103/S1060992X23040082
Saurabh Jaglan, Sunita Kumari, Praveen Aggarwal

Road traffic accidents are considered a significant problem which ruins the life of many people and also causes major economic losses. So, this issue is considered a hot research topic, and many researchers all over the world are focusing on developing a solution to this most challenging problem. Traditionally the accident spots are detected by means of transportation experts, and following that, some of the statistical models such as linear and nonlinear regression were used for accident severity prediction. However, these traditional approaches do not have the capability to analyze the relationship between the influential factor and accident severity. To address this issue, an Artificial Neural Network (ANN) classifier based vulnerable accident prediction model is proposed in this current research. Initially, the past accident data over the past period of years is collected from a specified area. The acquired data consists of a variable factor related to road infrastructure, weather condition, area of the accident, type of injury and driving characteristics. Then, to standardize the raw input data, min-max normalization is used as a pre-processing technique. The pre-processed is sent for the feature selection process in which essential features are selected by correlating the variable factor with accident severity prediction. Following that, the dimension of the features is reduced using Latent Sematic Index (LSI). Finally, the reduced features are fetched into the ANN classifier for predicting the severity of accidents such as low, medium and high. Simulation analysis of the proposed accident prediction model is carried out by evaluating some of the performance metrics for three datasets. Accuracy, error, specificity, recall and precision attained for the proposed model using dataset 1 is 96.3, 0.03, 98 and 98%. Through this proposed vulnerable accident prediction model, the severity of accidents can be analyzed effectively, and road safety levels can be improved.

道路交通事故被认为是一个重大问题,它毁掉了许多人的生活,也造成了重大的经济损失。因此,这个问题被认为是一个热门研究课题,世界各地的许多研究人员都在集中精力为这个最具挑战性的问题开发解决方案。传统上,事故点是通过交通专家来检测的,之后,一些统计模型(如线性和非线性回归)被用于事故严重性预测。然而,这些传统方法无法分析影响因素与事故严重性之间的关系。针对这一问题,本研究提出了一种基于人工神经网络(ANN)分类器的易损事故预测模型。首先,从指定区域收集过去几年的事故数据。获取的数据包括与道路基础设施、天气状况、事故发生区域、伤害类型和驾驶特征相关的可变因素。然后,为了使原始输入数据标准化,使用了最小-最大归一化作为预处理技术。预处理后的数据将被送往特征选择过程,在此过程中,通过将可变因素与事故严重性预测相关联来选择基本特征。然后,使用潜在语义索引(LSI)降低特征的维度。最后,将缩减后的特征提取到 ANN 分类器中,用于预测事故的严重程度,如低、中和高。通过评估三个数据集的一些性能指标,对所提出的事故预测模型进行了仿真分析。在数据集 1 中,所提模型的准确率、误差、特异性、召回率和精确率分别为 96.3%、0.03%、98% 和 98%。通过所提出的易损事故预测模型,可以有效地分析事故的严重程度,提高道路安全水平。
{"title":"Development of Prediction Models for Vulnerable Road User Accident Severity","authors":"Saurabh Jaglan,&nbsp;Sunita Kumari,&nbsp;Praveen Aggarwal","doi":"10.3103/S1060992X23040082","DOIUrl":"10.3103/S1060992X23040082","url":null,"abstract":"<p>Road traffic accidents are considered a significant problem which ruins the life of many people and also causes major economic losses. So, this issue is considered a hot research topic, and many researchers all over the world are focusing on developing a solution to this most challenging problem. Traditionally the accident spots are detected by means of transportation experts, and following that, some of the statistical models such as linear and nonlinear regression were used for accident severity prediction. However, these traditional approaches do not have the capability to analyze the relationship between the influential factor and accident severity. To address this issue, an Artificial Neural Network (ANN) classifier based vulnerable accident prediction model is proposed in this current research. Initially, the past accident data over the past period of years is collected from a specified area. The acquired data consists of a variable factor related to road infrastructure, weather condition, area of the accident, type of injury and driving characteristics. Then, to standardize the raw input data, min-max normalization is used as a pre-processing technique. The pre-processed is sent for the feature selection process in which essential features are selected by correlating the variable factor with accident severity prediction. Following that, the dimension of the features is reduced using Latent Sematic Index (LSI). Finally, the reduced features are fetched into the ANN classifier for predicting the severity of accidents such as low, medium and high. Simulation analysis of the proposed accident prediction model is carried out by evaluating some of the performance metrics for three datasets. Accuracy, error, specificity, recall and precision attained for the proposed model using dataset 1 is 96.3, 0.03, 98 and 98%. Through this proposed vulnerable accident prediction model, the severity of accidents can be analyzed effectively, and road safety levels can be improved.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"32 4","pages":"346 - 363"},"PeriodicalIF":1.0,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139013537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Optical Memory and Neural Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1