首页 > 最新文献

Optical Memory and Neural Networks最新文献

英文 中文
Deep Attention-Multistage GAN with Sparse Dense Fusion R-CNN for High-Resolution and Object Detection in Surveillance System 深度注意-多阶段GAN与稀疏密集融合R-CNN用于监控系统的高分辨率和目标检测
IF 0.8 Q4 OPTICS Pub Date : 2025-12-24 DOI: 10.3103/S1060992X2560003X
Anu Yadav,  Ela Kumar

Video surveillance utilization has developed significantly in sectors such as traffic monitoring, private institution protection, and landmark protection. Identifying an object in a captured surveillance image is difficult because of the poor excellence of the images. The quality of Low-Resolution (LR) images can be enhanced using an image Super-Resolution (SR) reconstruction method. Sophisticated Deep learning methods have been utilized in order to attain state-of-the-art performance in SR. Nevertheless, these techniques are typically prone to losing essential information and perform poorly on complex computations. To overcome these challenges, this research develop a deep Attention-Multistage Generative Adversarial Network (DA-MGAN) for image super-resolution and integrate SDR (Sparse Dense fusion R-CNN) with Non-Maximum Suppression (NMS) to enhance object detection accuracy in surveillance images. DA-MGAN is used to generate high-resolution surveillance images by utilizing attention mechanisms for improved feature extraction and integrating a multistage GAN that progressively enhances the image quality at each stage. After image resolution, Sparse Dense Fusion R-CNN (SDR) is used for object detection in super-resolved images to improve feature extraction through Sparse Dense Fusion. The R-CNN leverages these enhanced features to accurately detect and segment objects at the pixel level. Subsequently, Non-Maximum Suppression (NMS) was applied to improve localization by eliminating overlapping bounding boxes and minimizing false positives. This integrated method boosts overall detection precision and reliability in real time surveillance scenarios. The proposed model achieves a Super Resolution Error Rate (SRER) of 0.19%, a Bit Error Rate (BER) of 0.125%, a Packet Error Rate (PER) of 0.0990%, and a Deep End-to-End Image Metric (DEEIM) of 0.04963%, showcasing its superior performance. In contrast with existing methodologies, these results highlight the effectiveness of the suggested approach in reducing error rates and enhancing image quality metrics. As a result, these methods are ideally suited for real-time applications, particularly in high-resolution scenarios and object detection within surveillance systems.

视频监控在交通监控、私人机构保护和地标保护等领域的应用得到了显著发展。由于捕获的监控图像质量不佳,很难识别目标。采用图像超分辨率(SR)重建方法可以提高低分辨率(LR)图像的质量。为了在sr中获得最先进的性能,已经使用了复杂的深度学习方法。然而,这些技术通常容易丢失基本信息,并且在复杂的计算中表现不佳。为了克服这些挑战,本研究开发了一种用于图像超分辨率的深度注意-多阶段生成对抗网络(DA-MGAN),并将SDR(稀疏密集融合R-CNN)与非最大抑制(NMS)相结合,以提高监视图像中的目标检测精度。DA-MGAN通过利用注意力机制来改进特征提取,并集成多阶段GAN来逐步提高每阶段的图像质量,从而生成高分辨率的监控图像。图像分辨率完成后,利用SDR (Sparse Dense Fusion R-CNN)对超分辨率图像进行目标检测,通过Sparse Dense Fusion改进特征提取。R-CNN利用这些增强的功能来准确地检测和分割像素级的对象。随后,采用非最大抑制(NMS)方法,通过消除重叠的边界框和减少误报来改进定位。这种集成方法在实时监控场景中提高了整体检测精度和可靠性。该模型的超分辨错误率(SRER)为0.19%,误码率(BER)为0.125%,包错误率(PER)为0.0990%,深度端到端图像度量(DEEIM)为0.04963%,显示了其优越的性能。与现有的方法相比,这些结果突出了所建议的方法在降低错误率和提高图像质量指标方面的有效性。因此,这些方法非常适合实时应用,特别是在监控系统中的高分辨率场景和目标检测中。
{"title":"Deep Attention-Multistage GAN with Sparse Dense Fusion R-CNN for High-Resolution and Object Detection in Surveillance System","authors":"Anu Yadav,&nbsp; Ela Kumar","doi":"10.3103/S1060992X2560003X","DOIUrl":"10.3103/S1060992X2560003X","url":null,"abstract":"<p>Video surveillance utilization has developed significantly in sectors such as traffic monitoring, private institution protection, and landmark protection. Identifying an object in a captured surveillance image is difficult because of the poor excellence of the images. The quality of Low-Resolution (LR) images can be enhanced using an image Super-Resolution (SR) reconstruction method. Sophisticated Deep learning methods have been utilized in order to attain state-of-the-art performance in SR. Nevertheless, these techniques are typically prone to losing essential information and perform poorly on complex computations. To overcome these challenges, this research develop a deep Attention-Multistage Generative Adversarial Network (DA-MGAN) for image super-resolution and integrate SDR (Sparse Dense fusion R-CNN) with Non-Maximum Suppression (NMS) to enhance object detection accuracy in surveillance images. DA-MGAN is used to generate high-resolution surveillance images by utilizing attention mechanisms for improved feature extraction and integrating a multistage GAN that progressively enhances the image quality at each stage. After image resolution, Sparse Dense Fusion R-CNN (SDR) is used for object detection in super-resolved images to improve feature extraction through Sparse Dense Fusion. The R-CNN leverages these enhanced features to accurately detect and segment objects at the pixel level. Subsequently, Non-Maximum Suppression (NMS) was applied to improve localization by eliminating overlapping bounding boxes and minimizing false positives. This integrated method boosts overall detection precision and reliability in real time surveillance scenarios. The proposed model achieves a Super Resolution Error Rate (SRER) of 0.19%, a Bit Error Rate (BER) of 0.125%, a Packet Error Rate (PER) of 0.0990%, and a Deep End-to-End Image Metric (DEEIM) of 0.04963%, showcasing its superior performance. In contrast with existing methodologies, these results highlight the effectiveness of the suggested approach in reducing error rates and enhancing image quality metrics. As a result, these methods are ideally suited for real-time applications, particularly in high-resolution scenarios and object detection within surveillance systems.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 4","pages":"566 - 580"},"PeriodicalIF":0.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unsupervised Background Estimation Using a Neural Integrator 基于神经积分器的无监督背景估计
IF 0.8 Q4 OPTICS Pub Date : 2025-12-24 DOI: 10.3103/S1060992X25600156
Shiva Kamkar, Hamid Abrishami Moghaddam, Wolfram Erlhagen

Background estimation is an important part of many computer vision applications. However, it is a challenging task due to illumination changes, camouflage, occlusion, dynamic background, rain or snow fall, and shadows. In this paper, we propose a method to predict the background of videos recorded by fixed cameras. The proposed algorithm is unsupervised and online. It takes inspiration from the processing mechanisms of neural integrator circuits in recurrently connected networks. The neural activities of three distinct integrators, each responsible for processing a color channel in L * a * b color space, are updated according to the recent changes of the scene covering both spatial and temporal aspects. The maxima of the evolving activity distributions in color space are used to predict the background color value of each pixel. Evaluation results demonstrate that the proposed method outperforms several recent competitors on the Scene Background Initialization (SBI) and LASIESTA datasets, based on mean squared error (MSE) metrics.

背景估计是许多计算机视觉应用的重要组成部分。然而,由于光照变化、伪装、遮挡、动态背景、雨雪降落和阴影,这是一项具有挑战性的任务。本文提出了一种预测固定摄像机拍摄的视频背景的方法。该算法是无监督的在线算法。它的灵感来自于递归连接网络中神经积分器电路的处理机制。三个不同的积分器的神经活动,每个负责处理L * a * b颜色空间中的颜色通道,根据场景的最近变化进行更新,包括空间和时间方面。利用颜色空间中不断变化的活动分布的最大值来预测每个像素的背景颜色值。评估结果表明,基于均方误差(MSE)指标,该方法在场景背景初始化(SBI)和LASIESTA数据集上优于最近的几个竞争对手。
{"title":"Unsupervised Background Estimation Using a Neural Integrator","authors":"Shiva Kamkar,&nbsp;Hamid Abrishami Moghaddam,&nbsp;Wolfram Erlhagen","doi":"10.3103/S1060992X25600156","DOIUrl":"10.3103/S1060992X25600156","url":null,"abstract":"<p>Background estimation is an important part of many computer vision applications. However, it is a challenging task due to illumination changes, camouflage, occlusion, dynamic background, rain or snow fall, and shadows. In this paper, we propose a method to predict the background of videos recorded by fixed cameras. The proposed algorithm is unsupervised and online. It takes inspiration from the processing mechanisms of neural integrator circuits in recurrently connected networks. The neural activities of three distinct integrators, each responsible for processing a color channel in <i>L</i> * <i>a</i> * <i>b</i> color space, are updated according to the recent changes of the scene covering both spatial and temporal aspects. The maxima of the evolving activity distributions in color space are used to predict the background color value of each pixel. Evaluation results demonstrate that the proposed method outperforms several recent competitors on the Scene Background Initialization (SBI) and LASIESTA datasets, based on mean squared error (MSE) metrics.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 4","pages":"581 - 591"},"PeriodicalIF":0.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent Recommendation of Ideological and Political Course Content Based on the BPNN Algorithm Improved by Attention Mechanism 基于注意力机制改进的BPNN算法的思想政治课内容智能推荐
IF 0.8 Q4 OPTICS Pub Date : 2025-12-24 DOI: 10.3103/S1060992X25601484
Yang Yuan,  Lixia Li

This paper improved the back-propagation neural network (BPNN) algorithm for recommending ideological and political courses by a squeeze-and-excitation network (SEnet) and a multi-head attention mechanism. Simulation experiments were conducted to compare the improved algorithm with two other recommendation algorithms, followed by ablation experiments. Moreover, the effectiveness of the recommendation algorithm was tested in actual teaching of ideological and political courses. The results demonstrated that the improved BPNN algorithm outperformed others and the SEnet and the multi-head attention mechanism significantly enhanced the accuracy of recommendations. The algorithm effectively improved students’ performance in ideological and political courses and was satisfied by the majority of students.

本文采用挤压激励网络(SEnet)和多头注意机制对反向传播神经网络(BPNN)思想政治课推荐算法进行了改进。通过仿真实验将改进算法与其他两种推荐算法进行比较,并进行烧蚀实验。并在实际的思想政治课教学中验证了推荐算法的有效性。结果表明,改进的BPNN算法优于其他算法,SEnet和多头注意机制显著提高了推荐的准确性。该算法有效地提高了学生在思想政治课上的学习成绩,得到了广大学生的满意。
{"title":"Intelligent Recommendation of Ideological and Political Course Content Based on the BPNN Algorithm Improved by Attention Mechanism","authors":"Yang Yuan,&nbsp; Lixia Li","doi":"10.3103/S1060992X25601484","DOIUrl":"10.3103/S1060992X25601484","url":null,"abstract":"<p>This paper improved the back-propagation neural network (BPNN) algorithm for recommending ideological and political courses by a squeeze-and-excitation network (SEnet) and a multi-head attention mechanism. Simulation experiments were conducted to compare the improved algorithm with two other recommendation algorithms, followed by ablation experiments. Moreover, the effectiveness of the recommendation algorithm was tested in actual teaching of ideological and political courses. The results demonstrated that the improved BPNN algorithm outperformed others and the SEnet and the multi-head attention mechanism significantly enhanced the accuracy of recommendations. The algorithm effectively improved students’ performance in ideological and political courses and was satisfied by the majority of students.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 4","pages":"485 - 491"},"PeriodicalIF":0.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Silicon Microring Resonator-Based All-Optical Reversible 2 : 1 Multiplexer: Numerical Analysis 基于硅微环谐振器的全光可逆2:1多路复用器:数值分析
IF 0.8 Q4 OPTICS Pub Date : 2025-12-24 DOI: 10.3103/S1060992X25700237
Sankarapuram Siva Saravana Kumar, Jakusani Shirisha, Sultan Mahaboob Basha, Kalimuddin Mondal, Vankadari Nagaraju, Alagar Raja, Manjur Hossain

Future computing systems will likely require reversible logic gates since, in the best of circumstances; they are known to provide zero power dissipation. Reversible logic gates also have the advantage of minimizing quantum costs and unused outputs. In this work, reversible 2 : 1 multiplexer has been implemented using all-optical microring resonator. Reversible multiplexer is simulated in MATLAB at 260 Gbps. Some performance indicating factors such as “extinction ratio”, “contrast ratio”, “relative eye opening”, etc are analyzed and developed. The chosen optimal parameters can be validated practically.

未来的计算系统可能需要可逆逻辑门,因为在最好的情况下;众所周知,它们提供零功耗。可逆逻辑门还具有最小化量子成本和未使用输出的优点。本文采用全光微环谐振器实现了可逆的2:1多路复用器。在MATLAB中对260 Gbps的可逆复用器进行了仿真。对“消光比”、“对比度”、“相对开眼度”等性能指标进行了分析和开发。所选择的最优参数可以在实际中得到验证。
{"title":"Silicon Microring Resonator-Based All-Optical Reversible 2 : 1 Multiplexer: Numerical Analysis","authors":"Sankarapuram Siva Saravana Kumar,&nbsp;Jakusani Shirisha,&nbsp;Sultan Mahaboob Basha,&nbsp;Kalimuddin Mondal,&nbsp;Vankadari Nagaraju,&nbsp;Alagar Raja,&nbsp;Manjur Hossain","doi":"10.3103/S1060992X25700237","DOIUrl":"10.3103/S1060992X25700237","url":null,"abstract":"<p>Future computing systems will likely require reversible logic gates since, in the best of circumstances; they are known to provide zero power dissipation. Reversible logic gates also have the advantage of minimizing quantum costs and unused outputs. In this work, reversible 2 : 1 multiplexer has been implemented using all-optical microring resonator. Reversible multiplexer is simulated in MATLAB at 260 Gbps. Some performance indicating factors such as “extinction ratio”, “contrast ratio”, “relative eye opening”, etc are analyzed and developed. The chosen optimal parameters can be validated practically.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 4","pages":"492 - 501"},"PeriodicalIF":0.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of Training Results of a Convolutional Neural Network with Computed Weights and Random Weight Initialization 计算权值与随机权值初始化卷积神经网络训练结果的比较
IF 0.8 Q4 OPTICS Pub Date : 2025-12-24 DOI: 10.3103/S1060992X2560034X
P. Sh. Geidarov

This paper presents a description of an improved algorithm for computing the weights of a convolutional neural network, as well as the results of comparative experiments on training a convolutional neural network on a handwritten digit dataset MNIST using both precomputed and randomly initialized weights. The experimental results demonstrate the advantages of preliminary analytical computation of neural network weight values compared to random weight initialization. The weights of the convolutional neural network were computed using only 10 digit images, randomly selected from the dataset MNIST. Experiments showed that the time spent on the analytical computation of weight values was negligible. The testing results on the test dataset with the computed but yet untrained neural network showed an accuracy of more than 50% in correctly recognizing the images from the test dataset MNIST. The results of numerous training experiments on the same convolutional neural network, using both computed and random weights, showed that training with precomputed weights yields better results and requires less training time.

本文介绍了一种改进的卷积神经网络权值计算算法,以及在手写数字数据集MNIST上使用预计算权值和随机初始化权值训练卷积神经网络的对比实验结果。实验结果表明,与随机权值初始化相比,神经网络权值的初步解析计算具有优势。卷积神经网络的权重仅使用从数据集MNIST中随机选择的10位数图像来计算。实验表明,在权重值的解析计算上所花费的时间可以忽略不计。使用经过计算但未经训练的神经网络在测试数据集上的测试结果表明,在正确识别来自测试数据集MNIST的图像方面,准确率超过50%。在同一个卷积神经网络上,使用计算权值和随机权值的大量训练实验结果表明,使用预先计算权值的训练效果更好,所需的训练时间更少。
{"title":"Comparison of Training Results of a Convolutional Neural Network with Computed Weights and Random Weight Initialization","authors":"P. Sh. Geidarov","doi":"10.3103/S1060992X2560034X","DOIUrl":"10.3103/S1060992X2560034X","url":null,"abstract":"<p>This paper presents a description of an improved algorithm for computing the weights of a convolutional neural network, as well as the results of comparative experiments on training a convolutional neural network on a handwritten digit dataset MNIST using both precomputed and randomly initialized weights. The experimental results demonstrate the advantages of preliminary analytical computation of neural network weight values compared to random weight initialization. The weights of the convolutional neural network were computed using only 10 digit images, randomly selected from the dataset MNIST. Experiments showed that the time spent on the analytical computation of weight values was negligible. The testing results on the test dataset with the computed but yet untrained neural network showed an accuracy of more than 50% in correctly recognizing the images from the test dataset MNIST. The results of numerous training experiments on the same convolutional neural network, using both computed and random weights, showed that training with precomputed weights yields better results and requires less training time.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 4","pages":"502 - 511"},"PeriodicalIF":0.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving the Discretization Step of Multidimensional Digital Arrays through Self-Tuned Extrapolation 利用自调谐外推改进多维数字阵列的离散化步骤
IF 0.8 Q4 OPTICS Pub Date : 2025-12-24 DOI: 10.3103/S1060992X25700183
M. V. Gashnikov

Adaptive extrapolation-based algorithms for upscaling multidimensional digital arrays are investigated. For each array element an atomic extrpolator is chosen from a set of computationally simple atomic extrapolators. The choise relies on the local variation ratio in different directions. The adaptation suggests the efficient automatic selection of the local variation ratio limit at which one atomic extrapolator is replaced by another. The computational efficiency of the self-adjustment algorithm is determined by the use of preceeding (neighboring) values of the extrapolator accuracy factor in calculating the local variation ratio limit. Higher accuracy of the extrapolator is due to the use of the downscaled version of the input multidimensional data array for adjusting the extrapolator. The higher efficiency of adaptive extrapolation when scaling multidimensional data arrays has been experimentally proven.

研究了基于自适应外推的多维数字阵列升级算法。对于每个数组元素,从一组计算简单的原子外推器中选择一个原子外推器。选择依赖于不同方向的局部变化率。这种自适应表明了局部变异比极限的有效自动选择,一个原子外推器被另一个原子外推器取代。自调整算法的计算效率取决于外推器精度因子在计算局部变化率极限时的前(邻)值。外推器的更高精度是由于使用了输入多维数据数组的缩小版本来调整外推器。实验证明,自适应外推在缩放多维数据阵列时具有较高的效率。
{"title":"Improving the Discretization Step of Multidimensional Digital Arrays through Self-Tuned Extrapolation","authors":"M. V. Gashnikov","doi":"10.3103/S1060992X25700183","DOIUrl":"10.3103/S1060992X25700183","url":null,"abstract":"<p>Adaptive extrapolation-based algorithms for upscaling multidimensional digital arrays are investigated. For each array element an atomic extrpolator is chosen from a set of computationally simple atomic extrapolators. The choise relies on the local variation ratio in different directions. The adaptation suggests the efficient automatic selection of the local variation ratio limit at which one atomic extrapolator is replaced by another. The computational efficiency of the self-adjustment algorithm is determined by the use of preceeding (neighboring) values of the extrapolator accuracy factor in calculating the local variation ratio limit. Higher accuracy of the extrapolator is due to the use of the downscaled version of the input multidimensional data array for adjusting the extrapolator. The higher efficiency of adaptive extrapolation when scaling multidimensional data arrays has been experimentally proven.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 4","pages":"471 - 475"},"PeriodicalIF":0.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LungNet: A Novel Deep Learning-Based Model for Lung Disease Detection LungNet:一种新的基于深度学习的肺部疾病检测模型
IF 0.8 Q4 OPTICS Pub Date : 2025-12-24 DOI: 10.3103/S1060992X25600478
Prithwijit Mukherjee,  Anisha Halder Roy

Worldwide, lung disease is a serious health problem, affecting a large percentage of the global population. Accurate diagnosis of lung diseases can be challenging as many of these conditions present with similar symptoms. The goal of this research is to design a robust deep learning-based technique capable of accurately detecting four types of lung diseases, such as COVID-19, bacterial pneumonia, viral pneumonia, and mycoplasma pneumonia, using computed tomography (CT) images. In this study, a publicly available CT image dataset is used for designing the lung disease detection system. First, CT images are preprocessed to enhance their quality, then a GAN (Generative Adversarial Network)-based augmentation technique is used to expand the dataset, doubling its size. A hybrid deep learning model named LungNet, consisting of a Convolutional Neural Network (CNN) module with a dual attention mechanism, a multi-head attention module, and the proposed RPLSTM (revamped potent Long Short-Term Memory) network module, is designed for lung disease detection. The CNN module extracts useful features from CT images. The multi-head attention module helps to focus on the most significant features extracted by the CNN module. Lastly, the proposed RPLSTM module is used to diagnose lung disease effectively. The designed LungNet model achieves an average detection accuracy of 99.30%. The key innovations of this research are: (1) the design of a novel deep learning architecture called LungNet for different lung disease detection, (2) the utilization of channel attention and spatial attention in the CNN module of the proposed model for robust feature extraction, (3) employing a multi-head attention layer in the designed model to enhance its efficacy, (4) proposing an advanced architecture of potent long short-term memory (PLSTM) called RPLSTM and utilizing it for lung disease detection, and (5) utilization of GAN to increase the dataset size and thus solve the dataset scarcity problem.

在世界范围内,肺病是一个严重的健康问题,影响着全球很大一部分人口。肺部疾病的准确诊断可能具有挑战性,因为许多这些疾病都有类似的症状。本研究的目标是设计一种基于深度学习的鲁棒技术,能够使用计算机断层扫描(CT)图像准确检测四种类型的肺部疾病,如COVID-19,细菌性肺炎,病毒性肺炎和支原体肺炎。在本研究中,使用公开可用的CT图像数据集设计肺部疾病检测系统。首先,对CT图像进行预处理以提高其质量,然后使用基于GAN(生成对抗网络)的增强技术扩展数据集,使其大小增加一倍。设计了一种用于肺部疾病检测的混合深度学习模型LungNet,该模型由具有双注意机制的卷积神经网络(CNN)模块、多头注意模块和改进的有效长短期记忆(RPLSTM)网络模块组成。CNN模块从CT图像中提取有用的特征。多头关注模块有助于关注CNN模块提取的最重要的特征。最后,将提出的RPLSTM模块用于肺部疾病的有效诊断。所设计的LungNet模型平均检测准确率达到99.30%。本研究的主要创新点有:(1)设计了一种新的深度学习架构LungNet,用于不同肺部疾病的检测;(2)在模型的CNN模块中利用通道注意和空间注意进行鲁棒特征提取;(3)在设计的模型中使用多头注意层来增强其有效性;(4)提出了一种先进的强效长短期记忆(PLSTM)架构RPLSTM,并将其用于肺部疾病的检测。(5)利用GAN增加数据集大小,从而解决数据集稀缺性问题。
{"title":"LungNet: A Novel Deep Learning-Based Model for Lung Disease Detection","authors":"Prithwijit Mukherjee,&nbsp; Anisha Halder Roy","doi":"10.3103/S1060992X25600478","DOIUrl":"10.3103/S1060992X25600478","url":null,"abstract":"<p>Worldwide, lung disease is a serious health problem, affecting a large percentage of the global population. Accurate diagnosis of lung diseases can be challenging as many of these conditions present with similar symptoms. The goal of this research is to design a robust deep learning-based technique capable of accurately detecting four types of lung diseases, such as COVID-19, bacterial pneumonia, viral pneumonia, and mycoplasma pneumonia, using computed tomography (CT) images. In this study, a publicly available CT image dataset is used for designing the lung disease detection system. First, CT images are preprocessed to enhance their quality, then a GAN (Generative Adversarial Network)-based augmentation technique is used to expand the dataset, doubling its size. A hybrid deep learning model named LungNet, consisting of a Convolutional Neural Network (CNN) module with a dual attention mechanism, a multi-head attention module, and the proposed RPLSTM (revamped potent Long Short-Term Memory) network module, is designed for lung disease detection. The CNN module extracts useful features from CT images. The multi-head attention module helps to focus on the most significant features extracted by the CNN module. Lastly, the proposed RPLSTM module is used to diagnose lung disease effectively. The designed LungNet model achieves an average detection accuracy of 99.30%. The key innovations of this research are: (1) the design of a novel deep learning architecture called LungNet for different lung disease detection, (2) the utilization of channel attention and spatial attention in the CNN module of the proposed model for robust feature extraction, (3) employing a multi-head attention layer in the designed model to enhance its efficacy, (4) proposing an advanced architecture of potent long short-term memory (PLSTM) called RPLSTM and utilizing it for lung disease detection, and (5) utilization of GAN to increase the dataset size and thus solve the dataset scarcity problem.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 4","pages":"592 - 611"},"PeriodicalIF":0.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive Traffic Signal Control with Soft Actor-Critic: A Phase Duration Optimization Approach 基于软行为者评价的自适应交通信号控制:一种相位持续时间优化方法
IF 0.8 Q4 OPTICS Pub Date : 2025-12-24 DOI: 10.3103/S1060992X25600648
Anton Agafonov, Alexander Yumaganov

Efficient traffic signal control plays a critical role in reducing urban congestion and improving transportation efficiency. This paper presents an adaptive traffic signal control approach based on reinforcement learning that employs the Soft Actor-Critic (SAC) algorithm to optimize traffic signal phase durations. By focusing on phase duration optimization rather than discrete phase selection, the proposed method ensures a more predictable and adaptive traffic management system. Unlike traditional methods that select discrete signal phases, our approach continuously adjusts phase duration based on real-time traffic conditions, providing a more flexible and responsive control strategy. The proposed framework uses connected vehicle data, including position, speed, and acceleration, to predict vehicle arrival times at intersections. These predictions, combined with aggregated traffic characteristics such as queue length and waiting time, form the state representation for the reinforcement learning model. The SAC algorithm is then used to determine optimal phase durations. We evaluated the proposed approach using the SUMO traffic simulator in three different urban scenarios: a single intersection, a three-intersection arterial road, and a small road network. Experimental results demonstrate that the proposed method outperforms baseline approaches, including Deep Q-Network and a heuristic-based method, in terms of average travel time, time loss, and waiting time. Specifically, the SAC-based algorithm achieves reductions of up to 1.5% in average travel time and up to 13% in average waiting time across various simulation scenarios compared to the baseline methods. Furthermore, training convergence analysis and visualizations confirm the stability and effectiveness of the learned policy.

有效的交通信号控制对缓解城市拥堵、提高交通效率具有重要作用。本文提出了一种基于强化学习的自适应交通信号控制方法,该方法采用软行为-评价(SAC)算法来优化交通信号相位持续时间。该方法侧重于相位持续优化而非离散相位选择,保证了交通管理系统具有更强的可预测性和适应性。与选择离散信号相位的传统方法不同,我们的方法根据实时交通状况连续调整相位持续时间,提供更灵活和响应更快的控制策略。提出的框架使用连接的车辆数据,包括位置、速度和加速度,来预测车辆到达十字路口的时间。这些预测,结合聚合的交通特征,如队列长度和等待时间,形成了强化学习模型的状态表示。然后使用SAC算法确定最优相位持续时间。我们使用SUMO交通模拟器在三种不同的城市场景中评估了所提出的方法:单交叉口、三交叉口主干道和小型道路网络。实验结果表明,该方法在平均旅行时间、时间损失和等待时间方面优于基准方法,包括Deep Q-Network和基于启发式的方法。具体来说,与基线方法相比,基于sac的算法在各种模拟场景中平均行驶时间减少了1.5%,平均等待时间减少了13%。此外,训练收敛分析和可视化验证了学习策略的稳定性和有效性。
{"title":"Adaptive Traffic Signal Control with Soft Actor-Critic: A Phase Duration Optimization Approach","authors":"Anton Agafonov,&nbsp;Alexander Yumaganov","doi":"10.3103/S1060992X25600648","DOIUrl":"10.3103/S1060992X25600648","url":null,"abstract":"<p>Efficient traffic signal control plays a critical role in reducing urban congestion and improving transportation efficiency. This paper presents an adaptive traffic signal control approach based on reinforcement learning that employs the Soft Actor-Critic (SAC) algorithm to optimize traffic signal phase durations. By focusing on phase duration optimization rather than discrete phase selection, the proposed method ensures a more predictable and adaptive traffic management system. Unlike traditional methods that select discrete signal phases, our approach continuously adjusts phase duration based on real-time traffic conditions, providing a more flexible and responsive control strategy. The proposed framework uses connected vehicle data, including position, speed, and acceleration, to predict vehicle arrival times at intersections. These predictions, combined with aggregated traffic characteristics such as queue length and waiting time, form the state representation for the reinforcement learning model. The SAC algorithm is then used to determine optimal phase durations. We evaluated the proposed approach using the SUMO traffic simulator in three different urban scenarios: a single intersection, a three-intersection arterial road, and a small road network. Experimental results demonstrate that the proposed method outperforms baseline approaches, including Deep Q-Network and a heuristic-based method, in terms of average travel time, time loss, and waiting time. Specifically, the SAC-based algorithm achieves reductions of up to 1.5% in average travel time and up to 13% in average waiting time across various simulation scenarios compared to the baseline methods. Furthermore, training convergence analysis and visualizations confirm the stability and effectiveness of the learned policy.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 4","pages":"553 - 565"},"PeriodicalIF":0.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Price Fluctuations in International Trade Process of Agricultural Products with a Machine Learning Model 基于机器学习模型的农产品国际贸易过程价格波动研究
IF 0.8 Q4 OPTICS Pub Date : 2025-12-24 DOI: 10.3103/S1060992X25601289
Chunchao Pang

In this study, a combination of a gated recurrent unit (GRU) model and a Transformer model was used to predict soybean prices in the agricultural sector. Simulation experiments were conducted. The soybean price prediction algorithm was initially compared with two other algorithms, random forest (RF) and back-propagation neural network (BPNN). Then, ablation experiments were performed on the proposed prediction algorithm. The importance of the feature indicators used in predicting soybean prices was tested. The results indicated that, compared to the RF and BPNN algorithms, the GRU-Transformer model demonstrated a superior performance. Additionally, the results of the ablation experiments revealed that both GRU and Transformer models significantly contributed to the accuracy of soybean price prediction. Moreover, the importance of feature indicators such as soybean imports, soybean exports, soybean oil prices, exchange rates, and soybean meal prices was found to be high.

在本研究中,结合门控循环单元(GRU)模型和变压器模型来预测农业部门的大豆价格。进行了仿真实验。首先将大豆价格预测算法与随机森林(RF)和反向传播神经网络(BPNN)两种算法进行了比较。然后对提出的预测算法进行了烧蚀实验。检验了特征指标在预测大豆价格中的重要性。结果表明,与RF和BPNN算法相比,GRU-Transformer模型表现出优越的性能。此外,烧蚀实验结果表明,GRU和Transformer模型对大豆价格预测的准确性都有显著贡献。此外,大豆进口、大豆出口、豆油价格、汇率、豆粕价格等特征指标的重要性也很高。
{"title":"Research on Price Fluctuations in International Trade Process of Agricultural Products with a Machine Learning Model","authors":"Chunchao Pang","doi":"10.3103/S1060992X25601289","DOIUrl":"10.3103/S1060992X25601289","url":null,"abstract":"<p>In this study, a combination of a gated recurrent unit (GRU) model and a Transformer model was used to predict soybean prices in the agricultural sector. Simulation experiments were conducted. The soybean price prediction algorithm was initially compared with two other algorithms, random forest (RF) and back-propagation neural network (BPNN). Then, ablation experiments were performed on the proposed prediction algorithm. The importance of the feature indicators used in predicting soybean prices was tested. The results indicated that, compared to the RF and BPNN algorithms, the GRU-Transformer model demonstrated a superior performance. Additionally, the results of the ablation experiments revealed that both GRU and Transformer models significantly contributed to the accuracy of soybean price prediction. Moreover, the importance of feature indicators such as soybean imports, soybean exports, soybean oil prices, exchange rates, and soybean meal prices was found to be high.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 4","pages":"546 - 552"},"PeriodicalIF":0.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808678","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid Feature Extraction Model for Emotion Recognition Using EEG Signals and Deep Learning Approaches 基于脑电信号和深度学习的情感识别混合特征提取模型
IF 0.8 Q4 OPTICS Pub Date : 2025-12-24 DOI: 10.3103/S1060992X2560154X
R. Mathumitha, A. Maryposonia

Emotions reflect the mental state of a person. Individual changes in physiological, physical, mental and behavioral factors reflect different types of emotions. Studies on emotion recognition always have attention among researchers. Signal processing and feature handling techniques are developed for accurate emotions recognition from the biological brain signals. Through electroencephalography (EEG) channels, physiological signals are obtained and the essential features are extracted for analysis. However, the detection or recognition performance of traditional methods provides room for improvement due to poor accuracy or improper feature handling performances. The EEG signals for emotion recognition are predicted from two data sources that are preprocessed through filtering method for reducing artifacts of the EEG signals. Then, the preprocessed signals are given to the feature extraction phase such as Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM). The CNN model is applied for the extraction of statistical related features and the LSTM model is applied to extract non-linear related features. These features are fused at the final stage of the feature extraction phase and ResNet152 model is implemented in this paper for classifying types of emotions in the EEG signals according to the extracted features. The comprehensive analyses are performed through different performance evaluation measures and the proposed model attained better performances of 0.9867 and 0.9646 from accuracy and mathew’s correlation coefficient respectively. From this experimental validation, the proposed model achieved better outcome than other compared existing approaches.

情绪反映了一个人的精神状态。个体的生理、生理、心理和行为因素的变化反映了不同类型的情绪。情绪识别的研究一直受到研究者的关注。信号处理和特征处理技术是为了从生物大脑信号中准确识别情绪而发展起来的。通过脑电图通道获取生理信号,提取基本特征进行分析。然而,传统方法的检测或识别性能由于精度不高或特征处理性能不佳而存在改进空间。从两个数据源中预测用于情绪识别的脑电信号,并对两个数据源进行预处理,以减少脑电信号的伪影。然后,将预处理后的信号输入到卷积神经网络(CNN)、长短期记忆(LSTM)等特征提取阶段。采用CNN模型提取统计相关特征,采用LSTM模型提取非线性相关特征。在特征提取的最后阶段对这些特征进行融合,并实现ResNet152模型,根据提取的特征对脑电信号中的情绪类型进行分类。通过不同的性能评价指标进行综合分析,所提模型的精度和马修相关系数分别达到0.9867和0.9646。从实验验证来看,所提出的模型取得了较好的效果。
{"title":"Hybrid Feature Extraction Model for Emotion Recognition Using EEG Signals and Deep Learning Approaches","authors":"R. Mathumitha,&nbsp;A. Maryposonia","doi":"10.3103/S1060992X2560154X","DOIUrl":"10.3103/S1060992X2560154X","url":null,"abstract":"<p>Emotions reflect the mental state of a person. Individual changes in physiological, physical, mental and behavioral factors reflect different types of emotions. Studies on emotion recognition always have attention among researchers. Signal processing and feature handling techniques are developed for accurate emotions recognition from the biological brain signals. Through electroencephalography (EEG) channels, physiological signals are obtained and the essential features are extracted for analysis. However, the detection or recognition performance of traditional methods provides room for improvement due to poor accuracy or improper feature handling performances. The EEG signals for emotion recognition are predicted from two data sources that are preprocessed through filtering method for reducing artifacts of the EEG signals. Then, the preprocessed signals are given to the feature extraction phase such as Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM). The CNN model is applied for the extraction of statistical related features and the LSTM model is applied to extract non-linear related features. These features are fused at the final stage of the feature extraction phase and ResNet152 model is implemented in this paper for classifying types of emotions in the EEG signals according to the extracted features. The comprehensive analyses are performed through different performance evaluation measures and the proposed model attained better performances of 0.9867 and 0.9646 from accuracy and mathew’s correlation coefficient respectively. From this experimental validation, the proposed model achieved better outcome than other compared existing approaches.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 4","pages":"512 - 527"},"PeriodicalIF":0.8,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145808713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Optical Memory and Neural Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1