首页 > 最新文献

Optical Memory and Neural Networks最新文献

英文 中文
Decoding EEG Data with Deep Learning for Intelligence Quotient Assessment 基于深度学习的脑电数据解码与智商评估
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X24601921
Prithwijit Mukherjee,  Anisha Halder Roy

Intelligence quotient (IQ) serves as a statistical gauge for evaluating an individual’s cognitive prowess. Measuring IQ is a formidable undertaking, mainly due to the intricate intricacies of the human brain’s composition. Presently, the assessment of human intelligence relies solely on conventional paper-based psychometric tests. However, these approaches suffer from inherent discrepancies arising from the diversity of test formats and language barriers. The primary objective of this study is to introduce an innovative, deep learning-driven methodology for IQ measurement using Electroencephalogram (EEG) signals. In this investigation, EEG signals are captured from participants during an IQ assessment session. Subsequently, participants' IQ levels are categorized into six distinct tiers, encompassing extremely low IQ, borderline IQ, low average IQ, high average IQ, superior IQ, and very superior IQ, based on their test results. An attention mechanism-based Convolution Neural Network-modified tanh Long-Short-term-Memory (CNN-MTLSTM) model has been meticulously devised for adeptly classifying individuals into the aforementioned IQ categories by using EEG signals. A layer named 'input enhancement layer' is proposed and incorporated in CNN-MTLSTM for enhancing its prediction accuracy. Notably, a CNN is harnessed to automate the process of extracting important information from the extracted EEG features. A new model, i.e., MTLSTM, is proposed, which works as a classifier. The paper’s contributions encompass proposing the novel MTLSTM architecture and leveraging attention mechanism to enhance the classification accuracy of the CNN-MTLSTM model. The innovative CNN-MTLSTM model, incorporating an attention mechanism within the MTLSTM network, attains a remarkable average accuracy of 97.41% in assessing a person’s IQ level.

智商(IQ)是评估个人认知能力的统计指标。测量智商是一项艰巨的任务,主要是由于人类大脑的组成错综复杂。目前,人类智力的评估完全依赖于传统的纸质心理测试。然而,由于测试形式的多样性和语言障碍,这些方法存在固有的差异。本研究的主要目的是介绍一种创新的、深度学习驱动的方法,用于使用脑电图(EEG)信号进行智商测量。在这项研究中,在智商评估过程中,从参与者身上捕获脑电图信号。随后,根据测试结果,参与者的智商水平被分为六个不同的等级,包括极低智商、边缘智商、低平均智商、高平均智商、高智商和超高智商。本文精心设计了一个基于注意机制的卷积神经网络修正长短期记忆(CNN-MTLSTM)模型,利用脑电图信号熟练地将个体划分为上述智商类别。为了提高CNN-MTLSTM的预测精度,提出了一层“输入增强层”并将其加入到CNN-MTLSTM中。值得注意的是,利用CNN从提取的EEG特征中自动提取重要信息。提出了一种新的分类器模型MTLSTM。本文的贡献包括提出新的MTLSTM架构和利用注意力机制来提高CNN-MTLSTM模型的分类精度。创新的CNN-MTLSTM模型在MTLSTM网络中加入了注意机制,在评估一个人的智商水平时达到了97.41%的平均准确率。
{"title":"Decoding EEG Data with Deep Learning for Intelligence Quotient Assessment","authors":"Prithwijit Mukherjee,&nbsp; Anisha Halder Roy","doi":"10.3103/S1060992X24601921","DOIUrl":"10.3103/S1060992X24601921","url":null,"abstract":"<p>Intelligence quotient (IQ) serves as a statistical gauge for evaluating an individual’s cognitive prowess. Measuring IQ is a formidable undertaking, mainly due to the intricate intricacies of the human brain’s composition. Presently, the assessment of human intelligence relies solely on conventional paper-based psychometric tests. However, these approaches suffer from inherent discrepancies arising from the diversity of test formats and language barriers. The primary objective of this study is to introduce an innovative, deep learning-driven methodology for IQ measurement using Electroencephalogram (EEG) signals. In this investigation, EEG signals are captured from participants during an IQ assessment session. Subsequently, participants' IQ levels are categorized into six distinct tiers, encompassing extremely low IQ, borderline IQ, low average IQ, high average IQ, superior IQ, and very superior IQ, based on their test results. An attention mechanism-based Convolution Neural Network-modified tanh Long-Short-term-Memory (CNN-MTLSTM) model has been meticulously devised for adeptly classifying individuals into the aforementioned IQ categories by using EEG signals. A layer named 'input enhancement layer' is proposed and incorporated in CNN-MTLSTM for enhancing its prediction accuracy. Notably, a CNN is harnessed to automate the process of extracting important information from the extracted EEG features. A new model, i.e., MTLSTM, is proposed, which works as a classifier. The paper’s contributions encompass proposing the novel MTLSTM architecture and leveraging attention mechanism to enhance the classification accuracy of the CNN-MTLSTM model. The innovative CNN-MTLSTM model, incorporating an attention mechanism within the MTLSTM network, attains a remarkable average accuracy of 97.41% in assessing a person’s IQ level.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"441 - 456"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Open-Vocabulary Indoor Object Grounding with 3D Hierarchical Scene Graph 基于三维分层场景图的开放词汇室内物体接地
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25600673
S. Linok, G. Naumov

We propose OVIGo-3DHSG method—Open-Vocabulary Indoor Grounding of objects using 3D Hierarchical Scene Graph. OVIGo-3DHSG represents an extensive indoor environment over a Hierarchical Scene Graph derived from sequences of RGB-D frames utilizing a set of open-vocabulary foundation models and sensor data processing. The hierarchical representation explicitly models spatial relations across floors, rooms, locations, and objects. To effectively address complex queries involving spatial reference to other objects, we integrate the hierarchical scene graph with a Large Language Model for multistep reasoning. This integration leverages inter-layer (e.g., room-to-object) and intra-layer (e.g., object-to-object) connections, enhancing spatial contextual understanding. We investigate the semantic and geometry accuracy of hierarchical representation on Habitat Matterport 3D Semantic multi-floor scenes. Our approach demonstrates efficient scene comprehension and robust object grounding compared to existing methods. Overall OVIGo-3DHSG demonstrates strong potential for applications requiring spatial reasoning and understanding of indoor environments. Related materials can be found at https://github.com/linukc/OVIGo-3DHSG.

我们提出了OVIGo-3DHSG方法——基于三维层次场景图的开放词汇室内物体接地。OVIGo-3DHSG利用一组开放词汇基础模型和传感器数据处理,在RGB-D帧序列派生的分层场景图上表示广泛的室内环境。分层表示显式地对跨楼层、房间、位置和对象的空间关系进行建模。为了有效地处理涉及到其他对象的空间引用的复杂查询,我们将分层场景图与用于多步推理的大型语言模型相结合。这种集成利用了层间(例如,房间到对象)和层内(例如,对象到对象)的连接,增强了空间上下文的理解。研究了Habitat Matterport三维语义多层场景的分层表示的语义和几何精度。与现有方法相比,我们的方法展示了高效的场景理解和鲁棒的对象基础。总体而言,OVIGo-3DHSG在需要空间推理和室内环境理解的应用中显示出强大的潜力。相关资料可在https://github.com/linukc/OVIGo-3DHSG找到。
{"title":"Open-Vocabulary Indoor Object Grounding with 3D Hierarchical Scene Graph","authors":"S. Linok,&nbsp;G. Naumov","doi":"10.3103/S1060992X25600673","DOIUrl":"10.3103/S1060992X25600673","url":null,"abstract":"<p>We propose <b>OVIGo-3DHSG</b> method—<b>O</b>pen-<b>V</b>ocabulary <b>I</b>ndoor <b>G</b>rounding of <b>o</b>bjects using <b>3D</b> <b>H</b>ierarchical <b>S</b>cene <b>G</b>raph. OVIGo-3DHSG represents an extensive indoor environment over a Hierarchical Scene Graph derived from sequences of RGB-D frames utilizing a set of open-vocabulary foundation models and sensor data processing. The hierarchical representation explicitly models spatial relations across floors, rooms, locations, and objects. To effectively address complex queries involving spatial reference to other objects, we integrate the hierarchical scene graph with a Large Language Model for multistep reasoning. This integration leverages inter-layer (e.g., room-to-object) and intra-layer (e.g., object-to-object) connections, enhancing spatial contextual understanding. We investigate the semantic and geometry accuracy of hierarchical representation on Habitat Matterport 3D Semantic multi-floor scenes. Our approach demonstrates efficient scene comprehension and robust object grounding compared to existing methods. Overall OVIGo-3DHSG demonstrates strong potential for applications requiring spatial reasoning and understanding of indoor environments. Related materials can be found at https://github.com/linukc/OVIGo-3DHSG.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"323 - 333"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AS-ODB: Multivariate Attention Supervised Learning Based Optimized DBN Approach for Cloud Workload Prediction 基于多变量注意监督学习的优化DBN云工作负载预测方法
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700122
G. M. Kiran, A. Aparna Rajesh, D. Basavesha

Attainable on demand cloud computing makes it feasible to access a centralized shared pool of computing resources. Accurate estimation of cloud workload is necessary for optimal performance and effective use of cloud computing resources. Because cloud workloads are dynamic and unpredictable, this is a problematic problem. In this case, deep learning can provide reliable foundations for workload prediction in data centres when trained appropriately. In the proposed model, efficient workload prediction is executed out using novel deep learning. Efficient management of these hyperparameters may significantly improve the neural network model’s performance. Using the data centre’s workload traces at many consecutive time steps, the suggested approach is shown to be able to estimate Central Processing Unit (CPU) utilization. Collects raw data retrieved from the storage, including the number and type of requests, virtual machine (VMs) costs, and resource usage. Discover patterns and oscillations in the workload trace by preprocessing the data to increase the prediction efficacy of this model. During data pre-processing, the KCR approach, min max normalization, and data cleaning are used to select the important properties from raw data samples, eliminate noise, and normalize them. After that, a sliding window is used for deep learning processing to convert multivariate data into time series with supervised learning. Next, utilize a deep belief network based on green anaconda optimization (GrA-DBN) to attain precise workload forecasting. Comparing the suggested methodology with existing models, experimental results show that it provides a better trade-off between accuracy and training time. The suggested method provides higher performance, with an execution time of 28.5 s and an accuracy rate of 93.60%. According to the simulation results, the GrA-DBN workload prediction method performs better than other algorithms.

可实现的随需应变云计算使得访问集中的共享计算资源池成为可能。准确估计云工作负载对于优化性能和有效使用云计算资源是必要的。因为云工作负载是动态的和不可预测的,所以这是一个有问题的问题。在这种情况下,经过适当的训练,深度学习可以为数据中心的工作负载预测提供可靠的基础。在该模型中,利用新颖的深度学习实现了高效的工作负荷预测。对这些超参数的有效管理可以显著提高神经网络模型的性能。使用数据中心在许多连续时间步长的工作负载跟踪,所建议的方法能够估计中央处理单元(Central Processing Unit, CPU)的利用率。收集从存储检索到的原始数据,包括请求的数量和类型、虚拟机(vm)成本和资源使用情况。通过对数据进行预处理,发现工作负载跟踪中的模式和振荡,从而提高该模型的预测效率。在数据预处理过程中,使用KCR方法、最小最大归一化和数据清洗从原始数据样本中选择重要属性,消除噪声并进行归一化。之后,使用滑动窗口进行深度学习处理,通过监督学习将多变量数据转换为时间序列。其次,利用基于绿蟒蛇优化的深度信念网络(GrA-DBN)实现精确的工作量预测。实验结果表明,该方法能较好地平衡训练时间和准确率。该方法具有较高的性能,执行时间为28.5 s,准确率为93.60%。仿真结果表明,GrA-DBN工作负载预测方法的性能优于其他算法。
{"title":"AS-ODB: Multivariate Attention Supervised Learning Based Optimized DBN Approach for Cloud Workload Prediction","authors":"G. M. Kiran,&nbsp;A. Aparna Rajesh,&nbsp;D. Basavesha","doi":"10.3103/S1060992X25700122","DOIUrl":"10.3103/S1060992X25700122","url":null,"abstract":"<p>Attainable on demand cloud computing makes it feasible to access a centralized shared pool of computing resources. Accurate estimation of cloud workload is necessary for optimal performance and effective use of cloud computing resources. Because cloud workloads are dynamic and unpredictable, this is a problematic problem. In this case, deep learning can provide reliable foundations for workload prediction in data centres when trained appropriately. In the proposed model, efficient workload prediction is executed out using novel deep learning. Efficient management of these hyperparameters may significantly improve the neural network model’s performance. Using the data centre’s workload traces at many consecutive time steps, the suggested approach is shown to be able to estimate Central Processing Unit (CPU) utilization. Collects raw data retrieved from the storage, including the number and type of requests, virtual machine (VMs) costs, and resource usage. Discover patterns and oscillations in the workload trace by preprocessing the data to increase the prediction efficacy of this model. During data pre-processing, the KCR approach, min max normalization, and data cleaning are used to select the important properties from raw data samples, eliminate noise, and normalize them. After that, a sliding window is used for deep learning processing to convert multivariate data into time series with supervised learning. Next, utilize a deep belief network based on green anaconda optimization (GrA-DBN) to attain precise workload forecasting. Comparing the suggested methodology with existing models, experimental results show that it provides a better trade-off between accuracy and training time. The suggested method provides higher performance, with an execution time of 28.5 s and an accuracy rate of 93.60%. According to the simulation results, the GrA-DBN workload prediction method performs better than other algorithms.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"389 - 401"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073823","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
M3DMap: Object-Aware Multimodal 3D Mapping for Dynamic Environments M3DMap:动态环境的对象感知多模态3D映射
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700092
D. A. Yudin

3D mapping in dynamic environments poses a challenge for modern researchers in robotics and autonomous transportation. There are no universal representations for dynamic 3D scenes that incorporate multimodal data such as images, point clouds, and text. This article takes a step toward solving this problem. It proposes a taxonomy of methods for constructing multimodal 3D maps, classifying contemporary approaches based on scene types and representations, learning methods, and practical applications. Using this taxonomy, a brief structured analysis of recent methods is provided. The article also describes an original modular method called M3DMap, designed for object-aware construction of multimodal 3D maps for both static and dynamic scenes. It consists of several interconnected components: a neural multimodal object segmentation and tracking module; an odometry estimation module, including trainable algorithms; a module for 3D map construction and updating with various implementations depending on the desired scene representation; and a multimodal data retrieval module. The article highlights original implementations of these modules and their advantages in solving various practical tasks, from 3D object grounding to mobile manipulation. Additionally, it presents theoretical propositions demonstrating the positive effect of using multimodal data and modern foundational models in 3D mapping methods. Details of the taxonomy and method implementation are available at https://yuddim.github.io/M3DMap.

动态环境下的三维映射对现代机器人和自动运输研究人员提出了挑战。对于包含多模态数据(如图像、点云和文本)的动态3D场景,没有通用的表示。本文为解决这个问题迈出了一步。它提出了构建多模态3D地图的方法分类,根据场景类型和表示、学习方法和实际应用对当代方法进行分类。使用这种分类法,对最近的方法进行了简要的结构化分析。本文还介绍了一种名为M3DMap的原始模块化方法,用于静态和动态场景的多模态3D地图的对象感知构建。它由几个相互关联的组件组成:一个神经多模态目标分割和跟踪模块;里程计估计模块,包括可训练算法;一个用于3D地图构建和更新的模块,根据所需的场景表示使用各种实现;以及一个多模态数据检索模块。本文重点介绍了这些模块的原始实现及其在解决各种实际任务中的优势,从3D对象接地到移动操作。此外,本文还提出了一些理论命题,证明了在三维制图方法中使用多模态数据和现代基础模型的积极作用。分类法和方法实现的详细信息可在https://yuddim.github.io/M3DMap上获得。
{"title":"M3DMap: Object-Aware Multimodal 3D Mapping for Dynamic Environments","authors":"D. A. Yudin","doi":"10.3103/S1060992X25700092","DOIUrl":"10.3103/S1060992X25700092","url":null,"abstract":"<p>3D mapping in dynamic environments poses a challenge for modern researchers in robotics and autonomous transportation. There are no universal representations for dynamic 3D scenes that incorporate multimodal data such as images, point clouds, and text. This article takes a step toward solving this problem. It proposes a taxonomy of methods for constructing multimodal 3D maps, classifying contemporary approaches based on scene types and representations, learning methods, and practical applications. Using this taxonomy, a brief structured analysis of recent methods is provided. The article also describes an original modular method called M3DMap, designed for object-aware construction of multimodal 3D maps for both static and dynamic scenes. It consists of several interconnected components: a neural multimodal object segmentation and tracking module; an odometry estimation module, including trainable algorithms; a module for 3D map construction and updating with various implementations depending on the desired scene representation; and a multimodal data retrieval module. The article highlights original implementations of these modules and their advantages in solving various practical tasks, from 3D object grounding to mobile manipulation. Additionally, it presents theoretical propositions demonstrating the positive effect of using multimodal data and modern foundational models in 3D mapping methods. Details of the taxonomy and method implementation are available at https://yuddim.github.io/M3DMap.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"285 - 312"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erratum to: Consumer Behavior Analysis in Social Networking Big Data Using Correlated Extreme Learning 使用相关极限学习的社交网络大数据中的消费者行为分析的勘误
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X2570016X
M. Arumugam, C. Jayanthi
{"title":"Erratum to: Consumer Behavior Analysis in Social Networking Big Data Using Correlated Extreme Learning","authors":"M. Arumugam,&nbsp;C. Jayanthi","doi":"10.3103/S1060992X2570016X","DOIUrl":"10.3103/S1060992X2570016X","url":null,"abstract":"","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"470 - 470"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ElevNav: Large Language Model-Guided Robot Navigation via 3D Scene Graphs in Elevator Environments ElevNav:电梯环境中基于3D场景图的大型语言模型引导机器人导航
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700109
Huzhenyu Zhang

Cross-floor robotic navigation has become an increasingly critical capability for autonomous systems operating in multi-floor buildings. While 3D scene graphs have demonstrated promise for representing hierarchical spatial relationships, current approaches predominantly address cross-floor navigation by stairs, overlooking the practical challenges of elevator-mediated navigation in modern buildings. This paper presents ElevNav, a novel framework that bridges this gap through two key innovations: (1) automatic construction of semantically-rich 3D scene graphs from RGB-D sequences with estimated camera trajectories, and (2) task decomposition using large language models to translate natural language commands into executable action sequences. Our method addresses elevator interaction through specialized action primitives such as pressing buttons, entering and exiting the elevator, and moving toward target objects. We evaluate ElevNav in complex simulated environments built using Isaac Sim, demonstrating robust performance in multi-floor navigation scenarios. To facilitate further research, we release a new dataset containing elevator environments with corresponding scene graph representations, addressing a critical gap in existing 3D navigation benchmarks, which is open-sourced at: https://github.com/zhanghuzhenyu/elevnav.

跨楼层机器人导航已经成为在多层建筑中运行的自主系统越来越重要的能力。虽然3D场景图已经证明了表示分层空间关系的希望,但目前的方法主要是通过楼梯解决跨层导航,忽略了现代建筑中电梯导航的实际挑战。本文介绍了ElevNav,这是一个通过两个关键创新来弥合这一差距的新框架:(1)根据估计的摄像机轨迹从RGB-D序列自动构建语义丰富的3D场景图,以及(2)使用大型语言模型将自然语言命令转换为可执行的动作序列的任务分解。我们的方法通过特殊的动作原语(如按按钮、进入和退出电梯以及向目标对象移动)来处理电梯交互。我们在使用Isaac Sim构建的复杂模拟环境中评估了ElevNav,展示了在多层导航场景中的稳健性能。为了促进进一步的研究,我们发布了一个新的数据集,其中包含具有相应场景图表示的电梯环境,解决了现有3D导航基准的关键差距,该数据集是开源的:https://github.com/zhanghuzhenyu/elevnav。
{"title":"ElevNav: Large Language Model-Guided Robot Navigation via 3D Scene Graphs in Elevator Environments","authors":"Huzhenyu Zhang","doi":"10.3103/S1060992X25700109","DOIUrl":"10.3103/S1060992X25700109","url":null,"abstract":"<p>Cross-floor robotic navigation has become an increasingly critical capability for autonomous systems operating in multi-floor buildings. While 3D scene graphs have demonstrated promise for representing hierarchical spatial relationships, current approaches predominantly address cross-floor navigation by stairs, overlooking the practical challenges of elevator-mediated navigation in modern buildings. This paper presents <b>ElevNav</b>, a novel framework that bridges this gap through two key innovations: (1) automatic construction of semantically-rich 3D scene graphs from RGB-D sequences with estimated camera trajectories, and (2) task decomposition using large language models to translate natural language commands into executable action sequences. Our method addresses elevator interaction through specialized action primitives such as pressing buttons, entering and exiting the elevator, and moving toward target objects. We evaluate ElevNav in complex simulated environments built using Isaac Sim, demonstrating robust performance in multi-floor navigation scenarios. To facilitate further research, we release a new dataset containing elevator environments with corresponding scene graph representations, addressing a critical gap in existing 3D navigation benchmarks, which is open-sourced at: https://github.com/zhanghuzhenyu/elevnav.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"313 - 322"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Skin Disease Diagnosis Using Optimized nnU-Net Segmentation and Hybrid E-Cap Net with UFO-Net 基于优化nnU-Net分割和混合E-Cap网与ufo网的皮肤病诊断
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700134
Y. Lins Joy, S. Jerine

Skin diseases are among the most frequent and pervasive conditions affecting individuals all over the world. The two primary causes of skin cancer are climate change and global warming. If skin conditions are not identified and treated promptly, they may become fatal. Advanced ML and DL approaches on skin diseases often face limitations such as insufficient data diversity, high variability in imaging quality, and challenges in accurately distinguishing between similar-looking conditions. These drawbacks can lead to reduce diagnostic accuracy and generalizability of the models. To overcome the aforementioned challenges, an improved segmentation and hybrid deep learning approach is used to identify numerous kinds of skin disease. Initially, raw images for input are collected from the skin disease image dataset. The collected image is pre-processed with resizing and a Hierarchical Noise Deinterlace Net (HNDN) to remove noise. The pre-processed images are then segmented into different parts or regions using the no new U-Network (nnU-Net). Here, the Marine Predator Algorithm (MPA) is used to choose the nnU-Net learning rate, and batch size optimally. Then, the segmented image is subjected to a hybrid Efficient-capsule network (E-cap Net) and Unified force operation network (UFO-Net) classifier predicting several types of skin disease. An analysis of proposed method’s simulation results indicates that it achieves 97.49% accuracy, 90.06% precision, and 98.56% selectivity. Thus, the proposed method is a most effective method for predicting the multi-type skin disease.

皮肤病是影响全世界个人的最常见和最普遍的疾病之一。皮肤癌的两个主要原因是气候变化和全球变暖。如果没有及时发现和治疗皮肤病,它们可能会致命。皮肤疾病的高级ML和DL方法通常面临诸如数据多样性不足,成像质量高度可变性以及准确区分相似情况的挑战等限制。这些缺点会降低模型的诊断准确性和通用性。为了克服上述挑战,使用改进的分割和混合深度学习方法来识别多种皮肤疾病。首先,从皮肤病图像数据集中收集用于输入的原始图像。对采集到的图像进行预处理,通过调整大小和分层噪声去隔行网络(HNDN)去除噪声。然后使用无新u网络(nnU-Net)将预处理后的图像分割成不同的部分或区域。本文采用海洋捕食者算法(Marine Predator Algorithm, MPA)对nnU-Net学习率和批处理大小进行优化选择。然后,将分割后的图像进行高效胶囊网络(E-cap Net)和统一力操作网络(UFO-Net)混合分类器预测几种类型的皮肤病。仿真结果表明,该方法的准确率为97.49%,精密度为90.06%,选择性为98.56%。因此,该方法是预测多类型皮肤病最有效的方法。
{"title":"Efficient Skin Disease Diagnosis Using Optimized nnU-Net Segmentation and Hybrid E-Cap Net with UFO-Net","authors":"Y. Lins Joy,&nbsp;S. Jerine","doi":"10.3103/S1060992X25700134","DOIUrl":"10.3103/S1060992X25700134","url":null,"abstract":"<p>Skin diseases are among the most frequent and pervasive conditions affecting individuals all over the world. The two primary causes of skin cancer are climate change and global warming. If skin conditions are not identified and treated promptly, they may become fatal. Advanced ML and DL approaches on skin diseases often face limitations such as insufficient data diversity, high variability in imaging quality, and challenges in accurately distinguishing between similar-looking conditions. These drawbacks can lead to reduce diagnostic accuracy and generalizability of the models. To overcome the aforementioned challenges, an improved segmentation and hybrid deep learning approach is used to identify numerous kinds of skin disease. Initially, raw images for input are collected from the skin disease image dataset. The collected image is pre-processed with resizing and a Hierarchical Noise Deinterlace Net (HNDN) to remove noise. The pre-processed images are then segmented into different parts or regions using the no new U-Network (nnU-Net). Here, the Marine Predator Algorithm (MPA) is used to choose the nnU-Net learning rate, and batch size optimally. Then, the segmented image is subjected to a hybrid Efficient-capsule network (E-cap Net) and Unified force operation network (UFO-Net) classifier predicting several types of skin disease. An analysis of proposed method’s simulation results indicates that it achieves 97.49% accuracy, 90.06% precision, and 98.56% selectivity. Thus, the proposed method is a most effective method for predicting the multi-type skin disease.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"402 - 417"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learner Cognitive Feature Model for Learning Resource Personalizing Recommendation 面向学习资源个性化推荐的学习者认知特征模型
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X24600460
Yongheng Chen,  Chunyan Yin

In this paper, we propose a novel learner cognitive feature model for personalized guidance and push (LCFLM) that traces the evolution of learners’ knowledge proficiency based on their exercising logs in online learning systems. Specifically, we introduce the exercise-aware dependency hierarchical graph of exercise dependency and pattern dependency that can establish a model of exercise dependency relationships. Additionally, we propose the implementation of a forget gating mechanism, which combines the forgetting features with the knowledge state features to predict a student’s learning performance. The experimental results clearly demonstrate that LCFLM achieves the new state-of-the-art performance, exhibiting an improvement of at least 3% in both AUC and ACC. Furthermore, the LCFLM model has the ability to autonomously uncover the fundamental concepts underlying exercises and provides a visual representation of a student’s evolving knowledge state.

在本文中,我们提出了一种新的学习者认知特征模型,用于个性化指导和推送(LCFLM),该模型基于学习者在在线学习系统中的练习日志来跟踪学习者知识熟练程度的演变。具体来说,我们引入了运动依赖的运动感知依赖层次图和模式依赖,可以建立运动依赖关系模型。此外,我们提出了遗忘门控机制的实现,该机制将遗忘特征与知识状态特征相结合,以预测学生的学习表现。实验结果清楚地表明,LCFLM达到了新的最先进的性能,在AUC和ACC方面都有至少3%的提高。此外,LCFLM模型具有自主发现练习背后的基本概念的能力,并提供学生不断发展的知识状态的可视化表示。
{"title":"Learner Cognitive Feature Model for Learning Resource Personalizing Recommendation","authors":"Yongheng Chen,&nbsp; Chunyan Yin","doi":"10.3103/S1060992X24600460","DOIUrl":"10.3103/S1060992X24600460","url":null,"abstract":"<p>In this paper, we propose a novel learner cognitive feature model for personalized guidance and push (LCFLM) that traces the evolution of learners’ knowledge proficiency based on their exercising logs in online learning systems. Specifically, we introduce the exercise-aware dependency hierarchical graph of exercise dependency and pattern dependency that can establish a model of exercise dependency relationships. Additionally, we propose the implementation of a forget gating mechanism, which combines the forgetting features with the knowledge state features to predict a student’s learning performance. The experimental results clearly demonstrate that LCFLM achieves the new state-of-the-art performance, exhibiting an improvement of at least 3% in both AUC and ACC. Furthermore, the LCFLM model has the ability to autonomously uncover the fundamental concepts underlying exercises and provides a visual representation of a student’s evolving knowledge state.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"457 - 469"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reputation-Based Byzantine Fault Tolerance and ElGamal Cryptography with Deep Belief Network on Smart Contract for Secure Blockchain 基于信誉的拜占庭容错和基于深度信任网络的安全区块链智能合约ElGamal加密
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700110
V. Devi, P. Amudha

Blockchain is a secure, decentralized ledger system that records transactions in immutable blocks. A smart contract is a self-executing piece of code on the blockchain that automatically enforces agreements when specific conditions are met. Additionally, once deployed, smart contracts are immutable, making it difficult to fix bugs or vulnerabilities without affecting the entire blockchain. Using machine learning and deep learning techniques, vulnerabilities in smart contract code have been effectively identified. The trained net model is tampered with since the algorithms' learning is not safe. Therefore, a fully homomorphic deep learning algorithm has been developed to detect vulnerabilities in smart contract systems for blockchain in order to safeguard user data. Initially, user data is stored on the blockchain based on a consensus algorithm that evaluates the operations of each node using a reputation model. Reputation-based Byzantine Fault Tolerance (RBFT) enhances security by assessing users' reputations to prevent malicious behaviour and ensure fault tolerance. Reputation values, ranging from 0 to 1, are crucial for establishing trust and reliability in the network. To further optimize RBFT performance, the Secretary Bird Optimization Algorithm is employed. Smart contract data is derived from source code, including the control flow graph and operation code. XLNet and Bi-LSTM are used to extract features from the control flow graph and operation code, which are then trained and tested using ElGamal cryptography with a Deep Belief Network to improve vulnerability detection and enhance security in blockchain-based smart contract systems. The proposed approach provides 98.40% accuracy, 95.40% positive predictive value (PPV), and 98.80% selectivity. This proposed approach enhances blockchain-based smart contract systems by improving vulnerability detection and ensuring robust encryption of sensitive data through advanced reputation models and cryptographic techniques.

区块链是一个安全的、去中心化的账本系统,它将交易记录在不可变的区块中。智能合约是区块链上的一段自动执行的代码,当满足特定条件时,它会自动执行协议。此外,一旦部署,智能合约是不可变的,因此很难在不影响整个区块链的情况下修复错误或漏洞。利用机器学习和深度学习技术,智能合约代码中的漏洞已被有效识别。由于算法的学习不安全,训练后的网络模型会被篡改。因此,开发了一种完全同态深度学习算法来检测区块链智能合约系统中的漏洞,以保护用户数据。最初,用户数据基于共识算法存储在区块链上,该算法使用信誉模型评估每个节点的操作。基于声誉的拜占庭容错(Byzantine Fault Tolerance, RBFT)通过评估用户的声誉来提高安全性,防止恶意行为,保证容错能力。信誉值(从0到1)对于在网络中建立信任和可靠性至关重要。为了进一步优化RBFT性能,采用秘书鸟优化算法。智能合约数据来源于源代码,包括控制流程图和操作代码。XLNet和Bi-LSTM用于从控制流图和操作代码中提取特征,然后使用ElGamal加密技术与深度信念网络进行训练和测试,以改进漏洞检测并增强基于区块链的智能合约系统的安全性。该方法准确率为98.40%,阳性预测值(PPV)为95.40%,选择性为98.80%。该方法通过改进漏洞检测并通过先进的声誉模型和加密技术确保敏感数据的健壮加密,增强了基于区块链的智能合约系统。
{"title":"Reputation-Based Byzantine Fault Tolerance and ElGamal Cryptography with Deep Belief Network on Smart Contract for Secure Blockchain","authors":"V. Devi,&nbsp;P. Amudha","doi":"10.3103/S1060992X25700110","DOIUrl":"10.3103/S1060992X25700110","url":null,"abstract":"<p>Blockchain is a secure, decentralized ledger system that records transactions in immutable blocks. A smart contract is a self-executing piece of code on the blockchain that automatically enforces agreements when specific conditions are met. Additionally, once deployed, smart contracts are immutable, making it difficult to fix bugs or vulnerabilities without affecting the entire blockchain. Using machine learning and deep learning techniques, vulnerabilities in smart contract code have been effectively identified. The trained net model is tampered with since the algorithms' learning is not safe. Therefore, a fully homomorphic deep learning algorithm has been developed to detect vulnerabilities in smart contract systems for blockchain in order to safeguard user data. Initially, user data is stored on the blockchain based on a consensus algorithm that evaluates the operations of each node using a reputation model. Reputation-based Byzantine Fault Tolerance (RBFT) enhances security by assessing users' reputations to prevent malicious behaviour and ensure fault tolerance. Reputation values, ranging from 0 to 1, are crucial for establishing trust and reliability in the network. To further optimize RBFT performance, the Secretary Bird Optimization Algorithm is employed. Smart contract data is derived from source code, including the control flow graph and operation code. XLNet and Bi-LSTM are used to extract features from the control flow graph and operation code, which are then trained and tested using ElGamal cryptography with a Deep Belief Network to improve vulnerability detection and enhance security in blockchain-based smart contract systems. The proposed approach provides 98.40% accuracy, 95.40% positive predictive value (PPV), and 98.80% selectivity. This proposed approach enhances blockchain-based smart contract systems by improving vulnerability detection and ensuring robust encryption of sensitive data through advanced reputation models and cryptographic techniques.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"371 - 388"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigation of the Effect of the Formation of Subwavelength Microcylinders in the Process of Pulsed Laser Action on the Cr/ZrO2 Bilayer 脉冲激光作用于Cr/ZrO2双分子层过程中亚波长微柱形成影响的研究
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700158
S. D. Poletayev, D. A. Savelyev, G. V. Uspleniev

The effect of the formation of microcylinders during laser treatment (λ = 532 nm) of the surface of a chromium–zirconium dioxide bilayer in a pulsed low-frequency mode is studied. An unusual formation of microcylinders was observed, which was explained by the effect of micro wrinkles. It was found that in this way it is possible to form a quasi-periodic matrix of microcylinders, the size of which is on the order of the diffraction limit, which is approximately 6 times smaller than the effective diameter of the laser spot. It is shown that the matrix of elements can have a fill factor of about 0.5, with a period up to 2.5 times smaller than the diameter of the laser spot. Numerical simulation of diffraction of Gaussian beams and optical vortices with circular polarization on arrays of subwavelength microcylinders has shown that a decrease in the diameter of microcylinders leads to a decrease in the size of the focal spot and light needle for a Gaussian beam, and an increase in height leads to the formation of the main intensity peaks inside the element for both the Gaussian beam and the Laguerre-Gauss mode. Based on the simulation results, a focusing meander matrix of microcylinders with a minimum element size of about 350 nm and a height of 0.21 λ was made.

研究了脉冲低频激光(λ = 532 nm)处理氧化铬-氧化锆双层膜表面时微柱形成的影响。观察到一种不寻常的微柱形成,这可以用微皱的影响来解释。结果表明,用这种方法可以形成微柱的准周期矩阵,其尺寸在衍射极限数量级,比激光光斑的有效直径约小6倍。结果表明,元素矩阵的填充系数约为0.5,周期比激光光斑直径小2.5倍。对高斯光束在亚波长微柱阵列上的衍射和圆偏振光涡的数值模拟表明,微柱直径的减小导致高斯光束的焦点光斑和光针的尺寸减小,高度的增加导致高斯光束和拉盖尔-高斯模式在元件内部形成主强度峰。基于仿真结果,制备了最小单元尺寸约为350 nm,高度为0.21 λ的微柱聚焦曲流矩阵。
{"title":"Investigation of the Effect of the Formation of Subwavelength Microcylinders in the Process of Pulsed Laser Action on the Cr/ZrO2 Bilayer","authors":"S. D. Poletayev,&nbsp;D. A. Savelyev,&nbsp;G. V. Uspleniev","doi":"10.3103/S1060992X25700158","DOIUrl":"10.3103/S1060992X25700158","url":null,"abstract":"<p>The effect of the formation of microcylinders during laser treatment (λ = 532 nm) of the surface of a chromium–zirconium dioxide bilayer in a pulsed low-frequency mode is studied. An unusual formation of microcylinders was observed, which was explained by the effect of micro wrinkles. It was found that in this way it is possible to form a quasi-periodic matrix of microcylinders, the size of which is on the order of the diffraction limit, which is approximately 6 times smaller than the effective diameter of the laser spot. It is shown that the matrix of elements can have a fill factor of about 0.5, with a period up to 2.5 times smaller than the diameter of the laser spot. Numerical simulation of diffraction of Gaussian beams and optical vortices with circular polarization on arrays of subwavelength microcylinders has shown that a decrease in the diameter of microcylinders leads to a decrease in the size of the focal spot and light needle for a Gaussian beam, and an increase in height leads to the formation of the main intensity peaks inside the element for both the Gaussian beam and the Laguerre-Gauss mode. Based on the simulation results, a focusing meander matrix of microcylinders with a minimum element size of about 350 nm and a height of 0.21 λ was made.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"418 - 427"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Optical Memory and Neural Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1