首页 > 最新文献

Optical Memory and Neural Networks最新文献

英文 中文
M3DMap: Object-Aware Multimodal 3D Mapping for Dynamic Environments M3DMap:动态环境的对象感知多模态3D映射
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700092
D. A. Yudin

3D mapping in dynamic environments poses a challenge for modern researchers in robotics and autonomous transportation. There are no universal representations for dynamic 3D scenes that incorporate multimodal data such as images, point clouds, and text. This article takes a step toward solving this problem. It proposes a taxonomy of methods for constructing multimodal 3D maps, classifying contemporary approaches based on scene types and representations, learning methods, and practical applications. Using this taxonomy, a brief structured analysis of recent methods is provided. The article also describes an original modular method called M3DMap, designed for object-aware construction of multimodal 3D maps for both static and dynamic scenes. It consists of several interconnected components: a neural multimodal object segmentation and tracking module; an odometry estimation module, including trainable algorithms; a module for 3D map construction and updating with various implementations depending on the desired scene representation; and a multimodal data retrieval module. The article highlights original implementations of these modules and their advantages in solving various practical tasks, from 3D object grounding to mobile manipulation. Additionally, it presents theoretical propositions demonstrating the positive effect of using multimodal data and modern foundational models in 3D mapping methods. Details of the taxonomy and method implementation are available at https://yuddim.github.io/M3DMap.

动态环境下的三维映射对现代机器人和自动运输研究人员提出了挑战。对于包含多模态数据(如图像、点云和文本)的动态3D场景,没有通用的表示。本文为解决这个问题迈出了一步。它提出了构建多模态3D地图的方法分类,根据场景类型和表示、学习方法和实际应用对当代方法进行分类。使用这种分类法,对最近的方法进行了简要的结构化分析。本文还介绍了一种名为M3DMap的原始模块化方法,用于静态和动态场景的多模态3D地图的对象感知构建。它由几个相互关联的组件组成:一个神经多模态目标分割和跟踪模块;里程计估计模块,包括可训练算法;一个用于3D地图构建和更新的模块,根据所需的场景表示使用各种实现;以及一个多模态数据检索模块。本文重点介绍了这些模块的原始实现及其在解决各种实际任务中的优势,从3D对象接地到移动操作。此外,本文还提出了一些理论命题,证明了在三维制图方法中使用多模态数据和现代基础模型的积极作用。分类法和方法实现的详细信息可在https://yuddim.github.io/M3DMap上获得。
{"title":"M3DMap: Object-Aware Multimodal 3D Mapping for Dynamic Environments","authors":"D. A. Yudin","doi":"10.3103/S1060992X25700092","DOIUrl":"10.3103/S1060992X25700092","url":null,"abstract":"<p>3D mapping in dynamic environments poses a challenge for modern researchers in robotics and autonomous transportation. There are no universal representations for dynamic 3D scenes that incorporate multimodal data such as images, point clouds, and text. This article takes a step toward solving this problem. It proposes a taxonomy of methods for constructing multimodal 3D maps, classifying contemporary approaches based on scene types and representations, learning methods, and practical applications. Using this taxonomy, a brief structured analysis of recent methods is provided. The article also describes an original modular method called M3DMap, designed for object-aware construction of multimodal 3D maps for both static and dynamic scenes. It consists of several interconnected components: a neural multimodal object segmentation and tracking module; an odometry estimation module, including trainable algorithms; a module for 3D map construction and updating with various implementations depending on the desired scene representation; and a multimodal data retrieval module. The article highlights original implementations of these modules and their advantages in solving various practical tasks, from 3D object grounding to mobile manipulation. Additionally, it presents theoretical propositions demonstrating the positive effect of using multimodal data and modern foundational models in 3D mapping methods. Details of the taxonomy and method implementation are available at https://yuddim.github.io/M3DMap.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"285 - 312"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Erratum to: Consumer Behavior Analysis in Social Networking Big Data Using Correlated Extreme Learning 使用相关极限学习的社交网络大数据中的消费者行为分析的勘误
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X2570016X
M. Arumugam, C. Jayanthi
{"title":"Erratum to: Consumer Behavior Analysis in Social Networking Big Data Using Correlated Extreme Learning","authors":"M. Arumugam,&nbsp;C. Jayanthi","doi":"10.3103/S1060992X2570016X","DOIUrl":"10.3103/S1060992X2570016X","url":null,"abstract":"","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"470 - 470"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ElevNav: Large Language Model-Guided Robot Navigation via 3D Scene Graphs in Elevator Environments ElevNav:电梯环境中基于3D场景图的大型语言模型引导机器人导航
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700109
Huzhenyu Zhang

Cross-floor robotic navigation has become an increasingly critical capability for autonomous systems operating in multi-floor buildings. While 3D scene graphs have demonstrated promise for representing hierarchical spatial relationships, current approaches predominantly address cross-floor navigation by stairs, overlooking the practical challenges of elevator-mediated navigation in modern buildings. This paper presents ElevNav, a novel framework that bridges this gap through two key innovations: (1) automatic construction of semantically-rich 3D scene graphs from RGB-D sequences with estimated camera trajectories, and (2) task decomposition using large language models to translate natural language commands into executable action sequences. Our method addresses elevator interaction through specialized action primitives such as pressing buttons, entering and exiting the elevator, and moving toward target objects. We evaluate ElevNav in complex simulated environments built using Isaac Sim, demonstrating robust performance in multi-floor navigation scenarios. To facilitate further research, we release a new dataset containing elevator environments with corresponding scene graph representations, addressing a critical gap in existing 3D navigation benchmarks, which is open-sourced at: https://github.com/zhanghuzhenyu/elevnav.

跨楼层机器人导航已经成为在多层建筑中运行的自主系统越来越重要的能力。虽然3D场景图已经证明了表示分层空间关系的希望,但目前的方法主要是通过楼梯解决跨层导航,忽略了现代建筑中电梯导航的实际挑战。本文介绍了ElevNav,这是一个通过两个关键创新来弥合这一差距的新框架:(1)根据估计的摄像机轨迹从RGB-D序列自动构建语义丰富的3D场景图,以及(2)使用大型语言模型将自然语言命令转换为可执行的动作序列的任务分解。我们的方法通过特殊的动作原语(如按按钮、进入和退出电梯以及向目标对象移动)来处理电梯交互。我们在使用Isaac Sim构建的复杂模拟环境中评估了ElevNav,展示了在多层导航场景中的稳健性能。为了促进进一步的研究,我们发布了一个新的数据集,其中包含具有相应场景图表示的电梯环境,解决了现有3D导航基准的关键差距,该数据集是开源的:https://github.com/zhanghuzhenyu/elevnav。
{"title":"ElevNav: Large Language Model-Guided Robot Navigation via 3D Scene Graphs in Elevator Environments","authors":"Huzhenyu Zhang","doi":"10.3103/S1060992X25700109","DOIUrl":"10.3103/S1060992X25700109","url":null,"abstract":"<p>Cross-floor robotic navigation has become an increasingly critical capability for autonomous systems operating in multi-floor buildings. While 3D scene graphs have demonstrated promise for representing hierarchical spatial relationships, current approaches predominantly address cross-floor navigation by stairs, overlooking the practical challenges of elevator-mediated navigation in modern buildings. This paper presents <b>ElevNav</b>, a novel framework that bridges this gap through two key innovations: (1) automatic construction of semantically-rich 3D scene graphs from RGB-D sequences with estimated camera trajectories, and (2) task decomposition using large language models to translate natural language commands into executable action sequences. Our method addresses elevator interaction through specialized action primitives such as pressing buttons, entering and exiting the elevator, and moving toward target objects. We evaluate ElevNav in complex simulated environments built using Isaac Sim, demonstrating robust performance in multi-floor navigation scenarios. To facilitate further research, we release a new dataset containing elevator environments with corresponding scene graph representations, addressing a critical gap in existing 3D navigation benchmarks, which is open-sourced at: https://github.com/zhanghuzhenyu/elevnav.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"313 - 322"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Skin Disease Diagnosis Using Optimized nnU-Net Segmentation and Hybrid E-Cap Net with UFO-Net 基于优化nnU-Net分割和混合E-Cap网与ufo网的皮肤病诊断
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700134
Y. Lins Joy, S. Jerine

Skin diseases are among the most frequent and pervasive conditions affecting individuals all over the world. The two primary causes of skin cancer are climate change and global warming. If skin conditions are not identified and treated promptly, they may become fatal. Advanced ML and DL approaches on skin diseases often face limitations such as insufficient data diversity, high variability in imaging quality, and challenges in accurately distinguishing between similar-looking conditions. These drawbacks can lead to reduce diagnostic accuracy and generalizability of the models. To overcome the aforementioned challenges, an improved segmentation and hybrid deep learning approach is used to identify numerous kinds of skin disease. Initially, raw images for input are collected from the skin disease image dataset. The collected image is pre-processed with resizing and a Hierarchical Noise Deinterlace Net (HNDN) to remove noise. The pre-processed images are then segmented into different parts or regions using the no new U-Network (nnU-Net). Here, the Marine Predator Algorithm (MPA) is used to choose the nnU-Net learning rate, and batch size optimally. Then, the segmented image is subjected to a hybrid Efficient-capsule network (E-cap Net) and Unified force operation network (UFO-Net) classifier predicting several types of skin disease. An analysis of proposed method’s simulation results indicates that it achieves 97.49% accuracy, 90.06% precision, and 98.56% selectivity. Thus, the proposed method is a most effective method for predicting the multi-type skin disease.

皮肤病是影响全世界个人的最常见和最普遍的疾病之一。皮肤癌的两个主要原因是气候变化和全球变暖。如果没有及时发现和治疗皮肤病,它们可能会致命。皮肤疾病的高级ML和DL方法通常面临诸如数据多样性不足,成像质量高度可变性以及准确区分相似情况的挑战等限制。这些缺点会降低模型的诊断准确性和通用性。为了克服上述挑战,使用改进的分割和混合深度学习方法来识别多种皮肤疾病。首先,从皮肤病图像数据集中收集用于输入的原始图像。对采集到的图像进行预处理,通过调整大小和分层噪声去隔行网络(HNDN)去除噪声。然后使用无新u网络(nnU-Net)将预处理后的图像分割成不同的部分或区域。本文采用海洋捕食者算法(Marine Predator Algorithm, MPA)对nnU-Net学习率和批处理大小进行优化选择。然后,将分割后的图像进行高效胶囊网络(E-cap Net)和统一力操作网络(UFO-Net)混合分类器预测几种类型的皮肤病。仿真结果表明,该方法的准确率为97.49%,精密度为90.06%,选择性为98.56%。因此,该方法是预测多类型皮肤病最有效的方法。
{"title":"Efficient Skin Disease Diagnosis Using Optimized nnU-Net Segmentation and Hybrid E-Cap Net with UFO-Net","authors":"Y. Lins Joy,&nbsp;S. Jerine","doi":"10.3103/S1060992X25700134","DOIUrl":"10.3103/S1060992X25700134","url":null,"abstract":"<p>Skin diseases are among the most frequent and pervasive conditions affecting individuals all over the world. The two primary causes of skin cancer are climate change and global warming. If skin conditions are not identified and treated promptly, they may become fatal. Advanced ML and DL approaches on skin diseases often face limitations such as insufficient data diversity, high variability in imaging quality, and challenges in accurately distinguishing between similar-looking conditions. These drawbacks can lead to reduce diagnostic accuracy and generalizability of the models. To overcome the aforementioned challenges, an improved segmentation and hybrid deep learning approach is used to identify numerous kinds of skin disease. Initially, raw images for input are collected from the skin disease image dataset. The collected image is pre-processed with resizing and a Hierarchical Noise Deinterlace Net (HNDN) to remove noise. The pre-processed images are then segmented into different parts or regions using the no new U-Network (nnU-Net). Here, the Marine Predator Algorithm (MPA) is used to choose the nnU-Net learning rate, and batch size optimally. Then, the segmented image is subjected to a hybrid Efficient-capsule network (E-cap Net) and Unified force operation network (UFO-Net) classifier predicting several types of skin disease. An analysis of proposed method’s simulation results indicates that it achieves 97.49% accuracy, 90.06% precision, and 98.56% selectivity. Thus, the proposed method is a most effective method for predicting the multi-type skin disease.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"402 - 417"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Learner Cognitive Feature Model for Learning Resource Personalizing Recommendation 面向学习资源个性化推荐的学习者认知特征模型
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X24600460
Yongheng Chen,  Chunyan Yin

In this paper, we propose a novel learner cognitive feature model for personalized guidance and push (LCFLM) that traces the evolution of learners’ knowledge proficiency based on their exercising logs in online learning systems. Specifically, we introduce the exercise-aware dependency hierarchical graph of exercise dependency and pattern dependency that can establish a model of exercise dependency relationships. Additionally, we propose the implementation of a forget gating mechanism, which combines the forgetting features with the knowledge state features to predict a student’s learning performance. The experimental results clearly demonstrate that LCFLM achieves the new state-of-the-art performance, exhibiting an improvement of at least 3% in both AUC and ACC. Furthermore, the LCFLM model has the ability to autonomously uncover the fundamental concepts underlying exercises and provides a visual representation of a student’s evolving knowledge state.

在本文中,我们提出了一种新的学习者认知特征模型,用于个性化指导和推送(LCFLM),该模型基于学习者在在线学习系统中的练习日志来跟踪学习者知识熟练程度的演变。具体来说,我们引入了运动依赖的运动感知依赖层次图和模式依赖,可以建立运动依赖关系模型。此外,我们提出了遗忘门控机制的实现,该机制将遗忘特征与知识状态特征相结合,以预测学生的学习表现。实验结果清楚地表明,LCFLM达到了新的最先进的性能,在AUC和ACC方面都有至少3%的提高。此外,LCFLM模型具有自主发现练习背后的基本概念的能力,并提供学生不断发展的知识状态的可视化表示。
{"title":"Learner Cognitive Feature Model for Learning Resource Personalizing Recommendation","authors":"Yongheng Chen,&nbsp; Chunyan Yin","doi":"10.3103/S1060992X24600460","DOIUrl":"10.3103/S1060992X24600460","url":null,"abstract":"<p>In this paper, we propose a novel learner cognitive feature model for personalized guidance and push (LCFLM) that traces the evolution of learners’ knowledge proficiency based on their exercising logs in online learning systems. Specifically, we introduce the exercise-aware dependency hierarchical graph of exercise dependency and pattern dependency that can establish a model of exercise dependency relationships. Additionally, we propose the implementation of a forget gating mechanism, which combines the forgetting features with the knowledge state features to predict a student’s learning performance. The experimental results clearly demonstrate that LCFLM achieves the new state-of-the-art performance, exhibiting an improvement of at least 3% in both AUC and ACC. Furthermore, the LCFLM model has the ability to autonomously uncover the fundamental concepts underlying exercises and provides a visual representation of a student’s evolving knowledge state.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"457 - 469"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reputation-Based Byzantine Fault Tolerance and ElGamal Cryptography with Deep Belief Network on Smart Contract for Secure Blockchain 基于信誉的拜占庭容错和基于深度信任网络的安全区块链智能合约ElGamal加密
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700110
V. Devi, P. Amudha

Blockchain is a secure, decentralized ledger system that records transactions in immutable blocks. A smart contract is a self-executing piece of code on the blockchain that automatically enforces agreements when specific conditions are met. Additionally, once deployed, smart contracts are immutable, making it difficult to fix bugs or vulnerabilities without affecting the entire blockchain. Using machine learning and deep learning techniques, vulnerabilities in smart contract code have been effectively identified. The trained net model is tampered with since the algorithms' learning is not safe. Therefore, a fully homomorphic deep learning algorithm has been developed to detect vulnerabilities in smart contract systems for blockchain in order to safeguard user data. Initially, user data is stored on the blockchain based on a consensus algorithm that evaluates the operations of each node using a reputation model. Reputation-based Byzantine Fault Tolerance (RBFT) enhances security by assessing users' reputations to prevent malicious behaviour and ensure fault tolerance. Reputation values, ranging from 0 to 1, are crucial for establishing trust and reliability in the network. To further optimize RBFT performance, the Secretary Bird Optimization Algorithm is employed. Smart contract data is derived from source code, including the control flow graph and operation code. XLNet and Bi-LSTM are used to extract features from the control flow graph and operation code, which are then trained and tested using ElGamal cryptography with a Deep Belief Network to improve vulnerability detection and enhance security in blockchain-based smart contract systems. The proposed approach provides 98.40% accuracy, 95.40% positive predictive value (PPV), and 98.80% selectivity. This proposed approach enhances blockchain-based smart contract systems by improving vulnerability detection and ensuring robust encryption of sensitive data through advanced reputation models and cryptographic techniques.

区块链是一个安全的、去中心化的账本系统,它将交易记录在不可变的区块中。智能合约是区块链上的一段自动执行的代码,当满足特定条件时,它会自动执行协议。此外,一旦部署,智能合约是不可变的,因此很难在不影响整个区块链的情况下修复错误或漏洞。利用机器学习和深度学习技术,智能合约代码中的漏洞已被有效识别。由于算法的学习不安全,训练后的网络模型会被篡改。因此,开发了一种完全同态深度学习算法来检测区块链智能合约系统中的漏洞,以保护用户数据。最初,用户数据基于共识算法存储在区块链上,该算法使用信誉模型评估每个节点的操作。基于声誉的拜占庭容错(Byzantine Fault Tolerance, RBFT)通过评估用户的声誉来提高安全性,防止恶意行为,保证容错能力。信誉值(从0到1)对于在网络中建立信任和可靠性至关重要。为了进一步优化RBFT性能,采用秘书鸟优化算法。智能合约数据来源于源代码,包括控制流程图和操作代码。XLNet和Bi-LSTM用于从控制流图和操作代码中提取特征,然后使用ElGamal加密技术与深度信念网络进行训练和测试,以改进漏洞检测并增强基于区块链的智能合约系统的安全性。该方法准确率为98.40%,阳性预测值(PPV)为95.40%,选择性为98.80%。该方法通过改进漏洞检测并通过先进的声誉模型和加密技术确保敏感数据的健壮加密,增强了基于区块链的智能合约系统。
{"title":"Reputation-Based Byzantine Fault Tolerance and ElGamal Cryptography with Deep Belief Network on Smart Contract for Secure Blockchain","authors":"V. Devi,&nbsp;P. Amudha","doi":"10.3103/S1060992X25700110","DOIUrl":"10.3103/S1060992X25700110","url":null,"abstract":"<p>Blockchain is a secure, decentralized ledger system that records transactions in immutable blocks. A smart contract is a self-executing piece of code on the blockchain that automatically enforces agreements when specific conditions are met. Additionally, once deployed, smart contracts are immutable, making it difficult to fix bugs or vulnerabilities without affecting the entire blockchain. Using machine learning and deep learning techniques, vulnerabilities in smart contract code have been effectively identified. The trained net model is tampered with since the algorithms' learning is not safe. Therefore, a fully homomorphic deep learning algorithm has been developed to detect vulnerabilities in smart contract systems for blockchain in order to safeguard user data. Initially, user data is stored on the blockchain based on a consensus algorithm that evaluates the operations of each node using a reputation model. Reputation-based Byzantine Fault Tolerance (RBFT) enhances security by assessing users' reputations to prevent malicious behaviour and ensure fault tolerance. Reputation values, ranging from 0 to 1, are crucial for establishing trust and reliability in the network. To further optimize RBFT performance, the Secretary Bird Optimization Algorithm is employed. Smart contract data is derived from source code, including the control flow graph and operation code. XLNet and Bi-LSTM are used to extract features from the control flow graph and operation code, which are then trained and tested using ElGamal cryptography with a Deep Belief Network to improve vulnerability detection and enhance security in blockchain-based smart contract systems. The proposed approach provides 98.40% accuracy, 95.40% positive predictive value (PPV), and 98.80% selectivity. This proposed approach enhances blockchain-based smart contract systems by improving vulnerability detection and ensuring robust encryption of sensitive data through advanced reputation models and cryptographic techniques.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"371 - 388"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigation of the Effect of the Formation of Subwavelength Microcylinders in the Process of Pulsed Laser Action on the Cr/ZrO2 Bilayer 脉冲激光作用于Cr/ZrO2双分子层过程中亚波长微柱形成影响的研究
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700158
S. D. Poletayev, D. A. Savelyev, G. V. Uspleniev

The effect of the formation of microcylinders during laser treatment (λ = 532 nm) of the surface of a chromium–zirconium dioxide bilayer in a pulsed low-frequency mode is studied. An unusual formation of microcylinders was observed, which was explained by the effect of micro wrinkles. It was found that in this way it is possible to form a quasi-periodic matrix of microcylinders, the size of which is on the order of the diffraction limit, which is approximately 6 times smaller than the effective diameter of the laser spot. It is shown that the matrix of elements can have a fill factor of about 0.5, with a period up to 2.5 times smaller than the diameter of the laser spot. Numerical simulation of diffraction of Gaussian beams and optical vortices with circular polarization on arrays of subwavelength microcylinders has shown that a decrease in the diameter of microcylinders leads to a decrease in the size of the focal spot and light needle for a Gaussian beam, and an increase in height leads to the formation of the main intensity peaks inside the element for both the Gaussian beam and the Laguerre-Gauss mode. Based on the simulation results, a focusing meander matrix of microcylinders with a minimum element size of about 350 nm and a height of 0.21 λ was made.

研究了脉冲低频激光(λ = 532 nm)处理氧化铬-氧化锆双层膜表面时微柱形成的影响。观察到一种不寻常的微柱形成,这可以用微皱的影响来解释。结果表明,用这种方法可以形成微柱的准周期矩阵,其尺寸在衍射极限数量级,比激光光斑的有效直径约小6倍。结果表明,元素矩阵的填充系数约为0.5,周期比激光光斑直径小2.5倍。对高斯光束在亚波长微柱阵列上的衍射和圆偏振光涡的数值模拟表明,微柱直径的减小导致高斯光束的焦点光斑和光针的尺寸减小,高度的增加导致高斯光束和拉盖尔-高斯模式在元件内部形成主强度峰。基于仿真结果,制备了最小单元尺寸约为350 nm,高度为0.21 λ的微柱聚焦曲流矩阵。
{"title":"Investigation of the Effect of the Formation of Subwavelength Microcylinders in the Process of Pulsed Laser Action on the Cr/ZrO2 Bilayer","authors":"S. D. Poletayev,&nbsp;D. A. Savelyev,&nbsp;G. V. Uspleniev","doi":"10.3103/S1060992X25700158","DOIUrl":"10.3103/S1060992X25700158","url":null,"abstract":"<p>The effect of the formation of microcylinders during laser treatment (λ = 532 nm) of the surface of a chromium–zirconium dioxide bilayer in a pulsed low-frequency mode is studied. An unusual formation of microcylinders was observed, which was explained by the effect of micro wrinkles. It was found that in this way it is possible to form a quasi-periodic matrix of microcylinders, the size of which is on the order of the diffraction limit, which is approximately 6 times smaller than the effective diameter of the laser spot. It is shown that the matrix of elements can have a fill factor of about 0.5, with a period up to 2.5 times smaller than the diameter of the laser spot. Numerical simulation of diffraction of Gaussian beams and optical vortices with circular polarization on arrays of subwavelength microcylinders has shown that a decrease in the diameter of microcylinders leads to a decrease in the size of the focal spot and light needle for a Gaussian beam, and an increase in height leads to the formation of the main intensity peaks inside the element for both the Gaussian beam and the Laguerre-Gauss mode. Based on the simulation results, a focusing meander matrix of microcylinders with a minimum element size of about 350 nm and a height of 0.21 λ was made.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"418 - 427"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073618","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient Neural Network Method for Rumour Detection over Social Media 社交媒体谣言检测的高效神经网络方法
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X24601775
Manya Gidwani,  Ashwini Rao

Social media rumours significantly challenge societal discourse, demanding effective detection mechanisms. Existing automated rumour detection methods primarily rely on topological data, yet computational complexity and managing large datasets remain formidable obstacles. This study proposes a novel neural network approach utilising graphical structures to address these challenges and enhance rumour detection efficiency. This study suggests a novel neural network approach to improve rumour detection efficiency using graphical structures from the PHEME dataset. The strategy aims to improve classifier performance by transforming tweeting graphs into distinct binary trees, enabling the learning of structural information’s propagation and dispersion. This makes it possible to build meta-tree paths that record and capture local structural information. The model learns global structural representations using BERT on these pathways. The approach also incorporates user relationships and content associations utilizing a bidirectional graph convolutional network encoder to learn node-level representations. The final node-level representation is synthesised by combining user and content embeddings. A fusion approach combines the structural and node-level representations, passing through a fully connected layer and a Softmax layer for rumour detection. This proposed model outperforms the existing models, with an accuracy of over 93% without cross-validation and more than 95% with cross-validation. Experimental validation demonstrates the effectiveness of the suggested approach in rumour detection over social media, offering a promising solution to mitigate the impact of misinformation and rumours in online discourse.

社交媒体谣言极大地挑战了社会话语,需要有效的检测机制。现有的自动化谣言检测方法主要依赖于拓扑数据,但计算复杂性和管理大型数据集仍然是巨大的障碍。本研究提出了一种新的神经网络方法,利用图形结构来解决这些挑战并提高谣言检测效率。本研究提出了一种新的神经网络方法,利用来自PHEME数据集的图形结构来提高谣言检测效率。该策略旨在通过将tweet图转换为不同的二叉树,从而学习结构信息的传播和分散,从而提高分类器的性能。这使得构建记录和捕获局部结构信息的元树路径成为可能。该模型在这些路径上使用BERT学习全局结构表示。该方法还结合了用户关系和内容关联,利用双向图卷积网络编码器来学习节点级表示。最后的节点级表示通过结合用户嵌入和内容嵌入来合成。融合方法结合了结构和节点级表示,通过完全连接层和Softmax层进行谣言检测。该模型优于现有模型,未经交叉验证的准确率超过93%,交叉验证的准确率超过95%。实验验证证明了所建议的方法在社交媒体谣言检测中的有效性,为减轻在线话语中的错误信息和谣言的影响提供了一个有希望的解决方案。
{"title":"Efficient Neural Network Method for Rumour Detection over Social Media","authors":"Manya Gidwani,&nbsp; Ashwini Rao","doi":"10.3103/S1060992X24601775","DOIUrl":"10.3103/S1060992X24601775","url":null,"abstract":"<p>Social media rumours significantly challenge societal discourse, demanding effective detection mechanisms. Existing automated rumour detection methods primarily rely on topological data, yet computational complexity and managing large datasets remain formidable obstacles. This study proposes a novel neural network approach utilising graphical structures to address these challenges and enhance rumour detection efficiency. This study suggests a novel neural network approach to improve rumour detection efficiency using graphical structures from the PHEME dataset. The strategy aims to improve classifier performance by transforming tweeting graphs into distinct binary trees, enabling the learning of structural information’s propagation and dispersion. This makes it possible to build meta-tree paths that record and capture local structural information. The model learns global structural representations using BERT on these pathways. The approach also incorporates user relationships and content associations utilizing a bidirectional graph convolutional network encoder to learn node-level representations. The final node-level representation is synthesised by combining user and content embeddings. A fusion approach combines the structural and node-level representations, passing through a fully connected layer and a Softmax layer for rumour detection. This proposed model outperforms the existing models, with an accuracy of over 93% without cross-validation and more than 95% with cross-validation. Experimental validation demonstrates the effectiveness of the suggested approach in rumour detection over social media, offering a promising solution to mitigate the impact of misinformation and rumours in online discourse.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"428 - 440"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on the Solution of the Problem of Detecting Key Points of an Object from a Single Image Using Deep Neural Networks 基于深度神经网络的单幅图像中目标关键点检测问题的研究
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X24601957
G. Algashev, V. Kuzina, A. Kupriyanov

This paper addresses the problem of object keypoint detection from a single image using modern machine learning methods. Keypoint detection has been extensively studied for human pose estimation, and thus, the study compares deep convolutional neural networks that effectively solve this task. Given the challenge of adapting methods to different object types, special attention is paid to automating the preparation of training data. A novel approach is presented, which includes generating datasets based on 3D models, automatically annotating keypoints, and capturing images of objects from various angles, scales, backgrounds, and lighting conditions. The study investigates which modern deep neural networks are the most effective for keypoint detection and explores the applicability of models trained on synthetic data to real-world scenarios.

本文使用现代机器学习方法解决了从单个图像中检测目标关键点的问题。关键点检测已被广泛研究用于人体姿态估计,因此,本研究比较了有效解决该任务的深度卷积神经网络。考虑到使方法适应不同对象类型的挑战,特别关注训练数据的自动化准备。提出了一种基于三维模型生成数据集、自动标注关键点以及从不同角度、尺度、背景和光照条件下捕获物体图像的新方法。该研究调查了哪些现代深度神经网络对关键点检测最有效,并探索了在合成数据上训练的模型对现实世界场景的适用性。
{"title":"Research on the Solution of the Problem of Detecting Key Points of an Object from a Single Image Using Deep Neural Networks","authors":"G. Algashev,&nbsp;V. Kuzina,&nbsp;A. Kupriyanov","doi":"10.3103/S1060992X24601957","DOIUrl":"10.3103/S1060992X24601957","url":null,"abstract":"<p>This paper addresses the problem of object keypoint detection from a single image using modern machine learning methods. Keypoint detection has been extensively studied for human pose estimation, and thus, the study compares deep convolutional neural networks that effectively solve this task. Given the challenge of adapting methods to different object types, special attention is paid to automating the preparation of training data. A novel approach is presented, which includes generating datasets based on 3D models, automatically annotating keypoints, and capturing images of objects from various angles, scales, backgrounds, and lighting conditions. The study investigates which modern deep neural networks are the most effective for keypoint detection and explores the applicability of models trained on synthetic data to real-world scenarios.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"364 - 370"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Сonfluent Regions Packing of Coarsened Errors for Iterative Approximation of Multispectral Images Сonfluent多光谱图像迭代逼近中粗化误差的区域填充
IF 0.8 Q4 OPTICS Pub Date : 2025-09-17 DOI: 10.3103/S1060992X25700171
M. V. Gashnikov

The paper investigates an iterative approximation-based method of compressing discrete multispectral images. Less thinned multispectral images are used to approximate more thinned ones, the degree of thinning decreasing in an iterrative fashion. When a set of thinned multispectral images is used, the data redundency is eliminated by using nonredundant nested covers consisted of specially reduced thinned images. Approximation errors are rounded and stored. The paper considers an algorithm of detection and effective representation of degenerate subsets of rounded iterative approximation errors. The algotithm allows more efficient representation of rounded error subsets and higher data compression ratios. The computational experiment confirms a considerable increase in efficiency of the iterative approximation-based method of discrete multispectral data compression.

研究了一种基于迭代逼近的离散多光谱图像压缩方法。稀化程度较低的多光谱图像被用来近似稀化程度较高的多光谱图像,稀化程度以迭代的方式递减。当使用一组稀疏的多光谱图像时,通过使用由特别减少的稀疏图像组成的非冗余嵌套覆盖来消除数据冗余。近似误差被四舍五入并存储。本文研究了一种舍入迭代逼近误差退化子集的检测和有效表示算法。该算法允许更有效地表示舍入误差子集和更高的数据压缩比。计算实验证实了基于迭代逼近的离散多光谱数据压缩方法的效率有较大提高。
{"title":"Сonfluent Regions Packing of Coarsened Errors for Iterative Approximation of Multispectral Images","authors":"M. V. Gashnikov","doi":"10.3103/S1060992X25700171","DOIUrl":"10.3103/S1060992X25700171","url":null,"abstract":"<p>The paper investigates an iterative approximation-based method of compressing discrete multispectral images. Less thinned multispectral images are used to approximate more thinned ones, the degree of thinning decreasing in an iterrative fashion. When a set of thinned multispectral images is used, the data redundency is eliminated by using nonredundant nested covers consisted of specially reduced thinned images. Approximation errors are rounded and stored. The paper considers an algorithm of detection and effective representation of degenerate subsets of rounded iterative approximation errors. The algotithm allows more efficient representation of rounded error subsets and higher data compression ratios. The computational experiment confirms a considerable increase in efficiency of the iterative approximation-based method of discrete multispectral data compression.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 3","pages":"358 - 363"},"PeriodicalIF":0.8,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145073829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Optical Memory and Neural Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1