首页 > 最新文献

Optical Memory and Neural Networks最新文献

英文 中文
Consumer Behavior Analysis in Social Networking Big Data Using Correlated Extreme Learning 基于关联极限学习的社交网络大数据消费者行为分析
IF 1 Q4 OPTICS Pub Date : 2025-04-16 DOI: 10.3103/S1060992X24700875
M. Arumugam, C. Jayanthi

Scrutiny of consumer tweets posted on social media is found to be indispensable for numerous business applications. In this manner, the model of big data analytics is applied in processing data and analyzes it to predict consumer behavioral patterns on social media. Different machine learning algorithms have gathered consumer data to analysis consumer behavior. Conventional methods are unable to discover extreme hidden patterns and require to be enhanced to produce more accurate behavioral patterns. In this work a hybrid method called, proposed Bouldin Correlation Clustering and Gradient Extreme Learning Machine (BCC-GELM) method to perform the consumer behavior analysis in social network with big data. The BCC-GELM method in hybrid model split into two modules. At first, Davis-Bouldin Index-based Correlation Clustering selects clusters with most edges within clusters as positive (i.e., similar information) while most edges between clusters as negative (i.e., dissimilar information), therefore minimizing the error rate. Consumer previous behavioral characteristics and twitter messages are analyzed by means of focal points (i.e., cluster center) via Davis-Bouldin Index. Subsequently, Stochastic Gradient Descent Extreme Learning Machine yields good results by considering distribution of tweets, therefore paving way for predicting consumer behavioral patterns in an optimal manner. The performance of BCC-GELM method is evaluated using experimental analysis and comparison is also made with traditional consumer behavioral pattern methods. The findings demonstrate that BCC-GELM method performs well than the traditional consumer behavioral pattern methods in terms of 9% of clustering accuracy, 45 and 54% of clustering time using without and with preprocessing (percent), 23% of clustering overhead and 46% of error rate.

人们发现,对社交媒体上发布的消费者推文进行审查,对于许多商业应用来说是必不可少的。通过这种方式,运用大数据分析模型对数据进行处理和分析,从而预测消费者在社交媒体上的行为模式。不同的机器学习算法收集消费者数据来分析消费者行为。传统的方法无法发现极端的隐藏模式,需要改进以产生更准确的行为模式。本文提出了一种名为Bouldin相关聚类和梯度极限学习机(BCC-GELM)的混合方法,用于利用大数据进行社交网络中的消费者行为分析。混合模型中的BCC-GELM方法分为两个模块。首先,基于Davis-Bouldin指数的相关聚类选择聚类内大多数边为正(即信息相似)而聚类之间大多数边为负(即信息不相似)的聚类,从而使错误率最小化。通过Davis-Bouldin指数,利用焦点(即聚类中心)分析消费者之前的行为特征和twitter信息。随后,随机梯度下降极限学习机通过考虑推文的分布得到了很好的结果,从而为以最优方式预测消费者的行为模式铺平了道路。通过实验分析对BCC-GELM方法的性能进行了评价,并与传统的消费者行为模式方法进行了比较。结果表明,BCC-GELM方法比传统的消费者行为模式方法在以下方面表现得更好:9%的聚类准确率,45%和54%的聚类时间,23%的聚类开销和46%的错误率。
{"title":"Consumer Behavior Analysis in Social Networking Big Data Using Correlated Extreme Learning","authors":"M. Arumugam,&nbsp;C. Jayanthi","doi":"10.3103/S1060992X24700875","DOIUrl":"10.3103/S1060992X24700875","url":null,"abstract":"<p>Scrutiny of consumer tweets posted on social media is found to be indispensable for numerous business applications. In this manner, the model of big data analytics is applied in processing data and analyzes it to predict consumer behavioral patterns on social media. Different machine learning algorithms have gathered consumer data to analysis consumer behavior. Conventional methods are unable to discover extreme hidden patterns and require to be enhanced to produce more accurate behavioral patterns. In this work a hybrid method called, proposed Bouldin Correlation Clustering and Gradient Extreme Learning Machine (BCC-GELM) method to perform the consumer behavior analysis in social network with big data. The BCC-GELM method in hybrid model split into two modules. At first, Davis-Bouldin Index-based Correlation Clustering selects clusters with most edges within clusters as positive (i.e., similar information) while most edges between clusters as negative (i.e., dissimilar information), therefore minimizing the error rate. Consumer previous behavioral characteristics and twitter messages are analyzed by means of focal points (i.e., cluster center) via Davis-Bouldin Index. Subsequently, Stochastic Gradient Descent Extreme Learning Machine yields good results by considering distribution of tweets, therefore paving way for predicting consumer behavioral patterns in an optimal manner. The performance of BCC-GELM method is evaluated using experimental analysis and comparison is also made with traditional consumer behavioral pattern methods. The findings demonstrate that BCC-GELM method performs well than the traditional consumer behavioral pattern methods in terms of 9% of clustering accuracy, 45 and 54% of clustering time using without and with preprocessing (percent), 23% of clustering overhead and 46% of error rate.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 1","pages":"1 - 17"},"PeriodicalIF":1.0,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840428","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Monte Carlo Model for Describing Photon Interactions with Biological Tissue in New Approaches of Polarization-Sensitive Optical Coherence Tomography 偏振敏感光学相干层析成像新方法中描述光子与生物组织相互作用的蒙特卡罗模型
IF 1 Q4 OPTICS Pub Date : 2025-04-16 DOI: 10.3103/S1060992X24602045
O. V. Angelsky, C. Yu. Zenkova, D. I. Ivanskyi, Yu. Ursuliak

This work presents results from using a Monte Carlo model to describe photon interactions with a scattering and absorbing medium, exemplified by the eye cornea in polarization-sensitive optical coherence tomography (PS-OCT) approaches. The interaction of an incident photon packet with a weakly scattering birefringent object was analyzed using the meridian plane Monte Carlo approach, which made it possible to take into account the depolarization of radiation during interaction with the scattering centers of the eye corneal epithelium and to increase the signal-to-noise ratio of object information. The dynamic and geometric phase reconstruction in a modified Mach-Zehnder interferometer scheme allows to obtain data of collagen fibers orientation non-invasive, to restore lost information of the birefringent object structure. The result of this reconstruction is a complete picture of the stromal structure with an accuracy that surpasses current levels achieved with existing PS-OCT systems.

这项工作展示了使用蒙特卡洛模型描述光子与散射和吸收介质相互作用的结果,以偏振敏感光学相干断层扫描(PS-OCT)方法中的眼角膜为例。利用子午平面蒙特卡洛方法分析了入射光子包与弱散射双折射物体之间的相互作用,从而考虑到了辐射在与眼睛角膜上皮散射中心相互作用过程中的去极化现象,并提高了物体信息的信噪比。在改进的马赫-泽恩德干涉仪方案中进行动态和几何相位重建,可以获得胶原纤维定向的非侵入性数据,恢复丢失的双折射物体结构信息。这种重建的结果是完整的基质结构图,其精确度超过了现有 PS-OCT 系统的现有水平。
{"title":"Monte Carlo Model for Describing Photon Interactions with Biological Tissue in New Approaches of Polarization-Sensitive Optical Coherence Tomography","authors":"O. V. Angelsky,&nbsp;C. Yu. Zenkova,&nbsp;D. I. Ivanskyi,&nbsp;Yu. Ursuliak","doi":"10.3103/S1060992X24602045","DOIUrl":"10.3103/S1060992X24602045","url":null,"abstract":"<p>This work presents results from using a Monte Carlo model to describe photon interactions with a scattering and absorbing medium, exemplified by the eye cornea in polarization-sensitive optical coherence tomography (PS-OCT) approaches. The interaction of an incident photon packet with a weakly scattering birefringent object was analyzed using the meridian plane Monte Carlo approach, which made it possible to take into account the depolarization of radiation during interaction with the scattering centers of the eye corneal epithelium and to increase the signal-to-noise ratio of object information. The dynamic and geometric phase reconstruction in a modified Mach-Zehnder interferometer scheme allows to obtain data of collagen fibers orientation non-invasive, to restore lost information of the birefringent object structure. The result of this reconstruction is a complete picture of the stromal structure with an accuracy that surpasses current levels achieved with existing PS-OCT systems.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"34 1","pages":"30 - 48"},"PeriodicalIF":1.0,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840449","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Magnetic Field-Controlled Phase Transitions in Antiferromagnetic Structures 反铁磁结构中的磁场控制相变
IF 1 Q4 OPTICS Pub Date : 2025-02-03 DOI: 10.3103/S1060992X24700486
V. I. Egorov, B. V. Kryzhanovsky

The properties of an antiferromagnetic substance are investigated in the presence of a magnetic field. Analytical expressions are obtained in terms of the mean-field approximation. An external magnetic field is shown to be non-destructive to the phase transition in the antiferromagnetic substance. It only changes critical exponents and shifts the critical point. This allows us to control the critical properties of the system. The number of critical points can vary from one (the second-order phase transition) to four (two first-order phase transitions and two second-order phase transitions). It is shown that variations in the magnetic field magnitude can raise the critical temperature by three-odd times in materials with strong antiferromagnetic interactions. A Monte Carlo simulation carried out for a three-dimensional lattice with a finite interaction radius substantiates that the action of an external field brings about a shift in the temperature of the transition. The simulation results agree well with the analytical expressions of the mean field theory.

研究了磁场作用下反铁磁物质的性质。用平均场近似得到了解析表达式。外磁场对反铁磁性物质的相变无破坏作用。它只是改变临界指数和移动临界点。这使我们能够控制系统的关键特性。临界点的数目可以从一个(二阶相变)到四个(两个一阶相变和两个二阶相变)不等。结果表明,在具有强反铁磁相互作用的材料中,磁场大小的变化可使临界温度提高3倍以上。对具有有限相互作用半径的三维晶格进行了蒙特卡罗模拟,证实了外场的作用会引起相变温度的变化。模拟结果与平均场理论的解析表达式吻合较好。
{"title":"Magnetic Field-Controlled Phase Transitions in Antiferromagnetic Structures","authors":"V. I. Egorov,&nbsp;B. V. Kryzhanovsky","doi":"10.3103/S1060992X24700486","DOIUrl":"10.3103/S1060992X24700486","url":null,"abstract":"<p>The properties of an antiferromagnetic substance are investigated in the presence of a magnetic field. Analytical expressions are obtained in terms of the mean-field approximation. An external magnetic field is shown to be non-destructive to the phase transition in the antiferromagnetic substance. It only changes critical exponents and shifts the critical point. This allows us to control the critical properties of the system. The number of critical points can vary from one (the second-order phase transition) to four (two first-order phase transitions and two second-order phase transitions). It is shown that variations in the magnetic field magnitude can raise the critical temperature by three-odd times in materials with strong antiferromagnetic interactions. A Monte Carlo simulation carried out for a three-dimensional lattice with a finite interaction radius substantiates that the action of an external field brings about a shift in the temperature of the transition. The simulation results agree well with the analytical expressions of the mean field theory.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"33 4","pages":"401 - 410"},"PeriodicalIF":1.0,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143108125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimized Jordan Neural Network and Bandwidth Aware Routing Protocol for Congestion Prediction and Avoidance in IOT for Effective Communication 优化的Jordan神经网络和带宽感知路由协议用于物联网中有效通信的拥塞预测和避免
IF 1 Q4 OPTICS Pub Date : 2025-02-03 DOI: 10.3103/S1060992X24700838
Mallavalli Raghavendra Suma, Bhosale Rajkumar Shankarrao, Adapa Gopi, Nilesh U. Sambhe, Laxmikant Umate

Development of 5G internet in today’s trend leads to the evaluation of many IOT devices. The information is transmitted by a network in IOT to store the data in the cloud. Due to the wide usage of IOT devices by people, congestion may occurs in IOT networks, which delays the information or sometimes resulting in data loss despite the implementation of congestion control methods. So many machine learning and congestion control protocols are used to predict and avoid congestion in IOT network. But these existing systems consist of drawbacks such as accuracy drop for prediction, packet loss and time delay. Hence, the Bandwidth Aware Routing Strategy (BARS) protocol using Jordan Neural Network (JNN) was developed to predict and avoid congestion in the network. Initially, the IOT nodes are deployed and the data are collected and preprocessed using a sigmoidal function and Extreme Learning machine to improve the quality of the original data. Then extract the features from the pre-processed data using Locality Preserving Projection (LPP). After that, Jordan Neural Network is used for congestion prediction and pine cone optimization is used to tune the hyper parameters such as learning rate and batch size which is utilized to improve the classifier performance. Then, BARS protocol is used to avoid the congestion present in the IOT network. According to the experimental approach, the proposed techniques achieves 95.45% of Accuracy, 95.71% of Precision, 95.39% of F1-Scorce and 95.02 of specificity. Thus, the congestion and avoidance of Information in the IOT network is processed in high efficiency by using this proposed approach.

5G互联网的发展在当今的趋势下,导致了许多物联网设备的评估。信息通过物联网中的网络传输,将数据存储在云中。由于人们对物联网设备的广泛使用,物联网网络中可能会出现拥塞,即使实施拥塞控制方法,也会导致信息延迟,有时甚至导致数据丢失。因此,许多机器学习和拥塞控制协议被用于预测和避免物联网网络中的拥塞。但是这些现有的系统存在预测精度下降、丢包和时间延迟等缺点。为此,提出了利用约旦神经网络(JNN)来预测和避免网络拥塞的带宽感知路由策略(Bandwidth - Aware Routing Strategy, BARS)协议。首先,部署物联网节点,使用s型函数和极限学习机对数据进行收集和预处理,以提高原始数据的质量。然后利用局域保持投影(Locality Preserving Projection, LPP)从预处理数据中提取特征。然后利用Jordan神经网络进行拥塞预测,利用松果优化对学习率、批处理大小等超参数进行调整,提高分类器的性能。然后,使用BARS协议来避免物联网网络中存在的拥塞。根据实验方法,所提出的技术达到95.45%的准确度、95.71%的精密度、95.39%的F1-Scorce和95.02的特异性。因此,采用该方法可以高效地处理物联网网络中的拥塞和信息回避问题。
{"title":"Optimized Jordan Neural Network and Bandwidth Aware Routing Protocol for Congestion Prediction and Avoidance in IOT for Effective Communication","authors":"Mallavalli Raghavendra Suma,&nbsp;Bhosale Rajkumar Shankarrao,&nbsp;Adapa Gopi,&nbsp;Nilesh U. Sambhe,&nbsp;Laxmikant Umate","doi":"10.3103/S1060992X24700838","DOIUrl":"10.3103/S1060992X24700838","url":null,"abstract":"<p>Development of 5G internet in today’s trend leads to the evaluation of many IOT devices. The information is transmitted by a network in IOT to store the data in the cloud. Due to the wide usage of IOT devices by people, congestion may occurs in IOT networks, which delays the information or sometimes resulting in data loss despite the implementation of congestion control methods. So many machine learning and congestion control protocols are used to predict and avoid congestion in IOT network. But these existing systems consist of drawbacks such as accuracy drop for prediction, packet loss and time delay. Hence, the Bandwidth Aware Routing Strategy (BARS) protocol using Jordan Neural Network (JNN) was developed to predict and avoid congestion in the network. Initially, the IOT nodes are deployed and the data are collected and preprocessed using a sigmoidal function and Extreme Learning machine to improve the quality of the original data. Then extract the features from the pre-processed data using Locality Preserving Projection (LPP). After that, Jordan Neural Network is used for congestion prediction and pine cone optimization is used to tune the hyper parameters such as learning rate and batch size which is utilized to improve the classifier performance. Then, BARS protocol is used to avoid the congestion present in the IOT network. According to the experimental approach, the proposed techniques achieves 95.45% of Accuracy, 95.71% of Precision, 95.39% of F1-Scorce and 95.02 of specificity. Thus, the congestion and avoidance of Information in the IOT network is processed in high efficiency by using this proposed approach.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"33 4","pages":"429 - 446"},"PeriodicalIF":1.0,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143108221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced Personality Prediction Using Knowledge Distillation with BERT: A Focus on MBTI 基于BERT的知识精馏增强人格预测:以MBTI为中心
IF 1 Q4 OPTICS Pub Date : 2025-02-03 DOI: 10.3103/S1060992X2470084X
Suman A. Patil, Shivleela Patil, Vijayalaxmi V. Tadkal

A person’s personality comprises a range of behaviours, attitudes, and emotional patterns that shift throughout time due to ecological and biological influences. Personality prediction from the MBTI dataset poses computational efficiency, memory utilisation, and class imbalance challenges. This study proposes a novel approach leveraging Knowledge Distillation-based BERT to address these challenges. The process involves three stages: pre-processing, feature extraction, and classification. Initially, data is cleaned by removing irrelevant characters and URLs, followed by tokenisation and conversion to lowercase for consistency. The padding ensures uniform input size for DistilBERT, with attention masks aiding focus on relevant tokens. DistilBERT extracts contextual embeddings, enhanced by segment and positional embeddings, capturing semantic meaning via multi-head self-attention. A fully connected layer with GELU activation and batch normalisation mitigates overfitting, followed by a classification layer with Sparsemax activation, addressing the class imbalance. Fine-tuning pre-trained DistilBERT maximises detection accuracy while excluding irrelevant learning objectives. Dynamic masking during inference replaces static masking, and the Radam optimiser optimises hyperparameters for improved convergence. Our approach offers a robust solution that achieves 93% accuracy and 95% F1-score for accurate personality prediction while mitigating computational complexities and class imbalance issues.

一个人的个性包括一系列的行为、态度和情感模式,这些行为、态度和情感模式由于生态和生物的影响而随着时间的推移而变化。来自MBTI数据集的人格预测提出了计算效率、内存利用率和类别不平衡的挑战。本研究提出了一种利用基于知识蒸馏的BERT来解决这些挑战的新方法。该过程包括预处理、特征提取和分类三个阶段。最初,通过删除不相关的字符和url来清理数据,然后进行标记化并将其转换为小写字母以保持一致性。填充确保了对DistilBERT的统一输入大小,注意掩码帮助关注相关的标记。蒸馏器提取上下文嵌入,通过片段和位置嵌入增强,通过多头自注意捕获语义。一个具有GELU激活和批归一化的完全连接层减轻了过拟合,然后是一个具有Sparsemax激活的分类层,解决了类不平衡问题。微调预训练蒸馏器最大限度地提高检测精度,同时排除不相关的学习目标。在推理过程中的动态掩蔽取代了静态掩蔽,并且Radam优化器优化了超参数以提高收敛性。我们的方法提供了一个强大的解决方案,在减少计算复杂性和职业不平衡问题的同时,实现了93%的准确率和95%的f1分数的准确人格预测。
{"title":"Enhanced Personality Prediction Using Knowledge Distillation with BERT: A Focus on MBTI","authors":"Suman A. Patil,&nbsp;Shivleela Patil,&nbsp;Vijayalaxmi V. Tadkal","doi":"10.3103/S1060992X2470084X","DOIUrl":"10.3103/S1060992X2470084X","url":null,"abstract":"<p>A person’s personality comprises a range of behaviours, attitudes, and emotional patterns that shift throughout time due to ecological and biological influences. Personality prediction from the MBTI dataset poses computational efficiency, memory utilisation, and class imbalance challenges. This study proposes a novel approach leveraging Knowledge Distillation-based BERT to address these challenges. The process involves three stages: pre-processing, feature extraction, and classification. Initially, data is cleaned by removing irrelevant characters and URLs, followed by tokenisation and conversion to lowercase for consistency. The padding ensures uniform input size for DistilBERT, with attention masks aiding focus on relevant tokens. DistilBERT extracts contextual embeddings, enhanced by segment and positional embeddings, capturing semantic meaning via multi-head self-attention. A fully connected layer with GELU activation and batch normalisation mitigates overfitting, followed by a classification layer with Sparsemax activation, addressing the class imbalance. Fine-tuning pre-trained DistilBERT maximises detection accuracy while excluding irrelevant learning objectives. Dynamic masking during inference replaces static masking, and the Radam optimiser optimises hyperparameters for improved convergence. Our approach offers a robust solution that achieves 93% accuracy and 95% F1-score for accurate personality prediction while mitigating computational complexities and class imbalance issues.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"33 4","pages":"455 - 465"},"PeriodicalIF":1.0,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143108211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sign Language Video Generation from Text Using Generative Adversarial Networks 使用生成对抗网络从文本生成手语视频
IF 1 Q4 OPTICS Pub Date : 2025-02-03 DOI: 10.3103/S1060992X24700851
R. Sreemathy, Param Chordiya, Soumya Khurana, Mousami Turuk

This work presents a technique developed by utilizing Generative Adversarial Networks (GANs) to generate Sign Language videos. Sign Language is the main mode of communication for people in the hearing impaired community. The process of teaching sign language is difficult as there are not a lot of tools available for this purpose. Generative artificial intelligence can be very helpful for this task as it is able to learn from the limited data and is able to generate various images and videos. In this work, Conditional GANs (cGANs) were employed to generate videos for Indian Sign Language (ISL) based on a text input. It is found that the results obtained from cGANs exhibit superior quality and control based on the performance metrics such as SSIM, FID and MSE values. The effectiveness of the cGANs in generating accurate and visually appealing sign language videos highlights their potential for teaching sign language and improving sign language communication systems.

这项工作提出了一种利用生成对抗网络(GANs)来生成手语视频的技术。手语是听障人群的主要交流方式。手语教学的过程是困难的,因为没有很多工具可用于此目的。生成式人工智能对于这项任务非常有帮助,因为它能够从有限的数据中学习,并能够生成各种图像和视频。在这项工作中,使用条件gan (cgan)基于文本输入为印度手语(ISL)生成视频。研究发现,基于SSIM、FID和MSE值等性能指标,cgan获得的结果具有优越的质量和可控性。cgan在制作准确且视觉上吸引人的手语视频方面的有效性凸显了它们在手语教学和改进手语交流系统方面的潜力。
{"title":"Sign Language Video Generation from Text Using Generative Adversarial Networks","authors":"R. Sreemathy,&nbsp;Param Chordiya,&nbsp;Soumya Khurana,&nbsp;Mousami Turuk","doi":"10.3103/S1060992X24700851","DOIUrl":"10.3103/S1060992X24700851","url":null,"abstract":"<p>This work presents a technique developed by utilizing Generative Adversarial Networks (GANs) to generate Sign Language videos. Sign Language is the main mode of communication for people in the hearing impaired community. The process of teaching sign language is difficult as there are not a lot of tools available for this purpose. Generative artificial intelligence can be very helpful for this task as it is able to learn from the limited data and is able to generate various images and videos. In this work, Conditional GANs (cGANs) were employed to generate videos for Indian Sign Language (ISL) based on a text input. It is found that the results obtained from cGANs exhibit superior quality and control based on the performance metrics such as SSIM, FID and MSE values. The effectiveness of the cGANs in generating accurate and visually appealing sign language videos highlights their potential for teaching sign language and improving sign language communication systems.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"33 4","pages":"466 - 476"},"PeriodicalIF":1.0,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143108289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advanced Attention-Based Pre-Trained Transfer Learning Model for Accurate Brain Tumor Detection and Classification from MRI Images 基于注意力的高级预训练迁移学习模型用于MRI图像中脑肿瘤的准确检测和分类
IF 1 Q4 OPTICS Pub Date : 2025-02-03 DOI: 10.3103/S1060992X24700863
A. Priya, V. Vasudevan

Brain tumor identification using MRI images involves the detailed examination of brain tissues to detect and characterize tumors. Conventional ML and DL algorithms sometimes encounter difficulties due to a lack of labelled data, resulting in inferior performance and poor generalization. To address these issues, this study introduces an Advanced Attention-based Pre-trained Transfer Learning (TL) model that enhances accuracy and resilience in identifying and categorizing brain tumors using MRI images. The methodology starts with pre-processing, which includes image scaling and noise reduction with an adaptive median filter. After pre-processing, the images are fed into a CNN-based framework called Pre-trained Attention-fused Image SpectraNet. This framework comprises of five convolutional layers, after which Rectified Linear Unit (ReLU) activation and pooling layers are added to learn progressively more complex features. A novel self-attention layer is implemented to capture deep features that reveal aberrant tissue patterns, hence increasing model interpretability and accuracy. A globally average pooling layer is employed to reduce computational complexity, and it is accompanied by a fully connected layer with batch normalization to assure stability and convergence during training. The last layer uses softmax to categorize normal, pituitary, glioma, and meningioma. Utilizing the Adam optimizer, the suggested approach enhances performance, yielding excellent metrics such as 98.33% accuracy, 98.35% precision, 98.28% recall, and a 98.31% F1-score. These measures show considerable increases over existing ML and DL methods, demonstrating the system’s ability to improve brain tumor detection accuracy. The advancement of these treatments has significant implications for medical professionals who specialize in the timely identification of brain tumors.

使用核磁共振成像识别脑肿瘤涉及对脑组织的详细检查,以检测和表征肿瘤。传统的ML和DL算法有时会因为缺乏标记数据而遇到困难,从而导致性能差和泛化差。为了解决这些问题,本研究引入了一种基于高级注意的预训练迁移学习(TL)模型,该模型可以提高使用MRI图像识别和分类脑肿瘤的准确性和弹性。该方法从预处理开始,其中包括图像缩放和自适应中值滤波器的降噪。经过预处理后,这些图像被输入到一个基于cnn的框架中,这个框架被称为“预训练的注意力融合图像频谱网”。该框架由5个卷积层组成,之后加入ReLU激活层和池化层,逐步学习更复杂的特征。实现了一种新的自注意层来捕获揭示异常组织模式的深层特征,从而提高了模型的可解释性和准确性。该方法采用全局平均池化层来降低计算复杂度,同时采用全连接层进行批归一化处理,以保证训练过程中的稳定性和收敛性。最后一层使用softmax对正常、垂体、胶质瘤和脑膜瘤进行分类。利用Adam优化器,建议的方法提高了性能,产生了出色的指标,如98.33%的准确率、98.35%的精度、98.28%的召回率和98.31%的f1分数。与现有的ML和DL方法相比,这些措施显示出相当大的提高,证明了该系统提高脑肿瘤检测准确性的能力。这些治疗方法的进步对及时识别脑肿瘤的医学专业人员具有重要意义。
{"title":"Advanced Attention-Based Pre-Trained Transfer Learning Model for Accurate Brain Tumor Detection and Classification from MRI Images","authors":"A. Priya,&nbsp;V. Vasudevan","doi":"10.3103/S1060992X24700863","DOIUrl":"10.3103/S1060992X24700863","url":null,"abstract":"<p>Brain tumor identification using MRI images involves the detailed examination of brain tissues to detect and characterize tumors. Conventional ML and DL algorithms sometimes encounter difficulties due to a lack of labelled data, resulting in inferior performance and poor generalization. To address these issues, this study introduces an Advanced Attention-based Pre-trained Transfer Learning (TL) model that enhances accuracy and resilience in identifying and categorizing brain tumors using MRI images. The methodology starts with pre-processing, which includes image scaling and noise reduction with an adaptive median filter. After pre-processing, the images are fed into a CNN-based framework called Pre-trained Attention-fused Image SpectraNet. This framework comprises of five convolutional layers, after which Rectified Linear Unit (ReLU) activation and pooling layers are added to learn progressively more complex features. A novel self-attention layer is implemented to capture deep features that reveal aberrant tissue patterns, hence increasing model interpretability and accuracy. A globally average pooling layer is employed to reduce computational complexity, and it is accompanied by a fully connected layer with batch normalization to assure stability and convergence during training. The last layer uses softmax to categorize normal, pituitary, glioma, and meningioma. Utilizing the Adam optimizer, the suggested approach enhances performance, yielding excellent metrics such as 98.33% accuracy, 98.35% precision, 98.28% recall, and a 98.31% F1-score. These measures show considerable increases over existing ML and DL methods, demonstrating the system’s ability to improve brain tumor detection accuracy. The advancement of these treatments has significant implications for medical professionals who specialize in the timely identification of brain tumors.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"33 4","pages":"477 - 491"},"PeriodicalIF":1.0,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143108288","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IFDRF: Advancing Anomaly Detection with a Hybrid Machine Learning Model IFDRF:利用混合机器学习模型推进异常检测
IF 1 Q4 OPTICS Pub Date : 2025-02-03 DOI: 10.3103/S1060992X24700474
Hariharan Ramesh, Faridoddin Shariaty, Sanjiban Sekhar Roy

Anomaly detection is the identification of aberrations in the dataset using statistical methods or machine learning algorithms. It is widely performed using unsupervised learning algorithms because labelling the data manually can be expensive. While unsupervised anomaly detection is sufficient for data cleaning, this is not the case in real-world applications, where accuracy is of the utmost importance. For example, it would be unacceptable to misdiagnose someone as not having breast cancer and not provide them with treatment because our model failed to recognize it as an anomaly. In this paper, we propose an optimized model—IFDRF (Isolation Forest, DBSCAN, and Random Forest) that has incorporated feedback (corrections) into the unsupervised detection model. IFDRF is a novel hybrid model combining an unsupervised learning model at the first layer followed by a clustering model at the second layer and a supervised learning model at the end. The proposed model tunes the unsupervised learning model followed by a model fitting with the help of the feedback mechanism. It obviates the need to label the entire dataset and thus increases the scope of anomaly detection applications. We have compared our proposed model to the existing state-of-the-art anomaly detection baseline models to show its efficacy. The proposed model performed significantly ((P{text{-value}} < 2.2 times {{10}^{{ - 16}}})) better than the other algorithms, with an AUC score of 0.875.

异常检测是使用统计方法或机器学习算法识别数据集中的异常。它广泛使用无监督学习算法来执行,因为手动标记数据可能会很昂贵。虽然无监督的异常检测对于数据清理来说已经足够了,但在真实的应用程序中并非如此,因为准确性是最重要的。例如,由于我们的模型未能将其识别为异常,因此误诊某人没有患乳腺癌而不为其提供治疗是不可接受的。在本文中,我们提出了一个优化模型- ifdrf(隔离森林,DBSCAN和随机森林),它将反馈(修正)纳入无监督检测模型。IFDRF是一种新颖的混合模型,第一层是无监督学习模型,第二层是聚类模型,最后是监督学习模型。该模型首先调整无监督学习模型,然后利用反馈机制进行模型拟合。它避免了标记整个数据集的需要,从而增加了异常检测应用的范围。我们将我们提出的模型与现有的最先进的异常检测基线模型进行了比较,以显示其有效性。该模型((P{text{-value}} < 2.2 times {{10}^{{ - 16}}}))显著优于其他算法,AUC得分为0.875。
{"title":"IFDRF: Advancing Anomaly Detection with a Hybrid Machine Learning Model","authors":"Hariharan Ramesh,&nbsp;Faridoddin Shariaty,&nbsp;Sanjiban Sekhar Roy","doi":"10.3103/S1060992X24700474","DOIUrl":"10.3103/S1060992X24700474","url":null,"abstract":"<p>Anomaly detection is the identification of aberrations in the dataset using statistical methods or machine learning algorithms. It is widely performed using unsupervised learning algorithms because labelling the data manually can be expensive. While unsupervised anomaly detection is sufficient for data cleaning, this is not the case in real-world applications, where accuracy is of the utmost importance. For example, it would be unacceptable to misdiagnose someone as not having breast cancer and not provide them with treatment because our model failed to recognize it as an anomaly. In this paper, we propose an optimized model—IFDRF (Isolation Forest, DBSCAN, and Random Forest) that has incorporated feedback (corrections) into the unsupervised detection model. IFDRF is a novel hybrid model combining an unsupervised learning model at the first layer followed by a clustering model at the second layer and a supervised learning model at the end. The proposed model tunes the unsupervised learning model followed by a model fitting with the help of the feedback mechanism. It obviates the need to label the entire dataset and thus increases the scope of anomaly detection applications. We have compared our proposed model to the existing state-of-the-art anomaly detection baseline models to show its efficacy. The proposed model performed significantly (<span>(P{text{-value}} &lt; 2.2 times {{10}^{{ - 16}}})</span>) better than the other algorithms, with an AUC score of 0.875.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"33 4","pages":"385 - 400"},"PeriodicalIF":1.0,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143108094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Tracking and Computation of Characteristics of the Movement of People in Groups on Video Using Convolutional Neural Networks 基于卷积神经网络的视频人群运动特征跟踪与计算
IF 1 Q4 OPTICS Pub Date : 2025-02-03 DOI: 10.3103/S1060992X24700802
Huafeng Chen, A. Krytsky, Shiping Ye, Rykhard Bohush, S. Ablameyko

This paper proposes an approach for tracking the behavior of people in a group on video by using convolutional neural networks. At the beginning, definitions of group movement of people are given, and features for accompaniment are defined that can be used to analyze people’s behavior. Next, an algorithm is proposed for calculating the distance between people in video, which includes three stages: detection and tracking of objects, coordinate transformation, calculation of the distance between people and detection of distance violations. The results of experimental studies and comparison with known algorithms are presented, which confirms the effectiveness of the algorithm.

本文提出了一种基于卷积神经网络的视频群体行为跟踪方法。首先给出了人的群体运动的定义,并定义了陪伴的特征,这些特征可以用来分析人的行为。接下来,提出了一种视频中人与人之间距离的计算算法,该算法包括三个阶段:物体的检测与跟踪、坐标变换、人与人之间距离的计算和距离违规的检测。给出了实验研究结果,并与已知算法进行了比较,验证了算法的有效性。
{"title":"Tracking and Computation of Characteristics of the Movement of People in Groups on Video Using Convolutional Neural Networks","authors":"Huafeng Chen,&nbsp;A. Krytsky,&nbsp;Shiping Ye,&nbsp;Rykhard Bohush,&nbsp;S. Ablameyko","doi":"10.3103/S1060992X24700802","DOIUrl":"10.3103/S1060992X24700802","url":null,"abstract":"<p>This paper proposes an approach for tracking the behavior of people in a group on video by using convolutional neural networks. At the beginning, definitions of group movement of people are given, and features for accompaniment are defined that can be used to analyze people’s behavior. Next, an algorithm is proposed for calculating the distance between people in video, which includes three stages: detection and tracking of objects, coordinate transformation, calculation of the distance between people and detection of distance violations. The results of experimental studies and comparison with known algorithms are presented, which confirms the effectiveness of the algorithm.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"33 4","pages":"373 - 384"},"PeriodicalIF":1.0,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143108095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid Network Model for Cardiac Image Segmentation Using MRI Images 基于MRI图像分割心脏图像的混合网络模型
IF 1 Q4 OPTICS Pub Date : 2025-02-03 DOI: 10.3103/S1060992X24700498
A. Rasmi

Cardiac magnetic resonance imaging (MRI) commonly yields numerous images per scan, and manually delineating structures from these images is a laborious and time-intensive task. The automation of this process is highly desirable as it would enable the generation of crucial clinical measurements like ejection fraction and stroke volume. However, due to variations in scanning settings and patient characteristics, automated segmentation faces several challenges that lead to a high degree of variability in picture statistics and quality. Our study presents a neural network approach that utilizes the UNet and ResNet-50 architectures to efficiently partition the left and right ventricles' endocardial and epicardial boundaries. The Dice metric is used as the loss function in our strategy to maximize the trainable parameters in the network. Additionally, in the neural network’s predicted binary picture, we employed a preprocessing step to save just the segmentation labels' most connected component. Using datasets from the Multi-Vendor & Multi-Disease Cardiac Image Segmentation Challenge, the suggested method was learned. The test set of 160 that had been reserved for testing was used by the challenge organizers to evaluate the approach.

心脏磁共振成像(MRI)通常每次扫描产生大量图像,从这些图像中手动描绘结构是一项费力且耗时的任务。这一过程的自动化是非常可取的,因为它可以产生关键的临床测量,如射血分数和中风体积。然而,由于扫描设置和患者特征的变化,自动分割面临着一些挑战,导致图像统计和质量的高度变化。我们的研究提出了一种神经网络方法,利用UNet和ResNet-50架构有效地划分左心室和右心室的心内膜和心外膜边界。在我们的策略中,Dice度量被用作损失函数,以最大化网络中的可训练参数。此外,在神经网络预测的二值图像中,我们采用预处理步骤只保存分割标签中连接最紧密的部分。使用来自Multi-Vendor &;多疾病心脏图像分割挑战,学习了建议的方法。为测试保留的160个测试集被挑战组织者用来评估这种方法。
{"title":"Hybrid Network Model for Cardiac Image Segmentation Using MRI Images","authors":"A. Rasmi","doi":"10.3103/S1060992X24700498","DOIUrl":"10.3103/S1060992X24700498","url":null,"abstract":"<p>Cardiac magnetic resonance imaging (MRI) commonly yields numerous images per scan, and manually delineating structures from these images is a laborious and time-intensive task. The automation of this process is highly desirable as it would enable the generation of crucial clinical measurements like ejection fraction and stroke volume. However, due to variations in scanning settings and patient characteristics, automated segmentation faces several challenges that lead to a high degree of variability in picture statistics and quality. Our study presents a neural network approach that utilizes the UNet and ResNet-50 architectures to efficiently partition the left and right ventricles' endocardial and epicardial boundaries. The Dice metric is used as the loss function in our strategy to maximize the trainable parameters in the network. Additionally, in the neural network’s predicted binary picture, we employed a preprocessing step to save just the segmentation labels' most connected component. Using datasets from the Multi-Vendor &amp; Multi-Disease Cardiac Image Segmentation Challenge, the suggested method was learned. The test set of 160 that had been reserved for testing was used by the challenge organizers to evaluate the approach.</p>","PeriodicalId":721,"journal":{"name":"Optical Memory and Neural Networks","volume":"33 4","pages":"447 - 454"},"PeriodicalIF":1.0,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143108112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Optical Memory and Neural Networks
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1