首页 > 最新文献

Neural Computing and Applications最新文献

英文 中文
Wind speed super-resolution and validation: from ERA5 to CERRA via diffusion models
Pub Date : 2024-09-18 DOI: 10.1007/s00521-024-10139-9
Fabio Merizzi, Andrea Asperti, Stefano Colamonaco

The Copernicus Regional Reanalysis for Europe, CERRA, is a high-resolution regional reanalysis dataset for the European domain. In recent years, it has shown significant utility across various climate-related tasks, ranging from forecasting and climate change research to renewable energy prediction, resource management, air quality risk assessment, and the forecasting of rare events, among others. Unfortunately, the availability of CERRA is lagging 2 years behind the current date, due to constraints in acquiring the requisite external data and the intensive computational demands inherent in its generation. As a solution, this paper introduces a novel method using diffusion models to approximate CERRA downscaling in a data-driven manner, without additional informations. By leveraging the lower resolution ERA5 dataset, which provides boundary conditions for CERRA, we approach this as a super-resolution task. Focusing on wind speed around Italy, our model, trained on existing CERRA data, shows promising results, closely mirroring the original CERRA. Validation with in-situ observations further confirms the model’s accuracy in approximating ground measurements.

{"title":"Wind speed super-resolution and validation: from ERA5 to CERRA via diffusion models","authors":"Fabio Merizzi, Andrea Asperti, Stefano Colamonaco","doi":"10.1007/s00521-024-10139-9","DOIUrl":"https://doi.org/10.1007/s00521-024-10139-9","url":null,"abstract":"<p>The Copernicus Regional Reanalysis for Europe, CERRA, is a high-resolution regional reanalysis dataset for the European domain. In recent years, it has shown significant utility across various climate-related tasks, ranging from forecasting and climate change research to renewable energy prediction, resource management, air quality risk assessment, and the forecasting of rare events, among others. Unfortunately, the availability of CERRA is lagging 2 years behind the current date, due to constraints in acquiring the requisite external data and the intensive computational demands inherent in its generation. As a solution, this paper introduces a novel method using diffusion models to approximate CERRA downscaling in a data-driven manner, without additional informations. By leveraging the lower resolution ERA5 dataset, which provides boundary conditions for CERRA, we approach this as a super-resolution task. Focusing on wind speed around Italy, our model, trained on existing CERRA data, shows promising results, closely mirroring the original CERRA. Validation with in-situ observations further confirms the model’s accuracy in approximating ground measurements.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fine-tuning adaptive stochastic optimizers: determining the optimal hyperparameter $$epsilon$$ via gradient magnitude histogram analysis
Pub Date : 2024-09-18 DOI: 10.1007/s00521-024-10302-2
Gustavo Silva, Paul Rodriguez

Stochastic optimizers play a crucial role in the successful training of deep neural network models. To achieve optimal model performance, designers must carefully select both model and optimizer hyperparameters. However, this process is frequently demanding in terms of computational resources and processing time. While it is a well-established practice to tune the entire set of optimizer hyperparameters for peak performance, there is still a lack of clarity regarding the individual influence of hyperparameters mislabeled as “low priority”, including the safeguard factor (epsilon) and decay rate (beta), in leading adaptive stochastic optimizers like the Adam optimizer. In this manuscript, we introduce a new framework based on the empirical probability density function of the loss’ gradient magnitude, termed as the “gradient magnitude histogram”, for a thorough analysis of adaptive stochastic optimizers and the safeguard hyperparameter (epsilon). This framework reveals and justifies valuable relationships and dependencies among hyperparameters in connection to optimal performance across diverse tasks, such as classification, language modeling and machine translation. Furthermore, we propose a novel algorithm using gradient magnitude histograms to automatically estimate a refined and accurate search space for the optimal safeguard hyperparameter (epsilon), surpassing the conventional trial-and-error methodology by establishing a worst-case search space that is two times narrower.

{"title":"Fine-tuning adaptive stochastic optimizers: determining the optimal hyperparameter $$epsilon$$ via gradient magnitude histogram analysis","authors":"Gustavo Silva, Paul Rodriguez","doi":"10.1007/s00521-024-10302-2","DOIUrl":"https://doi.org/10.1007/s00521-024-10302-2","url":null,"abstract":"<p>Stochastic optimizers play a crucial role in the successful training of deep neural network models. To achieve optimal model performance, designers must carefully select both model and optimizer hyperparameters. However, this process is frequently demanding in terms of computational resources and processing time. While it is a well-established practice to tune the entire set of optimizer hyperparameters for peak performance, there is still a lack of clarity regarding the individual influence of hyperparameters mislabeled as “low priority”, including the safeguard factor <span>(epsilon)</span> and decay rate <span>(beta)</span>, in leading adaptive stochastic optimizers like the Adam optimizer. In this manuscript, we introduce a new framework based on the empirical probability density function of the loss’ gradient magnitude, termed as the “gradient magnitude histogram”, for a thorough analysis of adaptive stochastic optimizers and the safeguard hyperparameter <span>(epsilon)</span>. This framework reveals and justifies valuable relationships and dependencies among hyperparameters in connection to optimal performance across diverse tasks, such as classification, language modeling and machine translation. Furthermore, we propose a novel algorithm using gradient magnitude histograms to automatically estimate a refined and accurate search space for the optimal safeguard hyperparameter <span>(epsilon)</span>, surpassing the conventional trial-and-error methodology by establishing a worst-case search space that is two times narrower.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Augmented electric eel foraging optimization algorithm for feature selection with high-dimensional biological and medical diagnosis
Pub Date : 2024-09-18 DOI: 10.1007/s00521-024-10288-x
Mohammed Azmi Al-Betar, Malik Sh. Braik, Elfadil A. Mohamed, Mohammed A. Awadallah, Mohamed Nasor

This paper explores the importance of the electric eel foraging optimization (EEFO) algorithm in addressing feature selection (FS) problems, with the aim of ameliorating the practical benefit of FS in real-world applications. The use of EEFO to solve FS problems props our goal of providing clean and useful datasets that provide robust effectiveness for use in classification and clustering tasks. High-dimensional feature selection problems (HFSPs) are more common nowadays yet intricate where they contain a large number of features. Hence, the vast number of features in them should be carefully selected in order to determine the optimal subset of features. As the basic EEFO algorithm experiences premature convergence, there is a need to enhance its global and local search capabilities when applied in the field of FS. In order to tackle such issues, a binary augmented EEFO (BAEEFO) algorithm was developed and proposed for HFSPs. The following strategies were integrated into the mathematical model of the original EEFO algorithm to create BAEEFO: (1) resting behavior with nonlinear coefficient; (2) weight coefficient and confidence effect in the hunting process; (3) spiral search strategy; and (4) Gaussian mutation and random perturbations when the algorithm update is stagnant. Experimental findings confirm the effectiveness of the proposed BAEEFO method on 23 HFSPs gathered from the UCI repository, recording up to a 10% accuracy increment over the basic BEEFO algorithm. In most test cases, BAEEFO outperformed its competitors in classification accuracy rates and outperformed BEEFO in 90% of the datasets used. Thereby, BAEEFO has demonstrated strong competitiveness in terms of fitness scores and classification accuracy. When compared to its competitors, BAEEFO produced superior reduction rates with the fewest number of features selected. The findings in this research underscore the critical need for FS to combat the curse of dimensionality concerns and find highly useful features in data mining applications such as classification. The use of a new meta-heuristic algorithm incorporated with efficient search strategies in solving HFSPs represents a step forward in using this algorithm to solve other practical real-world problems in a variety of domains.

{"title":"Augmented electric eel foraging optimization algorithm for feature selection with high-dimensional biological and medical diagnosis","authors":"Mohammed Azmi Al-Betar, Malik Sh. Braik, Elfadil A. Mohamed, Mohammed A. Awadallah, Mohamed Nasor","doi":"10.1007/s00521-024-10288-x","DOIUrl":"https://doi.org/10.1007/s00521-024-10288-x","url":null,"abstract":"<p>This paper explores the importance of the electric eel foraging optimization (EEFO) algorithm in addressing feature selection (FS) problems, with the aim of ameliorating the practical benefit of FS in real-world applications. The use of EEFO to solve FS problems props our goal of providing clean and useful datasets that provide robust effectiveness for use in classification and clustering tasks. High-dimensional feature selection problems (HFSPs) are more common nowadays yet intricate where they contain a large number of features. Hence, the vast number of features in them should be carefully selected in order to determine the optimal subset of features. As the basic EEFO algorithm experiences premature convergence, there is a need to enhance its global and local search capabilities when applied in the field of FS. In order to tackle such issues, a binary augmented EEFO (BAEEFO) algorithm was developed and proposed for HFSPs. The following strategies were integrated into the mathematical model of the original EEFO algorithm to create BAEEFO: (1) resting behavior with nonlinear coefficient; (2) weight coefficient and confidence effect in the hunting process; (3) spiral search strategy; and (4) Gaussian mutation and random perturbations when the algorithm update is stagnant. Experimental findings confirm the effectiveness of the proposed BAEEFO method on 23 HFSPs gathered from the UCI repository, recording up to a 10% accuracy increment over the basic BEEFO algorithm. In most test cases, BAEEFO outperformed its competitors in classification accuracy rates and outperformed BEEFO in 90% of the datasets used. Thereby, BAEEFO has demonstrated strong competitiveness in terms of fitness scores and classification accuracy. When compared to its competitors, BAEEFO produced superior reduction rates with the fewest number of features selected. The findings in this research underscore the critical need for FS to combat the curse of dimensionality concerns and find highly useful features in data mining applications such as classification. The use of a new meta-heuristic algorithm incorporated with efficient search strategies in solving HFSPs represents a step forward in using this algorithm to solve other practical real-world problems in a variety of domains.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PLD-Det: plant leaf disease detection in real time using an end-to-end neural network approach based on improved YOLOv7
Pub Date : 2024-09-17 DOI: 10.1007/s00521-024-10409-6
Md Humaion Kabir Mehedi, Nafisa Nawer, Shafi Ahmed, Md Shakiful Islam Khan, Khan Md Hasib, M. F. Mridha, Md. Golam Rabiul Alam, Thanh Thi Nguyen

In order to maintain sustainable agriculture, it is vital to monitor plant health. Since all species of plants are prone to characteristic diseases, it necessitates regular surveillance to search for any symptoms, which is utterly challenging and time-consuming. Besides, farmers may struggle to identify the type of plant disease and its potential symptoms. Hence, the interest in research like image-based computer-aided automated plant leaf disease detection by analyzing the early symptoms has increased enormously. However, limitations in the plant leaf image database, for instance, unfitting backgrounds, blurry images, and so on, sometimes cause underprivileged feature extraction, misclassification, and overfitting issues in existing models. As a result, we have proposed a real-time plant leaf disease detection architecture incorporating proposed PLD-Det model, which is based on improved YOLOv7 with the intention of assisting farmers while reducing the issues in existing models. The architecture has been trained on the widely used PlantVillage dataset, which resulted in an accuracy of 98.53%. Furthermore, SHapley Additive exPlanations (SHAP) values have been analyzed as a unified measure of feature significance. According to the experimental findings, the proposed PLD-Det model, which is an improved YOLOv7 architecture, outperformed the original YOLOv7 model in test accuracy by approximately 4%.

{"title":"PLD-Det: plant leaf disease detection in real time using an end-to-end neural network approach based on improved YOLOv7","authors":"Md Humaion Kabir Mehedi, Nafisa Nawer, Shafi Ahmed, Md Shakiful Islam Khan, Khan Md Hasib, M. F. Mridha, Md. Golam Rabiul Alam, Thanh Thi Nguyen","doi":"10.1007/s00521-024-10409-6","DOIUrl":"https://doi.org/10.1007/s00521-024-10409-6","url":null,"abstract":"<p>In order to maintain sustainable agriculture, it is vital to monitor plant health. Since all species of plants are prone to characteristic diseases, it necessitates regular surveillance to search for any symptoms, which is utterly challenging and time-consuming. Besides, farmers may struggle to identify the type of plant disease and its potential symptoms. Hence, the interest in research like image-based computer-aided automated plant leaf disease detection by analyzing the early symptoms has increased enormously. However, limitations in the plant leaf image database, for instance, unfitting backgrounds, blurry images, and so on, sometimes cause underprivileged feature extraction, misclassification, and overfitting issues in existing models. As a result, we have proposed a real-time plant leaf disease detection architecture incorporating proposed PLD-Det model, which is based on improved YOLOv7 with the intention of assisting farmers while reducing the issues in existing models. The architecture has been trained on the widely used PlantVillage dataset, which resulted in an accuracy of 98.53%. Furthermore, SHapley Additive exPlanations (SHAP) values have been analyzed as a unified measure of feature significance. According to the experimental findings, the proposed PLD-Det model, which is an improved YOLOv7 architecture, outperformed the original YOLOv7 model in test accuracy by approximately 4%.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268763","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A systematic review of vision transformers and convolutional neural networks for Alzheimer’s disease classification using 3D MRI images
Pub Date : 2024-09-17 DOI: 10.1007/s00521-024-10420-x
Mario Alejandro Bravo-Ortiz, Sergio Alejandro Holguin-Garcia, Sebastián Quiñones-Arredondo, Alejandro Mora-Rubio, Ernesto Guevara-Navarro, Harold Brayan Arteaga-Arteaga, Gonzalo A. Ruz, Reinel Tabares-Soto

Alzheimer’s disease (AD) is a progressive neurodegenerative disorder that mainly affects memory and other cognitive functions, such as thinking, reasoning, and the ability to carry out daily activities. It is considered the most common form of dementia in older adults, but it can appear as early as the age of 25. Although the disease has no cure, treatment can be more effective if diagnosed early. In diagnosing AD, changes in the brain’s morphology are identified macroscopically, which is why deep learning models, such as convolutional neural networks (CNN) or vision transformers (ViT), excel in this task. We followed the Systematic Literature Review process, applying stages of the review protocol from it, which aims to detect the need for a review. Then, search equations were formulated and executed in several literature databases. Relevant publications were scanned and used to extract evidence to answer research questions. Several CNN and ViT approaches have already been tested on problems related to brain image analysis for disease detection. A total of 722 articles were found in the selected databases. Still, a series of filters were performed to decrease the number to 44 articles, focusing specifically on brain image analysis with CNN and ViT methods. Deep learning methods are effective for disease diagnosis, and the surge in research activity underscores its importance. However, the lack of access to repositories may introduce bias into the information. Full access demonstrates transparency and facilitates collaborative work in research.

{"title":"A systematic review of vision transformers and convolutional neural networks for Alzheimer’s disease classification using 3D MRI images","authors":"Mario Alejandro Bravo-Ortiz, Sergio Alejandro Holguin-Garcia, Sebastián Quiñones-Arredondo, Alejandro Mora-Rubio, Ernesto Guevara-Navarro, Harold Brayan Arteaga-Arteaga, Gonzalo A. Ruz, Reinel Tabares-Soto","doi":"10.1007/s00521-024-10420-x","DOIUrl":"https://doi.org/10.1007/s00521-024-10420-x","url":null,"abstract":"<p>Alzheimer’s disease (AD) is a progressive neurodegenerative disorder that mainly affects memory and other cognitive functions, such as thinking, reasoning, and the ability to carry out daily activities. It is considered the most common form of dementia in older adults, but it can appear as early as the age of 25. Although the disease has no cure, treatment can be more effective if diagnosed early. In diagnosing AD, changes in the brain’s morphology are identified macroscopically, which is why deep learning models, such as convolutional neural networks (CNN) or vision transformers (ViT), excel in this task. We followed the Systematic Literature Review process, applying stages of the review protocol from it, which aims to detect the need for a review. Then, search equations were formulated and executed in several literature databases. Relevant publications were scanned and used to extract evidence to answer research questions. Several CNN and ViT approaches have already been tested on problems related to brain image analysis for disease detection. A total of 722 articles were found in the selected databases. Still, a series of filters were performed to decrease the number to 44 articles, focusing specifically on brain image analysis with CNN and ViT methods. Deep learning methods are effective for disease diagnosis, and the surge in research activity underscores its importance. However, the lack of access to repositories may introduce bias into the information. Full access demonstrates transparency and facilitates collaborative work in research.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep convolutional neural networks for age and gender estimation using an imbalanced dataset of human face images
Pub Date : 2024-09-17 DOI: 10.1007/s00521-024-10390-0
İsmail Akgül

Automatic age and gender estimation provides an important information to analyze real-world applications such as human–machine interaction, system access, activity recognition, and consumer profile detection. While it is easy to estimate a person’s gender from human facial images, estimating their age is difficult. In such previous challenging studies, traditional convolutional neural network (CNN) methods have been used for age and gender estimation. With the development of deep convolutional neural network (DCNN) architectures, more successful results have been obtained than traditional CNN methods. In this study, two state-of-the-art DCNN models have been developed in the field of artificial intelligence (AI) to make age and gender estimation on an imbalanced dataset of human face images. Firstly, a new model called fast description network (FINet) was developed, which has a parametrically changeable structure. Secondly, the number of parameters has been reduced by using the layer reduction approach in InceptionV3 and NASNetLarge DCNN model structures, and a second model named inception Nasnet fast identify network (INFINet) was developed by concatenating these models and the FINet model as a triple. FINet and INFINet models developed for age and gender estimation were compared with many other state-of-the-art DCNN models in AI. The most successful accuracy results in terms of both age and gender were obtained with the INFINet model (age: 61.22%, gender: 80.95% in the FG-NET dataset, age: 72.00%, gender: 90.50% in the UTKFace dataset). The results obtained in age and gender estimation with the INFINet model are much more effective than other recent state-of-the-art works. In addition, the FINet model, which has a much smaller number of parameters than the compared models, showed a classification performance that can compete with state-of-the-art methods for age and gender estimation.

{"title":"Deep convolutional neural networks for age and gender estimation using an imbalanced dataset of human face images","authors":"İsmail Akgül","doi":"10.1007/s00521-024-10390-0","DOIUrl":"https://doi.org/10.1007/s00521-024-10390-0","url":null,"abstract":"<p>Automatic age and gender estimation provides an important information to analyze real-world applications such as human–machine interaction, system access, activity recognition, and consumer profile detection. While it is easy to estimate a person’s gender from human facial images, estimating their age is difficult. In such previous challenging studies, traditional convolutional neural network (CNN) methods have been used for age and gender estimation. With the development of deep convolutional neural network (DCNN) architectures, more successful results have been obtained than traditional CNN methods. In this study, two state-of-the-art DCNN models have been developed in the field of artificial intelligence (AI) to make age and gender estimation on an imbalanced dataset of human face images. Firstly, a new model called fast description network (FINet) was developed, which has a parametrically changeable structure. Secondly, the number of parameters has been reduced by using the layer reduction approach in InceptionV3 and NASNetLarge DCNN model structures, and a second model named inception Nasnet fast identify network (INFINet) was developed by concatenating these models and the FINet model as a triple. FINet and INFINet models developed for age and gender estimation were compared with many other state-of-the-art DCNN models in AI. The most successful accuracy results in terms of both age and gender were obtained with the INFINet model (age: 61.22%, gender: 80.95% in the FG-NET dataset, age: 72.00%, gender: 90.50% in the UTKFace dataset). The results obtained in age and gender estimation with the INFINet model are much more effective than other recent state-of-the-art works. In addition, the FINet model, which has a much smaller number of parameters than the compared models, showed a classification performance that can compete with state-of-the-art methods for age and gender estimation.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging graph-based learning for credit card fraud detection: a comparative study of classical, deep learning and graph-based approaches
Pub Date : 2024-09-17 DOI: 10.1007/s00521-024-10397-7
Sunisha Harish, Chirag Lakhanpal, Amir Hossein Jafari

Credit card fraud results in staggering financial losses amounting to billions of dollars annually, impacting both merchants and consumers. In light of the escalating prevalence of digital crime and online fraud, it is important for organizations to implement robust and advanced technology to efficiently detect fraud and mitigate the issue. Contemporary solutions heavily rely on classical machine learning (ML) and deep learning (DL) methods to handle such tasks. While these methods have been effective in many aspects of fraud detection, they may not always be sufficient for credit card fraud detection as they aren’t adaptable to detect complex relationships when it comes to transactions. Fraudsters, for example, might set up many coordinated accounts to avoid triggering limitations on individual accounts. In the context of fraud detection, the ability of Graph Neural Networks (GNN’s) to aggregate information contained within the local neighbourhood of a transaction enables them to identify larger patterns that may be missed by just looking at a single transaction. In this research, we conduct a thorough analysis to evaluate the effectiveness of GNNs in improving fraud detection over classical ML and DL methods. We first build an heterogeneous graph architecture with the source, transaction, and destination as our nodes. Next, we leverage Relational Graph Convolutional Network (RGCN) to learn the representations of nodes in our graph and perform node classification on the transaction node. Our experimental results demonstrate that GNN’s outperform classical ML and DL methods.

{"title":"Leveraging graph-based learning for credit card fraud detection: a comparative study of classical, deep learning and graph-based approaches","authors":"Sunisha Harish, Chirag Lakhanpal, Amir Hossein Jafari","doi":"10.1007/s00521-024-10397-7","DOIUrl":"https://doi.org/10.1007/s00521-024-10397-7","url":null,"abstract":"<p>Credit card fraud results in staggering financial losses amounting to billions of dollars annually, impacting both merchants and consumers. In light of the escalating prevalence of digital crime and online fraud, it is important for organizations to implement robust and advanced technology to efficiently detect fraud and mitigate the issue. Contemporary solutions heavily rely on classical machine learning (ML) and deep learning (DL) methods to handle such tasks. While these methods have been effective in many aspects of fraud detection, they may not always be sufficient for credit card fraud detection as they aren’t adaptable to detect complex relationships when it comes to transactions. Fraudsters, for example, might set up many coordinated accounts to avoid triggering limitations on individual accounts. In the context of fraud detection, the ability of Graph Neural Networks (GNN’s) to aggregate information contained within the local neighbourhood of a transaction enables them to identify larger patterns that may be missed by just looking at a single transaction. In this research, we conduct a thorough analysis to evaluate the effectiveness of GNNs in improving fraud detection over classical ML and DL methods. We first build an heterogeneous graph architecture with the source, transaction, and destination as our nodes. Next, we leverage Relational Graph Convolutional Network (RGCN) to learn the representations of nodes in our graph and perform node classification on the transaction node. Our experimental results demonstrate that GNN’s outperform classical ML and DL methods.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A knowledge-enhanced interest segment division attention network for click-through rate prediction
Pub Date : 2024-09-17 DOI: 10.1007/s00521-024-10330-y
Zhanghui Liu, Shijie Chen, Yuzhong Chen, Jieyang Su, Jiayuan Zhong, Chen Dong

Click-through rate (CTR) prediction aims to estimate the probability of a user clicking on a particular item, making it one of the core tasks in various recommendation platforms. In such systems, user behavior data are crucial for capturing user interests, which has garnered significant attention from both academia and industry, leading to the development of various user behavior modeling methods. However, existing models still face unresolved issues, as they fail to capture the complex diversity of user interests at the semantic level, refine user interests effectively, and uncover users’ potential interests. To address these challenges, we propose a novel model called knowledge-enhanced Interest segment division attention network (KISDAN), which can effectively and comprehensively model user interests. Specifically, to leverage the semantic information within user behavior sequences, we employ the structure of a knowledge graph to divide user behavior sequence into multiple interest segments. To provide a comprehensive representation of user interests, we further categorize user interests into strong and weak interests. By leveraging both the knowledge graph and the item co-occurrence graph, we explore users’ potential interests from two perspectives. This methodology allows KISDAN to better understand the diversity of user interests. Finally, we extensively evaluate KISDAN on three benchmark datasets, and the experimental results consistently demonstrate that the KISDAN model outperforms state-of-the-art models across various evaluation metrics, which validates the effectiveness and superiority of KISDAN.

点击率(CTR)预测旨在估算用户点击特定项目的概率,是各种推荐平台的核心任务之一。在此类系统中,用户行为数据对于捕捉用户兴趣至关重要,这引起了学术界和工业界的极大关注,并导致了各种用户行为建模方法的发展。然而,现有模型仍面临着一些尚未解决的问题,如无法在语义层面捕捉用户兴趣的复杂多样性、无法有效提炼用户兴趣以及挖掘用户的潜在兴趣。为了应对这些挑战,我们提出了一种名为 "知识增强兴趣段划分注意力网络(KISDAN)"的新型模型,它可以有效、全面地建立用户兴趣模型。具体来说,为了充分利用用户行为序列中的语义信息,我们采用知识图谱的结构将用户行为序列划分为多个兴趣段。为了全面呈现用户兴趣,我们进一步将用户兴趣分为强兴趣和弱兴趣。通过利用知识图谱和项目共现图谱,我们从两个角度探索用户的潜在兴趣。这种方法使 KISDAN 能够更好地理解用户兴趣的多样性。最后,我们在三个基准数据集上对 KISDAN 进行了广泛评估,实验结果一致表明,KISDAN 模型在各种评估指标上都优于最先进的模型,从而验证了 KISDAN 的有效性和优越性。
{"title":"A knowledge-enhanced interest segment division attention network for click-through rate prediction","authors":"Zhanghui Liu, Shijie Chen, Yuzhong Chen, Jieyang Su, Jiayuan Zhong, Chen Dong","doi":"10.1007/s00521-024-10330-y","DOIUrl":"https://doi.org/10.1007/s00521-024-10330-y","url":null,"abstract":"<p>Click-through rate (CTR) prediction aims to estimate the probability of a user clicking on a particular item, making it one of the core tasks in various recommendation platforms. In such systems, user behavior data are crucial for capturing user interests, which has garnered significant attention from both academia and industry, leading to the development of various user behavior modeling methods. However, existing models still face unresolved issues, as they fail to capture the complex diversity of user interests at the semantic level, refine user interests effectively, and uncover users’ potential interests. To address these challenges, we propose a novel model called knowledge-enhanced Interest segment division attention network (KISDAN), which can effectively and comprehensively model user interests. Specifically, to leverage the semantic information within user behavior sequences, we employ the structure of a knowledge graph to divide user behavior sequence into multiple interest segments. To provide a comprehensive representation of user interests, we further categorize user interests into strong and weak interests. By leveraging both the knowledge graph and the item co-occurrence graph, we explore users’ potential interests from two perspectives. This methodology allows KISDAN to better understand the diversity of user interests. Finally, we extensively evaluate KISDAN on three benchmark datasets, and the experimental results consistently demonstrate that the KISDAN model outperforms state-of-the-art models across various evaluation metrics, which validates the effectiveness and superiority of KISDAN.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Local part attention for image stylization with text prompt
Pub Date : 2024-09-17 DOI: 10.1007/s00521-024-10394-w
Quoc-Truong Truong, Vinh-Tiep Nguyen, Lan-Phuong Nguyen, Hung-Phu Cao, Duc-Tuan Luu

Prompt-based portrait image style transfer aims at translating an input content image to a desired style described by text without a style image. In many practical situations, users may not only attend to the entire portrait image but also the local parts (e.g., eyes, lips, and hair). To address such applications, we propose a new framework that enables style transfer on specific regions described by a text description of the desired style. Specifically, we incorporate semantic segmentation to identify the intended area without requiring edit masks from the user while utilizing a pre-trained CLIP-based model for stylizing. Besides, we propose a text-to-patch matching loss by randomly dividing the stylized image into smaller patches to ensure the consistent quality of the result. To comprehensively evaluate the proposed method, we use several metrics, such as FID, SSIM, and PSNR on a dataset consisting of portraits from the CelebAMask-HQ dataset and style descriptions of other related works. Extensive experimental results demonstrate that our framework outperforms other state-of-the-art methods in terms of both stylization quality and inference time.

{"title":"Local part attention for image stylization with text prompt","authors":"Quoc-Truong Truong, Vinh-Tiep Nguyen, Lan-Phuong Nguyen, Hung-Phu Cao, Duc-Tuan Luu","doi":"10.1007/s00521-024-10394-w","DOIUrl":"https://doi.org/10.1007/s00521-024-10394-w","url":null,"abstract":"<p>Prompt-based portrait image style transfer aims at translating an input content image to a desired style described by text without a style image. In many practical situations, users may not only attend to the entire portrait image but also the local parts (e.g., eyes, lips, and hair). To address such applications, we propose a new framework that enables style transfer on specific regions described by a text description of the desired style. Specifically, we incorporate semantic segmentation to identify the intended area without requiring edit masks from the user while utilizing a pre-trained CLIP-based model for stylizing. Besides, we propose a text-to-patch matching loss by randomly dividing the stylized image into smaller patches to ensure the consistent quality of the result. To comprehensively evaluate the proposed method, we use several metrics, such as FID, SSIM, and PSNR on a dataset consisting of portraits from the CelebAMask-HQ dataset and style descriptions of other related works. Extensive experimental results demonstrate that our framework outperforms other state-of-the-art methods in terms of both stylization quality and inference time.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142251110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of machine learning techniques for diagnosing Alzheimer’s disease using imaging modalities 利用成像模式诊断阿尔茨海默病的机器学习技术综述
Pub Date : 2024-09-17 DOI: 10.1007/s00521-024-10399-5
Nand Kishore, Neelam Goel

Alzheimer's disease is a progressive form of dementia. Dementia is a broad term for conditions that impair memory, thinking, and behaviour. Brain traumas or disorders can cause dementia. It is estimated that 60–80% of dementia cases around the world are caused by Alzheimer’s disease, an incurable neurodegenerative disorder. Although Alzheimer's disease research has increased in recent years, early diagnosis is challenging due to the complicated brain structure and functions associated with this disease. It is difficult for doctors to identify Alzheimer's disease in its early stages as there are still no biomarkers to be precise in early detection. In the area of medical imaging, deep learning is becoming increasingly popular and successful. There is no single best approach for the detection of Alzheimer's disease. In comparison with conventional machine learning methods, the deep learning models detect Alzheimer's disease more precisely and effectively. In this review paper, various machine learning-based techniques utilized for the classification of Alzheimer's disease through different imaging modalities are discussed. In addition, a comprehensive and detailed analysis of the various image processing procedures along with corresponding classification performance and feature extraction techniques have been meticulously compiled and presented. The investigation of computer-aided image analysis has demonstrated significant potential in the early detection of cognitive changes in individuals experiencing mild cognitive impairment. Machine learning can provide valuable insights into the cognitive status of patients, enabling healthcare professionals to intervene and provide timely treatment. This review may lead to a reliable method for recognizing and predicting Alzheimer's disease.

阿尔茨海默病是一种进行性痴呆。痴呆症是对损害记忆、思维和行为的病症的统称。脑部创伤或失调可导致痴呆症。据估计,全世界 60-80% 的痴呆症病例是由阿尔茨海默病引起的,这是一种无法治愈的神经退行性疾病。虽然近年来对阿尔茨海默病的研究有所增多,但由于这种疾病的大脑结构和功能复杂,早期诊断具有挑战性。由于目前还没有生物标志物可以精确地进行早期检测,因此医生很难在阿尔茨海默病的早期阶段进行识别。在医学成像领域,深度学习正变得越来越流行和成功。目前还没有一种检测阿尔茨海默病的最佳方法。与传统的机器学习方法相比,深度学习模型能更精确、更有效地检测阿尔茨海默病。在这篇综述论文中,讨论了通过不同成像模式对阿尔茨海默病进行分类的各种基于机器学习的技术。此外,还对各种图像处理程序以及相应的分类性能和特征提取技术进行了全面细致的分析和介绍。计算机辅助图像分析研究在早期检测轻度认知障碍患者的认知变化方面显示出巨大的潜力。机器学习可以为了解患者的认知状况提供有价值的见解,使医疗专业人员能够及时干预和提供治疗。这项研究可能会开发出一种识别和预测阿尔茨海默病的可靠方法。
{"title":"A review of machine learning techniques for diagnosing Alzheimer’s disease using imaging modalities","authors":"Nand Kishore, Neelam Goel","doi":"10.1007/s00521-024-10399-5","DOIUrl":"https://doi.org/10.1007/s00521-024-10399-5","url":null,"abstract":"<p>Alzheimer's disease is a progressive form of dementia. Dementia is a broad term for conditions that impair memory, thinking, and behaviour. Brain traumas or disorders can cause dementia. It is estimated that 60–80% of dementia cases around the world are caused by Alzheimer’s disease, an incurable neurodegenerative disorder. Although Alzheimer's disease research has increased in recent years, early diagnosis is challenging due to the complicated brain structure and functions associated with this disease. It is difficult for doctors to identify Alzheimer's disease in its early stages as there are still no biomarkers to be precise in early detection. In the area of medical imaging, deep learning is becoming increasingly popular and successful. There is no single best approach for the detection of Alzheimer's disease. In comparison with conventional machine learning methods, the deep learning models detect Alzheimer's disease more precisely and effectively. In this review paper, various machine learning-based techniques utilized for the classification of Alzheimer's disease through different imaging modalities are discussed. In addition, a comprehensive and detailed analysis of the various image processing procedures along with corresponding classification performance and feature extraction techniques have been meticulously compiled and presented. The investigation of computer-aided image analysis has demonstrated significant potential in the early detection of cognitive changes in individuals experiencing mild cognitive impairment. Machine learning can provide valuable insights into the cognitive status of patients, enabling healthcare professionals to intervene and provide timely treatment. This review may lead to a reliable method for recognizing and predicting Alzheimer's disease.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142268766","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Neural Computing and Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1