Pub Date : 2025-06-21DOI: 10.1016/j.iswa.2025.200549
Mohanad Deif , Hani Attar , Mohammad Aljaidi , Ayoub Alsarhan , Dimah Al-Fraihat , Ahmed Solyman
Developing implants using biodegradable materials eliminates the need for secondary surgery, improves both mechanical and biological properties, and enhances biocompatibility. This study proposes a machine learning approach based on Bayesian optimization (BO) and an eXtreme Gradient Boosting (XGBoost) algorithm to design a biodegradable Zinc (Zn) alloy and forecast percentage of elements in the Zn alloy for bone implants. The dataset employed in this study comprised 1182 samples of Zn alloys obtained from supplementary articles from Google Scholar and the mat web database. For forecasting the mechanical parameters Yield Stress (YS), Ductility, and Ultimate Tensile Strength (UTS), the suggested method got maximum R2 values of 0.85, 0.87, and 0.81 demonstrating its exceptional predictive capacity. In addition, the model created a Zn biodegradable alloy with UTS of 363.55 Mpa, YS of 318.93 Mpa, and Ductility of 14 %, which are regarded as good mechanical characteristics meet bone implant criteria. The BO-XGBoost model can expedite the production of the proper alloy for several medical applications, saving time, money, and effort.
{"title":"Machine learning alloying design of biodegradable zinc alloy for bone implants using XGBoost and Bayesian optimization","authors":"Mohanad Deif , Hani Attar , Mohammad Aljaidi , Ayoub Alsarhan , Dimah Al-Fraihat , Ahmed Solyman","doi":"10.1016/j.iswa.2025.200549","DOIUrl":"10.1016/j.iswa.2025.200549","url":null,"abstract":"<div><div>Developing implants using biodegradable materials eliminates the need for secondary surgery, improves both mechanical and biological properties, and enhances biocompatibility. This study proposes a machine learning approach based on Bayesian optimization (BO) and an eXtreme Gradient Boosting (XGBoost) algorithm to design a biodegradable Zinc (Zn) alloy and forecast percentage of elements in the Zn alloy for bone implants. The dataset employed in this study comprised 1182 samples of Zn alloys obtained from supplementary articles from Google Scholar and the mat web database. For forecasting the mechanical parameters Yield Stress (YS), Ductility, and Ultimate Tensile Strength (UTS), the suggested method got maximum R<sup>2</sup> values of 0.85, 0.87, and 0.81 demonstrating its exceptional predictive capacity. In addition, the model created a Zn biodegradable alloy with UTS of 363.55 Mpa, YS of 318.93 Mpa, and Ductility of 14 %, which are regarded as good mechanical characteristics meet bone implant criteria. The BO-XGBoost model can expedite the production of the proper alloy for several medical applications, saving time, money, and effort.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"27 ","pages":"Article 200549"},"PeriodicalIF":0.0,"publicationDate":"2025-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144490496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Nowadays, cybersecurity is a major worldwide problem. Intrusion detection systems (IDS) help guarantee network security by detecting malicious entries from legitimate entries in network traffic data. IDS has considerable potential for detecting dynamic cyber threats, identifying abnormalities, and identifying malicious conduct within the network. In this paper, we propose Machine Learning (ML) models with an emphasis on the Synthetic Minority Over-sampling Technique (SMOTE) with Iterative Partitioning Filter (IPF) for class imbalance and the Whale Optimization Algorithm (WOA) for feature selection. Class imbalance often results in poorly constructed ML models prioritizing the majority class. In addition, the absence of feature selection can lead to higher computational complexity without impacting performance accuracy. This study uses Bagging, AdaBoost, Extreme Gradient Boosting (XGBoost) and Extra Trees Classifier as classification models. The two widely used datasets to assess the proposed method are NLS-KDD and UNSW-NB15. The K-Fold cross-validation technique trains this model to minimize potential overfitting. These models are evaluated based on performance metrics such as accuracy, precision, recall, and F1-score. The experimental results demonstrate that the Extra Trees Classifier significantly outperforms the baseline models and achieves accuracy values of 99.9% for the NSL-KDD dataset and 97% for the UNSW-NB 15 dataset and outperforms all evaluation measures compared to baseline models for multi-classification of the IDS.
当今,网络安全是一个重大的世界性问题。入侵检测系统(IDS)通过检测网络流量数据中合法条目中的恶意条目,保障网络安全。IDS在检测动态网络威胁、识别异常和识别网络中的恶意行为方面具有相当大的潜力。在本文中,我们提出了机器学习(ML)模型,重点是基于迭代划分过滤器(IPF)的合成少数过采样技术(SMOTE)和鲸鱼优化算法(WOA)的特征选择。类不平衡通常会导致构造不良的ML模型优先考虑大多数类。此外,缺少特征选择可能导致更高的计算复杂度,而不会影响性能准确性。本研究使用Bagging、AdaBoost、Extreme Gradient Boosting (XGBoost)和Extra Trees Classifier作为分类模型。两个广泛使用的数据集是NLS-KDD和UNSW-NB15。K-Fold交叉验证技术训练该模型以最小化潜在的过拟合。这些模型是基于诸如准确性、精度、召回率和f1分数等性能指标进行评估的。实验结果表明,Extra Trees分类器显著优于基线模型,在NSL-KDD数据集和UNSW-NB 15数据集的准确率分别达到99.9%和97%,并且在IDS多分类方面优于基线模型的所有评估指标。
{"title":"Anomaly-based intrusion detection system based on SMOTE-IPF, Whale Optimization Algorithm, and ensemble learning","authors":"Tibebu Bekele Shana , Neetu Kumari , Mayank Agarwal , Samrat Mondal , Upaka Rathnayake","doi":"10.1016/j.iswa.2025.200543","DOIUrl":"10.1016/j.iswa.2025.200543","url":null,"abstract":"<div><div>Nowadays, cybersecurity is a major worldwide problem. Intrusion detection systems (IDS) help guarantee network security by detecting malicious entries from legitimate entries in network traffic data. IDS has considerable potential for detecting dynamic cyber threats, identifying abnormalities, and identifying malicious conduct within the network. In this paper, we propose Machine Learning (ML) models with an emphasis on the Synthetic Minority Over-sampling Technique (SMOTE) with Iterative Partitioning Filter (IPF) for class imbalance and the Whale Optimization Algorithm (WOA) for feature selection. Class imbalance often results in poorly constructed ML models prioritizing the majority class. In addition, the absence of feature selection can lead to higher computational complexity without impacting performance accuracy. This study uses Bagging, AdaBoost, Extreme Gradient Boosting (XGBoost) and Extra Trees Classifier as classification models. The two widely used datasets to assess the proposed method are NLS-KDD and UNSW-NB15. The K-Fold cross-validation technique trains this model to minimize potential overfitting. These models are evaluated based on performance metrics such as accuracy, precision, recall, and F1-score. The experimental results demonstrate that the Extra Trees Classifier significantly outperforms the baseline models and achieves accuracy values of 99.9% for the NSL-KDD dataset and 97% for the UNSW-NB 15 dataset and outperforms all evaluation measures compared to baseline models for multi-classification of the IDS.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"27 ","pages":"Article 200543"},"PeriodicalIF":0.0,"publicationDate":"2025-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144338636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-14DOI: 10.1016/j.iswa.2025.200539
Guang Xv , Xingchen Wu
Long-form sports videos present unique challenges for temporally localizing relevant segments described by text queries. In this paper, a novel two-stage method is proposed for text snippet localization in long sports videos, combining efficient retrieval with fine-grained refinement. First, an improved video encoding pipeline with a caching mechanism and a video-centric sampling strategy has been designed to efficiently process long videos. Then, a self-supervised proposal generation module is designed that leverages temporal consistency to generate candidate segments (proposals) with pseudo labels, reducing reliance on exhaustive manual annotation. Our model is trained in two stages: a segment-level discrimination stage that learns to identify short video snippets relevant to a query, followed by an instance-level completeness stage that ensures the entire event described by the query is accurately captured. To effectively fuse visual and textual information, a cross-modal fusion strategy is adopted that combines late fusion for scalable coarse retrieval with targeted cross-modal attention for precise alignment. Experiments on sports video datasets demonstrate that our method outperforms state-of-the-art baselines in both accuracy and efficiency.
{"title":"Temporal event localization in sports videos via self-supervised proposal generation and cross-modal fusion","authors":"Guang Xv , Xingchen Wu","doi":"10.1016/j.iswa.2025.200539","DOIUrl":"10.1016/j.iswa.2025.200539","url":null,"abstract":"<div><div>Long-form sports videos present unique challenges for temporally localizing relevant segments described by text queries. In this paper, a novel two-stage method is proposed for text snippet localization in long sports videos, combining efficient retrieval with fine-grained refinement. First, an improved video encoding pipeline with a caching mechanism and a video-centric sampling strategy has been designed to efficiently process long videos. Then, a self-supervised proposal generation module is designed that leverages temporal consistency to generate candidate segments (proposals) with pseudo labels, reducing reliance on exhaustive manual annotation. Our model is trained in two stages: a segment-level discrimination stage that learns to identify short video snippets relevant to a query, followed by an instance-level completeness stage that ensures the entire event described by the query is accurately captured. To effectively fuse visual and textual information, a cross-modal fusion strategy is adopted that combines late fusion for scalable coarse retrieval with targeted cross-modal attention for precise alignment. Experiments on sports video datasets demonstrate that our method outperforms state-of-the-art baselines in both accuracy and efficiency.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"27 ","pages":"Article 200539"},"PeriodicalIF":0.0,"publicationDate":"2025-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144514039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-13DOI: 10.1016/j.iswa.2025.200544
Francesco Colace , Giuseppe D’Aniello , Massimo De Santo , Rosario Gaeta , Gabriel Zuchtriegel
The safeguard of cultural heritage (CH) is one of the most of interest issues for all the countries, like Italy, known for their thousand-year history. Cultural properties have to be maintained regularly and effectively so that the condition of such properties remains good at all times. Human operators have always been the ones in charge of monitoring and maintaining these properties, with domain experts capable of understanding when and how the maintenance has to be done. In our paper, we define a CH asset as a Cyber–Physical–Social System. We designed and proposed a prototype of a Situation-aware Cyber–Physical–Social System (CPSS) for Cultural Heritage, capable of supporting the human operator situation awareness. The CPSS is a Machine Learning (ML) and expert based system equipped with modules for capturing information, which are then processed with ML techniques to identify asset maintenance issues, understanding how they will evolve, and what are the priorities in the maintenance activity to be performed. We propose three case studies relating respectively to: four structures in the archaeological site of Pompeii, three in the archaeological site of Paestum, and three related to the area the archaeological site of the Colosseum, in Rome, for the safeguarding of which the system uses vulnerability indexes, calculated using prior knowledge related to these structures, maintenance issues detected from aerial photos using a YoloV7 detection model, and context space theory with weather and anthropogenic flow data. We showed how it was possible to identify critical and dangerous situations for these zones, with vulnerability indexes capable of mitigating damaged and dangerous areas to be left in that state with the advent of adverse weather phenomena, which indeed from the photos appeared damaged and flooded.
{"title":"Situation-aware Cyber–Physical–Social System for Cultural Heritage","authors":"Francesco Colace , Giuseppe D’Aniello , Massimo De Santo , Rosario Gaeta , Gabriel Zuchtriegel","doi":"10.1016/j.iswa.2025.200544","DOIUrl":"10.1016/j.iswa.2025.200544","url":null,"abstract":"<div><div>The safeguard of cultural heritage (CH) is one of the most of interest issues for all the countries, like Italy, known for their thousand-year history. Cultural properties have to be maintained regularly and effectively so that the condition of such properties remains good at all times. Human operators have always been the ones in charge of monitoring and maintaining these properties, with domain experts capable of understanding when and how the maintenance has to be done. In our paper, we define a CH asset as a Cyber–Physical–Social System. We designed and proposed a prototype of a Situation-aware Cyber–Physical–Social System (CPSS) for Cultural Heritage, capable of supporting the human operator situation awareness. The CPSS is a Machine Learning (ML) and expert based system equipped with modules for capturing information, which are then processed with ML techniques to identify asset maintenance issues, understanding how they will evolve, and what are the priorities in the maintenance activity to be performed. We propose three case studies relating respectively to: four structures in the archaeological site of Pompeii, three in the archaeological site of Paestum, and three related to the area the archaeological site of the Colosseum, in Rome, for the safeguarding of which the system uses vulnerability indexes, calculated using prior knowledge related to these structures, maintenance issues detected from aerial photos using a YoloV7 detection model, and context space theory with weather and anthropogenic flow data. We showed how it was possible to identify critical and dangerous situations for these zones, with vulnerability indexes capable of mitigating damaged and dangerous areas to be left in that state with the advent of adverse weather phenomena, which indeed from the photos appeared damaged and flooded.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"27 ","pages":"Article 200544"},"PeriodicalIF":0.0,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144298090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The rapid advancement of deep learning (DL) technologies has significantly transformed the domain of aerial image analysis. This systematic review explores the forefront of deep learning architectures specifically designed for the processing and analysis of aerial imagery. It offers a comprehensive examination of updated models, such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Generative Adversarial Networks (GANs), and Transformers, highlighting their unique contributions and comparative effectiveness in aerial image analysis. This review critically compares these architectures through an extensive literature survey, focusing on their impact on enhancing accuracy, computational efficiency, and overall performance in critical aerial imaging tasks, such as classification, object detection, and semantic segmentation. Additionally, it sheds light on the innovative architectural improvements that have been crucial in overcoming traditional challenges associated with aerial image processing, such as handling high-resolution data, managing diverse and changing landscapes, and ensuring real-time analysis capabilities. By synthesizing current findings and identifying prevailing trends, this review not only charts the progress in the field but also outlines future research directions, emphasizing the need for more adaptable, robust, and efficient deep-learning solutions to meet the growing demands of aerial image analysis.
{"title":"Advancements and challenges of deep learning architectures for aerial image analysis: A systematic review","authors":"Hashibul Ahsan Shoaib , Hadiur Rahman Nabil , Md Anisur Rahman , Md Mohsin Kabir , M.F. Mridha , Jungpil Shin","doi":"10.1016/j.iswa.2025.200537","DOIUrl":"10.1016/j.iswa.2025.200537","url":null,"abstract":"<div><div>The rapid advancement of deep learning (DL) technologies has significantly transformed the domain of aerial image analysis. This systematic review explores the forefront of deep learning architectures specifically designed for the processing and analysis of aerial imagery. It offers a comprehensive examination of updated models, such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Generative Adversarial Networks (GANs), and Transformers, highlighting their unique contributions and comparative effectiveness in aerial image analysis. This review critically compares these architectures through an extensive literature survey, focusing on their impact on enhancing accuracy, computational efficiency, and overall performance in critical aerial imaging tasks, such as classification, object detection, and semantic segmentation. Additionally, it sheds light on the innovative architectural improvements that have been crucial in overcoming traditional challenges associated with aerial image processing, such as handling high-resolution data, managing diverse and changing landscapes, and ensuring real-time analysis capabilities. By synthesizing current findings and identifying prevailing trends, this review not only charts the progress in the field but also outlines future research directions, emphasizing the need for more adaptable, robust, and efficient deep-learning solutions to meet the growing demands of aerial image analysis.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"27 ","pages":"Article 200537"},"PeriodicalIF":0.0,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144272578","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Neuro-symbolic AI represents the convergence of two principal paradigms in artificial intelligence: neural networks, which are efficient in data-driven learning, and symbolic reasoning, which offers explainability and logical inference. This hybrid methodology combines the adaptability of neural networks with symbolic AI's interpretability and formal reasoning abilities, which provide a practical framework for advanced cognitive systems. This paper analyzes the present condition of neuro-symbolic AI, emphasizing essential techniques that combine reasoning and learning. We explore models such as Logic Tensor Networks, Differentiable Logic Programs, and Neural Theorem Provers. The study analyzes their impact on the advancement of cognitive systems in natural language processing, robotics, and decision-making. The paper examines the challenges faced by neuro-symbolic AI, such as scalability, integration with multimodal data, and maintaining interpretability without compromising efficiency. By evaluating the strengths and weaknesses of many methodologies, we comprehensively understand the field's development and its potential to revolutionize intelligent systems. In addition, we identify emerging research areas, including the incorporation of ethical frameworks and the development of adaptive dynamic neuro-symbolic systems that respond in real-time. This review aims to guide future research by providing insights into the potential of neuro-symbolic AI to influence the development of the next generation of intelligent, explainable, and adaptive systems.
{"title":"A review of neuro-symbolic AI integrating reasoning and learning for advanced cognitive systems","authors":"Uzma Nawaz , Mufti Anees-ur-Rahaman , Zubair Saeed","doi":"10.1016/j.iswa.2025.200541","DOIUrl":"10.1016/j.iswa.2025.200541","url":null,"abstract":"<div><div>Neuro-symbolic AI represents the convergence of two principal paradigms in artificial intelligence: neural networks, which are efficient in data-driven learning, and symbolic reasoning, which offers explainability and logical inference. This hybrid methodology combines the adaptability of neural networks with symbolic AI's interpretability and formal reasoning abilities, which provide a practical framework for advanced cognitive systems. This paper analyzes the present condition of neuro-symbolic AI, emphasizing essential techniques that combine reasoning and learning. We explore models such as Logic Tensor Networks, Differentiable Logic Programs, and Neural Theorem Provers. The study analyzes their impact on the advancement of cognitive systems in natural language processing, robotics, and decision-making. The paper examines the challenges faced by neuro-symbolic AI, such as scalability, integration with multimodal data, and maintaining interpretability without compromising efficiency. By evaluating the strengths and weaknesses of many methodologies, we comprehensively understand the field's development and its potential to revolutionize intelligent systems. In addition, we identify emerging research areas, including the incorporation of ethical frameworks and the development of adaptive dynamic neuro-symbolic systems that respond in real-time. This review aims to guide future research by providing insights into the potential of neuro-symbolic AI to influence the development of the next generation of intelligent, explainable, and adaptive systems.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"26 ","pages":"Article 200541"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144196102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01DOI: 10.1016/j.iswa.2025.200535
Jersson X. Leon-Medina , Diego A. Tibaduiza , Núria Parés , Francesc Pozo
The industrial development, the advances in sensor technology and the processing of large amounts of data, have enabled the training and testing of artificial intelligence models that reproduce, with high accuracy, the behavior of some variables of interest. With the consolidation of the big data era and the proliferation of sensors that can acquire information directly from various components of a wind turbine (WT), a digital twin (DT) allows to close the gap between the physical and the digital worlds. It combines historical data, sensor readings, machine learning and physics-based modeling to replicate the behavior of the physical component accurately. This DT can simulate the performance and behavior of the physical object under different conditions and situations, allowing for predicting failures in WT components and determining their remaining useful life. This review describes the existing literature related to the use of DTs and their developments for WT applications and their components in onshore and offshore applications. This review explores various types of DTs and their approaches, aiming to cover different methods of data processing and concepts related to each DT framework. In addition, it identifies insights from various studies and reviews, particularly focusing on the components of WTs.
{"title":"Digital twin technology in wind turbine components: A review","authors":"Jersson X. Leon-Medina , Diego A. Tibaduiza , Núria Parés , Francesc Pozo","doi":"10.1016/j.iswa.2025.200535","DOIUrl":"10.1016/j.iswa.2025.200535","url":null,"abstract":"<div><div>The industrial development, the advances in sensor technology and the processing of large amounts of data, have enabled the training and testing of artificial intelligence models that reproduce, with high accuracy, the behavior of some variables of interest. With the consolidation of the big data era and the proliferation of sensors that can acquire information directly from various components of a wind turbine (WT), a digital twin (DT) allows to close the gap between the physical and the digital worlds. It combines historical data, sensor readings, machine learning and physics-based modeling to replicate the behavior of the physical component accurately. This DT can simulate the performance and behavior of the physical object under different conditions and situations, allowing for predicting failures in WT components and determining their remaining useful life. This review describes the existing literature related to the use of DTs and their developments for WT applications and their components in onshore and offshore applications. This review explores various types of DTs and their approaches, aiming to cover different methods of data processing and concepts related to each DT framework. In addition, it identifies insights from various studies and reviews, particularly focusing on the components of WTs.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"26 ","pages":"Article 200535"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144190364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01DOI: 10.1016/j.iswa.2025.200532
Francesco Zito , El-Ghazali Talbi , Claudia Cavallaro , Vincenzo Cutello , Mario Pavone
The present work explores the application of Automated Machine Learning techniques, particularly on the optimization of Artificial Neural Networks through hyperparameter tuning. Artificial Neural Networks are widely used across various fields, however building and optimizing them presents significant challenges. By employing an effective hyperparameter tuning, shallow neural networks might become competitive with their deeper counterparts, which in turn makes them more suitable for low-power consumption applications. In our work, we highlight the importance of Hyperparameter Optimization in enhancing neural network performance. We examine various metaheuristic algorithms employed and, in particular, their effectiveness in improving model performance across diverse applications. Despite significant advancements in this area, a comprehensive comparison of these algorithms across different deep learning architectures remains lacking. This work aims to fill this gap by systematically evaluating the performance of metaheuristic algorithms in optimizing hyperparameters and discussing advanced techniques such as parallel computing to adapt metaheuristic algorithms for use in hyperparameter optimization with high-dimensional hyperparameter search space.
{"title":"Metaheuristics in automated machine learning: Strategies for optimization","authors":"Francesco Zito , El-Ghazali Talbi , Claudia Cavallaro , Vincenzo Cutello , Mario Pavone","doi":"10.1016/j.iswa.2025.200532","DOIUrl":"10.1016/j.iswa.2025.200532","url":null,"abstract":"<div><div>The present work explores the application of Automated Machine Learning techniques, particularly on the optimization of Artificial Neural Networks through hyperparameter tuning. Artificial Neural Networks are widely used across various fields, however building and optimizing them presents significant challenges. By employing an effective hyperparameter tuning, shallow neural networks might become competitive with their deeper counterparts, which in turn makes them more suitable for low-power consumption applications. In our work, we highlight the importance of Hyperparameter Optimization in enhancing neural network performance. We examine various metaheuristic algorithms employed and, in particular, their effectiveness in improving model performance across diverse applications. Despite significant advancements in this area, a comprehensive comparison of these algorithms across different deep learning architectures remains lacking. This work aims to fill this gap by systematically evaluating the performance of metaheuristic algorithms in optimizing hyperparameters and discussing advanced techniques such as parallel computing to adapt metaheuristic algorithms for use in hyperparameter optimization with high-dimensional hyperparameter search space.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"26 ","pages":"Article 200532"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144185956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-05-23DOI: 10.1016/j.iswa.2025.200538
Chi Kien Ha, Hoanh Nguyen, Vu Duc Van
Accurate and efficient ship detection in synthetic aperture radar (SAR) imagery remains a challenging task due to speckle noise, scale variations, and the low contrast of small vessels. In this work, we present YOLO-SR, an enhanced version of YOLOv10 tailored for SAR ship detection, introducing four key innovations: Balanced Detail Fusion (BDF), C2f‐MSDR, DySample, and the Focaler-SIoU loss. Our BDF module adaptively merges shallow, fine‐grained features with deeper semantic features, preventing subtle ship signatures from being overshadowed by irrelevant clutter. Concurrently, C2f‐MSDR replaces standard bottleneck layers with multi-scale dilation residual blocks, expanding the receptive field to handle wide variations in ship size. To improve spatial resolution and retain boundary details, we incorporate DySample, a data-driven upsampling strategy that counteracts the artifacts of naive interpolation. Finally, Focaler-SIoU refines bounding-box regression by integrating distance, orientation, shape, and a focal-like reweighting, thereby emphasizing difficult, small, or partially occluded ships. Experimental results on SAR ship detection datasets confirm that YOLO-SR outperforms state-of-the-art methods in both precision and recall, while retaining competitive inference speeds. These advances offer a robust framework for real-time maritime surveillance, enhancing the detection of both small and large ships under challenging SAR conditions.
{"title":"YOLO-SR: An optimized convolutional architecture for robust ship detection in SAR Imagery","authors":"Chi Kien Ha, Hoanh Nguyen, Vu Duc Van","doi":"10.1016/j.iswa.2025.200538","DOIUrl":"10.1016/j.iswa.2025.200538","url":null,"abstract":"<div><div>Accurate and efficient ship detection in synthetic aperture radar (SAR) imagery remains a challenging task due to speckle noise, scale variations, and the low contrast of small vessels. In this work, we present YOLO-SR, an enhanced version of YOLOv10 tailored for SAR ship detection, introducing four key innovations: Balanced Detail Fusion (BDF), C2f‐MSDR, DySample, and the Focaler-SIoU loss. Our BDF module adaptively merges shallow, fine‐grained features with deeper semantic features, preventing subtle ship signatures from being overshadowed by irrelevant clutter. Concurrently, C2f‐MSDR replaces standard bottleneck layers with multi-scale dilation residual blocks, expanding the receptive field to handle wide variations in ship size. To improve spatial resolution and retain boundary details, we incorporate DySample, a data-driven upsampling strategy that counteracts the artifacts of naive interpolation. Finally, Focaler-SIoU refines bounding-box regression by integrating distance, orientation, shape, and a focal-like reweighting, thereby emphasizing difficult, small, or partially occluded ships. Experimental results on SAR ship detection datasets confirm that YOLO-SR outperforms state-of-the-art methods in both precision and recall, while retaining competitive inference speeds. These advances offer a robust framework for real-time maritime surveillance, enhancing the detection of both small and large ships under challenging SAR conditions.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"26 ","pages":"Article 200538"},"PeriodicalIF":0.0,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144169914","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-05-18DOI: 10.1016/j.iswa.2025.200533
Shiji Yang, Xuezhong Xiao
To improve the accuracy of pedestrian trajectory prediction, the graph - based pedestrian trajectory modeling method in the pedestrian trajectory prediction scenario is effective. Thus, a pedestrian trajectory prediction model based on a self - supervised spatiotemporal graph network is proposed. Firstly, in the process of spatiotemporal graph modeling, this model introduces hop interaction instead of node interaction to update node features, which greatly reduces the times of graph convolution operations, alleviates the problem of feature smoothing, and greatly improves the accuracy of prediction. Secondly, a unique self-supervised module is added to the model to mine commonalities between pedestrian’s multi-trajectories through self-supervised to further improve the accuracy of prediction. The experiment uses ETH and UCY public datasets to train and evaluate model performance. The experimental results demonstrate that this model exhibits enhancements in both ADE and FDE metrics when compared to the SOTA model, with an average prediction error reduction of 15 % and 10 %, respectively. In scenes with dense pedestrians such as the UNIV dataset, the prediction errors are reduced by 25 % and 22 %.
{"title":"Pedestrian trajectory prediction model based on self-supervised spatiotemporal graph network","authors":"Shiji Yang, Xuezhong Xiao","doi":"10.1016/j.iswa.2025.200533","DOIUrl":"10.1016/j.iswa.2025.200533","url":null,"abstract":"<div><div>To improve the accuracy of pedestrian trajectory prediction, the graph - based pedestrian trajectory modeling method in the pedestrian trajectory prediction scenario is effective. Thus, a pedestrian trajectory prediction model based on a self - supervised spatiotemporal graph network is proposed. Firstly, in the process of spatiotemporal graph modeling, this model introduces hop interaction instead of node interaction to update node features, which greatly reduces the times of graph convolution operations, alleviates the problem of feature smoothing, and greatly improves the accuracy of prediction. Secondly, a unique self-supervised module is added to the model to mine commonalities between pedestrian’s multi-trajectories through self-supervised to further improve the accuracy of prediction. The experiment uses ETH and UCY public datasets to train and evaluate model performance. The experimental results demonstrate that this model exhibits enhancements in both ADE and FDE metrics when compared to the SOTA model, with an average prediction error reduction of 15 % and 10 %, respectively. In scenes with dense pedestrians such as the UNIV dataset, the prediction errors are reduced by 25 % and 22 %.</div></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"26 ","pages":"Article 200533"},"PeriodicalIF":0.0,"publicationDate":"2025-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144124236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}