Facial expressions are often considered the primary indicators of emotions. However, it is challenging to detect genuine emotions because they can be controlled. Many studies on emotion recognition have been conducted actively in recent years. In this study, we designed a convolutional neural network (CNN) model and proposed an algorithm that combines the analysis of bio-signals with facial expression templates to effectively predict emotional states. We utilized the EfficientNet-B0 architecture for network design and validation, known for achieving maximum performance with minimal parameters. The accuracy for emotion recognition using facial expression images alone was 74%, while the accuracy for emotion recognition combining biological signals reached 88.2%. These results demonstrate that integrating these two types of data leads to significantly improved accuracy. By combining the image and bio-signals captured in facial expressions, our model offers a more comprehensive and accurate understanding of emotional states.
{"title":"The Novel EfficientNet Architecture-Based System and Algorithm to Predict Complex Human Emotions","authors":"Mavlonbek Khomidov, Jong-Ha Lee","doi":"10.3390/a17070285","DOIUrl":"https://doi.org/10.3390/a17070285","url":null,"abstract":"Facial expressions are often considered the primary indicators of emotions. However, it is challenging to detect genuine emotions because they can be controlled. Many studies on emotion recognition have been conducted actively in recent years. In this study, we designed a convolutional neural network (CNN) model and proposed an algorithm that combines the analysis of bio-signals with facial expression templates to effectively predict emotional states. We utilized the EfficientNet-B0 architecture for network design and validation, known for achieving maximum performance with minimal parameters. The accuracy for emotion recognition using facial expression images alone was 74%, while the accuracy for emotion recognition combining biological signals reached 88.2%. These results demonstrate that integrating these two types of data leads to significantly improved accuracy. By combining the image and bio-signals captured in facial expressions, our model offers a more comprehensive and accurate understanding of emotional states.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":null,"pages":null},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141710527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This work introduces an unsupervised framework for video anomaly detection, leveraging a hybrid deep learning model that combines a vision transformer (ViT) with a convolutional spatiotemporal relationship (STR) attention block. The proposed model addresses the challenges of anomaly detection in video surveillance by capturing both local and global relationships within video frames, a task that traditional convolutional neural networks (CNNs) often struggle with due to their localized field of view. We have utilized a pre-trained ViT as an encoder for feature extraction, which is then processed by the STR attention block to enhance the detection of spatiotemporal relationships among objects in videos. The novelty of this work is utilizing the ViT with the STR attention to detect video anomalies effectively in large and heterogeneous datasets, an important thing given the diverse environments and scenarios encountered in real-world surveillance. The framework was evaluated on three benchmark datasets, i.e., the UCSD-Ped2, CHUCK Avenue, and ShanghaiTech. This demonstrates the model’s superior performance in detecting anomalies compared to state-of-the-art methods, showcasing its potential to significantly enhance automated video surveillance systems by achieving area under the receiver operating characteristic curve (AUC ROC) values of 95.6, 86.8, and 82.1. To show the effectiveness of the proposed framework in detecting anomalies in extra-large datasets, we trained the model on a subset of the huge contemporary CHAD dataset that contains over 1 million frames, achieving AUC ROC values of 71.8 and 64.2 for CHAD-Cam 1 and CHAD-Cam 2, respectively, which outperforms the state-of-the-art techniques.
{"title":"Enhancing Video Anomaly Detection Using a Transformer Spatiotemporal Attention Unsupervised Framework for Large Datasets","authors":"Mohamed H. Habeb, May Salama, Lamiaa A. Elrefaei","doi":"10.3390/a17070286","DOIUrl":"https://doi.org/10.3390/a17070286","url":null,"abstract":"This work introduces an unsupervised framework for video anomaly detection, leveraging a hybrid deep learning model that combines a vision transformer (ViT) with a convolutional spatiotemporal relationship (STR) attention block. The proposed model addresses the challenges of anomaly detection in video surveillance by capturing both local and global relationships within video frames, a task that traditional convolutional neural networks (CNNs) often struggle with due to their localized field of view. We have utilized a pre-trained ViT as an encoder for feature extraction, which is then processed by the STR attention block to enhance the detection of spatiotemporal relationships among objects in videos. The novelty of this work is utilizing the ViT with the STR attention to detect video anomalies effectively in large and heterogeneous datasets, an important thing given the diverse environments and scenarios encountered in real-world surveillance. The framework was evaluated on three benchmark datasets, i.e., the UCSD-Ped2, CHUCK Avenue, and ShanghaiTech. This demonstrates the model’s superior performance in detecting anomalies compared to state-of-the-art methods, showcasing its potential to significantly enhance automated video surveillance systems by achieving area under the receiver operating characteristic curve (AUC ROC) values of 95.6, 86.8, and 82.1. To show the effectiveness of the proposed framework in detecting anomalies in extra-large datasets, we trained the model on a subset of the huge contemporary CHAD dataset that contains over 1 million frames, achieving AUC ROC values of 71.8 and 64.2 for CHAD-Cam 1 and CHAD-Cam 2, respectively, which outperforms the state-of-the-art techniques.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":null,"pages":null},"PeriodicalIF":1.8,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141713920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mohamed Limam El hairach, Amal Tmiri, I. Bellamine
This study introduces a novel algorithm tailored for the precise detection of lower outliers (i.e., data points at the lower tail) in univariate datasets, which is particularly suited for scenarios with a single cluster and similar data distribution. The approach leverages a combination of transformative techniques and advanced filtration methods to efficiently segregate anomalies from normal values. Notably, the algorithm emphasizes high-precision outlier detection, ensuring minimal false positives, and requires only a few parameters for configuration. Its unsupervised nature enables robust outlier filtering without the need for extensive manual intervention. To validate its efficacy, the algorithm is rigorously tested using real-world data obtained from photovoltaic (PV) module strings with similar DC capacities, containing various outliers. The results demonstrate the algorithm’s capability to accurately identify lower outliers while maintaining computational efficiency and reliability in practical applications.
{"title":"Univariate Outlier Detection: Precision-Driven Algorithm for Single-Cluster Scenarios","authors":"Mohamed Limam El hairach, Amal Tmiri, I. Bellamine","doi":"10.3390/a17060259","DOIUrl":"https://doi.org/10.3390/a17060259","url":null,"abstract":"This study introduces a novel algorithm tailored for the precise detection of lower outliers (i.e., data points at the lower tail) in univariate datasets, which is particularly suited for scenarios with a single cluster and similar data distribution. The approach leverages a combination of transformative techniques and advanced filtration methods to efficiently segregate anomalies from normal values. Notably, the algorithm emphasizes high-precision outlier detection, ensuring minimal false positives, and requires only a few parameters for configuration. Its unsupervised nature enables robust outlier filtering without the need for extensive manual intervention. To validate its efficacy, the algorithm is rigorously tested using real-world data obtained from photovoltaic (PV) module strings with similar DC capacities, containing various outliers. The results demonstrate the algorithm’s capability to accurately identify lower outliers while maintaining computational efficiency and reliability in practical applications.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141339133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Saiya Li, Jinhe Su, Guoqing Jiang, Ziyu Huang, Xiaorong Zhang
Three-dimensional reconstruction from point clouds is an important research topic in computer vision and computer graphics. However, the discrete nature, sparsity, and noise of the original point cloud contribute to the results of 3D surface generation based on global features often appearing jagged and lacking details, making it difficult to describe shape details accurately. We address the challenge of generating smooth and detailed 3D surfaces from point clouds. We propose an adaptive octree partitioning method to divide the global shape into local regions of different scales. An iterative loop method based on GRU is then used to extract features from local voxels and learn local smoothness and global shape priors. Finally, a moving least-squares approach is employed to generate the 3D surface. Experiments demonstrate that our method outperforms existing methods on benchmark datasets (ShapeNet dataset, ABC dataset, and Famous dataset). Ablation studies confirm the effectiveness of the adaptive octree partitioning and GRU modules.
从点云进行三维重建是计算机视觉和计算机图形学的一个重要研究课题。然而,由于原始点云的离散性、稀疏性和噪声,基于全局特征生成的三维曲面往往显得参差不齐,缺乏细节,难以准确描述形状细节。我们要解决的难题是如何从点云生成平滑、细致的三维曲面。我们提出了一种自适应八叉树分割方法,将全局形状划分为不同尺度的局部区域。然后,使用基于 GRU 的迭代循环方法从局部体素中提取特征,并学习局部平滑度和全局形状先验。最后,采用移动最小二乘法生成三维曲面。实验证明,在基准数据集(ShapeNet 数据集、ABC 数据集和 Famous 数据集)上,我们的方法优于现有方法。消融研究证实了自适应八叉树分割和 GRU 模块的有效性。
{"title":"3D Reconstruction Based on Iterative Optimization of Moving Least-Squares Function","authors":"Saiya Li, Jinhe Su, Guoqing Jiang, Ziyu Huang, Xiaorong Zhang","doi":"10.3390/a17060263","DOIUrl":"https://doi.org/10.3390/a17060263","url":null,"abstract":"Three-dimensional reconstruction from point clouds is an important research topic in computer vision and computer graphics. However, the discrete nature, sparsity, and noise of the original point cloud contribute to the results of 3D surface generation based on global features often appearing jagged and lacking details, making it difficult to describe shape details accurately. We address the challenge of generating smooth and detailed 3D surfaces from point clouds. We propose an adaptive octree partitioning method to divide the global shape into local regions of different scales. An iterative loop method based on GRU is then used to extract features from local voxels and learn local smoothness and global shape priors. Finally, a moving least-squares approach is employed to generate the 3D surface. Experiments demonstrate that our method outperforms existing methods on benchmark datasets (ShapeNet dataset, ABC dataset, and Famous dataset). Ablation studies confirm the effectiveness of the adaptive octree partitioning and GRU modules.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141344382","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Osval A. Montesinos-López, Arvinth Sivakumar, Gloria Isabel Huerta Prado, Josafhat Salinas-Ruiz, Afolabi Agbona, Axel Efraín Ortiz Reyes, Khalid Alnowibet, Rodomiro Ortiz, Abelardo Montesinos-López, José Crossa
Genomic selection (GS) is a groundbreaking statistical machine learning method for advancing plant and animal breeding. Nonetheless, its practical implementation remains challenging due to numerous factors affecting its predictive performance. This research explores the potential of data augmentation to enhance prediction accuracy across entire datasets and specifically within the top 20% of the testing set. Our findings indicate that, overall, the data augmentation method (method A), when compared to the conventional model (method C) and assessed using Mean Arctangent Absolute Prediction Error (MAAPE) and normalized root mean square error (NRMSE), did not improve the prediction accuracy for the unobserved cultivars. However, significant improvements in prediction accuracy (evidenced by reduced prediction error) were observed when data augmentation was applied exclusively to the top 20% of the testing set. Specifically, reductions in MAAPE_20 and NRMSE_20 by 52.86% and 41.05%, respectively, were noted across various datasets. Further investigation is needed to refine data augmentation techniques for effective use in genomic prediction.
{"title":"Exploring Data Augmentation Algorithm to Improve Genomic Prediction of Top-Ranking Cultivars","authors":"Osval A. Montesinos-López, Arvinth Sivakumar, Gloria Isabel Huerta Prado, Josafhat Salinas-Ruiz, Afolabi Agbona, Axel Efraín Ortiz Reyes, Khalid Alnowibet, Rodomiro Ortiz, Abelardo Montesinos-López, José Crossa","doi":"10.3390/a17060260","DOIUrl":"https://doi.org/10.3390/a17060260","url":null,"abstract":"Genomic selection (GS) is a groundbreaking statistical machine learning method for advancing plant and animal breeding. Nonetheless, its practical implementation remains challenging due to numerous factors affecting its predictive performance. This research explores the potential of data augmentation to enhance prediction accuracy across entire datasets and specifically within the top 20% of the testing set. Our findings indicate that, overall, the data augmentation method (method A), when compared to the conventional model (method C) and assessed using Mean Arctangent Absolute Prediction Error (MAAPE) and normalized root mean square error (NRMSE), did not improve the prediction accuracy for the unobserved cultivars. However, significant improvements in prediction accuracy (evidenced by reduced prediction error) were observed when data augmentation was applied exclusively to the top 20% of the testing set. Specifically, reductions in MAAPE_20 and NRMSE_20 by 52.86% and 41.05%, respectively, were noted across various datasets. Further investigation is needed to refine data augmentation techniques for effective use in genomic prediction.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141344898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Md. Shahidur Rahman, Sowrav Chowdhury, Mirza Rasheduzzaman, A. B. M. S. U. Doulah
Respiratory Inductance Plethysmography (RIP) is a non-invasive method for the measurement of respiratory rates and lung volumes. Accurate detection of respiratory rates and volumes is crucial for the diagnosis and monitoring of prognosis of lung diseases, for which spirometry is classically used in clinical applications. RIP has been studied as an alternative to spirometry and shown promising results. Moreover, RIP data can be analyzed through machine learning (ML)-based approaches for some other purposes, i.e., detection of apneas, work of breathing (WoB) measurement, and recognition of human activity based on breathing patterns. The goal of this study is to provide an in-depth systematic review of the scope of usage of RIP and current RIP device developments, as well as to evaluate the performance, usability, and reliability of ML-based data analysis techniques within its designated scope while adhering to the PRISMA guidelines. This work also identifies research gaps in the field and highlights the potential scope for future work. The IEEE Explore, Springer, PLoS One, Science Direct, and Google Scholar databases were examined, and 40 publications were included in this work through a structured screening and quality assessment procedure. Studies with conclusive experimentation on RIP published between 2012 and 2023 were included, while unvalidated studies were excluded. The findings indicate that RIP is an effective method to a certain extent for testing and monitoring respiratory functions, though its accuracy is lacking in some settings. However, RIP possesses some advantages over spirometry due to its non-invasive nature and functionality for both stationary and ambulatory uses. RIP also demonstrates its capabilities in ML-based applications, such as detection of breathing asynchrony, classification of apnea, identification of sleep stage, and human activity recognition (HAR). It is our conclusion that, though RIP is not yet ready to replace spirometry and other established methods, it can provide crucial insights into subjects’ condition associated to respiratory illnesses. The implementation of artificial intelligence (AI) could play a potential role in improving the overall effectiveness of RIP, as suggested in some of the selected studies.
呼吸电感胸廓成像(RIP)是一种测量呼吸频率和肺活量的无创方法。准确检测呼吸频率和肺活量对于诊断和监测肺部疾病的预后至关重要,肺活量测定法通常用于临床应用。RIP 作为肺活量测定法的替代方法已被研究,并显示出良好的效果。此外,RIP 数据还可以通过基于机器学习(ML)的方法进行分析,用于其他一些目的,如检测呼吸暂停、测量呼吸功(WoB)以及根据呼吸模式识别人类活动。本研究的目的是对 RIP 的使用范围和当前 RIP 设备的发展情况进行深入系统的回顾,并在指定范围内评估基于 ML 的数据分析技术的性能、可用性和可靠性,同时遵守 PRISMA 准则。这项工作还确定了该领域的研究空白,并强调了未来工作的潜在范围。本研究对 IEEE Explore、Springer、PLoS One、Science Direct 和 Google Scholar 数据库进行了研究,并通过结构化筛选和质量评估程序将 40 篇出版物纳入本研究。收录了 2012 年至 2023 年间发表的关于 RIP 的确凿实验研究,同时排除了未经验证的研究。研究结果表明,RIP 在一定程度上是测试和监测呼吸功能的有效方法,但在某些情况下其准确性不足。不过,RIP 与肺活量测定法相比具有一些优势,因为它是非侵入性的,而且在固定和非卧床环境下均可使用。RIP 还展示了其在基于 ML 的应用中的能力,如检测呼吸不同步、呼吸暂停分类、睡眠阶段识别和人类活动识别 (HAR)。我们的结论是,虽然 RIP 还不能取代肺活量测定法和其他成熟的方法,但它能为了解受试者与呼吸系统疾病相关的状况提供重要信息。人工智能(AI)的应用可在提高 RIP 的整体有效性方面发挥潜在作用,正如一些选定研究中所建议的那样。
{"title":"Artificial Intelligence-Based Algorithms and Healthcare Applications of Respiratory Inductance Plethysmography: A Systematic Review","authors":"Md. Shahidur Rahman, Sowrav Chowdhury, Mirza Rasheduzzaman, A. B. M. S. U. Doulah","doi":"10.3390/a17060261","DOIUrl":"https://doi.org/10.3390/a17060261","url":null,"abstract":"Respiratory Inductance Plethysmography (RIP) is a non-invasive method for the measurement of respiratory rates and lung volumes. Accurate detection of respiratory rates and volumes is crucial for the diagnosis and monitoring of prognosis of lung diseases, for which spirometry is classically used in clinical applications. RIP has been studied as an alternative to spirometry and shown promising results. Moreover, RIP data can be analyzed through machine learning (ML)-based approaches for some other purposes, i.e., detection of apneas, work of breathing (WoB) measurement, and recognition of human activity based on breathing patterns. The goal of this study is to provide an in-depth systematic review of the scope of usage of RIP and current RIP device developments, as well as to evaluate the performance, usability, and reliability of ML-based data analysis techniques within its designated scope while adhering to the PRISMA guidelines. This work also identifies research gaps in the field and highlights the potential scope for future work. The IEEE Explore, Springer, PLoS One, Science Direct, and Google Scholar databases were examined, and 40 publications were included in this work through a structured screening and quality assessment procedure. Studies with conclusive experimentation on RIP published between 2012 and 2023 were included, while unvalidated studies were excluded. The findings indicate that RIP is an effective method to a certain extent for testing and monitoring respiratory functions, though its accuracy is lacking in some settings. However, RIP possesses some advantages over spirometry due to its non-invasive nature and functionality for both stationary and ambulatory uses. RIP also demonstrates its capabilities in ML-based applications, such as detection of breathing asynchrony, classification of apnea, identification of sleep stage, and human activity recognition (HAR). It is our conclusion that, though RIP is not yet ready to replace spirometry and other established methods, it can provide crucial insights into subjects’ condition associated to respiratory illnesses. The implementation of artificial intelligence (AI) could play a potential role in improving the overall effectiveness of RIP, as suggested in some of the selected studies.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141343614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
With the rapid advancement of urban intelligence, there is an increasingly urgent demand for technological innovation in traffic management. License plate recognition technology can achieve high accuracy under ideal conditions but faces significant challenges in complex traffic environments and adverse weather conditions. To address these challenges, we propose the enhanced attention network and decoding for license plate recognition model (EAND-LPRM). This model leverages an encoder to extract features from image sequences and employs a self-attention mechanism to focus on critical feature information, enhancing its capability to handle complex traffic scenarios such as rainy weather and license plate distortion. We have curated and utilized publicly available datasets that closely reflect real-world scenarios, ensuring transparency and reproducibility. Experimental evaluations conducted on these datasets, which include various complex scenarios, demonstrate that the EAND-LPRM model achieves an accuracy of 94%, representing a 6% improvement over traditional license plate recognition algorithms. The main contributions of this research include the development of a novel attention-mechanism-based architecture, comprehensive evaluation on multiple datasets, and substantial performance improvements under diverse and challenging conditions. This study provides a practical solution for automatic license plate recognition systems in dynamic and unpredictable environments.
{"title":"EAND-LPRM: Enhanced Attention Network and Decoding for Efficient License Plate Recognition under Complex Conditions","authors":"Shijuan Chen, Zongmei Li, Xiaofeng Du, Qin Nie","doi":"10.3390/a17060262","DOIUrl":"https://doi.org/10.3390/a17060262","url":null,"abstract":"With the rapid advancement of urban intelligence, there is an increasingly urgent demand for technological innovation in traffic management. License plate recognition technology can achieve high accuracy under ideal conditions but faces significant challenges in complex traffic environments and adverse weather conditions. To address these challenges, we propose the enhanced attention network and decoding for license plate recognition model (EAND-LPRM). This model leverages an encoder to extract features from image sequences and employs a self-attention mechanism to focus on critical feature information, enhancing its capability to handle complex traffic scenarios such as rainy weather and license plate distortion. We have curated and utilized publicly available datasets that closely reflect real-world scenarios, ensuring transparency and reproducibility. Experimental evaluations conducted on these datasets, which include various complex scenarios, demonstrate that the EAND-LPRM model achieves an accuracy of 94%, representing a 6% improvement over traditional license plate recognition algorithms. The main contributions of this research include the development of a novel attention-mechanism-based architecture, comprehensive evaluation on multiple datasets, and substantial performance improvements under diverse and challenging conditions. This study provides a practical solution for automatic license plate recognition systems in dynamic and unpredictable environments.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141342259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Yue Sun, Zhaohong Huang, Guorong Cai, Jinhe Su, Zheng Gong
Breast tumor segmentation of ultrasound images provides valuable tumor information for early detection and diagnosis. However, speckle noise and blurred boundaries in breast ultrasound images present challenges for tumor segmentation, especially for malignant tumors with irregular shapes. Recent vision transformers have shown promising performance in handling the variation through global context modeling. Nevertheless, they are often dominated by features of large patterns and lack the ability to recognize negative information in ultrasound images, which leads to the loss of breast tumor details (e.g., boundaries and small objects). In this paper, we propose a novel noise suppression and boundary refinement network, NSBR-Net, to simultaneously alleviate speckle noise interference and blurred boundary problems of breast tumor segmentation. Specifically, we propose two innovative designs, namely, the Noise Suppression Module (NSM) and the Boundary Refinement Module (BRM). The NSM filters noise information from the coarse-grained feature maps, while the BRM progressively refines the boundaries of significant lesion objects. Our method demonstrates superior accuracy over state-of-the-art deep learning models, achieving significant improvements of 3.67% on Dataset B and 2.30% on the BUSI dataset in mDice for testing malignant tumors.
{"title":"NSBR-Net: A Novel Noise Suppression and Boundary Refinement Network for Breast Tumor Segmentation in Ultrasound Images","authors":"Yue Sun, Zhaohong Huang, Guorong Cai, Jinhe Su, Zheng Gong","doi":"10.3390/a17060257","DOIUrl":"https://doi.org/10.3390/a17060257","url":null,"abstract":"Breast tumor segmentation of ultrasound images provides valuable tumor information for early detection and diagnosis. However, speckle noise and blurred boundaries in breast ultrasound images present challenges for tumor segmentation, especially for malignant tumors with irregular shapes. Recent vision transformers have shown promising performance in handling the variation through global context modeling. Nevertheless, they are often dominated by features of large patterns and lack the ability to recognize negative information in ultrasound images, which leads to the loss of breast tumor details (e.g., boundaries and small objects). In this paper, we propose a novel noise suppression and boundary refinement network, NSBR-Net, to simultaneously alleviate speckle noise interference and blurred boundary problems of breast tumor segmentation. Specifically, we propose two innovative designs, namely, the Noise Suppression Module (NSM) and the Boundary Refinement Module (BRM). The NSM filters noise information from the coarse-grained feature maps, while the BRM progressively refines the boundaries of significant lesion objects. Our method demonstrates superior accuracy over state-of-the-art deep learning models, achieving significant improvements of 3.67% on Dataset B and 2.30% on the BUSI dataset in mDice for testing malignant tumors.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141354148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Vikas Mittal, Kanta Prasad Sharma, Narmadha Thangarasu, Udandarao Sarat, Ahmad O. Hourani, Rohit Salgotra
Circular antenna arrays (CAAs) find extensive utility in a range of cutting-edge communication applications such as 5G networks, the Internet of Things (IoT), and advanced beamforming technologies. In the realm of antenna design, the side lobes levels (SLL) in the radiation pattern hold significant importance within communication systems. This is primarily due to its role in mitigating signal interference across the entire radiation pattern’s side lobes. In order to suppress the subsidiary lobe, achieve the required primary lobe orientation, and improve directivity, an optimization problem is used in this work. This paper introduces a method aimed at enhancing the radiation pattern of CAA by minimizing its SLL using a Hybrid Sooty Tern Naked Mole-Rat Algorithm (STNMRA). The simulation results show that the hybrid optimization method significantly reduces side lobes while maintaining reasonable directivity compared to the uniform array and other competitive metaheuristics.
{"title":"Synthesis of Circular Antenna Arrays for Achieving Lower Side Lobe Level and Higher Directivity Using Hybrid Optimization Algorithm","authors":"Vikas Mittal, Kanta Prasad Sharma, Narmadha Thangarasu, Udandarao Sarat, Ahmad O. Hourani, Rohit Salgotra","doi":"10.3390/a17060256","DOIUrl":"https://doi.org/10.3390/a17060256","url":null,"abstract":"Circular antenna arrays (CAAs) find extensive utility in a range of cutting-edge communication applications such as 5G networks, the Internet of Things (IoT), and advanced beamforming technologies. In the realm of antenna design, the side lobes levels (SLL) in the radiation pattern hold significant importance within communication systems. This is primarily due to its role in mitigating signal interference across the entire radiation pattern’s side lobes. In order to suppress the subsidiary lobe, achieve the required primary lobe orientation, and improve directivity, an optimization problem is used in this work. This paper introduces a method aimed at enhancing the radiation pattern of CAA by minimizing its SLL using a Hybrid Sooty Tern Naked Mole-Rat Algorithm (STNMRA). The simulation results show that the hybrid optimization method significantly reduces side lobes while maintaining reasonable directivity compared to the uniform array and other competitive metaheuristics.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141357340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Currently, the problem of information overload, a term popularized by Alvin Toffler in his book Future Shock [1], is more present than ever due to the rapid development of the Internet [...]
{"title":"Guest Editorial for the Special Issue “New Trends in Algorithms for Intelligent Recommendation Systems”","authors":"Edward Rolando Núñez-Valdéz, Vicente García-Díaz","doi":"10.3390/a17060255","DOIUrl":"https://doi.org/10.3390/a17060255","url":null,"abstract":"Currently, the problem of information overload, a term popularized by Alvin Toffler in his book Future Shock [1], is more present than ever due to the rapid development of the Internet [...]","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141363605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}