Algorithms最新文献_第5页

A Case-Study Comparison of Machine Learning Approaches for Predicting Student’s Dropout from Multiple Online Educational Entities 预测多个在线教育实体学生辍学情况的机器学习方法案例研究比较

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-03 DOI: 10.3390/a16120554

José Manuel Porras, J. Lara, Cristóbal Romero, Sebastián Ventura

Predicting student dropout is a crucial task in online education. Traditionally, each educational entity (institution, university, faculty, department, etc.) creates and uses its own prediction model starting from its own data. However, that approach is not always feasible or advisable and may depend on the availability of data, local infrastructure, and resources. In those cases, there are various machine learning approaches for sharing data and/or models between educational entities, using a classical centralized machine learning approach or other more advanced approaches such as transfer learning or federated learning. In this paper, we used data from three different LMS Moodle servers representing homogeneous different-sized educational entities. We tested the performance of the different machine learning approaches for the problem of predicting student dropout with multiple educational entities involved. We used a deep learning algorithm as a predictive classifier method. Our preliminary findings provide useful information on the benefits and drawbacks of each approach, as well as suggestions for enhancing performance when there are multiple institutions. In our case, repurposed transfer learning, stacked transfer learning, and centralized approaches produced similar or better results than the locally trained models for most of the entities.

预测学生辍学是在线教育的一项重要任务。传统上，每个教育实体(机构、大学、学院、部门等)都会根据自己的数据创建并使用自己的预测模型。然而，这种方法并不总是可行或可取的，并且可能取决于数据、本地基础设施和资源的可用性。在这些情况下，有各种机器学习方法用于在教育实体之间共享数据和/或模型，使用经典的集中式机器学习方法或其他更高级的方法，如迁移学习或联邦学习。在本文中，我们使用了来自三个不同的LMS Moodle服务器的数据，这些服务器代表了同构的不同大小的教育实体。我们测试了不同机器学习方法在预测涉及多个教育实体的学生退学问题上的性能。我们使用深度学习算法作为预测分类器方法。我们的初步研究结果提供了关于每种方法的优缺点的有用信息，以及在有多个机构时提高性能的建议。在我们的案例中，与大多数实体的本地训练模型相比，重新定位迁移学习、堆叠迁移学习和集中式方法产生了类似或更好的结果。

{"title":"A Case-Study Comparison of Machine Learning Approaches for Predicting Student’s Dropout from Multiple Online Educational Entities","authors":"José Manuel Porras, J. Lara, Cristóbal Romero, Sebastián Ventura","doi":"10.3390/a16120554","DOIUrl":"https://doi.org/10.3390/a16120554","url":null,"abstract":"Predicting student dropout is a crucial task in online education. Traditionally, each educational entity (institution, university, faculty, department, etc.) creates and uses its own prediction model starting from its own data. However, that approach is not always feasible or advisable and may depend on the availability of data, local infrastructure, and resources. In those cases, there are various machine learning approaches for sharing data and/or models between educational entities, using a classical centralized machine learning approach or other more advanced approaches such as transfer learning or federated learning. In this paper, we used data from three different LMS Moodle servers representing homogeneous different-sized educational entities. We tested the performance of the different machine learning approaches for the problem of predicting student dropout with multiple educational entities involved. We used a deep learning algorithm as a predictive classifier method. Our preliminary findings provide useful information on the benefits and drawbacks of each approach, as well as suggestions for enhancing performance when there are multiple institutions. In our case, repurposed transfer learning, stacked transfer learning, and centralized approaches produced similar or better results than the locally trained models for most of the entities.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"45 19","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138605469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Lightweight Graph Neural Network Algorithm for Action Recognition Based on Self-Distillation 基于自发散的轻量级图神经网络动作识别算法

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-01 DOI: 10.3390/a16120552

Miao Feng, Jean Meunier

Recognizing human actions can help in numerous ways, such as health monitoring, intelligent surveillance, virtual reality and human–computer interaction. A quick and accurate detection algorithm is required for daily real-time detection. This paper first proposes to generate a lightweight graph neural network by self-distillation for human action recognition tasks. The lightweight graph neural network was evaluated on the NTU-RGB+D dataset. The results demonstrate that, with competitive accuracy, the heavyweight graph neural network can be compressed by up to 80%. Furthermore, the learned representations have denser clusters, estimated by the Davies–Bouldin index, the Dunn index and silhouette coefficients. The ideal input data and algorithm capacity are also discussed.

识别人类行为可以在许多方面提供帮助，例如健康监测、智能监视、虚拟现实和人机交互。为了实现日常的实时检测，需要一种快速准确的检测算法。本文首先提出了一种基于自蒸馏的轻量图神经网络的人体动作识别方法。在NTU-RGB+D数据集上对轻量级图神经网络进行了评价。结果表明，在具有竞争精度的情况下，重量级图神经网络可以被压缩高达80%。此外，通过davis - bouldin指数、Dunn指数和剪影系数估计，学习到的表示具有更密集的聚类。讨论了理想的输入数据和算法容量。

引用次数: 0

Automatic Segmentation of Histological Images of Mouse Brains 小鼠大脑组织学图像的自动分割

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-01 DOI: 10.3390/a16120553

Juan Cisneros, Alain Lalande, Binnaz Yalcin, Fabrice Meriaudeau, Stephan Collins

Using a high-throughput neuroanatomical screen of histological brain sections developed in collaboration with the International Mouse Phenotyping Consortium, we previously reported a list of 198 genes whose inactivation leads to neuroanatomical phenotypes. To achieve this milestone, tens of thousands of hours of manual image segmentation were necessary. The present work involved developing a full pipeline to automate the application of deep learning methods for the automated segmentation of 24 anatomical regions used in the aforementioned screen. The dataset includes 2000 annotated parasagittal slides (24,000 × 14,000 pixels). Our approach consists of three main parts: the conversion of images (.ROI to .PNG), the training of the deep learning approach on the compressed images (512 × 256 and 2048 × 1024 pixels of the deep learning approach) to extract the regions of interest using either the U-Net or Attention U-Net architectures, and finally the transformation of the identified regions (.PNG to .ROI), enabling visualization and editing within the Fiji/ImageJ 1.54 software environment. With an image resolution of 2048 × 1024, the Attention U-Net provided the best results with an overall Dice Similarity Coefficient (DSC) of 0.90 ± 0.01 for all 24 regions. Using one command line, the end-user is now able to pre-analyze images automatically, then runs the existing analytical pipeline made of ImageJ macros to validate the automatically generated regions of interest resulting. Even for regions with low DSC, expert neuroanatomists rarely correct the results. We estimate a time savings of 6 to 10 times.

利用与国际小鼠表型联盟合作开发的脑组织组织的高通量神经解剖学筛选，我们先前报道了198个基因的失活导致神经解剖学表型的列表。为了实现这一里程碑，需要成千上万小时的人工图像分割。目前的工作涉及开发一个完整的流水线，以自动应用深度学习方法对上述屏幕中使用的24个解剖区域进行自动分割。该数据集包括2000张带注释的副矢状面幻灯片(24000 × 14000像素)。我们的方法包括三个主要部分:图像的转换(;ROI到。png)，对压缩图像(深度学习方法的512 × 256和2048 × 1024像素)进行深度学习方法的训练，使用U-Net或Attention U-Net架构提取感兴趣的区域，最后将识别的区域(. png到。ROI)进行转换，从而在Fiji/ImageJ 1.54软件环境中实现可视化和编辑。在图像分辨率为2048 × 1024的情况下，Attention U-Net在24个区域的整体Dice Similarity Coefficient (DSC)为0.90±0.01，呈现出最佳效果。使用一个命令行，最终用户现在能够自动预分析图像，然后运行由ImageJ宏组成的现有分析管道来验证自动生成的感兴趣的区域。即使对于低DSC的区域，专家神经解剖学家也很少纠正结果。我们估计可以节省6到10倍的时间。

{"title":"Automatic Segmentation of Histological Images of Mouse Brains","authors":"Juan Cisneros, Alain Lalande, Binnaz Yalcin, Fabrice Meriaudeau, Stephan Collins","doi":"10.3390/a16120553","DOIUrl":"https://doi.org/10.3390/a16120553","url":null,"abstract":"Using a high-throughput neuroanatomical screen of histological brain sections developed in collaboration with the International Mouse Phenotyping Consortium, we previously reported a list of 198 genes whose inactivation leads to neuroanatomical phenotypes. To achieve this milestone, tens of thousands of hours of manual image segmentation were necessary. The present work involved developing a full pipeline to automate the application of deep learning methods for the automated segmentation of 24 anatomical regions used in the aforementioned screen. The dataset includes 2000 annotated parasagittal slides (24,000 × 14,000 pixels). Our approach consists of three main parts: the conversion of images (.ROI to .PNG), the training of the deep learning approach on the compressed images (512 × 256 and 2048 × 1024 pixels of the deep learning approach) to extract the regions of interest using either the U-Net or Attention U-Net architectures, and finally the transformation of the identified regions (.PNG to .ROI), enabling visualization and editing within the Fiji/ImageJ 1.54 software environment. With an image resolution of 2048 × 1024, the Attention U-Net provided the best results with an overall Dice Similarity Coefficient (DSC) of 0.90 ± 0.01 for all 24 regions. Using one command line, the end-user is now able to pre-analyze images automatically, then runs the existing analytical pipeline made of ImageJ macros to validate the automatically generated regions of interest resulting. Even for regions with low DSC, expert neuroanatomists rarely correct the results. We estimate a time savings of 6 to 10 times.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"5 3","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138627079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Predicting the Impact of Data Poisoning Attacks in Blockchain-Enabled Supply Chain Networks 预测区块链供应链网络中数据中毒攻击的影响

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-11-29 DOI: 10.3390/a16120549

Usman Javed Butt, Osama Hussien, Krison Hasanaj, Khaled Shaalan, Bilal Hassan, Haider al-Khateeb

As computer networks become increasingly important in various domains, the need for secure and reliable networks becomes more pressing, particularly in the context of blockchain-enabled supply chain networks. One way to ensure network security is by using intrusion detection systems (IDSs), which are specialised devices that detect anomalies and attacks in the network. However, these systems are vulnerable to data poisoning attacks, such as label and distance-based flipping, which can undermine their effectiveness within blockchain-enabled supply chain networks. In this research paper, we investigate the effect of these attacks on a network intrusion detection system using several machine learning models, including logistic regression, random forest, SVC, and XGB Classifier, and evaluate each model via their F1 Score, confusion matrix, and accuracy. We run each model three times: once without any attack, once with random label flipping with a randomness of 20%, and once with distance-based label flipping attacks with a distance threshold of 0.5. Additionally, this research tests an eight-layer neural network using accuracy metrics and a classification report library. The primary goal of this research is to provide insights into the effect of data poisoning attacks on machine learning models within the context of blockchain-enabled supply chain networks. By doing so, we aim to contribute to developing more robust intrusion detection systems tailored to the specific challenges of securing blockchain-based supply chain networks.

随着计算机网络在各个领域变得越来越重要，对安全可靠网络的需求也变得更加迫切，特别是在区块链支持的供应链网络中。确保网络安全的一种方法是使用入侵检测系统（IDS），这是一种检测网络异常和攻击的专用设备。然而，这些系统很容易受到数据中毒攻击，如基于标签和距离的翻转，这会削弱它们在区块链供应链网络中的有效性。在本研究论文中，我们使用几种机器学习模型（包括逻辑回归、随机森林、SVC 和 XGB 分类器）研究了这些攻击对网络入侵检测系统的影响，并通过 F1 分数、混淆矩阵和准确率对每个模型进行了评估。我们将每个模型运行三次：一次不带任何攻击，一次是随机标签翻转（随机性为 20%），一次是基于距离的标签翻转攻击（距离阈值为 0.5）。此外，本研究还使用准确度指标和分类报告库测试了八层神经网络。本研究的主要目标是深入了解在区块链支持的供应链网络中，数据中毒攻击对机器学习模型的影响。通过这样做，我们旨在为开发更强大的入侵检测系统做出贡献，该系统是针对确保基于区块链的供应链网络安全所面临的具体挑战而量身定制的。

{"title":"Predicting the Impact of Data Poisoning Attacks in Blockchain-Enabled Supply Chain Networks","authors":"Usman Javed Butt, Osama Hussien, Krison Hasanaj, Khaled Shaalan, Bilal Hassan, Haider al-Khateeb","doi":"10.3390/a16120549","DOIUrl":"https://doi.org/10.3390/a16120549","url":null,"abstract":"As computer networks become increasingly important in various domains, the need for secure and reliable networks becomes more pressing, particularly in the context of blockchain-enabled supply chain networks. One way to ensure network security is by using intrusion detection systems (IDSs), which are specialised devices that detect anomalies and attacks in the network. However, these systems are vulnerable to data poisoning attacks, such as label and distance-based flipping, which can undermine their effectiveness within blockchain-enabled supply chain networks. In this research paper, we investigate the effect of these attacks on a network intrusion detection system using several machine learning models, including logistic regression, random forest, SVC, and XGB Classifier, and evaluate each model via their F1 Score, confusion matrix, and accuracy. We run each model three times: once without any attack, once with random label flipping with a randomness of 20%, and once with distance-based label flipping attacks with a distance threshold of 0.5. Additionally, this research tests an eight-layer neural network using accuracy metrics and a classification report library. The primary goal of this research is to provide insights into the effect of data poisoning attacks on machine learning models within the context of blockchain-enabled supply chain networks. By doing so, we aim to contribute to developing more robust intrusion detection systems tailored to the specific challenges of securing blockchain-based supply chain networks.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"52 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139209509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

OrthoDETR: A Streamlined Transformer-Based Approach for Precision Detection of Orthopedic Medical Devices OrthoDETR：基于简化变压器的矫形医疗器械精密检测方法

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-11-29 DOI: 10.3390/a16120550

Xiaobo Zhang, Huashun Li, Jingzhao Li, Xuehai Zhou

The rapid and accurate detection of orthopedic medical devices is pivotal in enhancing health care delivery, particularly by improving workflow efficiency. Despite advancements in medical imaging technology, current detection models often fail to meet the unique requirements of orthopedic device detection. To address this gap, we introduce OrthoDETR, a Transformer-based object detection model specifically designed and optimized for orthopedic medical devices. OrthoDETR is an evolution of the DETR (Detection Transformer) model, with several key modifications to better serve orthopedic applications. We replace the ResNet backbone with the MLP-Mixer, improve the multi-head self-attention mechanism, and refine the loss function for more accurate detections. In our comparative study, OrthoDETR outperformed other models, achieving an AP50 score of 0.897, an AP50:95 score of 0.864, an AR50:95 score of 0.895, and a frame per second (FPS) rate of 26. This represents a significant improvement over the DETR model, which achieved an AP50 score of 0.852, an AP50:95 score of 0.842, an AR50:95 score of 0.862, and an FPS rate of 20. OrthoDETR not only accelerates the detection process but also maintains an acceptable performance trade-off. The real-world impact of this model is substantial. By facilitating the precise and quick detection of orthopedic devices, OrthoDETR can potentially revolutionize the management of orthopedic workflows, improving patient care, and enhancing the efficiency of healthcare systems. This paper underlines the significance of specialized object detection models in orthopedics and sets the stage for further research in this direction.

快速、准确地检测骨科医疗器械对于提高医疗服务质量，尤其是提高工作流程效率至关重要。尽管医学成像技术不断进步，但目前的检测模型往往无法满足骨科设备检测的独特要求。为了弥补这一缺陷，我们推出了 OrthoDETR，这是一种基于变换器的物体检测模型，专为骨科医疗设备而设计和优化。OrthoDETR 是 DETR（Detection Transformer，检测变换器）模型的进化版，为更好地服务于骨科应用进行了几处关键修改。我们用 MLP-Mixer 代替了 ResNet 主干网，改进了多头自注意机制，并完善了损失函数，以实现更精确的检测。在比较研究中，OrthoDETR 的表现优于其他模型，AP50 得分为 0.897，AP50:95 得分为 0.864，AR50:95 得分为 0.895，每秒帧数 (FPS) 率为 26。与 DETR 模型相比，该模型的 AP50 得分为 0.852，AP50:95 得分为 0.842，AR50:95 得分为 0.862，每秒帧数 (FPS) 为 20。OrthoDETR 不仅加速了检测过程，还保持了可接受的性能权衡。该模型对现实世界的影响是巨大的。通过促进骨科设备的精确快速检测，OrthoDETR 有可能彻底改变骨科工作流程的管理，改善患者护理，提高医疗保健系统的效率。本文强调了矫形外科中专业物体检测模型的重要性，并为这一方向的进一步研究奠定了基础。

{"title":"OrthoDETR: A Streamlined Transformer-Based Approach for Precision Detection of Orthopedic Medical Devices","authors":"Xiaobo Zhang, Huashun Li, Jingzhao Li, Xuehai Zhou","doi":"10.3390/a16120550","DOIUrl":"https://doi.org/10.3390/a16120550","url":null,"abstract":"The rapid and accurate detection of orthopedic medical devices is pivotal in enhancing health care delivery, particularly by improving workflow efficiency. Despite advancements in medical imaging technology, current detection models often fail to meet the unique requirements of orthopedic device detection. To address this gap, we introduce OrthoDETR, a Transformer-based object detection model specifically designed and optimized for orthopedic medical devices. OrthoDETR is an evolution of the DETR (Detection Transformer) model, with several key modifications to better serve orthopedic applications. We replace the ResNet backbone with the MLP-Mixer, improve the multi-head self-attention mechanism, and refine the loss function for more accurate detections. In our comparative study, OrthoDETR outperformed other models, achieving an AP50 score of 0.897, an AP50:95 score of 0.864, an AR50:95 score of 0.895, and a frame per second (FPS) rate of 26. This represents a significant improvement over the DETR model, which achieved an AP50 score of 0.852, an AP50:95 score of 0.842, an AR50:95 score of 0.862, and an FPS rate of 20. OrthoDETR not only accelerates the detection process but also maintains an acceptable performance trade-off. The real-world impact of this model is substantial. By facilitating the precise and quick detection of orthopedic devices, OrthoDETR can potentially revolutionize the management of orthopedic workflows, improving patient care, and enhancing the efficiency of healthcare systems. This paper underlines the significance of specialized object detection models in orthopedics and sets the stage for further research in this direction.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"1 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139213500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimizing Physics-Informed Neural Network in Dynamic System Simulation and Learning of Parameters 在动态系统仿真和参数学习中优化物理信息神经网络

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-11-28 DOI: 10.3390/a16120547

Ebenezer O. Oluwasakin, Abdul Q. M. Khaliq

Artificial neural networks have changed many fields by giving scientists a strong way to model complex phenomena. They are also becoming increasingly useful for solving various difficult scientific problems. Still, people keep trying to find faster and more accurate ways to simulate dynamic systems. This research explores the transformative capabilities of physics-informed neural networks, a specialized subset of artificial neural networks, in modeling complex dynamical systems with enhanced speed and accuracy. These networks incorporate known physical laws into the learning process, ensuring predictions remain consistent with fundamental principles, which is crucial when dealing with scientific phenomena. This study focuses on optimizing the application of this specialized network for simultaneous system dynamics simulations and learning time-varying parameters, particularly when the number of unknowns in the system matches the number of undetermined parameters. Additionally, we explore scenarios with a mismatch between parameters and equations, optimizing network architecture to enhance convergence speed, computational efficiency, and accuracy in learning the time-varying parameter. Our approach enhances the algorithm’s performance and accuracy, ensuring optimal use of computational resources and yielding more precise results. Extensive experiments are conducted on four different dynamical systems: first-order irreversible chain reactions, biomass transfer, the Brusselsator model, and the Lotka-Volterra model, using synthetically generated data to validate our approach. Additionally, we apply our method to the susceptible-infected-recovered model, utilizing real-world COVID-19 data to learn the time-varying parameters of the pandemic’s spread. A comprehensive comparison between the performance of our approach and fully connected deep neural networks is presented, evaluating both accuracy and computational efficiency in parameter identification and system dynamics capture. The results demonstrate that the physics-informed neural networks outperform fully connected deep neural networks in performance, especially with increased network depth, making them ideal for real-time complex system modeling. This underscores the physics-informed neural network’s effectiveness in scientific modeling in scenarios with balanced unknowns and parameters. Furthermore, it provides a fast, accurate, and efficient alternative for analyzing dynamic systems.

人工神经网络改变了许多领域，为科学家们提供了建立复杂现象模型的有力方法。它们在解决各种棘手的科学问题方面也变得越来越有用。尽管如此，人们仍在不断努力寻找更快、更准确的方法来模拟动态系统。这项研究探索了物理信息神经网络（人工神经网络的一个专门子集）在以更快的速度和更高的精度模拟复杂动态系统方面的变革能力。这些网络将已知物理定律纳入学习过程，确保预测结果与基本原理保持一致，这在处理科学现象时至关重要。本研究的重点是优化这种专用网络在同步系统动力学模拟和时变参数学习中的应用，尤其是当系统中未知数的数量与未确定参数的数量相匹配时。此外，我们还探索了参数与方程不匹配的情况，优化了网络结构，以提高收敛速度、计算效率和学习时变参数的准确性。我们的方法提高了算法的性能和准确性，确保了计算资源的最佳利用，并产生了更精确的结果。我们在四个不同的动力系统上进行了广泛的实验：一阶不可逆链式反应、生物质转移、布鲁塞尔托模型和洛特卡-沃尔特拉模型，并使用合成生成的数据来验证我们的方法。此外，我们还将我们的方法应用于易感-感染-恢复模型，利用真实世界的 COVID-19 数据来学习大流行病传播的时变参数。报告全面比较了我们的方法和全连接深度神经网络的性能，评估了参数识别和系统动力学捕捉的准确性和计算效率。结果表明，物理信息神经网络的性能优于全连接深度神经网络，尤其是随着网络深度的增加，使其成为实时复杂系统建模的理想选择。这凸显了物理信息神经网络在平衡未知数和参数的情况下进行科学建模的有效性。此外，它还为动态系统分析提供了快速、准确和高效的替代方案。

{"title":"Optimizing Physics-Informed Neural Network in Dynamic System Simulation and Learning of Parameters","authors":"Ebenezer O. Oluwasakin, Abdul Q. M. Khaliq","doi":"10.3390/a16120547","DOIUrl":"https://doi.org/10.3390/a16120547","url":null,"abstract":"Artificial neural networks have changed many fields by giving scientists a strong way to model complex phenomena. They are also becoming increasingly useful for solving various difficult scientific problems. Still, people keep trying to find faster and more accurate ways to simulate dynamic systems. This research explores the transformative capabilities of physics-informed neural networks, a specialized subset of artificial neural networks, in modeling complex dynamical systems with enhanced speed and accuracy. These networks incorporate known physical laws into the learning process, ensuring predictions remain consistent with fundamental principles, which is crucial when dealing with scientific phenomena. This study focuses on optimizing the application of this specialized network for simultaneous system dynamics simulations and learning time-varying parameters, particularly when the number of unknowns in the system matches the number of undetermined parameters. Additionally, we explore scenarios with a mismatch between parameters and equations, optimizing network architecture to enhance convergence speed, computational efficiency, and accuracy in learning the time-varying parameter. Our approach enhances the algorithm’s performance and accuracy, ensuring optimal use of computational resources and yielding more precise results. Extensive experiments are conducted on four different dynamical systems: first-order irreversible chain reactions, biomass transfer, the Brusselsator model, and the Lotka-Volterra model, using synthetically generated data to validate our approach. Additionally, we apply our method to the susceptible-infected-recovered model, utilizing real-world COVID-19 data to learn the time-varying parameters of the pandemic’s spread. A comprehensive comparison between the performance of our approach and fully connected deep neural networks is presented, evaluating both accuracy and computational efficiency in parameter identification and system dynamics capture. The results demonstrate that the physics-informed neural networks outperform fully connected deep neural networks in performance, especially with increased network depth, making them ideal for real-time complex system modeling. This underscores the physics-informed neural network’s effectiveness in scientific modeling in scenarios with balanced unknowns and parameters. Furthermore, it provides a fast, accurate, and efficient alternative for analyzing dynamic systems.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"93 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139216705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Special Issue on “Algorithms for Biomedical Image Analysis and Processing” 生物医学图像分析与处理算法 "特刊

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-11-28 DOI: 10.3390/a16120544

L. Antonelli, Lucia Maddalena

Biomedical imaging is a broad field concerning image capture for diagnostic and therapeutic purposes [...]

生物医学成像是一个广泛的领域，涉及用于诊断和治疗目的的图像捕捉 [...]

引用次数: 0

An Efficient Optimized DenseNet Model for Aspect-Based Multi-Label Classification 基于方面的多标签分类的高效优化密集网络模型

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-11-28 DOI: 10.3390/a16120548

N. Ayub, Tayyaba, Saddam Hussain, Syed Sajid Ullah, Jawaid Iqbal

Sentiment analysis holds great importance within the domain of natural language processing as it examines both the expressed and underlying emotions conveyed through review content. Furthermore, researchers have discovered that relying solely on the overall sentiment derived from the textual content is inadequate. Consequently, sentiment analysis was developed to extract nuanced expressions from textual information. One of the challenges in this field is effectively extracting emotional elements using multi-label data that covers various aspects. This article presents a novel approach called the Ensemble of DenseNet based on Aquila Optimizer (EDAO). EDAO is specifically designed to enhance the precision and diversity of multi-label learners. Unlike traditional multi-label methods, EDAO strongly emphasizes improving model diversity and accuracy in multi-label scenarios. To evaluate the effectiveness of our approach, we conducted experiments on seven distinct datasets, including emotions, hotels, movies, proteins, automobiles, medical, news, and birds. Our initial strategy involves establishing a preprocessing mechanism to obtain precise and refined data. Subsequently, we used the Vader tool with Bag of Words (BoW) for feature extraction. In the third stage, we created word associations using the word2vec method. The improved data were also used to train and test the DenseNet model, which was fine-tuned using the Aquila Optimizer (AO). On the news, emotion, auto, bird, movie, hotel, protein, and medical datasets, utilizing the aspect-based multi-labeling technique, we achieved accuracy rates of 95%, 97%, and 96%, respectively, with DenseNet-AO. Our proposed model demonstrates that EDAO outperforms other standard methods across various multi-label datasets with different dimensions. The implemented strategy has been rigorously validated through experimental results, showcasing its effectiveness compared to existing benchmark approaches.

情感分析在自然语言处理领域具有重要意义，因为它既能研究评论内容所表达的情感，也能研究评论内容所隐含的情感。此外，研究人员还发现，仅仅依靠从文本内容中获得的整体情感是不够的。因此，情感分析应运而生，目的是从文本信息中提取细微的表达。该领域面临的挑战之一是如何利用涵盖各个方面的多标签数据有效地提取情感元素。本文介绍了一种名为基于 Aquila 优化器的密集网络集合（EDAO）的新方法。EDAO 专为提高多标签学习器的精确度和多样性而设计。与传统的多标签方法不同，EDAO 着重强调在多标签场景中提高模型的多样性和准确性。为了评估我们方法的有效性，我们在七个不同的数据集上进行了实验，包括情感、酒店、电影、蛋白质、汽车、医疗、新闻和鸟类。我们的初始策略包括建立一个预处理机制，以获得精确而精细的数据。随后，我们使用带有词袋（BoW）的 Vader 工具进行特征提取。在第三阶段，我们使用 word2vec 方法创建词关联。改进后的数据还用于训练和测试 DenseNet 模型，并使用 Aquila 优化器 (AO) 对其进行了微调。在新闻、情感、汽车、鸟类、电影、酒店、蛋白质和医疗数据集上，利用基于方面的多重标记技术，我们使用 DenseNet-AO 实现的准确率分别为 95%、97% 和 96%。我们提出的模型表明，在不同维度的多标签数据集上，EDAO 的表现优于其他标准方法。我们通过实验结果严格验证了所实施的策略，并展示了它与现有基准方法相比的有效性。

{"title":"An Efficient Optimized DenseNet Model for Aspect-Based Multi-Label Classification","authors":"N. Ayub, Tayyaba, Saddam Hussain, Syed Sajid Ullah, Jawaid Iqbal","doi":"10.3390/a16120548","DOIUrl":"https://doi.org/10.3390/a16120548","url":null,"abstract":"Sentiment analysis holds great importance within the domain of natural language processing as it examines both the expressed and underlying emotions conveyed through review content. Furthermore, researchers have discovered that relying solely on the overall sentiment derived from the textual content is inadequate. Consequently, sentiment analysis was developed to extract nuanced expressions from textual information. One of the challenges in this field is effectively extracting emotional elements using multi-label data that covers various aspects. This article presents a novel approach called the Ensemble of DenseNet based on Aquila Optimizer (EDAO). EDAO is specifically designed to enhance the precision and diversity of multi-label learners. Unlike traditional multi-label methods, EDAO strongly emphasizes improving model diversity and accuracy in multi-label scenarios. To evaluate the effectiveness of our approach, we conducted experiments on seven distinct datasets, including emotions, hotels, movies, proteins, automobiles, medical, news, and birds. Our initial strategy involves establishing a preprocessing mechanism to obtain precise and refined data. Subsequently, we used the Vader tool with Bag of Words (BoW) for feature extraction. In the third stage, we created word associations using the word2vec method. The improved data were also used to train and test the DenseNet model, which was fine-tuned using the Aquila Optimizer (AO). On the news, emotion, auto, bird, movie, hotel, protein, and medical datasets, utilizing the aspect-based multi-labeling technique, we achieved accuracy rates of 95%, 97%, and 96%, respectively, with DenseNet-AO. Our proposed model demonstrates that EDAO outperforms other standard methods across various multi-label datasets with different dimensions. The implemented strategy has been rigorously validated through experimental results, showcasing its effectiveness compared to existing benchmark approaches.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"55 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139224015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Wind Turbine Predictive Fault Diagnostics Based on a Novel Long Short-Term Memory Model 基于新型长短期记忆模型的风力涡轮机预测性故障诊断技术

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-11-28 DOI: 10.3390/a16120546

Shuo Zhang, Emma Robinson, Malabika Basu

The operation and maintenance (O&M) issues of offshore wind turbines (WTs) are more challenging because of the harsh operational environment and hard accessibility. As sudden component failures within WTs bring about durable downtimes and significant revenue losses, condition monitoring and predictive fault diagnostic approaches must be developed to detect faults before they occur, thus preventing durable downtimes and costly unplanned maintenance. Based primarily on supervisory control and data acquisition (SCADA) data, thirty-three weighty features from operational data are extracted, and eight specific faults are categorised for fault predictions from status information. By providing a model-agnostic vector representation for time, Time2Vec (T2V), into Long Short-Term Memory (LSTM), this paper develops a novel deep-learning neural network model, T2V-LSTM, conducting multi-level fault predictions. The classification steps allow fault diagnosis from 10 to 210 min prior to faults. The results show that T2V-LSTM can successfully predict over 84.97% of faults and outperform LSTM and other counterparts in both overall and individual fault predictions due to its topmost recall scores in most multistep-ahead cases performed. Thus, the proposed T2V-LSTM can correctly diagnose more faults and upgrade the predictive performances based on vanilla LSTM in terms of accuracy, recall scores, and F-scores.

海上风力涡轮机（WTs）的运行和维护（O&M）问题因其恶劣的运行环境和难以接近而更具挑战性。由于风力涡轮机内的突发性部件故障会造成长期停机和重大收入损失，因此必须开发状态监测和预测性故障诊断方法，以便在故障发生前检测出故障，从而防止出现长期停机和代价高昂的计划外维护。主要基于监控和数据采集（SCADA）数据，从运行数据中提取了 33 个重要特征，并根据状态信息对 8 个特定故障进行了分类，以便进行故障预测。通过在长短时记忆（LSTM）中提供与模型无关的时间向量表示法 Time2Vec (T2V)，本文开发了一种新型深度学习神经网络模型 T2V-LSTM，用于进行多级故障预测。分类步骤可在故障发生前 10 至 210 分钟内进行故障诊断。结果表明，T2V-LSTM 可以成功预测 84.97% 以上的故障，并且在大多数多步骤先行案例中，T2V-LSTM 的召回分数最高，因此在整体和单个故障预测方面均优于 LSTM 和其他同类产品。因此，所提出的 T2V-LSTM 可以正确诊断更多的故障，并在准确率、召回分数和 F 分数方面提升了基于 vanilla LSTM 的预测性能。

{"title":"Wind Turbine Predictive Fault Diagnostics Based on a Novel Long Short-Term Memory Model","authors":"Shuo Zhang, Emma Robinson, Malabika Basu","doi":"10.3390/a16120546","DOIUrl":"https://doi.org/10.3390/a16120546","url":null,"abstract":"The operation and maintenance (O&M) issues of offshore wind turbines (WTs) are more challenging because of the harsh operational environment and hard accessibility. As sudden component failures within WTs bring about durable downtimes and significant revenue losses, condition monitoring and predictive fault diagnostic approaches must be developed to detect faults before they occur, thus preventing durable downtimes and costly unplanned maintenance. Based primarily on supervisory control and data acquisition (SCADA) data, thirty-three weighty features from operational data are extracted, and eight specific faults are categorised for fault predictions from status information. By providing a model-agnostic vector representation for time, Time2Vec (T2V), into Long Short-Term Memory (LSTM), this paper develops a novel deep-learning neural network model, T2V-LSTM, conducting multi-level fault predictions. The classification steps allow fault diagnosis from 10 to 210 min prior to faults. The results show that T2V-LSTM can successfully predict over 84.97% of faults and outperform LSTM and other counterparts in both overall and individual fault predictions due to its topmost recall scores in most multistep-ahead cases performed. Thus, the proposed T2V-LSTM can correctly diagnose more faults and upgrade the predictive performances based on vanilla LSTM in terms of accuracy, recall scores, and F-scores.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"16 3 1","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139227221","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Measuring the Performance of Ant Colony Optimization Algorithms for the Dynamic Traveling Salesman Problem 衡量动态旅行推销员问题蚁群优化算法的性能

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-11-28 DOI: 10.3390/a16120545

Michalis Mavrovouniotis, Maria N. Anastasiadou, D. Hadjimitsis

Ant colony optimization (ACO) has proven its adaptation capabilities on optimization problems with dynamic environments. In this work, the dynamic traveling salesman problem (DTSP) is used as the base problem to generate dynamic test cases. Two types of dynamic changes for the DTSP are considered: (1) node changes and (2) weight changes. In the experiments, ACO algorithms are systematically compared in different DTSP test cases. Statistical tests are performed using the arithmetic mean and standard deviation of ACO algorithms, which is the standard method of comparing ACO algorithms. To complement the comparisons, the quantiles of the distribution are also used to measure the peak-, average-, and bad-case performance of ACO algorithms. The experimental results demonstrate some advantages of using quantiles for evaluating the performance of ACO algorithms in some DTSP test cases.

蚁群优化（ACO）已经证明了其在动态环境优化问题上的适应能力。在这项工作中，动态旅行推销员问题（DTSP）被用作生成动态测试案例的基础问题。DTSP 考虑了两种动态变化：(1) 节点变化和 (2) 权重变化。在实验中，ACO 算法在不同的 DTSP 测试用例中进行了系统比较。统计测试使用 ACO 算法的算术平均数和标准差进行，这是比较 ACO 算法的标准方法。为了补充比较，还使用了分布的定量值来衡量 ACO 算法的峰值、平均值和坏情况性能。实验结果表明，在一些 DTSP 测试案例中，使用量化值评估 ACO 算法的性能具有一定的优势。

引用次数: 0