Algorithms最新文献_第4页

Vision-Based Concrete-Crack Detection on Railway Sleepers Using Dense U-Net Model 使用密集 U-Net 模型对铁路枕木上的混凝土裂缝进行基于视觉的检测

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-15 DOI: 10.3390/a16120568

M. Khan, Seong-Hoon Kee, A. Nahid

Crack inspection in railway sleepers is crucial for ensuring rail safety and avoiding deadly accidents. Traditional methods for detecting cracks on railway sleepers are very time-consuming and lack efficiency. Therefore, nowadays, researchers are paying attention to vision-based algorithms, especially Deep Learning algorithms. In this work, we adopted the U-net for the first time for detecting cracks on a railway sleeper and proposed a modified U-net architecture named Dense U-net for segmenting the cracks. In the Dense U-net structure, we established several short connections between the encoder and decoder blocks, which enabled the architecture to obtain better pixel information flow. Thus, the model extracted the necessary information in more detail to predict the cracks. We collected images from railway sleepers, processed them in a dataset, and finally trained the model with the images. The model achieved an overall F1-score, precision, Recall, and IoU of 86.5%, 88.53%, 84.63%, and 76.31%, respectively. We compared our suggested model with the original U-net, and the results demonstrate that our model performed better than the U-net in both quantitative and qualitative results. Moreover, we considered the necessity of crack severity analysis and measured a few parameters of the cracks. The engineers must know the severity of the cracks to have an idea about the most severe locations and take the necessary steps to repair the badly affected sleepers.

铁路枕木的裂缝检测对于确保铁路安全和避免致命事故至关重要。传统的铁轨枕木裂缝检测方法耗时长、效率低。因此，如今研究人员开始关注基于视觉的算法，尤其是深度学习算法。在这项工作中，我们首次采用了 U-net 来检测铁路枕木上的裂缝，并提出了一种名为 Dense U-net 的改进型 U-net 结构来分割裂缝。在 Dense U-net 结构中，我们在编码器和解码器块之间建立了多个短连接，这使得该结构能够获得更好的像素信息流。这样，模型就能更详细地提取必要的信息来预测裂缝。我们收集了铁轨枕木的图像，并将其处理为数据集，最后利用这些图像对模型进行了训练。模型的总体 F1 分数、精确度、召回率和 IoU 分别达到了 86.5%、88.53%、84.63% 和 76.31%。我们将建议的模型与原始 U-net 进行了比较，结果表明我们的模型在定量和定性结果上都优于 U-net。此外，我们还考虑了裂缝严重性分析的必要性，并测量了裂缝的一些参数。工程师必须知道裂缝的严重程度，以便了解最严重的位置，并采取必要措施修复受严重影响的枕木。

{"title":"Vision-Based Concrete-Crack Detection on Railway Sleepers Using Dense U-Net Model","authors":"M. Khan, Seong-Hoon Kee, A. Nahid","doi":"10.3390/a16120568","DOIUrl":"https://doi.org/10.3390/a16120568","url":null,"abstract":"Crack inspection in railway sleepers is crucial for ensuring rail safety and avoiding deadly accidents. Traditional methods for detecting cracks on railway sleepers are very time-consuming and lack efficiency. Therefore, nowadays, researchers are paying attention to vision-based algorithms, especially Deep Learning algorithms. In this work, we adopted the U-net for the first time for detecting cracks on a railway sleeper and proposed a modified U-net architecture named Dense U-net for segmenting the cracks. In the Dense U-net structure, we established several short connections between the encoder and decoder blocks, which enabled the architecture to obtain better pixel information flow. Thus, the model extracted the necessary information in more detail to predict the cracks. We collected images from railway sleepers, processed them in a dataset, and finally trained the model with the images. The model achieved an overall F1-score, precision, Recall, and IoU of 86.5%, 88.53%, 84.63%, and 76.31%, respectively. We compared our suggested model with the original U-net, and the results demonstrate that our model performed better than the U-net in both quantitative and qualitative results. Moreover, we considered the necessity of crack severity analysis and measured a few parameters of the cracks. The engineers must know the severity of the cracks to have an idea about the most severe locations and take the necessary steps to repair the badly affected sleepers.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"14 17","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139001298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep Learning Based on EfficientNet for Multiorgan Segmentation of Thoracic Structures on a 0.35 T MR-Linac Radiation Therapy System 基于 EfficientNet 的深度学习在 0.35 T MR-Linac 放射治疗系统上对胸腔结构进行多器官分割

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-12 DOI: 10.3390/a16120564

Mohammed Chekroun, Youssef Mourchid, Igor Bessières, Alain Lalande

The advent of the 0.35 T MR-Linac (MRIdian, ViewRay) system in radiation therapy allows precise tumor targeting for moving lesions. However, the lack of an automatic volume segmentation function in the MR-Linac’s treatment planning system poses a challenge. In this paper, we propose a deep-learning-based multiorgan segmentation approach for the thoracic region, using EfficientNet as the backbone for the network architecture. The objectives of this approach include accurate segmentation of critical organs, such as the left and right lungs, the heart, the spinal cord, and the esophagus, essential for minimizing radiation toxicity during external radiation therapy. Our proposed approach, when evaluated on an internal dataset comprising 81 patients, demonstrated superior performance compared to other state-of-the-art methods. Specifically, the results for our approach with a 2.5D strategy were as follows: a dice similarity coefficient (DSC) of 0.820 ± 0.041, an intersection over union (IoU) of 0.725 ± 0.052, and a 3D Hausdorff distance (HD) of 10.353 ± 4.974 mm. Notably, the 2.5D strategy surpassed the 2D strategy in all three metrics, exhibiting higher DSC and IoU values, as well as lower HD values. This improvement strongly suggests that our proposed approach with the 2.5D strategy may hold promise in achieving more precise and accurate segmentations when compared to the conventional 2D strategy. Our work has practical implications in the improvement of treatment planning precision, aligning with the evolution of medical imaging and innovative strategies for multiorgan segmentation tasks.

0.35 T MR-Linac（MRIdian，ViewRay）系统在放射治疗中的出现，使移动病灶的肿瘤靶向更加精确。然而，MR-Linac 的治疗计划系统缺乏自动体积分割功能，这给我们带来了挑战。在本文中，我们提出了一种基于深度学习的胸腔区域多器官分割方法，使用 EfficientNet 作为网络架构的骨干。该方法的目标包括准确分割关键器官，如左肺和右肺、心脏、脊髓和食道，这对于在体外放射治疗过程中最大限度地减少辐射毒性至关重要。我们提出的方法在由 81 名患者组成的内部数据集上进行了评估，与其他最先进的方法相比，表现出了卓越的性能。具体来说，我们采用 2.5D 策略的方法的结果如下：骰子相似系数 (DSC) 为 0.820 ± 0.041，交集大于联合 (IoU) 为 0.725 ± 0.052，三维豪斯多夫距离 (HD) 为 10.353 ± 4.974 毫米。值得注意的是，2.5D 策略在所有三个指标上都超过了 2D 策略，表现出更高的 DSC 值和 IoU 值，以及更低的 HD 值。这一改进有力地表明，与传统的 2D 策略相比，我们提出的 2.5D 策略有望实现更精确、更准确的分割。我们的工作对提高治疗规划的精确度具有实际意义，符合医学成像的发展和多器官分割任务的创新策略。

{"title":"Deep Learning Based on EfficientNet for Multiorgan Segmentation of Thoracic Structures on a 0.35 T MR-Linac Radiation Therapy System","authors":"Mohammed Chekroun, Youssef Mourchid, Igor Bessières, Alain Lalande","doi":"10.3390/a16120564","DOIUrl":"https://doi.org/10.3390/a16120564","url":null,"abstract":"The advent of the 0.35 T MR-Linac (MRIdian, ViewRay) system in radiation therapy allows precise tumor targeting for moving lesions. However, the lack of an automatic volume segmentation function in the MR-Linac’s treatment planning system poses a challenge. In this paper, we propose a deep-learning-based multiorgan segmentation approach for the thoracic region, using EfficientNet as the backbone for the network architecture. The objectives of this approach include accurate segmentation of critical organs, such as the left and right lungs, the heart, the spinal cord, and the esophagus, essential for minimizing radiation toxicity during external radiation therapy. Our proposed approach, when evaluated on an internal dataset comprising 81 patients, demonstrated superior performance compared to other state-of-the-art methods. Specifically, the results for our approach with a 2.5D strategy were as follows: a dice similarity coefficient (DSC) of 0.820 ± 0.041, an intersection over union (IoU) of 0.725 ± 0.052, and a 3D Hausdorff distance (HD) of 10.353 ± 4.974 mm. Notably, the 2.5D strategy surpassed the 2D strategy in all three metrics, exhibiting higher DSC and IoU values, as well as lower HD values. This improvement strongly suggests that our proposed approach with the 2.5D strategy may hold promise in achieving more precise and accurate segmentations when compared to the conventional 2D strategy. Our work has practical implications in the improvement of treatment planning precision, aligning with the evolution of medical imaging and innovative strategies for multiorgan segmentation tasks.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"10 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139009640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robustness of Single- and Dual-Energy Deep-Learning-Based Scatter Correction Models on Simulated and Real Chest X-rays 基于单能量和双能量深度学习的散射校正模型在模拟和真实胸部 X 射线上的鲁棒性

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-12 DOI: 10.3390/a16120565

Clara Freijo, Joaquin L. Herraiz, F. Arias-Valcayo, Paula Ibáñez, Gabriela Moreno, A. Villa-Abaunza, José Manuel Udías

Chest X-rays (CXRs) represent the first tool globally employed to detect cardiopulmonary pathologies. These acquisitions are highly affected by scattered photons due to the large field of view required. Scatter in CXRs introduces background in the images, which reduces their contrast. We developed three deep-learning-based models to estimate and correct scatter contribution to CXRs. We used a Monte Carlo (MC) ray-tracing model to simulate CXRs from human models obtained from CT scans using different configurations (depending on the availability of dual-energy acquisitions). The simulated CXRs contained the separated contribution of direct and scattered X-rays in the detector. These simulated datasets were then used as the reference for the supervised training of several NNs. Three NN models (single and dual energy) were trained with the MultiResUNet architecture. The performance of the NN models was evaluated on CXRs obtained, with an MC code, from chest CT scans of patients affected by COVID-19. The results show that the NN models were able to estimate and correct the scatter contribution to CXRs with an error of <5%, being robust to variations in the simulation setup and improving contrast in soft tissue. The single-energy model was tested on real CXRs, providing robust estimations of the scatter-corrected CXRs.

胸部 X 射线（CXR）是全球用于检测心肺病变的第一种工具。由于需要较大的视野，这些采集受到散射光子的影响很大。CXR 中的散射会在图像中引入背景，从而降低图像的对比度。我们开发了三种基于深度学习的模型来估计和纠正 CXR 的散射。我们使用蒙特卡洛（Monte Carlo，MC）射线追踪模型模拟从使用不同配置（取决于是否有双能量采集）的 CT 扫描中获得的人体模型的 CXR。模拟的 CXR 包含探测器中直接 X 射线和散射 X 射线的分离贡献。然后，这些模拟数据集被用作多个 NN 的监督训练参考。使用 MultiResUNet 架构训练了三个 NN 模型（单能量和双能量）。使用 MC 代码对 COVID-19 患者胸部 CT 扫描获得的 CXR 对 NN 模型的性能进行了评估。结果表明，NN 模型能够估算和纠正 CXR 的散射贡献，误差小于 5%，对模拟设置的变化具有鲁棒性，并能改善软组织的对比度。单能量模型在真实 CXR 上进行了测试，对散射校正后的 CXR 进行了稳健的估计。

{"title":"Robustness of Single- and Dual-Energy Deep-Learning-Based Scatter Correction Models on Simulated and Real Chest X-rays","authors":"Clara Freijo, Joaquin L. Herraiz, F. Arias-Valcayo, Paula Ibáñez, Gabriela Moreno, A. Villa-Abaunza, José Manuel Udías","doi":"10.3390/a16120565","DOIUrl":"https://doi.org/10.3390/a16120565","url":null,"abstract":"Chest X-rays (CXRs) represent the first tool globally employed to detect cardiopulmonary pathologies. These acquisitions are highly affected by scattered photons due to the large field of view required. Scatter in CXRs introduces background in the images, which reduces their contrast. We developed three deep-learning-based models to estimate and correct scatter contribution to CXRs. We used a Monte Carlo (MC) ray-tracing model to simulate CXRs from human models obtained from CT scans using different configurations (depending on the availability of dual-energy acquisitions). The simulated CXRs contained the separated contribution of direct and scattered X-rays in the detector. These simulated datasets were then used as the reference for the supervised training of several NNs. Three NN models (single and dual energy) were trained with the MultiResUNet architecture. The performance of the NN models was evaluated on CXRs obtained, with an MC code, from chest CT scans of patients affected by COVID-19. The results show that the NN models were able to estimate and correct the scatter contribution to CXRs with an error of <5%, being robust to variations in the simulation setup and improving contrast in soft tissue. The single-energy model was tested on real CXRs, providing robust estimations of the scatter-corrected CXRs.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"51 7","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139007147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Predicting Pedestrian Trajectories with Deep Adversarial Networks Considering Motion and Spatial Information 利用考虑运动和空间信息的深度对抗网络预测行人轨迹

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-12 DOI: 10.3390/a16120566

Liming Lao, Dangkui Du, Pengzhan Chen

This paper proposes a novel prediction model termed the social and spatial attentive generative adversarial network (SSA-GAN). The SSA-GAN framework utilizes a generative approach, where the generator employs social attention mechanisms to accurately model social interactions among pedestrians. Unlike previous methodologies, our model utilizes comprehensive motion features as query vectors, significantly enhancing predictive performance. Additionally, spatial attention is integrated to encapsulate the interactions between pedestrians and their spatial context through semantic spatial features. Moreover, we present a novel approach for generating simulated multi-trajectory datasets using the CARLA simulator. This method circumvents the limitations inherent in existing public datasets such as UCY and ETH, particularly when evaluating multi-trajectory metrics. Our experimental findings substantiate the efficacy of the proposed SSA-GAN model in capturing the nuances of pedestrian interactions and providing accurate multimodal trajectory predictions.

本文提出了一种新颖的预测模型，称为社会和空间注意力生成对抗网络（SSA-GAN）。SSA-GAN 框架采用生成式方法，生成器利用社会关注机制来准确模拟行人之间的社会互动。与以往的方法不同，我们的模型利用综合运动特征作为查询向量，大大提高了预测性能。此外，我们还整合了空间注意力，通过语义空间特征来概括行人之间的互动及其空间环境。此外，我们还提出了一种利用 CARLA 模拟器生成模拟多轨迹数据集的新方法。这种方法规避了 UCY 和 ETH 等现有公共数据集固有的局限性，尤其是在评估多轨迹指标时。我们的实验结果证明了所提出的 SSA-GAN 模型在捕捉行人互动的细微差别和提供准确的多模态轨迹预测方面的功效。

引用次数: 0

On the Influence of Data Imbalance on Supervised Gaussian Mixture Models 论数据失衡对有监督高斯混合模型的影响

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-11 DOI: 10.3390/a16120563

Luca Scrucca

Imbalanced data present a pervasive challenge in many real-world applications of statistical and machine learning, where the instances of one class significantly outnumber those of the other. This paper examines the impact of class imbalance on the performance of Gaussian mixture models in classification tasks and establishes the need for a strategy to reduce the adverse effects of imbalanced data on the accuracy and reliability of classification outcomes. We explore various strategies to address this problem, including cost-sensitive learning, threshold adjustments, and sampling-based techniques. Through extensive experiments on synthetic and real-world datasets, we evaluate the effectiveness of these methods. Our findings emphasize the need for effective mitigation strategies for class imbalance in supervised Gaussian mixtures, offering valuable insights for practitioners and researchers in improving classification outcomes.

在统计和机器学习的许多实际应用中，不平衡数据是一个普遍存在的挑战，在这种情况下，一类实例的数量明显多于另一类实例。本文研究了类不平衡对高斯混合模型在分类任务中的性能的影响，并确定需要一种策略来减少不平衡数据对分类结果的准确性和可靠性的不利影响。我们探索了解决这一问题的各种策略，包括成本敏感学习、阈值调整和基于采样的技术。通过在合成数据集和真实数据集上进行大量实验，我们评估了这些方法的有效性。我们的研究结果强调了在有监督的高斯混合物中有效缓解类不平衡策略的必要性，为从业人员和研究人员改进分类结果提供了宝贵的见解。

引用次数: 0

Time-Dependent Unavailability Exploration of Interconnected Urban Power Grid and Communication Network 互联城市电网和通信网络随时间变化的不可用性探索

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-10 DOI: 10.3390/a16120561

Matěj Vrtal, R. Fujdiak, Jan Benedikt, P. Praks, R. Briš, Michal Ptacek, Petr Toman

This paper presents a time-dependent reliability analysis created for a critical energy infrastructure use case, which consists of an interconnected urban power grid and a communication network. By utilizing expert knowledge from the energy and communication sectors and integrating the renewal theory of multi-component systems, a representative reliability model of this interconnected energy infrastructure, based on real network located in the Czech Republic, is established. This model assumes reparable and non-reparable components and captures the topology of the interconnected infrastructure and reliability characteristics of both the power grid and the communication network. Moreover, a time-dependent reliability assessment of the interconnected system is provided. One of the significant outputs of this research is the identification of the critical components of the interconnected network and their interdependencies by the directed acyclic graph. Numerical results indicate that the original design has an unacceptable large unavailability. Thus, to improve the reliability of the interconnected system, a slightly modified design, in which only a limited number of components in the system are modified to keep the additional costs of the improved design limited, is proposed. Consequently, numerical results indicate reducing the unavailability of the improved interconnected system in comparison with the initial reliability design. The proposed unavailability exploration strategy is general and can bring a valuable reliability improvement in the power and communication sectors.

本文介绍了一种针对关键能源基础设施使用案例的随时间变化的可靠性分析，该使用案例由相互连接的城市电网和通信网络组成。通过利用能源和通信领域的专家知识，并结合多组件系统的更新理论，以捷克共和国的实际网络为基础，建立了该互联能源基础设施的代表性可靠性模型。该模型假定了可修复和不可修复的组件，并捕捉到了互联基础设施的拓扑结构以及电网和通信网络的可靠性特征。此外，还对互联系统的可靠性进行了随时间变化的评估。这项研究的重要成果之一是通过有向无环图确定了互联网络的关键组件及其相互依存关系。数值结果表明，原始设计具有不可接受的高不可用性。因此，为了提高互联系统的可靠性，提出了一种略有改动的设计，即只对系统中有限的几个组件进行改动，以限制改进设计所带来的额外成本。因此，数值结果表明，与最初的可靠性设计相比，改进后互联系统的不可用性有所降低。所提出的不可用性探索策略具有普遍性，可为电力和通信领域带来有价值的可靠性改进。

{"title":"Time-Dependent Unavailability Exploration of Interconnected Urban Power Grid and Communication Network","authors":"Matěj Vrtal, R. Fujdiak, Jan Benedikt, P. Praks, R. Briš, Michal Ptacek, Petr Toman","doi":"10.3390/a16120561","DOIUrl":"https://doi.org/10.3390/a16120561","url":null,"abstract":"This paper presents a time-dependent reliability analysis created for a critical energy infrastructure use case, which consists of an interconnected urban power grid and a communication network. By utilizing expert knowledge from the energy and communication sectors and integrating the renewal theory of multi-component systems, a representative reliability model of this interconnected energy infrastructure, based on real network located in the Czech Republic, is established. This model assumes reparable and non-reparable components and captures the topology of the interconnected infrastructure and reliability characteristics of both the power grid and the communication network. Moreover, a time-dependent reliability assessment of the interconnected system is provided. One of the significant outputs of this research is the identification of the critical components of the interconnected network and their interdependencies by the directed acyclic graph. Numerical results indicate that the original design has an unacceptable large unavailability. Thus, to improve the reliability of the interconnected system, a slightly modified design, in which only a limited number of components in the system are modified to keep the additional costs of the improved design limited, is proposed. Consequently, numerical results indicate reducing the unavailability of the improved interconnected system in comparison with the initial reliability design. The proposed unavailability exploration strategy is general and can bring a valuable reliability improvement in the power and communication sectors.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"918 ","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138982694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Blood Cell Revolution: Unveiling 11 Distinct Types with ‘Naturalize’ Augmentation 血细胞革命：利用 "归化 "增强技术揭示 11 种不同类型的血细胞

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-10 DOI: 10.3390/a16120562

Mohamad Abou Ali, F. Dornaika, Ignacio Arganda-Carreras

Artificial intelligence (AI) has emerged as a cutting-edge tool, simultaneously accelerating, securing, and enhancing the diagnosis and treatment of patients. An exemplification of this capability is evident in the analysis of peripheral blood smears (PBS). In university medical centers, hematologists routinely examine hundreds of PBS slides daily to validate or correct outcomes produced by advanced hematology analyzers assessing samples from potentially problematic patients. This process may logically lead to erroneous PBC readings, posing risks to patient health. AI functions as a transformative tool, significantly improving the accuracy and precision of readings and diagnoses. This study reshapes the parameters of blood cell classification, harnessing the capabilities of AI and broadening the scope from 5 to 11 specific blood cell categories with the challenging 11-class PBC dataset. This transformation facilitates a more profound exploration of blood cell diversity, surpassing prior constraints in medical image analysis. Our approach combines state-of-the-art deep learning techniques, including pre-trained ConvNets, ViTb16 models, and custom CNN architectures. We employ transfer learning, fine-tuning, and ensemble strategies, such as CBAM and Averaging ensembles, to achieve unprecedented accuracy and interpretability. Our fully fine-tuned EfficientNetV2 B0 model sets a new standard, with a macro-average precision, recall, and F1-score of 91%, 90%, and 90%, respectively, and an average accuracy of 93%. This breakthrough underscores the transformative potential of 11-class blood cell classification for more precise medical diagnoses. Moreover, our groundbreaking “Naturalize” augmentation technique produces remarkable results. The 2K-PBC dataset generated with “Naturalize” boasts a macro-average precision, recall, and F1-score of 97%, along with an average accuracy of 96% when leveraging the fully fine-tuned EfficientNetV2 B0 model. This innovation not only elevates classification performance but also addresses data scarcity and bias in medical deep learning. Our research marks a paradigm shift in blood cell classification, enabling more nuanced and insightful medical analyses. The “Naturalize” technique’s impact extends beyond blood cell classification, emphasizing the vital role of diverse and comprehensive datasets in advancing healthcare applications through deep learning.

人工智能（AI）已成为一种尖端工具，可同时加快、确保和加强对病人的诊断和治疗。外周血涂片（PBS）分析就是这种能力的一个例证。在大学医疗中心，血液学专家每天都要例行检查数百张 PBS 切片，以验证或纠正先进的血液分析仪在评估可能有问题的病人样本时产生的结果。从逻辑上讲，这一过程可能会导致 PBC 读数错误，给患者健康带来风险。人工智能是一种变革性工具，可显著提高读数和诊断的准确性和精确度。这项研究重塑了血细胞分类的参数，利用人工智能的能力，通过具有挑战性的 11 类 PBC 数据集，将特定血细胞类别从 5 类扩大到 11 类。这种转变有助于对血细胞多样性进行更深入的探索，超越了医学图像分析中以往的限制。我们的方法结合了最先进的深度学习技术，包括预训练 ConvNets、ViTb16 模型和定制 CNN 架构。我们采用迁移学习、微调和集合策略（如 CBAM 和平均集合）来实现前所未有的准确性和可解释性。我们的完全微调 EfficientNetV2 B0 模型树立了新的标准，其宏观平均精度、召回率和 F1 分数分别达到 91%、90% 和 90%，平均准确率达到 93%。这一突破凸显了 11 级血细胞分类在更精确医疗诊断方面的变革潜力。此外，我们开创性的 "Naturalize "增强技术也取得了显著效果。利用 "Naturalize "技术生成的 2K-PBC 数据集的宏观平均精确度、召回率和 F1 分数均达到 97%，而利用经过全面微调的 EfficientNetV2 B0 模型生成的数据集的平均精确度则达到 96%。这一创新不仅提高了分类性能，还解决了医学深度学习中的数据稀缺和偏差问题。我们的研究标志着血细胞分类领域的范式转变，使医学分析更加细致入微、更具洞察力。Naturalize "技术的影响超越了血细胞分类，强调了多样化和全面的数据集在通过深度学习推进医疗应用方面的重要作用。

{"title":"Blood Cell Revolution: Unveiling 11 Distinct Types with ‘Naturalize’ Augmentation","authors":"Mohamad Abou Ali, F. Dornaika, Ignacio Arganda-Carreras","doi":"10.3390/a16120562","DOIUrl":"https://doi.org/10.3390/a16120562","url":null,"abstract":"Artificial intelligence (AI) has emerged as a cutting-edge tool, simultaneously accelerating, securing, and enhancing the diagnosis and treatment of patients. An exemplification of this capability is evident in the analysis of peripheral blood smears (PBS). In university medical centers, hematologists routinely examine hundreds of PBS slides daily to validate or correct outcomes produced by advanced hematology analyzers assessing samples from potentially problematic patients. This process may logically lead to erroneous PBC readings, posing risks to patient health. AI functions as a transformative tool, significantly improving the accuracy and precision of readings and diagnoses. This study reshapes the parameters of blood cell classification, harnessing the capabilities of AI and broadening the scope from 5 to 11 specific blood cell categories with the challenging 11-class PBC dataset. This transformation facilitates a more profound exploration of blood cell diversity, surpassing prior constraints in medical image analysis. Our approach combines state-of-the-art deep learning techniques, including pre-trained ConvNets, ViTb16 models, and custom CNN architectures. We employ transfer learning, fine-tuning, and ensemble strategies, such as CBAM and Averaging ensembles, to achieve unprecedented accuracy and interpretability. Our fully fine-tuned EfficientNetV2 B0 model sets a new standard, with a macro-average precision, recall, and F1-score of 91%, 90%, and 90%, respectively, and an average accuracy of 93%. This breakthrough underscores the transformative potential of 11-class blood cell classification for more precise medical diagnoses. Moreover, our groundbreaking “Naturalize” augmentation technique produces remarkable results. The 2K-PBC dataset generated with “Naturalize” boasts a macro-average precision, recall, and F1-score of 97%, along with an average accuracy of 96% when leveraging the fully fine-tuned EfficientNetV2 B0 model. This innovation not only elevates classification performance but also addresses data scarcity and bias in medical deep learning. Our research marks a paradigm shift in blood cell classification, enabling more nuanced and insightful medical analyses. The “Naturalize” technique’s impact extends beyond blood cell classification, emphasizing the vital role of diverse and comprehensive datasets in advancing healthcare applications through deep learning.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"1001 ","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138982798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stereo 3D Object Detection Using a Feature Attention Module 使用特征注意模块进行立体 3D 物体检测

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-07 DOI: 10.3390/a16120560

Kexin Zhao, Rui Jiang, Jun He

Stereo 3D object detection remains a crucial challenge within the realm of 3D vision. In the pursuit of enhancing stereo 3D object detection, feature fusion has emerged as a potent strategy. However, the design of the feature fusion module and the determination of pivotal features in this fusion process remain critical. This paper proposes a novel feature attention module tailored for stereo 3D object detection. Serving as a pivotal element for feature fusion, this module not only discerns feature importance but also facilitates informed enhancements based on its conclusions. This study delved into the various facets aided by the feature attention module. Firstly, a interpretability analysis was conducted concerning the function of the image segmentation methods. Secondly, we explored the augmentation of the feature fusion module through a category reweighting strategy. Lastly, we investigated global feature fusion methods and model compression strategies. The models devised through our proposed design underwent an effective analysis, yielding commendable performance, especially in small object detection within the pedestrian category.

立体三维目标检测仍然是三维视觉领域的一个关键挑战。在追求增强立体三维目标检测，特征融合已成为一种有效的策略。然而，特征融合模块的设计和融合过程中关键特征的确定仍然至关重要。提出了一种针对立体三维目标检测的特征注意模块。作为特征融合的关键元素，该模块不仅可以识别特征的重要性，还可以根据其结论促进知情增强。本研究在特征注意模块的帮助下深入研究了各个方面。首先，对图像分割方法的功能进行了可解释性分析。其次，我们探索了通过类别重加权策略增强特征融合模块。最后，研究了全局特征融合方法和模型压缩策略。通过我们提出的设计设计的模型进行了有效的分析，产生了值得称赞的性能，特别是在行人类别的小物体检测方面。

引用次数: 0

Construction of Two-Derivative Runge–Kutta Methods of Order Six 构建六阶二衍 Runge-Kutta 方法

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-06 DOI: 10.3390/a16120558

Z. Kalogiratou, T. Monovasilis

Two-Derivative Runge–Kutta methods have been proposed by Chan and Tsai in 2010 and order conditions up to the fifth order are given. In this work, for the first time, we derive order conditions for order six. Simplifying assumptions that reduce the number of order conditions are also given. The procedure for constructing sixth-order methods is presented. A specific method is derived in order to illustrate the procedure; this method is of the sixth algebraic order with a reduced phase-lag and amplification error. For numerical comparison, five well-known test problems have been solved using a seventh-order Two-Derivative Runge–Kutta method developed by Chan and Tsai and several Runge–Kutta methods of orders 6 and 8. Diagrams of the maximum absolute error vs. computation time show the efficiency of the new method.

Chan和Tsai在2010年提出了二阶龙格-库塔方法，并给出了五阶以下的阶条件。在这项工作中，我们首次导出了六阶方程的有序条件。简化假设，减少订购条件的数量也给出了。给出了构造六阶方法的步骤。为了说明该过程，推导了一个具体的方法;该方法是六阶代数阶，具有较低的相位滞后和放大误差。为了进行数值比较，用Chan和Tsai开发的七阶二阶龙格-库塔方法和几种6阶和8阶龙格-库塔方法解决了五个著名的测试问题。最大绝对误差与计算时间的关系图表明了新方法的有效性。

引用次数: 0

An Efficient Closed-Form Formula for Evaluating r-Flip Moves in Quadratic Unconstrained Binary Optimization 在二次无约束二元优化中评估 r 翻转移动的高效闭式公式

IF 2.3 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Algorithms

Pub Date : 2023-12-05 DOI: 10.3390/a16120557

B. Alidaee, Haibo Wang, L. Sua

Quadratic unconstrained binary optimization (QUBO) is a classic NP-hard problem with an enormous number of applications. Local search strategy (LSS) is one of the most fundamental algorithmic concepts and has been successfully applied to a wide range of hard combinatorial optimization problems. One LSS that has gained the attention of researchers is the r-flip (also known as r-Opt) strategy. Given a binary solution with n variables, the r-flip strategy “flips” r binary variables to obtain a new solution if the changes improve the objective function. The main purpose of this paper is to develop several results for the implementation of r-flip moves in QUBO, including a necessary and sufficient condition that when a 1-flip search reaches local optimality, the number of candidates for implementation of the r-flip moves can be reduced significantly. The results of the substantial computational experiments are reported to compare an r-flip strategy-embedded algorithm and a multiple start tabu search algorithm on a set of benchmark instances and three very-large-scale QUBO instances. The r-flip strategy implemented within the algorithm makes the algorithm very efficient, leading to very high-quality solutions within a short CPU time.

二次型无约束二元优化(QUBO)是一个典型的NP-hard问题，具有大量的应用。局部搜索策略(LSS)是最基本的算法概念之一，已成功地应用于各种复杂的组合优化问题。一种获得研究人员关注的LSS是r-flip(也称为r-Opt)策略。给定一个有n个变量的二元解，r-flip策略“翻转”r个二元变量以获得一个新的解，如果这些变化改善了目标函数。本文的主要目的是开发几个在QUBO中实现r-flip移动的结果，包括当1-flip搜索达到局部最优时，实现r-flip移动的候选数可以显着减少的充分必要条件。在一组基准实例和三个非常大规模的QUBO实例上进行了大量的计算实验，比较了嵌入r-flip策略的算法和多开始禁忌搜索算法。算法中实现的r-flip策略使得算法非常高效，在较短的CPU时间内产生非常高质量的解决方案。

{"title":"An Efficient Closed-Form Formula for Evaluating r-Flip Moves in Quadratic Unconstrained Binary Optimization","authors":"B. Alidaee, Haibo Wang, L. Sua","doi":"10.3390/a16120557","DOIUrl":"https://doi.org/10.3390/a16120557","url":null,"abstract":"Quadratic unconstrained binary optimization (QUBO) is a classic NP-hard problem with an enormous number of applications. Local search strategy (LSS) is one of the most fundamental algorithmic concepts and has been successfully applied to a wide range of hard combinatorial optimization problems. One LSS that has gained the attention of researchers is the r-flip (also known as r-Opt) strategy. Given a binary solution with n variables, the r-flip strategy “flips” r binary variables to obtain a new solution if the changes improve the objective function. The main purpose of this paper is to develop several results for the implementation of r-flip moves in QUBO, including a necessary and sufficient condition that when a 1-flip search reaches local optimality, the number of candidates for implementation of the r-flip moves can be reduced significantly. The results of the substantial computational experiments are reported to compare an r-flip strategy-embedded algorithm and a multiple start tabu search algorithm on a set of benchmark instances and three very-large-scale QUBO instances. The r-flip strategy implemented within the algorithm makes the algorithm very efficient, leading to very high-quality solutions within a short CPU time.","PeriodicalId":7636,"journal":{"name":"Algorithms","volume":"121 11","pages":""},"PeriodicalIF":2.3,"publicationDate":"2023-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138599511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0