首页 > 最新文献

Journal of Imaging最新文献

英文 中文
Accelerating Point Cloud Computation via Memory in Embedded Structured Light Cameras. 基于内存的嵌入式结构光相机加速点云计算。
IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2026-02-21 DOI: 10.3390/jimaging12020091
Yanan Zhang, Shikang Meng, Shijie Wang, Yaheng Ren

Embedded structured light cameras have been widely applied in various fields. However, due to constraints such as insufficient computing resources, it remains difficult to achieve high-speed structured light point cloud computation. To address this issue, this study proposes a memory-driven computational framework for accelerating point cloud computation. Specifically, the point cloud computation process is precomputed as much as possible and stored in memory in the form of parameters, thereby significantly reducing the computational load during actual point cloud computation. The framework is instantiated in two forms: a low-memory method that minimizes memory footprint at the expense of point cloud stability, and a high-memory method that preserves the nonlinear phase-distance relation via an extensive lookup table. Experimental evaluations demonstrate that the proposed methods achieve comparable accuracy to the conventional method while delivering substantial speedups, and data-format optimizations further reduce required bandwidth. This framework offers a generalizable paradigm for optimizing structured light pipelines, paving the way for enhanced real-time 3D sensing in embedded applications.

嵌入式结构光摄像机已广泛应用于各个领域。然而,由于计算资源不足等限制,高速结构光点云计算仍然难以实现。为了解决这个问题,本研究提出了一个内存驱动的计算框架来加速点云计算。具体来说,点云计算过程尽可能地预先计算,并以参数的形式存储在内存中,从而大大减少了实际点云计算时的计算负荷。该框架以两种形式实例化:以牺牲点云稳定性为代价最小化内存占用的低内存方法,以及通过广泛查找表保留非线性相位距离关系的高内存方法。实验评估表明,所提出的方法在提供大量加速的同时达到了与传统方法相当的精度,并且数据格式优化进一步降低了所需的带宽。该框架为优化结构光管道提供了一个可推广的范例,为嵌入式应用中增强的实时3D传感铺平了道路。
{"title":"Accelerating Point Cloud Computation via Memory in Embedded Structured Light Cameras.","authors":"Yanan Zhang, Shikang Meng, Shijie Wang, Yaheng Ren","doi":"10.3390/jimaging12020091","DOIUrl":"10.3390/jimaging12020091","url":null,"abstract":"<p><p>Embedded structured light cameras have been widely applied in various fields. However, due to constraints such as insufficient computing resources, it remains difficult to achieve high-speed structured light point cloud computation. To address this issue, this study proposes a memory-driven computational framework for accelerating point cloud computation. Specifically, the point cloud computation process is precomputed as much as possible and stored in memory in the form of parameters, thereby significantly reducing the computational load during actual point cloud computation. The framework is instantiated in two forms: a low-memory method that minimizes memory footprint at the expense of point cloud stability, and a high-memory method that preserves the nonlinear phase-distance relation via an extensive lookup table. Experimental evaluations demonstrate that the proposed methods achieve comparable accuracy to the conventional method while delivering substantial speedups, and data-format optimizations further reduce required bandwidth. This framework offers a generalizable paradigm for optimizing structured light pipelines, paving the way for enhanced real-time 3D sensing in embedded applications.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2026-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12941603/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of the Surrounding Rock Based on Image Processing Analysis and Transfer Learning. 基于图像处理分析和迁移学习的围岩分类。
IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2026-02-19 DOI: 10.3390/jimaging12020089
Yanyun Fan, Jiaqi Zhu, Hua Luo, Yaxi Shen, Shuanglong Wang, Xiaoning Liu, Dong Li, Chuhan Deng

Currently, standardized classification methods of surrounding rock are relatively insufficient. The classification of surrounding rock mainly relies on the subjective judgment of technicians, leading to diverse evaluation results. This study focuses on the feature extraction and classification methods of surrounding rock images in a certain tunnel of the Central Yunnan Water Diversion Project by using image processing analysis and transfer learning. Rich surrounding rock images and the water conservancy tunnel data are collected, and then the surrounding rock is classified relatively accurately according to the code and expert guidance. By introducing the fractal theory, the complexity and irregularity of the spatial distribution of weak layers and joints on the surrounding rock surface are revealed effectively. Based on the analysis of changes in fractal dimension characteristic values, a classification method for surrounding rock based on the fractal theory is proposed. Combined with the quantified parameters of surrounding rock images and the strength data collected by rebound meters, a method for correcting the surrounding rock strength based on image analysis is proposed, which can effectively solve the error caused by the uneven distribution of rock masses in the traditional rebound meter strength values. After correction, more accurate strength characteristics can be obtained, which is conducive to the standardized classification of the surrounding rock. After studying the recognition of tunnel surrounding rock images with transfer learning, a model is constructed to achieve rapid classification of tunnel surrounding rock. This research provides support for the standardized classification of tunnel surrounding rock.

目前,规范的围岩分类方法相对不足。围岩的分类主要依靠技术人员的主观判断,导致评价结果参差不齐。本研究主要针对滇中引水工程某隧道围岩图像进行图像处理分析和迁移学习的特征提取与分类方法。采集丰富的围岩图像和水利隧道数据,根据规范和专家指导对围岩进行相对准确的分类。通过引入分形理论,有效地揭示了围岩表面软弱层和节理空间分布的复杂性和不规则性。在分析分形维数特征值变化的基础上,提出了一种基于分形理论的围岩分类方法。结合围岩图像的量化参数和回弹仪采集的强度数据,提出了一种基于图像分析的围岩强度校正方法,有效解决了传统回弹仪强度值因岩体分布不均匀造成的误差。校正后可以得到更准确的强度特征,有利于对围岩进行标准化分类。在研究了隧道围岩图像的迁移学习识别方法的基础上,建立了隧道围岩快速分类模型。该研究为隧道围岩的标准化分类提供了依据。
{"title":"Classification of the Surrounding Rock Based on Image Processing Analysis and Transfer Learning.","authors":"Yanyun Fan, Jiaqi Zhu, Hua Luo, Yaxi Shen, Shuanglong Wang, Xiaoning Liu, Dong Li, Chuhan Deng","doi":"10.3390/jimaging12020089","DOIUrl":"10.3390/jimaging12020089","url":null,"abstract":"<p><p>Currently, standardized classification methods of surrounding rock are relatively insufficient. The classification of surrounding rock mainly relies on the subjective judgment of technicians, leading to diverse evaluation results. This study focuses on the feature extraction and classification methods of surrounding rock images in a certain tunnel of the Central Yunnan Water Diversion Project by using image processing analysis and transfer learning. Rich surrounding rock images and the water conservancy tunnel data are collected, and then the surrounding rock is classified relatively accurately according to the code and expert guidance. By introducing the fractal theory, the complexity and irregularity of the spatial distribution of weak layers and joints on the surrounding rock surface are revealed effectively. Based on the analysis of changes in fractal dimension characteristic values, a classification method for surrounding rock based on the fractal theory is proposed. Combined with the quantified parameters of surrounding rock images and the strength data collected by rebound meters, a method for correcting the surrounding rock strength based on image analysis is proposed, which can effectively solve the error caused by the uneven distribution of rock masses in the traditional rebound meter strength values. After correction, more accurate strength characteristics can be obtained, which is conducive to the standardized classification of the surrounding rock. After studying the recognition of tunnel surrounding rock images with transfer learning, a model is constructed to achieve rapid classification of tunnel surrounding rock. This research provides support for the standardized classification of tunnel surrounding rock.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2026-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12941806/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291244","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MDF2Former: Multi-Scale Dual-Domain Feature Fusion Transformer for Hyperspectral Image Classification of Bacteria in Murine Wounds. MDF2Former:用于小鼠伤口细菌高光谱图像分类的多尺度双域特征融合变压器。
IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2026-02-19 DOI: 10.3390/jimaging12020090
Decheng Wu, Wendan Liu, Rui Li, Xudong Fu, Lin Tao, Yinli Tian, Anqiang Zhang, Zhen Wang, Hao Tang

Bacterial wound infection poses a major challenge in trauma care and can lead to severe complications such as sepsis and organ failure. Therefore, rapid and accurate identification of the pathogen, along with targeted intervention, is of vital importance for improving treatment outcomes and reducing risks. However, current detection methods are still constrained by procedural complexity and long processing times. In this study, a hyperspectral imaging (HSI) acquisition system for bacterial analysis and a multi-scale dual-domain feature fusion transformer (MDF2Former) were developed for classifying wound bacteria. MDF2Former integrates three modules: a multi-scale feature enhancement and fusion module that generates tokens with multi-scale discriminative representations, a spatial-spectral dual-branch attention module that strengthens joint feature modeling, and a frequency and spatial-spectral domain encoding module that captures global and local interactions among tokens through a hierarchical stacking structure, thereby enabling more efficient feature learning. Extensive experiments on our self-constructed HSI dataset of typical wound bacteria demonstrate that MDF2Former achieved outstanding performance across five metrics: Accuracy (91.94%), Precision (92.26%), Recall (91.94%), F1-score (92.01%), and Kappa coefficient (90.73%), surpassing all comparative models. These results have verified the effectiveness of combining HSI with deep learning for bacterial identification, and have highlighted its potential in assisting in the identification of bacterial species and making personalized treatment decisions for wound infections.

细菌性伤口感染是创伤护理的主要挑战,可导致严重的并发症,如败血症和器官衰竭。因此,快速准确地识别病原体,并进行有针对性的干预,对于改善治疗效果和降低风险至关重要。然而,目前的检测方法仍然受到程序复杂性和处理时间长的限制。本研究开发了用于细菌分析的高光谱成像(HSI)采集系统和用于伤口细菌分类的多尺度双域特征融合变压器(MDF2Former)。MDF2Former集成了三个模块:一个多尺度特征增强与融合模块,生成具有多尺度判别表征的token;一个空间-频谱双分支关注模块,加强联合特征建模;一个频率和空间-频谱域编码模块,通过分层堆叠结构捕获token之间的全局和局部交互,从而实现更高效的特征学习。在我们自建的典型伤口细菌HSI数据集上进行的大量实验表明,MDF2Former在准确率(91.94%)、精确度(92.26%)、召回率(91.94%)、f1评分(92.01%)和Kappa系数(90.73%)五个指标上都取得了出色的表现,超过了所有的比较模型。这些结果验证了将HSI与深度学习相结合用于细菌鉴定的有效性,并强调了其在协助鉴定细菌种类和制定伤口感染个性化治疗决策方面的潜力。
{"title":"MDF2Former: Multi-Scale Dual-Domain Feature Fusion Transformer for Hyperspectral Image Classification of Bacteria in Murine Wounds.","authors":"Decheng Wu, Wendan Liu, Rui Li, Xudong Fu, Lin Tao, Yinli Tian, Anqiang Zhang, Zhen Wang, Hao Tang","doi":"10.3390/jimaging12020090","DOIUrl":"10.3390/jimaging12020090","url":null,"abstract":"<p><p>Bacterial wound infection poses a major challenge in trauma care and can lead to severe complications such as sepsis and organ failure. Therefore, rapid and accurate identification of the pathogen, along with targeted intervention, is of vital importance for improving treatment outcomes and reducing risks. However, current detection methods are still constrained by procedural complexity and long processing times. In this study, a hyperspectral imaging (HSI) acquisition system for bacterial analysis and a multi-scale dual-domain feature fusion transformer (MDF2Former) were developed for classifying wound bacteria. MDF2Former integrates three modules: a multi-scale feature enhancement and fusion module that generates tokens with multi-scale discriminative representations, a spatial-spectral dual-branch attention module that strengthens joint feature modeling, and a frequency and spatial-spectral domain encoding module that captures global and local interactions among tokens through a hierarchical stacking structure, thereby enabling more efficient feature learning. Extensive experiments on our self-constructed HSI dataset of typical wound bacteria demonstrate that MDF2Former achieved outstanding performance across five metrics: Accuracy (91.94%), Precision (92.26%), Recall (91.94%), F1-score (92.01%), and Kappa coefficient (90.73%), surpassing all comparative models. These results have verified the effectiveness of combining HSI with deep learning for bacterial identification, and have highlighted its potential in assisting in the identification of bacterial species and making personalized treatment decisions for wound infections.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2026-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12942589/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291277","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Compactness Quantitative Metrics for Wrist Bone on Conventional Radiography in Rheumatoid Arthritis: A Clinical Evaluation Study. 类风湿性关节炎常规x线摄影腕骨自动紧致度定量指标:临床评价研究。
IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2026-02-18 DOI: 10.3390/jimaging12020087
Jiajing Zhou, Junmu Peng, Haolin Wang, Hiroshi Kataoka, Masaya Mukai, Tunlada Wiriyanukhroh, Tamotsu Kamishima

Rheumatoid arthritis (RA) frequently affects the joints of the hands, with joint space narrowing (JSN) representing an important early marker of structural damage. The semi-quantitative Sharp/van der Heijde (SvdH) scoring system is widely used in clinical practice but is inherently subjective and susceptible to observer variability. Moreover, the complex anatomy of the wrist and substantial overlap of carpal bones pose challenges for automated quantitative assessment of wrist JSN on routine radiographs. This study aimed to introduce a novel quantitative assessment perspective and to clinically validate an automated, compactness-related quantification framework for evaluating wrist JSN in RA. This study initially enrolled 51 patients with RA. After excluding one case with severe carpal fusion that precluded anatomical differentiation, 50 patients (44 females and 6 males) were included in the final analysis. The cohort had a mean age of 61 years (range: 21-82), a median symptom duration of 9 years (IQR: 1-32), and a median follow-up interval for bilateral hand radiographs of 1.06 years (IQR: 0.82-1.30). To quantify global wrist JSN, 10 compactness-related metrics were computed based on the spatial distribution of bone centroids extracted from carpal segmentation masks. These metrics were validated against the wrist JSN subscore of the SvdH score (SvdH-JSN_wrist) and the total Sharp score (TSS) as gold standards. Several distance-based metrics among the compactness-related metrics showed significant negative correlations with the wrist joint space narrowing subscore of the Sharp/van der Heijde score (SvdH-JSN_wrist). Specifically, mean-pairwise-distance (MPD), root-mean-square-radius (RMSR), and median-radius (R50) showed moderate to strong correlations (r = -0.52 to -0.63, all p≤0.0001) that were consistent at BL and FU. Correlations with TSS were weaker overall, with only R50 and its normalized form showing stable negative correlations (r = -0.40 to -0.43, p < 0.01). Longitudinal analyses showed limited correlations between metric changes and clinical score changes. The proposed automated compactness quantification framework enables objective and reliable assessment of wrist JSN on standard radiographs and complements conventional scoring systems by supporting automated and standardized evaluation of RA-related wrist structural changes.

类风湿性关节炎(RA)经常影响手部关节,关节间隙狭窄(JSN)是结构损伤的重要早期标志。半定量Sharp/van der Heijde (SvdH)评分系统广泛应用于临床实践,但其固有的主观性和易受观察者可变性的影响。此外,腕部复杂的解剖结构和腕骨的大量重叠为常规x线片腕部JSN的自动定量评估带来了挑战。本研究旨在引入一种新的定量评估视角,并在临床验证一种用于评估RA腕部JSN的自动化、紧凑性相关量化框架。这项研究最初招募了51名RA患者。在排除1例严重腕骨融合妨碍解剖分化后,50例患者(女性44例,男性6例)被纳入最终分析。该队列的平均年龄为61岁(范围:21-82岁),症状持续时间中位数为9年(IQR: 1-32),双侧手部x线片随访时间中位数为1.06年(IQR: 0.82-1.30)。为了量化全局腕部JSN,基于从腕关节分割掩模中提取的骨质心的空间分布,计算10个紧致度相关指标。这些指标根据SvdH评分(SvdH- jsn_wrist)的手腕JSN子评分和总夏普评分(TSS)作为金标准进行验证。在紧致度相关指标中,几个基于距离的指标与Sharp/van der Heijde评分(SvdH-JSN_wrist)的腕关节间隙缩小亚评分呈显著负相关。具体而言,平均两两距离(MPD)、均方根半径(RMSR)和中位半径(R50)显示出中等至强的相关性(r = -0.52至-0.63,均p≤0.0001),与BL和FU一致。与TSS的相关性总体较弱,只有R50与其归一化形式呈稳定的负相关(r = -0.40 ~ -0.43, p < 0.01)。纵向分析显示度量变化与临床评分变化之间的相关性有限。提出的自动化紧凑度量化框架能够客观可靠地评估标准x线片腕部JSN,并通过支持ra相关腕部结构变化的自动化和标准化评估,补充了传统评分系统。
{"title":"Automated Compactness Quantitative Metrics for Wrist Bone on Conventional Radiography in Rheumatoid Arthritis: A Clinical Evaluation Study.","authors":"Jiajing Zhou, Junmu Peng, Haolin Wang, Hiroshi Kataoka, Masaya Mukai, Tunlada Wiriyanukhroh, Tamotsu Kamishima","doi":"10.3390/jimaging12020087","DOIUrl":"10.3390/jimaging12020087","url":null,"abstract":"<p><p>Rheumatoid arthritis (RA) frequently affects the joints of the hands, with joint space narrowing (JSN) representing an important early marker of structural damage. The semi-quantitative Sharp/van der Heijde (SvdH) scoring system is widely used in clinical practice but is inherently subjective and susceptible to observer variability. Moreover, the complex anatomy of the wrist and substantial overlap of carpal bones pose challenges for automated quantitative assessment of wrist JSN on routine radiographs. This study aimed to introduce a novel quantitative assessment perspective and to clinically validate an automated, compactness-related quantification framework for evaluating wrist JSN in RA. This study initially enrolled 51 patients with RA. After excluding one case with severe carpal fusion that precluded anatomical differentiation, 50 patients (44 females and 6 males) were included in the final analysis. The cohort had a mean age of 61 years (range: 21-82), a median symptom duration of 9 years (IQR: 1-32), and a median follow-up interval for bilateral hand radiographs of 1.06 years (IQR: 0.82-1.30). To quantify global wrist JSN, 10 compactness-related metrics were computed based on the spatial distribution of bone centroids extracted from carpal segmentation masks. These metrics were validated against the wrist JSN subscore of the SvdH score (SvdH-JSN_wrist) and the total Sharp score (TSS) as gold standards. Several distance-based metrics among the compactness-related metrics showed significant negative correlations with the wrist joint space narrowing subscore of the Sharp/van der Heijde score (SvdH-JSN_wrist). Specifically, mean-pairwise-distance (MPD), root-mean-square-radius (RMSR), and median-radius (R50) showed moderate to strong correlations (r = -0.52 to -0.63, all p≤0.0001) that were consistent at BL and FU. Correlations with TSS were weaker overall, with only R50 and its normalized form showing stable negative correlations (r = -0.40 to -0.43, <i>p</i> < 0.01). Longitudinal analyses showed limited correlations between metric changes and clinical score changes. The proposed automated compactness quantification framework enables objective and reliable assessment of wrist JSN on standard radiographs and complements conventional scoring systems by supporting automated and standardized evaluation of RA-related wrist structural changes.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12941718/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291279","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of Biological Images and Quantitative Monitoring Using Deep Learning and Computer Vision. 使用深度学习和计算机视觉的生物图像分析和定量监测。
IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2026-02-18 DOI: 10.3390/jimaging12020088
Aaron Gálvez-Salido, Francisca Robles, Rodrigo J Gonçalves, Roberto de la Herrán, Carmelo Ruiz Rejón, Rafael Navajas-Pérez

Automated biological counting is essential for scaling wildlife monitoring and biodiversity assessments, as manual processing currently limits analytical effort and scalability. This review evaluates the integration of deep learning and computer vision across diverse acquisition platforms, including camera traps, unmanned aerial vehicles (UAVs), and remote sensing. Methodological paradigms ranging from Convolutional Neural Networks (CNNs) and one-stage detectors like You Only Look Once (YOLO) to recent transformer-based architectures and hybrid models are examined. The literature shows that these methods consistently achieve high accuracy-often exceeding 95%-across various taxa, including insect pests, aquatic organisms, terrestrial vegetation, and forest ecosystems. However, persistent challenges such as object occlusion, cryptic species differentiation, and the scarcity of high-quality, labeled datasets continue to hinder fully automated workflows. We conclude that while automated counting has fundamentally increased data throughput, future advancements must focus on enhancing model generalization through self-supervised learning and improved data augmentation techniques. These developments are critical for transitioning from experimental models to robust, operational tools for global ecological monitoring and conservation efforts.

自动生物计数对于扩大野生动物监测和生物多样性评估至关重要,因为人工处理目前限制了分析工作和可扩展性。这篇综述评估了深度学习和计算机视觉在不同采集平台上的集成,包括相机陷阱、无人机(uav)和遥感。从卷积神经网络(cnn)和一级检测器(如You Only Look Once (YOLO))到最近基于变压器的架构和混合模型的方法范例进行了研究。文献表明,这些方法在包括害虫、水生生物、陆地植被和森林生态系统在内的各种分类群中始终保持着很高的准确性,通常超过95%。然而,诸如物体遮挡、隐物种分化以及高质量标记数据集的稀缺性等持续存在的挑战继续阻碍完全自动化的工作流程。我们的结论是,虽然自动计数从根本上提高了数据吞吐量,但未来的进步必须集中在通过自监督学习和改进的数据增强技术来增强模型泛化。这些发展对于从实验模型转变为全球生态监测和保护工作的强大、可操作工具至关重要。
{"title":"Analysis of Biological Images and Quantitative Monitoring Using Deep Learning and Computer Vision.","authors":"Aaron Gálvez-Salido, Francisca Robles, Rodrigo J Gonçalves, Roberto de la Herrán, Carmelo Ruiz Rejón, Rafael Navajas-Pérez","doi":"10.3390/jimaging12020088","DOIUrl":"10.3390/jimaging12020088","url":null,"abstract":"<p><p>Automated biological counting is essential for scaling wildlife monitoring and biodiversity assessments, as manual processing currently limits analytical effort and scalability. This review evaluates the integration of deep learning and computer vision across diverse acquisition platforms, including camera traps, unmanned aerial vehicles (UAVs), and remote sensing. Methodological paradigms ranging from Convolutional Neural Networks (CNNs) and one-stage detectors like You Only Look Once (YOLO) to recent transformer-based architectures and hybrid models are examined. The literature shows that these methods consistently achieve high accuracy-often exceeding 95%-across various taxa, including insect pests, aquatic organisms, terrestrial vegetation, and forest ecosystems. However, persistent challenges such as object occlusion, cryptic species differentiation, and the scarcity of high-quality, labeled datasets continue to hinder fully automated workflows. We conclude that while automated counting has fundamentally increased data throughput, future advancements must focus on enhancing model generalization through self-supervised learning and improved data augmentation techniques. These developments are critical for transitioning from experimental models to robust, operational tools for global ecological monitoring and conservation efforts.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12941886/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Print Quality Assessment of QR Code Elements Achieved by the Digital Thermal Transfer Process. 数字热转印工艺实现的二维码元素打印质量评估。
IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2026-02-18 DOI: 10.3390/jimaging12020086
Igor Majnarić, Marija Jelkić, Marko Morić, Krunoslav Hajdek

The new European Regulation (EU) 2025/40 includes provisions on modern packaging and packaging waste. It defines the use of image QR codes on packaging (items 71 and 161) and in personal documents, making line barcodes a thing of the past. The definition of a QR code is precisely specified in ISO/IEC 18004:2024. However, their implementation in printing systems is not specified and remains an important factor for their future application. Digital foil printing is a completely new hybrid printing process for applying information to highly precise applications such as QR codes, security printing, and packaging printing. The technique is characterized by a combination of two printing techniques: drop-on-demand UV inkjet followed by thermal transfer of black foil. Using a matte-coated printing substrate (Garda Matt, 300 g/m2), Konica Minolta KM1024 LHE Inkjet head settings, and a transfer temperature of 100 °C, the size of the square printing elements in QR codes plays a decisive role in the quality of the decoded information. The aim of this work is to investigate the possibility of realizing the basic elements of the QR code image (the profile of square elements and the success of realizing a precisely defined surface) with a variation in the thickness of the UV varnish coating (7, 14 and 21 µm), realized using the MGI JETvarnish 3DS digital machine. The most commonly used rectangular elements with a surface area of 0.01 cm2 were tested: 0.06 cm2, 0.25 cm2, 1 cm2, 4 cm2, and 16 cm2. The results showed that the imprint quality is uneven for the smallest elements (square elements with base lengths of 0.1 cm and 0.25 cm). The effect is especially visible with a minimum UV varnish application of 7 μm (1 drop). By increasing the amount of UV varnish and the application thickness to 14 μm (2 drops) and 21 μm (3 drops), respectively, a significantly more stable, even reproduction of the achromatic image is achieved. The highest technical precision was achieved with a UV varnish thickness of 21 μm.

新的欧洲法规(EU) 2025/40包括关于现代包装和包装废弃物的规定。它规定了在包装(第71和161项)和个人文件中使用图像QR码,使直线条形码成为过去。QR码的定义在ISO/IEC 18004:2024中有精确的规定。然而,它们在印刷系统中的实现并没有具体说明,这仍然是它们未来应用的一个重要因素。数字箔印刷是一种全新的混合印刷工艺,用于将信息应用于高度精确的应用,如QR码,安全印刷和包装印刷。该技术的特点是结合了两种印刷技术:按需UV喷墨,然后是黑色箔的热转印。使用哑光涂层印刷基材(Garda Matt, 300 g/m2),柯尼卡美能达KM1024 LHE喷墨头设置,转移温度为100°C, QR码中方形印刷元素的大小对解码信息的质量起着决定性作用。这项工作的目的是研究利用MGI JETvarnish 3DS数字机器实现UV清漆涂层厚度(7,14和21 μ m)的变化来实现QR码图像的基本元素(正方形元素的轮廓和成功实现精确定义的表面)的可能性。测试了表面积为0.01 cm2的最常用矩形单元:0.06 cm2、0.25 cm2、1 cm2、4 cm2和16 cm2。结果表明,最小单元(基长为0.1 cm和0.25 cm的方形单元)的压印质量不均匀;当UV清漆最小用量为7 μm(1滴)时,效果尤其明显。通过将UV清漆的用量和应用厚度分别增加到14 μm(2滴)和21 μm(3滴),可以实现更加稳定,均匀的消色差图像再现。当UV清漆厚度为21 μm时,达到了最高的技术精度。
{"title":"Print Quality Assessment of QR Code Elements Achieved by the Digital Thermal Transfer Process.","authors":"Igor Majnarić, Marija Jelkić, Marko Morić, Krunoslav Hajdek","doi":"10.3390/jimaging12020086","DOIUrl":"10.3390/jimaging12020086","url":null,"abstract":"<p><p>The new European Regulation (EU) 2025/40 includes provisions on modern packaging and packaging waste. It defines the use of image QR codes on packaging (items 71 and 161) and in personal documents, making line barcodes a thing of the past. The definition of a QR code is precisely specified in ISO/IEC 18004:2024. However, their implementation in printing systems is not specified and remains an important factor for their future application. Digital foil printing is a completely new hybrid printing process for applying information to highly precise applications such as QR codes, security printing, and packaging printing. The technique is characterized by a combination of two printing techniques: drop-on-demand UV inkjet followed by thermal transfer of black foil. Using a matte-coated printing substrate (Garda Matt, 300 g/m<sup>2</sup>), Konica Minolta KM1024 LHE Inkjet head settings, and a transfer temperature of 100 °C, the size of the square printing elements in QR codes plays a decisive role in the quality of the decoded information. The aim of this work is to investigate the possibility of realizing the basic elements of the QR code image (the profile of square elements and the success of realizing a precisely defined surface) with a variation in the thickness of the UV varnish coating (7, 14 and 21 µm), realized using the MGI JETvarnish 3DS digital machine. The most commonly used rectangular elements with a surface area of 0.01 cm<sup>2</sup> were tested: 0.06 cm<sup>2</sup>, 0.25 cm<sup>2</sup>, 1 cm<sup>2</sup>, 4 cm<sup>2</sup>, and 16 cm<sup>2</sup>. The results showed that the imprint quality is uneven for the smallest elements (square elements with base lengths of 0.1 cm and 0.25 cm). The effect is especially visible with a minimum UV varnish application of 7 μm (1 drop). By increasing the amount of UV varnish and the application thickness to 14 μm (2 drops) and 21 μm (3 drops), respectively, a significantly more stable, even reproduction of the achromatic image is achieved. The highest technical precision was achieved with a UV varnish thickness of 21 μm.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12942380/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SREF: Semantics-Refined Feature Extraction for Long-Term Visual Localization. 面向长期视觉定位的语义精炼特征提取。
IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2026-02-18 DOI: 10.3390/jimaging12020085
Danfeng Wu, Kaifeng Zhu, Heng Shi, Fenfen Zhou, Minchi Kuang

Accurate and robust visual localization under changing environments remains a fundamental challenge in autonomous driving and mobile robotics. Traditional handcrafted features often degrade under long-term illumination and viewpoint variations, while recent CNN-based methods, although more robust, typically rely on coarse semantic cues and remain vulnerable to dynamic objects. In this paper, we propose a fine-grained semantics-guided feature extraction framework that adaptively selects stable keypoints while suppressing dynamic disturbances. A fine-grained semantic refinement module subdivides coarse semantic categories into stability-homogeneous sub-classes, and a dual-attention mechanism enhances local repeatability and semantic consistency. By integrating physical priors with self-supervised clustering, the proposed framework learns discriminative and reliable feature representations. Extensive experiments on the Aachen and RobotCar-Seasons benchmarks demonstrate that the proposed approach achieves state-of-the-art accuracy and robustness while maintaining real-time efficiency, effectively bridging coarse semantic guidance with fine-grained stability estimation. Quantitatively, our method achieves strong localization performance on Aachen (up to 88.1% at night under the (0.2°,0.25 m) threshold) and on RobotCar-Seasons (up to 57.2%/28.4% under the same threshold for day/night), demonstrating improved robustness to seasonal and illumination changes.

在不断变化的环境下,准确和鲁棒的视觉定位仍然是自动驾驶和移动机器人的基本挑战。传统的手工特征在长期光照和视点变化下往往会退化,而最近基于cnn的方法虽然更健壮,但通常依赖于粗糙的语义线索,并且容易受到动态对象的影响。在本文中,我们提出了一种细粒度语义引导的特征提取框架,该框架可以自适应地选择稳定的关键点,同时抑制动态干扰。细粒度语义细化模块将粗语义类别细分为稳定同构的子类,双注意机制增强了局部可重复性和语义一致性。通过将物理先验与自监督聚类相结合,该框架学习了判别和可靠的特征表示。在Aachen和RobotCar-Seasons基准测试上的大量实验表明,所提出的方法在保持实时效率的同时,实现了最先进的准确性和鲁棒性,有效地将粗语义指导与细粒度稳定性估计相结合。定量地,我们的方法在Aachen(在夜间(0.2°,0.25 m)阈值下达到了88.1%)和RobotCar-Seasons(在白天/夜晚相同阈值下达到57.2%/28.4%)上取得了很强的定位性能,证明了对季节和光照变化的鲁棒性提高。
{"title":"SREF: Semantics-Refined Feature Extraction for Long-Term Visual Localization.","authors":"Danfeng Wu, Kaifeng Zhu, Heng Shi, Fenfen Zhou, Minchi Kuang","doi":"10.3390/jimaging12020085","DOIUrl":"10.3390/jimaging12020085","url":null,"abstract":"<p><p>Accurate and robust visual localization under changing environments remains a fundamental challenge in autonomous driving and mobile robotics. Traditional handcrafted features often degrade under long-term illumination and viewpoint variations, while recent CNN-based methods, although more robust, typically rely on coarse semantic cues and remain vulnerable to dynamic objects. In this paper, we propose a fine-grained semantics-guided feature extraction framework that adaptively selects stable keypoints while suppressing dynamic disturbances. A fine-grained semantic refinement module subdivides coarse semantic categories into stability-homogeneous sub-classes, and a dual-attention mechanism enhances local repeatability and semantic consistency. By integrating physical priors with self-supervised clustering, the proposed framework learns discriminative and reliable feature representations. Extensive experiments on the Aachen and RobotCar-Seasons benchmarks demonstrate that the proposed approach achieves state-of-the-art accuracy and robustness while maintaining real-time efficiency, effectively bridging coarse semantic guidance with fine-grained stability estimation. Quantitatively, our method achieves strong localization performance on Aachen (up to 88.1% at night under the (0.2°,0.25 m) threshold) and on RobotCar-Seasons (up to 57.2%/28.4% under the same threshold for day/night), demonstrating improved robustness to seasonal and illumination changes.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2026-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12941875/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LEGS: Visual Localization Enhanced by 3D Gaussian Splatting. LEGS:通过3D高斯飞溅增强视觉定位。
IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2026-02-16 DOI: 10.3390/jimaging12020084
Daewoon Kim, I-Gil Kim

Accurate six-degree-of-freedom (6-DoF) visual localization is a fundamental component for modern mapping and navigation. While recent data-centric approaches have leveraged Novel View Synthesis (NVS) to augment training datasets, these methods typically rely on uniform grid-based sampling of virtual cameras. Such naive placement often yields redundant or weakly informative views, failing to effectively bridge the gap between sparse, unordered captures and dense scene geometry. To address these challenges, we present LEGS (Visual Localization Enhanced by 3D Gaussian Splatting), a trajectory-agnostic synthetic-view augmentation framework. LEGS constructs a joint set of 6-DoF camera pose proposals by integrating a coarse 3D lattice with the Structure-from-Motion (SfM) camera graph, followed by a visibility-aware, coverage-driven selection strategy. By utilizing 3D Gaussian Splatting (3DGS), our framework enables high-throughput, scene-specific synthesis within practical computational budgets. Experiments on standard benchmarks and an in-house dataset demonstrate that LEGS consistently improves pose accuracy and robustness, particularly in scenarios characterized by sparse sampling and co-located viewpoints.

精确的六自由度视觉定位是现代测绘和导航的基本组成部分。虽然最近以数据为中心的方法利用新颖视图合成(NVS)来增强训练数据集,但这些方法通常依赖于基于统一网格的虚拟相机采样。这种天真的放置通常会产生冗余或弱信息视图,无法有效地弥合稀疏,无序捕获和密集场景几何之间的差距。为了解决这些挑战,我们提出了LEGS (3D高斯飞溅增强视觉定位),这是一个与轨迹无关的合成视图增强框架。LEGS通过将粗糙的3D晶格与运动结构(SfM)相机图相结合,构建了一组联合的6自由度相机姿态建议,然后采用可见性感知、覆盖驱动的选择策略。通过利用3D高斯喷溅(3DGS),我们的框架可以在实际的计算预算内实现高吞吐量,特定场景的合成。在标准基准测试和内部数据集上的实验表明,LEGS持续提高姿态精度和鲁棒性,特别是在稀疏采样和共定位视点的场景中。
{"title":"LEGS: Visual Localization Enhanced by 3D Gaussian Splatting.","authors":"Daewoon Kim, I-Gil Kim","doi":"10.3390/jimaging12020084","DOIUrl":"10.3390/jimaging12020084","url":null,"abstract":"<p><p>Accurate six-degree-of-freedom (6-DoF) visual localization is a fundamental component for modern mapping and navigation. While recent data-centric approaches have leveraged Novel View Synthesis (NVS) to augment training datasets, these methods typically rely on uniform grid-based sampling of virtual cameras. Such naive placement often yields redundant or weakly informative views, failing to effectively bridge the gap between sparse, unordered captures and dense scene geometry. To address these challenges, we present LEGS (Visual <b>L</b>ocalization <b>E</b>nhanced by 3D <b>G</b>aussian <b>S</b>platting), a trajectory-agnostic synthetic-view augmentation framework. LEGS constructs a joint set of 6-DoF camera pose proposals by integrating a coarse 3D lattice with the Structure-from-Motion (SfM) camera graph, followed by a visibility-aware, coverage-driven selection strategy. By utilizing 3D Gaussian Splatting (3DGS), our framework enables high-throughput, scene-specific synthesis within practical computational budgets. Experiments on standard benchmarks and an in-house dataset demonstrate that LEGS consistently improves pose accuracy and robustness, particularly in scenarios characterized by sparse sampling and co-located viewpoints.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2026-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12941419/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research Progress on the Application of Radiomics and Deep Learning in Liver Fibrosis. 放射组学和深度学习在肝纤维化中的应用研究进展。
IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2026-02-15 DOI: 10.3390/jimaging12020082
Yi Dang, Wenjing Li, Zhao Liu, Junqiang Lei

Liver fibrosis (LF) represents a crucial intermediate stage in the pathological progression from chronic liver disease to cirrhosis and hepatocellular carcinoma. Early and accurate diagnosis is of vital importance for the intervention treatment of diseases and the improvement of prognosis. Traditional liver biopsy, long regarded as the diagnostic gold standard, remains associated with several notable limitations such as invasiveness, sampling errors and inter-observer variability. Lately, as artificial intelligence (AI) technology progresses swiftly, radiomics and deep learning (DL) have risen to prominence as non-invasive diagnostic instruments, showing significant potential in the LF diagnostic evaluation. This review summarizes the latest advancements in radiomics and DL for LF diagnosis, staging, prognosis prediction and etiological differentiation. It also analyzes the application value of multimodal imaging modalities, including magnetic resonance imaging (MRI), computed tomography (CT) and ultrasound in this field. Despite ongoing challenges in model generalization and standardization, improved model interpretability, technological integration and multimodal fusion, the continuous advancement of radiomics and DL technologies holds promise for AI-driven imaging analysis strategies. These approaches aim to integrate multiple clinical monitoring methods, overcome obstacles in the early LF diagnosis and treatment and provide new perspectives for precision medicine of this disease.

肝纤维化(LF)是慢性肝病到肝硬化和肝细胞癌病理进展的关键中间阶段。早期准确的诊断对疾病的干预治疗和改善预后至关重要。传统的肝活检,长期以来被认为是诊断的金标准,仍然存在一些显着的局限性,如侵入性,采样误差和观察者之间的可变性。最近,随着人工智能(AI)技术的迅速发展,放射组学和深度学习(DL)作为非侵入性诊断工具已经崭露头角,在LF诊断评估中显示出巨大的潜力。本文综述了放射组学和DL在LF诊断、分期、预后预测和病因鉴别方面的最新进展。分析了磁共振成像(MRI)、计算机断层扫描(CT)和超声等多模态成像技术在该领域的应用价值。尽管在模型泛化和标准化、改进的模型可解释性、技术集成和多模态融合方面仍存在挑战,但放射组学和DL技术的不断进步为人工智能驱动的成像分析策略带来了希望。这些方法旨在整合多种临床监测方法,克服LF早期诊断和治疗的障碍,为该病的精准医学提供新的视角。
{"title":"Research Progress on the Application of Radiomics and Deep Learning in Liver Fibrosis.","authors":"Yi Dang, Wenjing Li, Zhao Liu, Junqiang Lei","doi":"10.3390/jimaging12020082","DOIUrl":"10.3390/jimaging12020082","url":null,"abstract":"<p><p>Liver fibrosis (LF) represents a crucial intermediate stage in the pathological progression from chronic liver disease to cirrhosis and hepatocellular carcinoma. Early and accurate diagnosis is of vital importance for the intervention treatment of diseases and the improvement of prognosis. Traditional liver biopsy, long regarded as the diagnostic gold standard, remains associated with several notable limitations such as invasiveness, sampling errors and inter-observer variability. Lately, as artificial intelligence (AI) technology progresses swiftly, radiomics and deep learning (DL) have risen to prominence as non-invasive diagnostic instruments, showing significant potential in the LF diagnostic evaluation. This review summarizes the latest advancements in radiomics and DL for LF diagnosis, staging, prognosis prediction and etiological differentiation. It also analyzes the application value of multimodal imaging modalities, including magnetic resonance imaging (MRI), computed tomography (CT) and ultrasound in this field. Despite ongoing challenges in model generalization and standardization, improved model interpretability, technological integration and multimodal fusion, the continuous advancement of radiomics and DL technologies holds promise for AI-driven imaging analysis strategies. These approaches aim to integrate multiple clinical monitoring methods, overcome obstacles in the early LF diagnosis and treatment and provide new perspectives for precision medicine of this disease.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12941878/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
3D Road Defect Mapping via Differentiable Neural Rendering and Multi-Frame Semantic Fusion in Bird's-Eye-View Space. 基于可微分神经渲染和多帧语义融合的鸟瞰空间三维道路缺陷映射。
IF 2.7 Q3 IMAGING SCIENCE & PHOTOGRAPHIC TECHNOLOGY Pub Date : 2026-02-15 DOI: 10.3390/jimaging12020083
Hongjia Xing, Feng Yang

Road defect detection is essential for traffic safety and infrastructure maintenance. Excising automated methods based on 2D image analysis lack spatial context and cannot provide accurate 3D localization required for maintenance planning. We propose a novel framework for road defect mapping from monocular video sequences by integrating differentiable Bird's-Eye-View (BEV) mesh representation, semantic filtering, and multi-frame temporal fusion. Our differentiable mesh-based BEV representation enables efficient scene reconstruction from sparse observations through MLP-based optimization. The semantic filtering strategy leverages road surface segmentation to eliminate off-road false positives, reducing detection errors by 33.7%. Multi-frame fusion with ray-casting projection and exponential moving average update accumulates defect observations across frames while maintaining 3D geometric consistency. Experimental results demonstrate that our framework produces geometrically consistent BEV defect maps with superior accuracy compared to single-frame 2D methods, effectively handling occlusions, motion blur, and varying illumination conditions.

道路缺陷检测对交通安全和基础设施维护至关重要。基于二维图像分析的自动化方法缺乏空间背景,无法为维护计划提供准确的三维定位。本文提出了一种基于可微鸟瞰(BEV)网格表示、语义滤波和多帧时间融合的单眼视频序列道路缺陷映射新框架。我们基于微微网格的BEV表示,通过基于mlp的优化,可以从稀疏观测中高效地重建场景。语义过滤策略利用路面分割来消除越野误报,将检测误差降低了33.7%。多帧融合与光线投射投影和指数移动平均更新累积的缺陷观察帧,同时保持三维几何一致性。实验结果表明,与单帧2D方法相比,我们的框架产生几何上一致的BEV缺陷图,具有更高的精度,有效地处理遮挡、运动模糊和不同的光照条件。
{"title":"3D Road Defect Mapping via Differentiable Neural Rendering and Multi-Frame Semantic Fusion in Bird's-Eye-View Space.","authors":"Hongjia Xing, Feng Yang","doi":"10.3390/jimaging12020083","DOIUrl":"10.3390/jimaging12020083","url":null,"abstract":"<p><p>Road defect detection is essential for traffic safety and infrastructure maintenance. Excising automated methods based on 2D image analysis lack spatial context and cannot provide accurate 3D localization required for maintenance planning. We propose a novel framework for road defect mapping from monocular video sequences by integrating differentiable Bird's-Eye-View (BEV) mesh representation, semantic filtering, and multi-frame temporal fusion. Our differentiable mesh-based BEV representation enables efficient scene reconstruction from sparse observations through MLP-based optimization. The semantic filtering strategy leverages road surface segmentation to eliminate off-road false positives, reducing detection errors by 33.7%. Multi-frame fusion with ray-casting projection and exponential moving average update accumulates defect observations across frames while maintaining 3D geometric consistency. Experimental results demonstrate that our framework produces geometrically consistent BEV defect maps with superior accuracy compared to single-frame 2D methods, effectively handling occlusions, motion blur, and varying illumination conditions.</p>","PeriodicalId":37035,"journal":{"name":"Journal of Imaging","volume":"12 2","pages":""},"PeriodicalIF":2.7,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12941438/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147291255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Imaging
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1