首页 > 最新文献

Infrared Physics & Technology最新文献

英文 中文
Beyond preprocessing and directional bias: Transformer models for robust and efficient cross-instrument NIR calibration in wheat flour analysis 超越预处理和方向偏差:在小麦粉分析中稳健和高效的跨仪器近红外校准变压器模型
IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Pub Date : 2026-02-06 DOI: 10.1016/j.infrared.2026.106448
Jing Liang, Hailong Feng, Yu Xue, Mingyue Huang, Bin Wang, Xiaoxuan Xu, Jing Xu
Cross-instrument variability remains a key barrier to the scalable application of near-infrared (NIR) spectroscopy in agri-food quality monitoring. This study introduces two Transformer-based calibration transfer models, Transpec and TPDS, designed to enhance spectral alignment across different instruments.
By combining global attention with localized spectral modeling, the proposed methods reduce reliance on extensive preprocessing and large paired transfer sets. Compared with classical techniques, Transpec and TPDS achieve higher predictive consistency across forward and backward transfers and demonstrate strong performance across multiple flour quality indicators. Their robustness and computational efficiency highlight their potential for real-time deployment in industrial multi-instrument environments. This work establishes a scalable framework for cross-device NIR modeling and contributes to the development of intelligent quality control systems in agricultural processing.
跨仪器可变性仍然是近红外(NIR)光谱在农业食品质量监测中可扩展应用的关键障碍。本研究介绍了两种基于变压器的校准传递模型,Transpec和TPDS,旨在增强不同仪器之间的光谱校准。通过将全局关注与局部光谱建模相结合,所提出的方法减少了对大量预处理和大型配对转移集的依赖。与传统技术相比,Transpec和TPDS在正向和反向转移上具有更高的预测一致性,并且在多个面粉质量指标上表现出较强的性能。它们的鲁棒性和计算效率突出了它们在工业多仪器环境中实时部署的潜力。这项工作为跨设备近红外建模建立了一个可扩展的框架,并有助于农业加工中智能质量控制系统的发展。
{"title":"Beyond preprocessing and directional bias: Transformer models for robust and efficient cross-instrument NIR calibration in wheat flour analysis","authors":"Jing Liang,&nbsp;Hailong Feng,&nbsp;Yu Xue,&nbsp;Mingyue Huang,&nbsp;Bin Wang,&nbsp;Xiaoxuan Xu,&nbsp;Jing Xu","doi":"10.1016/j.infrared.2026.106448","DOIUrl":"10.1016/j.infrared.2026.106448","url":null,"abstract":"<div><div>Cross-instrument variability remains a key barrier to the scalable application of near-infrared (NIR) spectroscopy in agri-food quality monitoring. This study introduces two Transformer-based calibration transfer models, Transpec and TPDS, designed to enhance spectral alignment across different instruments.</div><div>By combining global attention with localized spectral modeling, the proposed methods reduce reliance on extensive preprocessing and large paired transfer sets. Compared with classical techniques, Transpec and TPDS achieve higher predictive consistency across forward and backward transfers and demonstrate strong performance across multiple flour quality indicators. Their robustness and computational efficiency highlight their potential for real-time deployment in industrial multi-instrument environments. This work establishes a scalable framework for cross-device NIR modeling and contributes to the development of intelligent quality control systems in agricultural processing.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"155 ","pages":"Article 106448"},"PeriodicalIF":3.4,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146172808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Standard spatial distribution of facial skin temperature: A preliminary study 面部皮肤温度的标准空间分布:初步研究
IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Pub Date : 2026-02-06 DOI: 10.1016/j.infrared.2026.106437
Kent Nagumo , Kosuke Oiwa , Akio Nozawa
This study presents a preliminary, exploratory longitudinal analysis of facial skin temperature (FST) spatial distributions using facial thermal imagery (FTI) acquired over six months from summer to winter. We aimed to quantify the typical spatial distribution of FST and assess intra- and inter-individual variability under controlled laboratory conditions. As an initial step toward defining a reference distribution, we compared FST spatial patterns measured during baseline sessions with those obtained under an experimentally induced, non-clinically validated abnormal condition. The results showed relatively small inter-individual variability in FST spatial distributions within the sampled population, suggesting a consistent pattern across participants. In contrast, the abnormal condition produced measurable deviations from the baseline pattern, particularly when distributions were expressed using Z-score normalization. Because this study did not include clinical validation, external control groups, or real-world testing, the findings should be interpreted as suggestive rather than definitive. Future work should include clinical trials and broader participant cohorts to validate the proposed reference distribution, evaluate additional confounders (e.g., circadian effects and environmental variability), and test robustness in real-world settings to support translational applications such as health monitoring.
本研究利用夏季至冬季6个月的面部热图像(FTI)对面部皮肤温度(FST)的空间分布进行了初步的探索性纵向分析。我们旨在量化FST的典型空间分布,并在受控的实验室条件下评估个体内部和个体间的变异。作为确定参考分布的第一步,我们将基线期间测量的FST空间模式与实验诱导的非临床验证的异常情况下获得的FST空间模式进行了比较。结果显示,样本群体中FST空间分布的个体间差异相对较小,表明参与者之间的模式一致。相反,异常情况与基线模式产生了可测量的偏差,特别是当使用z分数归一化表示分布时。由于本研究未包括临床验证、外部对照组或实际测试,因此研究结果应被解释为暗示性而非确定性。未来的工作应包括临床试验和更广泛的参与者队列,以验证建议的参考分布,评估其他混杂因素(例如,昼夜节律效应和环境变异性),并在现实环境中测试稳健性,以支持健康监测等转化应用。
{"title":"Standard spatial distribution of facial skin temperature: A preliminary study","authors":"Kent Nagumo ,&nbsp;Kosuke Oiwa ,&nbsp;Akio Nozawa","doi":"10.1016/j.infrared.2026.106437","DOIUrl":"10.1016/j.infrared.2026.106437","url":null,"abstract":"<div><div>This study presents a preliminary, exploratory longitudinal analysis of facial skin temperature (FST) spatial distributions using facial thermal imagery (FTI) acquired over six months from summer to winter. We aimed to quantify the typical spatial distribution of FST and assess intra- and inter-individual variability under controlled laboratory conditions. As an initial step toward defining a reference distribution, we compared FST spatial patterns measured during baseline sessions with those obtained under an experimentally induced, non-clinically validated abnormal condition. The results showed relatively small inter-individual variability in FST spatial distributions within the sampled population, suggesting a consistent pattern across participants. In contrast, the abnormal condition produced measurable deviations from the baseline pattern, particularly when distributions were expressed using Z-score normalization. Because this study did not include clinical validation, external control groups, or real-world testing, the findings should be interpreted as suggestive rather than definitive. Future work should include clinical trials and broader participant cohorts to validate the proposed reference distribution, evaluate additional confounders (e.g., circadian effects and environmental variability), and test robustness in real-world settings to support translational applications such as health monitoring.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"155 ","pages":"Article 106437"},"PeriodicalIF":3.4,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146135652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Memory-Driven Wavelet Network for lightweight infrared image super-resolution 轻量红外图像超分辨率的记忆驱动小波网络
IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Pub Date : 2026-02-05 DOI: 10.1016/j.infrared.2026.106450
Tao Jin, Jianyu Huang
Infrared imaging finds extensive applications in security surveillance, remote sensing, interstellar exploration, and other fields where visible light imaging is limited. However, due to sensor limitations and atmospheric interference, infrared images often suffer from low resolution, severe noise, and poor contrast. To address these challenges, we propose a Memory-Driven Wavelet Network (MDWN) for lightweight infrared image super-resolution. First, we design a Parallel Wavelet Feature Extractor (PWFE) that decomposes input features into multiscale frequency components via wavelet transform, constructing dual path representations that capture complementary low and high frequency details under distinct receptive fields. Second, we propose a Memory-Driven Feature Integration Block (MDFIB), which incorporates a hierarchical memory bank with a progressively increasing number of learnable tokens across network stages. This design enables shallow layers to capture local structural priors, while deeper layers model global semantic representations. The memory tokens act as anchors for cross-region attention, effectively fusing fine-grained local details with long-range contextual information, without resorting to computationally expensive dense pairwise attention. Extensive experiments on multiple infrared benchmark datasets demonstrate that our Memory-Driven Wavelet Network (MDWN) achieves state-of-the-art performance with significantly fewer parameters and lower computational overhead.
红外成像在安全监控、遥感、星际探测等可见光成像受限的领域有着广泛的应用。然而,由于传感器的限制和大气的干扰,红外图像往往遭受低分辨率,严重的噪声,和差的对比度。为了解决这些挑战,我们提出了一种记忆驱动的小波网络(MDWN),用于轻量红外图像的超分辨率。首先,我们设计了一个并行小波特征提取器(PWFE),通过小波变换将输入特征分解为多尺度频率分量,构建双路径表示,在不同的接受场下捕获互补的低频和高频细节。其次,我们提出了一个内存驱动的特征集成块(MDFIB),它结合了一个分层记忆库,在网络阶段中逐步增加可学习令牌的数量。这种设计使浅层能够捕获局部结构先验,而深层则建模全局语义表示。内存标记充当跨区域注意的锚点,有效地将细粒度的局部细节与远程上下文信息融合在一起,而不需要使用计算代价高昂的密集成对注意。在多个红外基准数据集上的大量实验表明,我们的记忆驱动小波网络(MDWN)以更少的参数和更低的计算开销实现了最先进的性能。
{"title":"Memory-Driven Wavelet Network for lightweight infrared image super-resolution","authors":"Tao Jin,&nbsp;Jianyu Huang","doi":"10.1016/j.infrared.2026.106450","DOIUrl":"10.1016/j.infrared.2026.106450","url":null,"abstract":"<div><div>Infrared imaging finds extensive applications in security surveillance, remote sensing, interstellar exploration, and other fields where visible light imaging is limited. However, due to sensor limitations and atmospheric interference, infrared images often suffer from low resolution, severe noise, and poor contrast. To address these challenges, we propose a Memory-Driven Wavelet Network (MDWN) for lightweight infrared image super-resolution. First, we design a Parallel Wavelet Feature Extractor (PWFE) that decomposes input features into multiscale frequency components via wavelet transform, constructing dual path representations that capture complementary low and high frequency details under distinct receptive fields. Second, we propose a Memory-Driven Feature Integration Block (MDFIB), which incorporates a hierarchical memory bank with a progressively increasing number of learnable tokens across network stages. This design enables shallow layers to capture local structural priors, while deeper layers model global semantic representations. The memory tokens act as anchors for cross-region attention, effectively fusing fine-grained local details with long-range contextual information, without resorting to computationally expensive dense pairwise attention. Extensive experiments on multiple infrared benchmark datasets demonstrate that our Memory-Driven Wavelet Network (MDWN) achieves state-of-the-art performance with significantly fewer parameters and lower computational overhead.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"155 ","pages":"Article 106450"},"PeriodicalIF":3.4,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146172884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Randomness enhancement of noise-like pulses in nonlinear polarization rotation fiber cavity 非线性偏振旋转光纤腔中类噪声脉冲的随机增强
IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Pub Date : 2026-02-05 DOI: 10.1016/j.infrared.2026.106446
Yujun Chen, Peng Cai, Yujia Li, Hang Ming, Minnan Wu, Lei Chen, Ligang Huang, Lei Gao, Tao Zhu
Noise-like pulses (NLPs) with intense randomness play a crucial role in low-coherence spectroscopic measurements, supercontinuum generation, random laser imaging, and chaotic laser sensing. However, systematic analysis addressing both the spectral intensity randomness and the polarization characteristics of NLPs have not yet been explored. Here, we use the dispersive Fourier transform technique and the high-speed wavelength-resolved polarization measurement technique to investigate the spectral randomness and polarization characteristics of NLPs generated by nonlinear polarization rotation mode-locking in a net-normal dispersive cavity with varying lengths of highly nonlinear fiber (HNLF). Through experimental and statistical methods (correlation, Pearson coefficient, mutual information), we demonstrated that HNLF enhances spectral intensity randomness in NLPs. The incorporation of HNLF not only increases the spectral correlation decay rate but also reduces the full width at half maximum of the Pearson coefficient curve corresponding to the peak wavelength from 0.38 nm to below 0.1 nm. Accompanied by the polarization filtering effects of nonlinear polarization rotation, the wavelength-resolved states of polarization (SOP) for the NLPs exhibit partial randomness on the Poincaré sphere. We quantify polarization distribution characteristics and randomness using the relative distance (r) of SOP projection points and approximate entropy (ApEn). HNLF enhances both polarization distribution range and randomness, increasing r_max from 2 to 3 and improving ApEn values for polar/azimuthal angles at different wavelengths. The simulation result is consistent with experiments. Our work provides a systematic routine for investigating the randomness in NLPs, and also offers a new approach for generating low-cost, highly random ultrafast light sources.
具有强随机性的类噪声脉冲在低相干光谱测量、超连续谱产生、随机激光成像和混沌激光传感等领域发挥着重要作用。然而,对于nlp的光谱强度随机性和偏振特性,目前还没有系统的分析。本文利用色散傅立叶变换技术和高速波长分辨偏振测量技术,研究了在不同长度的高度非线性光纤(HNLF)的净法向色散腔中,非线性偏振旋转锁模产生的nlp的光谱随机性和偏振特性。通过实验和统计方法(相关、Pearson系数、互信息),我们证明了HNLF增强了nlp的光谱强度随机性。HNLF的加入不仅提高了光谱相关衰减率,而且使峰值波长对应的Pearson系数曲线的半最大值全宽度从0.38 nm减小到0.1 nm以下。伴随着非线性偏振旋转的偏振滤波效应,nlp的波长分辨偏振态(SOP)在poincarcarcars球上表现出部分随机性。我们使用SOP投影点的相对距离(r)和近似熵(ApEn)来量化偏振分布特征和随机性。HNLF增强了偏振分布的范围和随机性,使r_max从2增加到3,提高了不同波长下极角/方位角的ApEn值。仿真结果与实验结果吻合较好。我们的工作为研究nlp的随机性提供了一个系统的程序,也为产生低成本、高随机性的超快光源提供了一种新的方法。
{"title":"Randomness enhancement of noise-like pulses in nonlinear polarization rotation fiber cavity","authors":"Yujun Chen,&nbsp;Peng Cai,&nbsp;Yujia Li,&nbsp;Hang Ming,&nbsp;Minnan Wu,&nbsp;Lei Chen,&nbsp;Ligang Huang,&nbsp;Lei Gao,&nbsp;Tao Zhu","doi":"10.1016/j.infrared.2026.106446","DOIUrl":"10.1016/j.infrared.2026.106446","url":null,"abstract":"<div><div>Noise-like pulses (NLPs) with intense randomness play a crucial role in low-coherence spectroscopic measurements, supercontinuum generation, random laser imaging, and chaotic laser sensing. However, systematic analysis addressing both the spectral intensity randomness and the polarization characteristics of NLPs have not yet been explored. Here, we use the dispersive Fourier transform technique and the high-speed wavelength-resolved polarization measurement technique to investigate the spectral randomness and polarization characteristics of NLPs generated by nonlinear polarization rotation mode-locking in a net-normal dispersive cavity with varying lengths of highly nonlinear fiber (HNLF). Through experimental and statistical methods (correlation, Pearson coefficient, mutual information), we demonstrated that HNLF enhances spectral intensity randomness in NLPs. The incorporation of HNLF not only increases the spectral correlation decay rate but also reduces the full width at half maximum of the Pearson coefficient curve corresponding to the peak wavelength from 0.38 nm to below 0.1 nm. Accompanied by the polarization filtering effects of nonlinear polarization rotation, the wavelength-resolved states of polarization (SOP) for the NLPs exhibit partial randomness on the Poincaré sphere. We quantify polarization distribution characteristics and randomness using the relative distance (<em>r</em>) of SOP projection points and approximate entropy (ApEn). HNLF enhances both polarization distribution range and randomness, increasing <em>r_max</em> from 2 to 3 and improving ApEn values for polar/azimuthal angles at different wavelengths. The simulation result is consistent with experiments. Our work provides a systematic routine for investigating the randomness in NLPs, and also offers a new approach for generating low-cost, highly random ultrafast light sources.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"155 ","pages":"Article 106446"},"PeriodicalIF":3.4,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146172813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual-wavelength pulse generation from a 2.8 μm spatiotemporally mode-locking fiber laser based on nonlinear polarization rotation 基于非线性偏振旋转的2.8 μm时空锁模光纤激光器产生双波长脉冲
IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Pub Date : 2026-02-03 DOI: 10.1016/j.infrared.2026.106445
Hang Ren , Ying Yang , He Cao , Zhan Hong Lip , Fanlong Dong , Jun Guo , Jiachen Wang , Jinzhang Wang , Tengfei Wu , Chunyu Guo , Shuangchen Ruan
In this paper, to the best of our knowledge, we demonstrate the generation of a dual-wavelength pulse from a mid-infrared spatiotemporally mode-locking (STML) large-mode-area Er: ZBLAN fiber laser based on the nonlinear polarization rotation technology for the first time. Under a pump power of 4.04 W, a dual-wavelength spectrum with a pulse duration of 42 ps is achieved, centered at 2792 nm and 2796 nm, and delivering a pulse energy of 9.94 nJ. Here, we conducted numerical simulations by solving the generalized multimode nonlinear Schrödinger equation. The experimental and simulation results indicate that both long-wavelength and short-wavelength components can independently achieve STML operation, thereby confirming the realization of dual-wavelength STML operation. This work can enhance the understanding of pulse dynamics in mid-infrared multi-wavelength STML fiber lasers.
在本文中,据我们所知,我们首次展示了基于非线性偏振旋转技术的中红外时空锁模(STML)大模面积Er: ZBLAN光纤激光器产生双波长脉冲。在泵浦功率为4.04 W的情况下,实现了以2792 nm和2796 nm为中心、脉冲持续时间为42 ps的双波长光谱,脉冲能量为9.94 nJ。本文通过求解广义多模非线性Schrödinger方程进行数值模拟。实验和仿真结果表明,长波长和短波长的组件都可以独立实现STML操作,从而证实了双波长STML操作的实现。本文的工作有助于提高对中红外多波长STML光纤激光器脉冲动力学的认识。
{"title":"Dual-wavelength pulse generation from a 2.8 μm spatiotemporally mode-locking fiber laser based on nonlinear polarization rotation","authors":"Hang Ren ,&nbsp;Ying Yang ,&nbsp;He Cao ,&nbsp;Zhan Hong Lip ,&nbsp;Fanlong Dong ,&nbsp;Jun Guo ,&nbsp;Jiachen Wang ,&nbsp;Jinzhang Wang ,&nbsp;Tengfei Wu ,&nbsp;Chunyu Guo ,&nbsp;Shuangchen Ruan","doi":"10.1016/j.infrared.2026.106445","DOIUrl":"10.1016/j.infrared.2026.106445","url":null,"abstract":"<div><div>In this paper, to the best of our knowledge, we demonstrate the generation of a dual-wavelength pulse from a mid-infrared spatiotemporally mode-locking (STML) large-mode-area Er: ZBLAN fiber laser based on the nonlinear polarization rotation technology for the first time. Under a pump power of 4.04 W, a dual-wavelength spectrum with a pulse duration of 42 ps is achieved, centered at 2792 nm and 2796 nm, and delivering a pulse energy of 9.94 nJ. Here, we conducted numerical simulations by solving the generalized multimode nonlinear Schrödinger equation. The experimental and simulation results indicate that both long-wavelength and short-wavelength components can independently achieve STML operation, thereby confirming the realization of dual-wavelength STML operation. This work can enhance the understanding of pulse dynamics in mid-infrared multi-wavelength STML fiber lasers.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"155 ","pages":"Article 106445"},"PeriodicalIF":3.4,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146172810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-order Spatial-Frequency Interaction and Detail Compensation Network for infrared and visible image fusion 红外与可见光图像融合的高阶空频交互与细节补偿网络
IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Pub Date : 2026-02-03 DOI: 10.1016/j.infrared.2026.106431
Kanglin Jin, Mengtong Guo, Minghao Piao
Infrared and visible image fusion aims to generate a fused image that highlights salient targets while preserving fine textures. Existing deep learning-based methods predominantly rely on spatial-domain representations, which fail to fully capture the modality-specific frequency characteristics, leading to suboptimal texture preservation and detail enhancement. Since infrared and visible images exhibit distinct frequency distributions, relying solely on spatial-domain methods is insufficient for achieving high-quality fusion. To overcome this limitation, we propose a novel High-order Spatial-Frequency Interaction and Detail Compensation Network (HSFIDCNet), which jointly exploits spatial and frequency representations for more effective feature fusion. Specifically, the High-order Spatial-Frequency Interaction (HSFI) module enhances cross-domain feature integration, achieving a balanced fusion of global structures and local details, while the Detail Compensation (DC) module strengthens texture representation and highlights salient objects. Extensive experiments on three benchmark datasets (M3FD, LLVIP, and MSRS) against twelve state-of-the-art methods demonstrate that our approach consistently outperforms existing methods, producing fused images with higher contrast and richer textures. In particular, our method achieves the best performance across all three datasets in CC (0.5298, 0.7134, 0.6180), QAB/F (0.7102, 0.7326, 0.7025), MS-SSIM (0.9573, 0.9696, 0.9778), and QCV (478.6155, 267.7829, 203.6782), highlighting its robust and generalizable fusion capability. Code is available at https://github.com/sdat-max/HSFIDCNet.
红外和可见光图像融合的目的是产生融合图像,突出突出的目标,同时保持良好的纹理。现有的基于深度学习的方法主要依赖于空间域表示,不能完全捕获模态特定的频率特征,导致纹理保存和细节增强不理想。由于红外和可见光图像表现出不同的频率分布,仅依靠空间域方法不足以实现高质量的融合。为了克服这一限制,我们提出了一种新的高阶空间频率交互和细节补偿网络(HSFIDCNet),该网络联合利用空间和频率表示进行更有效的特征融合。其中,高阶空间-频率交互(HSFI)模块增强了跨域特征集成,实现了全局结构和局部细节的平衡融合;细节补偿(DC)模块增强了纹理表示,突出了突出目标。在三个基准数据集(M3FD, LLVIP和MSRS)上对12种最先进的方法进行了广泛的实验,结果表明我们的方法始终优于现有的方法,产生具有更高对比度和更丰富纹理的融合图像。该方法在CC(0.5298, 0.7134, 0.6180)、QAB/F(0.7102, 0.7326, 0.7025)、MS-SSIM(0.9573, 0.9696, 0.9778)和QCV(478.6155, 267.7829, 203.6782)三个数据集上均取得了最佳的融合性能,突出了其鲁棒性和泛化能力。代码可从https://github.com/sdat-max/HSFIDCNet获得。
{"title":"High-order Spatial-Frequency Interaction and Detail Compensation Network for infrared and visible image fusion","authors":"Kanglin Jin,&nbsp;Mengtong Guo,&nbsp;Minghao Piao","doi":"10.1016/j.infrared.2026.106431","DOIUrl":"10.1016/j.infrared.2026.106431","url":null,"abstract":"<div><div>Infrared and visible image fusion aims to generate a fused image that highlights salient targets while preserving fine textures. Existing deep learning-based methods predominantly rely on spatial-domain representations, which fail to fully capture the modality-specific frequency characteristics, leading to suboptimal texture preservation and detail enhancement. Since infrared and visible images exhibit distinct frequency distributions, relying solely on spatial-domain methods is insufficient for achieving high-quality fusion. To overcome this limitation, we propose a novel High-order Spatial-Frequency Interaction and Detail Compensation Network (HSFIDCNet), which jointly exploits spatial and frequency representations for more effective feature fusion. Specifically, the High-order Spatial-Frequency Interaction (HSFI) module enhances cross-domain feature integration, achieving a balanced fusion of global structures and local details, while the Detail Compensation (DC) module strengthens texture representation and highlights salient objects. Extensive experiments on three benchmark datasets (M<sup>3</sup>FD, LLVIP, and MSRS) against twelve state-of-the-art methods demonstrate that our approach consistently outperforms existing methods, producing fused images with higher contrast and richer textures. In particular, our method achieves the best performance across all three datasets in CC (0.5298, 0.7134, 0.6180), <span><math><msup><mrow><mi>Q</mi></mrow><mrow><mi>A</mi><mi>B</mi><mo>/</mo><mi>F</mi></mrow></msup></math></span> (0.7102, 0.7326, 0.7025), MS-SSIM (0.9573, 0.9696, 0.9778), and <span><math><msub><mrow><mi>Q</mi></mrow><mrow><mi>C</mi><mi>V</mi></mrow></msub></math></span> (478.6155, 267.7829, 203.6782), highlighting its robust and generalizable fusion capability. Code is available at <span><span>https://github.com/sdat-max/HSFIDCNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"155 ","pages":"Article 106431"},"PeriodicalIF":3.4,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146172883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MFDFuse: A multi-frequency decomposition-based network for infrared and visible image fusion MFDFuse:一种基于多频分解的红外和可见光图像融合网络
IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Pub Date : 2026-02-02 DOI: 10.1016/j.infrared.2026.106442
Hao Song , Longkun He , Shan Zhao, Lijin Gao, Chengjiang Zhou
The infrared and visible image fusion is a technique that extracts and combines information from both images to enhance image quality and target recognition. The image fusion serves the high-level vision task. Therefore, existing fusion algorithms prioritize the prerequisites of these tasks, making it difficult to balance visual quality with task performance. Meanwhile, since the distribution of features is disordered and contains redundant information, the ability to directly capture features without constraints and targeting is limited. To address these issues, we propose an infrared and visible image fusion network based on multi-frequency feature decomposition, termed MFDFuse. Specifically, we build a channel-based collaborative feature extraction branch in the encoder to continuously capture and convey the correlations between channels, thus maintaining the coherence of channel modeling across the network to preserve the channel information of the source image. Secondly, we utilize element-wise multiplication-based single-level and multi-level mappings to integrate high-frequency information and semantic information, achieving a balance between detail preservation and semantic enhancement. Then, we propose a feature separation loss based on feature distance to separate low-high frequency features of different modalities to constrain the results of feature extraction. Finally, we fully consider the feature information lost during the feature extraction process to achieve effective compensation. The experiments performed on publicly accessible datasets, along with downstream tasks, have shown that MFDFuse surpasses state-of-the-art (SOTA) methods in both quantitative and qualitative analysis. The code is available at https://github.com/SunsHine0816/MFDFuse.
红外图像与可见光图像融合是一种从两种图像中提取和组合信息以提高图像质量和目标识别能力的技术。图像融合服务于高级视觉任务。因此,现有的融合算法优先考虑这些任务的先决条件,难以平衡视觉质量和任务性能。同时,由于特征的分布是无序的,并且包含冗余信息,因此不受约束和目标直接捕获特征的能力受到限制。为了解决这些问题,我们提出了一种基于多频特征分解的红外和可见光图像融合网络,称为MFDFuse。具体而言,我们在编码器中构建了一个基于信道的协同特征提取分支,持续捕获和传递信道之间的相关性,从而在整个网络中保持信道建模的一致性,以保留源图像的信道信息。其次,我们利用基于元素乘法的单级和多级映射来整合高频信息和语义信息,在细节保留和语义增强之间取得平衡。然后,我们提出了基于特征距离的特征分离损失来分离不同模态的低频特征,以约束特征提取的结果。最后,充分考虑特征提取过程中丢失的特征信息,实现有效补偿。在可公开访问的数据集上进行的实验以及下游任务表明,MFDFuse在定量和定性分析方面都超过了最先进的(SOTA)方法。代码可在https://github.com/SunsHine0816/MFDFuse上获得。
{"title":"MFDFuse: A multi-frequency decomposition-based network for infrared and visible image fusion","authors":"Hao Song ,&nbsp;Longkun He ,&nbsp;Shan Zhao,&nbsp;Lijin Gao,&nbsp;Chengjiang Zhou","doi":"10.1016/j.infrared.2026.106442","DOIUrl":"10.1016/j.infrared.2026.106442","url":null,"abstract":"<div><div>The infrared and visible image fusion is a technique that extracts and combines information from both images to enhance image quality and target recognition. The image fusion serves the high-level vision task. Therefore, existing fusion algorithms prioritize the prerequisites of these tasks, making it difficult to balance visual quality with task performance. Meanwhile, since the distribution of features is disordered and contains redundant information, the ability to directly capture features without constraints and targeting is limited. To address these issues, we propose an infrared and visible image fusion network based on multi-frequency feature decomposition, termed MFDFuse. Specifically, we build a channel-based collaborative feature extraction branch in the encoder to continuously capture and convey the correlations between channels, thus maintaining the coherence of channel modeling across the network to preserve the channel information of the source image. Secondly, we utilize element-wise multiplication-based single-level and multi-level mappings to integrate high-frequency information and semantic information, achieving a balance between detail preservation and semantic enhancement. Then, we propose a feature separation loss based on feature distance to separate low-high frequency features of different modalities to constrain the results of feature extraction. Finally, we fully consider the feature information lost during the feature extraction process to achieve effective compensation. The experiments performed on publicly accessible datasets, along with downstream tasks, have shown that MFDFuse surpasses state-of-the-art (SOTA) methods in both quantitative and qualitative analysis. The code is available at <span><span>https://github.com/SunsHine0816/MFDFuse.</span><svg><path></path></svg></span></div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"155 ","pages":"Article 106442"},"PeriodicalIF":3.4,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146172873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Study on the influence Laws of different Oxygen-Containing oxidants on the Low-Pressure combustion and radiation performance of MgB2/PTFE type infrared radiation agents 不同含氧氧化剂对MgB2/PTFE型红外辐射剂低压燃烧及辐射性能的影响规律研究
IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Pub Date : 2026-01-31 DOI: 10.1016/j.infrared.2026.106440
Jun Huang, Yichao Liu, Zefeng Guo, Tianyu Ren, Yujie Ji, Chengkun Cai, Hua Guan
This study systematically investigated the influence of different oxygen-containing oxidants (NaNO3, KNO3, Ba(NO3) 2) on the combustion performance and radiation characteristics of MgB2/PTFE infrared radiation agents under low-pressure conditions. Through thermal analysis (TG-DTA), combustion product characterization (XRD, SEM), and low-pressure combustion experiments (5–––101 kPa), it was found that the decomposition temperature of the oxidants significantly affected the combustion stability and reaction pathway of the agent in the low-pressure environment. NaNO3 and KNO3, due to their lower decomposition temperatures (374.5 ℃ and 408 ℃ respectively), could still promote the efficient reaction of MgB2 at 5 kPa low pressure, significantly improving the combustion stability and radiation area; while Ba(NO3) 2 had a higher decomposition temperature (577.8 ℃), its system could not burn stably at 5 kPa, but by generating high-emissivity condensed-phase products (such as BaO, BaB6), it effectively enhanced the infrared radiation intensity in the α (1.3–––3 μm), β (3–––5 μm), and γ (8–––14 μm) wavelength bands under low-pressure conditions. The study further revealed the reaction mechanisms of each system: NaNO3/KNO3 mainly produced alkali metal borates (such as Na2B4O7·H2O, K3.67B4O5 (OH) 5), while the Ba(NO3) 2 system formed BaB6 through complex phase changes. The results showed that by regulating the type of oxidant, the combustion efficiency and radiation performance of MgB2/PTFE agents under low pressure could be optimized, providing key theoretical and experimental basis for the design of high-altitude infrared decoy agents.
本研究系统研究了不同含氧氧化剂(NaNO3、KNO3、Ba(NO3) 2)对MgB2/PTFE红外辐射剂在低压条件下燃烧性能和辐射特性的影响。通过热分析(TG-DTA)、燃烧产物表征(XRD、SEM)和低压燃烧实验(5—101 kPa),发现氧化剂的分解温度显著影响剂在低压环境下的燃烧稳定性和反应途径。NaNO3和KNO3由于其较低的分解温度(分别为374.5℃和408℃),在5 kPa低压下仍能促进MgB2的高效反应,显著提高了燃烧稳定性和辐射面积;Ba(NO3) 2具有较高的分解温度(577.8℃),其体系在5 kPa下不能稳定燃烧,但通过生成高发射率凝聚相产物(如BaO、BaB6),在低压条件下有效增强了α(1.3—3 μm)、β(3—5 μm)和γ(8—14 μm)波段的红外辐射强度。研究进一步揭示了各体系的反应机理:NaNO3/KNO3主要生成碱金属硼酸盐(如Na2B4O7·H2O、K3.67B4O5 (OH) 5), Ba(NO3) 2体系通过复杂相变生成BaB6。结果表明,通过调节氧化剂的种类,可以优化MgB2/PTFE阻燃剂在低压下的燃烧效率和辐射性能,为高空红外诱捕剂的设计提供关键的理论和实验依据。
{"title":"Study on the influence Laws of different Oxygen-Containing oxidants on the Low-Pressure combustion and radiation performance of MgB2/PTFE type infrared radiation agents","authors":"Jun Huang,&nbsp;Yichao Liu,&nbsp;Zefeng Guo,&nbsp;Tianyu Ren,&nbsp;Yujie Ji,&nbsp;Chengkun Cai,&nbsp;Hua Guan","doi":"10.1016/j.infrared.2026.106440","DOIUrl":"10.1016/j.infrared.2026.106440","url":null,"abstract":"<div><div>This study systematically investigated the influence of different oxygen-containing oxidants (NaNO<sub>3</sub>, KNO<sub>3</sub>, Ba(NO<sub>3</sub>) <sub>2</sub>) on the combustion performance and radiation characteristics of MgB<sub>2</sub>/PTFE infrared radiation agents under low-pressure conditions. Through thermal analysis (TG-DTA), combustion product characterization (XRD, SEM), and low-pressure combustion experiments (5–––101 kPa), it was found that the decomposition temperature of the oxidants significantly affected the combustion stability and reaction pathway of the agent in the low-pressure environment. NaNO<sub>3</sub> and KNO<sub>3</sub>, due to their lower decomposition temperatures (374.5 ℃ and 408 ℃ respectively), could still promote the efficient reaction of MgB<sub>2</sub> at 5 kPa low pressure, significantly improving the combustion stability and radiation area; while Ba(NO<sub>3</sub>) <sub>2</sub> had a higher decomposition temperature (577.8 ℃), its system could not burn stably at 5 kPa, but by generating high-emissivity condensed-phase products (such as BaO, BaB<sub>6</sub>), it effectively enhanced the infrared radiation intensity in the α (1.3–––3 μm), β (3–––5 μm), and γ (8–––14 μm) wavelength bands under low-pressure conditions. The study further revealed the reaction mechanisms of each system: NaNO<sub>3</sub>/KNO<sub>3</sub> mainly produced alkali metal borates (such as Na<sub>2</sub>B<sub>4</sub>O<sub>7</sub>·H<sub>2</sub>O, K<sub>3.67</sub>B<sub>4</sub>O<sub>5</sub> (OH) <sub>5</sub>), while the Ba(NO<sub>3</sub>) <sub>2</sub> system formed BaB<sub>6</sub> through complex phase changes. The results showed that by regulating the type of oxidant, the combustion efficiency and radiation performance of MgB<sub>2</sub>/PTFE agents under low pressure could be optimized, providing key theoretical and experimental basis for the design of high-altitude infrared decoy agents.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"155 ","pages":"Article 106440"},"PeriodicalIF":3.4,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146172875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Drone detection network based on RGB-thermal imaging multimodal fusion 基于rgb -热成像多模态融合的无人机检测网络
IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Pub Date : 2026-01-30 DOI: 10.1016/j.infrared.2026.106426
Xingwei Yan , Kun Liu , Ji Li , Yan Zhang , Yaxiu Zhang , Chenchen Zhang
With the rapid proliferation of unmanned aerial vehicles, the issue of their security has gradually become a focal point of research. In infrared target detection tasks, due to the small target size, complex backgrounds, and low contrast, existing methods often rely solely on the internal features of a single modality, lacking the ability to interact with external information, which limits detection performance. To address this issue, this paper proposes a novel multi-modal image detection method, R2TNet, which can directly process misaligned RGB-T images, effectively avoiding the complexity of traditional manual registration. To achieve efficient modality alignment and fusion, this paper designs a supervised bottom-up multimodal alignment module, which adopts a coarse-to-fine layer-wise registration strategy. This effectively alleviates the modality misalignment issue in multimodal images, thereby achieving precise alignment between RGB and infrared features. On this basis, a semantic-guided module is further employed to optimize cross-modal feature fusion using high-level semantic information, significantly improving the accuracy and robustness of target detection. At the same time, a multi-scale gated dynamic fusion module is incorporated to realize fine-grained fusion of multimodal features, further enhancing the model’s adaptability in complex scenarios. Experimental results demonstrate that the proposed R2TNet significantly outperforms existing state-of-the-art bimodal detection methods across multiple evaluation metrics, including Em, Sm, Fm, and MAE, and exhibits stronger robustness and generalization capability in complex backgrounds and small target detection tasks. Moreover, comparative results with unimodal infrared detection methods further validate the advantages of the proposed method in cross-modal fusion detection.
随着无人飞行器的迅速普及,其安全问题逐渐成为研究的热点。在红外目标检测任务中,由于目标尺寸小、背景复杂、对比度低,现有方法往往只依赖于单一模态的内部特征,缺乏与外部信息交互的能力,限制了检测性能。针对这一问题,本文提出了一种新的多模态图像检测方法R2TNet,该方法可以直接处理不对齐的RGB-T图像,有效避免了传统人工配准的复杂性。为了实现高效的模态对齐和融合,设计了一种监督自底向上的多模态对齐模块,该模块采用从粗到细的分层配准策略。这有效地缓解了多模态图像中的模态不对准问题,从而实现了RGB与红外特征之间的精确对准。在此基础上,进一步采用语义引导模块利用高级语义信息优化跨模态特征融合,显著提高了目标检测的准确性和鲁棒性。同时,引入多尺度门控动态融合模块,实现多模态特征的细粒度融合,进一步增强了模型对复杂场景的适应性。实验结果表明,R2TNet在Em、Sm、Fm和MAE等多个评价指标上显著优于现有的双峰检测方法,在复杂背景和小目标检测任务中表现出更强的鲁棒性和泛化能力。与单峰红外检测方法的对比结果进一步验证了该方法在跨模态融合检测中的优势。
{"title":"Drone detection network based on RGB-thermal imaging multimodal fusion","authors":"Xingwei Yan ,&nbsp;Kun Liu ,&nbsp;Ji Li ,&nbsp;Yan Zhang ,&nbsp;Yaxiu Zhang ,&nbsp;Chenchen Zhang","doi":"10.1016/j.infrared.2026.106426","DOIUrl":"10.1016/j.infrared.2026.106426","url":null,"abstract":"<div><div>With the rapid proliferation of unmanned aerial vehicles, the issue of their security has gradually become a focal point of research. In infrared target detection tasks, due to the small target size, complex backgrounds, and low contrast, existing methods often rely solely on the internal features of a single modality, lacking the ability to interact with external information, which limits detection performance. To address this issue, this paper proposes a novel multi-modal image detection method, R2TNet, which can directly process misaligned RGB-T images, effectively avoiding the complexity of traditional manual registration. To achieve efficient modality alignment and fusion, this paper designs a supervised bottom-up multimodal alignment module, which adopts a coarse-to-fine layer-wise registration strategy. This effectively alleviates the modality misalignment issue in multimodal images, thereby achieving precise alignment between RGB and infrared features. On this basis, a semantic-guided module is further employed to optimize cross-modal feature fusion using high-level semantic information, significantly improving the accuracy and robustness of target detection. At the same time, a multi-scale gated dynamic fusion module is incorporated to realize fine-grained fusion of multimodal features, further enhancing the model’s adaptability in complex scenarios. Experimental results demonstrate that the proposed R2TNet significantly outperforms existing state-of-the-art bimodal detection methods across multiple evaluation metrics, including Em, Sm, Fm, and MAE, and exhibits stronger robustness and generalization capability in complex backgrounds and small target detection tasks. Moreover, comparative results with unimodal infrared detection methods further validate the advantages of the proposed method in cross-modal fusion detection.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"155 ","pages":"Article 106426"},"PeriodicalIF":3.4,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146172874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DWPCNNFusion: Deep pulse-coupled neural networks incorporating Weber’s law for efficient infrared and visible image fusion DWPCNNFusion:采用韦伯定律的深度脉冲耦合神经网络,用于有效的红外和可见光图像融合
IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION Pub Date : 2026-01-29 DOI: 10.1016/j.infrared.2026.106417
Jia Zhao , Sirui Jia , Jing Di , Jing Lian , Yide Ma , Yuelan Xin , Jisheng Dang , Jizhao Liu
Infrared and visible image fusion is a key task in computer vision, aiming to combine complementary multimodal information to generate a salient and texture-rich image. However, existing deep learning-based fusion methods typically rely on increasing network depth to enhance performance, often overlooking the significant computational resources required, which leads to inefficiency. To address this, we propose a novel brain-inspired, end-to-end trainable infrared and visible image fusion method (DWPCNNFusion). Specifically, in the feature extraction stage, we design a deep pulse-coupled neural networks based on Weber’s law (DWPCNN) , where the coupling weight matrix is treated as a learnable parameter, enabling the network to flexibly adapt to varying data characteristics. Additionally, linking strength coefficients are set according to Weber’s law, simulating the nonlinear perception of brightness in the human visual system, which effectively mitigates detail loss in low-light environments. To accommodate dynamic changes in input data over time, a time adaptive batch normalization method is proposed, and temporal information is integrated via a rate encoding scheme, allowing DWPCNN to be efficiently incorporated into existing deep learning frameworks. Furthermore, pulse convolutional dense blocks (PCDB) are employed to extract high-level semantic features, further enhancing the model’s feature representation capability. Experimental results on the TNO and MSRS datasets, compared with 15 representative methods using both objective and subjective metrics, demonstrate that the proposed method excels in detail preservation while achieving a better balance between computational efficiency and fusion performance.
红外图像与可见光图像融合是计算机视觉中的一项关键任务,其目的是将互补的多模态信息结合在一起,生成显著且纹理丰富的图像。然而,现有的基于深度学习的融合方法通常依赖于增加网络深度来提高性能,往往忽略了所需的大量计算资源,从而导致效率低下。为了解决这个问题,我们提出了一种新颖的脑启发,端到端可训练的红外和可见光图像融合方法(DWPCNNFusion)。具体而言,在特征提取阶段,我们设计了基于韦伯定律的深度脉冲耦合神经网络(DWPCNN),将耦合权矩阵作为可学习参数,使网络能够灵活地适应不同的数据特征。此外,根据韦伯定律设置连接强度系数,模拟人类视觉系统对亮度的非线性感知,有效减轻了弱光环境下的细节损失。为了适应输入数据随时间的动态变化,提出了一种时间自适应批归一化方法,并通过速率编码方案集成时间信息,使DWPCNN能够有效地融入现有的深度学习框架。此外,采用脉冲卷积密集块(PCDB)提取高级语义特征,进一步增强了模型的特征表示能力。在TNO和MSRS数据集上的实验结果表明,该方法在计算效率和融合性能之间取得了更好的平衡,并与15种具有代表性的客观和主观度量方法进行了比较。
{"title":"DWPCNNFusion: Deep pulse-coupled neural networks incorporating Weber’s law for efficient infrared and visible image fusion","authors":"Jia Zhao ,&nbsp;Sirui Jia ,&nbsp;Jing Di ,&nbsp;Jing Lian ,&nbsp;Yide Ma ,&nbsp;Yuelan Xin ,&nbsp;Jisheng Dang ,&nbsp;Jizhao Liu","doi":"10.1016/j.infrared.2026.106417","DOIUrl":"10.1016/j.infrared.2026.106417","url":null,"abstract":"<div><div>Infrared and visible image fusion is a key task in computer vision, aiming to combine complementary multimodal information to generate a salient and texture-rich image. However, existing deep learning-based fusion methods typically rely on increasing network depth to enhance performance, often overlooking the significant computational resources required, which leads to inefficiency. To address this, we propose a novel brain-inspired, end-to-end trainable infrared and visible image fusion method (DWPCNNFusion). Specifically, in the feature extraction stage, we design a deep pulse-coupled neural networks based on Weber’s law (DWPCNN) , where the coupling weight matrix is treated as a learnable parameter, enabling the network to flexibly adapt to varying data characteristics. Additionally, linking strength coefficients are set according to Weber’s law, simulating the nonlinear perception of brightness in the human visual system, which effectively mitigates detail loss in low-light environments. To accommodate dynamic changes in input data over time, a time adaptive batch normalization method is proposed, and temporal information is integrated via a rate encoding scheme, allowing DWPCNN to be efficiently incorporated into existing deep learning frameworks. Furthermore, pulse convolutional dense blocks (PCDB) are employed to extract high-level semantic features, further enhancing the model’s feature representation capability. Experimental results on the TNO and MSRS datasets, compared with 15 representative methods using both objective and subjective metrics, demonstrate that the proposed method excels in detail preservation while achieving a better balance between computational efficiency and fusion performance.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"154 ","pages":"Article 106417"},"PeriodicalIF":3.4,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146074070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Infrared Physics & Technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1