首页 > 最新文献

IEEE Transactions on Instrumentation and Measurement最新文献

英文 中文
A Dynamic High-Fidelity Equivalent Circuit Phantom for Intracardiac Communication in Pacemaker Indications 用于心脏起搏器适应症心内通信的动态高保真等效电路模型
IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-04-22 DOI: 10.1109/TIM.2025.3555701
Dongming Li;Jiamei Wang;Han Wang;Xiaojiang Huang;Jiejie Yang;Yueming Gao;Hung-Chun Li;Mang I Vai;Sio Hang Pun
Conductive intracardiac communication (CIC) is an essential approach for achieving multichamber pacing in leadless pacemakers (LCPs), significantly enhancing the therapeutic outcomes for conditions, such as bradycardia. However, the characteristics of the intracardiac channel are profoundly affected by the heart’s rhythmic contractions. Accurately understanding the dynamic transmission mechanisms and channel parameters under various cardiac pathological states is crucial for enhancing the multichamber pacing functionality of LCPs. In this article, the relationship between cardiac chamber volume and channel impedance is mapped based on the electrocardiogram (ECG) data. This mapping enables precise, real-time adjustments to variable impedance, simulating the impedance changes occurring with each heartbeat. Through this approach, a time-frequency equivalent circuit phantom is proposed to accurately simulate channel characteristics for various pacemaker indications (PIs). Utilizing a quasi-dual-pump structural analogy to the heart, we designed a dynamic experimental measurement platform capable of simulating the cardiac beating process under various PIs, which is employed to validate the accuracy of the circuit phantom. The results demonstrate that the correlation coefficients in the frequency and time domains are greater than 0.9432 and 0.9150, respectively, with a time-domain consistency coefficient of less than 3.25. Through cross validation in both frequency and time domains, the circuit effectively simulates the channel characteristics of normal and PI hearts. The empirical formula established based on the time-domain measurement results can be utilized for the rapid estimation of the right atrium (RA)–right ventricle (RV) channel characteristics. The proposed phantom offers a highly accurate and reproducible experimental method for the design of intracardiac communication transceivers, advancing the development and validation of leadless multichamber pacemaker systems.
导电性心内通信(CIC)是实现无导线起搏器(lcp)多室起搏的重要方法,可显著提高心动过缓等疾病的治疗效果。然而,心内通道的特性受到心脏节律性收缩的深刻影响。准确了解不同心脏病理状态下的动态传递机制和通道参数,对于增强lcp的多室起搏功能至关重要。本文根据心电数据,绘制心室容积与通道阻抗的关系图。这种映射可以精确、实时地调整可变阻抗,模拟每次心跳时发生的阻抗变化。通过这种方法,提出了一种时频等效电路模型来精确模拟各种起搏器指示(pi)的通道特性。我们利用心脏的准双泵结构类比,设计了一个动态实验测量平台,能够模拟不同pi下的心脏跳动过程,并用于验证电路模体的准确性。结果表明,频率域和时间域的相关系数分别大于0.9432和0.9150,时域一致性系数小于3.25。通过频域和时域的交叉验证,电路有效地模拟了正常心脏和PI心脏的通道特性。基于时域测量结果建立的经验公式可用于快速估计右心房-右心室通道特性。所提出的模体为心内通信收发器的设计提供了一种高度精确和可重复的实验方法,促进了无引线多室起搏器系统的开发和验证。
{"title":"A Dynamic High-Fidelity Equivalent Circuit Phantom for Intracardiac Communication in Pacemaker Indications","authors":"Dongming Li;Jiamei Wang;Han Wang;Xiaojiang Huang;Jiejie Yang;Yueming Gao;Hung-Chun Li;Mang I Vai;Sio Hang Pun","doi":"10.1109/TIM.2025.3555701","DOIUrl":"https://doi.org/10.1109/TIM.2025.3555701","url":null,"abstract":"Conductive intracardiac communication (CIC) is an essential approach for achieving multichamber pacing in leadless pacemakers (LCPs), significantly enhancing the therapeutic outcomes for conditions, such as bradycardia. However, the characteristics of the intracardiac channel are profoundly affected by the heart’s rhythmic contractions. Accurately understanding the dynamic transmission mechanisms and channel parameters under various cardiac pathological states is crucial for enhancing the multichamber pacing functionality of LCPs. In this article, the relationship between cardiac chamber volume and channel impedance is mapped based on the electrocardiogram (ECG) data. This mapping enables precise, real-time adjustments to variable impedance, simulating the impedance changes occurring with each heartbeat. Through this approach, a time-frequency equivalent circuit phantom is proposed to accurately simulate channel characteristics for various pacemaker indications (PIs). Utilizing a quasi-dual-pump structural analogy to the heart, we designed a dynamic experimental measurement platform capable of simulating the cardiac beating process under various PIs, which is employed to validate the accuracy of the circuit phantom. The results demonstrate that the correlation coefficients in the frequency and time domains are greater than 0.9432 and 0.9150, respectively, with a time-domain consistency coefficient of less than 3.25. Through cross validation in both frequency and time domains, the circuit effectively simulates the channel characteristics of normal and PI hearts. The empirical formula established based on the time-domain measurement results can be utilized for the rapid estimation of the right atrium (RA)–right ventricle (RV) channel characteristics. The proposed phantom offers a highly accurate and reproducible experimental method for the design of intracardiac communication transceivers, advancing the development and validation of leadless multichamber pacemaker systems.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-12"},"PeriodicalIF":5.6,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143870916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Characterization of Inductive Signals of Polymetallic Particles Under Variable Frequency Conditions 多金属粒子在变频条件下的感应信号表征
IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-04-17 DOI: 10.1109/TIM.2025.3557111
Hongpeng Zhang;Xiangming Kan;Chenyong Wang;Zhongyang Cai;Xurui Zhang;Riwei Wang;Chenzhao Bai
As the detection channel of an inductive particle sensor increases, it is inevitable that multiple particles will pass through the sensor simultaneously. However, the movement of multiple particles in fluid and magnetic fields, especially their aggregation behavior, can allow particle sensors to generate misleading wear signals. Therefore, to estimate the influence of the multi-particle aggregation effect on the accuracy of inductive sensors, this study constructed a magnetic coupling model of abrasive particles in strip structure and built the experimental platform required for the study, to investigate the difference in inductance change due to magnetic coupling effect among multiple metal particles at different frequencies. The experimental results reveal the phenomenon of inductance change of the three-particle combinations of the strip structure under two different conditions: in the “sequential entry” type of strip structure, the vortex effect between particles exhibits a mutual weakening, while in the “simultaneous entry” type, the vortex effect between particles exhibits a mutual enhancement. In the “simultaneous entry” type, the eddy current effects between the particles are characterized by mutual enhancement. These findings not only provide a new perspective for understanding the aggregation detection of polymetallic particles but also provide a theoretical basis for improving the accuracy of the sensor.
随着电感式粒子传感器检测通道的增加,不可避免地会有多个粒子同时通过传感器。然而,多个颗粒在流体和磁场中的运动,特别是它们的聚集行为,可能会使颗粒传感器产生误导性的磨损信号。因此,为了评估多粒子聚集效应对电感传感器精度的影响,本研究构建了条形结构中磨料颗粒的磁耦合模型,并搭建了研究所需的实验平台,研究不同频率下多个金属颗粒之间的磁耦合效应对电感变化的影响。实验结果揭示了两种不同条件下带状结构三粒子组合的电感变化现象:在“顺序进入”型带状结构中,粒子间的涡旋效应相互减弱,而在“同时进入”型带状结构中,粒子间的涡旋效应相互增强。在“同时进入”型中,粒子之间的涡流效应以相互增强为特征。这些发现不仅为理解多金属颗粒聚集检测提供了新的视角,也为提高传感器的精度提供了理论依据。
{"title":"Characterization of Inductive Signals of Polymetallic Particles Under Variable Frequency Conditions","authors":"Hongpeng Zhang;Xiangming Kan;Chenyong Wang;Zhongyang Cai;Xurui Zhang;Riwei Wang;Chenzhao Bai","doi":"10.1109/TIM.2025.3557111","DOIUrl":"https://doi.org/10.1109/TIM.2025.3557111","url":null,"abstract":"As the detection channel of an inductive particle sensor increases, it is inevitable that multiple particles will pass through the sensor simultaneously. However, the movement of multiple particles in fluid and magnetic fields, especially their aggregation behavior, can allow particle sensors to generate misleading wear signals. Therefore, to estimate the influence of the multi-particle aggregation effect on the accuracy of inductive sensors, this study constructed a magnetic coupling model of abrasive particles in strip structure and built the experimental platform required for the study, to investigate the difference in inductance change due to magnetic coupling effect among multiple metal particles at different frequencies. The experimental results reveal the phenomenon of inductance change of the three-particle combinations of the strip structure under two different conditions: in the “sequential entry” type of strip structure, the vortex effect between particles exhibits a mutual weakening, while in the “simultaneous entry” type, the vortex effect between particles exhibits a mutual enhancement. In the “simultaneous entry” type, the eddy current effects between the particles are characterized by mutual enhancement. These findings not only provide a new perspective for understanding the aggregation detection of polymetallic particles but also provide a theoretical basis for improving the accuracy of the sensor.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-9"},"PeriodicalIF":5.6,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143865372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimization of Magnetic Anomaly Detection Under 1/fα Noise Based on Karhunen–Loève Expansion and Frequency Characteristic Function 基于卡尔胡宁-洛夫扩展和频率特性函数的 1/fα 噪声下磁性异常检测优化
IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-04-17 DOI: 10.1109/TIM.2025.3557098
Wenqi Li;Zongtan Zhou;Hongxin Li;Jingsheng Tang;Ming Xu
The primary challenge facing the field of magnetic anomaly detection (MAD) currently lies in how to effectively improve detection performance in low signal-to-noise ratio (SNR) and real $1/f^{alpha } $ noise scenarios. To overcome these difficulties, this article proposes an optimized MAD method based on a random forest (RF) classifier. This method utilizes an orthonormal basis function (OBF) detector based on Karhunen-Loève expansion (KLE) to extract energy from the raw data as time-domain (TD) features. Meanwhile, the spectral information derived from low-pass filtering (LPF) and fast Fourier transform (FFT) serves as frequency-domain (FD) features of the raw data. The cutoff frequency of the LPF is determined based on a frequency characteristic function that defines the high-frequency boundary of the target signal. Combining these time and FD features, a simulated dataset is constructed for the training and testing of the detection model. Subsequently, the trained model undergoes further validation and evaluation on semi-real and real datasets built upon measured data from a tunneling magnetoresistance (TMR) magnetic sensor. Through a series of simulations, we demonstrate that our designed method exhibits superior detection capability and stronger generalization ability compared to other similar OBF-based methods. Furthermore, the superiority of this method is also confirmed by experimental results based on measured data.
磁异常检测(MAD)领域目前面临的主要挑战是如何在低信噪比(SNR)和真实$1/f^{alpha} $噪声场景下有效提高检测性能。为了克服这些困难,本文提出了一种基于随机森林(RF)分类器的优化MAD方法。该方法利用基于karhunen - lo展开(KLE)的正交基函数(OBF)检测器从原始数据中提取能量作为时域(TD)特征。同时,通过低通滤波(LPF)和快速傅立叶变换(FFT)得到的频谱信息作为原始数据的频域特征。LPF的截止频率是根据定义目标信号高频边界的频率特性函数确定的。结合这些时间和FD特征,构建模拟数据集,用于检测模型的训练和测试。随后,基于隧道磁阻(TMR)磁传感器的测量数据,对训练好的模型进行半真实和真实数据集的进一步验证和评估。通过一系列的仿真,我们证明了与其他类似的基于obf的方法相比,我们设计的方法具有更好的检测能力和更强的泛化能力。此外,基于实测数据的实验结果也证实了该方法的优越性。
{"title":"Optimization of Magnetic Anomaly Detection Under 1/fα Noise Based on Karhunen–Loève Expansion and Frequency Characteristic Function","authors":"Wenqi Li;Zongtan Zhou;Hongxin Li;Jingsheng Tang;Ming Xu","doi":"10.1109/TIM.2025.3557098","DOIUrl":"https://doi.org/10.1109/TIM.2025.3557098","url":null,"abstract":"The primary challenge facing the field of magnetic anomaly detection (MAD) currently lies in how to effectively improve detection performance in low signal-to-noise ratio (SNR) and real <inline-formula> <tex-math>$1/f^{alpha } $ </tex-math></inline-formula> noise scenarios. To overcome these difficulties, this article proposes an optimized MAD method based on a random forest (RF) classifier. This method utilizes an orthonormal basis function (OBF) detector based on Karhunen-Loève expansion (KLE) to extract energy from the raw data as time-domain (TD) features. Meanwhile, the spectral information derived from low-pass filtering (LPF) and fast Fourier transform (FFT) serves as frequency-domain (FD) features of the raw data. The cutoff frequency of the LPF is determined based on a frequency characteristic function that defines the high-frequency boundary of the target signal. Combining these time and FD features, a simulated dataset is constructed for the training and testing of the detection model. Subsequently, the trained model undergoes further validation and evaluation on semi-real and real datasets built upon measured data from a tunneling magnetoresistance (TMR) magnetic sensor. Through a series of simulations, we demonstrate that our designed method exhibits superior detection capability and stronger generalization ability compared to other similar OBF-based methods. Furthermore, the superiority of this method is also confirmed by experimental results based on measured data.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-13"},"PeriodicalIF":5.6,"publicationDate":"2025-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143875180","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Experimental Study of Cerebral Hemorrhage Imaging Based on Frequency-Differential Electrical Capacitance Tomography 基于频差电容断层扫描的脑出血成像实验研究
IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-04-16 DOI: 10.1109/TIM.2025.3560726
Gui Jin;Wei Zhuang;Mingxin Qin;Feng Wang;Mingsheng Chen;Haocheng Li;Zihong Wang
Currently, electrical capacitance tomography (ECT) is limited to time-differential imaging for monitoring dynamic alterations in cerebral hemorrhage. The inherent constraint of this approach, however, renders it unsuitable for rapid hemorrhage detection, as it requires a reference measurement from a nonhemorrhaging brain. In order to address this limitation, this study proposes a novel approach of frequency-differential ECT (FDECT) for cerebral hemorrhage imaging in practice. The method entails the identification of a frequency range wherein the permittivity variation of cerebral blood with frequency is much greater than the variation of other brain tissues. Within this identified range, two optimal frequencies are selected, and the permittivity difference at these two frequencies is used for imaging. With this method, cerebral hemorrhage is highlighted, and other brain tissues are suppressed, thereby achieving the absolute distribution of cerebral hemorrhage and eliminating the need for nonhemorrhagic baseline data. Simulation results demonstrate that FDECT imaging quality correlates directly with the frequency-dependent permittivity difference between the target and background media, thereby validating FDECT’s theoretical basis and highlighting the critical role of optimal frequency selection. Before conducting in vitro animal imaging, we analyzed the dielectric spectra of ex vivo sheep blood, pig fat, and pig brain tissue to identify the optimal frequency range for differentiating blood from these tissues. In vitro experiments confirmed that FDECT with the optimal frequencies effectively images blood within pig fat or brain tissue, contrasting with the suboptimal results from nonideal frequencies. Although essential for FDECT success, optimal frequency pairing does not eliminate the higher noise levels in FDECT images, largely due to the background brain tissue’s frequency-dependent dielectric characteristics. In order to mitigate this inherent limitation and improve imaging quality, we intend to implement a three-frequency FDECT approach, reducing background tissue interference.
目前,电容断层扫描(ECT)仅限于监测脑出血动态变化的时差成像。然而,这种方法的固有限制使得它不适合快速出血检测,因为它需要从非出血的大脑中进行参考测量。为了解决这一局限性,本研究提出了一种新的频差电痉挛治疗方法(FDECT)用于脑出血成像。该方法需要识别一个频率范围,其中脑血的介电常数随频率的变化远远大于其他脑组织的变化。在确定的范围内,选择两个最佳频率,并使用这两个频率的介电常数差进行成像。该方法突出脑出血,抑制其他脑组织,从而实现脑出血的绝对分布,无需非出血性基线数据。仿真结果表明,FDECT成像质量与目标介质和背景介质频率相关的介电常数差直接相关,从而验证了FDECT的理论基础,突出了最佳频率选择的关键作用。在进行体外动物成像之前,我们分析了离体羊血、猪脂肪和猪脑组织的介电光谱,以确定区分血液和这些组织的最佳频率范围。体外实验证实,与非理想频率下的次优结果相比,最佳频率下的FDECT能有效成像猪脂肪或脑组织内的血液。尽管对于FDECT的成功至关重要,但最佳频率配对并不能消除FDECT图像中较高的噪声水平,这主要是由于背景脑组织的频率相关介电特性。为了减轻这种固有的限制和提高成像质量,我们打算实施三频率FDECT方法,减少背景组织干扰。
{"title":"Experimental Study of Cerebral Hemorrhage Imaging Based on Frequency-Differential Electrical Capacitance Tomography","authors":"Gui Jin;Wei Zhuang;Mingxin Qin;Feng Wang;Mingsheng Chen;Haocheng Li;Zihong Wang","doi":"10.1109/TIM.2025.3560726","DOIUrl":"https://doi.org/10.1109/TIM.2025.3560726","url":null,"abstract":"Currently, electrical capacitance tomography (ECT) is limited to time-differential imaging for monitoring dynamic alterations in cerebral hemorrhage. The inherent constraint of this approach, however, renders it unsuitable for rapid hemorrhage detection, as it requires a reference measurement from a nonhemorrhaging brain. In order to address this limitation, this study proposes a novel approach of frequency-differential ECT (FDECT) for cerebral hemorrhage imaging in practice. The method entails the identification of a frequency range wherein the permittivity variation of cerebral blood with frequency is much greater than the variation of other brain tissues. Within this identified range, two optimal frequencies are selected, and the permittivity difference at these two frequencies is used for imaging. With this method, cerebral hemorrhage is highlighted, and other brain tissues are suppressed, thereby achieving the absolute distribution of cerebral hemorrhage and eliminating the need for nonhemorrhagic baseline data. Simulation results demonstrate that FDECT imaging quality correlates directly with the frequency-dependent permittivity difference between the target and background media, thereby validating FDECT’s theoretical basis and highlighting the critical role of optimal frequency selection. Before conducting in vitro animal imaging, we analyzed the dielectric spectra of ex vivo sheep blood, pig fat, and pig brain tissue to identify the optimal frequency range for differentiating blood from these tissues. In vitro experiments confirmed that FDECT with the optimal frequencies effectively images blood within pig fat or brain tissue, contrasting with the suboptimal results from nonideal frequencies. Although essential for FDECT success, optimal frequency pairing does not eliminate the higher noise levels in FDECT images, largely due to the background brain tissue’s frequency-dependent dielectric characteristics. In order to mitigate this inherent limitation and improve imaging quality, we intend to implement a three-frequency FDECT approach, reducing background tissue interference.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-14"},"PeriodicalIF":5.6,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143875139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MCAFNet: Multiscale Channel Attention Fusion Network for Arbitrary Style Transfer 任意风格迁移的多尺度通道注意融合网络
IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-04-16 DOI: 10.1109/TIM.2025.3561400
Zhongyu Bai;Hongli Xu;Qichuan Ding;Xiangyue Zhang
Recently, attention-based arbitrary style transfer (AST) techniques have been widely applied in image generation and video processing. However, the scale bias of the attention module used for contextual information extraction and multiscale feature aggregation poses a challenge in balancing the content structure and style patterns of images. In this work, a multiscale channel attention fusion network (MCAFNet) is proposed to generate stylization images with well-coordinated content and style. Specifically, the multiscale channel attention module (MCAM) is introduced to extract both local and global contextual information of style features within the channel dimension and subsequently aggregate this information with content features. Following MCAM, an attentional feature fusion module (AFFM) is adopted to effectively integrate both deep and shallow semantic features. Furthermore, a novel contrastive loss based on multi-source feature enhancement is proposed to optimize the spatial distribution between content and style features. Both qualitative and quantitative experimental results compared to the state-of-the-art (SOTA) baseline approaches indicate the superiority of the proposed method for real-time image and video style transfer.
近年来,基于注意力的任意风格迁移技术在图像生成和视频处理中得到了广泛的应用。然而,用于上下文信息提取和多尺度特征聚合的注意力模块的尺度偏差给图像的内容结构和风格模式的平衡带来了挑战。在这项工作中,提出了一种多尺度通道注意力融合网络(MCAFNet)来生成内容和风格协调良好的风格化图像。具体来说,引入了多尺度通道关注模块(MCAM)来提取通道维度内风格特征的局部和全局上下文信息,然后将这些信息与内容特征进行聚合。在MCAM之后,采用了注意特征融合模块(AFFM),有效地融合了深层和浅层语义特征。在此基础上,提出了一种基于多源特征增强的对比损失算法来优化内容特征和风格特征的空间分布。定性和定量实验结果与最先进的(SOTA)基线方法相比,表明了所提出的方法在实时图像和视频风格转移方面的优越性。
{"title":"MCAFNet: Multiscale Channel Attention Fusion Network for Arbitrary Style Transfer","authors":"Zhongyu Bai;Hongli Xu;Qichuan Ding;Xiangyue Zhang","doi":"10.1109/TIM.2025.3561400","DOIUrl":"https://doi.org/10.1109/TIM.2025.3561400","url":null,"abstract":"Recently, attention-based arbitrary style transfer (AST) techniques have been widely applied in image generation and video processing. However, the scale bias of the attention module used for contextual information extraction and multiscale feature aggregation poses a challenge in balancing the content structure and style patterns of images. In this work, a multiscale channel attention fusion network (MCAFNet) is proposed to generate stylization images with well-coordinated content and style. Specifically, the multiscale channel attention module (MCAM) is introduced to extract both local and global contextual information of style features within the channel dimension and subsequently aggregate this information with content features. Following MCAM, an attentional feature fusion module (AFFM) is adopted to effectively integrate both deep and shallow semantic features. Furthermore, a novel contrastive loss based on multi-source feature enhancement is proposed to optimize the spatial distribution between content and style features. Both qualitative and quantitative experimental results compared to the state-of-the-art (SOTA) baseline approaches indicate the superiority of the proposed method for real-time image and video style transfer.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-13"},"PeriodicalIF":5.6,"publicationDate":"2025-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143875106","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Unbalanced State Detection of Coriolis Flowmeter Based on Laplace Wavelet Sparse Representation and Customized Detection Indicator 基于拉普拉斯小波稀疏表示和自定义检测指标的科氏流量计不平衡状态检测
IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-04-15 DOI: 10.1109/TIM.2025.3558818
Yue Si;Yanyi Zhang;Lingfei Kong;Chaohui Zhang;Bohan Zhao
Coriolis flowmeters are widely used in the petroleum, chemical, and pharmaceutical industries. To ensure safety and efficiency during production, it is necessary to ensure the measurement accuracy of the flowmeter. The dynamic imbalance of a flowmeter is a key factor that affects the measurement accuracy of the flowmeter. Therefore, a method for evaluating the unbalanced state status of flowmeters is proposed. This study aimed to quickly and accurately identify the dynamic unbalanced state of a flowmeter. First, the frequency response function of the original signal is calculated to reduce the error caused by the influence of excitation. Second, the method of combining empirical wavelet and Laplacian sparse decomposition is used to extract the characteristic information of each order modal. Finally, through experiments, it is found that the second- and third-order modal characteristic details are sensitive to the degree and location of the flowmeter dynamic unbalance fault. Therefore, second- and third-order modal characteristic information were selected to calculate two characteristic indicators that reflect the degree and location of the flowmeter dynamic unbalanced fault to achieve the effect of fault diagnosis. Experiments and simulations show that this method is more accurate and efficient than the other methods.
科里奥利流量计广泛应用于石油、化工和制药行业。为了保证生产过程中的安全、高效,必须保证流量计的测量精度。流量计的动态不平衡是影响流量计测量精度的关键因素。为此,提出了一种评价流量计不平衡状态的方法。本研究旨在快速准确地识别流量计的动态不平衡状态。首先,计算原始信号的频响函数,减小激励影响带来的误差。其次,采用经验小波与拉普拉斯稀疏分解相结合的方法提取各阶模态的特征信息;最后,通过实验发现,二阶和三阶模态特征细节对流量计动不平衡故障的程度和位置敏感。因此,选取二阶和三阶模态特征信息,计算反映流量计动态不平衡故障发生程度和位置的两个特征指标,达到故障诊断的效果。实验和仿真结果表明,该方法比其他方法具有更高的精度和效率。
{"title":"The Unbalanced State Detection of Coriolis Flowmeter Based on Laplace Wavelet Sparse Representation and Customized Detection Indicator","authors":"Yue Si;Yanyi Zhang;Lingfei Kong;Chaohui Zhang;Bohan Zhao","doi":"10.1109/TIM.2025.3558818","DOIUrl":"https://doi.org/10.1109/TIM.2025.3558818","url":null,"abstract":"Coriolis flowmeters are widely used in the petroleum, chemical, and pharmaceutical industries. To ensure safety and efficiency during production, it is necessary to ensure the measurement accuracy of the flowmeter. The dynamic imbalance of a flowmeter is a key factor that affects the measurement accuracy of the flowmeter. Therefore, a method for evaluating the unbalanced state status of flowmeters is proposed. This study aimed to quickly and accurately identify the dynamic unbalanced state of a flowmeter. First, the frequency response function of the original signal is calculated to reduce the error caused by the influence of excitation. Second, the method of combining empirical wavelet and Laplacian sparse decomposition is used to extract the characteristic information of each order modal. Finally, through experiments, it is found that the second- and third-order modal characteristic details are sensitive to the degree and location of the flowmeter dynamic unbalance fault. Therefore, second- and third-order modal characteristic information were selected to calculate two characteristic indicators that reflect the degree and location of the flowmeter dynamic unbalanced fault to achieve the effect of fault diagnosis. Experiments and simulations show that this method is more accurate and efficient than the other methods.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-10"},"PeriodicalIF":5.6,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143856326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multisource Information Fusion for Continuous Prediction of Joint Angles Using TCN Combined With Temporal Pattern Attention Mechanism 结合时间模式注意机制的多源信息融合关节角连续预测
IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-04-15 DOI: 10.1109/TIM.2025.3560752
Tairen Sun;Shaozhe Wang;Hongjun Yang;Jiantao Yang;Zeng-Guang Hou
Continuous motion intention prediction is valuable for human-machine interaction (HMI); however, the accuracy and the efficiency of the existing related results are far from satisfactory. This study proposes a novel continuous motion-intention prediction method that combines multisource information fusion with an improved temporal convolutional neural network (TCN), enhanced by the introduction of the temporal pattern attention (TPA) mechanism. By integrating the temporal features of surface electromyography (sEMG) and mechanomyography (MMG) signals, we fully exploit their synergistic effect in movement intention prediction. Using the TCN network as the continuous motion prediction model improves the training efficiency through parallel computation and a simple network structure. TCN avoids the possible gradient vanishing problem and allows for parameter tuning according to different tasks. By integrating the TPA mechanism with the TCN, the model’s ability to recognize human motion intention in sequential data is significantly improved by focusing on key time steps. This enhancement increases prediction accuracy and strengthens the model’s ability to capture long-term dependencies in time series data. Experiments are conducted to show the effectiveness and the advantages of the proposed TPA-TCN-based motion prediction in comparison with the related results.
连续运动意图预测在人机交互(HMI)中具有重要价值;然而,现有的相关结果的准确性和效率远不能令人满意。本研究提出了一种将多源信息融合与改进的时间卷积神经网络(TCN)相结合,并通过引入时间模式注意(TPA)机制进行增强的连续运动意图预测方法。通过整合肌表电(sEMG)和肌力图(MMG)信号的时间特征,充分利用它们在运动意图预测中的协同作用。采用TCN网络作为连续运动预测模型,通过并行计算和简单的网络结构提高了训练效率。TCN避免了可能的梯度消失问题,并允许根据不同的任务进行参数调整。通过将TPA机制与TCN相结合,通过关注关键时间步,显著提高了模型对序列数据中人体运动意图的识别能力。这种增强提高了预测精度,并增强了模型在时间序列数据中捕获长期依赖关系的能力。通过实验与相关结果对比,验证了基于tpa - tcn的运动预测方法的有效性和优越性。
{"title":"Multisource Information Fusion for Continuous Prediction of Joint Angles Using TCN Combined With Temporal Pattern Attention Mechanism","authors":"Tairen Sun;Shaozhe Wang;Hongjun Yang;Jiantao Yang;Zeng-Guang Hou","doi":"10.1109/TIM.2025.3560752","DOIUrl":"https://doi.org/10.1109/TIM.2025.3560752","url":null,"abstract":"Continuous motion intention prediction is valuable for human-machine interaction (HMI); however, the accuracy and the efficiency of the existing related results are far from satisfactory. This study proposes a novel continuous motion-intention prediction method that combines multisource information fusion with an improved temporal convolutional neural network (TCN), enhanced by the introduction of the temporal pattern attention (TPA) mechanism. By integrating the temporal features of surface electromyography (sEMG) and mechanomyography (MMG) signals, we fully exploit their synergistic effect in movement intention prediction. Using the TCN network as the continuous motion prediction model improves the training efficiency through parallel computation and a simple network structure. TCN avoids the possible gradient vanishing problem and allows for parameter tuning according to different tasks. By integrating the TPA mechanism with the TCN, the model’s ability to recognize human motion intention in sequential data is significantly improved by focusing on key time steps. This enhancement increases prediction accuracy and strengthens the model’s ability to capture long-term dependencies in time series data. Experiments are conducted to show the effectiveness and the advantages of the proposed TPA-TCN-based motion prediction in comparison with the related results.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-12"},"PeriodicalIF":5.6,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143875178","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual-Channel Degradation Monitoring Based on Graph Neural Network for Aero-Engine Remaining Useful Life Prediction 基于图神经网络的双通道退化监测航空发动机剩余使用寿命预测
IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-04-15 DOI: 10.1109/TIM.2025.3560753
Li'ang Cao;Yuanfu Li;Jinwei Chen;Wenjie Wu;Huisheng Zhang
For aircraft engine predictive maintenance (PdM) programs, accurate remaining useful life (RUL) predictions can significantly reduce unscheduled maintenance downtime and ensure engine safety. To this end, we introduce a novel dual-channel degradation monitoring (DCDM) algorithm designed to minimize RUL prediction errors. Unlike traditional RUL prediction algorithms based on graph neural networks (GNNs), the proposed DCDM model extracts fault features from both node embeddings and changes in graph structures. This dual-channel approach allows for the fusion of fault information, improving the model’s ability to extract features across different fault patterns for RUL prediction. During the graph structure learning (GSL) process, domain-specific knowledge and a dynamic graph learning algorithm are integrated to generate graphs, enhancing the interpretability of graph representation. In addition, by introducing node sparse encoding as a model input, the DCDM model’s capability to discern critical features is significantly improved. The predictive performance of the DCDM model and the effectiveness of its individual components are validated using the C-MAPSS dataset. The results demonstrate the superior accuracy of the proposed method compared to existing approaches.
对于飞机发动机预测性维护(PdM)项目,准确的剩余使用寿命(RUL)预测可以显著减少计划外维护停机时间,确保发动机安全。为此,我们引入了一种新的双通道退化监测(DCDM)算法,旨在最大限度地减少RUL预测误差。与传统的基于图神经网络(gnn)的RUL预测算法不同,本文提出的DCDM模型同时从节点嵌入和图结构变化中提取故障特征。这种双通道方法允许融合故障信息,提高模型在不同故障模式中提取特征的能力,用于RUL预测。在图结构学习(GSL)过程中,结合领域特定知识和动态图学习算法生成图,提高了图表示的可解释性。此外,通过引入节点稀疏编码作为模型输入,显著提高了DCDM模型识别关键特征的能力。使用C-MAPSS数据集验证了DCDM模型的预测性能及其各个组件的有效性。结果表明,与现有方法相比,该方法具有更高的精度。
{"title":"Dual-Channel Degradation Monitoring Based on Graph Neural Network for Aero-Engine Remaining Useful Life Prediction","authors":"Li'ang Cao;Yuanfu Li;Jinwei Chen;Wenjie Wu;Huisheng Zhang","doi":"10.1109/TIM.2025.3560753","DOIUrl":"https://doi.org/10.1109/TIM.2025.3560753","url":null,"abstract":"For aircraft engine predictive maintenance (PdM) programs, accurate remaining useful life (RUL) predictions can significantly reduce unscheduled maintenance downtime and ensure engine safety. To this end, we introduce a novel dual-channel degradation monitoring (DCDM) algorithm designed to minimize RUL prediction errors. Unlike traditional RUL prediction algorithms based on graph neural networks (GNNs), the proposed DCDM model extracts fault features from both node embeddings and changes in graph structures. This dual-channel approach allows for the fusion of fault information, improving the model’s ability to extract features across different fault patterns for RUL prediction. During the graph structure learning (GSL) process, domain-specific knowledge and a dynamic graph learning algorithm are integrated to generate graphs, enhancing the interpretability of graph representation. In addition, by introducing node sparse encoding as a model input, the DCDM model’s capability to discern critical features is significantly improved. The predictive performance of the DCDM model and the effectiveness of its individual components are validated using the C-MAPSS dataset. The results demonstrate the superior accuracy of the proposed method compared to existing approaches.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-19"},"PeriodicalIF":5.6,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143870982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Translational Velocity and Dispersed Bubble Distribution Measurement in Slug Flow Based on Fiber Optical Reflectometer 基于光纤反射计的段塞流平动速度和分散气泡分布测量
IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-04-15 DOI: 10.1109/TIM.2025.3560754
Dandan Zheng;Jilin Ye;Maosen Wang;Yongtao Chen
In horizontal gas-liquid slug flow, the translational velocity of the liquid slug and its bubble distribution plays a crucial role in guiding pipeline design and understanding the formation and development mechanisms of the liquid slug. This article proposes the use of a single optical fiber probe (OFP) based on fiber optical reflectometer (FOR) to measure the velocity and chord length distribution of dispersed bubbles within the liquid slug. These acquired local parameters are then used to determine the translational slug velocity and local liquid holdup. Experiments under 24 velocity conditions were conducted in a horizontal DN50 pipeline, with the superficial liquid velocity from 0.2 to 0.7 m/s and superficial gas velocity from 1.04 to 8.14 m/s. The wavelet synchrosqueezed transform (WSST) method was applied to extract the frequency of the oscillation signal and to calculate the local velocity. Maximum velocity at the slug head region was identified as the translational velocity of the slug. Compared with Bendiksen’s empirical formula, this method achieved a mean absolute percentage error (MAPE) of 4.38%. Under most velocity conditions, the relative error did not exceed 10%. The trend in local liquid holdup was also consistent with Gregory’s prediction. Subsequently, the FOR technique was used to measure both size and spatial distribution of dispersed bubbles inside a slug. The average chord length of all dispersed bubbles was found to be 1.86 mm, with 93.8% of bubbles exhibiting a chord length between 0 and 5 mm. As superficial gas velocity increased from 3 to 7 m/s, the bubble chord length distribution shifted from 0-4 mm to 0-2 mm, indicating the enhanced bubble fragmentation within the slug at a higher gas velocity. Additionally, 29.8%–39.7% of large dispersed bubbles were concentrated at the slug head and slug tail. Bubble entrainment contributed to the increased large bubble concentration at the slug head, while bubble coalescence at the slug tail led to a reduction in the proportion of small bubbles. Moreover, the influence of gas velocity increase on bubble spatial distribution was analyzed.
在水平气液段塞流中,液体段塞流的平动速度及其气泡分布对指导管道设计和理解液体段塞流的形成和发展机制具有至关重要的作用。本文提出利用基于光纤反射计的单光纤探头(OFP)测量液塞内分散气泡的速度和弦长分布。这些获得的局部参数然后用于确定平移段塞速度和局部含液率。在水平DN50管道中进行了24种流速条件下的实验,液表流速为0.2 ~ 0.7 m/s,气表流速为1.04 ~ 8.14 m/s。采用小波同步压缩变换(WSST)方法提取振动信号的频率并计算局部速度。在弹头头部区域的最大速度被确定为弹头的平移速度。与Bendiksen经验公式相比,该方法的平均绝对百分比误差(MAPE)为4.38%。在大多数速度条件下,相对误差不超过10%。局部含液率的变化趋势也与Gregory的预测一致。随后,使用FOR技术测量段塞内分散气泡的大小和空间分布。分散气泡的平均弦长为1.86 mm,其中93.8%的气泡弦长在0 ~ 5 mm之间。随着表面气速从3 ~ 7 m/s增加,气泡弦长分布由0 ~ 4 mm变为0 ~ 2 mm,表明在较高气速下段塞内气泡破碎程度增强。此外,29.8% ~ 39.7%的大分散气泡集中在弹头头部和尾部。气泡夹带导致段塞流头部大气泡浓度增加,而段塞流尾部气泡聚并导致小气泡比例降低。分析了气速增大对气泡空间分布的影响。
{"title":"Translational Velocity and Dispersed Bubble Distribution Measurement in Slug Flow Based on Fiber Optical Reflectometer","authors":"Dandan Zheng;Jilin Ye;Maosen Wang;Yongtao Chen","doi":"10.1109/TIM.2025.3560754","DOIUrl":"https://doi.org/10.1109/TIM.2025.3560754","url":null,"abstract":"In horizontal gas-liquid slug flow, the translational velocity of the liquid slug and its bubble distribution plays a crucial role in guiding pipeline design and understanding the formation and development mechanisms of the liquid slug. This article proposes the use of a single optical fiber probe (OFP) based on fiber optical reflectometer (FOR) to measure the velocity and chord length distribution of dispersed bubbles within the liquid slug. These acquired local parameters are then used to determine the translational slug velocity and local liquid holdup. Experiments under 24 velocity conditions were conducted in a horizontal DN50 pipeline, with the superficial liquid velocity from 0.2 to 0.7 m/s and superficial gas velocity from 1.04 to 8.14 m/s. The wavelet synchrosqueezed transform (WSST) method was applied to extract the frequency of the oscillation signal and to calculate the local velocity. Maximum velocity at the slug head region was identified as the translational velocity of the slug. Compared with Bendiksen’s empirical formula, this method achieved a mean absolute percentage error (MAPE) of 4.38%. Under most velocity conditions, the relative error did not exceed 10%. The trend in local liquid holdup was also consistent with Gregory’s prediction. Subsequently, the FOR technique was used to measure both size and spatial distribution of dispersed bubbles inside a slug. The average chord length of all dispersed bubbles was found to be 1.86 mm, with 93.8% of bubbles exhibiting a chord length between 0 and 5 mm. As superficial gas velocity increased from 3 to 7 m/s, the bubble chord length distribution shifted from 0-4 mm to 0-2 mm, indicating the enhanced bubble fragmentation within the slug at a higher gas velocity. Additionally, 29.8%–39.7% of large dispersed bubbles were concentrated at the slug head and slug tail. Bubble entrainment contributed to the increased large bubble concentration at the slug head, while bubble coalescence at the slug tail led to a reduction in the proportion of small bubbles. Moreover, the influence of gas velocity increase on bubble spatial distribution was analyzed.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-12"},"PeriodicalIF":5.6,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143871031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Word Length-Aware Text Spotting: Enhancing Dense Text Detection and Recognition for Camera-Captured Document Image 单词长度感知文本识别:增强密集文本检测和识别相机捕获的文档图像
IF 5.6 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-04-15 DOI: 10.1109/TIM.2025.3560748
Hao Wang;Huabing Zhou;Yanduo Zhang;Jiayi Ma;Haibin Ling
Text spotting in camera-captured document images faces significant challenges, especially with dense text of variable lengths. Existing approaches falter with the long-tailed distribution of word lengths, leading to decreased performance on words with extreme lengths. To address this issue, we present WordLenSpotter, an end-to-end framework incorporating word length awareness to improve detection and recognition across a wide range of word lengths. Our method utilizes a dilated convolutional fusion module in its image encoder and a transformer framework for joint detection and recognition guided by word length priors. Our innovations include a spatial length predictor (SLP) and a length-aware segmentation (LenSeg) proposal head, enhancing the model’s sensitivity to the spatial distribution of text. Evaluated on our newly constructed DSTD1500 dataset and existing public datasets with dense text, WordLenSpotter demonstrates superior text spotting capabilities, especially in handling the diversity of word lengths in dense text scenes. The code is available at https://github.com/unxiaohao/WordLenSpotter
相机捕获的文档图像中的文本定位面临着重大挑战,特别是对于可变长度的密集文本。现有的方法由于单词长度的长尾分布而不稳定,导致在极端长度的单词上性能下降。为了解决这个问题,我们提出了WordLenSpotter,这是一个包含单词长度感知的端到端框架,以提高对大范围单词长度的检测和识别。我们的方法在其图像编码器中使用扩展卷积融合模块,并在单词长度先验指导下使用变压器框架进行联合检测和识别。我们的创新包括空间长度预测器(SLP)和长度感知分割(LenSeg)建议头,增强了模型对文本空间分布的敏感性。在我们新构建的DSTD1500数据集和现有的具有密集文本的公共数据集上进行评估,WordLenSpotter展示了卓越的文本识别能力,特别是在处理密集文本场景中单词长度的多样性方面。代码可在https://github.com/unxiaohao/WordLenSpotter上获得
{"title":"Word Length-Aware Text Spotting: Enhancing Dense Text Detection and Recognition for Camera-Captured Document Image","authors":"Hao Wang;Huabing Zhou;Yanduo Zhang;Jiayi Ma;Haibin Ling","doi":"10.1109/TIM.2025.3560748","DOIUrl":"https://doi.org/10.1109/TIM.2025.3560748","url":null,"abstract":"Text spotting in camera-captured document images faces significant challenges, especially with dense text of variable lengths. Existing approaches falter with the long-tailed distribution of word lengths, leading to decreased performance on words with extreme lengths. To address this issue, we present WordLenSpotter, an end-to-end framework incorporating word length awareness to improve detection and recognition across a wide range of word lengths. Our method utilizes a dilated convolutional fusion module in its image encoder and a transformer framework for joint detection and recognition guided by word length priors. Our innovations include a spatial length predictor (SLP) and a length-aware segmentation (LenSeg) proposal head, enhancing the model’s sensitivity to the spatial distribution of text. Evaluated on our newly constructed DSTD1500 dataset and existing public datasets with dense text, WordLenSpotter demonstrates superior text spotting capabilities, especially in handling the diversity of word lengths in dense text scenes. The code is available at <uri>https://github.com/unxiaohao/WordLenSpotter</uri>","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-15"},"PeriodicalIF":5.6,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143875104","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Instrumentation and Measurement
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1