A Novel Dual-Stage Semi-Supervised Learning Model for Fault Diagnosis in Transformers With Limited Imbalanced Labeled Data

IF 3.1 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Dielectrics and Electrical Insulation Pub Date : 2024-09-10 DOI:10.1109/TDEI.2024.3456777
Yanfei Sun;Tao Zhao;Shuguo Gao
{"title":"A Novel Dual-Stage Semi-Supervised Learning Model for Fault Diagnosis in Transformers With Limited Imbalanced Labeled Data","authors":"Yanfei Sun;Tao Zhao;Shuguo Gao","doi":"10.1109/TDEI.2024.3456777","DOIUrl":null,"url":null,"abstract":"Dissolved gas analysis (DGA) methods, a key technique in transformer fault diagnosis, face a typical multicategory data-imbalance problem due to the extremely heterogeneous distribution of fault types across different transformers in their respective operating environments. In addition, the high cost and time requirements for fault-type labeling result in scarce relevant data. To address this issue, a model called the dual-stage semi-supervised learning model (DS-SSLM) is proposed for transformer fault diagnosis with a small amount of unbalanced labeled data. This model uses a two-phase training approach to mitigate the negative impact caused by dataset imbalance. In the pretraining phase, a cross-entropy loss function is used to minimize the difference between the predicted and actual values of the model. Simultaneously, a student model and a teacher model are introduced to process different augmented versions of the same unlabeled sample, aiming to minimize the discrepancy between their outputs, thereby enhancing the model’s robustness to data imbalance. In the downstream task, the classification head of the model is fine-tuned to improve classification reliability. Model performance tests and comparative experiments conducted on the IEC TC 10 dataset demonstrate that the proposed method is robust to data imbalance and achieves high-precision fault diagnosis.","PeriodicalId":13247,"journal":{"name":"IEEE Transactions on Dielectrics and Electrical Insulation","volume":"32 3","pages":"1837-1846"},"PeriodicalIF":3.1000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Dielectrics and Electrical Insulation","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10670711/","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Dissolved gas analysis (DGA) methods, a key technique in transformer fault diagnosis, face a typical multicategory data-imbalance problem due to the extremely heterogeneous distribution of fault types across different transformers in their respective operating environments. In addition, the high cost and time requirements for fault-type labeling result in scarce relevant data. To address this issue, a model called the dual-stage semi-supervised learning model (DS-SSLM) is proposed for transformer fault diagnosis with a small amount of unbalanced labeled data. This model uses a two-phase training approach to mitigate the negative impact caused by dataset imbalance. In the pretraining phase, a cross-entropy loss function is used to minimize the difference between the predicted and actual values of the model. Simultaneously, a student model and a teacher model are introduced to process different augmented versions of the same unlabeled sample, aiming to minimize the discrepancy between their outputs, thereby enhancing the model’s robustness to data imbalance. In the downstream task, the classification head of the model is fine-tuned to improve classification reliability. Model performance tests and comparative experiments conducted on the IEC TC 10 dataset demonstrate that the proposed method is robust to data imbalance and achieves high-precision fault diagnosis.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用有限的不平衡标记数据进行变压器故障诊断的新型双阶段半监督学习模型
溶解气体分析(DGA)方法作为变压器故障诊断的关键技术,由于故障类型在不同变压器运行环境中的分布极不均匀,面临典型的多类数据不平衡问题。此外,故障类型标记的高成本和时间要求导致相关数据稀缺。针对这一问题,提出了一种双阶段半监督学习模型(DS-SSLM),用于少量不平衡标记数据下的变压器故障诊断。该模型采用两阶段训练方法来减轻数据集不平衡带来的负面影响。在预训练阶段,交叉熵损失函数用于最小化模型预测值与实际值之间的差异。同时,引入学生模型和教师模型来处理同一未标记样本的不同增强版本,以最小化其输出之间的差异,从而增强模型对数据不平衡的鲁棒性。在下游任务中,对模型的分类头进行微调,提高分类可靠性。在IEC TC 10数据集上进行的模型性能测试和对比实验表明,该方法对数据不平衡具有较强的鲁棒性,实现了高精度的故障诊断。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Dielectrics and Electrical Insulation
IEEE Transactions on Dielectrics and Electrical Insulation 工程技术-工程:电子与电气
CiteScore
6.00
自引率
22.60%
发文量
309
审稿时长
5.2 months
期刊介绍: Topics that are concerned with dielectric phenomena and measurements, with development and characterization of gaseous, vacuum, liquid and solid electrical insulating materials and systems; and with utilization of these materials in circuits and systems under condition of use.
期刊最新文献
2025 Index IEEE Transactions on Dielectrics and Electrical Insulation IEEE Transactions on Dielectrics and Electrical Insulation Information for Authors IEEE Dielectrics and Electrical Insulation Society Information Letter of Appreciation Effect of Phase Angle of High-Voltage Harmonics on PD Power Losses
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1