Wind turbine fault detection and identification using a two-tier machine learning framework

Zaid Allal , Hassan N. Noura , Flavien Vernier , Ola Salman , Khaled Chahine
{"title":"Wind turbine fault detection and identification using a two-tier machine learning framework","authors":"Zaid Allal ,&nbsp;Hassan N. Noura ,&nbsp;Flavien Vernier ,&nbsp;Ola Salman ,&nbsp;Khaled Chahine","doi":"10.1016/j.iswa.2024.200372","DOIUrl":null,"url":null,"abstract":"<div><p>A proactive approach is essential to optimize wind turbine maintenance and minimize downtime. By utilizing advanced data analysis techniques on the existing Supervisory Control and Data Acquisition (SCADA) system data, valuable insights can be gained into wind turbine performance without incurring high costs. This allows for early fault detection and predictive maintenance, ensuring that unscheduled or reactive maintenance is minimized and revenue loss is mitigated. In this study, data from a wind turbine SCADA system in the southeast of Ireland were collected, preprocessed, and analyzed using statistical and visualization techniques to uncover hidden patterns related to five fault types within the system. The paper introduces a conditional function designed to test two given scenarios. The first scenario employs a two-tier approach involving fault detection followed by fault identification. Initially, faulty samples are detected in the first tier and then passed to the second tier, which is trained to diagnose the specific fault type for each sample. In contrast, the second scenario involves a simpler solution referred to as naive, which treats fault types and normal cases together in the same dataset and trains a model to distinguish between normal samples and those related to specific fault types. Machine learning models, particularly robust classifiers, were tested in both scenarios. Thirteen classifiers were included, ranging from tree-based to traditional classifiers, neural networks, and ensemble learners. Additionally, an averaging feature importance technique was employed to select the most impactful features on the model decisions as a starting point. A comparison of the results reveals that the proposed two-tier approach is more accurate and less time-consuming, achieving 95% accuracy in separating faulty from normal samples and approximately 91% in diagnosing each fault type. Furthermore, ensemble learners, particularly bagging and stacking, demonstrated superior fault detection and identification performance. The performance of the classifiers was validated using t-SNE and explainable AI techniques, confirming that the impactful features align with the findings and that the proposed two-tier solution outperforms the naive solution. These results strongly indicate that the proposed solution is accurate, independent, and less complex compared to existing solutions.</p></div>","PeriodicalId":100684,"journal":{"name":"Intelligent Systems with Applications","volume":"22 ","pages":"Article 200372"},"PeriodicalIF":0.0000,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667305324000486/pdfft?md5=4606e6eb1accac3bd5df946c38764e84&pid=1-s2.0-S2667305324000486-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Systems with Applications","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667305324000486","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

A proactive approach is essential to optimize wind turbine maintenance and minimize downtime. By utilizing advanced data analysis techniques on the existing Supervisory Control and Data Acquisition (SCADA) system data, valuable insights can be gained into wind turbine performance without incurring high costs. This allows for early fault detection and predictive maintenance, ensuring that unscheduled or reactive maintenance is minimized and revenue loss is mitigated. In this study, data from a wind turbine SCADA system in the southeast of Ireland were collected, preprocessed, and analyzed using statistical and visualization techniques to uncover hidden patterns related to five fault types within the system. The paper introduces a conditional function designed to test two given scenarios. The first scenario employs a two-tier approach involving fault detection followed by fault identification. Initially, faulty samples are detected in the first tier and then passed to the second tier, which is trained to diagnose the specific fault type for each sample. In contrast, the second scenario involves a simpler solution referred to as naive, which treats fault types and normal cases together in the same dataset and trains a model to distinguish between normal samples and those related to specific fault types. Machine learning models, particularly robust classifiers, were tested in both scenarios. Thirteen classifiers were included, ranging from tree-based to traditional classifiers, neural networks, and ensemble learners. Additionally, an averaging feature importance technique was employed to select the most impactful features on the model decisions as a starting point. A comparison of the results reveals that the proposed two-tier approach is more accurate and less time-consuming, achieving 95% accuracy in separating faulty from normal samples and approximately 91% in diagnosing each fault type. Furthermore, ensemble learners, particularly bagging and stacking, demonstrated superior fault detection and identification performance. The performance of the classifiers was validated using t-SNE and explainable AI techniques, confirming that the impactful features align with the findings and that the proposed two-tier solution outperforms the naive solution. These results strongly indicate that the proposed solution is accurate, independent, and less complex compared to existing solutions.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用双层机器学习框架检测和识别风力发电机故障
积极主动的方法对于优化风机维护和减少停机时间至关重要。通过对现有的监控和数据采集 (SCADA) 系统数据采用先进的数据分析技术,可以在不产生高额成本的情况下深入了解风力涡轮机的性能。这样就可以进行早期故障检测和预测性维护,确保最大限度地减少计划外或被动维护,减少收入损失。本研究收集、预处理和分析了爱尔兰东南部风力涡轮机 SCADA 系统的数据,并使用统计和可视化技术揭示了系统内五种故障类型的隐藏模式。论文介绍了一个条件函数,旨在测试两种给定的情景。第一种情况采用了一种双层方法,包括故障检测和故障识别。最初,第一层检测到故障样本,然后将其传递给第二层,第二层经过训练后可诊断出每个样本的特定故障类型。与此相反,第二种方案涉及一种更简单的解决方案,即 "天真 "方案,它将故障类型和正常情况放在同一个数据集中处理,并训练一个模型来区分正常样本和与特定故障类型相关的样本。机器学习模型,尤其是鲁棒分类器,在这两种方案中都进行了测试。其中包括 13 种分类器,从基于树的分类器到传统分类器、神经网络和集合学习器,不一而足。此外,还采用了平均特征重要性技术,以选择对模型决策影响最大的特征作为起点。结果对比显示,所提出的双层方法更准确、更省时,在区分故障样本和正常样本方面达到了 95% 的准确率,在诊断每种故障类型方面达到了约 91% 的准确率。此外,集合学习器,特别是袋装和堆叠学习器,在故障检测和识别方面表现出色。使用 t-SNE 和可解释人工智能技术对分类器的性能进行了验证,证实了有影响的特征与研究结果一致,而且所提出的双层解决方案优于天真解决方案。这些结果有力地表明,与现有解决方案相比,所提出的解决方案准确、独立且复杂度较低。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
5.60
自引率
0.00%
发文量
0
期刊最新文献
MapReduce teaching learning based optimization algorithm for solving CEC-2013 LSGO benchmark Testsuit Intelligent gear decision method for vehicle automatic transmission system based on data mining Design and implementation of EventsKG for situational monitoring and security intelligence in India: An open-source intelligence gathering approach Ideological orientation and extremism detection in online social networking sites: A systematic review Multi-objective optimization of power networks integrating electric vehicles and wind energy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1