Adaptive System-Level Fault Diagnosis of Bijective Connection Networks

IF 5.7 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE IEEE Transactions on Reliability Pub Date : 2024-07-23 DOI:10.1109/TR.2024.3425759
Yanze Huang;Limei Lin;Li Xu;Sun-Yuan Hsieh
{"title":"Adaptive System-Level Fault Diagnosis of Bijective Connection Networks","authors":"Yanze Huang;Limei Lin;Li Xu;Sun-Yuan Hsieh","doi":"10.1109/TR.2024.3425759","DOIUrl":null,"url":null,"abstract":"As the multiprocessor systems are becoming large-scale, fault-diagnosis is crucial to ensure the reliability of multiprocessor systems. In order to improve the self-diagnosis capability of a multiprocessor system, a pessimistic fault diagnosis scheme such as <inline-formula><tex-math>$t/s$</tex-math></inline-formula>-diagnosis allows some fault-free processors to be mistakenly identified as faulty. All faulty processors in a <inline-formula><tex-math>$t/s$</tex-math></inline-formula>-diagnosable multiprocessor system (<inline-formula><tex-math>$t\\leq s$</tex-math></inline-formula>) should be identified into a set with size up to <inline-formula><tex-math>$s$</tex-math></inline-formula>, when the total amount of faulty processors in the system does not exceed <inline-formula><tex-math>$t$</tex-math></inline-formula>. This article focuses on the <inline-formula><tex-math>$t/s$</tex-math></inline-formula>-diagnosis for the <inline-formula><tex-math>$n$</tex-math></inline-formula>-dimensional bijective connection network <inline-formula><tex-math>$X_{n}$</tex-math></inline-formula>. An adaptive <inline-formula><tex-math>$t/s$</tex-math></inline-formula>-diagnosis algorithm APDMM*<inline-formula><tex-math>$t/s$</tex-math></inline-formula> of complexity <inline-formula><tex-math>$O(M(log_{2}\\,M)^{2})$</tex-math></inline-formula> under the comparison model is proposed, where <inline-formula><tex-math>$M$</tex-math></inline-formula> is the total amount of nodes in <inline-formula><tex-math>$X_{n}$</tex-math></inline-formula>. Then, the correctness of algorithm APDMM*<inline-formula><tex-math>$t/s$</tex-math></inline-formula> is proved by the fault-tolerant properties of the network itself. Moreover, we calculate the <inline-formula><tex-math>$t/s$</tex-math></inline-formula>-diagnosability of <inline-formula><tex-math>$X_{n}$</tex-math></inline-formula> by theoretical method in mathematics, which is <inline-formula><tex-math>$-\\frac{1}{2}y^{2}+(n-\\frac{1}{2})y+1$</tex-math></inline-formula> for <inline-formula><tex-math>$2 \\leq y \\leq n$</tex-math></inline-formula> under comparison model, where <inline-formula><tex-math>$s=-\\frac{1}{2}y^{2}+(n-\\frac{1}{2})y+y-1$</tex-math></inline-formula>. Furthermore, we apply algorithm APDMM*<inline-formula><tex-math>$t/s$</tex-math></inline-formula> on the hypercube and the real-world network WSN-DS to verify our main results, and analyze the experimental outcomes in terms of true positive rate, false positive rate, accuracy and precision. The experimental results reveal the advantage and high performance of our algorithm APDMM*<inline-formula><tex-math>$t/s$</tex-math></inline-formula>. Besides, we compare the <inline-formula><tex-math>$t/s$</tex-math></inline-formula>-diagnosability of <inline-formula><tex-math>$X_{n}$</tex-math></inline-formula> with traditional accurate diagnosability, and it turns out that as <inline-formula><tex-math>$n$</tex-math></inline-formula> gets larger, the <inline-formula><tex-math>$t/s$</tex-math></inline-formula>-diagnosability of <inline-formula><tex-math>$X_{n}$</tex-math></inline-formula> is significantly better than traditional accurate diagnosability.","PeriodicalId":56305,"journal":{"name":"IEEE Transactions on Reliability","volume":"74 2","pages":"2916-2926"},"PeriodicalIF":5.7000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Reliability","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10608002/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

As the multiprocessor systems are becoming large-scale, fault-diagnosis is crucial to ensure the reliability of multiprocessor systems. In order to improve the self-diagnosis capability of a multiprocessor system, a pessimistic fault diagnosis scheme such as $t/s$-diagnosis allows some fault-free processors to be mistakenly identified as faulty. All faulty processors in a $t/s$-diagnosable multiprocessor system ($t\leq s$) should be identified into a set with size up to $s$, when the total amount of faulty processors in the system does not exceed $t$. This article focuses on the $t/s$-diagnosis for the $n$-dimensional bijective connection network $X_{n}$. An adaptive $t/s$-diagnosis algorithm APDMM*$t/s$ of complexity $O(M(log_{2}\,M)^{2})$ under the comparison model is proposed, where $M$ is the total amount of nodes in $X_{n}$. Then, the correctness of algorithm APDMM*$t/s$ is proved by the fault-tolerant properties of the network itself. Moreover, we calculate the $t/s$-diagnosability of $X_{n}$ by theoretical method in mathematics, which is $-\frac{1}{2}y^{2}+(n-\frac{1}{2})y+1$ for $2 \leq y \leq n$ under comparison model, where $s=-\frac{1}{2}y^{2}+(n-\frac{1}{2})y+y-1$. Furthermore, we apply algorithm APDMM*$t/s$ on the hypercube and the real-world network WSN-DS to verify our main results, and analyze the experimental outcomes in terms of true positive rate, false positive rate, accuracy and precision. The experimental results reveal the advantage and high performance of our algorithm APDMM*$t/s$. Besides, we compare the $t/s$-diagnosability of $X_{n}$ with traditional accurate diagnosability, and it turns out that as $n$ gets larger, the $t/s$-diagnosability of $X_{n}$ is significantly better than traditional accurate diagnosability.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
双射连接网络的自适应系统级故障诊断
随着多处理机系统的规模化,故障诊断是保证多处理机系统可靠性的关键。为了提高多处理机系统的自诊断能力,提出了一种悲观故障诊断方案(如$t/s$ -diagnosis),该方案允许一些无故障的处理机被错误地识别为故障。当系统中故障处理器的总数不超过$t$时,应将可$t/s$诊断的多处理器系统($t\leq s$)中的所有故障处理器识别为一个大小为$s$的集合。本文主要研究了$n$维双客观连接网络$X_{n}$的$t/s$ -诊断。提出了比较模型下复杂度$O(M(log_{2}\,M)^{2})$的自适应$t/s$ -诊断算法APDMM* $t/s$,其中$M$为$X_{n}$中的节点总数。然后,通过网络本身的容错特性,证明了算法APDMM* $t/s$的正确性。此外,我们用数学的理论方法计算了$X_{n}$的$t/s$ -可诊断性,在比较模型下$2 \leq y \leq n$为$-\frac{1}{2}y^{2}+(n-\frac{1}{2})y+1$,其中$s=-\frac{1}{2}y^{2}+(n-\frac{1}{2})y+y-1$。此外,我们在超立方体和现实网络WSN-DS上应用算法APDMM* $t/s$验证了我们的主要结果,并从真阳性率、假阳性率、正确率和精密度等方面分析了实验结果。实验结果表明了APDMM* $t/s$算法的优越性和高性能。此外,我们将$X_{n}$的$t/s$ -可诊断性与传统的准确可诊断性进行了比较,结果表明,随着$n$的增大,$X_{n}$的$t/s$ -可诊断性明显优于传统的准确可诊断性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Reliability
IEEE Transactions on Reliability 工程技术-工程:电子与电气
CiteScore
12.20
自引率
8.50%
发文量
153
审稿时长
7.5 months
期刊介绍: IEEE Transactions on Reliability is a refereed journal for the reliability and allied disciplines including, but not limited to, maintainability, physics of failure, life testing, prognostics, design and manufacture for reliability, reliability for systems of systems, network availability, mission success, warranty, safety, and various measures of effectiveness. Topics eligible for publication range from hardware to software, from materials to systems, from consumer and industrial devices to manufacturing plants, from individual items to networks, from techniques for making things better to ways of predicting and measuring behavior in the field. As an engineering subject that supports new and existing technologies, we constantly expand into new areas of the assurance sciences.
期刊最新文献
Product Quality Information–Based Integrated Maintenance Strategies for Stochastic Manufacturing System Considering Implicit and Explicit Risks Anonymous Dynamic Formation Control of Multiagent Systems With Obstacle Avoidance Harnessing Transferable Adversarial Examples via Multilayer Attention-Guided Spatial Transformations Zero-Shot Fault Diagnosis in Manufacturing Processes via Attribute Co-Occurrence Relationships An Efficient Direct Algorithm for Computing the Reliability of a Multistate Flow Network Subject to Flow Loss
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1