{"title":"Tandem NonStop-UX操作系统故障/故障分析","authors":"R. K. Iyer, M. Hsueh, I. Lee","doi":"10.1109/DASC.1996.559205","DOIUrl":null,"url":null,"abstract":"This paper presents the results of an analyse's of failures in several releases of Tandem's NonStop-UX operating system. NonStop-UX is based on UNIX System V. The analysis covers software failures from the field and failures reported by Tandem's test center. Faults are classified based on the status of the reported failures, the locations of the code that detected the problems, the panic messages generated by the systems, the faulty source modules, and the types of developer's mistakes. We present distributions of the failure and repair times for unique and duplicate failures. We also discuss how the analysis results can be used for assessing the dependability of the operating system and guiding improvement efforts.","PeriodicalId":332554,"journal":{"name":"15th DASC. AIAA/IEEE Digital Avionics Systems Conference","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1996-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Fault/failure analysis of the Tandem NonStop-UX operating system\",\"authors\":\"R. K. Iyer, M. Hsueh, I. Lee\",\"doi\":\"10.1109/DASC.1996.559205\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents the results of an analyse's of failures in several releases of Tandem's NonStop-UX operating system. NonStop-UX is based on UNIX System V. The analysis covers software failures from the field and failures reported by Tandem's test center. Faults are classified based on the status of the reported failures, the locations of the code that detected the problems, the panic messages generated by the systems, the faulty source modules, and the types of developer's mistakes. We present distributions of the failure and repair times for unique and duplicate failures. We also discuss how the analysis results can be used for assessing the dependability of the operating system and guiding improvement efforts.\",\"PeriodicalId\":332554,\"journal\":{\"name\":\"15th DASC. AIAA/IEEE Digital Avionics Systems Conference\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1996-10-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"15th DASC. AIAA/IEEE Digital Avionics Systems Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DASC.1996.559205\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"15th DASC. AIAA/IEEE Digital Avionics Systems Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DASC.1996.559205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
本文介绍了对Tandem的NonStop-UX操作系统的几个版本的故障分析的结果。NonStop-UX基于UNIX System v。分析包括现场的软件故障和Tandem测试中心报告的故障。故障分类基于报告的故障状态、检测到问题的代码的位置、系统生成的紧急消息、故障源模块以及开发人员的错误类型。我们给出了唯一故障和重复故障的故障和修复时间的分布。我们还讨论了如何使用分析结果来评估操作系统的可靠性并指导改进工作。
Fault/failure analysis of the Tandem NonStop-UX operating system
This paper presents the results of an analyse's of failures in several releases of Tandem's NonStop-UX operating system. NonStop-UX is based on UNIX System V. The analysis covers software failures from the field and failures reported by Tandem's test center. Faults are classified based on the status of the reported failures, the locations of the code that detected the problems, the panic messages generated by the systems, the faulty source modules, and the types of developer's mistakes. We present distributions of the failure and repair times for unique and duplicate failures. We also discuss how the analysis results can be used for assessing the dependability of the operating system and guiding improvement efforts.