{"title":"Reconstruction of Erasure Correcting Codes for Dependable Distributed Storage System without Spare Disks","authors":"H. Kaneko, E. Fujiwara","doi":"10.1109/DFT.2007.26","DOIUrl":null,"url":null,"abstract":"Demand is increasing for large-scale dependable storage systems for use in data-intensive servers, such as database and multimedia servers. Conventional dependable disk arrays utilize erasure correcting code to recover lost data stored on failed disks, and require two types of redundant disks: check disks to store check bits, and spare disks to replace failed disks. To reduce the number of redundant disks, this paper proposes a distributed storage system that does not require spare disks. Specifically, the proposed system replaces a failed disk with a check disk rather than with a spare disk. Erasure correction capability of the system gradually degrades with accumulation of failed disks. This paper proposes a reconstruction method for the parity-check matrix of low-density triple-erasure correcting code. Evaluation of the mean time to data loss (MTTDL) shows that, for a storage system with 120 information disks, the proposed storage system with 21 check disks gives an MTTDL of 7.9 x 104 years, while a conventional system with 26 redundant disks gives an MTTDL of 5.8 x 101 years, where the mean time to failure of each disk is 0.5 x 106 years and the maintenance interval of each system is 5,376 hours (i.e., 32 weeks).","PeriodicalId":259700,"journal":{"name":"22nd IEEE International Symposium on Defect and Fault-Tolerance in VLSI Systems (DFT 2007)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"22nd IEEE International Symposium on Defect and Fault-Tolerance in VLSI Systems (DFT 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DFT.2007.26","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Demand is increasing for large-scale dependable storage systems for use in data-intensive servers, such as database and multimedia servers. Conventional dependable disk arrays utilize erasure correcting code to recover lost data stored on failed disks, and require two types of redundant disks: check disks to store check bits, and spare disks to replace failed disks. To reduce the number of redundant disks, this paper proposes a distributed storage system that does not require spare disks. Specifically, the proposed system replaces a failed disk with a check disk rather than with a spare disk. Erasure correction capability of the system gradually degrades with accumulation of failed disks. This paper proposes a reconstruction method for the parity-check matrix of low-density triple-erasure correcting code. Evaluation of the mean time to data loss (MTTDL) shows that, for a storage system with 120 information disks, the proposed storage system with 21 check disks gives an MTTDL of 7.9 x 104 years, while a conventional system with 26 redundant disks gives an MTTDL of 5.8 x 101 years, where the mean time to failure of each disk is 0.5 x 106 years and the maintenance interval of each system is 5,376 hours (i.e., 32 weeks).