{"title":"A multi-instance LSTM network for failure detection of hard disk drives","authors":"","doi":"10.1109/INDIN45582.2020.9442240","DOIUrl":null,"url":null,"abstract":"Hard disk (HDD) failure is the most important reliability issue in the data center. Therefore, the prediction of hard disk failure has become the focus of attention of major data centers. However, most current research work does not notice the fact that the data on the hard disk is mostly unlabeled data. Since the degradation period in HDD is very short, the mixture of health data and erroneous data can cause serious data imbalance. This makes fault prediction a difficult task. In response to the above problems, a multi-instance long-term sequence classification method based on long-short-term memory (LSTM) network is proposed. By dividing the longterm sequence data packet into multiple instances, the relationship between the instance and the sample label is studied to predict HDD failure. Through the analysis of the hard disk data of a communication company and the Backblaze data center, this method can obtain better results than other methods.","PeriodicalId":185948,"journal":{"name":"2020 IEEE 18th International Conference on Industrial Informatics (INDIN)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 18th International Conference on Industrial Informatics (INDIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIN45582.2020.9442240","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Hard disk (HDD) failure is the most important reliability issue in the data center. Therefore, the prediction of hard disk failure has become the focus of attention of major data centers. However, most current research work does not notice the fact that the data on the hard disk is mostly unlabeled data. Since the degradation period in HDD is very short, the mixture of health data and erroneous data can cause serious data imbalance. This makes fault prediction a difficult task. In response to the above problems, a multi-instance long-term sequence classification method based on long-short-term memory (LSTM) network is proposed. By dividing the longterm sequence data packet into multiple instances, the relationship between the instance and the sample label is studied to predict HDD failure. Through the analysis of the hard disk data of a communication company and the Backblaze data center, this method can obtain better results than other methods.