Bo Li, Mengdi Wang, Yongxin Zhao, G. Pu, Huibiao Zhu, Fu Song
{"title":"Modeling and Verifying Google File System","authors":"Bo Li, Mengdi Wang, Yongxin Zhao, G. Pu, Huibiao Zhu, Fu Song","doi":"10.1109/HASE.2015.38","DOIUrl":null,"url":null,"abstract":"Google File System (GFS) is a distributed file system developed by Google for massive data-intensive applications. Its high aggregate performance of delivering massive data to many clients but the inexpensiveness of commodity hardware facilitate GFS to successfully meet the massive storage needs and be widely used in industries. In this paper, we first present a formal model of Google File System in terms of Communicating Sequential Processes (CSP#), which precisely describes the un-derlying read/write behaviors of GFS. On that basis, both relaxed consistency and eventually consistency guaranteed by GFS may be revealed in our framework. Furthermore, the suggested CSP# model is encoded in Process Analysis Toolkit (PAT), thus several properties such as starvation-free and deadlock-free could be automatically checked and verified in the framework of formal methods.","PeriodicalId":248645,"journal":{"name":"2015 IEEE 16th International Symposium on High Assurance Systems Engineering","volume":"375 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 16th International Symposium on High Assurance Systems Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HASE.2015.38","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
Google File System (GFS) is a distributed file system developed by Google for massive data-intensive applications. Its high aggregate performance of delivering massive data to many clients but the inexpensiveness of commodity hardware facilitate GFS to successfully meet the massive storage needs and be widely used in industries. In this paper, we first present a formal model of Google File System in terms of Communicating Sequential Processes (CSP#), which precisely describes the un-derlying read/write behaviors of GFS. On that basis, both relaxed consistency and eventually consistency guaranteed by GFS may be revealed in our framework. Furthermore, the suggested CSP# model is encoded in Process Analysis Toolkit (PAT), thus several properties such as starvation-free and deadlock-free could be automatically checked and verified in the framework of formal methods.
Google File System (GFS)是Google为海量数据密集型应用开发的分布式文件系统。GFS具有向多个客户端提供海量数据的高聚合性能和廉价的商用硬件,成功地满足了海量存储的需求,在工业中得到了广泛的应用。在本文中,我们首先从通信顺序进程(CSP#)的角度提出了Google文件系统的形式化模型,该模型精确地描述了GFS的底层读/写行为。在此基础上,我们的框架可以揭示GFS保证的松弛一致性和最终一致性。此外,建议的CSP#模型被编码在过程分析工具包(PAT)中,因此可以在形式化方法的框架中自动检查和验证诸如无饥饿和无死锁等几个属性。