{"title":"混合存储架构中基于数据标签的数据调度","authors":"Liangyuan Wang, Xuan Chen, X. Li","doi":"10.1109/MASS.2018.00083","DOIUrl":null,"url":null,"abstract":"Carrying out high efficient and rapid analysis of big data is essential to big data application. Due to the poor scalability of DRAM, the performance of big data analysis and related applications is difficult to improve. DRAM/NVM hybrid storage architecture has the advantages of non-volatile and high storage density, which brings an opportunity to optimize big data analysis. Because the task itself depends on the data and does not modify the data, it is possible to solve the problem of operation delay if the data is deployed well on the storage system under the background of hybrid storage architecture. In order to optimize the problem of high latency, this paper discusses the data migration between disk and NVM and proposes a data deployment algorithm based on data label. The validity of labeling is verified by calculating the total time of reading data by tasks in the experiment and the efficiency of task execution is improved.","PeriodicalId":146214,"journal":{"name":"2018 IEEE 15th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data Scheduling Based on Data Label in Hybrid Storage Architecture\",\"authors\":\"Liangyuan Wang, Xuan Chen, X. Li\",\"doi\":\"10.1109/MASS.2018.00083\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Carrying out high efficient and rapid analysis of big data is essential to big data application. Due to the poor scalability of DRAM, the performance of big data analysis and related applications is difficult to improve. DRAM/NVM hybrid storage architecture has the advantages of non-volatile and high storage density, which brings an opportunity to optimize big data analysis. Because the task itself depends on the data and does not modify the data, it is possible to solve the problem of operation delay if the data is deployed well on the storage system under the background of hybrid storage architecture. In order to optimize the problem of high latency, this paper discusses the data migration between disk and NVM and proposes a data deployment algorithm based on data label. The validity of labeling is verified by calculating the total time of reading data by tasks in the experiment and the efficiency of task execution is improved.\",\"PeriodicalId\":146214,\"journal\":{\"name\":\"2018 IEEE 15th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE 15th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MASS.2018.00083\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 15th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MASS.2018.00083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Data Scheduling Based on Data Label in Hybrid Storage Architecture
Carrying out high efficient and rapid analysis of big data is essential to big data application. Due to the poor scalability of DRAM, the performance of big data analysis and related applications is difficult to improve. DRAM/NVM hybrid storage architecture has the advantages of non-volatile and high storage density, which brings an opportunity to optimize big data analysis. Because the task itself depends on the data and does not modify the data, it is possible to solve the problem of operation delay if the data is deployed well on the storage system under the background of hybrid storage architecture. In order to optimize the problem of high latency, this paper discusses the data migration between disk and NVM and proposes a data deployment algorithm based on data label. The validity of labeling is verified by calculating the total time of reading data by tasks in the experiment and the efficiency of task execution is improved.