Weitao Zou , Wei Li , Zeyu Wang , Jiaming Pei , Tongtong Lou , Guangsheng Chen , Weipeng Jing , Albert Y. Zomaya
{"title":"基于分布式矩阵的遥感计算强度求解方法","authors":"Weitao Zou , Wei Li , Zeyu Wang , Jiaming Pei , Tongtong Lou , Guangsheng Chen , Weipeng Jing , Albert Y. Zomaya","doi":"10.1016/j.future.2024.107644","DOIUrl":null,"url":null,"abstract":"<div><div>Remote sensing analysis is a dominant yet time-consuming part of geospatial applications. The performance can be optimized based on distributed computing, but current systems still face significant challenges. Firstly, the spatial characteristics of remote sensing data lead to an uneven distribution of computational intensity (CIT), which characterizes computing loads, including computation and IO, in different spatial domains. Secondly, it is hard to achieve load-balancing without introducing new computational costs, thus increasing the CIT and reducing the overall performance. Therefore, resolving CIT by decreasing and balancing it is an important research issue for distributed remote sensing computing. This paper proposes LBM-RS, an efficient distributed framework based on load-balancing matrix computing for remote sensing. It implements remote sensing applications based on the distributed matrix, representing the algorithms with a matrix computation and constructing multi-dimensional spatial domains to model computational costs for matrix operation tasks. It resolves the CIT with the minimum computation load and dynamic spatial domain decomposition strategy to support global load balancing. We also exploit the IO efficiency from the task staging strategy and the cache-aware memory structure for remote sensing data. In this way, it can reduce the bandwidth burden and memory access frequency, thus decreasing the overall CIT. Finally, we evaluate the proposed approach on both real and synthetic datasets, and the results demonstrate significant advantages in computation and communication efficiency compared to the benchmarks.</div></div>","PeriodicalId":55132,"journal":{"name":"Future Generation Computer Systems-The International Journal of Escience","volume":"166 ","pages":"Article 107644"},"PeriodicalIF":6.2000,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient distributed matrix for resolving computational intensity in remote sensing\",\"authors\":\"Weitao Zou , Wei Li , Zeyu Wang , Jiaming Pei , Tongtong Lou , Guangsheng Chen , Weipeng Jing , Albert Y. Zomaya\",\"doi\":\"10.1016/j.future.2024.107644\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Remote sensing analysis is a dominant yet time-consuming part of geospatial applications. The performance can be optimized based on distributed computing, but current systems still face significant challenges. Firstly, the spatial characteristics of remote sensing data lead to an uneven distribution of computational intensity (CIT), which characterizes computing loads, including computation and IO, in different spatial domains. Secondly, it is hard to achieve load-balancing without introducing new computational costs, thus increasing the CIT and reducing the overall performance. Therefore, resolving CIT by decreasing and balancing it is an important research issue for distributed remote sensing computing. This paper proposes LBM-RS, an efficient distributed framework based on load-balancing matrix computing for remote sensing. It implements remote sensing applications based on the distributed matrix, representing the algorithms with a matrix computation and constructing multi-dimensional spatial domains to model computational costs for matrix operation tasks. It resolves the CIT with the minimum computation load and dynamic spatial domain decomposition strategy to support global load balancing. We also exploit the IO efficiency from the task staging strategy and the cache-aware memory structure for remote sensing data. In this way, it can reduce the bandwidth burden and memory access frequency, thus decreasing the overall CIT. Finally, we evaluate the proposed approach on both real and synthetic datasets, and the results demonstrate significant advantages in computation and communication efficiency compared to the benchmarks.</div></div>\",\"PeriodicalId\":55132,\"journal\":{\"name\":\"Future Generation Computer Systems-The International Journal of Escience\",\"volume\":\"166 \",\"pages\":\"Article 107644\"},\"PeriodicalIF\":6.2000,\"publicationDate\":\"2024-12-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Future Generation Computer Systems-The International Journal of Escience\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167739X24006083\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Future Generation Computer Systems-The International Journal of Escience","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167739X24006083","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
Efficient distributed matrix for resolving computational intensity in remote sensing
Remote sensing analysis is a dominant yet time-consuming part of geospatial applications. The performance can be optimized based on distributed computing, but current systems still face significant challenges. Firstly, the spatial characteristics of remote sensing data lead to an uneven distribution of computational intensity (CIT), which characterizes computing loads, including computation and IO, in different spatial domains. Secondly, it is hard to achieve load-balancing without introducing new computational costs, thus increasing the CIT and reducing the overall performance. Therefore, resolving CIT by decreasing and balancing it is an important research issue for distributed remote sensing computing. This paper proposes LBM-RS, an efficient distributed framework based on load-balancing matrix computing for remote sensing. It implements remote sensing applications based on the distributed matrix, representing the algorithms with a matrix computation and constructing multi-dimensional spatial domains to model computational costs for matrix operation tasks. It resolves the CIT with the minimum computation load and dynamic spatial domain decomposition strategy to support global load balancing. We also exploit the IO efficiency from the task staging strategy and the cache-aware memory structure for remote sensing data. In this way, it can reduce the bandwidth burden and memory access frequency, thus decreasing the overall CIT. Finally, we evaluate the proposed approach on both real and synthetic datasets, and the results demonstrate significant advantages in computation and communication efficiency compared to the benchmarks.
期刊介绍:
Computing infrastructures and systems are constantly evolving, resulting in increasingly complex and collaborative scientific applications. To cope with these advancements, there is a growing need for collaborative tools that can effectively map, control, and execute these applications.
Furthermore, with the explosion of Big Data, there is a requirement for innovative methods and infrastructures to collect, analyze, and derive meaningful insights from the vast amount of data generated. This necessitates the integration of computational and storage capabilities, databases, sensors, and human collaboration.
Future Generation Computer Systems aims to pioneer advancements in distributed systems, collaborative environments, high-performance computing, and Big Data analytics. It strives to stay at the forefront of developments in grids, clouds, and the Internet of Things (IoT) to effectively address the challenges posed by these wide-area, fully distributed sensing and computing systems.