Sihan Chen, Sameh Abdulah, Ying Sun, Marc G. Genton
{"title":"On the impact of spatial covariance matrix ordering on tile low-rank estimation of Matérn parameters","authors":"Sihan Chen, Sameh Abdulah, Ying Sun, Marc G. Genton","doi":"10.1002/env.2868","DOIUrl":null,"url":null,"abstract":"<p>Spatial statistical modeling involves processing an <span></span><math>\n <semantics>\n <mrow>\n <mi>n</mi>\n <mo>×</mo>\n <mi>n</mi>\n </mrow>\n <annotation>$$ n\\times n $$</annotation>\n </semantics></math> symmetric positive definite covariance matrix, where <span></span><math>\n <semantics>\n <mrow>\n <mi>n</mi>\n </mrow>\n <annotation>$$ n $$</annotation>\n </semantics></math> denotes the number of locations. However, when <span></span><math>\n <semantics>\n <mrow>\n <mi>n</mi>\n </mrow>\n <annotation>$$ n $$</annotation>\n </semantics></math> is large, processing this covariance matrix using traditional methods becomes prohibitive. Thus, coupling parallel processing with approximation can be an elegant solution by relying on parallel solvers that deal with the matrix as a set of small tiles instead of the full structure. The approximation can also be performed at the tile level for better compression and faster execution. The tile low-rank (TLR) approximation has recently been used to compress the covariance matrix, which mainly relies on ordering the matrix elements, which can impact the compression quality and the efficiency of the underlying solvers. This work investigates the accuracy and performance of location-based ordering algorithms. We highlight the pros and cons of each ordering algorithm and give practitioners hints on carefully choosing the ordering algorithm for TLR approximation. We assess the quality of the compression and the accuracy of the statistical parameter estimates of the Matérn covariance function using TLR approximation under various ordering algorithms and settings of correlations through simulations on irregular grids. Our conclusions are supported by an application to daily soil moisture data in the Mississippi Basin area.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 6","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2868","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmetrics","FirstCategoryId":"93","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/env.2868","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Spatial statistical modeling involves processing an symmetric positive definite covariance matrix, where denotes the number of locations. However, when is large, processing this covariance matrix using traditional methods becomes prohibitive. Thus, coupling parallel processing with approximation can be an elegant solution by relying on parallel solvers that deal with the matrix as a set of small tiles instead of the full structure. The approximation can also be performed at the tile level for better compression and faster execution. The tile low-rank (TLR) approximation has recently been used to compress the covariance matrix, which mainly relies on ordering the matrix elements, which can impact the compression quality and the efficiency of the underlying solvers. This work investigates the accuracy and performance of location-based ordering algorithms. We highlight the pros and cons of each ordering algorithm and give practitioners hints on carefully choosing the ordering algorithm for TLR approximation. We assess the quality of the compression and the accuracy of the statistical parameter estimates of the Matérn covariance function using TLR approximation under various ordering algorithms and settings of correlations through simulations on irregular grids. Our conclusions are supported by an application to daily soil moisture data in the Mississippi Basin area.
期刊介绍:
Environmetrics, the official journal of The International Environmetrics Society (TIES), an Association of the International Statistical Institute, is devoted to the dissemination of high-quality quantitative research in the environmental sciences.
The journal welcomes pertinent and innovative submissions from quantitative disciplines developing new statistical and mathematical techniques, methods, and theories that solve modern environmental problems. Articles must proffer substantive, new statistical or mathematical advances to answer important scientific questions in the environmental sciences, or must develop novel or enhanced statistical methodology with clear applications to environmental science. New methods should be illustrated with recent environmental data.