Building the optimal hybrid spatial Data-Driven Model: Balancing accuracy and complexity

Emanuele Barca, Maria Clementina Caputo, Rita Masciale
{"title":"Building the optimal hybrid spatial Data-Driven Model: Balancing accuracy and complexity","authors":"Emanuele Barca,&nbsp;Maria Clementina Caputo,&nbsp;Rita Masciale","doi":"10.1016/j.jag.2025.104478","DOIUrl":null,"url":null,"abstract":"<div><div>Mapping environmental variables is crucial for natural resource management. Researchers and scholars have continually advanced this field with modern techniques such as Integrated Nested Laplace Approximation (INLA), Deep Learning (DL), and Graph Neural Networks (GNN) models. While effective, these models often present a significant challenge due to their <em>black</em> nature, which obscures the process of generating final maps from raw data. Recent theoretical breakthroughs have shown that white/grey-box models can achieve the same level of accuracy as these advanced techniques, debunking the belief that complex models are necessarily the most accurate. Based on these findings, we have developed a methodology that employs a series of statistical tests and data analytics to identify essential features hidden in spatial data in order to assess the predictive model (of white/grey kind) that best approximates underlying spatial processes. This methodology profiles the model that better adapts to the data, aiding in the selection of the simplest model that achieves the desired accuracy, functioning similarly to a recommender system for model selection. Furthermore, the set of permissible models includes only regressive-like ones to clarify the data’s contribution to map construction and can be applied to a wide range of datasets. By reducing complexity, this approach enhances the transparency of the model’s results. Real-world dataset demonstrates this methodology’s remarkable ability to produce highly accurate results.</div></div>","PeriodicalId":73423,"journal":{"name":"International journal of applied earth observation and geoinformation : ITC journal","volume":"139 ","pages":"Article 104478"},"PeriodicalIF":8.6000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of applied earth observation and geoinformation : ITC journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1569843225001256","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/26 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"REMOTE SENSING","Score":null,"Total":0}
引用次数: 0

Abstract

Mapping environmental variables is crucial for natural resource management. Researchers and scholars have continually advanced this field with modern techniques such as Integrated Nested Laplace Approximation (INLA), Deep Learning (DL), and Graph Neural Networks (GNN) models. While effective, these models often present a significant challenge due to their black nature, which obscures the process of generating final maps from raw data. Recent theoretical breakthroughs have shown that white/grey-box models can achieve the same level of accuracy as these advanced techniques, debunking the belief that complex models are necessarily the most accurate. Based on these findings, we have developed a methodology that employs a series of statistical tests and data analytics to identify essential features hidden in spatial data in order to assess the predictive model (of white/grey kind) that best approximates underlying spatial processes. This methodology profiles the model that better adapts to the data, aiding in the selection of the simplest model that achieves the desired accuracy, functioning similarly to a recommender system for model selection. Furthermore, the set of permissible models includes only regressive-like ones to clarify the data’s contribution to map construction and can be applied to a wide range of datasets. By reducing complexity, this approach enhances the transparency of the model’s results. Real-world dataset demonstrates this methodology’s remarkable ability to produce highly accurate results.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
构建最优混合空间数据驱动模型:平衡精度与复杂性
绘制环境变量图对于自然资源管理至关重要。研究人员和学者利用集成嵌套拉普拉斯逼近(INLA)、深度学习(DL)和图神经网络(GNN)模型等现代技术不断推进这一领域的发展。这些模型虽然有效,但由于其黑色特性,往往会对从原始数据生成最终地图的过程造成模糊,从而带来巨大挑战。最近的理论突破表明,白盒/灰盒模型可以达到与这些先进技术相同的准确度,从而推翻了 "复杂模型一定是最准确的 "这一观点。基于这些发现,我们开发了一种方法,利用一系列统计测试和数据分析来识别隐藏在空间数据中的基本特征,从而评估最接近潜在空间过程的预测模型(白盒/灰盒模型)。这种方法能剖析出更好地适应数据的模型,帮助选择最简单的模型来达到所需的准确度,其功能类似于模型选择的推荐系统。此外,允许使用的模型集只包括类似回归的模型,以明确数据对地图构建的贡献,并可应用于各种数据集。通过降低复杂性,这种方法提高了模型结果的透明度。真实世界的数据集证明了这种方法能够产生高精度的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International journal of applied earth observation and geoinformation : ITC journal
International journal of applied earth observation and geoinformation : ITC journal Global and Planetary Change, Management, Monitoring, Policy and Law, Earth-Surface Processes, Computers in Earth Sciences
CiteScore
12.00
自引率
0.00%
发文量
0
审稿时长
77 days
期刊介绍: The International Journal of Applied Earth Observation and Geoinformation publishes original papers that utilize earth observation data for natural resource and environmental inventory and management. These data primarily originate from remote sensing platforms, including satellites and aircraft, supplemented by surface and subsurface measurements. Addressing natural resources such as forests, agricultural land, soils, and water, as well as environmental concerns like biodiversity, land degradation, and hazards, the journal explores conceptual and data-driven approaches. It covers geoinformation themes like capturing, databasing, visualization, interpretation, data quality, and spatial uncertainty.
期刊最新文献
Phenology-Aligned multi-task temporal fusion framework for satellite-based triple-seasonal rice yield estimation in Southeast Asia An Arctic underwater terrain matching method integrating template matching and DEM super-resolution MAFNet: A multi-modal adaptive fusion network-based approach for individual building extraction from oblique photogrammetry Seasonal field-scale wheat yield forecasting using XGBoost with radar, optical, and weather data in Morocco Advances in extracting current profiles from X-band radar images with a focus on retrieving subsurface current
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1