High spatial and spectral resolution dataset of hyperspectral look-up tables for 3.5 million traits and structural combinations of Central European temperate broadleaf forests

IF 1 Q3 MULTIDISCIPLINARY SCIENCES Data in Brief Pub Date : 2024-11-03 DOI:10.1016/j.dib.2024.111105

Tomáš Hanousek , Terézia Slanináková , Tomáš Rebok , Růžena Janoutová

{"title":"High spatial and spectral resolution dataset of hyperspectral look-up tables for 3.5 million traits and structural combinations of Central European temperate broadleaf forests","authors":"Tomáš Hanousek , Terézia Slanináková , Tomáš Rebok , Růžena Janoutová","doi":"10.1016/j.dib.2024.111105","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate retrieval of forest functional traits from remote sensing data is critical for monitoring forest health and productivity. To achieve sufficient accuracy using inverse methods it is essential to have representative database of simulated or measured spectral properties together with corresponding forest traits. However, existing datasets are often limited in scope, covering specific sites and times with simplified structures. This limitation hinders the development of generalizable machine learning models for trait prediction. To address this issue, we present a comprehensive high-resolution dataset of hyperspectral Look-Up Tables (LUT) designed for Central European temperate broadleaf forests.</div><div>The dataset includes 3.5 million unique combinations of leaf biochemical and canopy structural characteristics of forest scenes together with a variety of sun geometry. The spectral data cover wavelengths from 450 nm to 2300 nm, with a resolution of 2 nm. The dataset is organised into two files: one capturing the average reflectance of all scene pixels and another focusing solely on sunlit leaf pixels. LUT were generated using the Discrete Anisotropic Radiative Transfer model version 5.10.0. Virtual forest scenes were based on 3D tree representations derived from Terrestrial Laser Scanning of European beech trees, adjusted to various leaf area index values and structural configurations to simulate natural forest variability. The reflectance data were processed using MATLAB and Python scripts, resulting in hyperspectral cubes that were processed to generate the LUT.</div><div>The dataset can be used to train machine learning models, such as Random Forest and Support Vector Machines, for predicting forest functional traits and assisting in the calibration of remote sensing algorithms. The biggest advantage of the dataset is high spectral and spatial resolution, together with the high number of different trait combinations, which allows for adaptability to different times, locations, and hyper- and multispectral sensors, and can support up-coming hyperspectral satellite missions. ESA Copernicus Hyperspectral Imaging Mission for the Environment (CHIME) and NASA Surface Biology and Geology (SBG) future satellite missions can utilise this dataset to develop their product processors for monitoring forest traits.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"Article 111105"},"PeriodicalIF":1.0000,"publicationDate":"2024-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340924010679","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate retrieval of forest functional traits from remote sensing data is critical for monitoring forest health and productivity. To achieve sufficient accuracy using inverse methods it is essential to have representative database of simulated or measured spectral properties together with corresponding forest traits. However, existing datasets are often limited in scope, covering specific sites and times with simplified structures. This limitation hinders the development of generalizable machine learning models for trait prediction. To address this issue, we present a comprehensive high-resolution dataset of hyperspectral Look-Up Tables (LUT) designed for Central European temperate broadleaf forests.

The dataset includes 3.5 million unique combinations of leaf biochemical and canopy structural characteristics of forest scenes together with a variety of sun geometry. The spectral data cover wavelengths from 450 nm to 2300 nm, with a resolution of 2 nm. The dataset is organised into two files: one capturing the average reflectance of all scene pixels and another focusing solely on sunlit leaf pixels. LUT were generated using the Discrete Anisotropic Radiative Transfer model version 5.10.0. Virtual forest scenes were based on 3D tree representations derived from Terrestrial Laser Scanning of European beech trees, adjusted to various leaf area index values and structural configurations to simulate natural forest variability. The reflectance data were processed using MATLAB and Python scripts, resulting in hyperspectral cubes that were processed to generate the LUT.

The dataset can be used to train machine learning models, such as Random Forest and Support Vector Machines, for predicting forest functional traits and assisting in the calibration of remote sensing algorithms. The biggest advantage of the dataset is high spectral and spatial resolution, together with the high number of different trait combinations, which allows for adaptability to different times, locations, and hyper- and multispectral sensors, and can support up-coming hyperspectral satellite missions. ESA Copernicus Hyperspectral Imaging Mission for the Environment (CHIME) and NASA Surface Biology and Geology (SBG) future satellite missions can utilise this dataset to develop their product processors for monitoring forest traits.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

中欧温带阔叶林 350 万个性状和结构组合的高光谱查找表的高空间和光谱分辨率数据集

从遥感数据中准确检索森林功能特征对于监测森林健康和生产力至关重要。要利用反演方法达到足够的精确度，必须拥有具有代表性的模拟或测量光谱特性数据库以及相应的森林特征。然而，现有的数据集通常范围有限，涵盖特定地点和时间，结构简化。这种局限性阻碍了用于性状预测的通用机器学习模型的开发。为了解决这个问题，我们提出了一个针对中欧温带阔叶林的高光谱查找表（LUT）综合高分辨率数据集。该数据集包括 350 万个森林场景的叶片生化和冠层结构特征的独特组合以及各种太阳几何形状。光谱数据涵盖 450 纳米到 2300 纳米的波长，分辨率为 2 纳米。数据集分为两个文件：一个捕捉所有场景像素的平均反射率，另一个只关注阳光下的叶片像素。LUT 使用离散各向异性辐射传输模型 5.10.0 版生成。虚拟森林场景基于欧洲山毛榉树地面激光扫描得出的三维树形，并根据不同的叶面积指数值和结构配置进行了调整，以模拟森林的自然变化。该数据集可用于训练随机森林和支持向量机等机器学习模型，以预测森林功能特征并协助校准遥感算法。该数据集的最大优势是光谱和空间分辨率高，同时具有大量不同的性状组合，可适应不同的时间、地点以及高光谱和多光谱传感器，并可支持即将到来的高光谱卫星任务。欧空局哥白尼环境高光谱成像任务（CHIME）和美国国家航空航天局（NASA）地表生物学和地质学（SBG）未来的卫星任务可利用该数据集开发其产品处理器，用于监测森林特征。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Data in Brief MULTIDISCIPLINARY SCIENCES-

CiteScore

3.10

自引率

0.00%

发文量

996

审稿时长

70 days

期刊介绍： Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.