High spatial and spectral resolution dataset of hyperspectral look-up tables for 3.5 million traits and structural combinations of Central European temperate broadleaf forests
Tomáš Hanousek , Terézia Slanináková , Tomáš Rebok , Růžena Janoutová
{"title":"High spatial and spectral resolution dataset of hyperspectral look-up tables for 3.5 million traits and structural combinations of Central European temperate broadleaf forests","authors":"Tomáš Hanousek , Terézia Slanináková , Tomáš Rebok , Růžena Janoutová","doi":"10.1016/j.dib.2024.111105","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate retrieval of forest functional traits from remote sensing data is critical for monitoring forest health and productivity. To achieve sufficient accuracy using inverse methods it is essential to have representative database of simulated or measured spectral properties together with corresponding forest traits. However, existing datasets are often limited in scope, covering specific sites and times with simplified structures. This limitation hinders the development of generalizable machine learning models for trait prediction. To address this issue, we present a comprehensive high-resolution dataset of hyperspectral Look-Up Tables (LUT) designed for Central European temperate broadleaf forests.</div><div>The dataset includes 3.5 million unique combinations of leaf biochemical and canopy structural characteristics of forest scenes together with a variety of sun geometry. The spectral data cover wavelengths from 450 nm to 2300 nm, with a resolution of 2 nm. The dataset is organised into two files: one capturing the average reflectance of all scene pixels and another focusing solely on sunlit leaf pixels. LUT were generated using the Discrete Anisotropic Radiative Transfer model version 5.10.0. Virtual forest scenes were based on 3D tree representations derived from Terrestrial Laser Scanning of European beech trees, adjusted to various leaf area index values and structural configurations to simulate natural forest variability. The reflectance data were processed using MATLAB and Python scripts, resulting in hyperspectral cubes that were processed to generate the LUT.</div><div>The dataset can be used to train machine learning models, such as Random Forest and Support Vector Machines, for predicting forest functional traits and assisting in the calibration of remote sensing algorithms. The biggest advantage of the dataset is high spectral and spatial resolution, together with the high number of different trait combinations, which allows for adaptability to different times, locations, and hyper- and multispectral sensors, and can support up-coming hyperspectral satellite missions. ESA Copernicus Hyperspectral Imaging Mission for the Environment (CHIME) and NASA Surface Biology and Geology (SBG) future satellite missions can utilise this dataset to develop their product processors for monitoring forest traits.</div></div>","PeriodicalId":10973,"journal":{"name":"Data in Brief","volume":"57 ","pages":"Article 111105"},"PeriodicalIF":1.0000,"publicationDate":"2024-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data in Brief","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352340924010679","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate retrieval of forest functional traits from remote sensing data is critical for monitoring forest health and productivity. To achieve sufficient accuracy using inverse methods it is essential to have representative database of simulated or measured spectral properties together with corresponding forest traits. However, existing datasets are often limited in scope, covering specific sites and times with simplified structures. This limitation hinders the development of generalizable machine learning models for trait prediction. To address this issue, we present a comprehensive high-resolution dataset of hyperspectral Look-Up Tables (LUT) designed for Central European temperate broadleaf forests.
The dataset includes 3.5 million unique combinations of leaf biochemical and canopy structural characteristics of forest scenes together with a variety of sun geometry. The spectral data cover wavelengths from 450 nm to 2300 nm, with a resolution of 2 nm. The dataset is organised into two files: one capturing the average reflectance of all scene pixels and another focusing solely on sunlit leaf pixels. LUT were generated using the Discrete Anisotropic Radiative Transfer model version 5.10.0. Virtual forest scenes were based on 3D tree representations derived from Terrestrial Laser Scanning of European beech trees, adjusted to various leaf area index values and structural configurations to simulate natural forest variability. The reflectance data were processed using MATLAB and Python scripts, resulting in hyperspectral cubes that were processed to generate the LUT.
The dataset can be used to train machine learning models, such as Random Forest and Support Vector Machines, for predicting forest functional traits and assisting in the calibration of remote sensing algorithms. The biggest advantage of the dataset is high spectral and spatial resolution, together with the high number of different trait combinations, which allows for adaptability to different times, locations, and hyper- and multispectral sensors, and can support up-coming hyperspectral satellite missions. ESA Copernicus Hyperspectral Imaging Mission for the Environment (CHIME) and NASA Surface Biology and Geology (SBG) future satellite missions can utilise this dataset to develop their product processors for monitoring forest traits.
期刊介绍:
Data in Brief provides a way for researchers to easily share and reuse each other''s datasets by publishing data articles that: -Thoroughly describe your data, facilitating reproducibility. -Make your data, which is often buried in supplementary material, easier to find. -Increase traffic towards associated research articles and data, leading to more citations. -Open up doors for new collaborations. Because you never know what data will be useful to someone else, Data in Brief welcomes submissions that describe data from all research areas.