Trang Thi Kieu Tran , Sayed M. Bateni , Hamid Mohebzadeh , Changhyun Jun , Manish Pandey , Dongkyn Kim
{"title":"Filling gaps in MODIS NDVI data using hybrid multiple imputation–Machine learning and DINCAE techniques: Case study of the State of Hawaii","authors":"Trang Thi Kieu Tran , Sayed M. Bateni , Hamid Mohebzadeh , Changhyun Jun , Manish Pandey , Dongkyn Kim","doi":"10.1016/j.advengsoft.2024.103856","DOIUrl":null,"url":null,"abstract":"<div><div>Normalized difference vegetation index (NDVI) data are vital for monitoring vegetation dynamics and health. However, NDVI time-series data obtained via remote sensing often contain missing values due to factors such as cloud cover, snow, and hardware failures. To address this problem and fill gaps in NDVI data from the Moderate Resolution Imaging Spectroradiometer (MODIS), this study combines the multiple imputations by chained equations (MICE) model with three machine learning techniques: Knearest neighbor, multilayer perceptron (MLP), and boosted regression tree. Additionally, the data interpolating convolutional auto-encoder (DINCAE), a recently proposed imputation method, is employed for imputation and comparison. The performance of all these models is evaluated using MODIS NDVI data from Oahu, Hawaii for training and validation. Synthetic scenarios with gap sizes of 20 %, 40 %, 60 %, and 80 % are created to assess the models’ feasibility for each gap size. Furthermore, all models are tested using data from Hawaii Island and Maui. Results indicate that the MICE-MLP model achieves the highest accuracy in imputing missing NDVI values on Oahu, with root mean square error (RMSE) values of 0.1028, 0.1112, and 0.1224 for missing ratios of 20 %, 40 %, and 60 %, respectively. Similarly, MICE-MLP outperforms other models using Hawaii Island and Maui data at gap sizes below 80 %. While the DINCAE model demonstrates superior accuracy at an 80 % gap size, its computational speed is slower than MICE-MLP. Overall, the findings underscore the robustness and accuracy of the MICE-MLP model in imputing missing NDVI data, making it a reliable alternative to existing methods.</div></div>","PeriodicalId":50866,"journal":{"name":"Advances in Engineering Software","volume":"201 ","pages":"Article 103856"},"PeriodicalIF":4.0000,"publicationDate":"2024-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Engineering Software","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0965997824002631","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Normalized difference vegetation index (NDVI) data are vital for monitoring vegetation dynamics and health. However, NDVI time-series data obtained via remote sensing often contain missing values due to factors such as cloud cover, snow, and hardware failures. To address this problem and fill gaps in NDVI data from the Moderate Resolution Imaging Spectroradiometer (MODIS), this study combines the multiple imputations by chained equations (MICE) model with three machine learning techniques: Knearest neighbor, multilayer perceptron (MLP), and boosted regression tree. Additionally, the data interpolating convolutional auto-encoder (DINCAE), a recently proposed imputation method, is employed for imputation and comparison. The performance of all these models is evaluated using MODIS NDVI data from Oahu, Hawaii for training and validation. Synthetic scenarios with gap sizes of 20 %, 40 %, 60 %, and 80 % are created to assess the models’ feasibility for each gap size. Furthermore, all models are tested using data from Hawaii Island and Maui. Results indicate that the MICE-MLP model achieves the highest accuracy in imputing missing NDVI values on Oahu, with root mean square error (RMSE) values of 0.1028, 0.1112, and 0.1224 for missing ratios of 20 %, 40 %, and 60 %, respectively. Similarly, MICE-MLP outperforms other models using Hawaii Island and Maui data at gap sizes below 80 %. While the DINCAE model demonstrates superior accuracy at an 80 % gap size, its computational speed is slower than MICE-MLP. Overall, the findings underscore the robustness and accuracy of the MICE-MLP model in imputing missing NDVI data, making it a reliable alternative to existing methods.
期刊介绍:
The objective of this journal is to communicate recent and projected advances in computer-based engineering techniques. The fields covered include mechanical, aerospace, civil and environmental engineering, with an emphasis on research and development leading to practical problem-solving.
The scope of the journal includes:
• Innovative computational strategies and numerical algorithms for large-scale engineering problems
• Analysis and simulation techniques and systems
• Model and mesh generation
• Control of the accuracy, stability and efficiency of computational process
• Exploitation of new computing environments (eg distributed hetergeneous and collaborative computing)
• Advanced visualization techniques, virtual environments and prototyping
• Applications of AI, knowledge-based systems, computational intelligence, including fuzzy logic, neural networks and evolutionary computations
• Application of object-oriented technology to engineering problems
• Intelligent human computer interfaces
• Design automation, multidisciplinary design and optimization
• CAD, CAE and integrated process and product development systems
• Quality and reliability.