Soil organic carbon (SOC) is a critical indicator of soil health and carbon cycling, essential for sustainable agricultural development and climate change mitigation. Visible near-infrared reflectance spectroscopy (vis-NIR) technology combined with machine learning models has shown potential for rapid and accurate SOC prediction. However, due to significant spatial heterogeneity in SOC distribution across different regions, global models based on soil spectral libraries (SSLs) often lack sufficient generalization capability for local applications. To address this challenge, we propose an innovative sample selection framework for instance-based transfer learning, KNN-FRSS, a forward recursive sample selection method combined with K-nearest neighbor algorithm. This framework identifies the most representative samples from the SSL for the target region using a two-step process: step 1: preliminary selection by local similarity metric, and step 2: precise selection by forward recursive sample selection mechanism, thereby enhancing the adaptability of cross-regional SOC modeling. We employed three predictive models (1D-CNN, Cubist, and PLSR) to evaluate the transferability of the KNN-FRSS strategy in cross-region modeling. In addition, we compared the performance of KNN-FRSS with another sample selection method (RS-LOCAL-v2.0) and feature-based transfer learning approach. These transfer learning methods were evaluated across four distinct regions. The results indicate that all transfer learning methods improved model predictive accuracy in four study regions. Notably, the combination of KNN-FRSS with the 1D-CNN model consistently outperformed the others. Compared to the best-performing models built using local data, this combined approach achieved an improvement in R²ranging from 3 % to 26 %, and a reduction in RMSE by 12.82–43.4 %. Finally, this study provides a feasible path to enhance the effectiveness of transfer learning in soil spectral modeling, and provides methodological support for rapid and high-accuracy SOC prediction across diverse geographic regions.
扫码关注我们
求助内容:
应助结果提醒方式:
