{"title":"PS-MTL-LUCAS: A partially shared multi-task learning model for simultaneously predicting multiple soil properties","authors":"Zhaoyu Zhai, Fuji Chen, Hongfeng Yu, Jun Hu, Xinfei Zhou, Huanliang Xu","doi":"10.1016/j.ecoinf.2024.102784","DOIUrl":null,"url":null,"abstract":"Soil acts as a foundation for human survival and social development and soil quality has a great effect on the growth of agricultural products. Visible/near-infrared spectroscopy has been acknowledged as a rapid and non-destructive method for predicting soil properties, and multi-task learning is a preferable approach to analyze the correlation between the spectroscopy data and soil properties. However, current multi-task learning models with the soft parameter sharing structure extremely rely on the task relatedness. To tackle this limitation, we proposed PS-MTL-LUCAS, a multi-task learning with a partially shared structure in this study. An additional shared layer was utilized to learn the general informative representations and interact with each task-specific layer. The partially shared structure ensured the maximum information flow between layers, thereby boosting the prediction performance. Also, the SHapley Addictive exPlanations (SHAP) algorithm was adopted to extract the feature wavelengths of each soil property. PS-MTL-LUCAS was validated on the LUCAS topsoil dataset (2009), and the experimental result suggested that PS-MTL-LUCAS dominated state-of-the-art models by achieving the determination of coefficient at 0.945, 0.936, 0.413, 0.624, 0.837, 0.952, and 0.956 for pH, N, P, K, CEC, OC, and CaCO, respectively. In summary, this study highlighted the use of the soil spectroscopy and multi-task learning techniques in the soil property prediction task and provided a very promising approach for soil studies.","PeriodicalId":51024,"journal":{"name":"Ecological Informatics","volume":"10 1","pages":""},"PeriodicalIF":5.8000,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecological Informatics","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.ecoinf.2024.102784","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Soil acts as a foundation for human survival and social development and soil quality has a great effect on the growth of agricultural products. Visible/near-infrared spectroscopy has been acknowledged as a rapid and non-destructive method for predicting soil properties, and multi-task learning is a preferable approach to analyze the correlation between the spectroscopy data and soil properties. However, current multi-task learning models with the soft parameter sharing structure extremely rely on the task relatedness. To tackle this limitation, we proposed PS-MTL-LUCAS, a multi-task learning with a partially shared structure in this study. An additional shared layer was utilized to learn the general informative representations and interact with each task-specific layer. The partially shared structure ensured the maximum information flow between layers, thereby boosting the prediction performance. Also, the SHapley Addictive exPlanations (SHAP) algorithm was adopted to extract the feature wavelengths of each soil property. PS-MTL-LUCAS was validated on the LUCAS topsoil dataset (2009), and the experimental result suggested that PS-MTL-LUCAS dominated state-of-the-art models by achieving the determination of coefficient at 0.945, 0.936, 0.413, 0.624, 0.837, 0.952, and 0.956 for pH, N, P, K, CEC, OC, and CaCO, respectively. In summary, this study highlighted the use of the soil spectroscopy and multi-task learning techniques in the soil property prediction task and provided a very promising approach for soil studies.
期刊介绍:
The journal Ecological Informatics is devoted to the publication of high quality, peer-reviewed articles on all aspects of computational ecology, data science and biogeography. The scope of the journal takes into account the data-intensive nature of ecology, the growing capacity of information technology to access, harness and leverage complex data as well as the critical need for informing sustainable management in view of global environmental and climate change.
The nature of the journal is interdisciplinary at the crossover between ecology and informatics. It focuses on novel concepts and techniques for image- and genome-based monitoring and interpretation, sensor- and multimedia-based data acquisition, internet-based data archiving and sharing, data assimilation, modelling and prediction of ecological data.