{"title":"聚类方法在数字土壤制图中的应用——以土壤质地区划为例","authors":"I. Dunkl, Mareike Ließ","doi":"10.5194/SOIL-2020-102","DOIUrl":null,"url":null,"abstract":"Abstract. High resolution soil maps are urgently needed by land managers and researchers for a variety of applications. Digital Soil Mapping (DSM) allows to regionalize soil properties by relating them to environmental covariates with the help of an empirical model. In this study, a legacy soil data set was used to train a machine learning algorithm in order to predict the particle size distribution within the catchment of the Bode river in Saxony-Anhalt (Germany). The ensemble learning method random forest was used to predict soil texture based on environmental covariates originating from a digital elevation model, land cover data and geologic maps. We studied the usefulness of clustering applications in addressing various aspects of the DSM procedure. To investigate the role of the imbalanced data problem in the learning process, the environmental variables were used to cluster the landscape of the study area. Different sampling strategies were used to create balanced training data and were evaluated on their ability to improve model performance. Clustering applications were also involved in feature selection and stratified cross-validation. Overall, clustering applications appear to be a versatile tool to be employed at various steps of the DSM procedure. Beyond their successful application, further application fields in DSM were identified. One of them is to find adequate means to include expert knowledge.\n","PeriodicalId":22015,"journal":{"name":"Soil Science","volume":"73 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On the benefits of clustering approaches in digital soil mapping: an application example concerning soil texture regionalization\",\"authors\":\"I. Dunkl, Mareike Ließ\",\"doi\":\"10.5194/SOIL-2020-102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract. High resolution soil maps are urgently needed by land managers and researchers for a variety of applications. Digital Soil Mapping (DSM) allows to regionalize soil properties by relating them to environmental covariates with the help of an empirical model. In this study, a legacy soil data set was used to train a machine learning algorithm in order to predict the particle size distribution within the catchment of the Bode river in Saxony-Anhalt (Germany). The ensemble learning method random forest was used to predict soil texture based on environmental covariates originating from a digital elevation model, land cover data and geologic maps. We studied the usefulness of clustering applications in addressing various aspects of the DSM procedure. To investigate the role of the imbalanced data problem in the learning process, the environmental variables were used to cluster the landscape of the study area. Different sampling strategies were used to create balanced training data and were evaluated on their ability to improve model performance. Clustering applications were also involved in feature selection and stratified cross-validation. Overall, clustering applications appear to be a versatile tool to be employed at various steps of the DSM procedure. Beyond their successful application, further application fields in DSM were identified. One of them is to find adequate means to include expert knowledge.\\n\",\"PeriodicalId\":22015,\"journal\":{\"name\":\"Soil Science\",\"volume\":\"73 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Soil Science\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.5194/SOIL-2020-102\",\"RegionNum\":4,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Agricultural and Biological Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soil Science","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.5194/SOIL-2020-102","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
On the benefits of clustering approaches in digital soil mapping: an application example concerning soil texture regionalization
Abstract. High resolution soil maps are urgently needed by land managers and researchers for a variety of applications. Digital Soil Mapping (DSM) allows to regionalize soil properties by relating them to environmental covariates with the help of an empirical model. In this study, a legacy soil data set was used to train a machine learning algorithm in order to predict the particle size distribution within the catchment of the Bode river in Saxony-Anhalt (Germany). The ensemble learning method random forest was used to predict soil texture based on environmental covariates originating from a digital elevation model, land cover data and geologic maps. We studied the usefulness of clustering applications in addressing various aspects of the DSM procedure. To investigate the role of the imbalanced data problem in the learning process, the environmental variables were used to cluster the landscape of the study area. Different sampling strategies were used to create balanced training data and were evaluated on their ability to improve model performance. Clustering applications were also involved in feature selection and stratified cross-validation. Overall, clustering applications appear to be a versatile tool to be employed at various steps of the DSM procedure. Beyond their successful application, further application fields in DSM were identified. One of them is to find adequate means to include expert knowledge.
期刊介绍:
Cessation.Soil Science satisfies the professional needs of all scientists and laboratory personnel involved in soil and plant research by publishing primary research reports and critical reviews of basic and applied soil science, especially as it relates to soil and plant studies and general environmental soil science.
Each month, Soil Science presents authoritative research articles from an impressive array of discipline: soil chemistry and biochemistry, physics, fertility and nutrition, soil genesis and morphology, soil microbiology and mineralogy. Of immediate relevance to soil scientists-both industrial and academic-this unique publication also has long-range value for agronomists and environmental scientists.