{"title":"Data transformations cause altered edaphic-climatic controls and reduced predictability on soil carbon decomposition rates","authors":"Daifeng Xiang, Gangsheng Wang, Zehao Lv, Wanyu Li, Jing Tian","doi":"10.1002/saj2.20759","DOIUrl":null,"url":null,"abstract":"<p>Data transformation of the reference soil organic matter (SOM) decomposition rates (<i>k</i><sub>ref</sub>), often derived as turnover times or in alternative formats, is commonly used to develop ecological models for projecting the persistence of SOM. However, the effects of reciprocal or logarithmic transformation of <i>k</i><sub>ref</sub> on model performance and edaphic-climatic patterns remain uncertain. Here, we convert published <i>k</i><sub>ref</sub> values into reciprocal or logarithmic formats and establish machine learning models between the transformed <i>k</i><sub>ref</sub> and edaphic-climatic predictors. We show that models trained with the transformed <i>k</i><sub>ref</sub> exhibit a 11.6%−68.4% reduction in performance upon re-conversion to <i>k</i><sub>ref</sub> compared to those trained with the original <i>k</i><sub>ref</sub>. The variable importance analysis identifies distinct key predictors governing the original <i>k</i><sub>ref</sub> and its transformed counterparts. This suggests that data transformation alters the relative significance of predictors without necessarily improving <i>k</i><sub>ref</sub> prediction performance. Consequently, our study underscores the importance of directly focusing on the original values rather than alternative representations when dissecting a given variable's patterns and mechanisms in ecological modeling.</p>","PeriodicalId":101043,"journal":{"name":"Proceedings - Soil Science Society of America","volume":"88 6","pages":"1971-1982"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings - Soil Science Society of America","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/saj2.20759","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Data transformation of the reference soil organic matter (SOM) decomposition rates (kref), often derived as turnover times or in alternative formats, is commonly used to develop ecological models for projecting the persistence of SOM. However, the effects of reciprocal or logarithmic transformation of kref on model performance and edaphic-climatic patterns remain uncertain. Here, we convert published kref values into reciprocal or logarithmic formats and establish machine learning models between the transformed kref and edaphic-climatic predictors. We show that models trained with the transformed kref exhibit a 11.6%−68.4% reduction in performance upon re-conversion to kref compared to those trained with the original kref. The variable importance analysis identifies distinct key predictors governing the original kref and its transformed counterparts. This suggests that data transformation alters the relative significance of predictors without necessarily improving kref prediction performance. Consequently, our study underscores the importance of directly focusing on the original values rather than alternative representations when dissecting a given variable's patterns and mechanisms in ecological modeling.