Status: this discussion paper is a preprint. It has been under review for the journal Hydrology and Earth System Sciences (HESS). The manuscript was not accepted for further review after discussion.
Compositional balance should be considered in soil particle-size fractions mapping using hybrid interpolators
Mo Zhangand Wenjiao Shi
Abstract. Digital soil mapping of soil particle-size fractions (PSFs) using log-ratio methods has been widely used. As a hybrid interpolator, regression kriging (RK) is an alternative way to improve prediction accuracy. However, there is still a lack of systematic comparison and recommendation when RK was applied for compositional data. Whether performance based on different balances of isometric log-ratio (ILR) transformation is robust. Here, we systematically compared the generalized linear model (GLM), random forest (RF), and their hybrid pattern (RK) using different balances of ILR transformed data of soil PSFs with 29 environmental covariables for prediction of soil PSFs on the upper reaches of the Heihe River Basin. The results showed that RF had better performance with more accurate predictions, but GLM had a more unbiased prediction. For the hybrid interpolators, RK was recommended because it widened data ranges of the prediction results, and modified bias and accuracy for most models, especially for RF. The drawback, however, existed due to the data distributions and model algorithms. Moreover, prediction maps generated from RK demonstrated more details of soil sampling points. Three ILR transformed data based on sequential binary partitions (SBP) made different distributions, and it is not recommended to use the most abundant component of compositions as the first component of permutations. This study can reference spatial simulation of soil PSFs combined with environmental covariables and transformed data at a regional scale.
Received: 23 Jul 2020 – Discussion started: 31 Aug 2020
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
School of Earth Sciences and Resources, China University of Geosciences, Beijing 100083, China
Wenjiao Shi
Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China
College of Resources and Environment, University of Chinese Academy of Sciences, Beijing 100049, China
We used generalized linear model (GLM), random forest (RF), and their hybrid patterns (regression kriging, RK) to map soil particle-size fractions (PSF) using isometric log-ratio (ILR) transformed data. RF and GLM did well in accuracy and bias, respectively. RK can modify bias and accuracy for most models. Different ILR transformed data based on sequential binary partitions was also discussed further. This study can reference soil PSF spatial simulation and how to choose the ILR balances.
We used generalized linear model (GLM), random forest (RF), and their hybrid patterns...