Impact of multi-scale predictor selection for modeling soil properties

Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data ava...

Full description

Saved in:
Bibliographic Details
Published inGeoderma Vol. 239-240; pp. 97 - 106
Main Authors Miller, Bradley A., Koszinski, Sylvia, Wehrhan, Marc, Sommer, Michael
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.02.2015
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Applying a data mining tool used regularly in digital soil mapping, this research focuses on the optimal inclusion of predictors for soil–landscape modeling by utilizing as wide of a pool of variables as possible. Predictor variables for digital soil mapping are often chosen on the basis of data availability and the researcher's expert knowledge. Predictor variables commonly overlooked include alternative analysis scales for land-surface derivatives and additional remote sensing products. For this study, a pool of 412 potential predictors was assembled, which included qualitative location classes, elevation, land-surface derivatives (with a wide range of analysis scales), hydrologic indicators, as well as proximal and remote sensing (from multiple sources with a variety of resolutions). Subsets of the full pool were also examined for comparison. The performance for the models built from the different starting predictor pools was analyzed for seven target variables. Results suggest that models with limited predictor pools can substitute other predictors to compensate for the missing variables. However, a better performing model was always found by considering predictor variables at multiple scales. Compared with baseline subsets with the most commonly used predictors for digital soil mapping at a single scale, the use of multi-scale predictor variables produced an improvement in model performance ranging from negligible to a 70% increase in the adjusted R2. Although the scale effect of the modifiable area unit problem is generally well known, this study suggests digital soil mapping efforts would be enhanced by the greater consideration of predictor variables at multiple analysis scales. •Potentially useful predictors for digital soil mapping are often overlooked.•Different analysis scales should be treated as unique predictor variables.•The use of multi-scale predictor variables can greatly increase model performance.•Experimentation with subsets of predictor pools for data mining tools can be productive.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0016-7061
1872-6259
DOI:10.1016/j.geoderma.2014.09.018