SciELO - Scientific Electronic Library Online

 
vol.51 issue9RapidEye image processing for soil use mapping in rugged landscapeSoil chemical attributes and organic matter quality in sugarcane harvest systems with and without burning author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

Share


Pesquisa Agropecuária Brasileira

Print version ISSN 0100-204XOn-line version ISSN 1678-3921

Abstract

CARVALHO JUNIOR, Waldir de et al. Multiple linear regression and Random Forest model to estimate soil bulk density in mountainous regions. Pesq. agropec. bras. [online]. 2016, vol.51, n.9, pp.1428-1437. ISSN 0100-204X.  https://doi.org/10.1590/s0100-204x2016000900041.

The objective of this work was the development of models with different sets of data for estimating soil bulk density in tropical mountainous regions, from soil attributes commonly found in the analyses of soil profiles described in regional surveys. The complete dataset is composed of 163 samples and it was divided into six groups, of which three groups have 73 samples and the maximum of 32 covariables, and three have 163 samples and the maximum of 18 covariables. The linear regression (RLM) and randomForest (RF) models were tested. The lowest uncertainty between the models was achieved by RLM2, with R2 of 0.56, 13 covariables, and 73 samples. Considering the groups with 163 samples, the best models were the RFs with mean R2 of 0.48. The root mean squared error ranged between 0.09 and 0.14. The most important covariables in the RF model were: organic carbon, hydrogen, fine and coarse sand, base saturation, and cation exchange capacity. By the stepwise backward regression, the main covariables were: silt and clay relation; fine and coarse sand; organic carbon; base saturation; and potassium.

Keywords : carbon stock; pedotransfer functions; data-driven models; stepwise.

        · abstract in Portuguese     · text in Portuguese     · Portuguese ( pdf )