Acessibilidade / Reportar erro

GAUSSIAN SPATIAL LINEAR MODEL OF SOYBEAN YIELD USING BOOTSTRAP METHODS

ABSTRACT

This study aims to quantify the uncertainties associated to the parameters of a Gaussian spatial linear model (GSLM) and the assumption of normality residuals in the modeling of the spatial dependence of the soybean yield as a function of soil chemical attributes. The spatial bootstrap methods were used to determine the point and interval estimators associated with the model parameters. Hypothesis tests were carried out on the significance of the model parameters and the quantile-quantile probability plot was elaborated to verify the data normality. The uncertainties associated to the parameters of the spatial dependence structure were quantified and the potassium content, phosphorus content and soil pH covariates were significant to explain the soybean yield mean. These covariates were used in the elaboration of a new model, which provided the elaboration of a contour map of soybean yield. Analysis of the quantile-quantile plot indicated that soybean yield data follow a normal probability distribution.

KEYWORDS
geostatistics; spatial resampling; spatial variability

INTRODUCTION

Soybean is the most important legume species in the world (Gallon et al., 2016Gallon M, Buzzello GL, Trezzi MM, Diesel F, Silva HL (2016) Ação de herbicidas inibidores da PROTOX sobre o desenvolvimento, acamamento e produtividade da soja. Revista Brasileira de Herbicidas 15(3):232-240. DOI: http://dx.doi.org/10.7824/rbh.v15i3.471
http://dx.doi.org/10.7824/rbh.v15i3.471...
) and it stands out in the Brazilian agribusiness for its high productive potential in different regions (Cruz et al., 2016Cruz SCS, Sena-Junior DG, Santos DMA, Lunezzo LO, Machado CG (2016) Cultivo de soja sob diferentes densidades de semeadura e arranjos espaciais. Revista de Agricultura Neotropical 3(1):1-6.). Soybean is widely used for the elaboration of animal feed and oil and as Ávila & Albrecht (2010)Ávila MR, Albrecht LP (2010) Isoflavonas e a qualidade das sementes de soja. Informativo Abrates 20(1):15-29. explained, it has been emphasized as an alternative in the prevention of diseases and in human food, being able to be transformed into several protein foods.

Among the studies involving soybean yield, we highlight the geostatistics methods used to detect the spatial variability in the crops (Borssoi et al., 2011Borssoi JA, Uribe-Opazo MA, Galea M (2011) Técnicas de diagnóstico de influência local na análise da produtividade da soja. Engenharia Agrícola 31(2):376-387. DOI: http://dx.doi.org/10.1590/S0100-69162011000200018
http://dx.doi.org/10.1590/S0100-69162011...
; Kestring et al., 2015Kestring FBF, Guedes LPC, De Bastiani F, Uribe-Opazo, MA (2015) Comparação de mapas temáticos de diferentes grades amostrais para a produtividade da soja. Engenharia Agrícola 35(4):733-743. DOI: http://dx.doi.org/10.1590/1809-4430-Eng.Agric.v35n4p733-743/2015
http://dx.doi.org/10.1590/1809-4430-Eng....
; Guedes et al., 2016Guedes LPC, Ribeiro-Jr PJ, Uribe-Opazo MA, De Bastiani F (2016) Mapas da produtividade da soja usando configurações amostrais regulares e otimizadas pela têmpera simulada. Engenharia Agrícola 36(1):114-125. DOI: http://dx.doi.org/10.1590/1809-4430-Eng.Agric.v36n1p114-125/2016
http://dx.doi.org/10.1590/1809-4430-Eng....
). Although geostatistics allows the understanding of spatial variability of soybean yield, the samples used in the analyzes are generally few and sparse (Pardo-Igúzquiza & Olea, 2012Pardo-Igúzquiza E, Olea RA (2012) VARBOOT: A spatial bootstrap program for semivariogram uncertainty assessment. Computers & Geosciences 41(1):188-198. DOI: http://dx.doi.org/10.1016/j.cageo.2011.09.002.
http://dx.doi.org/10.1016/j.cageo.2011.0...
), then there are uncertainties associated with the results obtained.

The uncertainties occur because the spatial linear model is estimated by parametric methods, such as those presented by Mardia & Marshall (1984)Mardia KV, Marshall RJ (1984) Maximum likelihood estimation of models for residual covariance in spatial regression. Biometrika 71(1):135-146. DOI: http://dx.doi.org/10.2307/2336405
http://dx.doi.org/10.2307/2336405...
. To quantify the uncertainties associated with the results of a geostatistics analysis, an alternative to traditional inference methods is the use of spatial bootstrap resampling (SB) (Solow, 1985Solow A (1985) Bootstrapping correlated data. Mathematical Geology 17(7):769-775. DOI: http://dx.doi.org/10.1007/BF01031616
http://dx.doi.org/10.1007/BF01031616...
) and parametric spatial bootstrap (PSB) methods (Tang et al., 2006Tang L, Schucany W, Woodward W, Gunst R (2006) A parametric spatial bootstrap. Technical Report SMU-TR-337. Dallas, Southern Methodist University.), adaptations of the bootstrap method (Efron, 1979Efron B (1979) Bootstrap methods: Another look at the jackknife. Annals of Statistics 7(1):1-26) for spatially dependent data.

The bootstrap method is well known and has been used in studies involving independent samples of soybean yield (Dalposso et al., 2016Dalposso GH, Uribe-Opazo MA, Johann JA (2016) Soybean yield modeling using bootstrap methods for small samples. Spanish Journal of Agricultural Research 14(3). DOI: http://dx.doi.org/10.5424/sjar/2016143-8635
http://dx.doi.org/10.5424/sjar/2016143-8...
; Gupta & Manjaya, 2016Gupta SK, Manjaya JG (2016) Assessment of genetic variation at soybean mosaic virus resistance loci in Indian Soybean (Glycine max L. Merill) genotypes using SSR markers. Eletronic Journal of Plant Breeding 7(2):392-400. DOI: http://dx.doi.org/10.5958/0975-928X.2016.00048.X
http://dx.doi.org/10.5958/0975-928X.2016...
). The bootstrap methods for spatially dependent data have been highlighted in the literature, due to the importance of uncertainty modeling in the analyzes, as can be observed in the works of Kang et al. (2008)Kang C, Choi S, Yoo SH (2008) A spatial bootstrap method for kriging variance. Journal of the Korean Data Analysis Society 10(3):1247-1254., Schelin & Sjöstedt-De Luna (2010)Schelin L, Sjöstedt-De Luna S (2010) Kriging prediction intervals based on semiparametric bootstrap, Mathematical Geosciences 42(1):985-1000. DOI: http://dx.doi.org/10.1007/s11004-010-9302-9
http://dx.doi.org/10.1007/s11004-010-930...
, Olea & Pardo-Igúzquiza (2011)Olea RA, Pardo-Igúzquiza E (2011) Generalized bootstrap method for assessment of uncertainty in semivariogram inference. Mathematical Geosciences 43(2):203-228. DOI: http://dx.doi.org/10.1007/s11004-010-9269-6.
http://dx.doi.org/10.1007/s11004-010-926...
and Pardo-Igúzquiza & Olea (2012)Pardo-Igúzquiza E, Olea RA (2012) VARBOOT: A spatial bootstrap program for semivariogram uncertainty assessment. Computers & Geosciences 41(1):188-198. DOI: http://dx.doi.org/10.1016/j.cageo.2011.09.002.
http://dx.doi.org/10.1016/j.cageo.2011.0...
.

The aim of this study was to use spatial bootstrap (SB) and parametric spatial bootstrap (PSB) methods to quantify the uncertainties associated with spatial dependence structure parameters and the assumption of residuals normality distributed in spatial dependence modeling of soybean yield with chemical attributes of the soil considered as covariates.

MATERIAL AND METHODS

Study area and data

The data set was collected in the 2012/2013 agricultural year and comes from an area (Figure 1) of 167.35 ha located in the western region of Paraná, Brazil, near the municipality of Cascavel, with central coordinates defined by latitude 24°57'25”S and longitude 53°34'29”W, and average altitude of 714 m. According to the Köppen classification, the climate of the agricultural area is the Cfa type (Aparecido et al., 2016Aparecido LE, Rolim GS, Richetti J, Souza PS, Johann JA (2016) Köppe, Thornthwaite and Camargo climate classifications for climatic zoning in the State of Paraná, Brazil. Ciência e Agrotecnologia 40(4):405-417. DOI: http://dx.doi.org/10.1590/1413-70542016404003916
http://dx.doi.org/10.1590/1413-705420164...
) and the soil is classified as Oxisol (Embrapa, 2009Embrapa (2009) Sistema brasileiro de classificação de solos. Rio de Janeiro, Embrapa.).

FIGURE 1
Geographic location map of the study area.

A systematic centered sampling with lattice plus close pairs was performed composed of 99 sample elements (Figure 1), georeferenced with a GEOEXPLORE 3 GPS device with a precision of 5 m. At each sampling point, the soybean yield (Prod, t ha−1) and the following soil chemical attributes were determined: calcium (Ca, cmolc dm−3), magnesium (Mg, cmolc dm−3), potassium (K, mg dm−3), phosphorus (P, mg dm−3), manganese (Mn, mg dm−3) and soil pH.

Geostatistical analysis

Consider a Gaussian stochastic process {Z(s), s ϵ S} with S ⊂ ℜ2, being ℜ2 the two- dimensional Euclidean space. Suppose that the data of this process, Z(s1), …, Z(sn) are recorded in known spatial locations si (i = 1,…,n) and generated by the following model:

(1) Z ( s ) = μ ( s ) + ε ( s )

where,

Z(s): an n × 1 vector of the variable of interest;

μ(s): an n × 1 vector that represents the mean of the process, and

ε(s); an n × 1 vector of errors, with zero mean vector E[ε(s)]=0 and covariance matrix Σ = C[(σiu)], where σiu = cov(ε(si), ε(su)), for i = u = 1, …,n

The mean vector μ(s) can be written as a linear model μ(s) = Xβ, where β = (β1 ,…, βp)T is a p × 1 vector of unknown parameters and X = X(s) is an n × p matrix formed with p covariates at site s ϵ S. According to Mardia & Marshall (1984)Mardia KV, Marshall RJ (1984) Maximum likelihood estimation of models for residual covariance in spatial regression. Biometrika 71(1):135-146. DOI: http://dx.doi.org/10.2307/2336405
http://dx.doi.org/10.2307/2336405...
, the covariance matrix can be expressed in the parametric form Σ = φ1In + φ2 R (φ3) where In is an n × n identity matrix, φ1 ≥ 0 is the parameter called nugget effect, φ2 ≥ 0 the parameter called contribution, φ3 ≥ 0 is the parameter that defines the range (α) of the model and R(φ3) = [(rij)] is a n × n symmetric matrix. The elements rij of the matrix R, for i, j = 1, …, n, represent the correlation function between the points si and sj, with rij = 1 if i = j ; rij = 0 if ij and φ2 = 0 and rij= φ2−1 σij if ij and φ2−1 ≠ 0, with σij = C(hij) is a function that depends on hij = ‖si - sj‖, which is the Euclidean distance between points si and sj. Thus, Z(s) follows a noπnal n-variate distribution with mean vector μ(s) and covariance matrix Σ (De Bastiani et al., 2015De Bastiani F, Cysneiros AHMA, Uribe-Opazo MA, Galea M (2015) Influence diagnostics in elliptical spatial linear models. Test 24(2):322-340. DOI: http://dx.doi.org/10.1007/s11749-014-0409-z
http://dx.doi.org/10.1007/s11749-014-040...
).

In order to identify the spatial dependence structure of soybean yield, the omnidirectional experimental semivariogram Y(hij) was constructed using the Matheron estimator (Cressie, 2015Cressie N (2015) Statistics for spatial data. New York, John Wiley & Sons. 928p.) and to model the structure of spatial dependence were used models of the Matérn family (Matérn, 1986Matérn B (1986) Spatial variation. Berlin, Springer-Verlag, 2 ed. 151p.), presented in [eq. (2)]:

(2) C ( h i j ) = { φ 1 + φ 2 , φ 2 2 k-1 Γ ( k ) ( h i j φ 3 ) k K k ( h i j φ 3 ) , i = j i j

where,

Kk(.); Bessel function of the third order type k > 0;

Γ(.): Gamma function, and

hij: Euclidean distance between points si and sj.

For second-order stationary Gaussian and isotropic processes, the semivariance function has the following relation: γ (hij) = C(0) – C(hij) (Uribe-Opazo et al., 2012Uribe-Opazo MA, Borssoi JA, Galea, M (2012) Influence diagnostics in gaussian spatial linear models. Journal of Applied Statistics 3(39):615-630. DOI: http://dx.doi.org/10.1080/02664763.2011.607802.
http://dx.doi.org/10.1080/02664763.2011....
), for i, j = 1,…,n, being hij ≥ 0 and C(0) the variance of Z. in [eq. (2)], the form parameter k controls the behavior close to the origin and the process analytical smoothing. In this study, we considered the fixed values k = {0.5;1.5;2.5;4.5} (Diggle & Ribeiro-Jr., 2007Diggle PJ, Ribeiro-Jr PJ (2007) Model-based geostatistics. New York, Springer. 232p.).

The parameters estimation of the models was carried out using the maximum likelihood (ML) method and the criteria for selecting the geostatistics model for the covariance matrix were the cross-validation (CrV), the trace of the asymptotic covariance matrix of the estimated mean (Tr) and the maximum value of the loglikelihood function (LML) (De Bastiani et al., 2015De Bastiani F, Cysneiros AHMA, Uribe-Opazo MA, Galea M (2015) Influence diagnostics in elliptical spatial linear models. Test 24(2):322-340. DOI: http://dx.doi.org/10.1007/s11749-014-0409-z
http://dx.doi.org/10.1007/s11749-014-040...
). To investigate the significance of covariates, the likelihood ratio test (Rao, 1973Rao CR (1973) Linear statistical inference and its applications. New York, John Wiley e Sons. 656p.) was used, and the kriging with external drift was used to obtain the predicted values (Wackernagel, 1995Wackernagel H (1995) Multivariate geostatistics. Berlin, Springer-Verlag. 256p.).

Bootstrap for spatial data

In this study, the spatial bootstrap (SB) methods, proposed by Solow (1985)Solow A (1985) Bootstrapping correlated data. Mathematical Geology 17(7):769-775. DOI: http://dx.doi.org/10.1007/BF01031616
http://dx.doi.org/10.1007/BF01031616...
and presented in Algorithm 1 and parametric spatial bootstrap (PSB), proposed by Tang et al. (2006)Tang L, Schucany W, Woodward W, Gunst R (2006) A parametric spatial bootstrap. Technical Report SMU-TR-337. Dallas, Southern Methodist University. and presented in Algorithm 2 were used.

Algorithm 1: Spatial Bootstrap (Solow, 1985Solow A (1985) Bootstrapping correlated data. Mathematical Geology 17(7):769-775. DOI: http://dx.doi.org/10.1007/BF01031616
http://dx.doi.org/10.1007/BF01031616...
).

a) Considering the spatial data set {Z(S1),…, Z(sn)} determine the vector of residuals ε^(s)=Z(s)μ^ where μ^=(XT^1X)1XT^1Z is the ML estimator of μ and ^ the ML estimator of the covariance matrix Σ. b) Considering the estimated covariance matrix ^, use the Cholesky decomposition method to obtain ^=L^L^T, that L^ is a lower triangular matrix of order n; c) Using the inverse matrix L^1, determine ε^unc=L^1ε^, the vector of uncorrelated residuals; d) Considering the elements of the vector of uncorrelated residuals, carry out n resamples with replacement to form the vector of bootstrap residuals εSB*=(ε1*,,εn*)T; e) The spatial bootstrap sample is obtained by recorrelating bootstrap residuals Z*=μ^+L^εSB*.

Algorithm 2: Parametric Spatial Bootstrap (Tang et al., 2006Tang L, Schucany W, Woodward W, Gunst R (2006) A parametric spatial bootstrap. Technical Report SMU-TR-337. Dallas, Southern Methodist University.).

a) Considering the spatial data set {Z(s1),…, Z(sn)}, determine the residue vector ε^(s)=Z(s)μ^ being μ^=(XT^1X)1XT^1Z the ML estimator of μ and ^ the ML estimator of Σ ; b) Use the Cholesky decomposition method to obtain ^=L^L^T, that L^ is a lower triangular matrix of order n; c) Use the standard normal distribution N (0.1) to create a vector of size n, called the parametric bootstrap vector εPSB*=(ε1*,,εn*)T, that εj*N(0,1) for j = 1, …,n; d) The spatial bootstrap sample is obtained by recorrelating bootstrap residuals Z*=μ^+L^εPSB*.

As Tang et al. (2006)Tang L, Schucany W, Woodward W, Gunst R (2006) A parametric spatial bootstrap. Technical Report SMU-TR-337. Dallas, Southern Methodist University. explained, the BSP method does not uncorrelated the residuals vector ε^ as in the BS method. Instead, the residuals are generated independent from a standard normal distribution. The theoretical basis of the BSP method can be seen in the study of Sjöstedt-De Luna & Young (2003)Sjöstedt-De Luna S, Young (2003) The bootstrap and kriging prediction intervals. Scandinavian Journal of Statistics 30(1):175-192..

Quantification of uncertainties in geostatistics analysis

The Algorithms 1 and 2 were used to determine B = 1000 bootstrap samples from the soybean yield dataset. For each sample, a model was adjusted, which allowed to construct the empirical distribution of the parameters of the model and, consequently, to determine point and interval estimators of the parameters using the percentile bootstrap method (Efron, 1982Efron B (1982) The jackknife, the bootstrap and other resampling plans. Philadelphia, SIAM. 93p.).

To verify the assumption of normality, the uncorrelated residuals ε^unc obtained by the spatial bootstrap method (Algorithm 1) were ordered and plotted versus the theoretical quantis of the standard normal distribution, resulting in a quantile-quantile plot (q-q plot).

Computational resources

The calculations performed in this study were developed in the R software (R Core Team, 2017R Core Team (2017) R: A language and environment for statistical computing. R foundation for statistical computing. Vienna.). The criteria used to choose the best model, the likelihood ratio test, the bootstrap methods and the multivariate plots q-q plots were implemented by the authors.

RESULTS AND DISCUSSION

According to Conab (2016)Conab (2016) Soja – Brasil: Série histórica de produtividade. Available: http://www.conab.gov.br. Accessed: Nov 17, 2016.
http://www.conab.gov.br...
, the average soybean yield in the study area (Table 1) was lower than the average of the State of Paraná (3.38 t ha−1) and higher than the national average (2.94 t ha−1).

TABLE 1
Descriptive analysis of soybean yield and covariates.

The calcium (Ca) contents varied from 2.30 cmolc-dm−3 to 13.10 cmolc-dm−3 and the magnesium (Mg) contents ranged from 0.85 cmolc dm−3 to 5.75 cmolc dm−3 (Table 1), and they were classified as high (Tomé Jr., 1997Tomé Jr JB (1997) Manual para interpretação de análise de solo. Guaíba, Agropecuária. 247p.). Analyzing the classification of the coefficient of variation (CV) (Table 1) proposed by Pimentel Gomes (1985)Pimentel Gomes, F (1985) Curso de estatística experimental. São Paulo, Nobel. 467p., the high dispersion of potassium (K) and phosphorus (P) contents and homogeneity of pH data are highlighted.

In Table 2, in all models of spatial dependence fitted for covariance matrix (Σ), the estimated values of the nugget effect parameter (φ1) were equal to zero, the estimated values of the contribution parameter 2) were similar and the estimated values of the range parameter ((φ)3) indicated a radius of spatial dependence ranging from 109 to 112 m. According to the criteria CrV, LML and Tr (Table 2), the best fit for the covariance matrix (Σ) was provided by the Matérn model with parameter k = 4.5.

TABLE 2
Criteria for selecting the soybean yield model elaborated with covariates considering the covariance function Matérn with different form parameters.

According to the fitted Gaussian linear spatial model (GSLM), the value associated with the potassium content (K) (Table 3) was the highest and presented a positive sign, indicating that it was the covariate that contributed the most to an increase in the average of the soybean yield (t ha-1). This result corroborates with the study of Dos Passos et al. (2015)Dos Passos AMA, Rezende PM, Carvalho ER, Ávila FW (2015) Biochar, farmyard manure and poultry litter on chemical attributes of a Distrophic Cambissol and soybean crop. Revista Brasileira de Ciências Agrárias 10(3):382-388. DOI: http://dx.doi.org/10.5039/agraria.v10i3a4546
http://dx.doi.org/10.5039/agraria.v10i3a...
, considering that this nutrient is the second most absorbed by the soybean plant, being essential for the growth and development of the plants (Li et al., 2015Li L, Xu L, Wang X, Pan G, Lu L (2015) De novo characterization of the alligator weed (Alternanthera philoxeroides) transcriptome illuminates gene expression under potassium deprivation. Journal of Genetics 94(1):95-104. DOI: http://dx.doi.org/10.1007/s12041-015-0493-1
http://dx.doi.org/10.1007/s12041-015-049...
). The estimates of the parameters associated to the covariates of phosphorus (P) and manganese (Mn) contents indicated that these covariates contributed little to the estimated soybean yield (Table 3).

TABLE 3
Estimated parameters for the linear spatial model by the maximum likelihood method considering the covariance function Matérn with form parameter k = 4.5.

The fact that the nugget effect is null indicates that, at small distances, spatial dependence is also small (Vieira & Gonzalez, 2003Vieira RS, Gonzalez AP (2003) Analysis of the spatial variability of crop yield and soil properties in small agricultural plots. Bragantia 62(1):127-138. DOI: http://dx.doi.org/10.1590/S0006-87052003000100016
http://dx.doi.org/10.1590/S0006-87052003...
), meaning that the distances considered among the samples were adequate.

To evaluate the reliability of the parameter estimates, the descriptive statistics of the bootstrap replicates obtained by the BS and BSP methods were calculated and the Efron percentile confidence intervals were determined (Table 4).

TABLE 4
Descriptive statistics and Efron percentile 95% confidence intervals of the bootstrap distribution of the model parameters of the spatial dependence structure of soybean yield considering the covariates Ca, Mg, K, P, Mn and pH.

From the B = 1000 models adjusted by the BS method, two models (0.2%) were excluded. By the BSP method, only one model (0.1%) was excluded. These exclusions were carried out due to, in these settings; the radius of spatial dependence was greater than the maximum distance between samples (1766 m) in the monitored area. As pointed out by Dalposso et al. (2009)Dalposso GH, Uribe-Opazo MA, Borssoi JA, Johann JA, Mercante E (2009) Previsão da produção de trigo utilizando métodos geoestatísticos. In: Di Leo N, Montico S, Nardón G. Avances en Ingenieria Rural 2007-2009. Rosario, Editorial de la Universidad Nacional de Rosario. p78-86., the models that present this behavior should be discarded, considering information that goes beyond the area under study.

For both bootstrap methods, the confidence intervals for the parameters associated with calcium (CA), magnesium (Mg) and manganese (Mn) contents were zero, indicating that these covariates may not be significant (Table 4).

Although the zero value is contained in the confidence intervals of the bootstrap replicates of the contribution (φ2) (Table 4), there is evidence to assume that the contribution is not null, since in addition to the value obtained in the original sample, 0.18, 94% of the bootstrap replicates obtained by the SB method and 95.1% of the bootstrap replicates obtained by the PSB method presented contribution different than zero. Despite the bootstrap methods used to generate replicates of the soybean yield data set, some models provided high range parameters, emphasizing the positive asymmetry (Table 4).

When comparing the standard deviations of the model parameters (Table 3), with the respective standard deviations obtained by the bootstrap (Table 4), except for the standard deviation of the range parameter (φ3). the others presented similar values. In the analysis of significance of the GSLM parameters (Table 5) the hypotheses ℋ0:βCa = 0, ℋ0 βMg = 0 and ℋ0: βMn = 0 are not rejected at 5% significance, demonstrating that spatial bootstrap methods are an alternative to test the individual effect of covariates.

TABLE 5
Likelihood ratio test (LR) value and p-value for hypotheses ℋ0: βi = 0, i = {Ca, Mg, K, P, Mn, PH} in the linear spatial model.

Due to the parameters associated with calcium (Ca, cmolc dm-3), magnesium (Mg, cmolc dm-3) and manganese (Mn, mg dm-3) contents were not significant (Table 5), a new spatial linear model considering potassium (K, mg dm-3), phosphorus (P, mg dm-3) and soil pH was adjusted, which presented significant parameters.

According to the CrV, LML and Tr criteria, the best fit for the GSLM was provided by the Matérn model with parameter of k form = 4.5 (Table 6).

TABLE 6
Parameters estimated by ML for the linear spatial model considering the covariance function Matérn with form parameter k = 4.5 and the covariates K, P and pH.

Comparing the complete model (Table 3) with the model elaborated with the covariates identified as significant (Table 6), the value associated with potassium content (K) remained high and positive. The value near zero of the parameter associated with the phosphorus content indicates that its influence on the average yield is minimal.

In relation to soil pH, the estimated value of the parameter presented a loss of intensity, going from -0.44 to -0.16. The negative signal of this parameter evidences an inverse relation with the yield, which was already expected, since, in general, the increase of soil pH decreases the availability of the micronutrients (Sousa et al., 2007Sousa DMG, Miranda LN, Oliveira AS (2007) Acidez do solo e sua correção. In: Novais RF, Alvarez VH, Barros NF, Fontes RL, Cantarutti RB, Neves JCL. Fertilidade do solo. SBCS, p205-274.), which, consequently, can result in yield losses.

Observing the productivity contour map (Figure 2) elaborated with the parameters of the fitted spatial linear model (Table 6), the majority of the area (84%) presented yield values varying from 3.175 t ha-1 to 3.592 t ha-1.

FIGURE 2
Map of soybean yield generated using kriging with external drift.

Another characteristic observed in the soybean yield map (Figure 2) is the presence of circular regions centered on the sample points. These regions represent the phenomenon known as bull eyes effect that, as explained by Menezes et al. (2016)Menezes MD, Silva SHG, Mello CR, Owens PR, Curi, N (2016) Spatial prediction of soil properties in two contrasting physiographic regions in Brazil. Scientia Agricola 73(3):274-285. DOI: http://dx.doi.org/10.1590/0103-9016-2015-0071
http://dx.doi.org/10.1590/0103-9016-2015...
, it occurs when the model has a spatial dependence radius less than the distance between neighboring sample points. Analyzing the distances between the pairs of sample points in the monitored area (Figure 1), only 33 pairs of points (0.68%) had a distance less than the spatial dependence radius (121 m). Thus, for presenting a behavior similar to a pure nugget effect, as highlighted by Margalho et al. (2014)Margalho L, Menezes R, Souza I (2014) Assessing interpolation error for space-time monitoring data. Stochastic Environmental Research and Risk Assessment 28(1): 1307-1321. DOI: http://dx.doi.org/10.1007/s00477-013-0826-7
http://dx.doi.org/10.1007/s00477-013-082...
, the map was expected to show constant values.

Analyzing the q-q plot chart (Figure 3), traced points are on or near the straight line that passes through the ordered pairs (Q↓(0.25(theoretical)), Q↓(0.25(observed))) and (Q↓(0.75(theoretical)), Q↓(0.75(observed))) that Q0.25 and Q0.75 represent, respectively, the quantiles of order 0.25 and 0.75. As Fowlkes (1987)Fowlkes EB (1987) A folio of distributions: A collection of theoretical quantile quantile plots (78). New York, Marcel Dekker. 560p. explained, this fact indicates that the theoretical quantis are in agreement with the quantis observed; therefore, it is coherent to assume that the data of soybean yield follow a normal distribution.

FIGURE 3
Quantile-quantile plot of soybean yield (t ha-1). In particular, the straight line that passes through ordered pairs formed with quantiles of order 0.25 and 0.75.

CONCLUSIONS

The SB and PSB methods allowed to quantify the uncertainties associated to the spatial dependence structure and to evaluate the individual significance of the parameters associated to the average of the spatial linear model, allowing the determination of a model with a smaller number of parameters.

The fact that the linear spatial model formed with the potassium (K, mg dm-3), phosphorus (P, mg dm-3) and soil pH covariates had a spatial dependence radius close to the minimum distance between points, provided a map characterized by circular regions and a high amount of near-average values.

The use of the uncorrected residuals in the q-q plot elaboration allowed verifying the normality assumption of the soybean yield data, a presupposition that, in many cases, is simply assumed without verification. This finding is extremely important in the analysis since a violation of the normality assumption can invalidate statistical hypothesis tests.

ACKNOWLEDGEMENTS

The authors would like to thank to UTFPR, UNIOESTE, UFPE, FUNDAÇÃO ARAUCÁRIA and CAPES and to the Fondecyt Project 1150325 - Chile for the support provided to carry out this study.

REFERENCES

  • Aparecido LE, Rolim GS, Richetti J, Souza PS, Johann JA (2016) Köppe, Thornthwaite and Camargo climate classifications for climatic zoning in the State of Paraná, Brazil. Ciência e Agrotecnologia 40(4):405-417. DOI: http://dx.doi.org/10.1590/1413-70542016404003916
    » http://dx.doi.org/10.1590/1413-70542016404003916
  • Ávila MR, Albrecht LP (2010) Isoflavonas e a qualidade das sementes de soja. Informativo Abrates 20(1):15-29.
  • Borssoi JA, Uribe-Opazo MA, Galea M (2011) Técnicas de diagnóstico de influência local na análise da produtividade da soja. Engenharia Agrícola 31(2):376-387. DOI: http://dx.doi.org/10.1590/S0100-69162011000200018
    » http://dx.doi.org/10.1590/S0100-69162011000200018
  • Conab (2016) Soja – Brasil: Série histórica de produtividade. Available: http://www.conab.gov.br Accessed: Nov 17, 2016.
    » http://www.conab.gov.br
  • Cressie N (2015) Statistics for spatial data. New York, John Wiley & Sons. 928p.
  • Cruz SCS, Sena-Junior DG, Santos DMA, Lunezzo LO, Machado CG (2016) Cultivo de soja sob diferentes densidades de semeadura e arranjos espaciais. Revista de Agricultura Neotropical 3(1):1-6.
  • Dalposso GH, Uribe-Opazo MA, Borssoi JA, Johann JA, Mercante E (2009) Previsão da produção de trigo utilizando métodos geoestatísticos. In: Di Leo N, Montico S, Nardón G. Avances en Ingenieria Rural 2007-2009. Rosario, Editorial de la Universidad Nacional de Rosario. p78-86.
  • Dalposso GH, Uribe-Opazo MA, Johann JA (2016) Soybean yield modeling using bootstrap methods for small samples. Spanish Journal of Agricultural Research 14(3). DOI: http://dx.doi.org/10.5424/sjar/2016143-8635
    » http://dx.doi.org/10.5424/sjar/2016143-8635
  • De Bastiani F, Cysneiros AHMA, Uribe-Opazo MA, Galea M (2015) Influence diagnostics in elliptical spatial linear models. Test 24(2):322-340. DOI: http://dx.doi.org/10.1007/s11749-014-0409-z
    » http://dx.doi.org/10.1007/s11749-014-0409-z
  • Diggle PJ, Ribeiro-Jr PJ (2007) Model-based geostatistics. New York, Springer. 232p.
  • Dos Passos AMA, Rezende PM, Carvalho ER, Ávila FW (2015) Biochar, farmyard manure and poultry litter on chemical attributes of a Distrophic Cambissol and soybean crop. Revista Brasileira de Ciências Agrárias 10(3):382-388. DOI: http://dx.doi.org/10.5039/agraria.v10i3a4546
    » http://dx.doi.org/10.5039/agraria.v10i3a4546
  • Efron B (1979) Bootstrap methods: Another look at the jackknife. Annals of Statistics 7(1):1-26
  • Efron B (1982) The jackknife, the bootstrap and other resampling plans. Philadelphia, SIAM. 93p.
  • Embrapa (2009) Sistema brasileiro de classificação de solos. Rio de Janeiro, Embrapa.
  • Fowlkes EB (1987) A folio of distributions: A collection of theoretical quantile quantile plots (78). New York, Marcel Dekker. 560p.
  • Gallon M, Buzzello GL, Trezzi MM, Diesel F, Silva HL (2016) Ação de herbicidas inibidores da PROTOX sobre o desenvolvimento, acamamento e produtividade da soja. Revista Brasileira de Herbicidas 15(3):232-240. DOI: http://dx.doi.org/10.7824/rbh.v15i3.471
    » http://dx.doi.org/10.7824/rbh.v15i3.471
  • Guedes LPC, Ribeiro-Jr PJ, Uribe-Opazo MA, De Bastiani F (2016) Mapas da produtividade da soja usando configurações amostrais regulares e otimizadas pela têmpera simulada. Engenharia Agrícola 36(1):114-125. DOI: http://dx.doi.org/10.1590/1809-4430-Eng.Agric.v36n1p114-125/2016
    » http://dx.doi.org/10.1590/1809-4430-Eng.Agric.v36n1p114-125/2016
  • Gupta SK, Manjaya JG (2016) Assessment of genetic variation at soybean mosaic virus resistance loci in Indian Soybean (Glycine max L. Merill) genotypes using SSR markers. Eletronic Journal of Plant Breeding 7(2):392-400. DOI: http://dx.doi.org/10.5958/0975-928X.2016.00048.X
    » http://dx.doi.org/10.5958/0975-928X.2016.00048.X
  • Kang C, Choi S, Yoo SH (2008) A spatial bootstrap method for kriging variance. Journal of the Korean Data Analysis Society 10(3):1247-1254.
  • Kestring FBF, Guedes LPC, De Bastiani F, Uribe-Opazo, MA (2015) Comparação de mapas temáticos de diferentes grades amostrais para a produtividade da soja. Engenharia Agrícola 35(4):733-743. DOI: http://dx.doi.org/10.1590/1809-4430-Eng.Agric.v35n4p733-743/2015
    » http://dx.doi.org/10.1590/1809-4430-Eng.Agric.v35n4p733-743/2015
  • Li L, Xu L, Wang X, Pan G, Lu L (2015) De novo characterization of the alligator weed (Alternanthera philoxeroides) transcriptome illuminates gene expression under potassium deprivation. Journal of Genetics 94(1):95-104. DOI: http://dx.doi.org/10.1007/s12041-015-0493-1
    » http://dx.doi.org/10.1007/s12041-015-0493-1
  • Mardia KV, Marshall RJ (1984) Maximum likelihood estimation of models for residual covariance in spatial regression. Biometrika 71(1):135-146. DOI: http://dx.doi.org/10.2307/2336405
    » http://dx.doi.org/10.2307/2336405
  • Margalho L, Menezes R, Souza I (2014) Assessing interpolation error for space-time monitoring data. Stochastic Environmental Research and Risk Assessment 28(1): 1307-1321. DOI: http://dx.doi.org/10.1007/s00477-013-0826-7
    » http://dx.doi.org/10.1007/s00477-013-0826-7
  • Matérn B (1986) Spatial variation. Berlin, Springer-Verlag, 2 ed. 151p.
  • Menezes MD, Silva SHG, Mello CR, Owens PR, Curi, N (2016) Spatial prediction of soil properties in two contrasting physiographic regions in Brazil. Scientia Agricola 73(3):274-285. DOI: http://dx.doi.org/10.1590/0103-9016-2015-0071
    » http://dx.doi.org/10.1590/0103-9016-2015-0071
  • Olea RA, Pardo-Igúzquiza E (2011) Generalized bootstrap method for assessment of uncertainty in semivariogram inference. Mathematical Geosciences 43(2):203-228. DOI: http://dx.doi.org/10.1007/s11004-010-9269-6
    » http://dx.doi.org/10.1007/s11004-010-9269-6
  • Pardo-Igúzquiza E, Olea RA (2012) VARBOOT: A spatial bootstrap program for semivariogram uncertainty assessment. Computers & Geosciences 41(1):188-198. DOI: http://dx.doi.org/10.1016/j.cageo.2011.09.002
    » http://dx.doi.org/10.1016/j.cageo.2011.09.002
  • Pimentel Gomes, F (1985) Curso de estatística experimental. São Paulo, Nobel. 467p.
  • R Core Team (2017) R: A language and environment for statistical computing. R foundation for statistical computing. Vienna.
  • Rao CR (1973) Linear statistical inference and its applications. New York, John Wiley e Sons. 656p.
  • Schelin L, Sjöstedt-De Luna S (2010) Kriging prediction intervals based on semiparametric bootstrap, Mathematical Geosciences 42(1):985-1000. DOI: http://dx.doi.org/10.1007/s11004-010-9302-9
    » http://dx.doi.org/10.1007/s11004-010-9302-9
  • Sjöstedt-De Luna S, Young (2003) The bootstrap and kriging prediction intervals. Scandinavian Journal of Statistics 30(1):175-192.
  • Solow A (1985) Bootstrapping correlated data. Mathematical Geology 17(7):769-775. DOI: http://dx.doi.org/10.1007/BF01031616
    » http://dx.doi.org/10.1007/BF01031616
  • Sousa DMG, Miranda LN, Oliveira AS (2007) Acidez do solo e sua correção. In: Novais RF, Alvarez VH, Barros NF, Fontes RL, Cantarutti RB, Neves JCL. Fertilidade do solo. SBCS, p205-274.
  • Tang L, Schucany W, Woodward W, Gunst R (2006) A parametric spatial bootstrap. Technical Report SMU-TR-337. Dallas, Southern Methodist University.
  • Tomé Jr JB (1997) Manual para interpretação de análise de solo. Guaíba, Agropecuária. 247p.
  • Uribe-Opazo MA, Borssoi JA, Galea, M (2012) Influence diagnostics in gaussian spatial linear models. Journal of Applied Statistics 3(39):615-630. DOI: http://dx.doi.org/10.1080/02664763.2011.607802
    » http://dx.doi.org/10.1080/02664763.2011.607802
  • Vieira RS, Gonzalez AP (2003) Analysis of the spatial variability of crop yield and soil properties in small agricultural plots. Bragantia 62(1):127-138. DOI: http://dx.doi.org/10.1590/S0006-87052003000100016
    » http://dx.doi.org/10.1590/S0006-87052003000100016
  • Wackernagel H (1995) Multivariate geostatistics. Berlin, Springer-Verlag. 256p.

Publication Dates

  • Publication in this collection
    Jan-Feb 2018

History

  • Received
    08 June 2017
  • Accepted
    03 Nov 2017
Associação Brasileira de Engenharia Agrícola SBEA - Associação Brasileira de Engenharia Agrícola, Departamento de Engenharia e Ciências Exatas FCAV/UNESP, Prof. Paulo Donato Castellane, km 5, 14884.900 | Jaboticabal - SP, Tel./Fax: +55 16 3209 7619 - Jaboticabal - SP - Brazil
E-mail: revistasbea@sbea.org.br