Comparison between artificial neural networks and maximum likelihood classification in digital soil mapping

Chagas, César da Silva; Vieira, Carlos Antônio Oliveira; Fernandes Filho, Elpídio Inácio

doi:10.1590/S0100-06832013000200005

Abstracts

Soil surveys are the main source of spatial information on soils and have a range of different applications, mainly in agriculture. The continuity of this activity has however been severely compromised, mainly due to a lack of governmental funding. The purpose of this study was to evaluate the feasibility of two different classifiers (artificial neural networks and a maximum likelihood algorithm) in the prediction of soil classes in the northwest of the state of Rio de Janeiro. Terrain attributes such as elevation, slope, aspect, plan curvature and compound topographic index (CTI) and indices of clay minerals, iron oxide and Normalized Difference Vegetation Index (NDVI), derived from Landsat 7 ETM+ sensor imagery, were used as discriminating variables. The two classifiers were trained and validated for each soil class using 300 and 150 samples respectively, representing the characteristics of these classes in terms of the discriminating variables. According to the statistical tests, the accuracy of the classifier based on artificial neural networks (ANNs) was greater than of the classic Maximum Likelihood Classifier (MLC). Comparing the results with 126 points of reference showed that the resulting ANN map (73.81 %) was superior to the MLC map (57.94 %). The main errors when using the two classifiers were caused by: a) the geological heterogeneity of the area coupled with problems related to the geological map; b) the depth of lithic contact and/or rock exposure, and c) problems with the environmental correlation model used due to the polygenetic nature of the soils. This study confirms that the use of terrain attributes together with remote sensing data by an ANN approach can be a tool to facilitate soil mapping in Brazil, primarily due to the availability of low-cost remote sensing data and the ease by which terrain attributes can be obtained.

terrain attributes; neural networks; maximum likelihood

O levantamento de solos é a principal fonte de informação espacial sobre solos para diferentes usos, principalmente o uso agrícola. No entanto, a continuidade dessa atividade tem sido grandemente comprometida, principalmente pela escassez de recursos financeiros. O objetivo deste estudo foi avaliar a eficiência da utilização de dois classificadores distintos (redes neurais artificiais - RNAs e o algoritmo da máxima verossimilhança - Maxver) na predição de classes de solos em uma área na região noroeste do Estado do Rio de Janeiro. As variáveis discriminantes usadas incluem atributos do terreno, como elevação, declividade, aspecto, plano de curvatura e índice topográfico combinado (CTI) e índices clay minerals, iron oxide e de vegetação NDVI, derivados de uma imagem do sensor ETM+ do LANDSAT 7. Para o treinamento e a validação dos classificadores, foram utilizadas, respectivamente, 300 e 150 amostras por classe de solo, representativas das características dessas classes, com relação às variáveis discriminantes utilizadas. De acordo com os testes estatísticos realizados, o classificador com base na RNA produziu maior exatidão do que o classificador clássico da máxima verossimilhança (Maxver). A comparação com 126 pontos de referência coletados no campo evidenciou que o mapa produzido pela RNA teve desempenho superior (73,81 %) ao mapa produzido pelo algoritmo Maxver (57,94 %). As principais causas de erros detectadas na utilização desses classificadores foram: a heterogeneidade geológica da área aliada a problemas no mapa geológico utilizado; profundidade do contato lítico e, ou, exposição da rocha; e problemas com o modelo de correlação ambiental utilizado em razão da natureza poligenética dos solos. Os resultados obtidos permitem inferir que a utilização de atributos do terreno juntamente com dados de sensoriamento remoto em uma abordagem por RNAs pode contribuir para facilitar o mapeamento de solos no Brasil, principalmente por causa da disponibilidade de dados de sensores remotos a custos mais baixos e da facilidade de obtenção dos atributos do terreno.

atributos do terreno; redes neurais artificiais; máxima verossimilhança

DIVISÃO 1 - SOLO NO ESPAÇO E NO TEMPO

COMISSÃO 1.3 - PEDOMETRIA

Comparison between artificial neural networks and maximum likelihood classification in digital soil mapping

Comparação entre redes neurais artificiais e classificação por máxima verossimilhança no mapeamento digital de solos

César da Silva Chagas^I; Carlos Antônio Oliveira Vieira^II; Elpídio Inácio Fernandes Filho^III

^IPesquisador A da Embrapa Solos. Rua Jardim Botânico, 1024. CEP 22460-000 Rio de Janeiro (RJ). E-mail: cesar@cnps.embrapa.br; chagas.rj@gmail.com

^IIProfessor do Departamento de Geociências, Universidade Federal de Santa Catarina - UFSC. Caixa-Postal 476. CEP 88040-900 Florianópolis (SC). E-mail: carlos.vieira@cfh.ufsc.br

^IIIProfessor do Departamento de Solos, Universidade Federal de Viçosa - UFV. Av. P.H. Rolfs, s/n. CEP 36570-000 Viçosa (MG). E-mail: elpidio@ufv.br

SUMMARY

Soil surveys are the main source of spatial information on soils and have a range of different applications, mainly in agriculture. The continuity of this activity has however been severely compromised, mainly due to a lack of governmental funding. The purpose of this study was to evaluate the feasibility of two different classifiers (artificial neural networks and a maximum likelihood algorithm) in the prediction of soil classes in the northwest of the state of Rio de Janeiro. Terrain attributes such as elevation, slope, aspect, plan curvature and compound topographic index (CTI) and indices of clay minerals, iron oxide and Normalized Difference Vegetation Index (NDVI), derived from Landsat 7 ETM⁺ sensor imagery, were used as discriminating variables. The two classifiers were trained and validated for each soil class using 300 and 150 samples respectively, representing the characteristics of these classes in terms of the discriminating variables. According to the statistical tests, the accuracy of the classifier based on artificial neural networks (ANNs) was greater than of the classic Maximum Likelihood Classifier (MLC). Comparing the results with 126 points of reference showed that the resulting ANN map (73.81 %) was superior to the MLC map (57.94 %). The main errors when using the two classifiers were caused by: a) the geological heterogeneity of the area coupled with problems related to the geological map; b) the depth of lithic contact and/or rock exposure, and c) problems with the environmental correlation model used due to the polygenetic nature of the soils. This study confirms that the use of terrain attributes together with remote sensing data by an ANN approach can be a tool to facilitate soil mapping in Brazil, primarily due to the availability of low-cost remote sensing data and the ease by which terrain attributes can be obtained.

Index terms: terrain attributes; neural networks; maximum likelihood.

RESUMO

O levantamento de solos é a principal fonte de informação espacial sobre solos para diferentes usos, principalmente o uso agrícola. No entanto, a continuidade dessa atividade tem sido grandemente comprometida, principalmente pela escassez de recursos financeiros. O objetivo deste estudo foi avaliar a eficiência da utilização de dois classificadores distintos (redes neurais artificiais - RNAs e o algoritmo da máxima verossimilhança - Maxver) na predição de classes de solos em uma área na região noroeste do Estado do Rio de Janeiro. As variáveis discriminantes usadas incluem atributos do terreno, como elevação, declividade, aspecto, plano de curvatura e índice topográfico combinado (CTI) e índices clay minerals, iron oxide e de vegetação NDVI, derivados de uma imagem do sensor ETM⁺ do LANDSAT 7. Para o treinamento e a validação dos classificadores, foram utilizadas, respectivamente, 300 e 150 amostras por classe de solo, representativas das características dessas classes, com relação às variáveis discriminantes utilizadas. De acordo com os testes estatísticos realizados, o classificador com base na RNA produziu maior exatidão do que o classificador clássico da máxima verossimilhança (Maxver). A comparação com 126 pontos de referência coletados no campo evidenciou que o mapa produzido pela RNA teve desempenho superior (73,81 %) ao mapa produzido pelo algoritmo Maxver (57,94 %). As principais causas de erros detectadas na utilização desses classificadores foram: a heterogeneidade geológica da área aliada a problemas no mapa geológico utilizado; profundidade do contato lítico e, ou, exposição da rocha; e problemas com o modelo de correlação ambiental utilizado em razão da natureza poligenética dos solos. Os resultados obtidos permitem inferir que a utilização de atributos do terreno juntamente com dados de sensoriamento remoto em uma abordagem por RNAs pode contribuir para facilitar o mapeamento de solos no Brasil, principalmente por causa da disponibilidade de dados de sensores remotos a custos mais baixos e da facilidade de obtenção dos atributos do terreno.

Termos de indexação: atributos do terreno; redes neurais artificiais; máxima verossimilhança.

INTRODUCTION

Soil survey is one of the main sources of spatial information on soils and has a wide range of uses. The qualitative landscape approach used in these surveys based on the interpretation of aerial photographs has however been criticized, for not allowing an understanding of the quantitative relationships between the shape of the land surface and the soils and their properties (McBratney et al., 2000).

Therefore, various quantitative methods (McBratney et al., 2003) have been developed over the last few years to describe, classify and study patterns of spatial distribution in soils. These methods are grouped together in an emerging field of soil science known as pedometrics (McBratney et al., 2000). This way of mapping soils digitally involves the quantitative prediction of soils and their properties using measured and/or observed and auxiliary data representing factors of soil formation. Various techniques have been used to predict soil properties and/or classes, such as kriging (Brus & Heuvelink, 2007; Weindorf & Zhu, 2010), multiple logistic regression (Hengl et al., 2007; Ten Caten et al., 2011), decision trees (Elnaggar & Noller, 2010; Giasson et al., 2011), and artificial neural networks (Behrens et al., 2005; Boruvka & Penizek, 2007; Chagas et al., 2010), among others.

The main advantages of artificial neural networks (ANNs) are an efficient handling of large quantities of data and their capacity for generalization. Furthermore, they require no specific type of data distribution, unlike the traditional parametric statistical approach, which implies normal data distribution, and allow the manipulation of data from different sources, with different levels of precision and noise (Atkinson & Tatnall, 1997; Benediktsson & Sveinsson, 1997).

In soil science, ANNs have already been used to map soil properties, mainly (McBratney et al., 2003). However, their application in the mapping of soil classes is less common and has been reported in only a few studies. Zhu (2000) used ANNs to populate a soil similarity model constructed to represent soil landscape as spatial continua. In this study, a set of soil formative factors were used as input data for the network, and the map derived by this approach was more detailed and superior to that derived from a conventional soil map.

An approach to the digital mapping of soil classes based on ANNs was developed by Behrens et al. (2005), using 69 terrain attributes, 53 geological units and three types of land use, extracted from existing maps and databases. In general, this approach proved fairly satisfactory, saved time, reduced financial costs and provided reliable results, with an average accuracy of the evaluated soil units of over 92 % for both training and validation. Boruvka & Penizek (2007) used ANNs for the mapping of soil classes with data of pH, clay levels and textural gradient from pre-existing soil surveys, as well as data on elevation, aspect and plan curvature as discriminating variables. The results demonstrated differences in the allocation of the soil classes.

In Brazil, the use of ANNs for the prediction of soil class was described by Chagas et al. (2010, 2011), who demonstrated the efficiency of this approach in digital soil mapping. Despite the growing use in recent years, there is still an enormous lack of studies involving digital soil mapping in Brazil, and mainly with a view to the selection of predictors, soil sampling techniques, evaluation of prediction methods (MLC, ANNs, decision trees, multiple logistic regressions, etc.), validation methods, and even the evaluation of the different environmental conditions in the country.

In this context, this study differs from previous work in its objective to compare the efficiency of the two prediction methods (ANNs and MLC) for digital soil mapping, using terrain attributes derived from a digital elevation model (DEM) and data from an orbital remote sensor (Landsat 7), to evaluate the real possibility of using these approaches in medium-scale soil surveys.

MATERIALS AND METHODS

The study was carried out in an area of approximately 10,000 ha in the basin of the São Domingos River, a tributary of the Muriaé River in the northwestern region of the State of Rio de Janeiro, between the UTM coordinates 7,629,223N to 7,638,239N and 186,146E to 197,180E, Zone 24K, Datum Córrego Alegre (Figure 1). Two geomorphological systems can be found in the region: the hillslope area Mar de Morros and mountain chains and layered structures (NE-SW); the predominant lithology structure in the region comprise noritic granulites, heterogeneous migmatites and Quaternary alluvial deposits (DRM-RJ, 1980).

To understand the soil-landscape relationships, the area was surveyed according to procedures established for pedological surveys (Embrapa, 1995). For the characterization of the identified soil classes, 73 soil profiles and 11 extra samples were collected with augers and classified according to the Brazilian Soil Classification System (Embrapa, 2006). In view of the high number of soil classes and the high complexity of the training and validation processes, mainly for ANNs, as well as the limitations of the different programs used at each step, the area was subdivided (shown in table 1). The Quaternary deposit areas were considered to be the flattened patches in the areas of noritic granulites and heterogeneous migmatites.

Thumbnail

This approach, which uses two different supervised classification algorithms (ANNs and the maximum likelihood algorithm), is based on the soil-landscape concept developed by Jenny (1941), according to which the soil is the result of interactions between formation factors over time.

Firstly, a digital elevation model (DEM) with a spatial resolution of 30 m was generated using the Topo to Raster option of the ArcGIS 10 program (ESRI, 2010). For this purpose, contour lines (20 m intervals), hydrography and elevation points contained in the topographic maps of IBGE SF-23-X-D-III-4 (Miracema) and SF-24-G-I-3 (São João do Paraíso) were used, both at a scale of 1:50,000. Then, after removing the spurious pits, this DEM was used to calculate the terrain attributes: elevation, slope, aspect, plan curvature and compound topographic index (CTI), according to Gallant & Wilson (2000), which together with the clay minerals, iron oxide (Sabins Junior, 1997) and Normalized Difference Vegetation Indices (NDVI) (Wiegand et al., 1991), derived from Landsat 7 ETM+ sensor imagery, represent the main factors of soil formation (discriminating variables) for the study area.

The compound topographic index (CTI) was obtained by equation 1:

where A_sis the contributing area ((flow accumulation + 1) x cell size of the grid in m²) and β is the slope expressed in radians.

The indices derived from the Landsat 7 ETM+ sensor images were used to reduce the work involved with data analysis by maximizing spectral information for a reduced number of sensor bands. Some studies demonstrated the relationship between the NDVI and soil moisture (Narasimha Rao et al., 1993) and the soil physical properties (Yang et al., 1997). This index was calculated as follows:

Similarly, clay minerals and iron oxide indices are frequently included in geological studies, underlying the distinction of soils with different physical and mineralogical characteristics (Sabins Junior, 1997). The clay mineral index was obtained by the ratio of band 5 (1.55-1.75 µm) by band 7 (2.08-2.35 µm) and the iron oxide index by the division of band 3 (0.63-0.69 µm) by band 1 (0.45-0.52 µm), both obtained from images of ETM⁺ sensor (LANDSAT 7) taken in August, 1999, using ERDAS Imagine 8.5 software.

The application of ANNs starts with the network training process (Tso & Mather, 2009). Firstly, the values related to all discriminating values were normalised to the interval between 0 and 1 [(discriminating variable - minimum value)/(maximum value - minimum value)] to facilitate the training process and avoid ANN saturation, as very large values can hamper the problem solution (saturation of the transfer functions could hinder the convergence of the network). Furthermore, efforts were made to prevent large variations in unimportant variables from inhibiting small variations in other more important variables.

The Java Neural Network Simulator was used for the classification by ANNs, based on the Stuttgart Neural Network Simulator 4.2 Kernel (Zell et al., 1996), and an applicable algorithm (funcpow) developed by Vieira (2000) for the classification by a Maximum Likelihood Classifier (MLC).

Two sets of independent stratified samples were collected from the areas containing profiles considered to be representative of each soil class, one for the training process and the other for validation of the two classifiers. In this way, efforts were made to represent the characteristics of each one in relation to the discrimination variables used (elevation, slope, aspect, plan curvature, CTI index, clay mineral index, iron oxide index, and NDVI index). It should be highlighted that the soil classes differed in relation to one or more of the discriminating variables. The training samples were used so that the classifiers could establish relationships between the discriminating variables (input data) and the soil classes (output data) through a learning process. The validation samples were then used to test this relationship by statistical means.

The training samples for both classifiers consisted of 300 samples or pixels per class, i.e., 3,600 samples for Area 1 and 3,000 samples for Area 2. The number of validation samples was limited to 50 % of the number of training samples (150 samples or pixels per class), with 1,800 samples for Area 1 and 1,500 samples for Area 2, according to Zhu (2000).

In the training step, 10 ANN architectures were tested, which differed only in terms of the number of neurons in the internal layer (3, 5, 6, 7, 8, 9, 10, 15, 20, and 25 neurons). All of these had the same number of neurons in the input layer (number of discriminating variables) and in the output layer (12 soil classes for Area 1 and 10 for Area 2). The learning algorithm used was the backpropagation method, with random allocation of interneuron weights between -0.5 and 0.5 and a learning rate of 0.2, involving 10,000 learning cycles.

The results were evaluated through statistical means using the Kappa coefficient and overall classification accuracy, derived from a confusion matrix (Congalton & Green, 1999) from the validation samples. In this way, the ANN architecture obtaining the best result for the Kappa coefficient was selected for the prediction of the soil classes.

A statistical test to verify the significance of the differences between two or more classifiers (represented by their respective confusion matrices) using Kappa coefficient analysis is proposed as an alternative approach for the comparison of independent confusion matrices. In this way, a significance matrix was generated using the results of the statistical tests for the chosen ANN and the classification with MLC, using the Kappa values and Kappa variance between the classifications, by means of the Z statistical test. The formula to test the significance between two independent Kappa coefficients (Z test) is given in the following equation:

Where K_a₁ and K_a₂ are the two Kappa coefficients that are being compared (Congalton & Mead, 1986) and var is the variance of the Kappa coefficient (Ka).

Thus, the Z test firstly verifies if the classification differs from a causal classification. In other words, if Z calculated > Zα/2 tabulated, the classification is significantly better than a random classification, where α/2 is the confidence level of the Z test and the number of degrees of freedom is assumed to be infinite. In a second analysis, the test verifies if there is a significant difference between the Kappa values resulting from the evaluation by the different classifiers. Values < than 1.96 (95 % confidence level) indicate no significant difference between the estimated Kappa values, while values > 1.96 demonstrate a significant difference.

The only existing soil survey in the area has a scale of 1:250,000 (Embrapa, 2003), a scale incompatible with the level of detail required for comparison with the digital maps produced by the two classifiers (between 1:100,000 and 1:50,000, medium scale). Therefore, 126 points of reference were used to determine the percentage of locations classified correctly on the maps, according to the procedure used by Zhu (2000). These reference points were collected by projects developed by Embrapa Solos in the area (the CNPq/CTHidro project, the Radema project and the Aquiferos project), including soil profiles, extra samples and observation points that were not used for the training and validation processes (independent points). The highest concentration of points in the middle of the area refers to a semi-detailed study carried out in the Barro Branco watershed for the Aquiferos project.

RESULTS AND DISCUSSION

The contribution of the different variables in the discrimination of soil class in Areas 1 and 2 are shown in table 2.

Thumbnail

In general, the differentiation of the soil classes detected by terrain attributes was much clearer than by the Landsat 7 indices. Studies carried out by Dobos et al. (2000, 2001) using terrain attributes and data from the Advanced Very High Resolution Radiometer (AVHRR) on the NOAA satellite for soil mapping showed that terrain attributes alone were not sufficient for the discrimination of the soils in the areas studied, while integrating this data with the AVHRR data increased classification performance.

The best architecture for final data classification was selected based on the results of the Kappa coefficient and the overall accuracy, obtained using the validation samples (Figure 2).

In this evaluation, the highest values for the overall accuracy and the Kappa coefficient for Area 1, obtained using the validation samples, were achieved using network architecture with only one input layer containing 8 neurons (Kappa value of 0.908). The Kappa coefficient for Area 2 performed best when using a network architecture with an input layer containing five neurons (Kappa value of 0.893). An analysis of the Kappa significance matrix (Table 3) showed significant differences between these networks and the others. These two architectures were therefore selected for the final data classification.

Thumbnail

On the other hand, using MLC in the classification led to an overall accuracy of 84.4 % and a Kappa coefficient of 0.830 for Area 1, and an overall accuracy of 72.3 % and a Kappa coefficient of 0.693 for Area 2. A comparison of the performances of the two classification types, based on an analysis of the Kappa significance matrix, indicated significant differences between these classifications for both areas (Table 4).

Thumbnail

The results obtained in this evaluation are similar to those in other studies, in which the approach via ANNs generally led to better results than by the MLC, mainly in the classification of soil use and cover using remote sensing data (Heermann & Khazenie, 1992; Bischof et al., 1992; Paola & Schowengerdt, 1995; McBratney et al., 2000).

Despite the significant differences between the two classifiers, with a better performance of the ANN classification (Kappa values of 0.908 and 0.893 for Areas 1 and 2 respectively), it should be emphasized that this approach requires more time and more computational resources for the training process than the classic approach using the MLC. However, for Yool (1998), ANNs have an advantage over conventional supervised classification methods such as MLC, as these are inadequate and impractical for the mapping of large areas.

According to Kanellopoulos & Wilkinson (1997), one aspect that has been neglected in the comparisons between statistical classifiers and artificial neural networks is the existence of significant differences in the performance of these classifiers when classes are considered individually. The confusion matrices obtained for the tested classifications based on the validation set are presented in tables 5 and 6.

Thumbnail

In the area characterized by noritic granulites (Area 1), the worst performance for ANN classification occurred for class PAe1 (62.0 %), with greatest confusion with classes PVAe2, GXbe and LVAd. In the case of PVAe2 and GXbe, the confusion can be explained by the extremely similar characteristics of the discriminating variables between these classes, mainly for slope, CTI index and the indices derived from Landsat 7 (Table 2). For class LVAd, the differentiation was clearest with regard to elevation, slope and CTI index (Table 2); therefore, the confusion observed must be attributed to an error in the classifier. Accuracies exceeded 85 % for all other soil classes (Table 5).

In the area characterized by Heterogeneous Migmatites (Area 2), the worst performance occurred for class PVe5 (70.7 %), with greatest confusion with class PVe6 (Table 5). These classes have very similar environmental characteristics and can be differentiated only by their aspect (Table 2). Accuracies exceeded 95 % for all other soil classes (Table 5).

For the Maximum Likelihood classification (Table 6), the performances in the classification of the area of noritic granulites (Area 1) was poorest for the classes PVAd (60 %) and PAe1 (56 %), versus 92.7 and 62 % obtained using ANNs. The confusion of PVAd was greater with PVe1, and since the average values of the discriminating variables of these classes are very different (Table 2), it can be concluded that MLC was less efficient than ANNs in the discrimination of these classes. The confusion of class PAe1 was greatest with class PVAe2 (Table 6), in part related to the similarity of the discriminating variables between these classes (Table 2). Therefore, the result of ANNs for class PAe1 (62 %) suggests that MLC is less efficient.

In the area characterized by Heterogeneous Migmatites (Area 2), the worst performances were obtained for classes PVe6, PVe4 and GXve, with accuracies of 44.7, 48.0 and 49.3 % respectively. Accuracies of 83.3, 92.7 and 97.3 % were obtained using ANNs for the same classes, which shows that the MLC was less efficient than ANNs in the discrimination of these classes. For PVe6, the confusion with class PVe5 was greatest, as well as for GXve with PAe2 and PVe4 with PVe6 (Table 6). The MLC performed well in discriminating the other classes, despite some confusion, with accuracy values exceeding 78.7 %.

Finally, a comparison between ANN and MLC shows a differentiated classification in Area 1, with ANNs performing better for the classes AR1, GXbe, PAe1, PVe1, PVe2, PVe3, PVAd, PVAe2, and RLe1, while MLC was only better in the classification of CXbe and PVAe1. The LVAd was classified equally by the two classifiers (Tables 5 and 6). In Area 2, the ANN approach was better in the classification of all classes except PVe5.

According to Landis & Koch (1977), the Kappa coefficient value obtained through the neural network approach indicates a very good to excellent performance for this classifier (Kappa > 0.75) for both Area1 (0.908) and Area 2 (0.893). On the other hand, the MLC presented a very good to excellent classification for Area 1 (0.830) and a moderate performance (0.4 < Kappa < 0.75) for Area 2.

Considering that some classes occupy small and/or very fragmented patches in the study area, some were grouped together to facilitate understanding of the ANN- and MLC-based maps. The criteria for this grouping was similarity in the characteristics of the area of occurrence, as mentioned in the key to Figure 3.

Although the results for the Kappa coefficient obtained from the validation samples were satisfactory for both classifiers, the resulting maps were different. The agreement between the ANN- and MLC-based maps was only 47.28 %. According to Kanellopoulos & Wilkinson (1997), this low agreement demonstrates the differing nature of the mathematic models used in the classifications and the way they structure the space.

Figure 3 shows that both maps contain many spatial details, reported by Zhu (2000). Using terrain attributes and geological properties as discriminating environmental variables therefore showed the soil-landscape relationships in the area more clearly, resulting in spatially more detailed maps.

In the absence of a conventional soil map on an adequate scale (> 50,000), reference points were used to compare and evaluate the maps obtained using ANNs and MLC, according to Zhu (2000). In this comparison, the classification using ANNs correctly inferred the soil classes at 93 locations (73.81 %) and MLC at 73 (57.94 %) (Table 6), which was comparable to results reported in other studies (Zhu, 2000; Zhu et al., 2001).

The matrix of Kappa significance (Table 7) indicated a significant difference between the classifiers. Better results were obtained using ANNs (Kappa coefficient of 0.709), which were significantly different from those obtained by MLC.

Thumbnail

In the ANN classification, possible causes of the low agreement with the reference points include the rather complex geological nature of the area, which in some situations impeded the correct determination of soil-landscape relationships; the quality of the geological map used; the lack of information on the depth of lithic contact; and problems related to the environmental correlation model used. Most misclassified observations occurred at the boundaries between the units of the geological map. Problems related to the quality of geological maps used in environmental correlation studies were also reported by Thomas et al. (1999) and McKenzie & Ryan (1999).

McKenzie & Austin (1993) found that the presence of an impediment layer at low depths is a strong predictor of soil properties and the presence of geological structures such as dikes can control the soil distribution pattern. Dikes of basic material are common in the area studied, but are not shown on the geological map (DRM-RJ, 1980).

On the other hand, McKenzie & Ryan (1999) emphasized that there are circumstances in which soil variation occurs without any direct relation to easily observable environmental variables (lack of a reliable predictor) and in these cases, detailed sampling is inevitable and some form of interpolation is necessary to generate a spatial prediction.

In the case of MLC-based classifications, the low agreement with the reference points was mostly due to the inefficiency of the classifier.

The main lack of agreement between ANN- and MLC-based classifications can be verified in the flatter areas of the landscape (flat and gently undulating topography), classified by ANNs as belonging to the Haplic Gleysol class (GXbe and GXve for Areas 1 and 2 respectively) and by MLC as belonging to classes PVAe2 and PVe7 (for Areas 1 and 2 respectively), which occurs in undulating topography. In this case, the ANN-based classification was more accurate while MLC was less effective in adequately discriminating areas with very similar slope values.

Conversely, by the ANN-based classification the class LVd (top-left corner of the image) was incorrectly identified in areas that should have been classified as belonging to class PVe7. In spite of the similar slope values of these classes, they occupy different positions on the landscape, with class LVd in even areas at high elevations and class PVe7 in the lower third of slopes close to wetlands. This result demonstrates an inefficiency in the ANN-based classification that was not observed for the MLC-based classification.

CONCLUSIONS

1. The Artificial Neutral Network classifier resulted in a higher accuracy for general classification when validation samples were used, producing significantly better statistical results than the Maximum Likelihood Classifier.

2. A comparison of reference points showed that the ANN-based was more accurate than the MLC-based map, which was significantly different. The main cause of data classification errors of the classifiers may be related to the geological heterogeneity of the area, the depth of lithic contact and problems associated with the environmental correlation model used, including the discriminating variables selected.

3. The use of terrain attributes and remote sensing data with spatial resolution compatible with the objectives of the study, together with a neural network approach, can help accelerate, extend and cheapen soil mapping in Brazil, due to the availability of lower-cost digital satellite images and the ease at which terrain attributes can be obtained.

LITERATURE CITED

Received for publication on July 18, 2012 and approved on February 28, 2013.

ATKINSON, P.M. & TATNALL, A.R.L. Neural networks in remote sensing. Inter. J. Remote Sens., 18:699-709, 1997.
BEHRENS, T.; FOSTER, H.; SCHOLTEN, T.; STEINRUCKEN, U.; SPIES, E.D. & GOLDSCHMITT, M. Digital soil mapping using artificial neural networks. J. Plant Nutr. Soil Sci., 168:21-33, 2005.
BENEDIKTSSON, J.A. & SVEINSSON, J.R. Feature extraction for multisource data classification with artificial neural networks. Inter. J. Remote Sens., 18:727-740, 1997.
BISCHOF, H.; SCHNEIDER, W. & PINZ, A.J. Multispectral classification of Landsat-images using neural networks. IEEE Trans. Geosci. Remote Sens., 30:482-490, 1992.
BORUVKA, L. & PENIZEK, V. A test of an artificial neural network allocation procedure using the Czech soil survey of agricultural land data. In: LAGACHERIE, P.; McBRATNEY, A. B. & VOLTZ, M., eds. Digital soil mapping: An introductory perspective. Amsterdam, Elsevier, 2007. p.415-424. (Developments in Soil Science, 31)
BRUS, D.J. & HEUVELINK, G.B.M. Optimization of sample patterns for universal kriging of environmental variables. Geoderma, 138:86-95, 2007.
CHAGAS, C.S.; FERNANDES FILHO, E.I.; VIEIRA, C.A.O.; SCHAEFER, C.E.G.R. & CARVALHO JÚNIOR, W. Atributos topográficos e dados do Landsat7 no mapeamento digital de solos com uso de redes neurais. Pesq. Agropec. Bras., 45:497-507, 2010.
CHAGAS, C.S.; CARVALHO JÚNIOR, W. & BHERING, S.B. Integração de dados do Quickbird e atributos do terreno no mapeamento digital de solos por redes neurais artificiais. R. Bras. Ci. Solo, 35:693-704, 2011.
CONGALTON, R.G. & GREEN, K. Assessing the accuracy of remotely sensed data: Principles and practices. New York, Lewis Publishers, 1999. 160p.
CONGALTON, R.G. & MEAD, R.A. A review of discrete multivariate analysis techniques used in assessing the accuracy of remotely sensed data from error matrices. IEEE Trans. Geosci. Remote Sens., 24:169-174, 1986.
DEPARTAMENTO DE RECURSOS MINERAIS - DRM-RJ. Projeto Carta Geológica do Estado do Rio de Janeiro na Escala 1:50.000. Folhas: Miracema e São João do Paraíso. 1980.
DOBOS, E.; MICHELI, E.; BAUMGARDNER, M.F.; BIEHL, L. & HELT, T. Use of combined digital elevation model and satellite data for regional soil mapping. Geoderma, 97:367-391, 2000.
DOBOS, E.; MONTANARELLA, L.; NÈGRE, T. & MICHELI, E. A regional scale soil mapping approach using integrated AVHRR and DEM data. Inter. J. Appl. Earth Obs. Geoinf., 3:30-42, 2001.
ELNAGGAR, A.A. & NOLLER, J.S. Application of remote sensing data and decision-tree analysis to mapping of salt-affected soils over large areas. Remote Sens., 2:151-165, 2010.
EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA - EMBRAPA. Centro Nacional de Pesquisa de Solos. Levantamento de reconhecimento de baixa intensidade dos solos do Estado do Rio de Janeiro. Rio de Janeiro, 2003. 242p. (Boletim de Pesquisa e Desenvolvimento, 32)
EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA - EMBRAPA. Centro Nacional de Pesquisa de Solos. Procedimentos normativos de levantamentos pedológicos. Brasília, 1995. 101p.
EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA - EMBRAPA. Centro Nacional de Pesquisa de Solos. Sistema brasileiro de classificação de solos. 2.ed. Rio de Janeiro, 2006. 306p.
ENVIRONMENTAL SYSTEMS RESEARCH INSTITUTE - ESRI. ArcGIS Desktop. Redlands, 2010. v.10.
GALLANT, J.C. & WILSON, J.P. Primary topographic attributes. In: WILSON, J.P. & GALLANT, J.C., eds. Terrain analysis: Principles and applications. New York, John Wiley & Sons, 2000. p.51-85.
GIASSON, E.; SARMENTO, E.C.; WEBER, E.; FLORES, C.A. & HASENACK, H. Decision trees for digital soil mapping on subtropical basaltic steeplands. Sci. Agric., 68:167-174, 2011.
HEERMANN, P.D. & KHAZENIE, N. Classification of multispectral remote sensing data using a back-propagation neural network. IEEE Trans. Geosci. Remote Sens., 30:81-88, 1992.
HENGL, T.; TOOMANIAN, N.; REUTER, H.I. & MALAKOUTI, M.J. Methods to interpolate soil categorical variables from profile observations: Lessons from Iran. Geoderma, 140:417-427, 2007.
JENNY, H. Factors of soil formation; a system of quantitative pedology. New York, McGraw-Hill, 1941. 281p.
KANELLOPOULOS, I. & WILKINSON, G.G. Strategies and best practice for neural network image classification. Inter. J. Remote Sens., 18:711-725, 1997.
LANDIS, J.R. & KOCH, G.G. The measurement of observer agreement for categorical data. Biometrics, 33:159-174, 1977.
McBRATNEY, A.B.; ODEH, I.O.A.; BISHOP, T.F.A.; DUNBAR, M.S. & SHATAR, T.M. An overview of pedometrics techniques for use in soil survey. Geoderma, 97:293-327, 2000.
McBRATNEY, A.B.; SANTOS, M.L.M. & MINASNY, B. On digital soil mapping. Geoderma, 117:3-52, 2003.
McKENZIE, N.J. & AUSTIN, M.P. A quantitative Australian approach to medium and small scale surveys based on soil stratigraphy and environmental correlation. Geoderma, 57:329-355, 1993.
McKENZIE, N.J. & RYAN, P.J. Spatial prediction of soil properties using environmental correlation. Geoderma, 89:67-94, 1999.
NARASIMHA RAO, P.V.; VENKATARATNAM, L.; KRISHNA RAO, P.V.; RAMANA, K.V. & SINGARAO, M.N. Relation between root zone soil moisture and normalized difference vegetation index of vegetated fields. Inter. J. Remote Sens., 14:441-449, 1993.
PAOLA, J.D. & SCHOWENGERDT, R.A. A detailed comparison of backpropagation neural network and maximum-likelihood classifiers for urban land use classification. IEEE Trans. Geosci. Remote Sens., 33:981-996, 1995.
SABINS JUNIOR, F.F. Remote sensing: Principles and interpretation. 3.ed. New York, W. H. Freeman and Company, 1997. 432p.
THOMAS, A.L.; KING, D.; DAMBRINE, E. & COUTURIER, A. Predicting soil classes with parameters derived from relief and geologic materials in a sandstone region of the Vosges mountains (Northeastern France). Geoderma, 90:291-305, 1999.
TSO, B. & MATHER, P.M. Classification methods for remotely sensed data. 2.ed. Boca Raton, CRC, 2009. 356p.
WEINDORF, D.C. & ZHU, Y. Spatial variability of soil properties at Capulin Volcano, New Mexico, USA: implications for sampling strategy. Pedosphere, 20:185-197, 2010.
WIEGAND, C.L.; RICHARDSON, A.J.; ESCOBAR, D.E. & GERBERMANN, A.H. Vegetation indices in crop assessments. Remote Sens. Environ., 35:105-119, 1991.
YANG, W.; YANG, L. & MERCHANT, J.W. An assessment of AVHRR/NDVI-ecoclimatological relations in Nebraska, USA. Inter. J. Remote Sens., 18:2161-2180, 1997.
YOOL, S.R. Land cover classification in rugged areas using simulated moderate-resolution remote sensor data and an artificial neural network. Inter. J. Remote Sens., 19:85-96, 1998.
ZELL, A.; MAMIER, G.; VOGT, M.; MACHE, N.; HUBNER, R.; DORING, S.; HERRMANN, K.; SOYEZ, T.; SCHMALZL, M.; SOMMER, T.; HATZIGEORGIOU, A.; POSSELT, D.; SCHREINER, T.; KETT, B.; CLEMENTE, G.; WIELAND, J. & GATTER, J. Stuttgart neural network simulator: User manual. Version 4.2. Stuttgart, University of Stuttgart, 1996. 338p.
TEN CATEN, A.; DALMOLIN, R.S.D.; PEDRON, F.A. & MENDONÇA-SANTOS, M.L. Regressões logísticas múltiplas: Fatores que influenciam sua aplicação na predição de classes de solos. R. Bras. Ci. Solo, 35:53-62, 2011.
ZHU, A.X. Mapping soil landscape as spatial continua: The neural network approach. Water Res. Res., 36:663-677, 2000.
ZHU, A.X.; HUDSON, B.; BURT, J.; LUBICH, K. & SIMONSON, D. Soil mapping using GIS, expert knowledge, and fuzzy logic. Soil Sci. Soc. Am. J., 65:1463-1472, 2001.

Publication Dates

Publication in this collection
03 June 2013
Date of issue
Apr 2013

History

Received
18 July 2012
Accepted
28 Feb 2013

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

[1] ATKINSON, P.M. & TATNALL, A.R.L. Neural networks in remote sensing. Inter. J. Remote Sens., 18:699-709, 1997.

[2] BEHRENS, T.; FOSTER, H.; SCHOLTEN, T.; STEINRUCKEN, U.; SPIES, E.D. & GOLDSCHMITT, M. Digital soil mapping using artificial neural networks. J. Plant Nutr. Soil Sci., 168:21-33, 2005.

[3] BENEDIKTSSON, J.A. & SVEINSSON, J.R. Feature extraction for multisource data classification with artificial neural networks. Inter. J. Remote Sens., 18:727-740, 1997.

[4] BISCHOF, H.; SCHNEIDER, W. & PINZ, A.J. Multispectral classification of Landsat-images using neural networks. IEEE Trans. Geosci. Remote Sens., 30:482-490, 1992.

[5] BORUVKA, L. & PENIZEK, V. A test of an artificial neural network allocation procedure using the Czech soil survey of agricultural land data. In: LAGACHERIE, P.; McBRATNEY, A. B. & VOLTZ, M., eds. Digital soil mapping: An introductory perspective. Amsterdam, Elsevier, 2007. p.415-424. (Developments in Soil Science, 31)

[6] BRUS, D.J. & HEUVELINK, G.B.M. Optimization of sample patterns for universal kriging of environmental variables. Geoderma, 138:86-95, 2007.

[7] CHAGAS, C.S.; FERNANDES FILHO, E.I.; VIEIRA, C.A.O.; SCHAEFER, C.E.G.R. & CARVALHO JÚNIOR, W. Atributos topográficos e dados do Landsat7 no mapeamento digital de solos com uso de redes neurais. Pesq. Agropec. Bras., 45:497-507, 2010.

[8] CHAGAS, C.S.; CARVALHO JÚNIOR, W. & BHERING, S.B. Integração de dados do Quickbird e atributos do terreno no mapeamento digital de solos por redes neurais artificiais. R. Bras. Ci. Solo, 35:693-704, 2011.

[9] CONGALTON, R.G. & GREEN, K. Assessing the accuracy of remotely sensed data: Principles and practices. New York, Lewis Publishers, 1999. 160p.

[10] CONGALTON, R.G. & MEAD, R.A. A review of discrete multivariate analysis techniques used in assessing the accuracy of remotely sensed data from error matrices. IEEE Trans. Geosci. Remote Sens., 24:169-174, 1986.

[11] DEPARTAMENTO DE RECURSOS MINERAIS - DRM-RJ. Projeto Carta Geológica do Estado do Rio de Janeiro na Escala 1:50.000. Folhas: Miracema e São João do Paraíso. 1980.

[12] DOBOS, E.; MICHELI, E.; BAUMGARDNER, M.F.; BIEHL, L. & HELT, T. Use of combined digital elevation model and satellite data for regional soil mapping. Geoderma, 97:367-391, 2000.

[13] DOBOS, E.; MONTANARELLA, L.; NÈGRE, T. & MICHELI, E. A regional scale soil mapping approach using integrated AVHRR and DEM data. Inter. J. Appl. Earth Obs. Geoinf., 3:30-42, 2001.

[14] ELNAGGAR, A.A. & NOLLER, J.S. Application of remote sensing data and decision-tree analysis to mapping of salt-affected soils over large areas. Remote Sens., 2:151-165, 2010.

[15] EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA - EMBRAPA. Centro Nacional de Pesquisa de Solos. Levantamento de reconhecimento de baixa intensidade dos solos do Estado do Rio de Janeiro. Rio de Janeiro, 2003. 242p. (Boletim de Pesquisa e Desenvolvimento, 32)

[16] EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA - EMBRAPA. Centro Nacional de Pesquisa de Solos. Procedimentos normativos de levantamentos pedológicos. Brasília, 1995. 101p.

[17] EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA - EMBRAPA. Centro Nacional de Pesquisa de Solos. Sistema brasileiro de classificação de solos. 2.ed. Rio de Janeiro, 2006. 306p.

[18] ENVIRONMENTAL SYSTEMS RESEARCH INSTITUTE - ESRI. ArcGIS Desktop. Redlands, 2010. v.10.

[19] GALLANT, J.C. & WILSON, J.P. Primary topographic attributes. In: WILSON, J.P. & GALLANT, J.C., eds. Terrain analysis: Principles and applications. New York, John Wiley & Sons, 2000. p.51-85.

[20] GIASSON, E.; SARMENTO, E.C.; WEBER, E.; FLORES, C.A. & HASENACK, H. Decision trees for digital soil mapping on subtropical basaltic steeplands. Sci. Agric., 68:167-174, 2011.

[21] HEERMANN, P.D. & KHAZENIE, N. Classification of multispectral remote sensing data using a back-propagation neural network. IEEE Trans. Geosci. Remote Sens., 30:81-88, 1992.

[22] HENGL, T.; TOOMANIAN, N.; REUTER, H.I. & MALAKOUTI, M.J. Methods to interpolate soil categorical variables from profile observations: Lessons from Iran. Geoderma, 140:417-427, 2007.

[23] JENNY, H. Factors of soil formation; a system of quantitative pedology. New York, McGraw-Hill, 1941. 281p.

[24] KANELLOPOULOS, I. & WILKINSON, G.G. Strategies and best practice for neural network image classification. Inter. J. Remote Sens., 18:711-725, 1997.

[25] LANDIS, J.R. & KOCH, G.G. The measurement of observer agreement for categorical data. Biometrics, 33:159-174, 1977.

[26] McBRATNEY, A.B.; ODEH, I.O.A.; BISHOP, T.F.A.; DUNBAR, M.S. & SHATAR, T.M. An overview of pedometrics techniques for use in soil survey. Geoderma, 97:293-327, 2000.

[27] McBRATNEY, A.B.; SANTOS, M.L.M. & MINASNY, B. On digital soil mapping. Geoderma, 117:3-52, 2003.

[28] McKENZIE, N.J. & AUSTIN, M.P. A quantitative Australian approach to medium and small scale surveys based on soil stratigraphy and environmental correlation. Geoderma, 57:329-355, 1993.

[29] McKENZIE, N.J. & RYAN, P.J. Spatial prediction of soil properties using environmental correlation. Geoderma, 89:67-94, 1999.

[30] NARASIMHA RAO, P.V.; VENKATARATNAM, L.; KRISHNA RAO, P.V.; RAMANA, K.V. & SINGARAO, M.N. Relation between root zone soil moisture and normalized difference vegetation index of vegetated fields. Inter. J. Remote Sens., 14:441-449, 1993.

[31] PAOLA, J.D. & SCHOWENGERDT, R.A. A detailed comparison of backpropagation neural network and maximum-likelihood classifiers for urban land use classification. IEEE Trans. Geosci. Remote Sens., 33:981-996, 1995.

[32] SABINS JUNIOR, F.F. Remote sensing: Principles and interpretation. 3.ed. New York, W. H. Freeman and Company, 1997. 432p.

[33] THOMAS, A.L.; KING, D.; DAMBRINE, E. & COUTURIER, A. Predicting soil classes with parameters derived from relief and geologic materials in a sandstone region of the Vosges mountains (Northeastern France). Geoderma, 90:291-305, 1999.

[34] TSO, B. & MATHER, P.M. Classification methods for remotely sensed data. 2.ed. Boca Raton, CRC, 2009. 356p.

[35] WEINDORF, D.C. & ZHU, Y. Spatial variability of soil properties at Capulin Volcano, New Mexico, USA: implications for sampling strategy. Pedosphere, 20:185-197, 2010.

[36] WIEGAND, C.L.; RICHARDSON, A.J.; ESCOBAR, D.E. & GERBERMANN, A.H. Vegetation indices in crop assessments. Remote Sens. Environ., 35:105-119, 1991.

[37] YANG, W.; YANG, L. & MERCHANT, J.W. An assessment of AVHRR/NDVI-ecoclimatological relations in Nebraska, USA. Inter. J. Remote Sens., 18:2161-2180, 1997.

[38] YOOL, S.R. Land cover classification in rugged areas using simulated moderate-resolution remote sensor data and an artificial neural network. Inter. J. Remote Sens., 19:85-96, 1998.

[39] ZELL, A.; MAMIER, G.; VOGT, M.; MACHE, N.; HUBNER, R.; DORING, S.; HERRMANN, K.; SOYEZ, T.; SCHMALZL, M.; SOMMER, T.; HATZIGEORGIOU, A.; POSSELT, D.; SCHREINER, T.; KETT, B.; CLEMENTE, G.; WIELAND, J. & GATTER, J. Stuttgart neural network simulator: User manual. Version 4.2. Stuttgart, University of Stuttgart, 1996. 338p.

[40] TEN CATEN, A.; DALMOLIN, R.S.D.; PEDRON, F.A. & MENDONÇA-SANTOS, M.L. Regressões logísticas múltiplas: Fatores que influenciam sua aplicação na predição de classes de solos. R. Bras. Ci. Solo, 35:53-62, 2011.

[41] ZHU, A.X. Mapping soil landscape as spatial continua: The neural network approach. Water Res. Res., 36:663-677, 2000.

[42] ZHU, A.X.; HUDSON, B.; BURT, J.; LUBICH, K. & SIMONSON, D. Soil mapping using GIS, expert knowledge, and fuzzy logic. Soil Sci. Soc. Am. J., 65:1463-1472, 2001.