Acessibilidade / Reportar erro

Comparative analysis of orbital sensors in soybean yield estimation by the random forest algorithm

Análise comparativa de sensores orbitais na estimativa de produtividade de soja pelo algoritmo random forest

ABSTRACT

Remote sensing has proven to be a promising tool allowing crop monitoring over large geographic areas. In addition, when combined with machine learning methods, the algorithms can be used for estimating crop yield. This study sought to estimate soybean yield through the enhanced vegetation index and normalized difference vegetation index. These vegetation indices were obtained using moderate-resolution imaging spectro-radiometer (MODIS) sensors on AQUA and TERRA satellites and multispectral instrument (MSI) sensor on Sentinel-2 satellite. Random forest (RF) algorithm was used to predict soybean yield and the estimation models were compared with the actual plot’s yield. The RF algorithm showed good performance to estimate soybean yield with our models (R2 = 0.60 and RMSE = 0.50 for MSI; R² = 0.63 and RMSE = 0.59 for MODIS). Vegetation indices with imaging dates corresponding to the crop’s maturation had a higher degree of importance in its predictive ability. However, when comparing the actual and predicted soybean production values, differences of 145 kg ha-1 in contrast to 4 kg ha-1 were found for the MODIS and MSI models, respectively. Therefore, the MSI sensor integrated with machine learning algorithms accurately estimated crop yields.

Index terms:
Remote sensing; yield estimation; machine learning.

RESUMO

O Sensoriamento Remoto orbital (SR) tem se mostrado uma ferramenta promissora, pois permite o monitoramento de culturas em grandes áreas geográficas. Além disso, quando métodos de Aprendizado de Máquina (AM) são combinados, os algoritmos podem ser usados para estimativas de produtividade de culturas. Assim, o estudo teve como objetivo estimar a produtividade da soja por meio dos índices de vegetação EVI (Enhanced vegetation index) e NDVI (Normalized Difference vegetation index), obtidos por meio dos sensores MODIS (Moderate-Resolution Imaging Spectroradiometer) dos satélites ACQUA e TERRA e MSI (Multispectral Instrument) do satélite orbital Sentinel-2. Neste estudo, o algoritmo Random Forest (RF) foi usado por ser amplamente difundido no estudos previsão de safras, e os modelos de estimativa de rendimento da soja foram comparados com o rendimento real da parcela. Os resultados mostraram bom desempenho do algoritmo de RF para estimar a produtividade da soja, obtendo R2 de 0,60 e RMSE de 0,50 para MSI; e R² de 0,63 e RMSE de 0,59 para MODIS na validação. Na modelagem, os índices de vegetação com datas de imagem correspondentes à maturação da cultura tiveram maior grau de importância na previsão. No entanto, ao comparar os valores reais e previstos de produção de soja, houve uma diferença de 145 kg ha-1 para o modelo gerado pelo MODIS e apenas 4 kg ha-1 para o MSI. Portanto, o sensor MSI integrado aos algoritmos de aprendizado de máquina estima com precisão o rendimento das culturas.

Termos para indexação:
Sensoriamento remoto; estimativa de produção; aprendizado máquina.

INTRODUCTION

Brazil is the second largest soybean producer in the world, and part of this success is due to solid investments in technologies allowing this crop to adapt to its soil and climate conditions. The development of highly productive cultivars resistant to tropical climates, advances in plant mineral nutrition technology, and strategies for pest and disease control are among the most significant advances (Empresa Brasileira de Pesquisa Agropecuária - EMBRAPA, 2020EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA - EMBRAPA. História da soja. 2020. Available in: <Available in: https://www.embrapa.br/web/portal/soja/cultivos/soja1/historia >. Access in: May 15, 2023.
https://www.embrapa.br/web/portal/soja/c...
).

During the last 10 years, the soybean cultivation area in Brazil increased from approximately 24 to 39 million hectares, with production rising from 75 to 124 million tons and 3.526 kg ha-1 on average (Companhia Nacional de Abastecimento - CONAB, 2023COMPANHIA NACIONAL DE ABASTECIMENTO - CONAB. Safra - Série histórica de grãos. 2023. Available in: <Available in: https://www.conab.gov.br/info-agro/safras/serie-historica-das-safras >. Access in: May 19, 2023.
https://www.conab.gov.br/info-agro/safra...
). The Brazilian soybean 2019/2020 harvest was approximately 125 million tons. The largest producers were the Mato Grosso state reaching about 36 million tons, Paraná with 21 million, Goiás with 13 million, Rio Grande do Sul and Mato Grosso do Sul with 11 million tons each (CONAB, 2020COMPANHIA NACIONAL DE ABASTECIMENTO - CONAB. Previsão de Safras. 2020. Available in: <Available in: https://www.conab.gov.br/infoagro/safras/graos >. Access in: December, 12 2020.
https://www.conab.gov.br/infoagro/safras...
). For the Paraná state, the productivity was 3.792 kg ha-1 (Departamento de Economia Rural - DERAL, 2020DEPARTAMENTO DE ECONOMIA RURAL - DERAL. Boletins conjunturais. 2020. Available in: <Available in: https://www.agricultura.pr.gov.br/sites/default/arquivos_restritos/files/documento/2020-12/Prognostico%20Soja%20-%202020_21.pdf >. Access in: May 17, 2023.
https://www.agricultura.pr.gov.br/sites/...
).

As soybean productivity increases, production costs also rise. Thus, monitoring crop nutritional status and adopting more sustainable techniques that guarantee the rational use of inputs are becoming more necessary for farmers. Precision agriculture has emerged as a tool allowing the management of the production unit through its spatial and temporal variations using numerous methodologies, including remote sensing (RS).

Through images obtained by sensors embedded in satellites, RS can generate information about the physiological and developmental conditions of crops, even in large areas, in a practical and low-cost manner. This type of technology can be used to predict productivity in regions where crops are being grown (Weiss; Jacob; Duveiller, 2020WEISS, M.; JACOB, F.; DUVEILLER, G. Remote sensing for agricultural applications: A meta: Review. Remote Sensing of Environment , 236:111402, 2020.). Although orbital sensors have limitations concerning resolution, especially regarding spectral and spatial accuracy and cloud cover, they are well positioned on stable platforms compared to airborne sensors, automatically generating images with less distortion (Singh et al., 2020SINGH, P. et al. Hyperspectral remote sensing in precision agriculture: Present status challenges and future trends. In: PANDEY, P. et al. Earth observation: Hyperspectral remote Sensing. Elsevier: Netherlands, p.121-146, 2020.).

The application of RS techniques to crops includes understanding the interaction processes between electromagnetic radiation and the different vegetation physiognomic types (Ponzoni; Shimabukuro; Kuplich 2012PONZONI, F. J.; SHIMABUKURO, Y. E.; KUPLICH, T. M. Sensoriamento remoto da vegetação. 2ª ed. atual. Ampl. São Paulo: Oficina de Textos. 2012.176p.). The spectral response of crops depends on a series of biochemical factors of the targeted plant species, in addition to the physical characteristics of the canopy. These factors are specific to the canopy architecture, plant development stages, agronomic parameters, and atmospheric conditions (Martins; Gallo, 2015MARTINS, G. D.; GALO, M. L. B. T. Caracterização espectral da cana-de-açúcar infectada por nematoides e Migdolus Fryanus por espectrorradiometria de campo. Boletim de Ciências Geodésicas, 21(4):783-796, 2015.). In this context, the enhanced vegetation index (EVI) and the normalized difference vegetation index (NDVI) are widely used (Ba et al., 2022BA, R. et al. Informational analysis of MODIS NDVI and EVI time series of sites affected and unaffected by wildfires. Physica A: Statistical Mechanics and its Applications, 604:127911, 2022.) because they capture the status and trend of crop growth (Shammi; Meng, 2021SHAMMI, S. A.; MENG, Q. Use time series NDVI and EVI to develop dynamic crop growth metrics for yield modeling. Ecological Indicators, 121:107124, 2021.). NDVI typically relies on the pigment absorption feature in the red and near-infrared regions. EVI relies on the electromagnetic spectrum’s red, blue, and near-infrared regions. Compared to NDVI, EVI is less sensitive to different soil compositions (Huete; Justice; Leeuwen, 1999HUETE, A.; JUSTICE, C.; LEEUWEN, W. Modis vegetation index (MOD 13): Algorithm theoretical basis. 1999. Available in: <Available in: https://modis.gsfc.nasa.gov/data/atbd/atbd_mod13.pdf >. Access in: January 15, 2021.
https://modis.gsfc.nasa.gov/data/atbd/at...
).

The estimation of agricultural production is essential for market planning and the adoption of public policies to combat hunger. RS has been widely used for data analyses of farming systems. However, this requires processing vast amounts of data from different orbital and suborbital platforms. Machine learning (ML) methods have been employed in this complex scenario due to their high capacity to process large amounts of input data and deal with linear tasks. Recently, advances in target detection technologies and ML methodologies have provided greater cost-effectiveness and solutions for better estimating the state of crops. This will soon be a routine practice in precision agriculture (Chlingaryan; Sukkarieh; Whelan, 2018CHLINGARYAN, A.; SUKKARIEH, S.; WHELAN, B. Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review. Computers and Electronics in Agriculture, 151(8):61-69, 2018.).

In this context, several agricultural studies have integrated data from orbital sensors and ML algorithms. Stepanov et al. (2020STEPANOV, A. et al. Predicting soybean yield at the regional scale using remote sensing and climatic data. Remote Sensing, 12(12):1-23, 2020.) used data from the moderate-resolution imaging spectro-radiometer (MODIS) sensor to monitor soybean crop yields in the Far East of Russia. In turn, Habibi et al. (2021HABIBI, L. N. et al. Machine learning techniques to predict soybean plant density using UAV and satellite-based remote sensing. Remote sensing, 13(13):1-20, 2021.) analyzed the spatial variability of soybean plant density with images from the commercial PlanetScope sensor. Xin et al. (2013XIN, Q. et al. A production efficiency model-based method for satellite estimates of corn and soybean yields in the midwestern US. Remote Sensing , 5(11):5926-5943, 2013.) developed models to estimate corn and soybean production efficiency with MODIS sensor data. Li et al. (2022LI, L. et al. Developing machine learning models with multi-source environmental data to predict wheat yield in China. Computers and Electronics in Agriculture , 194:106790, 2022.) used data from the MODIS sensor associated with environmental variables to estimate wheat yields in northwest China.

Among the different ML methods, the decision tree-based random forest (RF) method has been widely used in different research areas (Minnoor; Baths, 2023MINNOOR, M.; BATHS, V. Diagnosis of breast cancer using random forests. Procedia Computer Science, 218:429-437, 2023.), with good performance in estimating crop productivity (Alabi et al., 2022ALABI, T. R. et al. Estimation of soybean grain yield from multispectral high-resolution UAV data with machine learning models in West Africa. Remote Sensing Applications: Society and Environment, 27:100782, 2022.; Khanal et al., 2018KHANAL, S. et al. Integration of high resolution remotely sensed data and machine learning techniques for spatial prediction of soil properties and corn yield. Computers and Electronics in Agriculture , 153:213-225, 2018.). Furthermore, RF can identify the relative importance of each predictor for the response variable. This study used the RF regression algorithm to compare the performances of MODIS and MSI orbital sensors in estimating soybean yield through EVI and NDVI vegetation indices.

MATERIAL AND METHODS

This study was conducted in 16 agricultural plots, with areas between 12 and 150 ha, concerning the 2020/2021 harvest. This area is located in the Pato Branco Regional Nucleus of Paraná State Department of Agriculture and Supply (SEAB, Brazilian acronym), in municipalities in the southwest of the Paraná State (Figure 1), in the southern region of Brazil.

Figure 1:
Location of study areas.

The Pato Branco Regional Nucleus had a 12.9% increase in soybean cultivation area and a 33.9% increase in soybean grain production, totaling 1,292,682 tons. These values allowed this regional nucleus to occupy the fifth-ranked position in production in the Paraná State in the 2019/2020 harvest (DERAL, 2020DEPARTAMENTO DE ECONOMIA RURAL - DERAL. Boletins conjunturais. 2020. Available in: <Available in: https://www.agricultura.pr.gov.br/sites/default/arquivos_restritos/files/documento/2020-12/Prognostico%20Soja%20-%202020_21.pdf >. Access in: May 17, 2023.
https://www.agricultura.pr.gov.br/sites/...
).

The region’s climate is predominantly Cfa and Cfb according to the Köppen classification, with a historical average annual precipitation (from 1977 to 2006) ranging from 1800 to 2000 mm (Agência Nacional de Águas - ANA, 2013AGÊNCIA NACIONAL DE ÁGUAS - ANA. Pluviometria. Atlas geográfico de recursos hídricos do Brasil. 2013. Available in: <Available in: https://portal1.snirh.gov.br/atlasrh2013/ >. Access in: May 19, 2023.
https://portal1.snirh.gov.br/atlasrh2013...
). The pedology is comprised of Latosol, Nitisol, Chernozem, and Neosol in those areas with greater slope (EMBRAPA, 2006EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA - EMBRAPA. Levantamento de reconhecimento dos solos do estado do Paraná. 2006. Rio de Janeiro. Scale 1:250.000. Available in: <Available in: http://geoinfo.cnps.embrapa.br/layers/geonode%3Aparana_solos_20201105 >. Access in: May 17, 2023.
http://geoinfo.cnps.embrapa.br/layers/ge...
).

To obtain real data on soybean production in the 2020/2021 harvest, as well as the spatial variability of each plot, a John Deere GreenStar TM 3 (2630) harvest monitor was used, which made it possible to obtain production data with approximate dimensions of 1.5 x 8.5 meters (Figure 2).

Figure 2:
General scheme for obtaining soybean harvest data in plots. Adapted from John Deere (2022)JOHN DEERE. Monitor GreenStar™3 2630. 2022. Available in: <Available in: https://www.deere.com.br/pt/agricultura-de-precis%C3%A3o/receptores-monitores-e-rtk/monitor-greenstar-gs2-2630/ >. Access in: August 01, 2022.
https://www.deere.com.br/pt/agricultura-...
.

To estimate soybean production, the predictor variables NDVI (Equation 1) and EVI (Equation 2) corresponding to the period from sowing to harvest (October to March) were generated using RS. Visible and infrared images from the MSI sensor of SENTINEL-2A and 2B (L2A) satellites of the European Space Agency (ESA) were obtained without the presence of clouds, with a spatial resolution of 10 m and temporal resolution of five days, adjusted to the top of the atmosphere, totaling 16 variables.

Images from the MODIS sensor were obtained from the portal of the Brazilian Agricultural Research Corporation (EMBRAPA, Brazilian acronym), which distributes cloud-free images from the National Aeronautics and Space Administration (NASA). The MODIS image collection had 38 variables with a spatial resolution of 250 m and a temporal resolution of two days. The preparation of the indices and the data extraction were performed using QGIS 3.10 software.

N D V I = N I R R N I R + R (1)

Wherein:

NIR = Near Infrared;

R = Red.

E V I = G N I R R L + N I R + C 1 R C 2 B (2)

G = Gain factor;

L = Adjustment factor for the ground;

NIR = Near Infrared;

R = Red;

B = Blue;

C1 and C2 = adjustment coefficients for the aerosol effect of the atmosphere.

The knowledge-discovery in databases (KDD) method is used to process a large dataset through the pre-processing, mining, and post-processing of data (Goldschmidt; Passos; Bezerra, 2015GOLDSCHMIDT, R.; PASSOS, E.; BEZERRA, E. Data Mining: Conceitos, técnicas, algoritmos, orientações e aplicações. 2. ed. Rio de Janeiro: Elsevier. 2015. 296p.). Production granularity of the pre-processing stage, from the specific data on soybean productivity in the plots, was adjusted according to the spatial resolutions of the MODIS and MSI images. The average was calculated for each pixel of 250 and 10 meters, respectively. After the adjustment, the average values of each sensor’s EVI and NDVI vegetation indices were extracted using the Zonal Statistics tools of the QGIS 3.10 software.

Processing and extraction of soybean vegetation indices in the selected plots made it possible to create two data sets for the 2020/2021 harvest, one for MSI with 17 attributes and 74,570 instances; and another for the MODIS sensor, with 39 attributes and 311 instances. Subsequently, outliers and extreme values were identified and excluded.

The data mining step was carried out in R Studio (R Core Team, 2021R CORE TEAM. R: A language and environment for statistical computing. R Foundation for Statistical Computing. 2021. Vienna. Austria. Available in: <Available in: https://www.R-project.org/ >. Access in: May 19, 2023.
https://www.R-project.org/...
), where the division of the sets first took place, 70% for training and 30% for model validation. RF regression was performed using the Random Forest package based on the production response variable and the vegetation indices, the latter used as predictor variables.

A ntree of 100 was determined for RF regression and the predictor variables were set as the default. Decision trees are represented as a set of rules that start at the tree’s root and group to one of its leaves. The final product of these decisions consists of a directed acyclic graph in which each leaf node corresponds to a class or a decision node containing a test of some attribute (Monard; Baranauskas, 2003MONARD, M. C.; BARANAUSKAS, J. A. Conceitos sobre aprendizado de máquina. In: REZENDE, S. O. Sistemas inteligentes fundamentos e aplicações. Barueri: Manole Ltda, p.89-114, 2003.).

The RF regression algorithm builds a multitude of decision trees when training the samples through an average prediction of the individual trees. For James et al. (2013JAMES, G. et al. An introduction to Statistical Learning: With application in R. London: Springer. 2013. 441p.), this algorithm builds decision trees each time a split in a tree was considered, causing a random sample of n predictors to be chosen as candidates, and dividing complete sets of predictors (Figure 3).

Figure 3:
Representative scheme of the RF regression algorithm. Adapted from Rodriguez-Galiano et al. (2016)RODRIGUEZ-GALIANO, V. F. et al. Modelling interannual variation in the spring and autumn land surface phenology of the European forest. Biogeosciences, 13(11):3305-3317, 2016..

From the analyzes carried out by RF, it was possible to evaluate the performance of the models using data from the MODIS and MSI sensors, generate descriptive statistics, and consider which were the five most important variables for estimating soybean yield.

RESULTS AND DISCUSSION

In our test model, the RF regression models have a coefficient of determination (R²) of 0.63 or 0.60 for vegetation indices generated by MSI (Figure 4c) and MODIS (Figure 4d) data, respectively. Root mean squared deviation (RMSE) values showed that MSI and MODIS sensor data models were similar.

Figure 4:
Comparison of actual and predicted soybean yield dispersion in the field by using RF regression in testing data obtained from MODIS (a, c) and MSI (b, d) sensors.

These results were similar to other studies using vegetation indices as predictive variables to estimate the productivity of crops. For example, the estimation models from Johnson (2014JOHNSON, D. M. An assessment of pre- and within-season remotely sensed variables for forecasting corn and soybean yields in the United States. Remote Sensing of Environment, 141:116-128, 2014.) showed R2 = 0.71 for soybean and R2 = 0.77 for corn. Liu et al. (2020LIU, J. et al. Crop yield estimation in the canadian prairies using terra/MODIS-Derived crop metrics. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. 13:2685-2697, 2020.) applied the MODIS NDVI to estimate barley, rapeseed, and wheat yields in humid regions and obtained R2 values between 0.53 and 0.70.

Khanal et al. (2018KHANAL, S. et al. Integration of high resolution remotely sensed data and machine learning techniques for spatial prediction of soil properties and corn yield. Computers and Electronics in Agriculture , 153:213-225, 2018.) used a high-resolution multispectral image of bare land and topographic terrain to predict soil properties and corn yield using machine learning algorithms and obtained R² = 0.53 with the RF algorithm. Pantazi et al. (2016PANTAZI, X. E. et al. Wheat yield prediction using machine learning and advanced sensing techniques. Computers and Electronics in Agriculture , 121:57-65, 2016.) estimated wheat production with counter-propagation artificial neural networks with an average overall accuracy of 78.3%. In the work of Li et al. (2022LI, L. et al. Developing machine learning models with multi-source environmental data to predict wheat yield in China. Computers and Electronics in Agriculture , 194:106790, 2022.), vegetation indices from MODIS images were used to estimate wheat yields in Northwest China, with R² = 0.74 and RMSE of 0.758.

It is worth noting the recent use of unmanned aerial vehicles in monitoring crops, which allows greater autonomy when obtaining images, especially with their high spatial resolution. However, the study carried out by Alabi et al. (2022ALABI, T. R. et al. Estimation of soybean grain yield from multispectral high-resolution UAV data with machine learning models in West Africa. Remote Sensing Applications: Society and Environment, 27:100782, 2022.) monitored a soybean field located in Nigeria with images of 12 cm of spatial resolution and algorithms such as RF and the Cubist model obtained an R² of 0.89; higher than the results presented in this research.

Regarding the average of actual and estimated productivity of the fields studied (Table 1), a difference of only 4 kg ha-1 was found from the MSI data and 145 kg ha-1 for the MODIS sensor. These results showed that the MSI sensor was more sensitive in generating the vegetation indices and consequently gave a better predictive performance by ML.

The RF regression algorithm performed well in estimating soybean yields in the 16 fields studied (Table 1 and Figure 3). Jeong et al. (2016JEONG, J. H. et al. Random forests for global and regional crop yield predictions. PLoS ONE, 11(6):1-15, 2016.) the potential of the RF regression algorithm for estimating global and regional crops. RF can model complex cropping systems such as wheat, corn, and potato, and configures itself as an alternative statistical modeling method for crop yield prediction.

Table 1:
Descriptive statistics of actual and estimated soybean yield (tons ha-1) of the plots by RF.

The regression models generated from MODIS and MSI images showed differences in predicting the average productivity of the plots measured in tons per hectare. However, they presented similar R² and RMSE. The differences in the resolution of the sensors could explain this, despite the low temporality of the MSI images with its spatial resolution of 10 m, which allowed to estimate the production of each 100 m2. In contrast, the MODIS sensor has a pixel of 250 m allowing to estimate the production for every 6.25 ha.

Among these analyses, it was also possible to evaluate which five variables contributed most to generating the productivity estimation models. The importance of the input variables can be evaluated through the impurities implemented in the RF algorithm. The impurities are extracted from the regression trees by calculating IncNodePurity, corresponding to the total decrease in node impurities from the division in the predictors. An increase in the IncNodePurity value implies a reduction in the mean squared error, which means that the highest values represent the essential variables for the response (Habibi et al., 2021HABIBI, L. N. et al. Machine learning techniques to predict soybean plant density using UAV and satellite-based remote sensing. Remote sensing, 13(13):1-20, 2021.). Figure 5 shows the relative importance of the variables in the models used in this study.

Figure 5:
Ranking of the most important predictor variables for the MODIS (a) and MSI (b) sensor models.

Both EVI and NDVI vegetation indices, despite having differences in their composition and sensitivity to different soil compositions, as highlighted by Huete, Justice, and Leeuwen (1999HUETE, A.; JUSTICE, C.; LEEUWEN, W. Modis vegetation index (MOD 13): Algorithm theoretical basis. 1999. Available in: <Available in: https://modis.gsfc.nasa.gov/data/atbd/atbd_mod13.pdf >. Access in: January 15, 2021.
https://modis.gsfc.nasa.gov/data/atbd/at...
), had the same importance in predicting soybean yield with the models generated for both MODIS and MSI sensors.

The most prominent predictor variables were those corresponding to January and February, equivalent to the physiological maturation stages of the soybean crop called R7 and R8. In these stages, the predominant characteristic is the appearance of the first normal pod on the main stem with mature color (until 95% yellowing of the pods) (Neumaier et al., 2020NEUMAIER, N. et al. Ecofisiologia da soja. In: SEIXAS, C. D. et al. Tecnologias de produção de soja. Londrina: Embrapa Soja. p. 33-54, 2020.).

In contrast, the study carried out by Shami and Meng (2021) in the Mississippi Delta with the MODIS EVI and NDVI indices was based on growth metrics from the beginning to the complete development of the pods (R3, R5, and R6), and showed the best-predicted soybean crop productivity with 95% accuracy. However, the response variable inserted in the model was at the municipality level obtained through agricultural statistics, unlike the present study, which uses production data from orbital sensors and vegetation indices.

CONCLUSIONS

We obtained good performance in estimating agricultural yield by the high-performance RF algorithm associated with EVI and NDVI vegetation indices from MSI sensor images, which have a lower temporal resolution but with high spatial resolution. Based on the modeling obtained in this study, it will be possible to estimate production at regional levels in the future.

AUTHOR CONTRIBUTION

Conceptual idea: Batistella, D.; Methodology design: Batistella, D.; Lima, V. A. Data collection: Batistella, D.; Data analysis and interpretation: Campos, J. R. R.; Modolo, A. J.; and Writing and editing: Batistella, D.; Campos, J. R. R.; Modolo, A. J.

ACKNOWLEDGEMENTS

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - EDITAL SIMPLIFICADO Nº 08/2023 - PROPPG/UTFPR. We thank the rural producers for providing the harvest data.

REFERENCES

  • AGÊNCIA NACIONAL DE ÁGUAS - ANA. Pluviometria. Atlas geográfico de recursos hídricos do Brasil. 2013. Available in: <Available in: https://portal1.snirh.gov.br/atlasrh2013/ >. Access in: May 19, 2023.
    » https://portal1.snirh.gov.br/atlasrh2013/
  • ALABI, T. R. et al. Estimation of soybean grain yield from multispectral high-resolution UAV data with machine learning models in West Africa. Remote Sensing Applications: Society and Environment, 27:100782, 2022.
  • BA, R. et al. Informational analysis of MODIS NDVI and EVI time series of sites affected and unaffected by wildfires. Physica A: Statistical Mechanics and its Applications, 604:127911, 2022.
  • CHLINGARYAN, A.; SUKKARIEH, S.; WHELAN, B. Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review. Computers and Electronics in Agriculture, 151(8):61-69, 2018.
  • COMPANHIA NACIONAL DE ABASTECIMENTO - CONAB. Previsão de Safras. 2020. Available in: <Available in: https://www.conab.gov.br/infoagro/safras/graos >. Access in: December, 12 2020.
    » https://www.conab.gov.br/infoagro/safras/graos
  • COMPANHIA NACIONAL DE ABASTECIMENTO - CONAB. Safra - Série histórica de grãos. 2023. Available in: <Available in: https://www.conab.gov.br/info-agro/safras/serie-historica-das-safras >. Access in: May 19, 2023.
    » https://www.conab.gov.br/info-agro/safras/serie-historica-das-safras
  • DEPARTAMENTO DE ECONOMIA RURAL - DERAL. Boletins conjunturais. 2020. Available in: <Available in: https://www.agricultura.pr.gov.br/sites/default/arquivos_restritos/files/documento/2020-12/Prognostico%20Soja%20-%202020_21.pdf >. Access in: May 17, 2023.
    » https://www.agricultura.pr.gov.br/sites/default/arquivos_restritos/files/documento/2020-12/Prognostico%20Soja%20-%202020_21.pdf
  • EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA - EMBRAPA. História da soja. 2020. Available in: <Available in: https://www.embrapa.br/web/portal/soja/cultivos/soja1/historia >. Access in: May 15, 2023.
    » https://www.embrapa.br/web/portal/soja/cultivos/soja1/historia
  • EMPRESA BRASILEIRA DE PESQUISA AGROPECUÁRIA - EMBRAPA. Levantamento de reconhecimento dos solos do estado do Paraná. 2006. Rio de Janeiro. Scale 1:250.000. Available in: <Available in: http://geoinfo.cnps.embrapa.br/layers/geonode%3Aparana_solos_20201105 >. Access in: May 17, 2023.
    » http://geoinfo.cnps.embrapa.br/layers/geonode%3Aparana_solos_20201105
  • GOLDSCHMIDT, R.; PASSOS, E.; BEZERRA, E. Data Mining: Conceitos, técnicas, algoritmos, orientações e aplicações. 2. ed. Rio de Janeiro: Elsevier. 2015. 296p.
  • HABIBI, L. N. et al. Machine learning techniques to predict soybean plant density using UAV and satellite-based remote sensing. Remote sensing, 13(13):1-20, 2021.
  • HUETE, A.; JUSTICE, C.; LEEUWEN, W. Modis vegetation index (MOD 13): Algorithm theoretical basis. 1999. Available in: <Available in: https://modis.gsfc.nasa.gov/data/atbd/atbd_mod13.pdf >. Access in: January 15, 2021.
    » https://modis.gsfc.nasa.gov/data/atbd/atbd_mod13.pdf
  • JAMES, G. et al. An introduction to Statistical Learning: With application in R. London: Springer. 2013. 441p.
  • JEONG, J. H. et al. Random forests for global and regional crop yield predictions. PLoS ONE, 11(6):1-15, 2016.
  • JOHN DEERE. Monitor GreenStar™3 2630. 2022. Available in: <Available in: https://www.deere.com.br/pt/agricultura-de-precis%C3%A3o/receptores-monitores-e-rtk/monitor-greenstar-gs2-2630/ >. Access in: August 01, 2022.
    » https://www.deere.com.br/pt/agricultura-de-precis%C3%A3o/receptores-monitores-e-rtk/monitor-greenstar-gs2-2630/
  • JOHNSON, D. M. An assessment of pre- and within-season remotely sensed variables for forecasting corn and soybean yields in the United States. Remote Sensing of Environment, 141:116-128, 2014.
  • KHANAL, S. et al. Integration of high resolution remotely sensed data and machine learning techniques for spatial prediction of soil properties and corn yield. Computers and Electronics in Agriculture , 153:213-225, 2018.
  • LI, L. et al. Developing machine learning models with multi-source environmental data to predict wheat yield in China. Computers and Electronics in Agriculture , 194:106790, 2022.
  • LIU, J. et al. Crop yield estimation in the canadian prairies using terra/MODIS-Derived crop metrics. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. 13:2685-2697, 2020.
  • MARTINS, G. D.; GALO, M. L. B. T. Caracterização espectral da cana-de-açúcar infectada por nematoides e Migdolus Fryanus por espectrorradiometria de campo. Boletim de Ciências Geodésicas, 21(4):783-796, 2015.
  • MINNOOR, M.; BATHS, V. Diagnosis of breast cancer using random forests. Procedia Computer Science, 218:429-437, 2023.
  • MONARD, M. C.; BARANAUSKAS, J. A. Conceitos sobre aprendizado de máquina. In: REZENDE, S. O. Sistemas inteligentes fundamentos e aplicações. Barueri: Manole Ltda, p.89-114, 2003.
  • NEUMAIER, N. et al. Ecofisiologia da soja. In: SEIXAS, C. D. et al. Tecnologias de produção de soja. Londrina: Embrapa Soja. p. 33-54, 2020.
  • PANTAZI, X. E. et al. Wheat yield prediction using machine learning and advanced sensing techniques. Computers and Electronics in Agriculture , 121:57-65, 2016.
  • PONZONI, F. J.; SHIMABUKURO, Y. E.; KUPLICH, T. M. Sensoriamento remoto da vegetação. 2ª ed. atual. Ampl. São Paulo: Oficina de Textos. 2012.176p.
  • R CORE TEAM. R: A language and environment for statistical computing. R Foundation for Statistical Computing. 2021. Vienna. Austria. Available in: <Available in: https://www.R-project.org/ >. Access in: May 19, 2023.
    » https://www.R-project.org/
  • RODRIGUEZ-GALIANO, V. F. et al. Modelling interannual variation in the spring and autumn land surface phenology of the European forest. Biogeosciences, 13(11):3305-3317, 2016.
  • SHAMMI, S. A.; MENG, Q. Use time series NDVI and EVI to develop dynamic crop growth metrics for yield modeling. Ecological Indicators, 121:107124, 2021.
  • SINGH, P. et al. Hyperspectral remote sensing in precision agriculture: Present status challenges and future trends. In: PANDEY, P. et al. Earth observation: Hyperspectral remote Sensing. Elsevier: Netherlands, p.121-146, 2020.
  • STEPANOV, A. et al. Predicting soybean yield at the regional scale using remote sensing and climatic data. Remote Sensing, 12(12):1-23, 2020.
  • WEISS, M.; JACOB, F.; DUVEILLER, G. Remote sensing for agricultural applications: A meta: Review. Remote Sensing of Environment , 236:111402, 2020.
  • XIN, Q. et al. A production efficiency model-based method for satellite estimates of corn and soybean yields in the midwestern US. Remote Sensing , 5(11):5926-5943, 2013.

Publication Dates

  • Publication in this collection
    03 July 2023
  • Date of issue
    2023

History

  • Received
    24 Feb 2023
  • Accepted
    05 May 2023
Editora da Universidade Federal de Lavras Editora da UFLA, Caixa Postal 3037 - 37200-900 - Lavras - MG - Brasil, Telefone: 35 3829-1115 - Lavras - MG - Brazil
E-mail: revista.ca.editora@ufla.br