Acessibilidade / Reportar erro

High-efficiency phenotyping for vitamin A in banana using artificial neural networks and colorimetric data

ABSTRACT

Banana is one of the most consumed fruits in Brazil and an important source of minerals, vitamins and carbohydrates for human diet. The characterization of banana superior genotypes allows identifying those with nutritional quality for cultivation and to integrate genetic improvement programs. However, identification and quantification of the provitamin carotenoids are hampered by the instruments and reagents cost for chemical analyzes, and it may become unworkable if the number of samples to be analyzed is high. Thus, the objective was to verify the potential of indirect phenotyping of the vitamin A content in banana through artificial neural networks (ANNs) using colorimetric data. Fifteen banana cultivars with four replications were evaluated, totaling 60 samples.

For each sample, colorimetric data were obtained and the vitamin A content was estimated in the ripe banana pulp. For the prediction of the vitamin A content by colorimetric data, multilayer perceptron ANNs were used. Ten network architectures were tested with a single hidden layer. The network selected by the best fit (least mean square error) had four neurons in the hidden layer, enabling high efficiency in prediction of vitamin A (r2 = 0.98). The colorimetric parameters a* and Hue angle were the most important in this study. High-scale indirect phenotyping of vitamin A by ANNs on banana pulp is possible and feasible.

Key words
Musa spp.; colorimetric parameters; computational intelligence; multilayer perceptro; phenomic

RESUMO

A banana é uma das frutas mais consumidas no Brasil, sendo importante fonte de minerais, vitaminas e carboidratos na dieta humana. A caracterização de genótipos superiores de banana permite identificar aqueles com qualidade nutricional para cultivo e para integrar programas de melhoramento genético. Porém, a identificação e quantificação dos carotenoides provitamínicos são dificultadas pelo custo instrumental e dos reagentes químicos para as análises, podendo se tornar inviável caso o número de amostras a serem analisadas seja elevado. Assim, objetivou-se verificar o potencial da fenotipagem indireta do teor de vitamina A em banana por redes neurais artificiais (RNAs) utilizando-se dados colorimétricos. Foram avaliadas 15 cultivares de bananeira com quatro repetições, totalizando 60 amostras. Para cada amostra, foram obtidos dados colorimétricos, estimando-se o teor de vitamina A na polpa dos frutos maduros. Para a predição do teor de vitamina A por dados colorimétricos, utilizaram-se RNAs do tipo perceptron multicamadas. Foram testadas dez arquiteturas de rede com uma única camada intermediária. A rede selecionada pelo melhor ajuste (menor erro quadrático médio) teve quatro neurônios na camada intermediária, possibilitando alta eficiência na predição de vitamina A (r2 = 0,98). Os parâmetros colorimétricos a* e ângulo Hue foram os mais importantes neste estudo. A fenotipagem indireta em alta escala da vitamina A por meio de RNAs na polpa de banana é possível e viável.

Palavras-chave
Musa spp.; parâmetros colorimétricos; inteligência computacional; perceptron multicamadas; fenômica

INTRODUCTION

The banana tree (Musa spp.) is one of the most cultivated fruit trees in tropical and subtropical countries. In Brazil, the production of bananas and plantains was 6.89 million tons in 485,000 hectares of harvested area in 2013 (FAO 2015Food and Agriculture Organization of the United Nations (2015). [accessed 2015 Aug 12]. http://faostat.fao.org/site/339;defaut.aspx
http://faostat.fao.org/site/339;defaut.a...
). Due to its good organoleptic properties and low cost, bananas are consumed by people across the social spectrum, representing a good source of minerals, vitamins and carbohydrates, with a high potential as a functional and nutraceutical food (Amorim et al. 2011Amorim, E. P., Cohen, K. O., Amorim, V. B. O., Paes, N. S., Sousa H. N., Santos-Serejo J. A. and Silva, S. O. (2011). Caracterização de acessos de bananeira com base na concentração de compostos funcionais. Ciência Rural, 41, 592-598. http://dx.doi.org/10.1590/S0103-84782011005000042.
http://dx.doi.org/10.1590/S0103-84782011...
; Aquino et al. 2014Aquino, C. F., Salomão, L. C. C., Siqueira, D. L., Cecon, P. R. and Ribeiro, S. M. R. (2014). Teores de minerais em polpas e cascas de frutos de cultivares de bananeira. Pesquisa Agropecuária Brasileira, 49, 546-553. http://dx.doi.org/10.1590/S0100-204X2014000700007.
http://dx.doi.org/10.1590/S0100-204X2014...
). The carotenoid contents, such as lutein, β-carotene, and α-carotene, which play an important role in the operation of the human body, stand out among the functional and nutraceutical properties. Moreover, β-carotene and α-carotene are converted to vitamin A in the human body (Davey et al. 2009Davey, M. W., Bergh, V. D., Markham, R., Swnnen, R. and Keulemans, J. (2009). Genetic variability in Musa fruit provitamin A carotenoids, lutein and mineral micronutrient contents. Food Chemistry, 115, 806-813. http://dx.doi.org/10.1016/j.foodchem.2008.12.088.
http://dx.doi.org/10.1016/j.foodchem.200...
).

Vitamin A deficiency is considered a serious nutritional disease and is the most common cause of preventable blindness in the world (Santos et al. 2010Santos, E. M., Velarde, L. G. C. and Ferreira, V. A. (2010). Associação entre deficiência de vitamina A e variáveis socioeconômicas, nutricionais e obstétricas de gestantes. Ciência & Saúde Coletiva, 15, 1021-1030. http://dx.doi.org/10.1590/S1413-81232010000700008.
http://dx.doi.org/10.1590/S1413-81232010...
). One of the sustainable ways to mitigate the problem of vitamin A deficiency is to encourage the consumption of natural foods rich in provitamin carotenoids, such as fruits and vegetables (Ekesa et al. 2012Ekesa, B., Poulaert, M., Davey, M.W., Kimiywe, J. Van Den Bergh, I., Blomm, G. and Dhuique-Mayer, C. (2012). Bioaccessibility of provitamin A carotenoids in bananas (Musa spp.) and derived dishes in African countries. Food Chemistry, 133, 1471-1477. http://dx.doi.org/10.1016/j.foodchem.2012.02.036.
http://dx.doi.org/10.1016/j.foodchem.201...
). Thus, the prospection of banana access into collections is important to breeding programs, focusing on the development of cultivars with better nutraceutical properties (Amorim et al. 2011Amorim, E. P., Cohen, K. O., Amorim, V. B. O., Paes, N. S., Sousa H. N., Santos-Serejo J. A. and Silva, S. O. (2011). Caracterização de acessos de bananeira com base na concentração de compostos funcionais. Ciência Rural, 41, 592-598. http://dx.doi.org/10.1590/S0103-84782011005000042.
http://dx.doi.org/10.1590/S0103-84782011...
). However, the quantification of vitamin A content is expensive and it may become unfeasible if the number of samples to be analyzed is high.

The indirect estimate of the carotenoid content and, consequently, the provitamin one is possible by using colorimetric data, which are easily measured in the pulp or peel of the fruit using the colorimeter. This analytical approach has been used in tomato (Carvalho et al. 2005Carvalho, W., Fonseca, M. E. N., Silva, H. R., Boiteux, L. S. and Giordano, L. B. (2005). Estimativa indireta de teores de licopeno em frutos de genótipos de tomateiro via análise colorimétrica. Horticultura Brasileira, 23, 819-825. http://dx.doi.org/10.1590/S0102-05362005000300026.
http://dx.doi.org/10.1590/S0102-05362005...
; Fernandez-Ruiz et al. 2010), pumpkin (Seroczyńska et al. 2006Seroczyńska, A., Korzeniewska, A., Sztangret-Wiśniewska, J., Niemirowicz-Szczytt, K. and Gajewski, M. (2006). Relationship between carotenoids content and flower or fruit flesh colour of winter squash (Cucurbita máxima). Folia Horticulturae, 18, 51-61.; Itle and Kabelka 2009Itle, R. A. and Kabelka, E. A. (2009). Correlation between L. a. b. color space values and carotenoid content in pumpkins and squash (Cucurbita spp.). HortScience, 44, 633-637.; Doka et al. 2013), and potato (Lu et al. 2001Lu, W., Haynes, K., Wiley, E. and Clevidence, B. (2001). Carotenoid content and color in diploid potatoes. Journal of the American Society for Horticultural Science, 126, 722-726.). The indirect estimate of the carotenoid content can reduce the time, labor and financial resources in the evaluation stages.

Because artificial neural networks (ANNs) are efficient to model complex problems (Barbosa et al. 2011Barbosa, C. D., Viana, A. P., Quintal, S. S. R. and Pereira, M. G. (2011). Artificial neural network analysis of genetic diversity in Carica papaya L. Crop Breeding and Applied Biotechnology, 11, 224-231. http://dx.doi.org/10.1590/S1984-70332011000300004.
http://dx.doi.org/10.1590/S1984-70332011...
; Nascimento et al. 2013Nascimento, M., Peternelli, L. A., Cruz, C. D., Nascimento, A. C. C., Ferreira, R. P., Bhering, L. P. and Salgado, C. C. (2013). Artificial neural networks for adaptability and stability evaluation in alfalfa genotypes. Crop Breeding and Applied Biotechnology, 13, 152-156.; Azevedo et al. 2015Azevedo, A. M., Andrade Júnior, V. C., Pedrosa, C. E., Oliveira, C. M., Dornas, M. F. S., Cruz, C. D. and Valadares, N. R. (2015). Application of artificial neural networks in indirect selection: a case study on the breeding of lettuce. Bragantia, 74, 1-7. http://dx.doi.org/10.1590/1678-4499.0088.
http://dx.doi.org/10.1590/1678-4499.0088...
; Brasileiro et al. 2015Brasileiro, B. P., Marinho, C. D., Costa, P. M. A., Cruz, C. D., Peternelli, L. A. and Barbosa, M. H. P. (2015). Selection in sugarcane families with artificial neural networks. Crop Breeding and Applied Biotechnology, 15, 72-78. http://dx.doi.org/10.1590/1984-70332015v15n2a14.
http://dx.doi.org/10.1590/1984-70332015v...
), they may also be effective in the indirect phenotyping of vitamin A content by using colorimetric data. The ANNs are computational models of the human brain that can recognize patterns and regularities of the data, becoming an alternative as universal approximator of complex functions (Gianola et al. 2011Gianola, D., Okut, H., Weigel, K. A. and Rosa, G. J. (2011). Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat. BMC Genetics, 12, 87-101. http://dx.doi.org/10.1186/1471-2156-12-87.
http://dx.doi.org/10.1186/1471-2156-12-8...
). Consequently, they may perform better than conventional statistical models, with the advantage of being non-parametric, do not require detailed information about the physical processes of the system to be modeled, and tolerate data loss (Azevedo et al. 2015Azevedo, A. M., Andrade Júnior, V. C., Pedrosa, C. E., Oliveira, C. M., Dornas, M. F. S., Cruz, C. D. and Valadares, N. R. (2015). Application of artificial neural networks in indirect selection: a case study on the breeding of lettuce. Bragantia, 74, 1-7. http://dx.doi.org/10.1590/1678-4499.0088.
http://dx.doi.org/10.1590/1678-4499.0088...
).

Thus, the objective of the present research was to verify the phenotyping potential of the vitamin A content in banana, using ANNs and colorimetric data.

MATERIAL AND METHODS

Banana bunches of the cultivars Ouro (AA), Nanica (AAA), Nanicão (AAA), Caru-Verde (AAA), Caru-Roxa (AAA), Caipira (AAA), Prata (AAB), Prata-Anã (AAB), Maçã (AAB), Mysore (AAB), Pacovan (AAB), Marmelo (ABB), Prata-Graúda (AAAB) and Caju (unidentified genomic group), as well as Terrinha plantain (AAB), were harvested from an experimental orchard in the Universidade Federal de Viçosa, Viçosa, Minas Gerais.

The banana bunches were harvested when the first signs of yellow color appeared in the fruits of each cultivar. The bananas were removed from the second, third and fourth tiers hands, and the damaged, diseased and malformed ones were discarded. Subsequently, they were immersed in ethephon solution (1.2 g∙L–1) for 8 min to even the ripening. After drying in air for 15 min, they were dipped in Prochloraz fungicide solution (0.49 g∙L–1) for 5 min. Then, the fruits remained at room temperature until the complete ripening.

The completely randomized design was adopted, with 15 treatments (cultivars) and four replications (clusters) — six fruits per sample unit. The bananas were peeled, cut longitudinally, and the colorimetric reading was performed inside the fruit using the colorimeter Konica-Minolta, model CR 10. The values of L*, a*, b*, C* and Hue angle (°hue) were determined. L* (brightness) ranges from 0 (black) to 100 (white); a* varies from green (−60) to yellow (+60); b* ranges from blue (−60) to yellow (+60); C* is chroma/saturation or color intensity. The °hue value ranges from 0° to 360°, being 0° (red), 90° (yellow), 180° (green) and 270° (blue) (McGuire 1992McGuire, R. G. (1992). Reporting of objective color measurements. HortScience, 27, 1254-1260.).

Carotenoids were extracted according to the method proposed by Rodríguez-Amaya (2001)Rodríguez-Amaya, D. B. (2001). A guide to carotenoid analysis in foods. Washington: International Life Sciences Institute Press. with modifications. A 5-g sample of plant material was weighed; 60 mL of 100% acetone (which was cooled) were added. Then, the material was processed in an Ultra-Turrax homogenizer (model T18 Basic) for 6 min. Subsequently, the extract was vacuum-filtered through Buchner funnel using filter paper; then it was transferred to a separatory funnel containing 20 mL of cooled petroleum ether and washed with distilled water to remove the acetone completely. Anhydrous sodium sulfate p.a. was added to remove the residual water contained in the extract.

The carotenoids were analyzed by high-performance liquid chromatography HPLC-DAD, following the chromatographic conditions adjusted by PinheiroSant’ana et al. (1998). The Shimadzu chromatograph was equipped with a high-pressure pump, LC-10AT VP model, with SIL-10AF automatic injector, and UV-visible diode array detector, SPD-M10A model, controlled by the Multi System software, Class VP 6.12. We employed a chromatographic column Phenomenex Gemini RP-18, 250 × 4.6 mm, with 5 μm internal particle, equipped with Phenomenex ODS guard column (C18), 4 × 3 mm, and detection at 450 nm. The mobile phase consisted of methanol:ethyl acetate:acetonitrile (80:10:10, v/v/v), HPLC grade, 2.0 mL∙min–1 flow rate, and 13 min run time.

The peaks of interest were identified by comparing the retention times of the standard and samples and, especially, through the absorption spectrum. The quantification was performed using the standard curves of concentration versus area, and the results are expressed in μg per 100 g of each plant, on a wet basis. The vitamin A content was obtained according to the recommendations of the Institute of Medicine (2001)Institute of Medicine (2001). Dietary Reference Intakes (DRIs): vitamin A, vitamin K, arsenic, boron, chromium, copper, iodine, iron, manganese, molybdenum, nickel, silicon, vanadium and zinc. Washington: National Academy Press.. The carotenoids (β-carotene and α-carotene) were quantified, and the steps were performed, being protected from direct light to prevent degradation of the material.

Thus, the colorimetric parameters (L*, a*, b*, C* and °hue) and vitamin A content for 60 samples (four replicates of 15 cultivars) were obtained. These data were analyzed in the R software (R Development Core Team 2012R Development Core Team (2012). The R Project for Statistical Computing; [accessed 2016 May 17]. http://www.R-project.org/
http://www.R-project.org/...
) by ANNs. For the best efficiency in the training of networks, both input (color data) and output (vitamin A content) data were normalized to the range between 0 and 1 by the “normalize Data” function of the RSNNS package (Bergmeir and Benitez 2012).

The analysis by ANNs showed that 70% of the data (42 samples) were used to train the network and 30%, for validation (18 samples). The samples that formed the training and validation fractions were randomly selected. The Multi-Layer-Perceptron (MLP) networks were used for the analysis and developed using the “mlp” function of the RSNNS package with back propagation algorithm and learning rate of 0.1. The maximum number of training/epochs was 1,000, the activation function for the hidden layer was the logistics and the output layer was the linear. Ten network architectures were tested to determine a trained network with good fit, with 1, 2, 3, …, 9 and 10 neurons in the hidden layer. Considering that, at the beginning of the training, the free parameters were randomly generated and that these initial values can influence the final result of the training (Soares et al. 2014Soares, F. C., Robaina, A. D., Peiter, M. X., Russi, J. L. and Vivan, G. A. (2014). Redes neurais artificiais na estimativa da retenção de água do solo. Ciência Rural, 44, 293-300. http://dx.doi.org/10.1590/S0103-4782014000200016.
http://dx.doi.org/10.1590/S0103-47820140...
), each ANN architecture was trained 1,000 times. The network with the best fit was selected using the mean squared error (MSE) for the validation sample.

For the best-selected network, the diagram of the network topology was obtained using “plotnet” function (Neural Net Tools package). In addition, the relative importance of the input traits was obtained using the Garson method (1991) and the “garson” function (Neural Net Tools package). To determine the efficiency of network training, we performed the regression analysis of vitamin A levels predicted and observed for the training and validation samples. The multiple comparison test by bootstrapping (Ramos and Ferreira 2009Ramos, S. P. and Ferreira, D. F. (2009). Agrupamento de médias via bootstrap de populações normais e não-normais. Revista Ceres, 56, 140-149.) was used to compare the best network architectures, and the BCa bootstrap test, to obtain the 95% confidence intervals. The vitamin A level estimates, observed and predicted by the ANNs, were compared by the bootstrap paired test. In all analyses using the bootstrap technique, 10,000 simulations were used.

RESULTS AND DISCUSSION

The vitamin A content of the studied samples varied greatly, from 1.330 to 141.968 µg per 100 g of pulp, with a coefficient of variation equal to 149.144% (Table 1). This high variability is essential, so that the trained networks are general enough (Azevedo et al. 2015Azevedo, A. M., Andrade Júnior, V. C., Pedrosa, C. E., Oliveira, C. M., Dornas, M. F. S., Cruz, C. D. and Valadares, N. R. (2015). Application of artificial neural networks in indirect selection: a case study on the breeding of lettuce. Bragantia, 74, 1-7. http://dx.doi.org/10.1590/1678-4499.0088.
http://dx.doi.org/10.1590/1678-4499.0088...
). Among the colorimetric data, the parameter a* had the greatest variability, with the highest coefficient of variation (103.527%). The red/green opponent colors are represented on the a* axis, where the positive values are red; the negative ones are green; and 0 value is neutral (Trevisan et al. 2008Trevisan, R., Gonçalves. E. D., Gonçalves, R. S., Antunes, L. E. A. C. and Herter, F. G. (2008). Influência do plástico branco, poda verde e amino quelant®-K na qualidade de pêssegos ‘Santa Áurea’. Bragantia, 67, 243-247. http://dx.doi.org/10.1590/S0006-87052008000100029.
http://dx.doi.org/10.1590/S0006-87052008...
). On the other hand, the L* parameter displayed the lowest variability, with coefficient of variation of 6.191%. This parameter relates to light, ranging from 0 (perfect black) to 100 (perfect white).

Table 1
Descriptive analysis and Pearson correlation between colorimetric parameters and vitamin A level in ripe banana pulp.

For the ten tested network architectures, the smallest MSEs were observed for the lower numbers of neurons in the hidden layer (Figure 1a). Small MSE estimates indicate that the values, actual and predicted by the ANN, are close, or, in other words, indicate great efficiency of the networks. The multiple comparison test by bootstrapping (Ramos and Ferreira 2009Ramos, S. P. and Ferreira, D. F. (2009). Agrupamento de médias via bootstrap de populações normais e não-normais. Revista Ceres, 56, 140-149.) showed that, when using only one neuron in the hidden layer, the average network efficiency was better. A similar conclusion was evident when analyzing the coefficient of determination (r2) in Figure 1b in which smaller numbers of neurons in the hidden layer also yielded better results. The use of non-parametric tests such as the bootstrap for multiple comparisons (Ramos and Ferreira 2009Ramos, S. P. and Ferreira, D. F. (2009). Agrupamento de médias via bootstrap de populações normais e não-normais. Revista Ceres, 56, 140-149.) is feasible in studies similar to this, when, in general, the MSE and r2 do not follow a normal distribution.

Figure 1
Mean square error (a) and coefficient of determination (b) not identify the associations between the data in the for different numbers of neurons in the hidden layer.

Generally, the increased number of neurons per layer does not ensure the best network performance. Similar results were found by Soares et al. (2014)Soares, F. C., Robaina, A. D., Peiter, M. X., Russi, J. L. and Vivan, G. A. (2014). Redes neurais artificiais na estimativa da retenção de água do solo. Ciência Rural, 44, 293-300. http://dx.doi.org/10.1590/S0103-4782014000200016.
http://dx.doi.org/10.1590/S0103-47820140...
and Azevedo et al. (2015)Azevedo, A. M., Andrade Júnior, V. C., Pedrosa, C. E., Oliveira, C. M., Dornas, M. F. S., Cruz, C. D. and Valadares, N. R. (2015). Application of artificial neural networks in indirect selection: a case study on the breeding of lettuce. Bragantia, 74, 1-7. http://dx.doi.org/10.1590/1678-4499.0088.
http://dx.doi.org/10.1590/1678-4499.0088...
. An explanation for this is that the increased number of neurons in the network may lead to overfitting, which occurs when the network training process stores the data in the training sample and does not identify the associations between the data in the input and output layers (Silva et al. 2010Silva, I. N., Spatti, D. H. and Flauzino, R. A. (2010). Redes neurais artificiais: para engenharia e ciências aplicadas. São Paulo: Artliber.). In this case, a good fit is observed for the sample training while a very poor one is found for the validation sample. Therefore, network efficiency should always be checked with a sample whose data were not used in the training process, which is the validation sample.

The evaluation of the relative importance of the explanatory variables by Garson method (1991) showed that the parameters a* and °hue were the most important (Figure 2), with relative contribution of 28.87 and 20.08%, respectively. This is important, especially when it becomes advantageous to exclude traits to reduce the computational effort (Paliwal and Kumar 2011Paliwal, M. and Kumar, U. A. (2011). Assessing the contribution of variables in feed forward neural network. Applied Soft Computing, 11, 3690-3696.). A major contribution is expected for these traits due to the highest correlation estimates with vitamin A (Table 1), 0.821 and −0.765 for a. and °hue, respectively.

Figure 2
Relative contribution, obtained by the method of Garson (1991)Garson, G. D. (1991). Interpreting neural-network connection weights. Journal AI Expert, 6, 47-51., of the colorimetric parameters in the input layer to predict the vitamin A level using artifi cial neural networks. The deviations refer to the 95% confi dence intervals obtained by bootstrap BCa with 10,000 simulations.

Although there was on average a good fit with only one neuron in the hidden layer (Figure 1a,b), the best fit was observed when using four neurons in the hidden layer (Figure 3). This can be explained by the high number of trainings (1,000) for each network architecture. The use of a large number of trainings for each network architecture is suggested, since, at the beginning of the training, the synaptic weights are randomly generated (Soares et al. 2014Soares, F. C., Robaina, A. D., Peiter, M. X., Russi, J. L. and Vivan, G. A. (2014). Redes neurais artificiais na estimativa da retenção de água do solo. Ciência Rural, 44, 293-300. http://dx.doi.org/10.1590/S0103-4782014000200016.
http://dx.doi.org/10.1590/S0103-47820140...
) and, therefore, at each training, different results are found for the same architecture.

Figure 3
Best-fi t network topology trained to predict vitamin A level from colorimetric data of ripe banana pulp.

For the best-fitted network, optimum fittings were found, with r2 = 95.11% for the training sample (Figure 4a) and 98.50% for the validation sample (Figure 4b). The high r2 value estimated for the validation sample indicates that the trained network is efficient and has the power of generalization. The prediction efficiency found in this study is higher than that observed by Carvalho et al. (2005)Carvalho, W., Fonseca, M. E. N., Silva, H. R., Boiteux, L. S. and Giordano, L. B. (2005). Estimativa indireta de teores de licopeno em frutos de genótipos de tomateiro via análise colorimétrica. Horticultura Brasileira, 23, 819-825. http://dx.doi.org/10.1590/S0102-05362005000300026.
http://dx.doi.org/10.1590/S0102-05362005...
, r2 = 0.90% for lycopene prediction, using the colorimetric data of tomatoes. On the other hand, Seroczyńska et al. (2006)Seroczyńska, A., Korzeniewska, A., Sztangret-Wiśniewska, J., Niemirowicz-Szczytt, K. and Gajewski, M. (2006). Relationship between carotenoids content and flower or fruit flesh colour of winter squash (Cucurbita máxima). Folia Horticulturae, 18, 51-61. and Doka et al. (2013)Dóka, O., Ficzek, G., Luterotti, S., Bicanic, D., Spruijt, R., Buijnsters, J. G., Szalay, L. and Végvári, G. (2013). Simple and rapid quantification of total carotenoids in lyophilized apricots (Prunus armeniaca L.) by means of reflectance colorimetry and photoacoustic spectroscopy. Food Technology and Biotechnology, 51, 453-459. found r2 of 0.92 and 0.96%, respectively, when predicting the β-carotene content using the colorimetric data of pumpkin. The good results of this work can be explained by the good fit of neural networks for non-linear systems (Gionola et al. 2011Gianola, D., Okut, H., Weigel, K. A. and Rosa, G. J. (2011). Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat. BMC Genetics, 12, 87-101. http://dx.doi.org/10.1186/1471-2156-12-87.
http://dx.doi.org/10.1186/1471-2156-12-8...
). Also, this technique allows considering many explanatory variables simultaneously, which can become impractical for multiple linear regression. Fernandez-Ruiz et al. (2010)Fernández-Ruiz, A. V., Torrecilla, J. S., Cámara, M., Mata, M. C. and Shoemaker, C. (2010). Radial basis network analysis of color parameters to estimate lycopene content on tomato fruits. Talanta, 83, 9-13. http://dx.doi.org/10.1016/j.talanta.2010.08.020.
http://dx.doi.org/10.1016/j.talanta.2010...
also found high r2 (0.99%) estimates to predict lycopene content in tomatoes by ANNs, using colorimetric data.

Figure 4
Regression of the predicted and estimated vitamin A (µg per 100 g of pulp) in the training (a) and validation (b) samples using artificial neural networks.

The actual and predicted vitamin A levels were compared by the non-parametric bootstrapping paired test for training and validation samples, with estimated p-values of 44 and 48%, respectively. This means that, at 5% significance level, there is not enough evidence to reject the null hypothesis. In this case, the null hypothesis considers that the mean difference of each observation between the actual and predicted data is zero. This reinforces the conclusion of the prediction efficiency found in this study. Thus, the content of vitamin A can be easily estimated by using only color data. This strategy allows reducing evaluation time, labor and financial costs (Fernandez-Ruiz et al. 2010Fernández-Ruiz, A. V., Torrecilla, J. S., Cámara, M., Mata, M. C. and Shoemaker, C. (2010). Radial basis network analysis of color parameters to estimate lycopene content on tomato fruits. Talanta, 83, 9-13. http://dx.doi.org/10.1016/j.talanta.2010.08.020.
http://dx.doi.org/10.1016/j.talanta.2010...
).

CONCLUSION

The colorimetric parameters a* and °hue were the most important in predicting the level of vitamin A in ripe banana pulp. High-level phenotyping of vitamin A in banana pulp by colorimetric data and artificial neural networks is feasible, allowing reducing evaluation time, labor, and financial costs.

ACKNOWLEDGMENTS

Thanks are due to Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG) and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) for the grants and financial support.

REFERENCES

  • Amorim, E. P., Cohen, K. O., Amorim, V. B. O., Paes, N. S., Sousa H. N., Santos-Serejo J. A. and Silva, S. O. (2011). Caracterização de acessos de bananeira com base na concentração de compostos funcionais. Ciência Rural, 41, 592-598. http://dx.doi.org/10.1590/S0103-84782011005000042
    » http://dx.doi.org/10.1590/S0103-84782011005000042
  • Aquino, C. F., Salomão, L. C. C., Siqueira, D. L., Cecon, P. R. and Ribeiro, S. M. R. (2014). Teores de minerais em polpas e cascas de frutos de cultivares de bananeira. Pesquisa Agropecuária Brasileira, 49, 546-553. http://dx.doi.org/10.1590/S0100-204X2014000700007
    » http://dx.doi.org/10.1590/S0100-204X2014000700007
  • Azevedo, A. M., Andrade Júnior, V. C., Pedrosa, C. E., Oliveira, C. M., Dornas, M. F. S., Cruz, C. D. and Valadares, N. R. (2015). Application of artificial neural networks in indirect selection: a case study on the breeding of lettuce. Bragantia, 74, 1-7. http://dx.doi.org/10.1590/1678-4499.0088
    » http://dx.doi.org/10.1590/1678-4499.0088
  • Barbosa, C. D., Viana, A. P., Quintal, S. S. R. and Pereira, M. G. (2011). Artificial neural network analysis of genetic diversity in Carica papaya L. Crop Breeding and Applied Biotechnology, 11, 224-231. http://dx.doi.org/10.1590/S1984-70332011000300004
    » http://dx.doi.org/10.1590/S1984-70332011000300004
  • Bergmeir, C. and Benítez, M. J. (2012). Neural networks in R using the Stuttgart Neural Network Simulator: RSNNS. Journal of Statistical Software, 46, 1-26.
  • Brasileiro, B. P., Marinho, C. D., Costa, P. M. A., Cruz, C. D., Peternelli, L. A. and Barbosa, M. H. P. (2015). Selection in sugarcane families with artificial neural networks. Crop Breeding and Applied Biotechnology, 15, 72-78. http://dx.doi.org/10.1590/1984-70332015v15n2a14
    » http://dx.doi.org/10.1590/1984-70332015v15n2a14
  • Carvalho, W., Fonseca, M. E. N., Silva, H. R., Boiteux, L. S. and Giordano, L. B. (2005). Estimativa indireta de teores de licopeno em frutos de genótipos de tomateiro via análise colorimétrica. Horticultura Brasileira, 23, 819-825. http://dx.doi.org/10.1590/S0102-05362005000300026
    » http://dx.doi.org/10.1590/S0102-05362005000300026
  • Davey, M. W., Bergh, V. D., Markham, R., Swnnen, R. and Keulemans, J. (2009). Genetic variability in Musa fruit provitamin A carotenoids, lutein and mineral micronutrient contents. Food Chemistry, 115, 806-813. http://dx.doi.org/10.1016/j.foodchem.2008.12.088
    » http://dx.doi.org/10.1016/j.foodchem.2008.12.088
  • Dóka, O., Ficzek, G., Luterotti, S., Bicanic, D., Spruijt, R., Buijnsters, J. G., Szalay, L. and Végvári, G. (2013). Simple and rapid quantification of total carotenoids in lyophilized apricots (Prunus armeniaca L.) by means of reflectance colorimetry and photoacoustic spectroscopy. Food Technology and Biotechnology, 51, 453-459.
  • Ekesa, B., Poulaert, M., Davey, M.W., Kimiywe, J. Van Den Bergh, I., Blomm, G. and Dhuique-Mayer, C. (2012). Bioaccessibility of provitamin A carotenoids in bananas (Musa spp.) and derived dishes in African countries. Food Chemistry, 133, 1471-1477. http://dx.doi.org/10.1016/j.foodchem.2012.02.036
    » http://dx.doi.org/10.1016/j.foodchem.2012.02.036
  • Fernández-Ruiz, A. V., Torrecilla, J. S., Cámara, M., Mata, M. C. and Shoemaker, C. (2010). Radial basis network analysis of color parameters to estimate lycopene content on tomato fruits. Talanta, 83, 9-13. http://dx.doi.org/10.1016/j.talanta.2010.08.020
    » http://dx.doi.org/10.1016/j.talanta.2010.08.020
  • Food and Agriculture Organization of the United Nations (2015). [accessed 2015 Aug 12]. http://faostat.fao.org/site/339;defaut.aspx
  • Garson, G. D. (1991). Interpreting neural-network connection weights. Journal AI Expert, 6, 47-51.
  • Gianola, D., Okut, H., Weigel, K. A. and Rosa, G. J. (2011). Predicting complex quantitative traits with Bayesian neural networks: a case study with Jersey cows and wheat. BMC Genetics, 12, 87-101. http://dx.doi.org/10.1186/1471-2156-12-87
    » http://dx.doi.org/10.1186/1471-2156-12-87
  • Institute of Medicine (2001). Dietary Reference Intakes (DRIs): vitamin A, vitamin K, arsenic, boron, chromium, copper, iodine, iron, manganese, molybdenum, nickel, silicon, vanadium and zinc. Washington: National Academy Press.
  • Itle, R. A. and Kabelka, E. A. (2009). Correlation between L. a. b. color space values and carotenoid content in pumpkins and squash (Cucurbita spp.). HortScience, 44, 633-637.
  • Lu, W., Haynes, K., Wiley, E. and Clevidence, B. (2001). Carotenoid content and color in diploid potatoes. Journal of the American Society for Horticultural Science, 126, 722-726.
  • McGuire, R. G. (1992). Reporting of objective color measurements. HortScience, 27, 1254-1260.
  • Nascimento, M., Peternelli, L. A., Cruz, C. D., Nascimento, A. C. C., Ferreira, R. P., Bhering, L. P. and Salgado, C. C. (2013). Artificial neural networks for adaptability and stability evaluation in alfalfa genotypes. Crop Breeding and Applied Biotechnology, 13, 152-156.
  • Paliwal, M. and Kumar, U. A. (2011). Assessing the contribution of variables in feed forward neural network. Applied Soft Computing, 11, 3690-3696.
  • Pinheiro-Sant’ana, H. M., Stringheta, P. C., Brandão, S. C. C. and Azeredo, R. M. C. (1998). Carotenoid retention and vitamin A value in carrot (Daucus carota L.) prepared by food service. Food Chemistry, 61, 145-151. http://dx.doi.org/10.1016/j.asoc.2011.01.040
    » http://dx.doi.org/10.1016/j.asoc.2011.01.040
  • R Development Core Team (2012). The R Project for Statistical Computing; [accessed 2016 May 17]. http://www.R-project.org/
    » http://www.R-project.org/
  • Ramos, S. P. and Ferreira, D. F. (2009). Agrupamento de médias via bootstrap de populações normais e não-normais. Revista Ceres, 56, 140-149.
  • Rodríguez-Amaya, D. B. (2001). A guide to carotenoid analysis in foods. Washington: International Life Sciences Institute Press.
  • Santos, E. M., Velarde, L. G. C. and Ferreira, V. A. (2010). Associação entre deficiência de vitamina A e variáveis socioeconômicas, nutricionais e obstétricas de gestantes. Ciência & Saúde Coletiva, 15, 1021-1030. http://dx.doi.org/10.1590/S1413-81232010000700008
    » http://dx.doi.org/10.1590/S1413-81232010000700008
  • Seroczyńska, A., Korzeniewska, A., Sztangret-Wiśniewska, J., Niemirowicz-Szczytt, K. and Gajewski, M. (2006). Relationship between carotenoids content and flower or fruit flesh colour of winter squash (Cucurbita máxima). Folia Horticulturae, 18, 51-61.
  • Silva, I. N., Spatti, D. H. and Flauzino, R. A. (2010). Redes neurais artificiais: para engenharia e ciências aplicadas. São Paulo: Artliber.
  • Soares, F. C., Robaina, A. D., Peiter, M. X., Russi, J. L. and Vivan, G. A. (2014). Redes neurais artificiais na estimativa da retenção de água do solo. Ciência Rural, 44, 293-300. http://dx.doi.org/10.1590/S0103-4782014000200016
    » http://dx.doi.org/10.1590/S0103-4782014000200016
  • Trevisan, R., Gonçalves. E. D., Gonçalves, R. S., Antunes, L. E. A. C. and Herter, F. G. (2008). Influência do plástico branco, poda verde e amino quelant®-K na qualidade de pêssegos ‘Santa Áurea’. Bragantia, 67, 243-247. http://dx.doi.org/10.1590/S0006-87052008000100029
    » http://dx.doi.org/10.1590/S0006-87052008000100029

Publication Dates

  • Publication in this collection
    16 June 2016
  • Date of issue
    Jul-Sep 2016

History

  • Received
    02 Oct 2015
  • Accepted
    17 Dec 2015
Instituto Agronômico de Campinas Avenida Barão de Itapura, 1481, 13020-902, Tel.: +55 19 2137-0653, Fax: +55 19 2137-0666 - Campinas - SP - Brazil
E-mail: bragantia@iac.sp.gov.br