Joint analysis of phenotypic and molecular diversity provides new insights on the genetic variability of the Brazilian physic nut germplasm bank

The genetic variability of the Brazilian physic nut (Jatropha curcas) germplasm bank (117 accessions) was assessed using a combination of phenotypic and molecular data. The joint dissimilarity matrix showed moderate correlation with the original matrices of phenotypic and molecular data. However, the correlation between the phenotypic dissimilarity matrix and the genotypic dissimilarity matrix was low. This finding indicated that molecular markers (RAPD and SSR) did not adequately sample the genomic regions that were relevant for phenotypic differentiation of the accessions. The dissimilarity values of the joint dissimilarity matrix were used to measure phenotypic + molecular diversity. This diversity varied from 0 to 1.29 among the 117 accessions, with an average dissimilarity among genotypes of 0.51. Joint analysis of phenotypic and molecular diversity indicated that the genetic diversity of the physic nut germplasm was 156% and 64% higher than the diversity estimated from phenotypic and molecular data, respectively. These results show that Jatropha genetic variability in Brazil is not as limited as previously thought.


Introduction
The high quality of physic nut (Jatropha curcas L.) oil (which meets Brazilian, US and European quality standards) and the ease with which it can be converted into biodiesel by either chemical (Berchmans and Hirata, 2008) or biological transesterification reactions (Modi et al., 2007) have made this species, together with the oil palm (Elaeis guineensis), the most promising plant species  for complementing soybean as a source of oil for biodiesel production in the near future (Durães et al., 2011). The existence of an intense, well-organized research program led by the Brazilian Agricultural Research Corporation (Embrapa) that addresses most of the problems related to Jatropha cultivation and oil production (Durães et al., 2011) has favored this species to the detriment of others that lack such an initiative.
As one of the first steps to ensure that Jatropha may be cultivated in large scale production fields, Embrapa has initiated a breeding program. The initial results have been very promising (Drumond et al., 2010;Rosado et al., 2010;Gurgel et al., 2011;Laviola et al., 2010Laviola et al., , 2012aRocha et al., 2012;Bhering et al., 2012Bhering et al., , 2013. Specifically, these results show that (1) it is possible to phenotypically select su-perior physic nut families and individuals, regardless of the selection strategy used, (2) physic nut genotypes respond differentially to different selection methods, (3) combined selection and BLUP (Best Linear Unbiased Prediction) are the best methods for rapid improvement of the species, (4) the application of combined selection or BLUP to the Brazilian physic nut leads to high genetic gains in the initial stages of the breeding program and (5) genotypes with enhanced productivity can be selected and released as improved cultivars. The application of molecular tools is also being investigated as a means of accelerating the domestication of this species and for identifying superior genotypes to be released for commercial purposes (Durães et al., 2011;Silva-Junior et al., 2011).
Virtually all of these results rely on the evaluation of the Jatropha germplasm collection established by Embrapa. This germplasm bank is one of the largest in the world in terms of absolute number of accessions (~200) and plants (> 2000), and is thought to represent most of the genetic variability of the species in Brazil . Initial characterization of the Brazilian Jatropha germplasm bank using molecular markers  indicated the existence of limited genetic variability. Similarity-based clustering revealed only 43 unique multi-locus profiles in the 192 accessions evaluated. Considering that the physic nut is preferentially an outcrossing species (Ginwal et al., 2005) a high degree of genetic variation was expected to occur. Other reports that have examined the genetic diversity of J. curcas using different types of molecular markers have also shown that in general there is limited variation in the most accessible germplasm resources of this species in various countries (Ram et al., 2008;Sun et al., 2008;Pamidimarri et al., 2009;Subramanyam et al., 2009).
The recurrent finding of low genetic diversity indicates that this is a worldwide phenomenon, probably because physic nut dispersal has depended mainly on anthropic action. During the initial dispersal, seeds from a limited number of plants were probably collected and planted abroad. In addition, since physic nut plants often occur as isolated individuals interbreeding is not very frequent, and this may further limit the genetic diversity. Experimentally, neutral molecular marker techniques such as RAPD (random amplified polymorphic DNA) or SSR (simple sequence repeats) that are commonly used in molecular diversity studies may sample diversity in non-coding regions of the genome (Collard et al., 2005) and may therefore have limited use in predicting the phenotypic diversity of individuals, especially in complex traits such as yield. Consequently, low molecular variability may not be accompanied by low phenotypic variability for important traits. In agreement with this conclusion, and as confirmation of the representativeness of the Embrapa germplasm collection, we have reported the existence of moderate phenotypic diversity that allows elevated genetic gains to be obtained within a short period (Laviola et al., , 2012a. Based on the foregoing considerations, it is clear that the genetic variability of the Brazilian Jatropha germplasm bank has not been fully investigated. Given the importance of this germplasm bank, it would be interesting not only to measure genetic diversity based on molecular or phenotypic traits separately, but to combine them into a single analysis in order to get a clear picture of the actual diversity available. Methods that allow such an analysis exist , but are not used very often because only a few germplasm collections have been characterized phenotypically and molecularly. Since the Brazilian Jatropha germplasm collection has already been characterized phenotypically and molecularly, the objectives of this study were to (1) determine the extent to which molecular diversity resembles phenotypic diversity, (2) determine how well the combined distance matrix correlates with distance matrices based on phenotypic and molecular data, (3) evaluate the suitability of different procedures for performing a joint analysis of phenotypic and molecular diversity, (4) verify whether a joint analysis of phenotypic and molecular diversity could help discriminate genotypes within the Brazilian Jatropha population and (5) verify whether a joint analysis of phenotypic and molecular diversity can reveal novel patterns of genotype grouping. By using two complementary data sets (phenotypic and molecular data previously generated by our group) we sought to provide a better assessment of the existing diversity. The results of this study may be helpful in determining the need for additional collections to improve the representativeness of the germplasm bank and allow further genetic characterization of Jatropha.

Plant material and experimental design
A set of 117 J. curcas accessions collected in different geographical regions throughout Brazil  and belonging to the Embrapa germplasm bank was used in the experiment. The germplasm bank is located in the experimental area of Embrapa Cerrados at Planaltina, DF, Brazil (15°35'30" south latitude and 47°42'30" west longitude, at 1007 m altitude)]. The field experiment was done using a randomized blocks design with two replicates and five adult plants per plot arranged in rows (4 m x 2 m spacing). The cultural management practices, i.e., nutrition, pest and disease control, for maintenance of the germplasm bank were those recommended for the species (Dias et al., 2007;Laviola et al., 2010).

Phenotypic and molecular characterization
Phenotypic evaluation of the 117 accessions was done in the second year of cultivation and was based on reproductive and vegetative traits . The following phenotypic traits were evaluated: (1) grain yield (Yield, g), (2) weight of 100 seeds (W 100 , g), (3) number of seeds per fruit (NSF, n), (4) endosperm/seed ratio (E/S), (5) seed length (SL, mm), (6) seed width (SW, mm), (7) seed thickness (ST, mm), (8) total number of branches per plant at 0.5 m above ground (TNB 0.5), (9) total number of branches per plant at 1 m above ground (TNB 1.0), (10) total number of branches per plant at 1.5 m above ground (TNB 1.5), (11) plant height (height, m), (12) stem diameter (SD, mm), (13) canopy projection on the row measured as the extent of the canopy (in meters) in the planting row (CP R , m), (14) canopy projection between lines (CP BL , m), (15) canopy volume, measured as the extent of the canopy (in meters) between adjacent planting lines (CV, m 3 ), (16) juvenility measured as the number of days from planting to first flowering (NDF, days) and (17) height of the first inflorescence (HFI, m). The canopy volume was estimated by approximating the volume of a cylinder with an elliptical base using the formula where V canopy = canopy volume (m 3 ), p = 3.14159, D 1 = canopy diameter at the widest point (m), D 2 = canopy diameter at the narrowest point (m) and h = plant height (m), as previously used by Laviola et al. (2010). We have previously estimated genetic and environmental parameters using mixed models (REML/BLUP) and have found that medium to high accuracies can be obtained for breeding values by using only one measure in a single environment (Laviola et al., , 2012a. This finding demonstrates that the type of evaluation used in this experiment is useful for predicting the genetic value of accessions. The molecular data used in this study were obtained as reported by Rosado et al. (2010). Briefly, total genomic DNA was extracted from the young leaves of two plants per accession, i.e., two biological replicates, using a CTAB protocol (Grattapaglia and Sederoff, 1994). The DNA concentration was measured using a Nanodrop spectrophotometer (Nanodrop Products, Wilmington, DE, USA) and the concentration of each sample was adjusted to 2-5 ng/mL. Fifteen RAPD primers (Operon Technologies, Inc., Huntsville, AL, USA; DT Technologies, Coralville, IA, USA) and two SSR primer pairs (jcps20-1 and jcps20-2) were scored. The selected primers were previously tested for polymorphism in this same set of Jatropha accessions and found to amplify robust (reproducible) loci . RAPD and SSR loci were amplified by the polymerase chain reaction (PCR) using standard procedures . RAPD products were visualized on 1.5% agarose gels and SSR products were electroinjected in an ABI 3100XL Genetic Analyzer (Applied Biosystems, Foster City, CA, USA) and data were collected with virtual filter D using GeneScan 2.0. Alleles were declared using Genotyper 2.1 (Applied Biosystems). An internal size standard labeled with ROX (Brondani and Grattapaglia, 2001) was used for sizing SSR alleles. All analyses related to the amplification, interpretation and robustness of RAPD markers were done as described by Grattapaglia and Sederoff (1994) and Reis and Grattapaglia (2004).

Genetic diversity analyses
Since we had access to morphological and molecular data, different strategies for studying the genetic diversity of physic nut based on parallel molecular and phenotypic analyses were tested. Initially, the phenotypic data were used to generate a dissimilarity matrix based on Gower's method (Gower, 1971) and the molecular data were used to generate a dissimilarity matrix based on Jaccard's coefficient (Jaccard, 1901). To combine phenotypic and molecular data, the Gower and Jaccard dissimilarity matrices were summed algebraically to generate a new matrix referred to as the joint dissimilarity matrix. By combining dissimilarity matrices based on different data sets, i.e., phenotypic and molecular, our aim was to simultaneously assess all information at once, thereby capturing all available information.
Graphic representations of the matrices were generated based on color gradients that were used to express the dissimilarity between genotypes. Red was used to indicate lower genetic dissimilarity while green was used to indicate the most dissimilar accessions. A score of zero corresponded to the lowest genetic dissimilarity (represented by red) while a score of 1 corresponded to maximum genetic dissimilarity (represented by green). The correlation between the dissimilarity matrices described above was also examined. The correlations and their significances were tested by using Mantel's Z test with 1000 permutations (Manly, 1997). This procedure allowed us to assess the resemblance between the genotypic and phenotypic matrices and between the genetic dissimilarity matrices and joint dissimilarity matrix.
After generating the dissimilarity matrices the genetic diversity of the Brazilian physic nut population was initially assessed based on a 3D dispersion analysis. For this, the dissimilarity matrices generated with Gower's index, Jaccard's index and by combining the Gower and Jaccard information were used to produce three groups of x, y and z coordinates, i.e., a distance projection in three dimensional space using Genes software (Cruz, 2009). The coordinates were later used to construct 3D scatter plots with SigmaPlot v.12.0. In these graphs, greater dot dispersion indicated greater genetic diversity of the population. We later expanded this group analysis by using Tocher's clustering method. In this method the accessions were assigned to groups consisting of equivalent genotypes. The groups were presented using letter coded group profiles. Accessions allocated to the same group based on Tocher's cluster analysis were presented in the same color in the 3D scatter plots.

Results
The phenotypic and molecular diversity of physic nut was assessed by generating various dissimilarity matrices based on Gower's and Jaccard's indices. Table 1 shows the minimum, maximum and mean genetic dissimilarities and Figure 1 shows the graphic representation of the dissimilarity matrices. The data in Table 1 and Figure 1 indicate that physic nut diversity is limited at the molecular and phenotypic levels, as shown by the yellow and red points in the figure. Red and yellow points indicate highly and moderately similar genotype pairs, whereas green points indicate highly divergent genotype pairs. Although numerous reports have indicated low molecular diversity, the present data indicate that the mean molecular dissimilarity is greater than the mean phenotypic dissimilarity. However, Genetic diversity of Jatropha 373 more genotypic pairs are closely related to each other based on molecular markers than on phenotypic data ( Figure 1). The higher individual dissimilarities in the molecular data compared to phenotypic data most likely accounted for the higher general mean in the former. In accordance with this conclusion, the 3D dispersion graphs representing the relationship among genotypes (accessions) based on the two dissimilarity indices (Figure 2) demonstrated that most of the genotypes were concentrated in a dense dot "cloud" when analyzed with phenotypic data. On the other hand, the molecular data allowed genotypes to be allocated into distinct groups that were more divergent among each other. Table 2 shows the results of the statistical analysis of the 3D scatterplots, including the distortion associated with the estimated coordinates. As can be seen, the projections presented in Figure 3 were not error free since there was a small distortion in all cases.
Since the grouping of genotypes into significant clusters was found to be impossible based on visual examination of the scatter plots, we undertook a cluster analysis 374 Alves et al.

Figure 1 -(A)
Gower's dissimilarity matrix and (B) Jaccard's dissimilarity matrix. A color gradient was used to graphically express the dissimilarity between genotypes. Red indicates the most similar accessions, i.e., lowest genetic dissimilarity, while green indicates the most dissimilar accessions, i.e., lowest genetic similarity. A value of zero corresponded to the lowest genetic dissimilarity (represented by red) while 1 corresponded to the maximum genetic dissimilarity (represented by green). The dissimilarity matrices were symmetric and, consequently, the dissimilarity values below the diagonal were equivalent to those above the diagonal. based on Tocher's method (Tables 3 and 4). This analysis revealed 14 and 15 distinct groups for the 117 accessions based on phenotypic and molecular data, respectively. Dots representing distinct groups of accessions were then differentiated in the scatter plot using different colors ( Figure 2). Interestingly, nearly 60% of 117 accessions occurred in a single group (Group A) based on phenotypic and molecular information, thus confirming the previous conclusion regarding limited phenotypic and molecular diversity in the Brazilian cultivated germplasm.
Although the diversity analyses based on phenotypic or molecular data resulted in a similar number of indistinguishable groups and a similar concentration of genotypes in a single group, the correlation between Gower's dissimilarity matrix and Jaccard's dissimilarity matrix was nevertheless low (Table 5). This finding indicates that molecular markers (RAPDs and SSRs) are not sampling the same genomic regions that are relevant to phenotypic differentiation of the accessions.
After these analyses, the phenotypic dissimilarity matrix was combined with the molecular dissimilarity matrix generated using Jaccard's index. The combination was done by algebraically summing the matrices. No data standardization was required since these indices generated dissimilarity coefficients that varied between 0 and 1. The correlation between the joint dissimilarity matrix and the original dissimilarity matrices was then examined. As shown in Table 5, the new dissimilarity matrix displayed moderate correlation with the original matrices (phenotypic or molecular). This finding indicated a moderate degree of likelihood between this matrix and the original information, and that the procedure may be employed when one desires to use both phenotypic and molecular information in a single analysis. The dissimilarity values in the joint dissimilarity matrix were used to measure phenotypic + molecular diversity. This diversity varied from 0 to 1.29 among the 117 accessions, with an average dissimilarity among genotypes of 0.51 (Table 1). Figure 3 shows the combined dis-Genetic diversity of Jatropha 375  similarity matrix along with the 3D dispersion graph representing the relationship among genotypes (accessions) in terms of their joint phenotypic and molecular diversity. Comparison of this graphical representation with the graphical representation of Gower's or Jaccard's dissimilarity matrices (Figure 1) suggests that the joint dissimilarity matrix shares a high degree of likelihood with the original matrices (shown earlier by correlation analysis) and, at the same time, that this procedure is efficient in enhancing the sampling of genetic diversity. Such a matrix is therefore probably more realistic than the other matrices.
Together, these results indicate that the genetic diversity in the germplasm bank of the Brazilian physic nut is small, but not as limited as previously thought based on the studies by Rosado et al. (2010) and Laviola et al. (2010Laviola et al. ( , 2012b. These results also indicate that the joint analysis of phenotypic and molecular data is effective in capturing all existing genetic variability.

Discussion
To the best of our knowledge this is the most comprehensive assessment of the genetic diversity of the cultivated physic nut germplasm in Brazil, and one of the first to be based on phenotypic and molecular data. The phenotypic data used in this study were derived from two of the most comprehensive studies of genetic parameters in physic nut performed to date (Laviola et al., , 2012b. The traits list encompassed vegetative (tree architecture traits) and reproductive traits, including grain yield, considered to be the most important trait to be improved in forthcoming years to Genetic diversity of Jatropha 377 Table 5 -Correlation coefficients between the dissimilarity matrices for phenotypic and molecular data.
Physic nut genetic diversity was low based on both molecular and phenotypic data. The average dissimilarity among the 117 genotypes was 0.31 and 0.19 based on molecular and phenotypic data, respectively. These values agree with those previously reported for the most accessible physic nut germplasm worldwide (Ram et al., 2008;Sun et al., 2008;Kumar et al., 2009;Pamidimarri et al., 2009;Subramanyam et al., 2009;Rosado et al., 2010;Sunil et al., 2011). The average genetic dissimilarity estimated among the 117 accessions was also low when compared to another preferentially allogamous and closely related Euphorbiaceae species (Asante and Offei, 2003;Colombo et al., 2000). Compared to the levels of genetic diversity found in 307 plant studies surveyed with molecular mark-ers (Nybom, 2004), the genetic diversity in the germplasm of the cultivated Brazilian physic nut is limited. According to Rosado et al. (2010) the limited genetic diversity and considerable extent of duplications found in this germplasm collection despite its widespread origin and large size most likely reflects a common ancestry, drift and intensive selection of the currently cultivated material since its introduction into Brazil. Rosado et al. (2010) also suggested the occurrence of extensive homozygosity in the germplasm of cultivated J. curcas, despite its allogamous habit (Divakara et al., 2010). Other recent reports that have sampled diversity using different molecular markers have reached similar conclusions regarding the limited variation in the most accessible germplasm resources of J. curcas in various countries (Kumar et al., 2009;Pamidimarri et al., 2009;Subramanyam et al., 2009).
Comparison of the genetic dissimilarity determined here based on molecular markers with the data provided by Rosado et al. (2010) revealed a slight increase in the corresponding value (from 0.11 to 0.31). This may be related to the set of accessions used. Whereas Rosado et al. (2010) analyzed 192 accessions our analysis was based on a set of 117 accessions, for which we had good quality phenotypic data; the remaining 75 accessions were all related, [according to Rosado et al. (2010); data not shown]. Consequently, our optimized set of physic nut accessions consisted of the most diverse collection of Brazilian genotypes. This observation indicates that the germplasm collection can be optimized by maintaining only a set of the most diverse genotypes.
The limited genetic diversity and considerable homozygosity found in the Brazilian germplasm collection contrasted with previous reports of phenotypic variability (Laviola et al., , 2012aRocha et al., 2012). In this regard, we have previously unequivocally demonstrated the breeding potential of this population. Laviola et al. (2010) first described the genetic parameters for the traits used here to assess the phenotypic diversity of this germplasm. Based on the existence and magnitude of the additive genetic variance we considered that this population was adequate for selection purposes. Laviola et al. (2012b) latter re-estimated the genetic parameters using new data and confirmed the existence of genetic variability for traits such as grain yield. Laviola et al. (2012a) and Bhering et al. (2012Bhering et al. ( , 2013 later demonstrated that the existing genetic variability could support high genetic gains, regardless of the selection strategy used. Based on these findings, it is clear that there is some discordance between the data corresponding to molecular variability and those related to the variability available for breeding purposes (phenotypic variability). We anticipated (Laviola et al., 2012b) that this disparity could arise from the fact that neutral molecular markers, such as RAPD or SSR, commonly used in molecular diversity studies, may be located in non-coding regions of the genome (Collard et 378 Alves et al. Tocher-phenotypic/molecular data-based clustering (blue circle). Forty-two of the accessions grouped based on molecular markers in Group A were also grouped based on phenotypic data, 35 of the accessions grouped based on phenotypic/molecular data in Group A were also grouped based on phenotypic data, 52 of the accessions grouped based on phenotypic/molecular data in Group A were also grouped based on molecular data and 35 accessions were consistently grouped together, regardless of the similarity matrix in which the cluster analysis was performed. The numbers in each non-overlapping part of the circles indicate the number of genotypes grouped solely based on one specific strategy. Numbers outside the circles indicate the total number of accessions grouped based on each strategy.
al ., 2005) and therefore be of limited use in predicting the phenotypic diversity of individuals, especially in complex traits such as yield.
As shown here, the molecular markers used to assess molecular diversity indeed do not sample the same genomic regions that are involved in the genetic control of phenotypes. The correlation between the genetic dissimilarity matrices obtained using molecular data and phenotypic data was very low. If the markers were sampling the genes responsible for the phenotypic expression of the traits evaluated then a high correlation between the dissimilarity matrices would be expected. Many studies have found weak relationships between genetic distances estimated from molecular data and genetic distances estimated from morphological data (Lewontin, 1984;Reed and Frankham, 2001;Koehler-Santos et al., 2003;Martins et al., 2011). For instance, Sunil et al. (2011) detected such a weak relationship in a Jatropha germplasm in India. In their study, the accessions identified as the most diverse based on the analysis of eight phenotypic traits (quantitative traits) were not corroborated as diverse by molecular analysis. Similarly, the accessions that were detected as outliers in our molecular analysis were also not unique based on phenotypic data. Hamza et al. (2011), in a study of the genetic diversity of date palm (Phoenix dactylifera L.), also did not detect significant correlation between the genetic distances estimated based on morphological data and genetic distances estimated based on molecular data in which SSR markers were instead of RAPD. In both cases, the natural and human selective forces acting on molecular variation apparently differed from those acting on morphological traits. A further consideration is that morphological traits are heavily affected by the environment when they are expressed, whereas molecular markers are not subject to such variation and their variation is based directly on DNA sequence variation (Collard et al., 2005).
Together, these findings indicate that arbitrary molecular markers such as RAPD and anonymous markers such as SSR are not efficient in capturing the polymorphism that results in phenotypic variation in physic nuts. This scenario could change if, instead of a small battery of RAPD markers and anonymous SSR, a battery encompassing thousands of single nucleotide polymorphisms (SNPs) densely covering the entire genome (and possibly every predicted gene-model) and EST-derived SSR are used. In this regard, some reports have indicated that when an adequate set of molecular markers is used to assess the genetic diversity of germplasm collections, the estimated molecular genetic distances can be successfully correlated with phenotypic variation (Sestili et al., 2011). Since we have already developed a large number of SNPs for Jatropha (Silva-Junior et al., 2011) and in view of the availability of genomic resources (EST sequences generated using Illumina's GA IIx) that could be used to design primers to amplify EST-derived SSR (primer design is underway), a more detailed study of molecular diversity and of its relationship with phenotypic variation is within the reach in the near future.
By analyzing phenotypic and molecular data together, the average dissimilarity among genotypes increased by 156% and 64% with respect to the original mean phenotypic and molecular dissimilarities. This occurred basically because non-overlapping information was combined into one data set. By combining phenotypic and molecular data the sampling intensity of the genetic diversity was markedly increased; a similar trend was described by Bosetti et al. (2011) and Martins et al. (2011).
Our results indicate that physic nut variability in Brazil is indeed limited (more than 50% of the average similarity), but not as much as previously thought. The slight change in the sampling intensity also affected the clustering of genotypes, and indicated that the joint analysis of phenotypic and molecular data was effective in capturing most existing genetic variability. Considering Tocher's clustering based on the Gower and Jaccard dissimilarity matrix, respectively, joint analysis provided an increase of nearly 29% and 6% in the number of groups. Joint analysis also decreased the genotype concentration in a single group and was effective in discriminating among genotypes. Genotypes that were considered to be most diverse based solely on phenotypic or molecular data were now grouped together with other genotypes. The opposite also occurred, i.e., some genotypes that could not be correctly discriminated based solely on phenotypic or molecular data were now placed in single accession groups.
These associations completely changed the picture regarding the genetic diversity of physic nuts in Brazil. For a species that is only now starting to be improved based on systematic methods, the clear definition of sets of diverse genotypes is an extremely important finding as it can be used to guide future crosses and breeding strategies Bhering et al., 2012;Laviola et al., 2012b;Bhering et al., 2013). However, considering that the Brazilian physic nut germplasm collection forms the basis of the breeding program for the species, it is important not only to identify the existing genetic diversity, but also to use the genetic diversity data to establish some guidelines. The finding that genetic dissimilarity among physic nut genotypes is indeed limited reinforces the need of adding more diversity to the germplasm bank to ensure long-term use of the germplasm in breeding programs. This initiative would be ideally performed by collecting new samples from the center of origin/diversity of the species, as suggested by Rosado et al. (2010) and Laviola et al. (2012b) in Brazil and other countries (Divakara et al., 2010).
Taken together, the results presented here demonstrate that: (1) the molecular diversity of the physic nut does not resemble its phenotypic variability, (2) the algebraic combination of Gower's phenotypic dissimilarity matrix with Jaccard's molecular dissimilarity matrix can be used to successfully perform a joint genetic diversity analysis, (3) the combination of phenotypic and molecular diversity matrices increases sampling of the physic nut genetic diversity, (4) joint analysis of phenotypic and molecular diversity reveals that physic nut variability in Brazil is indeed limited, but not as much as previously thought, and (5) additional collections are urgently needed to enhance the germplasm representativeness and usefulness in terms of species improvement.