Selection parameters for improvement of yield and quality in tomatillo

ABSTRACT The extended use of the same variety by farmers over successive generations and reuse of the “seeds” caused natural variability that can be a magnificent opportunity for genetic improvement of tomatillo. Two successive seasons were accomplished to select genotypes with good performance from a heterogeneous population of the local tomatillo “Balady”. The results revealed significant differences in early yield as fruit number and average fruit weight over the original population. Additionally, the results revealed a highly significant correlation between fruit set percentage and total soluble solids (TSS) at 60 days and the marketable yield with fruit length and TSS at 60 days. In conclusion, the path analysis along with heritability and selection genetic gain showed that early yield and fruit set are the two most critical component traits for fruit yield in tomatillo. In addition, the average fruit weight showed a low indirect effect on fruit yield per plant. Twenty promising genotypes were selected as a kernel for pure line development through further breeding programs.


INTRODUCTION
The tomatillo (Physalis ixocarpa Brot.2n = 2x = 24) is a species that is native to Mexico and Central America, and it is currently one of Mexico's most significant vegetable crops.In 2020, this crop was the sixth vegetable with the largest planted area in Mexico, with 40,117 ha and 766,515 tons of production (Sistema de Información Agroalimentaria y Pesquera -SIAP, 2021).It is a fruit vegetable that belongs to the family Solanaceae and bears round or spherical and green or green-purple fruit.The tomatillo fruit is surrounded by an inedible, paper-like husk formed from the calyx (Waterfall, 1967).Tomatillos make a great addition to a high-antioxidant eating plan focused on cancer prevention (Dolson, 2020) due to their high nutritional value, containing 8.24 °Brix, 0.75-1.06%protein, 1.12-2.10%fat and 0.77-1.42%ash (Shenstone et al., 2020).Tomatillo fruits are mainly rich in potassium, and they are also a rich source of Mg, Ca, Na, P, and bioactive compounds (González-Pérez; Guerrero-Beltrán, 2021).
The phenotypic value of an individual, P, measured as a deviation from the population mean, is the sum of two parts: the deviation of its family mean from the population mean, P f, and the deviation of the individual from the family mean, P w .The procedure of selection varies according to the attention given or the weight given to these two parts.If we select based on individual values only, we give equal weight to the two components P f and P w of the individual's value P.This is known as individual selection, where individuals are selected solely by their phenotypic values.This method is usually the simplest to operate and, in many circumstances, yields the most rapid response.The genetic principles of individual selection appear to be similar to the pure line selection strategy that has been used in many breeding programs, such as in coffee (Carvalho et al., 1952), beans (Yokoyama et al., 1999;Santos;Abreu;Ramalho, 2002), and rice (Rangel;Zimmermann;Neves, 1998).It is expected that during selection process, a high selection differential could be obtained, to provide greater progress in selection.Genetic gain is closely associated with the heritability of the trait of interest (Araújo et al., 2021).The higher heritability suggests that much of the total genetic variability associated with the selected trait is due to the additive action of genes, indicating that selection for this trait should be effective in the first segregating generations (Cruz;Regazzi;Carneiro 2012).
Correlation and path coefficient analyses are prerequisites for the improvement of any cultivar, including tomatillo, for the selection of superior genotypes (Elsayed, 2017).The information obtained from the correlation coefficients can be enhanced by partitioning into direct and indirect effects for a set of pairwise cause-effect interrelationships (Kang;Miller;Tai, 1983).It is standardized partial regression analysis and deals with a closed system of variables that are linearly related.Hence, this approach would facilitate selection based on more than one trait, which can produce balanced outcomes with selection (Araújo et al., 2021).
In tomatillo, few studies on correlation without investigating the effect of indirect selection, such as path coefficient analysis as a tool for enhancement selection gain, have been reported to estimate the associations between yield and other traits (Peña- Lomelí et al., 2008;Santos et al., 2021).In this respect, multiple traits should be taken into consideration simultaneously in selection to meet farmers, manufacturers and consumer requirements (Carvalho et al., 2021).
Few tomatillo genetic studies have been conducted in Egypt with modest efforts in improvement programs.Hence, the local variety "Balady" suffered from different levels of deterioration with many diseases and pests, resulting in a significant reduction in yield and fruit quality characteristics and asymmetry.
Therefore, the goal of this investigation was to identify and select superior genotypes that possess good performance in traits of interest from a heterogeneous population of the local tomatillo "Balady cv".Additionally, to purify a mixed population with differing phenotypes to develop new lines, thereby improving the average performance of the population.

MATERIAL AND METHODS
This study was conducted during two successive seasons, 2019 and 2020, at Elbaramoon Research Farm, Mansoura city, 31°2′25″N 31°22′58″E, 15 m altitude, Horticulture Research Institute, Egypt.The Köppen-Geiger climate classification system classifies its climate as hot desert (BWh).The experimental site is characterized by its silt clay loam soil with pH 8.13 and organic matter 1.7%.
Seeds of the basic population were sown on 20 February 2019 in 209-seedling trays.By the first week of April at 40 days of age, the seedlings were transplanted to the field.During the first season, transplanting of the tomatillo "Balady cv" seedlings was conducted with approximately 500 plants of tomatillo grown in bulk in an open field under clay soil conditions with a surface irrigation system.At the flowering stage, visual identification, labeling and selfing of plants with diverse phenotypic characterizations were performed.However, tomatillo plants are non-self-pollinated species, and to guarantee and ensure no cross-pollination by insects that could happen, the flowers were bagged at the bud stage.At the end of the maturity stage, seeds of individuals with superior agricultural and yield traits were collected and picked from each individual separately for seed extraction.
In the following season, seeds of every selected plant at an independent line in 209-seedling trays were sown.tomatillo seedlings were transplanted into the field on the 7 th of April 2020.The experimental design used was a randomized complete block design (RCBD) with three replicates.Each entry (plot) contained 20 plants for each genotype.The plot area was 8.4 m 2 and included three rows, each row 0.7 cm apart and 4 m in length.The plants were distributed to complete the number of plants for each genotype, labeling the final plant of each genotype in the third row.The transplants were sown in hills on one side of the ridge as one plant at 0.5 m.All agricultural practices were applied whenever they were needed.Chemical fertilizers, such as phosphorus and potassium fertilizers, were added during soil preparation according to the recommendations of Ministry of Agriculture.Nitrogen fertilizer was divided into three equal portions and added to the soil at 30, 55 and 80 days after transplanting.
Ciência e Agrotecnologia, 47:e013722, 2023 Growth characteristics: plant height (cm), number of main branches per plant and number of leaves per plant were recorded.Yield components were estimated: number of flowers per plant, early fruit number (the sum of first five pickings of maturity fruits), early yield (the first five pickings were weighed for plant and recorded as grams plant -1 ), total number of fruits (estimated by summing number of all picked fruits), total plant yield (determined by summing weight of all picked fruits (g plant -1 ), total yield per feddan, (0.42 hectare) that was estimated by multiplication the average yield of each plot in kg by the number of plants per feddan, average fruit weight [determined by dividing the weight of 10 fruits by their numbers (gram)], fruit set percentage was estimated by the following Equation 1: interaction with the environments; the second refers to the controls, which are evaluated in a factorial system, quantifying the variation between them and the variation in the interaction with the environment.Contrasts between controls and families were included along with the sources of variation inherent to the environments, characterized by blocks, assay and residual effects.Equation 2, where: (1) Marketable yield [determined by summing the weight of all picked fruits without damage (kg)]; fruit length (cm), measured during the harvest stage as the average length of ten randomly selected fruits per replicate using a digital slide caliper; fruit diameter (cm), measured during the harvest stage as the average diameter of ten randomly selected fruits per replicate using a digital slide caliper; shape index, estimated by dividing the value of fruit length by the value of fruit diameter and total soluble solids percentage, estimated two times at 45 and 60 days using a hand refractometer.

Statistical analysis
The statistical model was applied to obtain the analyses of variance of all traits according to (Steel;Torrie, 1960) using a random model as follows: where Y ij : the i-th genotype value in the j-th replication; μ: population means; G i : the i-th genotype effect; Rj: the j-th replicate effect;: E ij the experimental error effect.

Combined analysis
In the combined analysis of several assays, the Z ijk and Y ijk values are admitted, which refer to the same variable but are used to represent the values of the controls and the genotypes (families), being more enlightening, since the effects considered in the determination of each observation are differentiated (Bailey, 2008).In this analysis, it considers two sources of variance regarding the genotypes; the first concerns the families in which each group occurs in a given trial, and it is not possible to quantify their where: DS X X   ) P: parental control, this value is equal to one when the relationship between progeny and offspring is obtained by self-pollination or vegetative propagation.Phenotypic correlations between the different vegetative, earliness, and fruit characters were estimated by Pearson's coefficient, and their significance was evaluated by t tests.All statistical analyses procedures were performed with the aid of Genes software program v. 2015.5.0 (Cruz, 2013).
number of fruit set in the first five flower clusters 1 00 total of flowers anthesis in these clusters x (2) (3) Since: Z ijk : value evaluated in the i-th common treatment, and j-th repetitions of the k trial; µ: general mean of trial; Te i : effect of i-th control (common treatment); Bj (K) : effect of the j-th block within the k-th trial; E k : effect of k-th trail; Te Eik : effect of interaction between the common treatment and the trial (season) and R ijk : random error and Equation 3: where: Y ij(k) : value evaluated in i-th treatment (family) in j-th replication, for a particular trial k; µ: general mean of the experiment; F i : effect of i-th family in a k trial; B j : effect of j-th block in a k trial and R ij(k) : random error in a k trial.
The following parameters were estimated: broadsense heritability (h 2 ), experimental coefficient of variation (CV), coefficient of genotypic variation (CV G ) and experimental variation (CV E ).Thereafter, the selection gains (GS) were estimated, adopting a selection intensity of 8% according to Vencovsky (1987) as Equation 4:

Mean performance of selected individuals
Desirable individuals with superior values are identified, selected and selfed to generate the following generation to develop inbred lines at the end.Twenty individuals for each studied trait, representing approximately 8% of the selection intensity, were selected from the original population that showed the best values for each evaluated trait.Plant number and its mean performance for each trait are presented in Table 1.
Regarding plant height, the mean performance of two selected individuals (G. 13 and 11) were more than 150 cm in height and more than the original population mean (P 0 = 133 cm).However, no significant differences were observed for the number of branches between the selected individuals and the original population, and only two individuals (G. 8 and 20) gave slightly more branches than the original mean (P 0 = 22.7) (Table 1).For the number of leaves per plant, approximately 17 of the selected plants had significant differences compared to the original population mean (P 0 = 173.3).On the other hand, the majority of selected plants gave superior values regarding the number of flowers per plant, whereas genotypes 11 and 15 gave the highest number of flowers.Regarding early yield as fruit number, four genotypes (16, 6, 15 and 14) gave significant differences in early yield as fruit number over the original population (P 0 = 23.3fruit number).The same genotypes maintained their arrangement to the early yield per plant (g plant -1 ).These values of early yield simultaneously with the total number of fruits were obtained over similar individual plants (Elsayed;Elsaid;Habiba, 2015).
For average fruit weight (Table 2), only a selected individual had significant differences compared to the original population mean (no 13 with 9.567 g).The increase in fruit weight ranged between one and two grams over the original mean.For fruit set percentage, 14 plants of 20 gave an increase in fruit set % compared to the original population mean, where genotype 18 gave the highest fruit set, followed by genotypes 16, 12, 10 and 15.On the other hand, nine genotypes had significant differences from the original population mean.Regarding the marketable yield per feddan (0.42 hectare), the majority of the selected plants recorded a significant increase at 5% over the original population means (Table 2).Regarding fruit shape, fruit diameter and length (Table 3) gave the lowest values for selection, where the round type for tomatillo fruits is considered the preferable shape for the consumer.The regular and uniform round shape is controlled by many other traits, such as the maturity stage, harvest date, fruit set, climatic conditions and other factors that complicate genetic improvement, such as this type of character.When the values of the shape index are approximately zero, this is a preferable value that reflects the regular round shape of tomatillo fruits.Hence, genotypes 11, 5, 18, 4, 19 and 3 could be used for further breeding programs for fruit uniformity improvement.

Path coefficient analysis
Before conducting path analysis, the multicollinearity test was applied as the previous step to avoid collinearity among the traits of interest.According to multicollinearity analysis, the variance inflation factors, VIF, were more than 10, indicating the existence of multicollinearity (Table 4).Moreover, the condition number, the ratio between maximum and minimum eigenvalues, was more than 1000, which indicates severe collinearity before exclusion of the traits with a coefficient of correlation of more than 0.900.Hence, this ratio was adjusted to 941.4,revealing the presence of moderate collinearity.Under these new circumstances, the outcomes of the direct and indirect effects of the vegetative and fruit characteristics and different yield component parameters on total yield can reliably be drawn by adopting path coefficient analysis.
For quantification of the magnitude of phenotypic selection and direction of the interrelationship among studied traits, simple correlation coefficients were estimated among some vegetative, yield components and fruit characters.The correlation analysis was obtained by applying the mean genotypes over all the estimated traits.A negative and significant correlation at 5% probability was obtained between the number of branches per plant and total soluble solids at 45 days (-0.461) with a coefficient of coincidence (C %) of 41.94%.Early fruit number was associated with early yield per plant, the total number of fruits, total yield per plant (TY) and fruit diameter, with significant positive correlations estimated by 0.812, 0.814, 0.438, and 0.627, respectively.
Significant and positive correlation coefficients were observed between early yield per plant and the total number of fruits (0.964), fruit set % (0.608), total yield per plant (0.672), marketable yield (0.551), fruit diameter (0.638), and TSS at 60 days old (0.524).Similar associations were found for the total number of fruits with other yield components in addition to fruit length and diameter.Regarding the correlation between fruit set % and other estimated traits, in addition to its strong positive association with other yield components, a highly significant correlation between fruit set % and TSS at 60 days old was observed (0.600, C % 54.84).Additionally, the marketable yield had a significant positive correlation with fruit length and TSS at 60 days old (Table 5).
Vegetative traits such as plant height and number of branches had a negative correlation but no significant association with yield components, which indicates that short plants with moderate shoots have a higher yield than tall plants.A significant positive association with high coefficient coincidence was estimated between early yield and other yield components, indicating the possibility of practicing early selection for yield using this parameter as a form of indirect selection.
As explained, the correlation of total fruit yield with most of the studied traits except plant height, number of branches, average fruit weight, shape index and TSS at 45 days indicated that indirect selection of yield through early yield per plant is possible because of the high and significant indirect effects.
As outlined previously, nonetheless, the highest direct effect of fruit set percentage on total fruit yield and earliness traits at the beginning of the stage were found to be the most important selection index of tomatillo under our field conditions (Table 5).
In contrast, Shushay, Derbew and Abay (2014) revealed that average fruit weight had the highest direct effect on total fruit yield in cultivars similar to tomato (0.644).The selection indexes aggregate information to multiple characters, and with this, they are able to carry out the selection of a set of variables simultaneously (Carneiro et al., 2021).Hence, direct selection based on earliness can reliably be used for yield improvement in tomatillo.
The residual effects estimated by 0.649 revealed that the causal factors explained approximately 35.10% of the variability in the dependent factor, total fruit yield per plant.The coefficient of determination was 0.578 between early yield and total fruit yield per plant, meaning that 57.80% of the variance in total yield can be explained by early yield%.In a similar study, high additive genetic correlations between the first bifurcation height (FBH), fruit set number (FSN), yield per plant in the first cut (YP) and fruit number per plant (FP) with final yield (FY) were reported by Peña- Lomelí et al. (2008).The same authors reported the possibility of obtaining high gains by indirect selection according to heritability values in the first four characters (secondary variables) than in the main character (FY) based on higher FSN, YP, FP and lower FBH (Hallauer;Miranda, 1981;Falconer, 1986).Both simple and combined ANOVA revealed a significant genotype effect (P < 0.01) for 50% of the evaluated traits, including the number of leaves, earliness parameters, fruit set%, yield and TSS at 60 days old, indicating the existence of genetic variability among the population for these traits (data not shown).On the other hand, statistically, no significant differences among the population individuals could be detected for plant height, number of branches, number of flowers, average fruit weight, fruit morphology characteristics and TSS at 45 days old (data not shown).
The knowledge of population genetic parameters is of great interest for a better understanding of the population structure regarding its genetic variability, expected genetic gains and consequently possible success in the breeding program (Elsayed;Elsaid;Habiba;2015).The high heritability value estimates for earliness parameters (94.72%, 87.73%) and number of leaves (84.48%) demonstrate high selection gain.On the other hand, direct selection based on these traits could achieve great progress attributed to the insignificant impact of these traits by the environment (Falconer;Mackay, 1996).Moderate estimates for average fruit weight, fruit set%, and marketable yield, and particularly the low heritability estimates for plant height, number of flowers, and TSS at 45 days old, revealed a favorable influence of the environment rather than genotype, and hence, selection gain for these traits would not be rewarding in early generations.
Few of the evaluated traits showed highly significant variation as genetic causes, as revealed by the high values of the coefficients of genetic variation.The possibility of improving the mean performance during the next phases of the selection cycles is high for these traits.In this regard, high values of the coefficient of genotypic variation (CV G ) reveal proper conditions for selection.Except for plant height, number of branches, number of flowers, average fruit weight, fruit set, fruit morphology characteristics and TSS at 45 days, the other traits had high estimates of CV G , especially the traits related to earliness, which had the highest estimates ranging from 15.27% to 30.20% (Table 6).studies on tomato (Asati;Rai;Singh, 2008;Pradeepkumar et al., 2001).From the estimates of phenotypic and genotypic variance and heritability, Vieira et al. (2021) found that with the use of 130 progenies, these estimates tend to stabilize, implying that a high number of progenies does not interfere decisively in the quality of most parameters, except for the limits of maximum and minimum variation.
The CV G /CV E ratio (coefficients of genotypic variation and experimental variation) is another parameter for indicating selection feasibility.Values close to or greater than unity for a trait mean that selection is promising because the major portion of the observed variability is attributed to genetic causes (Cruz;Regazzi;Carneiro, 2012).Seven traits, including leaf number, earliness, total yield per plant and per feddan, marketable yield and TSS at 45 days old, presented estimates of CV G /CV higher than 1 (Table 6).High lighting early fruit number by presenting higher CV G , h 2 , and CV G /CV E values.Despite the relatively high values of heritability regarding certain traits such as early fruit number, this is not a guarantee that the selection gain will be large.Hence, higher genetic variability in the In a similar study on the same species, Lomelí et al. (2004Lomelí et al. ( , 2008) ) found similar values to those in the present study for CV G of 28.7% and 14.9% for fruit yield and number of fruits per plant, respectively.In similar study on tomato, Prema, Indiresh andSanthosha (2011) indicated the prevalence of sufficient genetic variation from all the characters studied except plant height at 60 and 90 days after transplanting.Also the same authors recorded high phenotypic and genotypic coefficient for average fruit weight, fruit yield per plant (kg), lycopene content (μg/100g), fruit length (cm), total soluble solids of fruit (°Brix) and fruit width (cm) among cherry tomato genotypes.Kumar et al. (2004) observed moderate heritability and low genetic advance for the number of branches.
According to our findings, the selection gain obtained by the highest values of both CV G and heritability for the early fruit number trait did not show great gain by selection.While the marketable yield per feddan gave the highest gain by the selection, (32.33%) based on fewer values of both CV G and h 2 .In contrast, high values of CV G (coefficient of genetic variation) and h 2 have been reported in similar original population is necessary to allow the formation of a proportion of individuals whose mean is higher than the original population mean so that when multiplied by heritability, the result will be optimal (Grajales et al., 2000).Among the objectives of the current investigation was to identify genotypes with higher fruit yield and fruit flavor quality.TSS is considered one of the most important traits related to the organoleptic quality of fruits.TSS is a fast and easy test that can give a general impression of the fruit's sweetness (Mitchell;Uchanski;Elliott, 2019).The means of the individuals selected for the evaluated traits are listed in Tables 1, 2 and 3.In this respect, Pérez-Herrera et al. ( 2021) reported a high variation in TSS content among genotypes of wild Physalis spp ranging from 4.2 to 7.0 °Brix.On the other hand, the direct selection for yield per plant based on plant height, number of branches per plant and TSS at 45 days was not successful, resulting in undesirable gains for these traits.
Relatively high CV G and h 2 estimates associated with higher gains for the traits of interest indicate the importance of additive gene action (Narayan et al., 1996), enhancing the selection procedure.Grajales et al. (2000) reported lower heritability values through an improved material of P. ixocarpa compared to our finding, ranging from 0.26 to 0.23 for fruit yield and number of fruits per plant, respectively.Additionally, Ara et al. (2009) and Saeed et al. (2007) showed high heritability followed by high genetic gain for the total number of fruits per plant.
The species P. ixocarpa is known to be selfcompatible (Pandey, 1957), so mass selection, halfsibling and blended families of half-siblings are the most appropriate selection methods for their improvement (Peña- Lomelí et al., 2013).However, the formation of hybrids using dihaploid lines obtained by another culture has great potential for use in the species (Ortuño-Olea; Manzo-González; Peña- Lomelí, 1998).
The results obtained here by direct selection on early yield traits emphasize the fact that the high gains for these traits are associated with satisfactory gains for TSS at 60 days old.On the other hand, indirect selection on early yield per plant and fruit set appears to be promising to obtain indirect gains for fruit yield and soluble solids content at 60 days old.According to our findings, individual selection of tomatillo genotypes for early yield and soluble solids content was sufficient for the breeding process 60 days after transplanting, while earlier selection for TSS resulted in undesirable gains.
Early maturing cultivars are preferred for both producers and markets, where they could guarantee high prices due to the lack of this item in the market during this period of the year.In addition, earliness in yield is an important selection criterion by alleviating unfavorable climatic conditions such as drought, heat stress and diseases by escaping.In certain cases, such as a lack of water, the short cycle of varieties with concentrated yield in the first pickings could be a good option under these circumstances.
On the other hand, taste and flavor are the main components of the marketability of tomatillos and other fruits.Total soluble solids are significantly related to fruit flavor and the plant's accumulation of sugars (Jiménez-Santana et al., 2012).Therefore, greater efforts are being made to improve these traits by increasing the total soluble solid content.Through this study, the total soluble solids were affected by the age of the plant when estimating this trait, according to the previous data.The data showed that the increase in the TSS content increased with the age of the plants, i.e., at 60 days compared to 45 days.The nature of the association between TSS content and the yield also changed, as it showed an increase in its value in conjunction with the increase in the total and early yield.

CONCLUSIONS
Path analysis along with heritability and selection gain showed that early yield and fruit set are the two most critical component traits for fruit yield in phenotypic selection in tomatillo.In addition, the average fruit weight revealed a low indirect effect on fruit yield per plant.Among the evaluated materials, ten promising genotypes, 18, 16, 6, 20, 9, 15, 10, 14, 5, and 13, were identified and selected that possess better performance under the current conditions of our investigation.

Table 1 :
Mean performance of 20 genotypes with a selection intensity of 8% (from 250 plants) among the S 1 population for vegetative and early yield traits in tomatillo resulting from phenotypic selection.

Table 2 :
Mean performance of 20 genotypes with a selection intensity of 8% (from 250 plants) among the S 1 population for yield traits in tomatillo resulting from phenotypic selection.
P 0 and S 1 refer to the means of the original and selected populations, respectively.LSD: Least significant difference, Fisher's test.n.s , *, ** and *** significance of the F test at not significance, 0.05, 0.01 and 0.01% probability, respectively.ELSAYED, A. Y. A. M. et al.

Table 3 :
Mean performance of 20 genotypes with a selection intensity of 8% (from 250 plants) among the S 1 population for fruit shape and TSS in tomatillo resulting from phenotypic selection.

Table 4 :
Multicollinearity diagnosis of the correlation matrix including some vegetable traits, yield components and fruit characters in tomatillo under individual phenotypic selection.
*Small single value indicates collinearity, while high condition index indicates collinearity.Condition number (Max/Min): 941.4 indicates moderate to strong collinearity.

Table 5 :
The direct and indirect effects of nine secondary components on the total fruit yield per plant as the primary component in tomatillo plants resulted from individual phenotypic selection (value of k: 0.218)

Table 6 :
Estimates of genetic and phenotypic parameters for vegetative, yield components and fruit characters in tomatillo obtained by phenotypic of individual selection for two successive seasons.