Critical analyses when modeling tree biomass to ensure additivity of its components

It is presented the theme additivity of biomass of tree components. To evaluate and discuss this context, experimental information collected in forests of Acacia mearnsii De Wild. was used. Equations for components (stem and crown) and total biomass were fitted by means of two procedures: 1) generalized nonlinear least squares and 2) weighted-nonlinear seemingly unrelated regressions. Analyzing the performance of the estimators, it can be concluded that the two tested procedures are equivalent. On the other hand, this conclusion differs when evaluated the consistency and efficiency of the estimators. Fitting equations for the components and for the total biomass by an independent way is not realistic, because from a biological point of view the estimates of biomass are inconsistent, i.e., are not additive. The biomass estimates of the components and of the total, resulting from equations adjusted by means of systems of equations, provided narrower confidence intervals in relation to the equations adjusted independently, and is therefore more efficient. The second procedure presents better biological properties and statistics to estimate allometric equations for biomass of the components and for the total when compared with the independent estimation, thus it should be the method to be used.


1-INTRODUCTION
A basic premise must be considered when modeling biomass: the additivity.It means compatibility of biomass estimates of tree components (leaves, branches, bark, stem and root) with the tree total biomass.When modeling the tree biomass of components and its total, frequently fitting of the models are performed independently, i.e., it is done for each component without considering the interdependence existing between them.When using these equations, the sum of the components' biomass will not result equal to the total biomass and thus is not biologically consistent.As Chiyenda and Kozak (1984) stated, it is unrealistic to consider that the components of the same tree are independent of each other.
In this sense, some assumptions need to be stated to ensure accuracy in the prediction of forest biomass on a large scale.Genet et al. (2011) state ALEXANDRE BEHLING et al. that a good set of equations for this purpose should meet the following characteristics: i) consistency -referring to the separation of the total biomass of a tree in standard components, ensuring, however, its additivity, ii) robustness -referring to the composition of an operating system that works correctly for wide variations in the sample population and with low sensitivity to the sampling procedure and the assumptions made and (iii) accuracy.
In this research, only the characteristics i and iii will be treated; robustness will not be included in its scope, because this is a more generic topic, important in large-scale inventories.For these purposes, when adjusting biomass equations for the total and for the components, three fundamental aspects are highlighted: performance of the equations, biological consistency of the equations and efficiency of the equation estimators.The performance refers to the quality of the estimates of biomass, while the biological consistency refers to the additivity of components with the total biomass.Regarding the efficiency of the estimators, we analyze the equations tested and choose the one that results in a smaller variance, which determines narrower confidence intervals for the dependent variables and, therefore, is more efficient.
Taking the best equations, evaluated through their performance, two important properties remain in modeling components' biomass, because it is not satisfactory that only biomass equations are additive, but the estimator of the equations may be less efficient, or vice-versa.These issues have been researched over the past 50 years, highlighting the pioneering spirit of the researchers Kozak (1970), Cunia (1979), Jacobs and Cunia (1980), Chiyenda and Kozak (1984), Cunia andBriggs (1984, 1985) and Reed and Green (1985), which presented methods to achieve the additivity of the equations of biomass and, additionally, the review and dissemination of these procedures by Parresol (1999Parresol ( , 2001)).Nevertheless, Nord-Larsen et al. (2017) observed that estimating model parameters using simultaneous estimation (NSUR) increased model bias compared with OLS estimation for most biomass components, i.e., contrary to the expectations that NSUR estimation always reduces the standard error of estimates, in certain restrictive conditions discussed by the authors.Parresol (1999) has researched three procedures to ensure additivity of linear biomass ' equations and Parresol (2001) two procedures for equations that are not linear.In his research, he concluded that the procedure Seemingly Unrelated Regressions (SUR) for linear functions and Nonlinear Seemingly Unrelated Regressions (NSUR) for non-linear functions have resulted in more efficient estimators for components and total biomass, and the application of these procedures in component modeling is recommended.These works have provoked considerable influence in the dissemination of this type of modeling in various parts of the world, although in Brazil the application is restricted to studies conducted by Sanquetta et al. (2014Sanquetta et al. ( , 2015)).An alternative methodology has been presented by Dong et al. (2015), in which additivity is ensured by decomposing the component functions in a multiplicative manner, but it is difficult to work with this method, since it is difficult to constrain the component equation to show a logical behavior.
Some issues are worthy of attention because they justify the importance of the application of SUR and NSUR: (i) what is the gain in efficiency of these estimators over those obtained from a single equation?(ii) do the estimators of single equations produce biologically consistent estimates?(iii) are the estimates of biomass from the system of equations equivalent to those produced by equations estimated independently? (iv) are the estimated parameters and the statistics used to evaluate their quality of fit similar for systems of equations and for independently adjusted equations?
These issues have led to the presentation of this work and compose the following hypotheses, which were tested by means of conducting an experiment in black wattle forests (Acacia mearnsii De Wild.), in the state of Rio Grande do Sul, Brazil: i) "Considering that the equations obtained by means of fitting of single equation and of systems of equations are consistent statistically (without bias), then the estimates of biomass and the statistics of the fitted equations (coefficient of determination and coefficient of variation) obtained through these procedures do not differ among themselves".
(ii) "Given that the estimators for individual equations do not include restrictions to their coefficients, their estimates will not result in additivity of the components leading to biological inconsistency".
(iii) "Given that the estimator of systems of equations include contemporaneous correlations of components' biomass in their fitting, the variance of biomass estimates are expected to be smaller when compared to the estimators obtained from a single equation and therefore should result in greater efficiency".
Considering the assumptions made in this study, the objectives were to compare estimators of biomass through a single equation and by a system of equations, to evaluate their respective statistical estimates, their biological consistencies and, finally, their efficiencies.

2.1-ORIGIN OF EXPERIMENTAL DATA
The field research was conducted during the months of June and July 2014 in forest stands of black wattle (Acacia mearnsii De Wild.), located in places that concentrate the plantations of this species in the state of Rio Grande do Sul, Brazil.We decided to divide these places in three regions: Municipality of Crystal, Encruzilhada do Sul and Piratini.These regions are characterized according to climate, type of relief and soil in Mochiutti (2007).
In each region four forest areas were sampled in a sequence of ages after planting, so as to cover all the rotation period of the culture (10 years).In Cristal, forest stands of 1.75, 2.75, 5.00 and 10.08 years old were sampled, in Encruzilhada do Sul of 1.83; 3.08; 5.75 and 10.75 years old and in Piratini 1; 2.33; 5.25 and 9.83 years old.
In a random manner, four circular plots with diameter of 22.56 m (400 m²) were allocated in each forest stand and in the center of each one also a subplot of 10 m in diameter (78.54 m 2 ).This procedure was performed because it would not be possible to measure the variables of all trees in the plots, due to the high cost.Thus, it was decided to allocate a smaller sub-plot within the plot, in order to make feasible the proposed evaluations without affecting the research objectives.
The variables diameter at breast height and total height were measured in the plots.Yet, all trees inside each plot were felled and measured for the following characteristics: diameter at breast height, total height, and stem and crown biomass.
Thus, the present study encompassed the allocation of 48 plots, measurement of 3,462 pairs of diameter at breast height and total height and 670 trees for obtaining stem, crown and total biomass (defined as the sum of the stem and crown biomass) .

2.1.1-Measurement of variables in the plot
The goal of measuring the variables in the plot was to obtain the number of living trees per hectare.The values of the pairs of diameter at breast height and total height aimed to quantify, by means of equations, the biomass stocks of components per hectare.
The diameter at breast height (d) was measured using a dendrometric tape and the total height (h) a hypsometer (Haglöf).Forked trees below diameter at breast height were considered individual trees.ALEXANDRE BEHLING et al.

2.1.2-Measurement of variables in the sub-plot
All trees within the sub-plot were felled and assessed for the following characteristics: a) The diameter at breast height was measured using a dendrometric tape and the total height a measuring tape; b) The measurement of biomass was performed for the stem component (wood of the stem + bark) and for the crown (live and dead branches, leaves, flowers and fruits), taking the concepts defined in Picard et al. (2012).For each tree, these components were separated and weighed to assess the wet biomass using a digital scale (Portable Electronic Scale) with 5 g of precision; c) The dry crown and stem biomass were obtained from samples and immediately taken their masses using a digital scale (Hoyle) with a 1 g of precision.The crown samples weighted approximately 1,500 g and were taken along the crown of the tree at positions 0%, 25%, 50%, 75% and 95% in relation to the crown total length (away from the first branch, regardless of being alive or dead, to the crown apex).For the stem, 5 discs of 2 centimeters thick in the positions: 0%, 25%, 50%, 75% and 95% of the total height were removed; d) The samples were dried in an oven with movement and renewal of air at 100°C, and after constant mass, obtained through successive evaluations; the material was weighed in a digital scale with 1 g of precision.
The dry biomass was obtained in [1].
Where: B m = dry biomass of the m th component for stem or crown, in kg, for t th observation; MU mi = fresh mass of m th the component for stem or crown, in kg, for t th observation; MS i = dry mass of the sample of the m th component for stem or crown, in kg, for t th observation; MUA i = fresh mass of the sample of the m th component for the stem or crown, in kg, for t th observation.

2.2-EQUATIONS FOR ABOVE-GROUND BIOMASS
The equations for above-ground biomass of the components and for the total have been proposed considering the age of the forest stands, as is shown in Table I.We used the title group of equations for those that were independently estimated and system of equations for those jointly estimated.
The qualitative classification of the stand age was proposed by the authors.

2.2.1-Biomass equations with independent estimates -Procedure 1
The models of Schumacher and Hall (1933) and Spurr (1952) were selected to express the allometry for the component biomass and for the total, as a function of the variables d and h.The models were fitted using estimated generalized nonlinear least squares, with definition of the estimators presented in Greene (2008).The resulting equations are presented in [2] to [16].The White's test (1980) was applied to test the hypothesis of homogeneity of residues in each equation, at 95% probability.For those cases in which the hypothesis was rejected, weights were obtained through the structure of

2.2.2-Systems of Equations of biomass -Procedure 2
We used the model nonlinear seemingly unrelated regressions to fit the system of equations for the components and total biomass, with definitions of estimators in Srivastava and Giles (1987), Fiebig (2001), Greene (2008) andParresol (2001).The White's test (1980) was applied to test the hypothesis of homogeneity of the residuals for the equations of the components and for the total, at 95% probability.For those cases in which the hypothesis was rejected, the weights obtained in section 2.2.1 were used and the system was again fitted by means of the estimator weightednonlinear seemingly unrelated regressions.

2.3-PERFORMANCE, CONSISTENCY AND EFFICIENCY OF THE ESTIMATORS OF THE BIOMASS EQUATIONS
The performance of the equations for total biomass and for the components obtained in procedure 1, in 2.2.1 and in procedure 2, in 2.2.2 were compared using the values of the coefficients of determination, the coefficients of variation and the coefficients of the estimated equations.Additionally, estimates of biomass for the components and for the total were obtained, using the data from the plots, at the tree level and per hectare.These estimates were compared by means of a Chi-square test, whose null hypothesis assumed that the values estimated by means of the procedure 1 are equal to those estimated by the procedure 2, at 95% probability.ALEXANDRE BEHLING et al.
The biological consistency was assessed in relation to the additivity of the biomass of the components to the total, only tested for the equations fitted by means of procedure 1. Taking the expression in [32], estimates of biomass were considered with biological consistency if ≠ , then the estimates of biomass were considered without biological consistency, and it was evaluated at tree and plot levels, and per hectare.
( ) The efficiency of the estimators of the biomass equations fitted by means of procedures 1 and 2 was assessed in relation to the precision of confidence intervals in [33].The confidence intervals were calculated for all the trees from the plots, using the formulations presented in Parresol (2001).
Where: PIC i = the precision of the confidence interval for the t th observation of the i th component; IC i = the confidence interval for the t th observation of the i th component; ŷ i = the estimated value for the t th observation of the predictor variables.
The confidence intervals for ENGLS were calculated according to Greene (2008) and for WNSUR as presented in Parresol (2001).

3.1-PERFORMANCE OF THE ESTIMATORS OF BIOMASS EQUATIONS
The first hypothesis outlined in this paper: "considering that the equations obtained by means of the adjustment of single equation and of systems of equations are consistent statistically (without bias), then the estimates of biomass and the statistics of fitting of the equations (coefficient of determination and coefficient of variation) obtained through these procedures do not differ among themselves", was confirmed as stated in the following paragraphs.
The values of the coefficients of the equations for the components' biomass estimated by the procedure 1 ( ) The statistics for the assessment of the fitting quality, 2 R and CV, for the biomass equations of the components and total, fitted by means of the procedure 1 and 2, showed similar trends in behavior in relation to the group or system of equations, and also with similar values.The differences for 2 R ranged from -0.52% and 0.60%, whereas for CV between -1.46% and 3.17%.Still, by means of a graph to express the tendency among the observed values and estimated by procedures 1 and 2, were not observed deviations in relation to the estimated and observed values.
The estimates obtained through procedures 1 and 2 were compared by means of a Chi-square test and the obtained values were not significant, i.e., values were estimated by equations fitted by means of procedure 1 were similar to those estimated by equations fitted by means of procedure 2.

3.2-BIOLOGICAL CONSISTENCY OF THE BIOMASS EQUATIONS
The restriction of biological consistency, i.e., ˆˆT otal s C y y y = + , was not observed logically, only for the equations of biomass of the components and for the total fitted by means of procedure 1. ≠ , these equations were considered without biological consistency.The non-additivity of these equations (NA) at tree level is shown in Figure 1 for the variable d, when variations between -19.47% to 9.19% were observed.
The biological inconsistency was also observed at plot level, in which the non-additivity of the equations ranged between -4.09% to 2.22%.In the same way, was observed per hectare, with variations between -0.97% and 0.05% (Figure 2).
Therefore, the second hypothesis outlined in this paper -"since the estimators of systems of equations present restrictions to obtain the coefficients in their settings, their estimates should result in additivity of components, when compared with the estimators obtained by a single equation, and, consequently, result in biological consistency" -has been proven.

3.3-EFFICIENCY OF THE ESTIMATORS OF THE EQUATIONS OF BIOMASS
The precision of the estimates, using the equations fitted by means of procedure 1, for the crown component ranged from 5.46% to 41.11%, for the stem component from 2.12% to 21.30% and for the total from 1.29% to 21.74%.For the equations fitted by means of procedure 2, the specifications for the crown component vary between 0.75% to 28.24%, for the stem component from 1.62% to 20.43% and for the total from 0.28% to 20.62%.
The estimates of biomass for the components and for the total, using the equations fitted by means of procedure 2, resulted in narrower confidence intervals, therefore more efficient.The efficiencies in the confidence intervals between (EIC) varied from 5.39% to 77.04% (Figure 3), depending on the component and the system of equations.On average, in general, EIC were higher for the crown biomass component, followed by the total and by the stem components, respectively of 51.64%, 27.79% and 15.98%.Thus, the third hypothesis outlined in this paper -"since the estimators of systems of equations include the contemporaneous correlations of the biomass components in their fittings, their estimates should result in a smaller variance, when compared with the estimators obtained by single equation, and, consequently, result in greater efficiency" -has been proven.

4.1-PERFORMANCE OF THE ESTIMATORS OF THE BIOMASS EQUATIONS
The statistics for the assessment of the fitting quality, 2 R and CV, were similar in relation to the method of fitting the equations.Some of the statistics obtained from the equations fitted by procedure 2 led to a slight improvement and others did not.This is a direct effect of the flexibility of the estimator to meet the requirement of additivity of the components, and it has also been observed by Reed and Green (1985).Even if it occurs, does not affect the predictions, since the estimates of the components' biomass and of the total obtained by the systems or groups of equations, did not differ among themselves, as proven by the Chi-square test.
The systems and groups of Equations 1, 2 and 3 were defined by the same regressors to the functions of the components and this resulted in similar values for the obtained coefficients.This is a special case, in which the estimates of the coefficients, by means of the system of equations or individually, are equivalent.Greene (2008) demonstrated this condition in a comparative approach of the generalized least squares estimator (equivalent to the Seemingly Unrelated Regressions) with ordinary least squares (equivalent to the equations fitted independently).In this condition, assuming that X i =X j =X, then, '   ( ) The inverse of the matrix on the right side becomes, therefore, ( ) . Using the product properties of Kronecker, in which ( ) ( ) Still, on the right side of the expression in [34], each term ' i j X y is equivalent to X'y i , which in turn is equivalent to X'Xb j .With these results, after the elimination of the common values of X'X, the estimate of β is defined in [35].
Taking the first sub-vector to estimate β, after multiplication and cancellation of the momentum matrix is defined in [36].Figure 3 -Efficiency in the confidence interval (EIC) for estimates of biomass for the components and for the total from equations fitted by procedure 2. Where: , in percentage, and PIC WNSUR the average of accuracy of the confidence interval for the i th component, obtained for equations adjusted by means of procedure 1 and PIC ENGLS the average of accuracy of the confidence interval for the i th component obtained for equations fitted by means of procedure 1.
The terms in parentheses are elements of the first line of ∑∑ -1 =I, so the final result corresponds to 1 1 ˆb β = , i.e. ˆi i b β = , what would determine that the more efficient estimator would be the single equation.However, in some special cases, as the modeling of biomass components, results are compromised, since there are restrictions on β, which can be inside of the equations or between the equations.Such guidance indicates that, given the condition of the equations are composed by the same regressors, and due to restrictions that are imposed on the coefficients of the equations of the components to be additive, the coefficients will tend to similarity, even if the gain in efficiency by the system of equations still prevail (as approach performed below).
Through the performance analysis of the estimators, particularly in relation to the estimates of biomass and the statistics of fitting the equations, it can be concluded that the two tested procedures 1 and 2 are equivalent.On the other hand, this conclusion differs when evaluated the consistency and efficiency of the estimators.

4.2-BIOLOGICAL CONSISTENCY OF THE BIOMASS EQUATIONS
Whereas the statistical consistency of the estimators becomes better as we increase the sampling intensity, this reality can also be improved through statistical and computer techniques for obtaining allometric relationships.In other words, this consistency becomes more enhanced the closer are the allometric estimators of the true population values.If statistical modeling can approximate estimators of real population values (which is provided by least squares estimators), and also ensure an additive system of the parts of the organism with its whole, they are therefore biologically consistent, hence the nomination of Biological Consistency.
The biological inconsistency, in percentage terms, was higher at tree level.In this condition, positive and negative NA values do not cancel out, resulting in relative greater inconsistency than when compared at plot level, or per hectare.There were no trends established for the lack of additivity concerning the independent variables.
The biological inconsistency in absolute or relative terms, at tree level, was higher in groups of equations 4 and 5, i.e., in older forest stands, and relates to the composition of the models for components' biomass and total.The proposed models for components' biomass and total for the groups of Equations 1, 2 and 3 were the same, for groups of equations 4 and 5 the models proposed for crown and stem biomass were different.The lack of additivity was not even higher, because the used model for the total biomass was the same as for the stem.
The non-additivity was lower in percentage terms at plot level.In this case, the biomass of a component or for the total ( ˆ) i y for the h th plot corresponds to the sum of biomass estimated for t th trees.Thus, when adding the total numbers of sampled trees, positive and negative values that can generate NA at the tree level cancel out.This convergence does not tend to zero, due to the combination of the probabilistic distribution of independent variables in the plot with the randomness of NA at tree level.
In the same way, this also occurs when taking the average values of the plots to compose the biomass estimator per hectare.For this condition, the biological inconsistency for the 5 groups of equations ranged from -0.05% to -0.97%, is equivalent -0.001 to -1.31 t.ha -1 .Although the biological inconsistency decreases as the estimators of biomass are calculated in t.ha -1 , still remains and becomes significant when the estimator is extrapolated to the entire area of the forest stand.In addition, it is more severe when the inventories of biomass are done in large areas, for example, in the inventories of biomass at regional, state and national levels.
The equations adjusted independently generate biological inconsistent results, which implies that models for components' biomass and for the total should be estimated by means of systems of equations.Most of the equations used for the species black watlle, as well as for other species in Brazilian forests, were fitted independently.Therefore, these equations do not reach biological consistency and, consequently, we must take certain precautions before we simply use them.
An alternative to take advantage of the hundreds of equations of biomass, that were applied without considering the additivity principle, is to set for the total biomass the sum of the masses of the components.In this case, as it is also of interest to know the variance of the total biomass ( )  , as defined in [37].This approach is presented in Parresol (1999Parresol ( , 2001) ) and Sanquetta et al. (2015), and it is recommended that it be applied to non-additive biomass equations.

4.3-EFFICIENCY OF THE ESTIMATORS OF THE BIOMASS EQUATIONS
Hypothetically, having been solved the biological inconsistency of the independently estimated equations (using the references already cited), and knowing the slight difference in performance between the independent estimators and of systems of equations for the components, it could be concluded once more on the non-difference of the equations fitted logically by means of procedures 1 and 2. However, in a third analysis, which considers the efficiency of the estimators, i.e., by a process which results in the minimum variance, the equations fitted by means of procedure 2 are more efficient.This superiority, both for components' biomass as well as for total biomass, it is undeniable, since they will produce estimates with smaller variance, i.e., with narrower confidence intervals, by considering the correlation that exists between the components of the tree.As highlight by Zellner (1962) and Zellner and Huang (1962), the gain in efficiency occurs when the terms of the disturbances of different equations are correlated.Therefore, it is likely that this always happens when it comes to the tree biomass.
For the systems of equations adjusted by procedure 2, the contemporaneous correlation matrices were presented (Systems S1-S5 -Supplementary Material).Through them it was possible to demonstrate the correlations that occur between the measured components on the same tree, i.e., the dependence between the components of biomass.These correlations are the ones that will determine the efficiency gains due to the application of seemingly unrelated regressions.
Chiyenda and Kozak (1984) emphasize that it is unrealistic to consider that the tree components are independent, or that the residuals 1, , i k ε = … of components are not correlated, since the same tree is composed of information for more than one component of biomass.When considering these correlations, the equations adjusted through procedure 2 will provide estimates with smaller variance, that is, the result that determines the higher efficiency of the estimator.Thus, if a particular estimate of biomass results in smaller variance, hence its confidence interval is also narrower and, therefore, the estimate is more accurate.In addition, the existence of contemporaneous correlations makes it possible to obtain the best unbiased estimators for the equations of the components and for the total.Therefore, it is reasonable to conclude that this estimator should be used to adjust biomass equations for the components and for the total.
The estimates of the components' biomass and of the total resulting from equations fitted through procedure 2 made it possible to obtain narrower confidence intervals in relation to the equations adjusted through procedure 1, which was proved by means of the analysis of the obtained confidence intervals.This demonstrates that the most efficient strategy to adjust biomass by seemingly unrelated regressions.These perceptions are also conclusive in studies conducted by Parresol (1999, 2001), Sanquetta et al. (2015) and Zhao et al. (2015).
The values of the coefficients expressing the intercepts of the equations ( ) e ˆˆi β β led to close relations with the fitting method, mainly in what refers to the standard errors of coefficients, since the equations adjusted by means of procedure 2 resulted in smaller standard errors when compared to those obtained by equations fitted by means of the procedure 1.This is a direct effect of the inclusion of correlations in procedure 2, which was clearly evidenced by Parresol (1999Parresol ( , 2001)).
The efficiency provided by procedure 2 changed in the forests too young to mature.In young forest stands the contemporaneous correlation between biomass components was stronger, evidencing that in these ages the biomass distribution on the components is dependent on intrinsic factors of the species.In the older forest stands, the correlation between biomass components decreased, evidencing that the biomass allocation is dependent on external factors of the plant which affects its correlation.This is a consequence of the biological growth of plants, since the allocation of components' biomass as a function of their age is changed, therefore determines different contemporaneous correlations and, consequently, different degrees of accuracy in the estimates.In this sense, an aspect which denotes special attention is that concerning the crown component.
By means of the contemporaneous correlation matrices it is observed that the crown correlations with the stem and total biomass, estimated in the same tree, decrease from the very young forest to the mature one.The opposite was observed for the stem in relation to the total biomass, reflecting the effect of the allocation of components' biomass along the growth stages of the forest.Although the correlation of the crown component with the other components of the tree decreases over the years, it still has to be considered, since it will be an important element to make the estimates of this component more precise.By its nature, the crown is a difficult component to be modeled, so using an estimator that results in more accurate estimates is crucial, especially because this component gains importance in mature forests, since it is the raw material for energy production.These results ALEXANDRE BEHLING et al. confirm the importance of implementation of procedure 2.

4.4-IMPORTANCE OF ADDITIVE EQUATIONS AND SMALLER VARIANCE
In agreement with the analysis of the equations perfomance, the biological consistency and efficiency of the statistics, undoubtedly it is concluded that the equations fitted by means of procedure 2 are the ones that should be used, because they provide accurate estimates for the components' biomass: crown biomass and total, with the main characteristic of being obtained with smaller variance.
The use of procedure 2 results in statistically correlated equations, which are definitively related through contemporaneous disturbances of the equations -which gives rise to the nomenclature: apparently unrelated equations.The inclusion of the contemporaneous correlations between the biomass of the components and total biomass in the fitting of the equation systems through this procedure resulted in narrower confidence intervals, thus showing one of its most important properties, efficiency.The equations adjusted by means of procedure 1 do not take into account the contemporaneous correlations, which impairs the efficiency of this estimator.
How much is this efficiency?This issue is worthy of attention because it is what justifies the importance of applying procedure 2. In this study, the efficiencies in the 95% confidence intervals (EIC) ranged from 5.39% to 77.04%, depending on the component and the system of equations, therefore an outcome that should be considered relevant.Zellner (1962), andDwivedi andSrivastava (1978) analyzed this issue in detail, noting the conditions to be the gain in efficiency of the regression estimators is apparently not related in comparison to the average of single equation.The authors showed that the main condition for efficiency gain is due to the existence of a correlation between residuals of the equations, since, the stronger it is, greater is the gain in efficiency of the estimators of seemingly unrelated regressions, than when the average of a single equation is considered.
Another important property of systems of equations adjusted by means of procedure 2 is the additivity of the equations, desirable characteristic and fundamental in modeling tree biomass.The additivity of biomass estimates has been recommended by several researches: Kozak (1970), Chiyenda and Kozak (1984), Cunia and Briggs (1984), Paressol (1999, 2001), Carvalho andParresol (2003), Sanquetta et al. (2015), Zhao et al. (2015) and Affleck and Diéguez-Aranda (2016).In this work, the additivity of the components to compose the total biomass was guaranteed using procedure 2 and, having been established a model for each biomass component, that for the total biomass was defined in relation to the variables that composed the functions of the components.
There are other specifications of models and methods of estimation to force the additivity of biomass components to compose the total biomass for linear and non-linear functions, according to approaches that were performed by Kozak (1970), Jacobs and Cunia (1980), Chiyenda andKozak (1984), Cunia andBriggs (1984, 1985), Reed and Green (1985), Parresol (1999, 2001), Carvalho andParresol (2003), Sanquetta et al. (2015).In this context, Parresol (2001) highlights: I almost unilaterally recommend the joint generalized least squares approach for both linear or nonlinear modeling of biomass equations systems.This is due to the following aspects: • Once the components of biomass are not independent, they will be contemporaneously correlated; • The apparently unrelated regression approach allows us to account for these correlations to establish equations with smaller variances; • The intensity of the contemporaneous correlation is what determines the values of variance of equations.Thus, the larger this correlation is, the smaller will be the variance, therefore, the greater will be the gain in efficiency on the confidence intervals of the coefficients and of the estimates.The structure of the models presented in procedure 2 was used to ensure the property of additivity for nonlinear models of biomass.The proposed system guaranteed additive equations and, by including the structure of the variance of the error in the fitting of functions, also minimized estimates of variance.The structure of the systems of equations presented is similar to that proposed by Parresol (2001).In this case, the estimates of the components' biomass and total are assured by the following aspects: • Each component is expressed by a function, considered the best through the process of choice of competitive models; • The model for the total biomass is a function of the independent variables that compose the components, resulting in additivity; • Each equation of biomass, for components or total, has its own weighted function, so as to ensure estimates with minimum variance.In addition to the biomass additivity, it is important to ensure that the resulting functions have been adjusted for the condition that provides estimates of minimum variance.Experimentally, the heteroscedasticity of the residues in each of the biomass equations worsened as the forest became older.In fact, this result is not surprising, because in the mature forest condition the biomass deviations from their mean increase from the smaller to the larger classes.This allows to emphasize the importance of the correction of heteroscedasticity, given its effect on the estimates of the biomass equations.
To eliminate the heteroscedasticity of the residuals, a weighted function was established for each component of the biomass in order to model the structure of the variance os its residuals.By means of these functions, weights were calculated to stabilize the variance disturbances, whose conditions allowed the weighted estimator to generate equations with minimum variance and narrower confidence intervals.These results highlight the importance of conducting the modeling of the error structure, in order to ensure reliable estimates of each of the biomass components and also for the total with minimum variance.
Additivity and efficiency of biomass estimates of the components and total are prerequisites in the inventories of biomass.Thus, the approach presented in this work for the set of functions of biomass, by means of the procedure 2, proved to be appropriate, because of the following reasons: • Performance: estimation of biomass and good fitness of the equation similar to the method that maximizes the quality of the equation fitted independently; • Biological Consistency: the equations of biomass of the components are additive to compose the equation of total biomass; • Statistical Efficiency: estimation of biomass and coefficients with smaller variance, therefore more efficient.Through the estimates of equations, using procedure 2, it is possible to estimate systems of equations statistically correlated and, through the inclusion of correlations in the fitting, it was obtained equations with smaller variance, which propitiated to obtain more efficient estimates of the parameters and more reliable ranges for the estimates.The consideration of the additivity of systems of equations ensured the consistency of biological estimates of components' biomass and the total.
Due to the smaller variance, the systems of equations adjusted by means of procedure 2 produced more accurate estimates, since the confidence intervals were narrower.This was ALEXANDRE BEHLING et al. also evidenced by Parresol (1999Parresol ( , 2001)), who demonstrated the superiority of statistical efficiency and consistency of biological equations fitted by seemingly unrelated regressions, respectively for linear and non-linear functions.This allows an important application in estimates of biomass in forest inventories, since the errors from the equations should be considered in the calculation of the total sampling error, as shown by Cunia (1987).
Therefore, if the equations adjusted by means of procedure 2 result in smaller variance, it becomes acceptable that this minor error also propagate in the estimates per hectare, resulting in more accurate estimates when compared with the equations fitted by ENGLS.In this case, it is reinforced the importance of entering the structure of the error in the adjustment of the biomass equations in order to achieve efficient estimates and narrower confidence intervals.Therefore, these concepts reinforce the importance of using procedure 2 to establish biomass equations.
Parresol (2001) emphasizes that the additivity property ensures that regression functions be consistent one with another, since a component of the tree is part of the other component, therefore it is logical to expect that the estimates of the parties do not exceed the whole.This property has determined the consistency of the equations adjusted by means of procedure 2 for the reported examples, in which the total biomass above ground was divided into two components: stem biomass and crown biomass.According to Parresol (2001), this property can be applied to any quantity that is disaggregated in a logical system of components.

5-CONCLUSIONS
All assumptions made in this study were duly confirmed.

i) Performance of equations
The two procedures tested do not differ in relation to performance evaluation.

ii) Biological consistency of the equations
To estimate the equations of biomass independently for the components and for the total, is not realistic, i.e., are not additive.
A property in setting equations by means of the regression model apparently unrelated is the additivity, i.e., the estimates of the components' biomass are added to form the total.iii) Efficiency of the estimators of the equations Equations estimated by the seemingly unrelated regression produce estimates with smaller variance.Therefore, from the statistical point of view they are more efficient when compared with the independent estimated equations.
Estimating the biomass equations of the components and total biomass separately, means disregarding their interdependencies, resulting in equations that will provide estimates with larger variance, therefore, less efficient.

iiii) Systems of equations
The apparently unrelated regression model presents better biological and statistical properties to estimate biomass allometric equations for components and total biomass when compared to independent estimation.Therefore, it is the method that should be used.
ˆi β and through the procedure 2 ( ) ˆij β showed similar trends in behavior in relation to the group or system of equations, although differences in the values of the coefficients has occurred.The largest differences -D were originated in the coefficients expressing the intercept of the equations ( )1 1 e ˆˆiβ β , which ranged from -56.30%to 24.66%.The other coefficients -( ) between -3.71% to 11.18%.

Figure 1 -
Figure 1 -Variation of non-additivity, i.e., discrepancy (D) of estimates of biomass components in relation to the total biomass, derived from equations adjusted by means of procedure 1, at tree level, observed in forest stands of black wattle species in the state of Rio Grande do Sul.Where: ˆˆ ˆi i i i s C Total D y y y = + − all i and j on[34].

Figure 2 -
Figure 2 -Non-additivity ( ) NA of biomass estimates of the components in relation to the total from equations fitted by means of procedure 1, at plot level and per hectare.Where: the calculation for the plot is given by ˆ ˆî i i i s C Total NA y y y = + − in t.ha -1 , and related to the i th plot of the group of equations, and per hectare as ˆˆ ˆs C Total NA y y y = + − , in t.ha -1 .
, it can be obtained as a function of the variances and covariances of estimates of the components.The values of the variance tend to be larger, because they are dependent on the variances of each component