Accessibility / Report Error

Orthogonal contrasts: definitions and concepts

Contrastes ortogonais: definições e conceitos

Abstracts

The single degree of freedom of orthogonal contrasts is a useful technique for the analysis of experimental data and helpful in obtaining estimates of main, nested and interaction effects, for mean comparisons between groups of data and in obtaining specific residuals. Furthermore, the application of orthogonal contrasts is an alternative way of doing statistical analysis on data from non-conventional experiments, whithout a definite structure. To justify its application, an extensive review is made on the definitions and concepts involving contrasts.

analysis of variance; partition of sum of squares; experiments with additional treatments


A técnica de contrastes ortogonais com um grau de liberdade é simples e bastante eficiente na análise de dados experimentais, como por exemplo, na obtenção de efeitos principais, de efeito de interação e de efeitos aninhados, nas comparações entre grupos de médias e na obtenção dos resíduos específicos. Além disso, sua aplicação tem revelado ser uma forma alternativa para análise de dados obtidos de um experimento que não segue uma estrutura definida. Com o objetivo de justificar a sua aplicação, foi realizada uma revisão sobre as definições e os conceitos envolvendo contrastes.

análise da variância; partição da soma de quadrados; experimentos com tratamentos adicionais


REVIEW

Orthogonal contrasts: definitions and concepts

Contrastes ortogonais: definições e conceitos

Maria Cristina Stolf Nogueira

USP/ESALQ - Depto. de Ciências Exatas, C.P. 9 - 13418-900 - Piracicaba, SP - Brasil. e-mail <mcsnogue@esalq.usp.br>

ABSTRACT

The single degree of freedom of orthogonal contrasts is a useful technique for the analysis of experimental data and helpful in obtaining estimates of main, nested and interaction effects, for mean comparisons between groups of data and in obtaining specific residuals. Furthermore, the application of orthogonal contrasts is an alternative way of doing statistical analysis on data from non-conventional experiments, whithout a definite structure. To justify its application, an extensive review is made on the definitions and concepts involving contrasts.

Key words: analysis of variance, partition of sum of squares, experiments with additional treatments

RESUMO

A técnica de contrastes ortogonais com um grau de liberdade é simples e bastante eficiente na análise de dados experimentais, como por exemplo, na obtenção de efeitos principais, de efeito de interação e de efeitos aninhados, nas comparações entre grupos de médias e na obtenção dos resíduos específicos. Além disso, sua aplicação tem revelado ser uma forma alternativa para análise de dados obtidos de um experimento que não segue uma estrutura definida. Com o objetivo de justificar a sua aplicação, foi realizada uma revisão sobre as definições e os conceitos envolvendo contrastes.

Palavras-chave: análise da variância, partição da soma de quadrados, experimentos com tratamentos adicionais

INTRODUCTION

The orthogonal contrast technique is a simple and efficient way of analysing experimental data to obtain, for instance, the main effects, interaction effects and nested effects, for comparisons between groups of means and/or to obtain specific residuals. Additionally, the application of orthogonal contrasts is an alternative way of doing statistical analysis on data from experiments without a definite structure, like the experiments with additional treatments. The objective of this paper review was to justify the application of the single degree of freedom orthogonal contrasts in the analysis of experimental data from non-conventional experiments, recently published by Nogueira & Corrente (2000) and Corrente et al. (2001).

Definitions and concepts for mean contrasts with equal number of replications

Scheffé (1959), Winer (1971), Steel & Torrie (1981), Mead (1988) and Hinkelmann & Kempthorne (1994), among others, define a contrast between treatment means, represented by Y, as a linear function that can be estimated, considering an equal number of replications for all treatments, as follows:

where ci are values of coefficients associated to mi, and mi the mean attributed to the treatment i, so that

Supposing the mathematical model Yij = m + ti + eij, when i = 1,..., I and j = 1, ..., r, and m being a constant, ti the treatment effect i, so that: , and eij the experimental error, so that eij ~ N(0, s2) and independent of each other.

Supposing that mi = m + ti, then

Two contrasts and , since h ¹ h' and h = 1, ..., (I-1), are orthogonal, if Cov(Yh , Yh') = 0. Thus, for the adopted model Cov (Yh , Yh') = , occurs when ách' i, ch' iñ, that is .

Supposing isan estimator in such way that , with ,

considering eij ~ N ( 0, s2) and independent of each other, then is an unbiased estimator for Yh and

A special characteristic of orthogonal contrasts is that they may easily be included in the analysis of variance, in such way that they originate sums of squares with one degree of freedom which correspond, each of them, to the (I-1) subdivisions of the sum of squares due to the treatments with (I-1) degrees of freedom. That is, the sum of squares due to the treatments can be decomposed in (I-1) sums of squares due to the contrasts with one degree of freedom.

Mead (1988) demonstrated this characteristic of the orthogonal contrasts and defined that the sum of squares due to Yh is given by:

with one degree of freedom and consequently, the sum of squares due to treatments is given by:

with (I-1) degrees of freedom.

Hence, supposing is a contrast of treatment effects, so that and , with h ¹ h' , for h = 1, ... , (I-1) , and ghi is a coefficient defined as:

so that , with h ¹ h' , h = 1, ..., (I-1) and , originating orthonormal contrasts. Consider also a group of (I-1) orthogonal linear functions of the treatment effects expressed in the following matrix form:

where = (gh1, gh2,...,ghI)for h =1, ... , (I-1);t' = (t1, t2 ,...,tI)is the vector of the estimates of the treatment effects. Thus, the vector z will be a group of (I-1) estimates of the considered orthogonal contrasts, and that, due to the contrast orthogonality, the contrast estimates are independent, that is, the zh are independent of each other. Thus, for h =1, 2, ... , (I-1). If the variable observations follow a normal distribution, then the vector z is a normal aleatory variable vector and independently distributed, and each element has mean = zero and variance = 1.

Adding to the matrix G' a line vector g'0 , so that , where E is a vector of 1's, the data matrix will take the form:

where the data matrix G*' is an orthogonal matrix, having G*'G* = I, and for this reason G* = (G*')-1 , resulting G* G*' = I .

Supposing

where , for h = 1, ... , (I-1), thus

considering , and yij = m + ti + eij , the expression can be written:

Substituting equation (2) into equation (1) the following expression is obtained:

Considering that , ever h = 1, ... , (I-1), and applying to (3), the obtained form is:

Considering that E(ti) = ti , E (eij) = 0 and and also, E( ti eij ) = 0 , thus

If Yh = 0, E(z h ) = 0, and since zh follow a normal distribution, and supposing the null hypothesis H0 : Yh = 0, thus

Now supposing

and that

where EE' is a matrix of elements 1, thus

and like matrix G*' is also a orthogonal matrix and . Due to the restriction relative to , thus

Considering that G* G*' = I , due to the orthogonality of matrix G*', which means

The comparison between (5) with (4) evidences that and consequently,

The one by one element multiplication of (6) by r, results in:

In this way, the sum of squares due to Yh is obtained by , with one degree of freedom, having the following properties:

(i) If Yh = 0 , the SSYh ~ ;

(ii) The , with (I-1) degrees of freedom.

Hinkelmann & Kempthorne (1994) considered that, , with (I-1) degrees of freedom in order to demonstrate that

where G' is an orthogonal matrix, and thus

These authors observed also that

and thus,

resulting

Using equation (7) in (8), results that

The one by one element multiplication of (9) by r, results in:

with (I-1) degrees of freedom.

This result can be included into the analysis of variance table, originating a more detailed table, as the example in table 1.

In order to test the hypothesis H0 : Yh = 0, which may also be written as H0 : c' t = 0, test F is applied, so that

where MSR is the mean square residual and v corresponds to the degrees of freedom related to the F denominator, that is, in this case, v = I(r-1).

Alternatively, a confidence interval for Yh , with (1-a) 100 % , is given:

where v refers to the degrees of freedom related to

, so that

and where refers to SSR, with I(r-1) degrees of freedom, hence v = I(r-1).

According Scheffé (1959), the hypothesis H0 : t1 = t2 = ... = tI = 0, initially tested by the test F, described in table 1, is equivalent to the hypothesis

H0 : Y1 = Y2 = ... = YI-1 = 0,

(an alternative hypothesis Ha which consists of at least one contrast Yh diferent from zero),and where {Y1 , Y2 , ... , YI-1} is a group of predictable independent linear functions.

Supposing is an unbiased estimator of Yh , given by , where ~ N (mi , ) and independents and the author demonstrates that the probability of all contrast values to fulfill simultaneously the unequal expression written below, is (1- a):

or else,

where the constant S is obtained by the following expression

and .

The method described is the Scheffé method applied to multiple comparisons using intervals for the contrasts. This method is related to the F test, when testing the hypothesis H0 : t1 = t2 = ...=tI = 0, as follows: if is considered significantly different from zero, which implies that Yh = 0 is excluded of the interval (10). Now, if 0 is not considered significantly different from zero, this implies that Yh = 0 is included in the interval (10). Thus, when H0 is rejected by F test, the author infers that at least one contrast should be significantly different from zero. In other words, if (and only if) under the significance level a it is concluded by the F test that the contrasts are not all nulls, then the Scheffé method will show contrast estimates that are significantly different from zero.

The level of significance a is the probability of the global type I error or experimentwise, that is, the probability of at least one contrast be significantly different from zero, from a group of (I-1) contrasts, which means:

a = 1 - (1-a')(I-1)

where a' is the probability of type I error for a particular contrast (or comparisonwise) applied when the null hypothesis is rejected H0 : Yh = 0, for h = 1, ..., (I-1) .

According to Kuehl (1994), the formula

expresses the probability of type I error for a particular contrast (comparisonwise) as a function of the global type I error (experimentwise). For ordinary calculations a' is considered closely a/(I-1).

However, it may occur that none of these contrasts significantly different from zero are of practical interest. An example of a non-practical contrast is the normalized maximal contrast estimate.

Definition and estimation of

Scheffé (1959) demonstrated that SST = SSYmax = , where is the normalized contrast estimate for the effect {ti } maximal, defined as: suppose L is the sampling space for all the contrasts in {ti}, and L" is the group of all normalized contrasts in L, that is, the group of all Y in L, so that the Var() = C s2, and that C =1. Thus, is maximal with Y in L".

If

where wj = 1/r , so that wj > 0 and , and thus

with considering in this case Kij = 1, for all i = 1, ..., I and j = 1, ..., r , which represents the number of times in which i and j appear together, that is, for the present case in study the considered structure is a treatment factor with I levels and completely random r replications, each level i of the considered treatment factor occurring only once with the j replication. Supposing the above considerations the following expression may be written:

The restrictions for being in L", are the following:

Hence, to maximize , variations on {ci} are observed while {} are remained fixed and thus, it is convenient to consider that

and thereafter, this leads to .

In order to maximize , the following conditions are imposed:

Applying the technique Lagrange's multipliers, we have:

calculating , and considering , for all i, leads to the form

.

Isolating the ci element we obtain:

Adding (11) in , thus

resulting

Hence, placing (12) in (11) , leads to

and placing (13) in , leads to

Finally, placing the result obtained in (13) in the formula

substituting into (15) the result of (14) ,

Thus, it is concluded that =SQYmax = SST.

If the interest is to test the null hypothesis H0: Ymax = 0, the rejection area for this test is given by

or by

where a is the significance level that gives the quantity of the F distribution with (I-1) and I(r-1) degrees of freedom.

It is noticed that the way the test was applied to check the null hypothesis H0 : Ymax = 0 is identical to the test applied to verify the hypothesis of no variation due to treatments. If the tested hypothesis in terms of comparisons is not rejected, the resulted implication is in the correspondent comparison in populational terms that it is not significantly different from zero. Then, if the null hypothesis, H0 : Ymax = 0, is not rejected, that is, if Ymax is not significantly different from zero, then no other comparison to be tested will be significantly different from zero, when the Scheffé method is used. If the null hypothesis H0 : Ymax = 0 is rejected, this impplies that the correspondent comparison is significantly different from zero. For the group of tests used in the considered comparisons, the Scheffé method is referred to the a level of significance, which means that the a level of significance is the probability of the global type I error (or experimentwise), correspondent to the totality of tests used.

Definitions and Concepts for mean contrasts with unequal number of replications

In the case of treatments with unequal number of replications, that is, for j = 1, ..., ri , (Steel & Torrie, 1981, cited in Nogueira, 1997), consider that for a given contrast

where and independent of each other, and, ri is the number of replications of treatment i; chi is the coefficient to be attributed to ; and is the mean estimate of treatment i, so that

Moreover, and . Thus, is defined as the contrast between treatment means obtained from data with unequal number of replications.

Supposing that mi = m + ti, where m is a constant and Ti is the effect of treatment i, the following expression is written:

Two contrasts and where h ¹ h' and h = 1, ..., (I-1), are orthogonal when ári chi, ri ch'iñ = 0, that is,

.

Hence, the null hypothesis

can be tested by means of

where and = = n-I, where n is the total number of observations, that is, n = .

Kirk (1968), for average data from unequal number of replications, considers that

so that and the orthogonal condition is as follows:

Thus, the null hypothesis

can be tested by means of

where and .

Winer (1971) and Kirk (1968) showed that the sum of squares due to Yh , for the case of unequal number of replications, is given by the following expression:

when

and, suggested that the SSYmax is obtained when and this leads

to according to Scheffé (1959).

Received October 11, 2002

Accepted October 29, 2003

  • CORRENTE, J.E.; NOGUEIRA, M.C.S.; COSTA, B.M. Contrastes ortogonais na análise do controle de volatilização de amônia em compostagem. Sci. Agric. (Piracicaba Braz.) v.58, p.407-412, 2001.
  • HINKELMANN, K.; KEMPTHORNE, O. Design and analysis of experiments. New York: Wiley-Interscience, 1994. v.1, 495p.
  • KUEHL, R.O. Statistical principles of research design and analysis. Belmont: Duxbury Press. 1994.681p.
  • MEAD, R. The design of experiment: Statistical principles for practical application. Cambridge: Cambridge University Press, 1988. 618p.
  • NOGUEIRA, M.C.S. Estatística experimental aplicada à experimentação agronômica Piracicaba: ESALQ, DME, 1997. 250p.
  • NOGUEIRA, M.C.S.; CORRENTE, J.E. Decomposição da interação tripla significativa utilizando o comando contrast do PROC GLM do SAS, aplicado ao modelo de classificação tripla para dados balanceados. Bragantia v.59, p.109-115, 2000.
  • SCHEFFÉ, H. Analysis of variance. London: John Wiley & Sons, 1959. 467p.
  • STEEL, R.G.D.; TORRIE, J.H. Principles and procedures of statistics: A biometrical approach. 2.ed. New York: McGraw-Hill International Book Company, 1981. 625p.
  • WINER, B.J. Statistical principles in experimental design2.ed. New York: MacGraw-Hill Book Company, 1971. 897p.

Publication Dates

  • Publication in this collection
    13 Feb 2004
  • Date of issue
    Feb 2004

History

  • Accepted
    29 Oct 2003
  • Received
    11 Oct 2002
Escola Superior de Agricultura "Luiz de Queiroz" USP/ESALQ - Scientia Agricola, Av. Pádua Dias, 11, 13418-900 Piracicaba SP Brazil, Phone: +55 19 3429-4401 / 3429-4486 - Piracicaba - SP - Brazil
E-mail: scientia@usp.br