Acessibilidade / Reportar erro

ON HYPOTHESIS TESTS FOR COVARIANCE MATRICES UNDER MULTIVARIATE NORMALITY

Abstract

In this paper we proposed a new statistical test for testing the covariance matrix in one population under multivariate normal assumption. In general, the proposed and the likelihood-ratio tests resulted in larger values of estimated powers than VMAX for bivariate and trivariate cases. VMAX was not sensitive to general changes in the covariance (correlation) structure. The advantage of the new test is that it is based on the comparison of all elements of the postulated covariance matrix under the null hypothesis with their respective maximum likelihood sample estimates and therefore, it does not restrict the information of the covariance matrix into a scalar number such as the determinant or trace, for example. Due to the fact that it is based on the maximum likelihood estimates and the Fisher information matrix, it can be used for data coming from distribution other than the multivariate normal.

covariance matrix; hypothesis testing; Fisher information matrix


1 INTRODUCTION

Statistical tests for covariance matrix has been the issue of many papers particularly in quality control for monitoring processes using several quality characteristics simultaneously, or in calibrations studies (Li & Tsung, 201113 LI Y & TSUNG F. 2011.Detecting and diagnosing covariance matrix changes in multistage processes. IIE Transactions, 43(4): 259-274.x). They are also used to verify the assumption of equal covariance matrices when performing statistical tests to compare the vector means of independent populations, such as in marketing research in the comparison of consumers profiles of different segments, among many other applications (Montgomery, 200815 MONTGOMERY DC. 2008. Introduction to Statistical Quality Control. New York: JohnWiley.).

A well-known statistical test used to verify if the covariance matrix of a population (or process) has a certain specified structure is the generalized variance, |S|, which is based on the determinant of the sample covariance matrix (see Alt, 1985ALT F. 1985.Multivariate quality control. In: Encyclopedia of Statistical Sciences [edited by S.Kotz, & N.L. Johnson], vol. 6, JohnWiley, 110-122.; Djauhari, 200510 DJAUHARI MA 2005. Improved monitoring of multivariate process variability. Journal of Quality Technology, 37(1): 32-39.). Tests based on transformations of |S| are also found in the literature (García-Diaz, 200711 GARCÍA-DIAZ JC. 2007. The effective variance control chart for monitoring the dispersion process with missing data. European Journal of Industrial Engineering, 1(1): 40-55.). Although the idea of summarizing the information of the covariance matrix into a scalar number is appealing, the fact that different matrices can have equal or similar determinants makes the generalized variance to fail in detecting some types of changes in the covariance matrix when they take place. The assumption of a normal distribution as an approximated distribution for |S|, may also restrict its use in certain areas since it just holds for large sample sizes (Djauhari, 2009DJAUHARI MA. 2009. Asymptotic distribution of sample covariance determinant. Matematika, 25(1): 79-85.). In quality control, for example, it is common to use samples of sizes n = 4 or 5, cases for which the normal approximation would not be valid. The exact distribution of |S| for the bivariate case is known and it is related to a chi-square distribution. However, for p > 2 variables the exact distribution of |S| does not belong to any class of known probability distributions (Anderson, 1958ANDERSON TW. 1958. An Introduction to Multivariate Statistical Analysis. New York: JohnWiley & Sons, Inc.; Aparisi et al., 1999APARISI F, JABAIOYES J & CARRION A. 1999. Statistical properties of the |s| multivariate control chart. Communications in Statistics-Theory and Methods, 28(11): 2671-2686.).

The likelihood-ratio test was presented by Anderson (1958)ANDERSON TW. 1958. An Introduction to Multivariate Statistical Analysis. New York: JohnWiley & Sons, Inc., in a general context for multivariate normal populations and by Alt (1985)ALT F. 1985.Multivariate quality control. In: Encyclopedia of Statistical Sciences [edited by S.Kotz, & N.L. Johnson], vol. 6, JohnWiley, 110-122., as a tool to monitor the covariance matrix in quality control. However, the test was based on the chi-square distribution for the test statistic which is valid asymptotically, only. (Costa & Machado 2009COSTA AFB & MACHADO MAG. 2009. A new chart based on sample variances for monitoring the covariance matrix of multivariate processes. International Journal of Advanced Manufacturing Technology, 41(7-8): 770-779.; 2008COSTA AFB&. MACHADO MAG 2008.A new multivariate chart for monitoring the covariance matrix of bivariate processes. Communications in Statistics. Simulation and Computation, 37(7): 1453-1465.), proposed the VMAX based on the maximum of the p standardized sample variances, p being the number of variables or quality characteristics. VMAX was more efficient than the generalized variance to monitor processes presenting better capability at detecting variances shifts. The critical region of the test (or control chart) was obtained by numerical integrationwhich involves non-central chi-square distributions. More recently, Quinino et al. (2012)14 QUININO R, COSTA A & HO LL. 2012. A single statistic for monitoring the covariance matrix of bivariate processes. Quality Engineering, 24(3): 423-430., proposed the VMIX as a tool to monitor the variability structure of two quality characteristics under the assumption of bivariate normal distribution. In their paper it was shown that VMIX was more efficient than the generalized variance control chart and better than VMAX, for some cases.

A more general method was proposed by Sullivan et al. (2007)16 SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84., considering the vector of parameters θ which includes the elements of the mean vector and the covariance matrix. The estimated vector of θ is compared with the vector θ specified in the null hypothesis by means of a test statistic which has an approximated chi-square distribution under the null hypothesis. What is interesting in Sullivan et al. (2007)16 SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84. test is that there is no restriction on the structure of the covariance matrix postulated on the alternative hypothesis and it can be used for normal and non-normal distributions being capable of testing the whole vector of parameters as well as parts of it. As an illustration for the bivariate case, one could be interested in testing the vector θ containing all 5 parameters (two means, two variances and one covariance); the parameter vector θ containing the two means or two variances only, or containing just the variances and the covariances, among other possibilities. Other statistical tests for covariance matrices can be found in the literature (see Yeh et al., 200618 YEH AB LIN DKJ & MCGRATH RN. 2006. Multivariate control charts for monitoring covariance matrix: a review. Quality of Technology and Quantitative Managament, 3(4): 415-436.; 201217 YEH AB, LI B & WANG K. 2012. Monitoring multivariate process variability with individual observations via penalised likelihood estimation. International Journal of Production Research, 50(22): 6624-6638., among others).

In this paper we discuss the performance of the Sullivan et al. (2007)16 SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84., when used to test the covariance matrix. Its performance will be compared to the likelihood-ratio and the VMAX tests by using the exact distribution of the test statistic, as well as the chi-square distribution as an approximation. This paper is organized as follows: in Section 2 the statistical tests are presented; in Section 3 the results of the comparisons of the tests are shown followed by numerical examples in Section 4 and final remarks in Section 5.

2 STATISTICAL TESTS FOR COVARIANCE MATRICES

For all tests presented it this section, it is assumed that X 1, X 2, ..., X n, where X k = (X k1, X k2, ..., X kp)', k = 1, 2, ..., n, is a random sample of size n from a p-variate normal distribution with the mean vector μ = (μ 1, μ 2, ..., μ p)' and covariance matrix Σ, a p × p positive definite matrix, where p is the number of random variables.

2.1 The likelihood-ratio test

The likelihood-ratio is a well-known method of hypothesis testing (Alt, 1985ALT F. 1985.Multivariate quality control. In: Encyclopedia of Statistical Sciences [edited by S.Kotz, & N.L. Johnson], vol. 6, JohnWiley, 110-122.; Casella & Berger, 2002CASELLA G & BERGER RL. 2002. Statistical Inference. California: Duxbury Thomson Learning.), and it can be used to construct a test for the covariance matrix as follows. Under the p-variate normal distribution assumption, the likelihood function is given as

where x k = (x k1, x k2, ..., x kp)' is the observed vector for the kth sample element, k = 1, 2, ..., n, and | . | denotes the determinant of the matrix. Let the null and the alternative hypothesis be defined as H 0: Σ = Σ 0, H 1: ΣΣ 0, μ unknown. Under the null hypothesis the maximum (L o) of the likelihood function (1) is achieved when the vector μ is taken as the sample mean vector and Σ = Σ 0. Under the whole parametric space the maximum (L 1) is achieved when the vector μ is taken as the sample mean vector and the covariance matrix is estimated as , where

Sp×p = (n−1)−1 (x k )(x k )' is the sample covariance matrix. The likelihood-ratio test statistic is then given by

where A = (n − 1)S and tr(.) is the trace operator. The distribution of W under the null hypothesis is asymptotically chi-square and H 0 is rejected for values of W larger than the constant LSC = , which is a value taken from a chi-square distribution with p(p + 1)/2 degrees of freedom, whose area above is equal to the significance level of the test, α, 0 < α < 1. In the quality control terminology α is the false alarms rate. When the vector mean μ is known under the null hypothesis, the test statistic in (3) reduces to

where = (n)−1 (x kμ)(x kμ)'.

2.2 The VMAX test

Costa & Machado (2008)COSTA AFB&. MACHADO MAG 2008.A new multivariate chart for monitoring the covariance matrix of bivariate processes. Communications in Statistics. Simulation and Computation, 37(7): 1453-1465., proposed a new control chart to monitor the covariance matrix Σ for bivariate normal populations when the mean vector μ = (μ 1, μ 2)', was known. Let X = (X 1, X 2)' be the random vector of interest and H 0: Σ = Σ 0, H 1: Σ = Σ 1, be the null and the alternative hypothesis, where

a > 0, b > 0 are constants, , and σ 12 = σ 21 are the variances and the covariances of X 1 and X 2, respectively, under the null hypothesis; a 2 , b 2 and abσ 12, are the variances and the covariances of X 1 and X 2, respectively, under the alternative hypothesis. Under this structure, changes in the variances and covariances values of X 1 and X 2 are allowed but the correlation structure is not affected by the changes.

The VMAX test statistic is defined as

Where

(x k1, x k2), k = 1, 2, ..., n, are the sample values of the random vector X = (X 1, X 2)'. For a given significance level α, 0 < α < 1, the critical region of the test is obtained by solving the equation (6) for CL, considering a = b = 1. The null hypothesis is then rejected for any value of (5) larger than the critical constant CL. In the quality control field the constant CL is the control limit.

where represents a chi-square distribution with n degrees of freedom and noncentrality parameter given by 2/(1 − ρ 2), Γ(.) is the Gamma function, ρ is the correlation coefficient between X 1 and X 2 and Pr[.] denotes the probability of the considered event. Under the alternative hypothesis the expression in (5) is used to compute the power of VMAX test. When the means (μ 1, μ 2) are unknown, they can be replaced by the respective sample means.

Details about the equation (6) can be found in the Appendix of Costa & Machado's (2008COSTA AFB&. MACHADO MAG 2008.A new multivariate chart for monitoring the covariance matrix of bivariate processes. Communications in Statistics. Simulation and Computation, 37(7): 1453-1465.; 2009COSTA AFB & MACHADO MAG. 2009. A new chart based on sample variances for monitoring the covariance matrix of multivariate processes. International Journal of Advanced Manufacturing Technology, 41(7-8): 770-779.) papers. Basically, it was derived taking into consideration the fact that under the bivariate normal distribution assumption, has a chi-square distributionwith n degrees of freedom ( ). It can also be shown that conditioned on xk = (x k1, x k2, ..., x kp)', the random variable defined as

has a non-central chi-square distributionwith n degrees of freedom and non-centrality parameter given as (ρ 2/1 − ρ 2) . The equation (6) follows taking into account that

where p w = α when a = b = 1 (under the null hypothesis), and p w is the power of the test under the alternative hypothesis.

Costa & Machado (2008)COSTA AFB&. MACHADO MAG 2008.A new multivariate chart for monitoring the covariance matrix of bivariate processes. Communications in Statistics. Simulation and Computation, 37(7): 1453-1465., also considered the case where the covariance matrix Σ1 was defined as

In this case, under the alternative hypothesis the correlation coefficient between X 1 and X 2 would change to ρ = σ ij/abσ 1 σ 2, where σ 1, σ 2, are the standard deviations of the variables X 1 and X 2, respectively.

VMAX test was extended by Costa & Machado (2009)COSTA AFB & MACHADO MAG. 2009. A new chart based on sample variances for monitoring the covariance matrix of multivariate processes. International Journal of Advanced Manufacturing Technology, 41(7-8): 770-779., for the multivariate case (p > 2) by using a similar approach as taken to the bivariate normal population. Numerical integration was used to obtain the critical constant CL of the test.

2.3 Sullivan et al. (2007)16 SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84. statistical test

Sullivan et al. (2007)16 SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84., proposed a statistical test that can be applied for two groups of consecutive independent homogeneous observations with the purpose of comparing the parameters of the distributions of both groups. This test is of importance in quality control since it can be used to compare the process parameters before and after some identified change point in the structure of the sample observations. In a more general context Sullivan et al. (2007)16 SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84. test could be applied in the comparison of the vector means and/or the covariance matrices of two independent groups.

Let's suppose that the parameters of the distributions before (A) and after (B) the change point, are described by the parameter vector θ which consists of the elements of the vector mean μ and the covariance matrix Σ. As an illustration, for the bivariate normal distribution the vector θ can be defined as θ = (μ 1, μ 2, , σ 12, )' or θ = (μ 1, μ 2, σ 1, ρ 12, σ 2)', where σ 12 and ρ 12 are the covariance and correlation between the two variables and σ i is the standard deviation of the variable X i , i = 1, 2. Let θ A and θ B be the parameter vectors of the groups of observations A and B respectively, and A and B be their respective maximum likelihood estimators. Let the (k 0 × 1) vector = ( A − B), be the maximum likelihood estimator of the difference vector δ = (θ Aθ B), where k 0 is the number of parameters being compared. As well known, under general regularity conditions, the maximum likelihood estimators have asymptotically a multivariate normal distribution (Casella & Berger, 2002CASELLA G & BERGER RL. 2002. Statistical Inference. California: Duxbury Thomson Learning.). Therefore, under the null hypothesis, asymptotically, the quadratic form

has a chi-square distribution with k 0 degrees of freedom, where Σ δ and are the theoretical and the estimated covariance matrices of , respectively. The null hypothesis corresponds to the non-changing in the vector of parameters, i.e., δ = 0. The covariance matrix is approximated by the Fisher information matrix (Casella & Berger, 2002CASELLA G & BERGER RL. 2002. Statistical Inference. California: Duxbury Thomson Learning.), and it is estimated by using an algebraic expression for the expected value of the Hessian matrix, which is then evaluated at the maximum likelihood estimates of the parameters.

The framework used in Sullivan et al. (2007)16 SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84. test makes it possible to build a test for the covariance matrix components as shown next.

2.4 An Adaptation of Sullivan et al. (2007)16 SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84. to test the covariance matrix

In this section we will describe an adaptation of the Sullivan et al. (2007)16 SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84. test for the situation in which only the covariance matrix of the population is concerned. From all over this paper we will call this test as an adapted Sullivan et al.

Let p = 2, μ 1 and μ 2 pre-specified and θ = (σ 1, ρ 12, σ 2)' be the parameter vector from a bivariate normal distribution. Let be the vector containing the maximum likelihood estimates of the parameters in θ. Let H 0: θ = θ 0 and H 1: θ = θ 1, be the null and the alternative hypothesis which are equivalent to H 0: δ = 0 and H 0: δ 0, where δ = θ θ 0. Let the test statistic be defined as

where = θ 0. Under H 0, asymptotically, the test statistic in (7) has a chi-square distribution with k 0 = 3 degrees of freedom being the null hypothesis rejected for large values of (7), according to the significance level of the test. Under the bivariate normality assumption the covariance matrix is obtained through the Fisher information matrix (see Beyer, 1978BEYER WH. 1978. StandardMathematical Tables.West Palm Beach: CRC.; Sullivan et al., 200716 SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84.), which is given by

Where

being the values of σ 1, σ 2, ρ 12, , , σ 12 as specified in H 0, where , are the variances of X 1 and X 2 variables, respectively.

The adapted Sullivan et al. test for the covariance matrix can also be extended for p > 2 variables. Let X be a (p × 1) random vector with multivariate normal distribution, with mean vector μ ( θ ) and covariance matrix Σ( θ ), where μ ( θ ) = (μ 1(θ), μ 2(θ), ..., μ p(θ))'. In this case the typical (i,j) element of the Fisher information matrix is given by

Where

tr(.) is the trace operator and Σi, j is the (i,j) element of the covariance matrix Σ.

A possible advantage of the adapted Sullivan et al. test regarding to the VMAX and the likelihood-ratio tests comes from the fact that all the parameters of the covariance matrix postulated in the null hypothesis are compared in its test statistic, one by one, with their respective maximum likelihood estimates, so that there is no reduction of the covariance matrix information into a single measure such as the determinant, or others. It also allows the correlation coefficients under the null and the alternative hypothesis to be completely different being therefore, a more general test. The test can also be used for non-normal multivariate data since it is based on the maximum likelihood estimates and the Fisher information matrix. It also can be used to test shifts in the variances of the variables only.

3 COMPARISON OF THE STATISTICAL TESTS

The power of the adapted Sullivan et al., the VMAX and the likelihood-ratio tests were estimated by using Monte Carlo simulation under the multivariate normal distribution assumption. The null hypothesis is H 0: Σ = Σ 0, where Σ 0 is pre-specified. For the adapted Sullivan et al. test as well as for the likelihood-ratio, the exact and the asymptotic chi-square distributions of the test statistics were also consider to determine the respective rejection regions of H 0.

In this paper, in the implementation of the VMAX test for the bivariate normal distribution, the Gauss-Legendre and the Pegasus method were used to sort out the numeric integration and to determine the zeros of the function given in (6) in Section 2.2 (see Davis & Rabinowitz, 1984DAVIS PJ & RABINOWITZ P. 1984. Methods of Numerical Integration. Orlando: Academic Press., for details). Due to the fact that the values of CL for p = 2 obtained by using numeric integration were similar to those obtained by using simulation, the value of CL for p = 3 was obtained by simulation only.

In order to determine the critical region of the tests by the exact distribution (except for VMAX, p = 2), M = 50.000 random samples of fixed sizes n = 5, 10, 25, 50 and 100 were generated from a p-variate normal distributionwith mean vector μ and covariance matrix Σ 0 as stipulated in the null hypothesis. For each sample the respective test statistic was calculated and the empirical distribution was built based on the M obtained sample values. By using this distribution the critical region of the test was determined according to the pre-specified significance level α = 0.05 for the tests. After the determination of the critical region, the respective type I error and power of the tests were estimated as follows. First, 10.000 random samples were generated from a multivariate normal distribution under the null hypothesis considering Σ = Σ 0. For each sample and each test, the null hypothesis was tested and the proportion of rejection was calculated given an estimate of type I error of the test. Afterwards, 10.000 samples were generated from a multivariate normal distribution under the alternative hypothesis, H 1: Σ = Σ 1, considering different Σ 1 settings, Σ 1 ≠ Σ 0. Again for each sample and each test, the null hypothesis was tested and the proportion of rejection was computed given an estimate of the power of the test. This procedure was repeated k = 50 times under the null and the alternative hypothesis. At the end, average estimates of the type I error and the power were obtained by taking the average over all 50 repetitions for each test, respectively. Without loss of generality this study was performed considering one particular structure for the matrix Σ 0 for p = 2, 3 and assuming μ = 0 under H 0 and H 1.

The simulated models under the alternative hypothesis (see Tables 1 and 2), were chosen to make it possible to evaluate whether the traditional likelihood-ratio, the adapted Sullivan et al. and the VMAX tests were able to detect small and large differences from Σ 0. In models 1-6 for p = 2, and models 1, 6, 7 for p = 3, changes in the variances and in the correlation structure were allowed. Models 7-10 for p = 2, 4 and 8 for p = 3, were more restrict since the variances were allowed to change but the correlation (in model 4) and the covariance (in model 10) structures remained the same as under the null hypothesis. On the contrary, themodels 2, 3 and 5 for p = 3, have the same variances as H 0 but different correlation and covariance structures.

Table 1
Simulated models − p = 2 variables.
Table 2
Simulated models − p = 3 variables.

Tables 1 and 2 also present the ratio of the covariance matrices traces and determinants under H 0 and H 1 as well as the ratio of the maximum of the variances under the alternative and null simulated models. The ratio of the determinants for the bivariate models ranged from 1 (cases 2 and 3) to 2.67 (case 5), which corresponds to an increase of 0 to 166,67% in the determinant under H 1 compared to the determinant under H 0. For the trivariate models the ratio ranged from 1.13 to 9.73 (case 8).

3.1 Comparison by using the exact distribution of the tests statistics

The average proportions of rejection of H 0 for each test discussed in this paper are shown in Tables 3 and 4, for p = 2, under the null and the alternative hypothesis models. When the exact distribution was used to build the critical region of the tests, the estimates of the type I error were 0.05 as expected. Considering only the exact distributions of the test statistics, the adapted Sullivan et al. test (SE) performed better than the likelihood-ratio (LRE) for all sample sizes except in cases 2, 4 and 5, for n = 50 and 100, cases which both tests were similar. Compared to VMAX the SE test presented larger power values in cases 1-6 for all sample sizes. Both tests had similar performance for cases 7-10 with some advantage to VMAX for smaller sample sizes.

Table 3
Type I error and power estimates of the tests − p = 2, n = 5, 10 and 25.
Table 4
Type I error and power estimates of the tests − p = 2, n = 50 and 100.

It is important to point out the outstanding superiority of the SE test in case 2 when compared to VMAX even for smaller samples sizes (SE average power > 0.58; VMAX average power around 0.05 for all samples sizes). This is an important point in favour of SE since the correlation structure of Σ 1 and Σ 0 are completely different (opposite covariance structure), although the variances of the two random variables are the same as well as the generalized variance. VMAX presented also poor performance for situations where the matrix Σ 1 was diagonal with variances similar to the matrix Σ 0 (cases 1, 3 and 4). In these cases, the maximum power values were achieved at n = 100 being equal to 0.05 (case 1), 0.18 (case 3) and 0.41 (case 4), values much lower than the obtained results from SE and LRE. Just as an example, for n = 25 the power values from SE were above 0.7 (0.77; 0.79; 0.82) and above 0.6 from LRE (0.63; 0.67; 0.72). Even for case 5 which the increase of the generalized variance under the alternative hypothesis was 166,67% compared to the null hypothesis, the VMAX did not performed well for samples of size n = 5 (power = 0.29) reaching a power equal 0.74 for n = 25 (for this particular value of n, the power values of SE and LRE were 0.96 and 0.93, respectively). That is not very surprising since the test statistic of VMAX is based only upon the maximum ratio between the sample and the population variances postulated in the null hypothesis.

When the matrix Σ 1 is diagonal with variances larger than Σ 0 (cases 5 and 6), the power of VMAX increases for all sample sizes but the values are still lower than SE and LRE, except for n = 50 and 100, where they all achieved similar power (around 1). For the models 7-10 which the correlation structures are the same under the null and the alternative hypothesis, the performance of the likelihood-ratio test decreased and its power estimates were always lower than SE and VMAX, except in cases 9 and 10 for n > 50, which all tests presented similar power values (around 1). On the other hand, the performance of VMAX increased but the SE was still very competitive.

Tables 5 and 6 present the estimates of type I error and power for p = 3. Considering only the exact distributions to compare the tests (they all reach 0.05 type I error under the null hypothesis), it can be seen that the VMAX did not perform well for cases 2, 3 and 5 (estimated power 0.05 for all sample sizes). These results are expected since the test statistic of VMAX just takes into account changes in variances of the variables. For these models the adapted Sullivan et al. was more efficient than themaximum likelihood ratio test (SE: power > 0.30; LRE power > 0.16, for n > 10), particularly for case 5 (power = 0.38 for n = 5 and > 0.70 for n > 10). That was due to the fact that Σ 1 and Σ 0 have equal diagonal values although completely different covariance structures.

Table 5
Type I error and power estimates of the tests −p = 3, n = 5, 10 and 25.
Table 6
Type I error and power estimates of the tests −p = 3, n = 50 and 100.

The performance of VMAX improved for the cases which the variances changed as well as the correlation structure (cases 1, 6 and 7), particularly for case 1 whose Σ 1 is a diagonal matrix and the variance of the first variable doubled the respective value giving in Σ 0. The adapted Sullivan et al. and the maximum likelihood ratio were still more efficient than VMAX in all cases (power: SE > 0.78; LRE: > 0.57; VMAX: > 0.18, for n > 10). For larger sample sizes (n = 100), the VMAX estimated power for case 1 was equal to SE and LRE values and for n = 50 they were similar (0.93 for VMAX and 1 for the other two tests). In case 6 which corresponds to 1.2 increase of variance, VMAX achieved its maximum estimated power when n = 100 (power = 0.55), as long as LRE and SE reached power values close to 1 when n = 25. SE test was the most efficient to detect the increase of variance 1.5 (case 7) since the power estimates are larger or equal 0.82 for n > 10. Both LRE and SE reached power values closer or equal 1.0 for n > 25. The best values for VMAX occurred for n = 50 (power = 0.61) and for n = 100 (power = 0.86).

Although n = 5 is a small sample size to test hypothesis about the covariance matrix when p = 3, the adapted Sullivan et al. performed well for the cases 1, 6 and 7, (power: 0.73, 0.43 and 0.46, respectively), being the estimated power about 3 times larger than VMAX and 2.5 larger than LRE values.

The efficiency of VMAX in detecting the shifts of the covariance matrix improved a lot for case 8 (power: > 0.66; = 1.0 for n > 25), case which the covariance structure remains the same as H 0 but the variance of the second variable (see Σ 1) is 4 times larger than the value giving in Σ 0. For this particular case, VMAX power estimates were closer to the SE and LRE values.

Comparing the results of cases 4 and 6 (both have HVR = 1.2, see Table 2), it is possible to see that the different correlation structure affected the SE and LRE efficiencies since the tests were more powerful to detect the shits in case 6 than in the case 4. This result is due to the fact that the values of SE and LRE test statistics take into account all the variances and covariances values from the Σ 0 and Σ 1 matrices, and the matrix Σ 1 of case 4 is more similar to Σ 0 than the covariance matrix of case 6. Both SE and LRE did not perform well in case 4 and VMAX had power estimates larger than these two tests (maximum power estimates: VMAX = 0.46; SE = 0.37; LRE = 0.31). It is important to point out that cases 4 and 8 belong to the framework which VMAX was proposed by (Costa & Machado 2008COSTA AFB&. MACHADO MAG 2008.A new multivariate chart for monitoring the covariance matrix of bivariate processes. Communications in Statistics. Simulation and Computation, 37(7): 1453-1465.; 2009COSTA AFB & MACHADO MAG. 2009. A new chart based on sample variances for monitoring the covariance matrix of multivariate processes. International Journal of Advanced Manufacturing Technology, 41(7-8): 770-779.) since only increases of variances were allowed being the correlation, or the covariance, structures the same as in H 0. Therefore, the results suggest that under this framework the performance of VMAX is similar to the performance of the adapted Sullivan et al. test.

As far as the maximum likelihood-ratio test is concerned its performance was inferior than the adapted Sullivan et al. in all cases for n < 25 and similar in the majority of cases for n > 50.

Considering all the results observed for p = 2 and 3, we can conclude that for the evaluated cases, when the exact distribution was used to build the critical region of the tests, the adapted Sullivan et al. was more efficient than the maximum likelihood-ratio, both were more efficient than VMAX with few exceptions. The exceptions were related to the cases whose variances changed but the correlation (or the covariance) structure under the alternative hypothesis was kept the same as in H 0, situations that fit well in the framework of VMAX test. One of the main goals in quality control is to detect variances increases and in this context the results showed that VMAX performed well.

3.2 Discussion of the results from the asymptotic distribution of the test statistic

As far as the chi-square approximation is concerned, for the bivariate cases and small samples sizes (n = 5 and 10) the maximum likelihood-ratio (LR) and the adapted Sullivan et al. (SC) tests resulted in estimates of type I error larger than the pre-specified nominal significance level of 5% (estimates were 19 and 10% respectively, for LR; 12 and 8% for SC). Therefore, their power estimates could not be compared with the results from the exact distributions of the tests and were not presented in Table 3. For samples of sizes n = 25 and 50 the estimates were more reasonable (around 6 to 7%) and equal 5% for n = 100. This is perfectly explained by the fact that the chi-square approximation is valid only for larger sample sizes. For n > 25 the average power estimates from the chi-square distribution are shown in Tables 3 and 4. The results were similar to the estimates obtained by the exact distributions.

Similar pattern was found for p = 3. For small samples (n = 5,10), the estimates of the type I error based on the chi-square distribution were also much larger than the pre-specified 5% significance level. For the maximum likelihood ratio test (LR), the estimates were equal 37% for n = 5, 14% for n = 10, 8% for n = 25, being the approximation to 5% achieved only for samples of sizes 50 and 100 (estimates around 5 to 6%). For the adapted Sullivan et al. test (SC), the estimates were 22, 14 and 9% for n = 5, 10 and 25, respectively, achieving 5% only for larger samples (estimates around 7 to 5% for n = 50 and 100). For n > 50 the average power estimates from the chi-square distribution are shown in Tables 5 and 6 for the LR an SC tests and the results were similar to the estimates obtained by the exact distribution.

Therefore, by the results presented for p = 2, 3, it is clear that the chi-square approximation did not work well for small sample sizes being not recommended in these situations. The exact distribution of the test statistics obtained by Monte Carlo simulation should be used for small sample sizes.

4 EXAMPLES OF APPLICATION

In this section two examples of application are presented. The first is a general situation and the second is a particularly case of quality control. In all examples the adapted Sullivan et al. test described in Section 2.4 is compared with the maximum likelihood-ratio and VMAX tests.

Example 1. The transpiration data of 20 healthy females was given on Table 5.1 from Johnson & Wichern (2002)12 JOHNSON RA & WICHERN WD. 2002.Applied Multivariate Statistical Analysis. New Jersey: Prentice Hall.. The variables of interest were: rate of sweat (X 1), quantity of sodium (X 2) and quantity of potassium (X 3). Suppose the researcher wants to test the null hypothesis H 0 : Σ = Σ 0 against H 1: Σ ≠ Σ 0, where

By the data from Johnson & Wichern (2002)12 JOHNSON RA & WICHERN WD. 2002.Applied Multivariate Statistical Analysis. New Jersey: Prentice Hall., one can find the maximum likelihood estimate of the population covariance matrix as

The parameter vector under the null hypothesis, the respective maximum likelihood vector estimate and the difference between these two vectors are given by

θ0 = (σ 1, ρ 12, σ 2, ρ 13, ρ 23, σ 3)' = (1.732, 0.449, 14.142, 0.597, 0.219, 1.933)'

= (1.654, 0.417, 13.776, −0.559, −0.232, 1.856)'

= θ 0 = (−0.078, −0.032, −0.366, −1.156, −0.451, −0.076)'.

The results of the three tests discussed in this paper are shown in Table 7. All tests except VMAX, rejected the null hypothesis. From the structure of Σ 0 and matrices, it is easy to notice that VMAX was not sensitive to the covariances deviations from these two matrices. Since the ratios between the variances of Σ 0 and are close to 1, the result of VMAX is consistent with its theoretical formulation.

Table 7
Test statistic values and critical limits - Example 1.

Example 2. In Aparisi et al. (1999)APARISI F, JABAIOYES J & CARRION A. 1999. Statistical properties of the |s| multivariate control chart. Communications in Statistics-Theory and Methods, 28(11): 2671-2686. an example was given where three quality characteristics were measure in a part (see Fig. 1): the distance between centers, X 1 (cm), and the diameters X 2 (cm) and X 3 (cm). According to the authors the vector mean and the covariance matrix when the process was under control were known and given by

Figure 1
Part of the example 2. Source: See [Aparisi et. al, 1999].

To illustrate the statistical tests discussed in this paper we perform the following: (i) initially 5 samples of size n = 5, 10, 25, were simulated from a multivariate normal distribution with parameters μ 0, Σ 0; (2) the remaining 5 samples were generated from a multivariate normal distribution with mean vector μ 0 but different covariance matrices as follows: samples 6 and 7 from matrix Σ 1, samples 8 and 9 from Σ 2, sample 10 from Σ 3, where

The highest ratio between the variances under the alternative and the null hypothesiswere 1.7 for Σ 2 and 3.2 for Σ 3. The variances in Σ 1 are smaller than the variances in Σ 0. For each sample the null hypothesis H 0: Σ = Σ 0, was tested against H 1: Σ ≠ Σ 0, by using themaximum likelihoodratio, the adapted Sullivan et al. and VMAX tests at α = 0.005 significance level. The results are given in Tables 8 and 9 with the critical limits of each test considering the exact distribution as well as the asymptotic chi-square for the maximum likelihood-ratio and the adapted Sullivan et al. tests. For sample sizes of n = 5, the sample covariance matrices are also shown. By the exact distribution it can be seen that for the first 5 samples the null hypothesis was not rejected by any of the statistical tests and all n. For samples 6, 7, 8 and 9, H 0 was rejected by the adapted Sullivan et al. (SE) test for all sample sizes; by the maximum likelihood-ratio (LRE) it was rejected for n = 10 and 25, except for sample 8 whose rejection occurred for n = 25, only. On the other hand, VMAX rejected the null hypothesis only for sample 9 when n = 25 and did not rejected for samples 6, 7 and 8 for any sample size.

The null hypothesis was rejected for sample 10 by all statistical tests as expected considering the fact that this sample came from a multivariate normal distribution whose variance of X 1 is 3.2 times larger than the respective variance given in Σ 0.

Table 8
Results of the Example 2 - samples sizes n = 10 and n = 25 - α = 0.005.
Table 9
Results of the Example 2 - sample size n = 5.

Considering the asymptotic chi-square distribution the null hypothesis was rejected for samples 6-10 by likelihood-ratio (LR) as well as by the adapted Sullivan et al. (SC) for all sample sizes. However, it is important to remind that as shown in Section 3.2, for p = 3 and n = 5, 10, 25, the type I error of these two tests for small samples is inflated being larger than the pre-specified significance level, particularly for n = 5, 10.

It is interesting to point out that the same parameters μ 0 and Σ 0 were used by Costa & Machado (2009)COSTA AFB & MACHADO MAG. 2009. A new chart based on sample variances for monitoring the covariance matrix of multivariate processes. International Journal of Advanced Manufacturing Technology, 41(7-8): 770-779., in the illustrative example given on Section 5 of their paper. The matrix Σ 1, which was used in our example, is the covariance matrix of the sample number 4 presented in Costa & Machado's example (2009COSTA AFB & MACHADO MAG. 2009. A new chart based on sample variances for monitoring the covariance matrix of multivariate processes. International Journal of Advanced Manufacturing Technology, 41(7-8): 770-779.), and it was considered similar to the matrix Σ 0 by the VMAX test in their paper.

5 FINAL REMARKS

Considering the exact distribution of the tests statistics, in the majority of the simulated cases, the adapted Sullivan et al. and the likelihood-ratio tests resulted in larger values of estimated power than VMAX for bivariate and trivariate normal distributions. VMAX was not sensitive to general changes in the covariance (or correlation) structure. Its estimated powers increased when the variances were larger under the alternative than the null hypothesis and the covariance (or correlation) structure remained similar under H 0 and H 1. This result is no very surprising since VMAX was built with the main purpose of detecting variances shifts and it has been proved to be efficient on this type of situation (see Costa & Machado, 2008COSTA AFB&. MACHADO MAG 2008.A new multivariate chart for monitoring the covariance matrix of bivariate processes. Communications in Statistics. Simulation and Computation, 37(7): 1453-1465., 2009COSTA AFB & MACHADO MAG. 2009. A new chart based on sample variances for monitoring the covariance matrix of multivariate processes. International Journal of Advanced Manufacturing Technology, 41(7-8): 770-779.; Quinino et al., 201214 QUININO R, COSTA A & HO LL. 2012. A single statistic for monitoring the covariance matrix of bivariate processes. Quality Engineering, 24(3): 423-430.). It still can be used for this purpose but it is important to point out that the adapted Sullivan et al. was also competitive in these particular cases.

As far as the efficiency of the likelihood-ratio test is concerned, the results showed that the adapted Sullivan et al. test was more efficient in most of the cases, except for larger samples were their performance were similar. In fact for p = 3 the adapted Sullivan et al. was always superior. In some cases for p = 2, VMAX was more efficient than the likelihood-ratio test.

The results presented in this paper had indicated that the adapted Sullivan et al. is a good alternative for testing hypothesis about the structure of the covariance matrix. It is a very flexible test and it can be applied in more general situations to detect differences in the variances as well as in other parameters of the covariance matrix. Also the fact that the adapted Sullivan et al. test depends upon the calculation of Fisher Information matrix, makes it possible to be used for multivariate normal and non-normal populations.

For small samples sizes the chi-square approximation did not performed well being not recommended since the estimates of type I error were larger than the pre-specified significance level for the adapted Sullivan et al. and for the likelihood-ratio tests. However, this fact does not make the use of the adapted Sullivan et al. or the likelihood-ratio tests restrictive since the exact distribution of the respective test statistics under the null hypothesis are easily obtained by Monte Carlo simulation. The same is true for the VMAX test for p > 2 since in these situations the use of numerical integration to find the critical region of the test is more complex and Monte Carlo simulation can be very helpful.

ACKNOWLEDGMENTS

This work was partially supported by the Brazilian Institutions CNPq and CAPES (PROCADProject). The authors wish to thank the anonymous referees for the suggestions that led to the improvement of the paper.

  • 1
    ALT F. 1985.Multivariate quality control. In: Encyclopedia of Statistical Sciences [edited by S.Kotz, & N.L. Johnson], vol. 6, JohnWiley, 110-122.
  • 2
    ANDERSON TW. 1958. An Introduction to Multivariate Statistical Analysis. New York: JohnWiley & Sons, Inc.
  • 3
    APARISI F, JABAIOYES J & CARRION A. 1999. Statistical properties of the |s| multivariate control chart. Communications in Statistics-Theory and Methods, 28(11): 2671-2686.
  • 4
    BEYER WH. 1978. StandardMathematical Tables.West Palm Beach: CRC.
  • 5
    CASELLA G & BERGER RL. 2002. Statistical Inference. California: Duxbury Thomson Learning.
  • 6
    COSTA AFB & MACHADO MAG. 2009. A new chart based on sample variances for monitoring the covariance matrix of multivariate processes. International Journal of Advanced Manufacturing Technology, 41(7-8): 770-779.
  • 7
    COSTA AFB&. MACHADO MAG 2008.A new multivariate chart for monitoring the covariance matrix of bivariate processes. Communications in Statistics. Simulation and Computation, 37(7): 1453-1465.
  • 8
    DAVIS PJ & RABINOWITZ P. 1984. Methods of Numerical Integration. Orlando: Academic Press.
  • 9
    DJAUHARI MA. 2009. Asymptotic distribution of sample covariance determinant. Matematika, 25(1): 79-85.
  • 10
    DJAUHARI MA 2005. Improved monitoring of multivariate process variability. Journal of Quality Technology, 37(1): 32-39.
  • 11
    GARCÍA-DIAZ JC. 2007. The effective variance control chart for monitoring the dispersion process with missing data. European Journal of Industrial Engineering, 1(1): 40-55.
  • 12
    JOHNSON RA & WICHERN WD. 2002.Applied Multivariate Statistical Analysis. New Jersey: Prentice Hall.
  • 13
    LI Y & TSUNG F. 2011.Detecting and diagnosing covariance matrix changes in multistage processes. IIE Transactions, 43(4): 259-274.x
  • 14
    QUININO R, COSTA A & HO LL. 2012. A single statistic for monitoring the covariance matrix of bivariate processes. Quality Engineering, 24(3): 423-430.
  • 15
    MONTGOMERY DC. 2008. Introduction to Statistical Quality Control. New York: JohnWiley.
  • 16
    SULLIVAN JH, STOUMBOS ZG, MASON RL & YOUNG JC. 2007. Step-down analysis for change in the covariance matrix and other parameters. Journal of Quality Technology, 39(1): 66-84.
  • 17
    YEH AB, LI B & WANG K. 2012. Monitoring multivariate process variability with individual observations via penalised likelihood estimation. International Journal of Production Research, 50(22): 6624-6638.
  • 18
    YEH AB LIN DKJ & MCGRATH RN. 2006. Multivariate control charts for monitoring covariance matrix: a review. Quality of Technology and Quantitative Managament, 3(4): 415-436.

Publication Dates

  • Publication in this collection
    Jan-Apr 2015

History

  • Received
    11 June 2012
  • Accepted
    06 Apr 2014
Sociedade Brasileira de Pesquisa Operacional Rua Mayrink Veiga, 32 - sala 601 - Centro, 20090-050 Rio de Janeiro RJ - Brasil, Tel.: +55 21 2263-0499, Fax: +55 21 2263-0501 - Rio de Janeiro - RJ - Brazil
E-mail: sobrapo@sobrapo.org.br