Acessibilidade / Reportar erro

Linear combination of chi-squares for multinomial process monitoring

Combinação linear de qui-quadrados para monitoramento de processo multinomial

Abstract:

Marcucci (1985) proposed a chi square goodness of fit statistic based generalized p-chart for multinomial process monitoring. A chi square distribution quantile was considered as a control chart limit. A weighted chi square goodness of fit statistic-based control chart is proposed for multinomial process monitoring in this paper, where more important weights are advocated to poor quality categories. The statistic distribution is approximated by a well-known linear combination of chi squares distribution. The approximation is assessed through a simulation, an extreme percentile of the approximated distribution is used as an upper control chart limit and a comparison is carried out with a chi square goodness of fit statistic-based control chart. The average run length is used as a benchmark and the comparison is performed using simulations considering two process shifts scenarios. Under some restrictions, the weighted statistic-based control chart allows an earlier detection of process shift in case of deterioration and postpones out of control signals in case of improvement. This benefit is clearer when the process is improved by a decrease in the poor quality probability category and an increase in the best quality category probability.

Keywords:
Multinomial Process; Generalized p-Chart; Distribution Approximation; Simulation

Resumo:

Marcucci (1985) propôs um gráfico generalizado baseado na estatística de bondade de ajuste do qui-quadrado para monitoramento de processos multinominais. Um quantil de distribuição do qui quadrado foi considerado como limite do gráfico de controle. Um gráfico de controle baseado em estatística de qualidade de ajuste chi-quadrado ponderado é proposto para monitoramento de processo multinominal neste artigo, onde pesos mais importantes são defendidos para categorias de baixa qualidade. A distribuição estatística é aproximada por uma combinação linear bem conhecida de distribuição de qui-quadrados. A aproximação é avaliada por meio de uma simulação, um percentil extremo da distribuição aproximada é usado como um limite superior do gráfico de controle e uma comparação é realizada com um gráfico de controle baseado na estatística de qualidade do qui quadrado do ajuste. O comprimento médio de execução é usado como referência e a comparação é realizada por meio de simulações considerando dois cenários de deslocamento de processo. Sob algumas restrições, o gráfico de controle baseado em estatística ponderada permite uma detecção precoce de mudança de processo em caso de deterioração e adia sinais fora de controle em caso de melhoria. Esse benefício fica mais claro quando o processo é aprimorado por uma diminuição na categoria de probabilidade de qualidade ruim e um aumento na probabilidade da categoria de melhor qualidade.

Palavras-chave:
Processo Multinominal; Gráfico generalizado; Aproximação de distribuição; Simulação

1 Introduction

Control charts for attributes are used when focus is on the classification of product units into categories rather than quality characteristic measurements. The inspected units are generally classified into conforming or nonconforming units; however, different criteria of nonconforming classification could be used following defect seriousness. Such processes are called multinomial processes.

Raz & Wang (1990)Raz, T., & Wang, J. (1990). On the construction of control charts using linguistic variables. International Journal of Production Research, 28(3), 477-487. http://dx.doi.org/10.1080/00207549008942731.
http://dx.doi.org/10.1080/00207549008942...
and Taleb & Limam (2002)Taleb, H., & Limam, M. (2002). On fuzzy and probabilistic control charts. International Journal of Production Research, 40(12), 2849-2863. http://dx.doi.org/10.1080/00207540210137602.
http://dx.doi.org/10.1080/00207540210137...
have introduced and discussed the construction of univariate control charts for multinomial process monitoring using both probability and fuzzy theory. Moreover, (Taleb et al., 2006Taleb, H., Limam, M., & Hirota, H. (2006). Multivariate fuzzy multinomial control charts. Quality Technology & Quantitative Management, 3(4), 437-453. http://dx.doi.org/10.1080/16843703.2006.11673125.
http://dx.doi.org/10.1080/16843703.2006....
) discussed methods based on multivariate fuzzy multinomial control charts. A review of multinomial and multi attribute quality control charts is given by (Topalidou & Psarakis, 2009Topalidou, E., & Psarakis, S. (2009). Review of multinomial and multi-attribute quality control charts. Quality and Reliability Engineering International, 25(7), 773-804. http://dx.doi.org/10.1002/qre.999.
http://dx.doi.org/10.1002/qre.999...
).

A probabilistic approach is considered in this paper as no classification ambiguity is considered. It is based on the work of (Duncan, 1950Duncan, A. J. (1950). A Chi-Square chart for controlling a set of percentages. Industrial Quality Control., 7, 11-15.) who developed a chi square chart for controlling a set of percentages and (Marcucci, 1985Marcucci, M. (1985). Monitoring multinomial processes. Journal of Quality Technology, 17(2), 86-91. http://dx.doi.org/10.1080/00224065.1985.11978941.
http://dx.doi.org/10.1080/00224065.1985....
) who introduced a one-sided generalized p-chart for multinomial process monitoring. (Cozzucoli, 2009Cozzucoli, P. C. (2009). Process monitoring with multivariate p-Control chart. International Journal of Quality, Statistics, and Reliability, 2009, 1-11. http://dx.doi.org/10.1155/2009/707583.
http://dx.doi.org/10.1155/2009/707583...
) used an overall defectiveness index based two-sided multivariate p-chart to distinguish between process improvement and process deterioration by points less than LCL and greater than UCL, respectively. The overall defectiveness index is a weighted proportions of defects statistic where weights are between 0 and 1. It was noticed that the proposed control chart performance depends on weight values. (Yashchin, 2012Yashchin, E. (2012). On detection of changes in categorical data. Quality Technology & Quantitative Management, 9(1), 79-96. http://dx.doi.org/10.1080/16843703.2012.11673279.
http://dx.doi.org/10.1080/16843703.2012....
) derived changepoint detection schemes based on generalized likelihood ratio tests. The considered schemes are based on predefined acceptable and unacceptable regions for the monitored parameters. (Li et al., 2014aLi, J., Tsung, F., & Zou, C. (2014a). A simple categorical chart for detecting location shifts with ordinal information. International Journal of Production Research, 52(2), 550-562. http://dx.doi.org/10.1080/00207543.2013.838329.
http://dx.doi.org/10.1080/00207543.2013....
) considered that the ordinal attribute levels of a categorical variable are formed by some thresholds of the latent continuous variable. An example where flash on the head of toothbrush is classified into four categories according to its length is considered. A simple ordinal categorical control chart to detect location shift in factor’s latent continuous variable distribution was then proposed. (Li et al., 2014bLi, J., Tsung, F., & Zou, C. (2014b). Multivariate binomial/multinomial control chart. IIE Transactions, 46(5), 526-542. http://dx.doi.org/10.1080/0740817X.2013.849830.
http://dx.doi.org/10.1080/0740817X.2013....
) proposed a multivariate binomial and multinomial control chart. An EWMA-type control chart is implemented using log-linear models for characterizing the relationship among categorical factors. (Weiss, 2018Weiss, C. H. (2018). Control Charts for Time-Dependent Categorical Processes. In: S. Knoth & W. Schmid (Eds.) Frontiers in statistical quality control 12 (pp. 211-231). Springer International Publishing.) proposed a control chart for serially dependent categorical processes monitoring. (Perry, 2019Perry, M. B. (2019). An EWMA control chart for categorical processes with applications to social network monitoring. Journal of Quality Technology. http://dx.doi.org/10.1080/00224065.2019.1571343.
https://doi.org/10.1080/00224065.2019.15...
) used an EWMA control chart for categorical processes where weighted category counts were used as a control chart statistic. The weight assigned to each category is the inverse of its expected count. (Marcucci, 1985) control chart statistic depends already on the inverse of the expected count. The aim of this work is to propose an improved (Marcucci, 1985Marcucci, M. (1985). Monitoring multinomial processes. Journal of Quality Technology, 17(2), 86-91. http://dx.doi.org/10.1080/00224065.1985.11978941.
http://dx.doi.org/10.1080/00224065.1985....
) generalized p-chart rather than improvement detection. The proposed control chart is based on a weighted chi-square goodness of fit statistic and aims at earlier detection of process deterioration and delay improvement detection. A weighted goodness of fit statistic called Q statistic is presented in section 2, as the distribution of the Q statistic is unknown, the distribution approximation issue is discussed in section 3 and the approximated distribution is assessed in section 4. The Q statistic control chart is presented through an illustrative example in section 5, finally its performance is compared to Marcucci (1985)Marcucci, M. (1985). Monitoring multinomial processes. Journal of Quality Technology, 17(2), 86-91. http://dx.doi.org/10.1080/00224065.1985.11978941.
http://dx.doi.org/10.1080/00224065.1985....
control chart in section 6.

2 The Weighted Goodness of Fit Statistic

Multinomial process output is classified into q categories. Let p1, p2, ..., pq be the probabilities of classification into the exhaustive and mutually exclusive categories with j=1qpj=1 and pj known ∀j = 1, ..., q, where conform units are classified into the first category and worst quality units are classified into the qth category. Xj is the observed number of output units classified into the jth category. If m samples are selected randomly without replacement, then Xij is the observed number of output units in the ith sample and classified into the jth category with i = 1, ..., m and let ni be the sample sizes, it is common to suppose that all samples have the same sample size n.

Marcucci (1985)Marcucci, M. (1985). Monitoring multinomial processes. Journal of Quality Technology, 17(2), 86-91. http://dx.doi.org/10.1080/00224065.1985.11978941.
http://dx.doi.org/10.1080/00224065.1985....
proposed to use the following chi square (Equation 1) goodness of fit statistic for multinomial process monitoring:

Y i 2 = i = 1 q X i j n i p j 2 n i p j (1)

where nipj is the expected count at the ith sample and jth category. The proposed statistic is a measure of overall deviations from expected counts and has a known distribution, hence, a quantile of the distribution is used as an upper control limit for the χ2 control chart. Moreover, the goodness of fit statistic does not take into consideration whether poor quality categories expected count decrease or increase and all deviations have similar impact. The control chart statistic does not distinguish between quality improvement and quality deterioration.

For that purpose, different weights are attributed to the categories considering that deviations from expected counts in categories with poor process output quality have negative impact on the overall process output quality. Then, more important weights are advocated to poor quality categories. Let w1, w2, ..., wq be the categories constant weights. The overall deviations from expected counts assessment is given by the following Q statistic (Equation 2):

Q = j = 1 q w j X j n p j 2 n p j (2)

if several samples are considered, then the Q statistic for the ith sample is given by Equation 3:

Q i = j = 1 q w j X i j n i p j 2 n i p j (3)

3 Q Statistic distribution approximation

Qi is considered as the control chart positive statistic and a one-sided control chart is the appropriate choice. in what follows, Qi distribution is approximated by a linear combination of chi squares as the exact distribution is not known (Equation 4)

Q i j = 1 q w j χ i j 2 ( 1 ) (4)

Applications of linear combination of chi squares are presented in (Jensen & Solomon, 1972Jensen, D. R., & Solomon, H. (1972). A Gaussian approximation to the distribution of a definite quadratic form. Journal of the American Statistical Association, 67(340), 898-902. http://dx.doi.org/10.1080/01621459.1972.10481313.
http://dx.doi.org/10.1080/01621459.1972....
) and in (Mathai & Provost, 1992Mathai, A. M., & Provost, S. B. (1992). Quadratic forms in random variables: Theory and Applications (vol. 126). New York: Marcel Dekker Inc.). (Kotz et al., 1967Kotz, S., Johnson, N. L., & Boyd, D. W. (1967). Series representations of distributions of quadratic forms in normal variables. I. Central case. Annals of Mathematical Statistics, 38(3), 823-837. http://dx.doi.org/10.1214/aoms/1177698877.
http://dx.doi.org/10.1214/aoms/117769887...
) used power series expansion to determine the approximated Q statistic distribution. (Davis, 1977Davis, A. W. (1977). A differential equation approach to linear combination of independent chi-squares. Journal of the American Statistical Association, 72(357), 212-214. http://dx.doi.org/10.1080/01621459.1977.10479941.
http://dx.doi.org/10.1080/01621459.1977....
) tried to solve the slow convergence of the power series for large values of the linear combination of chi squares using differential equations and assuming positive coefficients for a useful range of parameters. (Oman & Zacks, 1981Oman, S. D., & Zacks, S. (1981). A mixture approximation to the distribution of a weighted sum of chi-squared variables. Journal of Statistical Computation and Simulation, 13(3-4), 215-224. http://dx.doi.org/10.1080/00949658108810498.
http://dx.doi.org/10.1080/00949658108810...
) noticed that all available methods suffer of lengthy computations or are insufficiently accurate. However, the work of (Feiveson & Delaney, 1968Feiveson, A. H., & Delaney, F. C. (1968). The distribution and properties of a weighted sum of chi squares (NASA Technical Note, D-4575). Houston, Texas: Manned Spacecraft Center.) was not considered. (Moschopoulos & Canada, 1984Moschopoulos, P. G., & Canada, W. B. (1984). The distribution function of linear combination of chi-squares. Computers & Mathematics with Applications (Oxford, England), 10(4/5), 383-386. http://dx.doi.org/10.1016/0898-1221(84)90066-X.
http://dx.doi.org/10.1016/0898-1221(84)9...
) presented an easily programmed method for the distribution computation and better or at least as accurate as (Oman & Zacks, 1981Oman, S. D., & Zacks, S. (1981). A mixture approximation to the distribution of a weighted sum of chi-squared variables. Journal of Statistical Computation and Simulation, 13(3-4), 215-224. http://dx.doi.org/10.1080/00949658108810498.
http://dx.doi.org/10.1080/00949658108810...
) method, they noticed that their moment generating function-based method has equivalent results than (Imhof, 1961Imhof, J. P. (1961). Computing the distribution of quadratic forms in normal variables. Biometrika, 48(3-4), 419-426. http://dx.doi.org/10.1093/biomet/48.3-4.419.
http://dx.doi.org/10.1093/biomet/48.3-4....
) method. Moreover, (Davies, 1980Davies, R. B. (1980). Algorithm AS 155: The distribution of a linear combination of chi-square random variables. Journal of the Royal Statistical Society. Series C, Applied Statistics, 29(3), 323-333.) presented an algorithm to compute the distribution function of linear combination of non-central chi square variables and (Castano-Martinez & Lopez-Blazquez, 2005Castano-Martinez, A., & Lopez-Blazquez, F. (2005). Distribution of a sum of weighted noncentral chi-square variables. Test, 14(2), 397-415. http://dx.doi.org/10.1007/BF02595410.
http://dx.doi.org/10.1007/BF02595410...
) derived Laguerre expansions for the density and distribution functions for a sum of positive weighted noncentral chi square variables. (Bodenham & Adams, 2016Bodenham, D. A., & Adams, N. M. (2016). A comparison of efficient approximations for a weighted sum of chi-squared random variables. Statistics and Computing, 26(4), 917-928. http://dx.doi.org/10.1007/s11222-015-9583-4.
http://dx.doi.org/10.1007/s11222-015-958...
) compared different approximations for weighted sum of chi squared random variables in the case of streaming data and noticed that Imhof’s method is essentially exact, however, it is not suitable for a streaming data scenario. (Moschopoulos & Canada, 1984Moschopoulos, P. G., & Canada, W. B. (1984). The distribution function of linear combination of chi-squares. Computers & Mathematics with Applications (Oxford, England), 10(4/5), 383-386. http://dx.doi.org/10.1016/0898-1221(84)90066-X.
http://dx.doi.org/10.1016/0898-1221(84)9...
) point out that the linear combination of independent central chi-square random variables distribution function is given by Equation 5:

F x = Pr Q i x = i = 2 q b i j = 0 a j 0 x g j t d t (5)

where Qi is expressed as in Equation 4, gj(t) is the p.d.f. of the Gamma distribution with a shape parameter λ = q2+j and a rate parameter β=12w1 . The coefficients bi=w1wi12 and the coefficients aj=Ajq where Aji are determined recursively with Aji=k=0jAki1A(wi,jk), Ar2=Aw2,r;Awi,r=12r1w1wirr!; 12r=1212+112+r1; r=0,1,2,; i=3,,q; j=0,1,2, and jk=0,1,2,

(Moschopoulos & Canada, 1984Moschopoulos, P. G., & Canada, W. B. (1984). The distribution function of linear combination of chi-squares. Computers & Mathematics with Applications (Oxford, England), 10(4/5), 383-386. http://dx.doi.org/10.1016/0898-1221(84)90066-X.
http://dx.doi.org/10.1016/0898-1221(84)9...
) provided some percentiles corresponding to different wj and different degrees of freedom, they recommend using 40 terms in Equation 5 to determine numerically upper percentiles of the linear combination of chi squares. However, it is noticed that for extreme quantiles and large weight values, result convergence needs more than 85 terms. Notice that if wj = 1, ∀j = 1, ..., q then Qiχ2(q) this property is used as a benchmark for program code validation. Figures 1,2 and 3 compare the approximated Q distribution to different chi square distributions and it is noticed that the approximated Q distribution depends on the category number and is different from a chi square distribution. Approximated Q quantiles are given in Table 1 for different values of q and different weights where α is the quantile order. (Feiveson & Delaney, 1968Feiveson, A. H., & Delaney, F. C. (1968). The distribution and properties of a weighted sum of chi squares (NASA Technical Note, D-4575). Houston, Texas: Manned Spacecraft Center.) approximated the linear combination of chi-squares distribution by a gamma distribution with a rate parameter β=12 i=1qwii=1qwi2 and a shape parameter λ=12 (i=1qwi)2i=1qwi2 and pointed out that large standard deviation of weights overestimate the true functional value in the right tail of the distribution.

Figure 1
Approximated Three Categories Q Distribution with Weights iq Against Chi Square Distributions.
Figure 2
Approximated Four Categories Q Distribution with Weights iq Against Chi Square Distributions.
Figure 3
Approximated Five Categories Q Distribution with Weights iq Against Chi Square Distributions
Table 1
Linear Combination Percentiles for Different Weights and q Values.

4 Approximations assessment

Using (Moschopoulos & Canada, 1984Moschopoulos, P. G., & Canada, W. B. (1984). The distribution function of linear combination of chi-squares. Computers & Mathematics with Applications (Oxford, England), 10(4/5), 383-386. http://dx.doi.org/10.1016/0898-1221(84)90066-X.
http://dx.doi.org/10.1016/0898-1221(84)9...
) approximation the Q statistic approximated cumulative distribution is represented graphically against some chi square distributions. The considered weights are wj=jq with j = 1, ..., q. The Q statistic approximated cumulative distribution could be then compared graphically to χ2q1 distribution. It becomes obvious that the upper control chart limit value based on linear combination of chi squares distribution approximation with wj=jq is less than the upper limit determined from the chi square distribution as wj ≤ 1, ∀j = 1...q. Moreover, Figure 4 shows that the linear combination of chi squares distribution is more likely to be a gamma distribution rather than a chi square distribution and that it is sensitive to weight change. However, the Q statistic is a weighted chi square goodness of fit statistic, it would be interesting to compare graphically the linear combination of chi squares distribution and the weighted chi square goodness of fit statistic empirical distribution. In order to represent graphically the empirical distribution different multinomial distribution parameters were considered, 100000 samples were generated from each multinomial distribution and the Q statistic was computed for each sample of each multinomial distribution. Figure 5 illustrates both distributions where the empirical distribution is obtained for several category probabilities combinations and for huge sample size n=20000. It becomes obvious that the empirical distribution is sensitive to category probabilities changes and that the closest distribution to the approximation distribution is the empirical distribution with the lowest poor quality category classification probability.

Figure 4
Approximated Three Categories Q Distribution with different weights against the gamma distribution.
Figure 5
Q Statistic approximated cumulative distribution against empirical distributions.

A simulation is implemented to assess the Q statistic distribution approximation by linear combination of chi squares. As the approximated quantile with order 0.0027 would be used as the control chart upper control limit, then, first, the quantile x0.0027 is determined for q = 3,4,5 using (5), then 5000 samples are generated from a multinomial distribution, different sample sizes will be used, finally the Q statistic is computed for each sample and the number of Q statistic values beyond the x0.0027 are counted. These steps are repeated 5000 times, hence, the average count of Q statistic values beyond the x0.0027 is determined and divided by 5000 in order to determine the proportion α of Q statistic values beyond the x0.0027. The approximation is satisfactory if the determined proportions are close and converge to the targeted value α = 0.0027 when the sample size is increased. Simulation results are in Tables 2, 3 and 4.

Table 2
α for different sample sizes and q = 3.
Table 3
α for different sample sizes and q = 4.
Table 4
α for different sample sizes and q = 5.

The remarkable fact is that α does not converge exactly to the targeted value 0.0027 for all considered multinomial distributions, however, it is noticed computationally and graphically that the approximation is more efficient when pq ≤ 0.01. The quantile x0.0027 could be used as an upper control limit for a Q control chart especially when pq ≤ 0.01. As in Figure 4 it was noticed that linear combination of chi squares distribution is more likely to be the gamma distribution, then, in what follows (Feiveson & Delaney, 1968Feiveson, A. H., & Delaney, F. C. (1968). The distribution and properties of a weighted sum of chi squares (NASA Technical Note, D-4575). Houston, Texas: Manned Spacecraft Center.) approximation is assessed for the Q distribution x0.0027 quantile using the same steps explained previously. Results are in Tables 5, 6 and 7 where no approximation improvements are observed. Since Imhof’s method the essentially exact and that (Moschopoulos & Canada, 1984Moschopoulos, P. G., & Canada, W. B. (1984). The distribution function of linear combination of chi-squares. Computers & Mathematics with Applications (Oxford, England), 10(4/5), 383-386. http://dx.doi.org/10.1016/0898-1221(84)90066-X.
http://dx.doi.org/10.1016/0898-1221(84)9...
) method is in fact the Imhof’s method equivalent, then, only (Moschopoulos & Canada, 1984Moschopoulos, P. G., & Canada, W. B. (1984). The distribution function of linear combination of chi-squares. Computers & Mathematics with Applications (Oxford, England), 10(4/5), 383-386. http://dx.doi.org/10.1016/0898-1221(84)90066-X.
http://dx.doi.org/10.1016/0898-1221(84)9...
) approximation will be used for the control chart performance assessment where the A.R.L. is used as a benchmark.

Table 5
α for different sample sizes and q = 3 using Gamma distribution x0.0027 quantile.
Table 6
α for different sample sizes and q = 4 using Gamma distribution x0.0027 quantile.
Table 7
α for different sample sizes and q = 5 using Gamma distribution x0.0027 quantile

5 Q Statistic Control Chart

Multinomial processes with q categories are considered. Quality levels are ordered and let the first category has the best quality level, the qth category has the worst quality level. Let p1, p2, ..., pq be the probabilities to classify a given product unit into one of the q categories with j=1qpj=1. Different weights w1w2...wq are assigned to the considered categories. For a given taken randomly sample and including n product units, let Xj be the count of product units in the jth category. The Q statistic is computed for the considered sample as Equation 2 and if several samples are considered, then the Q statistic for the ith sample is given by Equation 3, where ni is the size of the ith sample and Xij is the observed count of product units in the ith sample and classified into the jth category.

Higher weight values are assigned to categories with poor quality in order to detect earlier process deterioration and delay process improvement detection. It is proposed that wj=jq.

The Q statistic is a weighted distance between the observed counts and theoretical counts computed under the hypothesis that count occurrence in different categories are independent. In order to control the Q statistic values, a one-sided control chart is considered where the Q statistic distribution quantile of order α is the upper control limit, where α is the targeted false alarm rate. Since the Q statistic distribution is unknown, the α quantile is approximated by a quantile of the distribution in Equation 5 as explained previously. Q statistic control chart is illustrated through the following example. The example of (Marcucci, 1985) is reconsidered, where in brick manufacturing, a brick could be classified as suitable for all uses (standard), or (chipped-face) if sound but not suitable for all uses or unacceptable for use (cull). Classification probabilities are 95%, 3%, 2% respectively and the 95th percentile of the χ2(2) distribution is used as the upper control limit for the Yi2 statistic defined in Equation 1. In the same way the 95th percentile of the approximated distribution of the Q statistic is used as the upper control limit for Q statistic control chart where wj=jq with j=1,2,3 in this case. Hence, 5.99 and 5.47 are the control limits for the χ2 chart and the Q chart respectively. Table 8 gives the control charts statistics computation. Both charts behave identically and show that samples 5, 10, 11 and 14 are out of control.

Table 8
Control Charts Statistics Computation.

6 Control Charts Comparison

A simulation is implemented in order to compare between the (Marcucci, 1985Marcucci, M. (1985). Monitoring multinomial processes. Journal of Quality Technology, 17(2), 86-91. http://dx.doi.org/10.1080/00224065.1985.11978941.
http://dx.doi.org/10.1080/00224065.1985....
) χ2 control chart and the Q statistic-based control chart with a UCL approximated by a linear combination of independent chi squares distribution quantile. The upper control limits for the Q chart are 10.6475, 11.5732 and 12.4866 for q = 3, q = 4 and q = 5 respectively. A multinomial distribution is used as an in-control situation, large number of samples are generated and the order of the first out of control statistic value is determined. These steps are repeated 5000 times and the inverse of the average order is considered as the A.R.L. when the process is in control. Two scenarios are then considered to represent out of control situations. The A.R.L. is computed in the case of process improvement and in the case of process deterioration.

Simulation results are in Tables 9, 10 and 11. It is noticed that Q control chart produces less false alarms than the χ2 control chart and that both are appropriate for large samples n ≥ 300. The Q chart has a higher A.R.L. than the goodness of fit control chart in case of process improvement and a smaller A.R.L. in case of process deterioration. The Q control chart shows less points in average beyond the control chart limit in case of process improvement and allows earlier detection in case of process deterioration, however, these main benefits are observed specifically when the process improvement and process deterioration concern only extreme categories, for q = 4 and q = 5 and only when the UCL for the Q chart is efficiently approximated.

Table 9
Average Run Length for q = 3.
Table 10
Average Run Length for q = 4.
Table 11
Average Run Length for q = 5.

7 Conclusion

A weighted goodness of fit statistic is proposed to improve the (Marcucci, 1985Marcucci, M. (1985). Monitoring multinomial processes. Journal of Quality Technology, 17(2), 86-91. http://dx.doi.org/10.1080/00224065.1985.11978941.
http://dx.doi.org/10.1080/00224065.1985....
) one-sided control chart performance in the case of multinomial process monitoring. As the proposed statistic distribution is not known, the control limit is approximated by a linear combination of independent chi squares distribution quantile. Performance assessment through simulation was performed for the cases were the UCL is well approximated and considering process improvement and process deterioration scenarios. Main benefits of earlier deterioration detection and postponing improvement detection are observed specifically for processes with less than 1% of product units into the worst quality category. Moreover, control chart performance improvement is clearer for q = 4 and q = 5. As the linear combination of chi squares offers poor approximation for the weighted chi squares statistic distribution, better performance is expected with the exact distribution of the control chart statistic.

  • Financial support: None.
  • How to cite: Talmoudi, R., Achouri A., & Taleb H (2021). Linear Combination of Chi-Squares for Multinomial Process Monitoring. Gestão & Produção, 28(3), e41. http://doi.org/101590/1806-9649-2020v28e41
  • ERRATUM

    1University of Carthage, LR18ES48 ENVIE Laboratory, Faculté des Sciences Economiques et de Gestion de Nabeul, Tunisia. E-mail: ramzi.telmoudi@fsegn.u-carthage.tn
    2University of Carthage, ARBRE Laboratory, Tunisia
    Due to desktop publishing error the article “Linear combination of chi-squares for multinomial process monitoring” (DOI https://doi.org/10.1590/1806-9649-2021v28e41), published in Gestão & Produção, 28(3), e41 was published with an error.
    On page 1, where the text reads:
    ORIGINAL ARTICLE
    It should read:
    THEMATIC SECTION: STATISTICAL PROCESS MONITORING AND CONTROL
    The publisher apologizes for the errors.

References

  • Bodenham, D. A., & Adams, N. M. (2016). A comparison of efficient approximations for a weighted sum of chi-squared random variables. Statistics and Computing, 26(4), 917-928. http://dx.doi.org/10.1007/s11222-015-9583-4
    » http://dx.doi.org/10.1007/s11222-015-9583-4
  • Castano-Martinez, A., & Lopez-Blazquez, F. (2005). Distribution of a sum of weighted noncentral chi-square variables. Test, 14(2), 397-415. http://dx.doi.org/10.1007/BF02595410
    » http://dx.doi.org/10.1007/BF02595410
  • Cozzucoli, P. C. (2009). Process monitoring with multivariate p-Control chart. International Journal of Quality, Statistics, and Reliability, 2009, 1-11. http://dx.doi.org/10.1155/2009/707583
    » http://dx.doi.org/10.1155/2009/707583
  • Davies, R. B. (1980). Algorithm AS 155: The distribution of a linear combination of chi-square random variables. Journal of the Royal Statistical Society. Series C, Applied Statistics, 29(3), 323-333.
  • Davis, A. W. (1977). A differential equation approach to linear combination of independent chi-squares. Journal of the American Statistical Association, 72(357), 212-214. http://dx.doi.org/10.1080/01621459.1977.10479941
    » http://dx.doi.org/10.1080/01621459.1977.10479941
  • Duncan, A. J. (1950). A Chi-Square chart for controlling a set of percentages. Industrial Quality Control., 7, 11-15.
  • Feiveson, A. H., & Delaney, F. C. (1968). The distribution and properties of a weighted sum of chi squares (NASA Technical Note, D-4575). Houston, Texas: Manned Spacecraft Center.
  • Imhof, J. P. (1961). Computing the distribution of quadratic forms in normal variables. Biometrika, 48(3-4), 419-426. http://dx.doi.org/10.1093/biomet/48.3-4.419
    » http://dx.doi.org/10.1093/biomet/48.3-4.419
  • Jensen, D. R., & Solomon, H. (1972). A Gaussian approximation to the distribution of a definite quadratic form. Journal of the American Statistical Association, 67(340), 898-902. http://dx.doi.org/10.1080/01621459.1972.10481313
    » http://dx.doi.org/10.1080/01621459.1972.10481313
  • Kotz, S., Johnson, N. L., & Boyd, D. W. (1967). Series representations of distributions of quadratic forms in normal variables. I. Central case. Annals of Mathematical Statistics, 38(3), 823-837. http://dx.doi.org/10.1214/aoms/1177698877
    » http://dx.doi.org/10.1214/aoms/1177698877
  • Li, J., Tsung, F., & Zou, C. (2014a). A simple categorical chart for detecting location shifts with ordinal information. International Journal of Production Research, 52(2), 550-562. http://dx.doi.org/10.1080/00207543.2013.838329
    » http://dx.doi.org/10.1080/00207543.2013.838329
  • Li, J., Tsung, F., & Zou, C. (2014b). Multivariate binomial/multinomial control chart. IIE Transactions, 46(5), 526-542. http://dx.doi.org/10.1080/0740817X.2013.849830
    » http://dx.doi.org/10.1080/0740817X.2013.849830
  • Marcucci, M. (1985). Monitoring multinomial processes. Journal of Quality Technology, 17(2), 86-91. http://dx.doi.org/10.1080/00224065.1985.11978941
    » http://dx.doi.org/10.1080/00224065.1985.11978941
  • Mathai, A. M., & Provost, S. B. (1992). Quadratic forms in random variables: Theory and Applications (vol. 126). New York: Marcel Dekker Inc.
  • Moschopoulos, P. G., & Canada, W. B. (1984). The distribution function of linear combination of chi-squares. Computers & Mathematics with Applications (Oxford, England), 10(4/5), 383-386. http://dx.doi.org/10.1016/0898-1221(84)90066-X
    » http://dx.doi.org/10.1016/0898-1221(84)90066-X
  • Oman, S. D., & Zacks, S. (1981). A mixture approximation to the distribution of a weighted sum of chi-squared variables. Journal of Statistical Computation and Simulation, 13(3-4), 215-224. http://dx.doi.org/10.1080/00949658108810498
    » http://dx.doi.org/10.1080/00949658108810498
  • Perry, M. B. (2019). An EWMA control chart for categorical processes with applications to social network monitoring. Journal of Quality Technology http://dx.doi.org/10.1080/00224065.2019.1571343.
    » https://doi.org/10.1080/00224065.2019.1571343
  • Raz, T., & Wang, J. (1990). On the construction of control charts using linguistic variables. International Journal of Production Research, 28(3), 477-487. http://dx.doi.org/10.1080/00207549008942731
    » http://dx.doi.org/10.1080/00207549008942731
  • Taleb, H., & Limam, M. (2002). On fuzzy and probabilistic control charts. International Journal of Production Research, 40(12), 2849-2863. http://dx.doi.org/10.1080/00207540210137602
    » http://dx.doi.org/10.1080/00207540210137602
  • Taleb, H., Limam, M., & Hirota, H. (2006). Multivariate fuzzy multinomial control charts. Quality Technology & Quantitative Management, 3(4), 437-453. http://dx.doi.org/10.1080/16843703.2006.11673125
    » http://dx.doi.org/10.1080/16843703.2006.11673125
  • Topalidou, E., & Psarakis, S. (2009). Review of multinomial and multi-attribute quality control charts. Quality and Reliability Engineering International, 25(7), 773-804. http://dx.doi.org/10.1002/qre.999
    » http://dx.doi.org/10.1002/qre.999
  • Weiss, C. H. (2018). Control Charts for Time-Dependent Categorical Processes. In: S. Knoth & W. Schmid (Eds.) Frontiers in statistical quality control 12 (pp. 211-231). Springer International Publishing.
  • Yashchin, E. (2012). On detection of changes in categorical data. Quality Technology & Quantitative Management, 9(1), 79-96. http://dx.doi.org/10.1080/16843703.2012.11673279
    » http://dx.doi.org/10.1080/16843703.2012.11673279

Publication Dates

  • Publication in this collection
    02 July 2021
  • Date of issue
    2021

History

  • Received
    23 Oct 2020
  • Accepted
    25 Jan 2021
Universidade Federal de São Carlos Departamento de Engenharia de Produção , Caixa Postal 676 , 13.565-905 São Carlos SP Brazil, Tel.: +55 16 3351 8471 - São Carlos - SP - Brazil
E-mail: gp@dep.ufscar.br