New statistical process control charts for overdispersed count data based on the Bell distribution

BOAVENTURA, LAION L.; FERREIRA, PAULO H.; FIACCONE, ROSEMEIRE L.; RAMOS, PEDRO L.; LOUZADA, FRANCISCO

doi:10.1590/0001-3765202320200246

Abstract

Poisson distribution is a popular discrete model used to describe counting information, from which traditional control charts involving count data, such as the c and u charts, have been established in the literature. However, several studies recognize the need for alternative control charts that allow for data overdispersion, which can be encountered in many fields, including ecology, healthcare, industry, and others. The Bell distribution, recently proposed by Castellares et al. (2018), is a particular solution of a multiple Poisson process able to accommodate overdispersed data. It can be used as an alternative to the usual Poisson (which, although not nested in the Bell family, is approached for small values of the Bell distribution) Poisson, negative binomial, and COM-Poisson distributions for modeling count data in several areas. In this paper, we consider the Bell distribution to introduce two new exciting, and useful statistical control charts for counting processes, which are capable of monitoring count data with overdispersion. The performance of the so-called Bell charts, namely Bell-c and Bell-u charts, is evaluated by the average run length in numerical simulation. Some artificial and real data sets are used to illustrate the applicability of the proposed control charts.

Key words
average run length; Bell charts; count data; overdispersion

Introduction

Count data arise in many fields, including biology, ecology, healthcare, marketing, economics, and industry. Overdispersion (variance $>$ mean) in count data is quite common (Hougaard et al. 1997HOUGAARD P, LEE MLT & WHITMORE GA. 1997. Analysis of overdispersed count data by mixtures of Poisson variables and Poisson processes. Biometrics 53(4): 1225-1238., Okamura et al. 2012OKAMURA H, PUNT AE & AMANO T. 2012. A generalized model for overdispersed count data. Pop Ecol 54(3): 467-474., Coly et al. 2016COLY S, YAO AF, ABRIAL D & CHARRAS-GARRIDO M. 2016. Distributions to model overdispersed count data. Journal de la Société Française de Statistique 157(2): 39-64.) and can occur for several reasons, including mechanisms that generate excessive zero counts or censoring (Avcı et al. 2015AVCI E, ALTÜRK S & SOYLU EN. 2015. Comparison count regression models for overdispersed alga data. Int J Res Rev Appl Sci 25(1): 1-5.). In the presence of overdispersed count data, the Poisson model, which is the most popular distribution for analyzing count data and assumes equidispersion (variance $=$ mean), may result in incorrect inference about parameter estimates, standard errors, tests, and confidence intervals (Avcı et al. 2015). Particularly in Statistical Process Control (SPC), a misspecified Poisson model ( $c$ or $u$ chart) may increase the false alarm rate (Saghir & Lin 2015SAGHIR A & LIN Z. 2015. Control charts for dispersed count data: an overview. Qual Reliab Eng Int 31(5): 725-739.). Indeed, when dealing with the application of control charts to data that show excessive dispersion, several works (Spiegelhalter 2005SPIEGELHALTER DJ. 2005. Handling over-dispersion of performance indicators. BMJ Quality & Safety 14(5): 347-351., Mohammed & Laney 2006MOHAMMED MA & LANEY D. 2006. Overdispersion in health care performance data: Laney’s approach. BMJ Quality & Safety 15(5): 383-384., Albers 2011ALBERS W. 2011. Control charts for health care monitoring under overdispersion. Metrika 74(1): 67-83.) express concern about the use of such a structure. This concern is because increased variation may cause multiple data values to be falsely detected as out of control when they are merely false positive.

Sheaffer & Leavenworth (1976)SHEAFFER RL & LEAVENWORTH RS. 1976. The negative binomial model for counts in units of varying size. J Qual Technol 8(3): 158-163. and Kaminsky et al. 1992KAMINSKY FC, BENNEYAN JC, DAVIS RD & BURKE RJ. 1992. Statistical control charts based on a geometric distribution. J Qual Technol 24(2): 63-69. considered the negative binomial distribution, while Sellers 2012SELLERS KF. 2012. A generalized statistical control chart for over- or under-dispersed data. Qual Reliab Eng Int 28(1): 59-65. used the Conway–Maxwell–Poisson (COM-Poisson) model ( $c m p c$ and $c m p u$ charts) as flexible alternatives to the Poisson control charts ( $c$ and $u$ charts). In particular, Kaminsky et al. 1992KAMINSKY FC, BENNEYAN JC, DAVIS RD & BURKE RJ. 1992. Statistical control charts based on a geometric distribution. J Qual Technol 24(2): 63-69. also developed control charts that plot the total or the average number of events based on the geometric distribution ( $g$ and $h$ charts, respectively). However, as pointed out by Saghir & Lin (2015), there are still very few articles on monitoring dispersed count data. Other proposals involve using compound distributions, e.g., the Poisson-gamma mixture (Cheng & Yu 2013), the shifted (or zero-truncated) generalized Poisson distribution (Famoye 1994), among others.

Recently, Castellares et al. 2018CASTELLARES F, FERRARI SLP & LEMONTE AJ. 2018. On the Bell distribution and its associated regression model for count data. Applied Mathematical Modelling 56: 172-185. introduced a one-parameter discrete probability distribution named the Bell distribution. It is infinitely divisible and capable of modeling count data with overdispersion, and with many other attractive properties, e.g., it is a single parameter distribution that belongs to the exponential family of distributions. Although the Poisson distribution is not nested in the Bell family, the Bell distribution approaches the Poisson distribution for small values of the parameter. That is, the Poisson distribution is a limiting case of the Bell distribution which arises when the Bell parameter tends to zero. The Bell distribution motivated us to propose two new statistical control charts to describe the total and the average number of events per inspection unit. The so-called Bell charts, namely Bell- $c$ and Bell- $u$ charts, represent interesting and useful alternatives to the charts mentioned above when monitoring counting processes with overdispersed data. It is worth pointing out that, in a typical Shewhart control chart, such as the ones proposed in this paper, the main aim is to detect significantly massive shifts in the process parameter. This kind of control chart also ignores historical data, which is why it is called “without memory“.

The remainder of this paper is organized as follows. The “Bell distribution“ section revises the Bell distribution and some of its basic properties. The “New control charts“ section presents the new attribute control charts based on the Bell distribution. The “Performance evaluation“ section provides simulation studies to assess the performance of the proposed Bell charts, even when compared to some traditional control charts. The “Applications“ section illustrates the usefulness of the Bell charts through several examples. Finally, the “Final remarks“ section concludes the paper with a few remarks and discussions on future works.

Bell distribution

In this section, we present a brief review of the Bell distribution and some of its properties. We also discuss point estimation via the maximum likelihood estimator for its parameter.

Introduced by Castellares et al. 2018CASTELLARES F, FERRARI SLP & LEMONTE AJ. 2018. On the Bell distribution and its associated regression model for count data. Applied Mathematical Modelling 56: 172-185., a random variable $Y$ is said to be Bell distributed with parameter $θ > 0$ , denoted by $Y \sim Bell (θ)$ , if its probability mass function (PMF) is given by

P (Y = y | θ) = \frac{θ^{y} e^{- e^{θ} + 1} B_{y}}{y!}, for y = 0, 1, 2, \dots,

(1)

where

B_{y} = \frac{1}{e} \sum_{k = 0}^{\infty} \frac{k^{y}}{k!}

,

y = 0, 1, 2, \dots

, are the Bell numbers¹ 1 It is worth noting that the Bell number By is the y-th moment of a Poisson distribution with parameter equal to one; see Remark 1 of Castellares et al. 2018. (Bell 1934aBELL ET. 1934a. Exponential numbers. The American Mathematical Monthly 41(7): 411-419., bBELL ET. 1934b. Exponential polynomials. Ann Math 35(2): 258-277.), which can be computed via the bell(.) function of the “numbers“ package (Borchers 2018) in R software (R Core Team 2018R CORE TEAM. 2018. R: A Language and Environment for Statistical Computing. (Version 3.6.1). R Foundation for Statistical Computing. Vienna, Austria.).

If $Y \sim B e l l (θ)$ , then the mean and variance of $Y$ are given, respectively, by

E [Y] = θ e^{θ} and Var [Y] = θ (1 + θ) e^{θ} .

(2)

Note that

Var [Y] / E [Y] = 1 + θ > 1

,

\forall θ > 0

. Therefore, the Bell distribution may be a suitable distribution for modeling count data with overdispersion, although it may not accommodate all possible forms of overdispersion; see Remark 5 of Castellares et al. 2018CASTELLARES F, FERRARI SLP & LEMONTE AJ. 2018. On the Bell distribution and its associated regression model for count data. Applied Mathematical Modelling 56: 172-185..

Regarding pseudo-random sample generation from $Y \sim B e l l (θ)$ , we resort to Proposition 3 of Castellares et al. 2018CASTELLARES F, FERRARI SLP & LEMONTE AJ. 2018. On the Bell distribution and its associated regression model for count data. Applied Mathematical Modelling 56: 172-185., which stated that the random variable $Y$ has the same distribution as the sum of $N$ independent and identically distributed (IID) zero-truncated Poisson (ZTP) random variables with parameter $θ > 0$ , where $N$ is Poisson distributed with parameter $e^{θ} - 1$ . In other words, if $X_{i} \overset{I I D}{\sim} Z T P (θ)$ , for $i = 1, \dots, N$ , with $N \sim P o i s s o n (e^{θ} - 1)$ and independent of ${X_{1}, \dots, X_{N}}$ , then $Y = X_{1} + \dots + X_{N} \sim B e l l (θ)$ . In R software, we can generate pseudo-random observations from the ZTP distribution by using the rztpois(.) function of the “actuar“ package (Dutang et al. 2008DUTANG C, GOULET V & PIGEON M. 2008. actuar: An R package for Actuarial Science. J Stat Soft 25(7): 1-37.).

We can easily estimate the parameter $θ$ via the maximum likelihood method. By considering the observed sample $𝐲 = {(y_{1}, y_{2}, \dots, y_{n})}^{'}$ of size $n$ from $Y \sim B e l l (θ)$ , we obtain the likelihood function

L (θ | 𝐲) = \prod_{i = 1}^{n} \frac{θ^{y_{i}} e^{- e^{θ} + 1} B_{y_{i}}}{y_{i}!} = \frac{θ^{\sum_{i = 1}^{n} y_{i}} e^{n (- e^{θ} + 1)} \prod_{i = 1}^{n} B_{y_{i}}}{\prod_{i = 1}^{n} y_{i}!} \propto θ^{\sum_{i = 1}^{n} y_{i}} e^{- n e^{θ}} .

(3)

Castellares et al. 2018CASTELLARES F, FERRARI SLP & LEMONTE AJ. 2018. On the Bell distribution and its associated regression model for count data. Applied Mathematical Modelling 56: 172-185. showed that the maximum likelihood estimator (MLE)

\hat{θ}

of

θ

has a closed-form expression and is given by

\hat{θ} = W_{0} (\overline{Y}),

(4)

where

W_{0} (.)

is the Lambert

W

function (Corless et al. 1996CORLESS RM, GONNET GH, HARE DEG, JEFFREY DJ & KNUTH DE. 1996. On the LambertW function. Adv Comput Math 5(1): 329-359.), which can be computed via the W(.) function of the “LambertW“ package (Goerg 2011GOERG GM. 2011. Lambert W random variables - a new family of generalized skewed distributions with applications to risk estimation. Ann Appl Stat 5(3): 2197-2230., Goerg 2016GOERG GM. 2016. LambertW: An R package for Lambert W × F Random Variables. URL https://CRAN.R-project.org/package=LambertW. R package version 0.6.4.
https://CRAN.R-project.org/package=Lambe... ) in R, and

\overline{Y} = \sum_{i = 1}^{n} Y_{i} / n

is the sample mean.

Castellares et al. 2018CASTELLARES F, FERRARI SLP & LEMONTE AJ. 2018. On the Bell distribution and its associated regression model for count data. Applied Mathematical Modelling 56: 172-185. also presented an alternative and useful (mainly in a regression modeling framework, in which it is very common to model the mean of the response variable as a function of several other variables, also called explanatory variables or regressors) reparametrization of the Bell distribution, where $μ = E [Y] = θ e^{θ}$ and, hence, $θ = W_{0} (μ)$ . In this case, the PMF, mean and variance of $Y \sim B e l l (μ)$ , $μ > 0$ , are written, respectively, as

P (Y = y | μ) = exp {- e^{W_{0} (μ)} + 1} \frac{{[W_{0} (μ)]}^{y} B_{y}}{y!}, for y = 0, 1, 2, \dots,

E [Y] = μ and Var [Y] = μ [1 + W_{0} (μ)] .

Furthermore, it can be easily shown that the MLE

\hat{μ}

of

μ

is the sample mean, i.e.,

\hat{μ} = \overline{Y} = \sum_{i = 1}^{n} Y_{i} / n

.

New control charts

Suppose that a process (e.g., an industrial process) generates events (e.g., nonconformities or defects) according to a $B e l l (θ)$ distribution. Letting $Y$ denote the number of events per process unit, the PMF of $Y$ is given by (1). Also, let $Y_{1}, Y_{2}, \dots, Y_{n}$ be a random sample of size $n$ from $Y \sim B e l l (θ)$ , and consider $T = \sum_{i = 1}^{n} Y_{i}$ and $\overline{Y} = T / n = \sum_{i = 1}^{n} Y_{i} / n$ , which represent, respectively, the total number of events and the average number of events per unit. Hence, it follows from Equation (2), using the properties of the expected value and variance operators, that the exact mean and variance of $T$ and $\overline{Y}$ are given by

\begin{matrix} E [T] & = n θ e^{θ}, Var [T] = n θ (1 + θ) e^{θ}, \\ E [\overline{Y}] & = θ e^{θ}, Var [\overline{Y}] = \frac{θ (1 + θ) e^{θ}}{n} . \end{matrix}

The above quantities can be used to construct the proposed control charts in the usual manner for Shewhart charts. Assuming that a standard value for $θ$ is available, the center lines and $L σ$ control limits ² 2 In the usual Six Sigma quality control program, L=3. for each chart - namely, the Bell- $c$ (total number of events per unit) and Bell- $u$ (average number of events per unit) charts - are shown in Table I. It is worth pointing out that since the Bell distribution approaches the Poisson distribution for small values of $θ$ , the control limits presented in Table I also approach the control limits for special cases of the $c$ and $u$ charts derived from the Poisson distribution. Moreover, similar to the $c$ chart, the Bell- $c$ chart should not be used in the cases where the sample sizes are unequal, because both the center line and the control limits of the Bell- $c$ chart will vary with the sample size. Therefore, the Bell- $u$ chart is recommended over the Bell- $c$ chart for variable sample size as the former is easier to interpret for this scenario (the center line of the Bell- $u$ chart will not vary across samples).

Thumbnail

Table I
Control limits for the Bell-c and Bell-u charts (standards given). UCL = upper control limit, CL = center line, LCL = lower control limit.

In order to build the so-called Bell charts, considering the lack of knowledge about $θ$ , we can apply, for example, the MLE (4), using all the available $m$ samples, in the place of $θ$ (plug-in approach) in the control limits shown in Table 1. In this case, the obtained control limits are commonly treated as trial control limits (Montgomery 2013).

Since the $θ$ parameter contains little or no direct interpretation, it can be difficult for industry professionals, such as engineers and technicians, to provide a standard (or reference) value. In order to overcome this difficulty, we can use a convenient reparametrization of the Bell distribution in terms of the mean $μ = E [Y] = θ e^{θ}$ . This parametrization was described at the end of the previous section. The derived control limits for the proposed Bell charts, considering this alternative reparametrization, are displayed in Table 2. When no standard is given, then $μ$ can be estimated by the overall sample mean, that is, $\hat{μ} = \overline{\overline{Y}} = \frac{1}{m n} \sum_{j = 1}^{m} \sum_{i = 1}^{n} Y_{i j}$ (in the cases of the Bell- $c$ and Bell- $u$ charts with equal sample sizes) or $\hat{μ} = \overline{\overline{Y}} = \frac{\sum_{j = 1}^{m} \sum_{i = 1}^{n_{j}} Y_{i j}}{\sum_{j = 1}^{m} n_{j}}$ (in the case of the Bell- $u$ chart with unequal sample sizes ³ 3 Note that, in this case, the control limits of the Bell-u chart will also vary with the sample size, since we should replace n by nj in their expressions. But alternatively, Montgomery 2013 suggested to base the control limit calculations on an average sample size n‾, which results in constant limits and is particularly helpful if the charts will be presented to management. ), where $Y_{i j}$ denotes the number of events for the $i$ -th inspection unit of the $j$ -th sample. Both estimators may be easily proved to be unbiased estimators for $μ$ .

Therefore, because of their practical advantage as well as simplicity, we shall hereafter consider the control limits provided in Table 2.

Thumbnail

Table II
Control limits for the Bell-c and Bell-u charts, considering a different reparametrization of the Bell distribution (standards given).

Performance evaluation

In this section, we conduct Monte Carlo (MC) simulation studies to assess the performance of the proposed Bell charts, as well as of some existing/traditional control charts for count data (namely, the Poisson-based $c$ and $u$ charts, and the COM-Poisson-based $c m p c$ and $c m p u$ charts) when the true data-generating process is Bell distributed. All simulations and computations were performed using the R software version 3.6.1. Interested readers can email the authors for the corresponding R codes.

The average run length (ARL) is a measure commonly used to evaluate the performance of control charts. The in-control ARL, also denoted as ${ARL}_{0}$ , is defined as the average number of samples (or monitoring points) before a signal is given (that is, a single point falls outside the control limits), assuming that the process is in control; while the out-of-control ARL (or ${ARL}_{1}$ ) is the average number of samples that are taken until a mean shift is observed when the process is out of control (Saghir & Lin 2015).

Let us assume that $Y \sim Bell (μ)$ comes from a process with in-control average nonconformities, and let $μ_{s}$ be the shifted mean nonconformities parameter after a shift occurs in $μ$ , that is, $Y \sim Bell (μ_{s})$ . For the proposed Bell charts, the ${ARL}_{0}$ is defined as

{ARL}_{0} = \frac{1}{α},

where

α = 1 - P (LCL > Y > UCL | μ)

. While the

{ARL}_{1}

is given by

{ARL}_{1} = \frac{1}{1 - β},

with

β = P (LCL < Y < UCL | μ_{s})

.

For instance, in the usual Six Sigma programs, $α = 0.0027$ and, thus, ${ARL}_{0} = 1 / 0.0027 \approx 370$ . That is, even if the process is in control, an out-of-control signal will be given every 370 samples, on average (Montgomery 2013). On the other hand, ${ARL}_{1}$ values near one are desirable, mainly for large-size shifts in a process mean (Shewhart control charts).

In-control ARL

Without loss of generality, in this subsection we consider a Bell process with mean nonconformities rates: $μ = 3$ and $15$ , as well as two different values for the probability of false alarm: $α = 0.01$ and $0.1$ . These $α$ values correspond, respectively, to ${ARL}_{0} = 100$ and $10$ . We also assume three sample sizes for the process: $n = 50$ , $100$ and $500$ (or $\overline{n} = 50$ , $100$ and $500$ , in the case of the Bell- $u$ chart with variable sample size ⁴ 4 Here, each sample size nj is obtained by nj=n‾+kj, for j=1,…,m, where kj is a random integer constant ranging over the interval [−n‾/5;n‾/5]. ), even as processes with three different sample quantities in phase 1: $m = 20$ , $50$ and $100$ .

Figures 1-4 and 9-12 (see Appendix A) show the results obtained from $1, 000$ MC simulations (or replicates) with $m^{*} = 5, 000$ phase 2 samples each ⁵ 5 As pointed out by Montgomery 2013, the phases 1 and 2 of control chart application have different and distinct objectives. In phase 1, a set of process data is gathered and analyzed at once, constructing trial control limits to determine whether the process was in control when the initial m samples were collected (retrospective analysis). On the other hand, in phase 2, the chart built from a “clean“ set of process data exhibiting control (reliable control limits), is used for monitoring future production (prospective process monitoring). , performed for each scenario studied. That is, by varying the number of phase 1 samples, the sample size, the Bell distribution parameter, and the probability of false alarm. In particular, the ${ARL}_{0}$ results in Figures 1-4 concern the Bell- $c$ chart with equal sample sizes, while Figures 9-12 contain the ${ARL}_{0}$ values of the Bell- $u$ chart with unequal sample sizes. In all these figures, the dashed line represents the nominal ${ARL}_{0}$ value, and the asterisk inside the boxplot indicates the average ${ARL}_{0}$ estimate of the $1, 000$ simulations.

Despite some slight to moderate discrepancies from the target ${ARL}_{0}$ value in some cases, which is indeed expected due to the parameter estimation effect ⁶ 6 For a further discussion on this relevant issue, see, e.g., the review paper by Jensen et al. 2006. , the results presented in Figures 1-4 and 9-12 seem to indicate the good performance of the proposed Bell charts. Note that, in general, the results improve, i.e., the ${ARL}_{0}$ values approach the nominal one when both $m$ and $n$ (or $\overline{n}$ ) increase.

Figure 1. The

{ARL}_{0}

values of the Bell-c chart for various m and n (

𝛍

= 3 and

𝛂

= 0.01).

Figure 2. The

{ARL}_{0}

values of the Bell-c chart for various m and n (

𝛍

= 15 and

𝛂

= 0.01).

Figure 3. The

{ARL}_{0}

values of the Bell-c chart for various m and n (

𝛍

= 3 and

𝛂

= 0.1).

Figure 4. The

{ARL}_{0}

values of the Bell-c chart for various m and n (

𝛍

= 15 and

𝛂

= 0.1).

Out-of-control ARL

In this subsection, we evaluate the detection ability of the proposed Bell charts by means of ${ARL}_{1}$ , for the same scenarios as in the “In-control ARL“ subsection. Due to space constraints, we consider only shifts at three levels that represent percentage increases $p$ in the nonconformities rate $μ$ of the Bell process. The hypothesized levels are as follows: $p = 0.5 %$ ( $μ_{s} = 3.015$ and $15.075$ ), $1 %$ ( $μ_{s} = 3.03$ and $15.15$ ) and $10 %$ ( $μ_{s} = 3.3$ and $16.5$ ).

The ${ARL}_{1}$ values from the $1, 000$ MC simulations with $m^{*} = 5, 000$ samples each, are presented in Tables 3 and 6 (see Appendix A) for the Bell- $c$ chart with equal sample sizes and the Bell- $u$ chart with unequal sample sizes, respectively. From these tables, it can be clearly seen that the ${ARL}_{1}$ on average decreases with increasing $p$ . Note also that, for both control charts based on the Bell distribution, the ${ARL}_{1}$ values are quite close to one when $p \geq 1 %$ , and regardless of the $m$ , $n$ (or $\overline{n}$ ), $μ$ and $α$ values.

Thumbnail

Table III
Mean (standard deviation in parentheses) values of

{ARL}_{1}

of the Bell-c chart, for the different scenarios studied.

The impact of Bell data on some standard control charts

The Bell control chart theory introduced here may be applied in practical situations as a useful and exciting alternative to the well-known $c$ and $u$ charts developed via the Poisson assumption, and the $c m p c$ and $c m p u$ charts derived from the COM-Poisson distribution, among others, when data are overdispersed. Hence, in-control count data with overdispersion can be modeled well via the proposed Bell charts.

In this subsection, we apply the above-mentioned Poisson- and COM-Poisson-based control charts to sample data generated from the Bell distribution. The aim is to investigate, employing simulations, the performance (in terms of ${A R L}_{0}$ and ${A R L}_{1}$ ) of these well-known control charts when they are applied to the Bell processes. For comparison purposes, we also use the Bell- $c$ and Bell- $u$ charts for the cases when the samples are of equal and unequal sizes. The simulations were conducted using the same settings as described in the previous subsections. However, due to space and time limitations, we consider only $m = 100$ and $n = 100$ (or $\overline{n} = 100$ ). Of course, it may also be of interest to examine further the performance of the Bell-based control charts to analyze count data generated from other distributions, including the Poisson and COM-Poisson distributions with some level of dispersion. Although we provide some preliminary results (indeed, based on a single artificial sample only) in the “Poisson data“ and “COM-Poisson data“ subsubsections, this issue will be better addressed in our future work.

Tables IV and V display, respectively, the ${A R L}_{0}$ and ${A R L}_{1}$ results for the control charts with equal sample sizes (or $c$ -type control charts). While the performance measures for the control charts with unequal sample sizes (or $u$ -type control charts) are shown in Tables AII and AIII of Appendix A. From these tables, it can be seen that, regardless of the scenario configuration, the Poisson-based control charts produced poor results (i.e., gave many false alarms) for the in-control Bell samples. This result is in agreement with some authors, e.g., Saghir & Lin (2015). On the other hand, the COM-Poisson-based control charts generally provided reasonable to good results for both the in-control and out-of-control Bell samples (notice that such results are sometimes close to the ones obtained from the Bell-based control charts). This finding is somewhat expected as the COM-Poisson distribution has one extra parameter which adds some flexibility to the model and, consequently, the corresponding control chart.

Finally, it is worth pointing out that we obtained the CL and control limits (UCL and LCL) by finding the respective mean and standard deviation (SD) for each case, thus calculating the limits by the “ $Mean \pm L \times SD$ “ rule of thumb. For the Poisson and Bell distributions, the mean is easily determined. Meanwhile, the COM-Poisson summary statistics have complicated formulas, but can be readily computed using the “compoisson“ package (Dunn 2012DUNN J. 2012. compoisson: Conway-Maxwell-Poisson Distribution. URL https://CRAN.R-project.org/package=compoisson. R package version 0.3.
https://CRAN.R-project.org/package=compo... ) in R.

Thumbnail

Table IV
Mean (standard deviation in parentheses) values of

{ARL}_{0}

of the competitor c-type control charts when the true data-generating process is Bell(

𝛍

) distributed, for some

𝛍

and

𝛂

values (m=100 and n=100).

Thumbnail

Table V
Mean (standard deviation in parentheses) values of

{ARL}_{1}

of the competitor c-type control charts when the true data-generating process is Bell(

𝛍

) distributed, for some

𝛍

,

𝛂

and p values (m=100 and n=100).

Applications

It is well-known that there are practical situations in which the nature of the production process allows an item or product to contain several nonconformities (defects) and not be classified as nonconforming. For example, the manufacture of personal computers might have one or more very minor flaws in the cabinet finish, but since these flaws do not seriously affect the unit’s functional operation, it could be classified as conforming (Montgomery 2013). Thus, in this section, we apply SPC charts to inspect the total or average number of defects per sample, and determine if the process is in control.

Simulated data examples

Here, we consider three artificial data sets containing some levels of overdispersion to illustrate the usefulness of the Bell- $c$ and Bell- $u$ charts, as well as their ability to produce bounds comparable to those established by the classical SPC theory.

In the cases where the samples are of equal sizes, the control charts chosen for comparison were the traditional $c$ chart and the $c m p c$ chart. The $c$ chart assumes that the Poisson distribution well models the number of defects per inspection unit. However, according to Sellers 2012SELLERS KF. 2012. A generalized statistical control chart for over- or under-dispersed data. Qual Reliab Eng Int 28(1): 59-65., the $c$ chart, while having a better performance for large samples, does not work well in the cases of overdispersion. Thus, the author developed a control chart using a more general count distribution that relaxes the equidispersion assumption of the Poisson distribution. The so-called $c m p c$ chart, based on the assumption of a COM-Poisson distribution, has shown excellent results when fitting overdispersed count data. Therefore, it can also be used as a reference tool to compare the performance of the Bell- $c$ chart in this work. For the cases when the samples are of unequal sizes, the control charts selected for the comparison were the $u$ chart and the $c m p u$ chart.

Similar to the “Performance evaluation“ section, all the analyses were done using the R programming language, and it was assumed $L = 3$ .

Poisson data

In this application, we generated $m = 50$ samples of size $n = 100$ from a Poisson distribution with parameter $λ = 3$ . The idea here was to compare the performance of the Bell- $c$ and $c$ charts, mainly in phase 2. More specifically, in phase 1, we calculated the control limits of both charts from the aforementioned $m = 50$ samples. While in phase 2 we simulated new $m^{*} = 70$ samples of size $n = 100$ from the same distribution (Poisson), but with the last 20 samples disturbed by $λ_{s} = 3.3$ . Then, for both estimated control charts, the occurrence (or not) of false alarms in the first $50$ samples and the number of runs until the first of the 20 nonconforming samples could be detected.

Figure 5. Performance comparison of Bell-c and c charts, both constructed from Poisson distribution samples (phase 2).

Figure 5 illustrates this application, where the points (samples) to the right of the blue dashed vertical line are the disturbing observations (out-of-control samples). Furthermore, the red solid horizontal lines represent the UCL and LCL, the blue solid horizontal line indicates the CL, and the green points correspond to the out-of-control signals.

This figure points out for this particular artificial example that the Bell-based control chart produced better results than the Poisson-based control chart for the in-control samples, even though the true data-generating process is not Bell distributed. This was due to the fact that the Bell- $c$ chart identified a single false alarm, while the $c$ chart itself found three.

Finally, for the samples that are known to be out-of-control, both charts performed similarly, showing that the Bell- $c$ chart may be a useful alternative to the traditional $c$ chart.

COM-Poisson data

As in the “Poisson data“ subsubsection, in this application we generated $m = 50$ samples of equal size $n = 100$ , but this time, from a COM-Poisson distribution with parameters $λ = 1.5$ and $ν = 0.5$ . Here, the idea also was to compare the performance of the Bell- $c$ and $c m p c$ charts in phase 2 monitoring. Phases 1 and 2 analyses were carried out in the same way as before, with the exception that the last 20 samples (in a total of $m^{*} = 70$ new samples) were perturbed by $ν_{s} = 0.75$ .

Figure 6 shows the estimated control charts applied to the phase 2 samples, from which it can be observed that similar to the previous subsubsection, the Bell-based control chart provided better results than the COM-Poisson-based control chart for the in-control samples. Although the COM-Poisson represents the true data-generating distribution, the Bell- $c$ chart gave a single false alarm, while the $c m p c$ chart itself found three. Such excellent performance of the Bell- $c$ chart is maintained when analyzing out-of-control samples, showing that the Bell- $c$ chart can also be a useful, as well as simpler alternative to the $c m p c$ chart.

Figure 6. Performance comparison of Bell-c and

c m p c

charts, both constructed from COM-Poisson distribution samples (phase 2).

Bell data

In this last application, we simulated $m = 50$ samples of unequal sizes (with $\overline{n} = 100$ , and each sample size obtained by following the same approach described in footnote 4) from a Bell distribution with parameter $μ = 3$ , and compared the performance of the Bell- $u$ chart against the $u$ chart and $c m p u$ chart. Once again, as in the previous subsubsections, we considered the application to phase 2 samples. The idea here was to evaluate both Poisson- and COM-Poisson-based control charts, compared to the Bell-based control chart performance when the data are actually from a Bell distribution.

From the analysis of Figure 7, it can be noticed that, in the case of conforming (in-control) samples, the Bell- $u$ chart did not produce false alarms as the $c m p u$ chart, while the $u$ chart identified four false alarms. However, the Bell-based control chart performed slightly better than the COM-Poisson-based control chart when in the presence of nonconforming samples, where the Bell out-of-control samples were generated with $μ_{s} = 3.3$ .

Figure 7. Performance comparison of Bell-u, u and cmpu charts, all constructed from Bell distribution samples (phase 2).

Real data example

In this subsection, we considered the Bell- $c$ , $c m p c$ and $c$ charts to analyze the real data set related to the number of nonconformities in samples of 100 printed circuit boards; see Chapter 7 of Montgomery 2013MONTGOMERY DC. 2013. Introduction to Statistical Quality Control. 7th ed. John Wiley & Sons.. The data analysis initially comprehends 20 samples with overdispersion (variance/mean $\approx$ 1.21), which suggests that the process can be described by a COM-Poisson distribution with $\hat{λ} = 2.87$ and $\hat{ν} = 0.37$ (Sellers 2012). Further, Alevizakos & KoukouvinosLEVIZAKOS V & KOUKOUVINOS C. 2022. A progressive mean control chart for COM-Poisson distribution. Communications in Statistics-Simulation and Computation 51(3): 849-867. (2022) considered 20 additional samples, also with overdispersion (variance/mean $\approx$ 2.00), simulated from a COM-Poisson distribution with $λ = 3.09$ and $ν = 0.37$ , which implies a deterioration in the process; for more details, see Table 10 of Alevizakos & Koukouvinos (2022). Here, we checked the goodness-of-fit of the Bell distribution ( $\hat{μ} \approx 20$ ) through the Pearson’s $χ^{2}$ test application with the first 20 samples, where we found p-value $=$ 0.57. We also used the same test procedure for the last 20 samples, in which we obtained a p-value $=$ 0.98 ( $\hat{μ} \approx 24$ ).

Figure 8 shows the estimated Bell- $c$ , $c m p c$ , and $c$ charts, applied to the printed circuit boards data set, with control limits calculated from the initial 20 samples. That is, the left part of the green dashed vertical line is precisely the monitoring of the first 20 samples (phase 1), while the right part of this line represents the monitoring of the remaining 20 samples, which did not participate in the construction of the control limits (phase 2).

It can be observed from Figure 8 that the Bell- $c$ chart ⁷ 7 The R code used for generating this Bell-c chart is available in Appendix B. detected the process mean shift upwards as soon as it occurred. This detection was unlike the $c m p c$ chart, which did not identify any nonconformity in the process, neither on the left side nor on the right side of the green dashed vertical line. The $c$ chart, as expected, produced false alarms just to the left of this line. This result is because, as we know, the $c$ chart is not flexible enough to accommodate overdispersed data. Furthermore, it is essential to note that Alevizakos & Koukouvinos (2022) also considered the data set analyzed by Sellers 2012SELLERS KF. 2012. A generalized statistical control chart for over- or under-dispersed data. Qual Reliab Eng Int 28(1): 59-65.; they had already observed that the proposed chart did not point to declines over the UCL, which is undesirable. In our case, with the Bell- $c$ chart, the first warning appeared on unit $# 22$ , with three units outside the UCL correctly showing process deterioration. This result confirms that the Bell- $c$ chart is an excellent alternative to usual control charts, such as the $c m p c$ and $c$ charts.

Figure 8. Performance comparison of Bell-c, cmpc and c charts, constructed from the printed circuit boards data set.

Final remarks

The one-parameter discrete Bell distribution, introduced by Castellares et al. 2018CASTELLARES F, FERRARI SLP & LEMONTE AJ. 2018. On the Bell distribution and its associated regression model for count data. Applied Mathematical Modelling 56: 172-185., has been established as a viable alternative to the Poisson distribution for analyzing count data with overdispersion. As demonstrated by the examples, we provided in the “Applications“ section. The Bell-based control charts can be useful choices/alternatives to the traditional ones (e.g., the $c$ and $u$ charts derived from the Poisson distribution) for process monitoring of overdispersed count data. Notably, the proposed Bell charts provided satisfactory results (few false alarms and fast process change/shift detection) even when the sample data were generated from the two-parameter (and thus, more complex) COM-Poisson distribution.

In the “Performance evaluation“ section, simulation results also demonstrated the excellent performance of the proposed Bell charts under different scenarios (i.e., by varying the number of phase 1 samples, the sample size, the Bell distribution parameter and the probability of false alarm). In this case, the ARL criterion was used to measure control chart performance.

In the applications provided in the “Applications“ section, we determined the control limits based on the standard three-sigma rule (Six Sigma program). Nevertheless, alternatively, like in the “Performance evaluation“ section, one can obtain the UCL and LCL via a specified type I error rate $α$ , so that the probability of finding sample points outside the control region is equal to $α$ . Finally, we considered a real data set related to the number of nonconformities in samples of 100 printed circuit boards, which demonstrated that our proposed Bell- $c$ chart outperformed the standard charts when showing process deterioration.

There are many extensions of the current work. For instance, following Braun 1999BRAUN WJ. 1999. Run length distributions for estimated attributes charts. Metrika 50(2): 121-129., Chakraborti & Human (2008)CHAKRABORTI S & HUMAN SW. 2008. Properties and performance of the c-chart for attributes data. J Appl Stat 35(1): 89-100., Castagliola & Wu (2012)CASTAGLIOLA P, WU S, KHOO MBC & CHAKRABORTI S. 2014. Synthetic phase II Shewhart-type attributes control charts when process parameters are estimated. Qual Reliab Eng Int 30(3): 315-335., and Castagliola et al. 2014CASTAGLIOLA P & WU S. 2012. 2012. Design of the c and np charts when the parameters are estimated. Int J Reliab Qual Saf Eng 19(02): 1250010., we intend to develop optimization designs in order to increase the performance of the Bell control charts when estimating the unknown process parameter $μ$ using phase 1 data.

Acknowledgments

The authors thank the anonymous referees, the editor, and the associate editor for all their useful comments and suggestions which improved remarkably the quality of this article. Laion L. Boaventura acknowledges support from Bahia State Research Foundation (FAPESB, Proc. 19.573.201.5418). Paulo H. Ferreira acknowledges support from the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq, grant no. 307221/2022-9). The research of Rosemeire L. Fiaccone has been supported by FAPESB (n. app 0071/2016). Francisco Louzada is supported by CNPq (grant no. 301976/2017-1) and Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP, grant no. 2013/07375-0).

1
It is worth noting that the Bell number $B_{y}$ is the $y$ -th moment of a Poisson distribution with parameter equal to one; see Remark 1 of Castellares et al. 2018CASTELLARES F, FERRARI SLP & LEMONTE AJ. 2018. On the Bell distribution and its associated regression model for count data. Applied Mathematical Modelling 56: 172-185..
2
In the usual Six Sigma quality control program, $L = 3$ .
3
Note that, in this case, the control limits of the Bell- $u$ chart will also vary with the sample size, since we should replace $n$ by $n_{j}$ in their expressions. But alternatively, Montgomery 2013MONTGOMERY DC. 2013. Introduction to Statistical Quality Control. 7th ed. John Wiley & Sons. suggested to base the control limit calculations on an average sample size $\overline{n}$ , which results in constant limits and is particularly helpful if the charts will be presented to management.
4
Here, each sample size $n_{j}$ is obtained by $n_{j} = \overline{n} + k_{j}$ , for $j = 1, \dots, m$ , where $k_{j}$ is a random integer constant ranging over the interval $[- \overline{n} / 5; \overline{n} / 5]$ .
5
As pointed out by Montgomery 2013MONTGOMERY DC. 2013. Introduction to Statistical Quality Control. 7th ed. John Wiley & Sons., the phases 1 and 2 of control chart application have different and distinct objectives. In phase 1, a set of process data is gathered and analyzed at once, constructing trial control limits to determine whether the process was in control when the initial $m$ samples were collected (retrospective analysis). On the other hand, in phase 2, the chart built from a “clean“ set of process data exhibiting control (reliable control limits), is used for monitoring future production (prospective process monitoring).
6
For a further discussion on this relevant issue, see, e.g., the review paper by Jensen et al. 2006JENSEN WA, JONES-FARMER LA, CHAMP CW & WOODALL WH. 2006. Effects of parameter estimation on control chart properties: a literature review. J Qual Technol 38(4): 349-364..
7
The R code used for generating this Bell- $c$ chart is available in Appendix B.

ACKNOWLEDGMENTS

The authors thank the anonymous referees, the editor, and the associate editor for all their useful comments and suggestions which improved remarkably the quality of this article. Laion L. Boaventura acknowledges support from Bahia State Research Foundation (FAPESB, Proc. 19.573.201.5418). Paulo H. Ferreira acknowledges support from the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq, grant no. 307221/2022-9). The research of Rosemeire L. Fiaccone has been supported by FAPESB (n. app 0071/2016). Francisco Louzada is supported by CNPq (grant no. 301976/2017-1) and Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP, grant no. 2013/07375-0).

ALBERS W. 2011. Control charts for health care monitoring under overdispersion. Metrika 74(1): 67-83.
LEVIZAKOS V & KOUKOUVINOS C. 2022. A progressive mean control chart for COM-Poisson distribution. Communications in Statistics-Simulation and Computation 51(3): 849-867.
AVCI E, ALTÜRK S & SOYLU EN. 2015. Comparison count regression models for overdispersed alga data. Int J Res Rev Appl Sci 25(1): 1-5.
BELL ET. 1934a. Exponential numbers. The American Mathematical Monthly 41(7): 411-419.
BELL ET. 1934b. Exponential polynomials. Ann Math 35(2): 258-277.
BRAUN WJ. 1999. Run length distributions for estimated attributes charts. Metrika 50(2): 121-129.
CASTAGLIOLA P & WU S. 2012. 2012. Design of the c and np charts when the parameters are estimated. Int J Reliab Qual Saf Eng 19(02): 1250010.
CASTAGLIOLA P, WU S, KHOO MBC & CHAKRABORTI S. 2014. Synthetic phase II Shewhart-type attributes control charts when process parameters are estimated. Qual Reliab Eng Int 30(3): 315-335.
CASTELLARES F, FERRARI SLP & LEMONTE AJ. 2018. On the Bell distribution and its associated regression model for count data. Applied Mathematical Modelling 56: 172-185.
CHAKRABORTI S & HUMAN SW. 2008. Properties and performance of the c-chart for attributes data. J Appl Stat 35(1): 89-100.
CHENG SS & YU FJ. 2013. A CUSUM control chart to monitor wafer quality. In: Proceedings of World Academy of Science, Engineering and Technology. 78. p. 1300. Citeseer.
COLY S, YAO AF, ABRIAL D & CHARRAS-GARRIDO M. 2016. Distributions to model overdispersed count data. Journal de la Société Française de Statistique 157(2): 39-64.
CORLESS RM, GONNET GH, HARE DEG, JEFFREY DJ & KNUTH DE. 1996. On the LambertW function. Adv Comput Math 5(1): 329-359.
DUNN J. 2012. compoisson: Conway-Maxwell-Poisson Distribution. URL https://CRAN.R-project.org/package=compoisson R package version 0.3.
» https://CRAN.R-project.org/package=compoisson
DUTANG C, GOULET V & PIGEON M. 2008. actuar: An R package for Actuarial Science. J Stat Soft 25(7): 1-37.
FAMOYE F. 1994. Statistical control charts for shifted generalized Poisson distribution. J Ital Stat Soc 3(3): 339-354.
GOERG GM. 2011. Lambert W random variables - a new family of generalized skewed distributions with applications to risk estimation. Ann Appl Stat 5(3): 2197-2230.
GOERG GM. 2016. LambertW: An R package for Lambert W × F Random Variables. URL https://CRAN.R-project.org/package=LambertW R package version 0.6.4.
» https://CRAN.R-project.org/package=LambertW
HOUGAARD P, LEE MLT & WHITMORE GA. 1997. Analysis of overdispersed count data by mixtures of Poisson variables and Poisson processes. Biometrics 53(4): 1225-1238.
JENSEN WA, JONES-FARMER LA, CHAMP CW & WOODALL WH. 2006. Effects of parameter estimation on control chart properties: a literature review. J Qual Technol 38(4): 349-364.
KAMINSKY FC, BENNEYAN JC, DAVIS RD & BURKE RJ. 1992. Statistical control charts based on a geometric distribution. J Qual Technol 24(2): 63-69.
MOHAMMED MA & LANEY D. 2006. Overdispersion in health care performance data: Laney’s approach. BMJ Quality & Safety 15(5): 383-384.
MONTGOMERY DC. 2013. Introduction to Statistical Quality Control. 7th ed. John Wiley & Sons.
OKAMURA H, PUNT AE & AMANO T. 2012. A generalized model for overdispersed count data. Pop Ecol 54(3): 467-474.
R CORE TEAM. 2018. R: A Language and Environment for Statistical Computing. (Version 3.6.1). R Foundation for Statistical Computing. Vienna, Austria.
SAGHIR A & LIN Z. 2015. Control charts for dispersed count data: an overview. Qual Reliab Eng Int 31(5): 725-739.
SELLERS KF. 2012. A generalized statistical control chart for over- or under-dispersed data. Qual Reliab Eng Int 28(1): 59-65.
SHEAFFER RL & LEAVENWORTH RS. 1976. The negative binomial model for counts in units of varying size. J Qual Technol 8(3): 158-163.
SPIEGELHALTER DJ. 2005. Handling over-dispersion of performance indicators. BMJ Quality & Safety 14(5): 347-351.

APPENDIX A

Figure A1
The

{ARL}_{0}

values of the Bell-u chart for various m and

\overline{𝐧}

(

𝛍

= 3 and

𝛂

= 0.01).

Figure A2
The

{ARL}_{0}

values of the Bell-u chart for various m and

\overline{𝐧}

(

𝛍

= 15 and

𝛂

= 0.01).

Figure A3
The

{ARL}_{0}

values of the Bell-u chart for various m and

\overline{𝐧}

(

𝛍

= 3 and

𝛂

= 0.1).

Figure A4
The

{ARL}_{0}

values of the Bell-u chart for various m and

\overline{𝐧}

(

𝛍

= 15 and

𝛂

= 0.1).

Thumbnail

Table AI
Mean (standard deviation in parentheses) values of

{ARL}_{1}

of the Bell-u chart, for the different scenarios studied.

Thumbnail

Table AII
Mean (standard deviation in parentheses) values of

{ARL}_{0}

of the competitor u-type control charts when the true data-generating process is Bell(

𝛍

) distributed, for some

𝛍

and

𝛂

values (m=100 and

\overline{𝐧}

=100).

Thumbnail

Table AIII
Mean (standard deviation in parentheses) values of

{ARL}_{1}

of the competitor u-type control charts, when the true data-generating process is Bell(

𝛍

) distributed, for some

𝛍

,

𝛂

and p values (m=100 and

\overline{𝐧}

=100).

Appendix B

Publication Dates

Publication in this collection
05 June 2023
Date of issue
2023

History

Received
3 Mar 2020
Accepted
30 Aug 2020

This is an open-access article distributed under the terms of the Creative Commons Attribution License

[1] 1
It is worth noting that the Bell number $B_{y}$ is the $y$ -th moment of a Poisson distribution with parameter equal to one; see Remark 1 of Castellares et al. 2018CASTELLARES F, FERRARI SLP & LEMONTE AJ. 2018. On the Bell distribution and its associated regression model for count data. Applied Mathematical Modelling 56: 172-185..

[2] 2
In the usual Six Sigma quality control program, $L = 3$ .

[3] 3
Note that, in this case, the control limits of the Bell- $u$ chart will also vary with the sample size, since we should replace $n$ by $n_{j}$ in their expressions. But alternatively, Montgomery 2013MONTGOMERY DC. 2013. Introduction to Statistical Quality Control. 7th ed. John Wiley & Sons. suggested to base the control limit calculations on an average sample size $\overline{n}$ , which results in constant limits and is particularly helpful if the charts will be presented to management.

[4] 4
Here, each sample size $n_{j}$ is obtained by $n_{j} = \overline{n} + k_{j}$ , for $j = 1, \dots, m$ , where $k_{j}$ is a random integer constant ranging over the interval $[- \overline{n} / 5; \overline{n} / 5]$ .

[5] 5
As pointed out by Montgomery 2013MONTGOMERY DC. 2013. Introduction to Statistical Quality Control. 7th ed. John Wiley & Sons., the phases 1 and 2 of control chart application have different and distinct objectives. In phase 1, a set of process data is gathered and analyzed at once, constructing trial control limits to determine whether the process was in control when the initial $m$ samples were collected (retrospective analysis). On the other hand, in phase 2, the chart built from a “clean“ set of process data exhibiting control (reliable control limits), is used for monitoring future production (prospective process monitoring).

[6] 6
For a further discussion on this relevant issue, see, e.g., the review paper by Jensen et al. 2006JENSEN WA, JONES-FARMER LA, CHAMP CW & WOODALL WH. 2006. Effects of parameter estimation on control chart properties: a literature review. J Qual Technol 38(4): 349-364..

[7] 7
The R code used for generating this Bell- $c$ chart is available in Appendix B.

	Bell- $c$ chart	Bell- $u$ chart
UCL	$n θ e^{θ} + L \sqrt{n θ (1 + θ) e^{θ}}$	$θ e^{θ} + L \sqrt{\frac{θ (1 + θ) e^{θ}}{n}}$
CL	$n θ e^{θ}$	$θ e^{θ}$
LCL	$n θ e^{θ} - L \sqrt{n θ (1 + θ) e^{θ}}$	$θ e^{θ} - L \sqrt{\frac{θ (1 + θ) e^{θ}}{n}}$

			$n = 50$			$n = 100$			$n = 500$
			$p = 0.5 %$	$p = 1 %$	$p = 10 %$	$p = 0.5 %$	$p = 1 %$	$p = 10 %$	$p = 0.5 %$	$p = 1 %$	$p = 10 %$
$m = 20$	$μ = 3$	$α = 0.1$	12.442	1.272	1.013	12.942	1.178	1.015	14.331	1.251	1.011
		$α = 0.1$	(0.112)	(0.011)	(0.023)	(0.132)	(0.011)	(0.024)	(0.111)	(0.012)	(0.011)
		$α = 0.01$	117.212	1.244	1.021	112.732	1.162	1.016	110.213	1.267	1.013
		$α = 0.01$	(0.223)	(0.011)	(0.024)	(0.263)	(0.011)	(0.023)	(0.314)	(0.012)	(0.011)
	$μ = 15$	$α = 0.1$	17.131	1.151	1.063	12.622	1.265	1.011	11.972	1.145	1.023
		$α = 0.1$	(0.114)	(0.012)	(0.022)	(0.211)	(0.011)	(0.022)	(0.111)	(0.010)	(0.011)
		$α = 0.01$	119.634	1.242	1.092	117.000	1.171	1.027	118.303	1.263	1.018
		$α = 0.01$	(0.411)	(0.010)	(0.021)	(0.410)	(0.011)	(0.023)	(0.382)	(0.012)	(0.014)
$m = 50$	$μ = 3$	$α = 0.1$	13.721	1.127	1.043	17.092	1.264	1.018	19.191	1.233	1.006
		$α = 0.1$	(0.110)	(0.011)	(0.023)	(0.111)	(0.014)	(0.023)	(0.112)	(0.011)	(0.012)
		$α = 0.01$	123.232	1.257	1.021	117.500	1.265	1.016	112.621	1.237	1.002
		$α = 0.01$	(0.411)	(0.014)	(0.022)	(0.332)	(0.011)	(0.024)	(0.373)	(0.012)	(0.011)
	$μ = 15$	$α = 0.1$	12.202	1.171	1.033	12.210	1.227	1.018	11.993	1.221	1.011
		$α = 0.1$	(0.111)	(0.013)	(0.024)	(0.104)	(0.011)	(0.023)	(0.214)	(0.013)	(0.012)
		$α = 0.01$	112.532	1.103	1.092	119.124	1.174	1.019	112.6	1.249	1.014
		$α = 0.01$	(0.311)	(0.014)	(0.023)	(0.282)	(0.011)	(0.024)	(0.423)	(0.012)	(0.010)
$m = 100$	$μ = 3$	$α = 0.1$	11.922	1.114	1.005	11.961	1.097	1.008	10.992	1.281	1.009
		$α = 0.1$	(0.111)	(0.014)	(0.023)	(0.102)	(0.012)	(0.024)	(0.113)	(0.012)	(0.011)
		$α = 0.01$	112.124	1.194	1.032	101.812	1.267	1.018	101.924	1.152	1.011
		$α = 0.01$	(0.410)	(0.012)	(0.024)	(0.352)	(0.011)	(0.024)	(0.253)	(0.012)	(0.012)
	$μ = 15$	$α = 0.1$	12.304	1.113	1.042	11.994	1.154	1.009	12.261	1.219	1.004
		$α = 0.1$	(0.110)	(0.012)	(0.024)	(0.213)	(0.012)	(0.021)	(0.224)	(0.012)	(0.013)
		$α = 0.01$	117.311	1.124	1.008	116.223	1.227	1.009	102.172	1.107	1.000
		$α = 0.01$	(0.182)	(0.013)	(0.024)	(0.213)	(0.012)	(0.022)	(0.214)	(0.014)	(0.012)

	$μ = 3$		$μ = 15$
Control chart	$α = 0.01$	$α = 0.1$	$α = 0.01$	$α = 0.1$
Bell- $c$	97.0814	9.8157	101.1592	9.9066
	(13.4512)	(0.4083)	(15.0990)	(0.3851)
$c$	13.8551	4.0038	7.3268	2.9126
	(0.6888)	(0.0965)	(0.2633)	(0.0558)
$c m p c$	117.8809	10.6975	105.1765	9.4851
	(17.9508)	(0.4711)	(16.4531)	(0.3732)

	$μ = 3$
	$α = 0.01$			$α = 0.1$
Control chart	$p = 0.5 %$	$p = 1 %$	$p = 10 %$	$p = 0.5 %$	$p = 1 %$	$p = 10 %$
Bell- $c$	91.2050	74.9862	12.6206	9.7069	9.2611	2.8119
	(12.6312)	(9.7344)	(0.6236)	(0.3904)	(0.3662)	(0.0549)
$c$	13.5571	12.4846	4.1535	3.9747	3.8935	1.8493
	(0.6768)	(0.6042)	(0.1048)	(0.0929)	(0.0908)	(0.0243)
$c m p c$	113.8549	86.4678	14.4834	10.9720	10.2203	3.0474
	(17.8464)	(12.2304)	(0.7670)	(0.4775)	(0.4228)	(0.0633)
	$μ = 15$
	$α = 0.01$			$α = 0.1$
Control chart	$p = 0.5 %$	$p = 1 %$	$p = 10 %$	$p = 0.5 %$	$p = 1 %$	$p = 10 %$
Bell- $c$	91.9277	50.3049	2.6450	9.8956	8.3245	1.5320
	(12.4999)	(5.2314)	(0.0472)	(0.4038)	(0.3101)	(0.0158)
$c$	7.1346	5.5831	1.3094	2.9266	2.7210	1.1663
	(0.2476)	(0.1671)	(0.0105)	(0.0573)	(0.0505)	(0.0066)
$c m p c$	102.9636	27.0492	2.8850	9.8275	9.1967	1.5444
	(15.3876)	(2.0920)	(0.0543)	(0.4035)	(0.3691)	(0.0160)

			$\overline{n} = 50$			$\overline{n} = 100$			$\overline{n} = 500$
			$p = 0.5 %$	$p = 1 %$	$p = 10 %$	$p = 0.5 %$	$p = 1 %$	$p = 10 %$	$p = 0.5 %$	$p = 1 %$	$p = 10 %$
$m = 20$	$μ = 3$	$α = 0.1$	14.233	1.172	1.013	13.242	1.228	1.015	15.233	1.151	1.011
		$α = 0.1$	(0.142)	(0.024)	(0.012)	(0.271)	(0.024)	(0.022)	(0.111)	(0.011)	(0.014)
		$α = 0.01$	111.223	1.214	1.011	106.024	1.162	1.016	121.242	1.227	1.011
		$α = 0.01$	(0.282)	(0.022)	(0.024)	(0.482)	(0.021)	(0.024)	(0.261)	(0.022)	(0.014)
	$μ = 15$	$α = 0.1$	11.631	1.124	1.046	17.261	1.242	1.011	14.242	1.224	1.024
		$α = 0.1$	(0.124)	(0.013)	(0.022)	(0.113)	(0.023)	(0.022)	(0.111)	(0.014)	(0.010)
		$α = 0.01$	113.224	1.174	1.072	112.424	1.161	1.024	115.222	1.262	1.012
		$α = 0.01$	(0.212)	(0.014)	(0.014)	(0.213)	(0.013)	(0.011)	(0.222)	(0.013)	(0.014)
$m = 50$	$μ = 3$	$α = 0.1$	12.274	1.156	1.022	13.292	1.245	1.018	12.262	1.232	1.005
		$α = 0.1$	(0.114)	(0.012)	(0.013)	(0.112)	(0.014)	(0.024)	(0.112)	(0.014)	(0.011)
		$α = 0.01$	116.142	1.246	1.026	113.524	1.244	1.026	111.263	1.235	1.002
		$α = 0.01$	(0.211)	(0.013)	(0.014)	(0.222)	(0.011)	(0.022)	(0.203)	(0.014)	(0.012)
	$μ = 15$	$α = 0.1$	11.242	1.151	1.045	12.561	1.213	1.018	12.291	1.121	1.011
		$α = 0.1$	(0.091)	(0.011)	(0.014)	(0.113)	(0.012)	(0.023)	(0.102)	(0.012)	(0.014)
		$α = 0.01$	118.524	1.193	1.032	112.100	1.171	1.018	114.100	1.159	1.004
		$α = 0.01$	(0.314)	(0.022)	(0.024)	(0.222)	(0.014)	(0.020)	(0.202)	(0.014)	(0.011)
$m = 100$	$μ = 3$	$α = 0.1$	12.522	1.104	1.005	16.120	1.197	1.007	13.692	1.126	1.009
		$α = 0.1$	(0.112)	(0.013)	(0.023)	(0.214)	(0.012)	(0.024)	(0.142)	(0.012)	(0.014)
		$α = 0.01$	118.124	1.163	1.012	115.242	1.197	1.018	103.024	1.151	1.006
		$α = 0.01$	(0.292)	(0.014)	(0.012)	(0.262)	(0.014)	(0.014)	(0.412)	(0.022)	(0.014)
	$μ = 15$	$α = 0.1$	12.392	1.102	1.041	16.221	1.151	1.009	12.224	1.222	1.004
		$α = 0.1$	(0.114)	(0.010)	(0.023)	(0.210)	(0.012)	(0.024)	(0.113)	(0.012)	(0.011)
		$α = 0.01$	115.224	1.183	1.008	116.524	1.126	1.008	119.503	1.206	1.000
		$α = 0.01$	(0.432)	(0.014)	(0.022)	(0.330)	(0.012)	(0.024)	(0.332)	(0.013)	(0.010)

Brasil

Brasil

New statistical process control charts for overdispersed count data based on the Bell distribution

Abstract

Introduction

Bell distribution

New control charts

Performance evaluation

In-control ARL

Out-of-control ARL

The impact of Bell data on some standard control charts

Applications

Simulated data examples

Poisson data

COM-Poisson data

Bell data

Real data example

Final remarks

ACKNOWLEDGMENTS

APPENDIX A

Appendix B

Publication Dates

History

	Bell- $c$ chart	Bell- $u$ chart
UCL	$n μ + L \sqrt{n μ [1 + W_{0} (μ)]}$	$μ + L \sqrt{\frac{μ [1 + W_{0} (μ)]}{n}}$
CL	$n μ$	$μ$
LCL	$n μ - L \sqrt{n μ [1 + W_{0} (μ)]}$	$μ - L \sqrt{\frac{μ [1 + W_{0} (μ)]}{n}}$

	$μ = 3$		$μ = 15$
Control chart	$α = 0.01$	$α = 0.1$	$α = 0.01$	$α = 0.1$
Bell- $u$	96.3690	10.2200	96.5712	9.7825
	(13.5892)	(0.4345)	(13.6545)	(0.4296)
$u$	13.7370	4.0295	7.2248	2.8840
	(0.6982)	(0.0972)	(0.2526)	(0.0551)
$c m p u$	112.8635	10.6225	76.3384	7.1528
	(17.3837)	(0.4540)	(9.3869)	(0.2550)

	$μ = 3$
	$α = 0.01$			$α = 0.1$
Control chart	$p = 0.5 %$	$p = 1 %$	$p = 10 %$	$p = 0.5 %$	$p = 1 %$	$p = 10 %$
Bell- $u$	90.8468	84.1567	13.4427	10.1559	9.9885	3.0521
	(12.3245)	(11.6707)	(0.6791)	(0.4489)	(0.4074)	(0.0619)
$u$	13.5156	13.1820	4.2784	4.0293	3.9949	1.9329
	(0.6548)	(0.6323)	(0.1085)	(0.0989)	(0.0984)	(0.0267)
$c m p u$	108.1874	89.3612	15.3133	10.6599	10.5533	3.2587
	(16.5333)	(12.5799)	(0.8142)	(0.4826)	(0.4375)	(0.0693)
	$μ = 15$
	$α = 0.01$			$α = 0.1$
Control chart	$p = 0.5 %$	$p = 1 %$	$p = 10 %$	$p = 0.5 %$	$p = 1 %$	$p = 10 %$
Bell- $u$	93.6208	83.0078	3.0950	10.0302	9.1960	1.4209
	(12.9182)	(10.8026)	(0.0612)	(0.4335)	(0.3735)	(0.0131)
$u$	7.2176	6.9502	1.3911	2.9273	2.8343	1.1289
	(0.2531)	(0.2402)	(0.0123)	(0.0584)	(0.0562)	(0.0055)
$c m p u$	69.3032	82.7237	3.0015	7.4049	9.7473	1.4464
	(8.0359)	(10.6946)	(0.0575)	(0.2543)	(0.4111)	(0.0138)