A weighted negative binomial Lindley distribution with applications to dispersed data

BAKOUCH, HASSAN S

doi:10.1590/0001-3765201820170733

Abstract

A new discrete distribution is introduced. The distribution involves the negative binomial and size biased negative binomial distributions as sub-models among others and it is a weighted version of the two parameter discrete Lindley distribution. The distribution has various interesting properties, such as bathtub shape hazard function along with increasing/decreasing hazard rate, positive skewness, symmetric behavior, and over- and under-dispersion. Moreover, it is self decomposable and infinitely divisible, which makes the proposed distribution well suited for count data modeling. Other properties are investigated, including probability generating function, ordinary moments, factorial moments, negative moments and characterization. Estimation of the model parameters is investigated by the methods of moments and maximum likelihood, and a performance of the estimators is assessed by a simulation study. The credibility of the proposed distribution over the negative binomial, Poisson and generalized Poisson distributions is discussed based on some test statistics and four real data sets.

Key words
characterization; discrete distributions; Estimation; Vuong test statistic; mixture distributions; thunderstorms data

INTRODUCTION

In observational studies it is observed that due to lack of well defined sampling frames for plant, human, insect, wildlife and fish populations, the scientists/ researchers cannot select sampling units with equal probability. As a result the recorded observations on individuals in these populations are biased unless every observation is given an equal chance of being recorded. As such biased data arise in all disciplines of science, so statisticians and researchers have done their best to find out solutions for correction of the biases. In this regard, the weighted distribution theory gives a unified approach for modeling biased data, Fisher (1934)FISHER RA. 1934. The effects of methods of ascertainment upon the estimation of frequencies. Ann Eugenic 6: 13-25., later on Rao (1985)RAO CR. 1985. Weighted distributions arising out of methods of ascertainment. In Atkinson AC AND Fienberg SE (Eds.), A Celebration of Statistics Chapter 24. pp. 543-569. New York: Springer-Verlag. and Patil (1991)PATIL GP. 1991. Encountered data, statistical ecology, environmental statistics, and weighted distribution methods. Environmetrics 2: 377-423. studied it in a unified manner. Those pioneers have pointed out the situations in which the recorded observations cannot be considered as a random sample from the original distribution like non-experimental, non-replicated and non-random categories. This may be due to one or more reasons, such as non-observability of some events, damage caused to original observations and adoption of unequal probability sampling (see Jain et al. 2014JAIN K, SINGLA N AND GUPTA RD. 2014. A weighted version of gamma distribution. Discus Math Prob Stat 34: 89-111.). Patil and Rao (1977)PATIL GP AND RAO CR. 1977. Weighted distributions: a survey of their application. In Krishnaiah PR (Ed.), Applications of Statistics, pp. 383-405. North Holland Publishing Company. stated that " Although the situations that involve weighted distributions seem to occur frequently in various fields, the underlying concept of weighted distributions as a major stochastic concept does not seem to have been widely recognized", the same quote is applicable today but, unfortunately a rare work is seen on this topic.

In order to remove the biases and to obtain the suitable distribution, the researchers usually adopt the approach of weighted concept of biased observation which leads towards the development of a weighted distribution. For a non-negative integer valued random variable X, its probability mass function can be defined as $P (X = k) = p (k; θ,$ where θ is a vector parameter, and let ω(k) be a non negative function on $N_{0} = 0, 1, . . .$ which does not need to be zero or one and may exceed unity but it should have a finite expectation, i.e., $𝔼 (ω (X = k)) = \sum_{k = 0}^{\infty} ω (k) P (X = k) < \infty$ . Suppose that when the event occurs, the weight $ω (k)$ is used to adjust the probability. Therefore, the record k is thus the realization of count random variable Y which is a weighted version of X and its probability mass function (pmf) is given by

P (Y = k) = p_{Y} (k; θ) = \frac{w (k; ψ) P (X = k)}{𝔼 (w (X; ψ))}, k \in N_{0},

(1)

where $ω (k; ψ) : \equiv ω (k)$ can depend on a parameter ψ representing the recording mechanism, and it may also be connected to the underlying initial vector parameter θ. A detailed account of weight functions is available in Patil and Rao (1986)PATILL GP, RAO CR AND RATNAPARKHI M. 1986. On discrete weighted distribution and their use in model choice for observed data. Commun Statisit Theory Math 15: 907-918.. Some particular cases of the weight functions are the standard distributions with $ω (k; ψ) =$ Constant, $\forall k \in N_{0}$ , the size-biased distributions with weight $ω (k; ψ) = k,$ $\forall k \in N_{0}$ , the popular COM-Poisson with weight $ω (k; ψ) = (k!)^{1 - ψ},$ $\forall k \in N_{0}$ (see Balakrishnan 2014BALAKRISHNAN N. 2014. Methods and Applications of Statistics in Clinical Trials: Planning, Analysis and Inferential Methods. J Wiley & Sons, Inc., pp. 511-512), Neel and Schull (1966)NEEL JV AND SCHULL WJ. 1966. Human Heredity. Chicago: University of Chicago Press. proposed the weighted Poisson distribution with a weight as Poisson distribution, i.e., $ω (k; ψ) = \frac{ψ^{k} e^{- ψ}}{k!}, ψ > 0, \forall k \in N_{0}$ and binomial distribution with weight function $ω (k; ψ) = 1 - (1 - ψ)^{k},$ $0 < ψ < 1, \forall k \in N_{0}$ (see Kokonendji and Casany 2012KOKONENDJI CC AND CASANY MP. 2012. A note on weighted count distributions. J Stat Theory Appl 11: 337-352.).

In this paper, we construct a weighted version of the two parameter discrete Lindley distribution introduced by Hussain et al. (2016)HUSSAIN T, ASLAM M AND AHMAD M. 2016. A two parameter discrete Lindley distribution. Rev Colomb Estad 39: 45-61. by making use of the weight function $ω (k; β) = (\frac{β + k - 1}{k})$ . Construction and motivations of the introduced distribution are outlined in the section below. Rest of the paper is organized as follows. The section ’Statistical Properties Of The WNBL Distribution’ gives several properties of the proposed distribution, such as probability generating function, moments, factorial moments, recurrence relation between moments and negative moments. In the section ’Parameter Estimation and Inference’, the estimation of the model parameters is investigated by the methods of moments and maximum likelihood, and performance of the estimators is assessed by a simulation study. A characterization for the distribution via the probability generating function is investigated with a discussion to self decomposability and infinite divisibility. Credibility of the proposed distribution over the negative binomial, Poisson and generalized Poisson distributions is also discussed based on some evaluation statistics and four real data sets. The last section gives the drawn conclusion.

THE PROPOSED DISTRIBUTION WITH MOTIVATIONS

Hussain et al. (2016) developed the two parameter discrete Lindley distribution with the pmf

P (X = k) = \frac{(1 - p)^{2} (1 + β k) p^{k}}{(1 + β p - p)},

(1)

$k \in N_{0},$ $p \in (0, 1)$ and $β \geq 0$

Considering the weight function $ω (k; β) = (\frac{β + k - 1}{k})$ and substituting it along with equation (2) into equation (1), we get

P (Y = k) = p_{k} = \frac{1}{q + β^{2} p} (\frac{β + k - 1}{k}) (1 + β k) p^{k} q^{β + 1},

(3)

$k \in N_{0},$ $q = 1 - p,$ $p \in (0, 1)$ and $β \geq 0$ .

Hereafter, the weighted version of the two parameter discrete Lindley distribution shall be denoted by the weighted negative binomial Lindley (WNBL) distribution. Now, we give some motivations for the proposed distribution.

Motivation 1. The pmf of the negative binomial (NB) distribution is given as

P (Y = k) = (\frac{β + k - 1}{k}) p^{k} {(1 - p)}^{β}

$k,$ $β \in N_{0}$ and $p \in (0, 1),$

or equivalently

P (Y = k) = C (\frac{β + k - 1}{k}) p^{k},

where k denotes the number of failures/success before the $β^{t h}$ failure/success, p is the probability of success and C is the normalization constant which equals ${(1 - p)}^{β}$ . In the NB distribution, we repeat the number of Bernoulli trials in order to obtain a fixed number of successes, while if the number of failures/successes increases linearly on each trial then the probability of exactly k failures/successes can be expressed as $P (Y = k) = 𝔸 (\frac{β + k - 1}{k}) (1 + β k) p^{k},$ where $𝔸 = \frac{q^{β + 1}}{q + β^{2} p},$ which is the pmf of the WNBL distribution.

Moreover, it can be noted that the probability function given by (2) is a generalized form of the NB with vector parameter $(2, p)$ and also the discrete Lindley with vector parameter $(1, p) .$

Motivation 2.

The binomial coefficient $(\frac{β + k - 1}{k})$ in the equation (3) represents the number of ways in which the fixed number of failures/successes are obtained when these failures/successes increase linearly on each trial.

Motivation 3.

It is also worth mentioning that the model given in equation (3) is recognized as a mixture of the NB distribution and the size biased negative binomial distribution (SBNB), that is,

p_{k} = \frac{q}{q + p β^{2}} N B (k; p, β) + \frac{p β^{2}}{q + p β^{2}} S B N B (k - 1; p, β) .

Motivation 4.

The model given by (3) can describe the thunderstorm activities as shall be outlined in the application section using two real data sets from such area. Thunderstorms activities often affect the planning of space missions and launch operations at different launching stations because of unstable meteorological norms associated with them. For this purpose several statistical distributions are used to represent the variation of the thunderstorms activities and search out that negative binomial and its modified forms are usually preferred for such activities (see Sakamoto 1973SAKAMOTO CM. 1973. Application of the Poisson and negative binomial models to thunderstorm and hail days probabilities in Nevada. Monthly Weather Review 101: 350-355.). Therefore, we model such activities by introducing a weighted and mixed version of the negative binomial and Lindley distributions. The weighted distributions are important for the adjustment of probability of occurrences and removal of biasedness from the data, while the mixed distributions are usually used in heterogeneous and over-dispersed data sets. The proposed model not only overcomes the heterogeneity issue but also handles the over-dispersion because of its mixture representation which makes it better than the negative binomial, Poisson and generalized Poisson distributions.

Further motivations for the WNBL distribution are given in the next notes.

Figure 1
Probability Graphs for the Indicated Values of p and b of WNBL.

Figure 1 portrays that the WNBL distribution shows symmetrical behavior when p < 0.20 and β increases. Whereas, for smaller β and higher p, the distribution becomes positively skewed. The distribution also exhibits bimodal and reverse J shapes.

Now, we give two prime measures of the distribution.

The survival and hazard rate functions of the WNBL distribution are

S_{k} = \frac{(1 - p)^{β + 1} p^{k} (β)_{k}}{(1 + p (β^{2} - 1)) x k!} \times {(1 + β k)_{2} F_{1} (β + k, 1; k + 1; p) + \frac{β p (β + k)_{2} F_{1} (β + k + 1, 2; k + 2; p)}{(k + 1)}}

(4)

and

h_{k} = \frac{(1 + β k) (k + 1)}{(1 + β k) (k + 1)_2F_1 (β + k, 1; k + 1; p) + β p (β + k)_2F_1 (β + k + 1, 2; k + 2; p)},

(5)

respectively. Where

_2F_1 (a, b; c; z) = \sum_{n = 0}^{\infty} \frac{(a)_{n} (b)_{n} z^{n}}{(c)_{n} n!},

and $(a)_{n} = a (a + 1) (a + 2) \dots (a + n - 1)$ , are the hypergeometric series function and the Pochhammer’s symbol, respectively, the series converges for $a, b, c \geq 0$ and $| z | \leq 1$ .

Clearly

l i m_{k \to \infty} h_{k} = (1 - p)^{2},

therefore, the hazard rate function of the WNBL distribution is bounded above, which is an important property for the lifetime models.

Figure 2 displays increasing, decreasing and bathtub shape (BTS) hazard functions. It is observed that the traditional statistical distributions, such as the Poisson, generalized Poisson and NB distributions, cannot be used efficiently in models of count data with many zeros. The Poisson distribution tends to under-estimate the number of zeros, while the NB may over-estimate zeros (see Saengthong and Bodhisuwan 2013SAENGTHONG P AND BODHISUWAN W. 2013. Negative binomial-crack (NB-CR) distribution. Int J Pure Appl Math 84: 213-230.). Although the traditional count models have generally increasing or decreasing failure rates yet unable to exhibit BTS hazard rates.

Figure 2
Hazard Function Graphs for the Indicated Values of p and b of WNBL.

In view of the discussion above, some other motivational factors of the proposed model shall be outlined later and they are: i) Possessing BTS hazard function along with increasing/decreasing hazard rate characteristics which are seldom observed in discrete distributions. ii) Possessing bounded hazard function iii) Being an over- and under-dispersed statistical model. iv) Being a self decomposable and infinitely divisible model. v) Being a characterizable function which is a milestone in model selection.

STATISTICAL PROPERTIES OF THE WNBL DISTRIBUTION

The recognition of any discrete probability distribution is usually based on its probability generating function (pgf). The following theorem gives the pgf of the WNBL distribution.

Proposittion1: If $Y \sim$ WNBL $(p, β),$ then the pgf of the random variable Y is expressed as

G_{Y} (t) = \frac{(1 + p t (β^{2} - 1)) q^{β + 1}}{(1 + p (β^{2} - 1)) (1 - p t)^{β + 1}},

(6)

where $p \in (0, 1)$ and $β \geq 0 .$

Proof: The pgf from the definition is expressed as $G_{Y} (t) = \sum_{x = 0}^{\infty} t^{x} P (Y = x)$ . Using equation (3), we have

G_{Y} (t) = \frac{q^{β + 1}}{1 + p (β^{2} - 1)} \sum_{x = 0}^{\infty} (\frac{β + x - 1}{x}) (p t)^{x} (1 + β x)

which by simplification, equation (6) is obtained, which completes the proof.

Corollary 1: In equation (6), if t is replaced by e^t, we get the moment generating function (mgf) of the WNBL distribution as

M_{Y} (t) = \frac{(1 + p e^{t} (β^{2} - 1)) q^{β + 1}}{(1 + p (β^{2} - 1)) (1 - p e^{t})^{β + 1}},

(7)

where $0 < p e^{t} < 1$ .

The r^th derivatives of equation (7) with respect to t at t = 0 yield the moments. In particular, for r = 1 we have

μ_{1}^{'} = \frac{β p (1 + β + p (β^{2} - 1))}{(1 - p) (1 + p (β^{2} - 1))},

for r = 2 we have

μ_{2}^{'} = \frac{β (1 + β) p (1 - p (1 - 3 β - (β - 1) β p))}{(1 - p)^{2} (1 + p (β^{2} - 1))},

for r = 3 we have

μ_{3}^{'} = \frac{β (1 + β) p (1 + p (- p + β (7 + p (- 1 + 6 β + (- 1 + β) β p))))}{(1 - p)^{3} (1 + p (β^{2} - 1))}

and for r = 4 we have

\begin{matrix} μ_{4}^{'} & = & \frac{1}{(1 - p)^{4} (1 + p (β^{2} - 1))} \times \\ {β (1 + β) p (1 + p (3 + 15 β + (- 3 + β (11 + 25 β)) p \\ + (- 1 + β (- 2 + β + 10 β^{2})) p^{2} + (- 1 + β) β^{3} p^{3}))} . \end{matrix}

Moreover, by definition, the r^th non central moment can be expressed as

μ_{r}^{'} = \frac{(1 - p)^{β + 1}}{1 + p (β^{2} - 1)} \sum_{x = 0}^{\infty} x^{r} (\frac{β + x - 1}{x}) (1 + β x) p^{x} .

Differentiating the equation above with respect to p we get

\begin{matrix} \frac{d μ_{r}^{'}}{d p} & = & - \sum_{x = 0}^{\infty} x^{r} (\frac{β + x - 1}{x}) (1 + β x) p^{x} {\frac{(β^{2} - 1) (1 - p)^{β + 1}}{(1 + p (β^{2} - 1))^{2}} + \frac{(β + 1) (1 - p)^{β}}{(1 + p (β^{2} - 1))}} \\ + \frac{(1 - p)^{β + 1}}{1 + p (β^{2} - 1)} \sum_{x = 0}^{\infty} x^{r + 1} (\frac{β + x - 1}{x}) (1 + β x) p^{x - 1}, \end{matrix}

which by simplification yields the recurrence relation

μ_{r + 1}^{'} = p \frac{d μ_{r}^{'}}{d p} + \frac{p (β + 1) ((β - 1) (1 - p) + 1 + p (β^{2} - 1))}{(1 - p) (1 + p (β^{2} - 1))^{2}} μ_{r}^{'},

$r = 0, 1, \dots,$ $μ_{0}^{'} = 1$ .

Similarly, by definition, the r^th central moment can be expressed as

μ_{r} = \frac{(1 - p)^{β + 1}}{1 + p (β^{2} - 1)} \sum_{x = 0}^{\infty} (x - \frac{β p (1 + β + p (β^{2} - 1))}{(1 - p) (1 + p (β^{2} - 1))})^{r} (\frac{β + x - 1}{x}) (1 + β x) p^{x} .

Differentiating the above equation with respect to p and simplifying it, we get

μ_{r + 1} + μ_{1}^{'} μ_{r} = p \frac{d μ_{r}}{d p} + \frac{p β (β + 1) (1 - p)^{β} (1 + p (β - 1))}{(1 + p (β^{2} - 1))^{2}} μ_{r} + \frac{r (β + 1) ((1 + p (β^{2} - 1))^{2} + (β - 1) (1 - p)^{2})}{(1 - p)^{2} (1 + p (β^{2} - 1))^{2}} μ_{r - 1}

$r = 1, 2, \dots, μ_{0}^{'} = 1$ .

OVER-AND UNDER-DISPERSION

In statistics, the phenomenon of over- and under-dispersion relative to the Poisson distribution is generally observed in count data and well known in statistical literature. There are various causes of such phenomenon, like heterogeneity and aggregation for over-dispersion and repulsion for under-dispersion although less frequent (see Kokonendji and Mizre 2005KOKONENDJI CC AND MIZRE D. 2005. Overdispersion and Underdispersion Characterization of Weighted Poisson Distribution (Technical Report No. 0523) France LMA. Technical report.). In order to see the reflection of over- and under-dispersion pattern, the researchers usually take the support of the index of dispersion (ID) which is defined as variance-to-mean ratio, which indicates the suitability of distribution in, under- or over-dispersed data sets. If $I D > 1 (< 1)$ the distribution is over-dispersed (under-dispersed). The index of dispersion for the WNBL distribution is

I D = \frac{1}{1 - p} - \frac{1}{1 - p + β p} + \frac{1}{1 + p (β^{2} - 1)} .

For indicating the dispersion pattern we first consider that β < 1 which implies that $β^{2} - 1 \leq β - 1$ . If $β^{2} - 1 = - C$ and $β - 1 = - D$ then ID can be re expressed as

I D = \frac{1}{1 - p} - \frac{1}{1 - p D} + \frac{1}{1 - p C},

since C > D then 1- pC < 1-pD or $\frac{1}{1 - p C} - \frac{1}{1 - p D} > 0$ . Therefore, the WNBL is an over-dispersed model for all values of p. However, if β >1 this implies that $β^{2} - 1 > β - 1$ . Let $β^{2} - 1 = L$ and $β - 1 = S,$ then ID can be rewritten as

I D = \frac{1}{1 - p} - \frac{1}{1 + p S} + \frac{1}{1 + p L} .

As it is evident that L > S, then $\frac{1}{1 + p L} < \frac{1}{1 + p S}$ or $0 < \frac{1}{1 + p S} - \frac{1}{1 + p L} < 1$ . Hence, ID< 1 if $p \to 0$ and $β \to \infty$ .

OTHER MOMENTS MEASURES

The following theorems gives factorial moments and negative moments.

Theorem 1: If $Y \sim$ WNBL $(p, β)$ , then the r^th descending factorial moment of Y is given by

μ_{(r)}^{'} = \frac{(β)_{r} p^{r} (1 - p)^{- r} (1 + β r + (β^{2} - 1) p)}{(1 + p (β^{2} - 1))},

(8)

where $p \in (0, 1),$ $β \geq 0,$ $r = 0, 1, \dots, (a)_{n} = a (a + 1) (a + 2) \dots (a + n - 1)$ and $μ_{(0)}^{'} = 1$ .

Proof: The r^th descending factorial moment for Y can be defined as $μ_{(r)}^{'} = E (Y^{(r)}) = \sum_{x = 0}^{\infty} x^{(r)} P (Y = x)$ . Using the expression $x^{(r)} = x (x - 1) \dots (x - r + 1) = \frac{x!}{(x - r)!}$ , we have

μ_{(r)}^{'} = \frac{(1 - p)^{β + 1}}{(1 + p (β^{2} - 1))} \sum_{x = r}^{\infty} \frac{x!}{(x - r)!} (\frac{β + x - 1}{x}) (1 + β x) p^{x} .

By using the binomial series $(1 - z)^{- a} = \sum_{x = 0}^{\infty} \frac{(a)_{n} z^{n}}{n!}$ , we obtain

μ_{(r)}^{'} = \frac{(1 - p)^{β + 1} (β + r - 1)! p^{r}}{(β - 1)! (1 + p (β^{2} - 1))} {(1 - p)^{- (β + r)} + β (r + (r + 1) (β + r) p + (β + r)_{2} (r + 2) \frac{p^{2}}{2!} \dots)},

hence, we get

μ_{(r)}^{'} = \frac{(1 - p)^{- r} (β + r - 1)! p^{r}}{(β - 1)! (1 + p (β^{2} - 1))} {(1 - p)^{- (β + r)} + (1 - p)^{- (β + r + 1)} β (r + β p)} .

After some algebraic manipulation, the equation (8) is attained and it generates a recursive relation between r and (r - 1) descending factorial moments as

μ_{(r)}^{'} {(1 - p) (1 + β (r - 1) + (β^{2} - 1) p)} = {(β + r - 1) p (1 + β r + (β^{2} - 1) p)} μ_{(r - 1)}^{'},

(9)

where $r = 1, 2, \dots,$ which completes the proof.

Theorem 2: If $Y \sim$ WNBL $(p, β)$ , the r^th ascending factorial moment of Y is given by

μ_{[r]}^{'} = \frac{β p r! (1 - p)^{β + 1}}{(1 + p (β^{2} - 1))} {_2F_1 (r + 1, β + 1; 2; p) + β_2F_1 (r + 1, β + 1; 1; p)},

(10)

where $p \in (0, 1),$ $β \geq 0,$ $r = 0, 1, . . .,$ $μ_{[0]}^{'} = 1 - \frac{(1 - p)^{β + 1}}{(1 + p (β^{2} - 1))}$ .

Proof: The r^th ascending factorial moment is defined as $μ_{[r]}^{'} = E ((Y)_{r}) = \sum_{x = 0}^{\infty} (x)_{r} P (Y = x)$ , this implies that

μ_{[r]}^{'} = \frac{(1 - p)^{β + 1}}{(1 + p (β^{2} - 1))} {\sum_{x = 0}^{\infty} (\frac{β + x - 1}{x}) (x)_{r} p^{x} + β \sum_{x = 0}^{\infty} (\frac{β + x - 1}{x}) x (x)_{r} p^{x}} .

By using the hypergeometric series function $_2F_1 (a, b; c; z) = \sum_{n = 0}^{\infty} \frac{(a)_{n} (b)_{n} z^{n}}{(c)_{n} n!}$ , we get the equation (10), where

\sum_{x = 0}^{\infty} (\frac{β + x - 1}{x}) (x)_{r} p^{x} = β r! p_{2} F_{1} (r + 1, β + 1; 2; p)

and

β \sum_{x = 0}^{\infty} (\frac{β + x - 1}{x}) x (x)_{r} p^{x} = β^{2} r! p_{2} F_{1} (r + 1, β + 1; 1; p)

Thus the theorem is proved.

As it is known, the ordinary moments are generally helpful in estimating the unknown parameters. However, due to recent developments in inverse theory of the random variables, the negative moments are gaining momentum in life testing phenomena, estimation purposes and identifying the models. The negative moments are being used in irreversible damage to manufacturing materials due to damage process such as creep, corrosion, creep fracture, wear, shrinkage, aging and cracking (see Ahmad 2007AHMAD M. 2007. On the theory of inversion. Int J Stat Sci 6: 43-53.). Therefore, the next theorem deals with the negative moments of the WNBL distribution.

Theorem 3: If $Y \sim$ WNBL $(p, β),$ the first order negative moment of Y is given by

E ((Y + a)^{- 1}) = \frac{(1 - p)^{β + 1}}{(1 + p (β^{2} - 1))} {\frac{1}{a}_{2} F_{1} (a, β; a + 1; p) + \frac{β^{2} p}{a + 1}_{2} F_{1} (a + 1, β + 1; a + 2; p)}

(11)

where $p \in (0, 1),$ $β \geq 0,$ $a > 0$ and $_2F_1 (a, b; c; z) = \sum_{n = 0}^{\infty} \frac{(a)_{n} (b)_{n} z^{n}}{(c)_{n} n!}$ .

Proof: By definition, $E (\frac{1}{Y + a}) = \sum_{x = 0}^{\infty} \frac{P (Y = x)}{x + a}$ , we have

E (\frac{1}{Y + a}) = \frac{(1 - p)^{β + 1}}{(1 + p (β^{2} - 1))} \sum_{x = 0}^{\infty} (\frac{β + x - 1}{x}) (x + a)^{- 1} p^{x} (1 + β x) .

By simplification, we get the equation (11) where

\sum_{x = 0}^{\infty} (\frac{β + x - 1}{x}) \frac{p^{x}}{x + a} = \frac{1}{a}_2F_1 (a, β; a + 1; p)

and

\sum_{x = 0}^{\infty} (\frac{β + x - 1}{x}) \frac{x p^{x}}{x + a} = \frac{β p}{(a + 1)}_{2} F_{1} (a + 1, β + 1; a + 2; p)

This completes the proof.

Corollary 2: The r^th order negative moment of $Y \sim$ WNBL $(p, β)$ can be expressed as

\begin{matrix} E ((Y + a)^{- s}) & = & \frac{(1 - p)^{β + 1}}{(1 + p (β^{2} - 1))} {\frac{1}{a^{s}}_{s + 1} F_{s} (a, . . ., a, β; a + 1, . . ., a + 1; p) \\ + \frac{β^{2} p}{(a + 1)^{s}}_{s + 1} F_{s} (a + 1, . . ., a + 1, β + 1; a + 2, . . ., a + 2; p)}, \end{matrix}

where

_{s} F_{u} (a_{1}, . . ., a_{s}; b_{1}, . . ., b_{u}; z) = \sum_{n = 0}^{\infty} \frac{(a_{1})_{n}, . . ., (a_{s})_{n} z^{n}}{(b_{1})_{n}, . . ., (b_{u})_{n} n!},

the series converges for $s = u + 1$ and $| z | < 1$ .

PARAMETER ESTIMATION AND INFERENCE

In this section, estimation of parameters of the distribution is investigated by the methods of moments (MM) and maximum likelihood (ML), and a performance of the estimators is assessed by a simulation study.

MOMENTS METHOD

Let $Y_{1}, Y_{2}, \dots, Y_{n}$ be a random sample drawn from the WNBL distribution with the observed values $x_{1}, x_{2}, \dots, x_{n}$ . Equating the first two sample moments, $m_{1}^{'} = \overline{x} = \frac{1}{n} \sum_{i = 1}^{n} x_{i}$ and $m_{2}^{'} = \frac{1}{n} \sum_{i = 1}^{n} x_{i}^{2},$ with their associated population moments

μ_{1}^{'} = \frac{(β + 1) p}{1 - p} + \frac{(β^{2} - 1) p}{1 + p (β^{2} - 1)}

and

μ_{2}^{'} = μ_{1}^{'} + \frac{((β + 1) (β + 2) p^{2})}{(1 - p)^{2}} + \frac{2 (β + 1) (β^{2} - 1) p^{2}}{(1 - p) (1 + p (β^{2} - 1))}

then the MM estimators of the WNBL are obtained.

Alternatively, the MM estimators can also be obtained using the suggested approach by Khan et al. (1989)KHAN AMS, KHALIQUE A AND ABOUAMMOTH AM. 1989. On estimating parameters in a discrete Weibull distribution. IEEE T Reliab 38: 348-350. via minimizing the expression

𝔖 = (μ_{1}^{'} - m_{1}^{'})^{2} + (μ_{2}^{'} - m_{2}^{'})^{2},

with respect to p and β. Hence,

S = (\frac{(β + 1) p}{1 - p} + \frac{(β^{2} - 1) p}{1 + p (β^{2} - 1)} - m_{1}^{'})^{2} + (m_{1}^{'} + \frac{((β + 1) (β + 2) p^{2})}{(1 - p)^{2}} + \frac{2 (β + 1) (β^{2} - 1) p^{2}}{(1 - p) (1 + p (β^{2} - 1))} - m_{2}^{'})^{2}

yields two equations which are not in closed form, so parameters are estimated using numerical optimization techniques via Mathematica 8 computational packages.

MAXIMUM LIKELIHOOD METHOD

If $Y_{1}, Y_{2}, \dots, Y_{n}$ is a random sample drawn from the WNBL distribution with the observed values $x_{1}, x_{2}, \dots, x_{n},$ then we get the log-likelihood function

\begin{matrix} κ_{p, β} & = & l n (L (p; β)) = (β + 1) n l n (1 - p) - n l n ((1 + p (β^{2} - 1))) + \sum_{i = 1}^{n} x_{i} l n p \\ + \sum_{i = 1}^{n} l n (\frac{β + x_{i} - 1}{x_{i}}) + \sum_{i = 1}^{n} l n (1 + β x_{i}) . \end{matrix}

By partially differentiating both sides of the equation above with respect to p and β and equating them to zero, we get the MLEs of pand β respectively, as

\frac{n (β + 1)}{1 - p} + \frac{n (β^{2} - 1)}{1 + p (β^{2} - 1)} = \frac{\bar{x}}{p}

(12)

and

n l n (1 - p) - \frac{2 n p β}{1 + p (β^{2} - 1)} + \sum_{i = 1}^{n} \frac{x_{i}}{1 + β x_{i}} + \sum_{i = 1}^{n} {ψ^{(0)} (β + x_{i}) - ψ^{(0)} (β)} = 0

(13)

Similarly, the second derivatives of the equation (12)) and equation (13) with respect to p and β, respectively, are

\frac{\partial^{2} κ_{p, β}}{\partial p^{2}} = - \frac{n (β + 1)}{(1 - p)^{2}} + \frac{n (β^{2} - 1)^{2}}{1 + p (β^{2} - 1)} - \frac{\sum_{i = 1}^{n} x_{i}}{p^{2}}

and

\frac{\partial^{2} κ_{p, β}}{\partial β^{2}} = \frac{4 n p^{2} β^{2}}{(1 + p (β^{2} - 1))^{2}} - \sum_{i = 1}^{n} \frac{x_{i}^{2}}{(1 + β x_{i})^{2}} - \frac{2 n p}{1 + p (β^{2} - 1)} + \sum_{i = 1}^{n} {ψ^{(1)} (β + x_{i}) - ψ^{(1)} (β)}

Also, the second derivative of equation (12) with respect to β or equation (13) with respect to p yields

\frac{\partial^{2} κ_{p, β}}{\partial β \partial p} = \frac{2 β (β^{2} - 1) n p}{(1 + p (β^{2} - 1))^{2}} - \frac{n}{1 - p} - \frac{β n}{1 + p (β^{2} - 1)},

where $ψ^{(n)} (z) = \frac{d^{n} ψ (z)}{d z^{n}}$ is the logarithmic derivative of the gamma function. The MLEs are computed using computational packages, such as Mathematica 7. In view of the regularity conditions (see Rohatgi and Saleh 2002ROHATGI VK AND SALEH AKE. 2002. An Introduction to probability and Statistics. Singapore: John Wiley and Sons (Asia) Pte Ltd., pp. 419), we observed that the MLEs of WNBL parameters satisfy such conditions, where i) The parameters $p \in (0, 1), β \in (0, \infty)$ are subset of the real line, ii) $\frac{\partial κ_{p, β}}{\partial p}, \frac{\partial κ_{p, β}}{\partial β}, \frac{\partial^{2} κ_{p, β}}{\partial p^{2}}, \frac{\partial^{2} κ_{p, β}}{\partial β^{2}}, \frac{\partial^{2} κ_{p, β}}{\partial p \partial β}$ exist for all values of p and β, iii) $E (\frac{\partial κ_{p, β}}{\partial p}) = 0, E (\frac{\partial κ_{p, β}}{\partial β}) = 0,$ iv) $\infty \leq E (\frac{\partial^{2} κ_{p, β}}{\partial p^{2}}) \leq 0, - \infty \leq E (\frac{\partial^{2} κ_{p, β}}{\partial β^{2}}) \leq 0$ for all values of p and β.

Moreover, the MLEs $\hat{p}$ and $\hat{β}$ of the WNBL distribution have an asymptotic bivariate normal distribution with vector mean (p,β) and variance-covariance matrix (I(p,β))^-1, where I(p,β) denotes the information matrix given by

$I ((p, β) |_{p = \hat{p}, β = \hat{β}}) = [\begin{matrix} 𝔼 (\frac{- \partial^{2} κ_{p, β}}{\partial p^{2}}) & 𝔼 (\frac{- \partial^{2} κ_{p, β}}{\partial p \partial β}) \\ 𝔼 (\frac{- \partial^{2} κ_{p, β}}{\partial p \partial β}) & 𝔼 (\frac{- \partial^{2} κ_{p, β}}{\partial β^{2}}) \end{matrix}]$ ,

where

𝔼 (\frac{- \partial^{2} κ_{p, β}}{\partial p^{2}}) = \frac{n (\hat{β} + 1)}{(1 - \hat{p})^{2}} - \frac{n ({\hat{β}}^{2} - 1)^{2}}{1 + \hat{p} ({\hat{β}}^{2} - 1)} + \frac{n 𝔼 (Y)}{{\hat{p}}^{2}},

𝔼 (\frac{- \partial^{2} κ_{p, β}}{\partial p \partial β}) = - \frac{2 \hat{β} ({\hat{β}}^{2} - 1) n \hat{p}}{(1 + \hat{p} ({\hat{β}}^{2} - 1))^{2}} + \frac{n}{1 - \hat{p}} - \frac{2 \hat{β} n}{1 + \hat{p} ({\hat{β}}^{2} - 1)},

𝔼 (\frac{- \partial^{2} κ_{p, β}}{\partial β^{2}}) = - \frac{4 n {\hat{p}}^{2} {\hat{β}}^{2}}{(1 + \hat{p} ({\hat{β}}^{2} - 1))^{2}} + n 𝔼 (\frac{Y^{2}}{(1 + \hat{β} Y)^{2}}) + \frac{2 n \hat{p}}{1 + \hat{p} ({\hat{β}}^{2} - 1)} - n 𝔼 {ψ^{(1)} (\hat{β} + Y) - ψ^{(1)} (\hat{β})} .

The expectations above can be obtained numerically.

SIMULATION SCHEME

As it is mentioned in the introduction that the WNBL(p,β) distribution can be viewed as a mixture of the negative binomial, NB(p,β), and size biased negative binomial, SBNB(p,β), distributions. So, in order to generate random data k_i, i=1,2,...n, from the WNBL we used the following algorithm.

Algorithm: i) Generate U_i,i = 1,2,...,n,from the uniform distribution on the interval (o,1). ii) Generate NB_i,i=1,2,...,n from the negative binomial distribution NB(p,β) with support k=0,1,.... iii)Set Y₁=NB_i. iv) Generate SBNB_i,i=1,2,...,n, from the size biased negative binomial distribution SBNB(p,β) with support k=1,2,.... v) Set Y₂=SBNB_i. If $U_{i} \leq \frac{1 - p}{(1 + p (β^{2} - 1))}$ , then set k_i=Y₁ otherwise set k_i=Y₂,i=1,2,...,n.

SIMULATION STUDY

In this section, we perform some numerical experiments to see how the estimates studied by using the above listed methods (MM and ML) as well as their asymptotic results for finite samples. All the numerical results are performed via Mathematica 8 using the random numbers generator code. We consider the following different model parameters. Model-I: $β = 0.0542, p = 0.7487$ , Model-II: $β = 30.4389, p = 0.0157$ and Model-III: $β = 5.2856, p = 0.6398$ for both MMEs and MLEs. We consider the following sample sizes n=20(small), 50 (moderate), and 100,200,500 (large). For each set of the model parameters and for each sample size, we compute the MMEs and MLEs of each β and p. We repeat this process 1000 times and compute the average bias and mean square error (MSE) for all replications in the relevant sample sizes. The results are reported in Table I and III.

Discussion: Some notes are very clear from the simulation studies, such as the bias decreases as sample size increases for both MLEs and MMEs. Moreover, we conclude that the bias takes negative and positive signs, and it approaches to the value zero for both signs while the MSE decreases as the sample size increases.

Parameter	Sample Size	Bias( $\hat{β}$ )	Bias( $\hat{p}$ )	MSE( $\hat{β}$ )	MSE( $\hat{p}$ )
β = 0:0542, p = 0:7487	n=20	26.6700	-0.7485	742.1	0.5603
	n=50	14.5694	-0.6458	411.023	0.4276
	n=100	1.1141	-0.6022	1.4983	0.4265
	n=200	0.9135	-0.5743	0.9130	0.3924
	n=500	0.8783	-0.5517	0.8940	0.3822
β = 30:4389, p = 0:0157	n=20	139.4975	-0.7418	37071.8	0.5502
	n=50	4.7926	-0.7071	23.0087	0.5001
	n=100	4.4596	-0.6744	19.4900	0.4995
	n=200	3.8780	-0.6688	18.8034	0.4872
	n=500	2.8913	-0.6217	15.8204	0.4644
β= 5:2856, p = 0:6398	n=20	4.0410	-0.0942	12.6543	0.0125
	n=50	3.6904	-0.0907	12.1628	0.0063
	n=100	3.6522	-0.0890	9.8270	0.0058
	n=200	3.3172	-0.0836	8.5084	0.0056
	n=500	2.6516	-0.0798	8.2922	0.0047

Parameter	Sample Size	Bias( $\hat{β}$ )	Bias( $\hat{p}$ )	MSE( $\hat{β}$ )	MSE( $\hat{p}$ )
β = 0:0542, p = 0:7487	n=20	32.0968	-0.7486	1030.22	0.5604
	n=50	14.498	-0.6101	410.9645	0.3914
	n=100	0.9708	-0.5826	1.1641	0.3570
	n=200	0.7625	-0.5720	0.6627	0.3545
	n=500	0.6575	-0.5606	0.6321	0.3532
β = 30:4389, p = 0:0157	n=20	138.775	-0.7417	36685.5	0.5502
	n=50	4.1503	-0.6758	18.827	0.4968
	n=100	3.6311	-0.6408	17.574	0.4888
	n=200	3.3901	-0.6212	16.8538	0.4837
	n=500	2.3361	-0.5537	14.3888	0.4327
β = 5:2856, p = 0:6398	n=20	3.3960	-0.0869	11.5333	0.0075
	n=50	3.0078	-0.0866	11.3053	0.0055
	n=100	2.5571	-0.0866	10.3211	0.0052
	n=200	2.4807	-0.0752	7.7243	0.0045
	n=500	1.6978	-0.0728	5.1232	0.0042

Distribution	$\hat{p}$	$\hat{β}$	KS	- $l$	df	$χ^{2}$	p-value	AIC	BIC	AICc
WNBL( $\hat{p}$ , $\hat{β}$ )	0.2087(0.0154)	0.4770(0.0436)	0.5951	222.3063	3	2.3842	0.497	448.6126	454.6339	445.6926
NB( $\hat{p}$ , $\hat{β}$ )	0.5281(0.0275)	1.0245( 0.1051)	0.5979	222.4371	3	2.5147	0.473	448.8742	454.8955	445.9542
GP( $\hat{p}$ , $\hat{β}$ )	0.3209(0.0551)	0.7786(0.0796)	0.6019	222.7452	3	2.9475	0.400	449.4904	455.5117	446.5704
Pois.( $\hat{β}$ )		1.1466(0.0874)	0.6408	242.8099	2	26.6513	0.000	487.6198	490.6304	485.6465

Distribution	$\hat{p}$	$\hat{β}$	KS	- $l$	df	$χ^{2}$	p-value	AIC	BIC	AICc
WNBL( $\hat{p}$ , $\hat{β}$ )	0.3006(0.0140)	0.6160(0.0314)	0.7295	592.0683	2	2.9852	0.225	1188.137	1197.081	1185.155
NB( $\hat{p}$ , $\hat{β}$ )	0.3497(0.0137)	0.8651(0.0587)	0.7309	592.2670	2	3.2742	0.195	1188.534	1197.479	1185.553
GP( $\hat{p}$ , $\hat{β}$ )	0.1963(0.0332)	0.3739(0.0253)	0.7328	592.6005	2	3.7961	0.150	1189.201	1198.146	1186.22
Pois.( $\hat{β}$ )		0.4652(0.0268)	0.7534	617.1843	2	65.0106	0.000	1236.369	1240.841	1234.375

Distribution	$\hat{p}$	$\hat{β}$	KS	- $l$	df	$χ^{2}$	p-value	AIC	BIC	AICc
WNBL( $\hat{p}$ , $\hat{β}$ )	0.0833(0.0077)	3.9294(0.2916)	0.5729	187.3976	1	1.1206	0.290	378.7952	384.8949	375.8721
NB( $\hat{p}$ , $\hat{β}$ )	0.0017(5.6903 $\times 10^{- 06}$ )	572.1488(45.9855)	0.5395	191.9723	1	9.4912	0.002	387.9446	394.0443	385.0215
GP( $\hat{p}$ , $\hat{β}$ )	-0.1450(0.0403)	1.1376(0.0779)	0.5487	188.8228	1	4.5304	0.033	381.6456	387.7453	378.7225
Pois.( $\hat{β}$ )		0.9936(0.0798)	0.5381	191.9362	2	9.4145	0.009	385.8724	388.9223	383.898

x	January	February	March	April	May	June	July	August	September	October	Nov.	Dec.	Spring	Summer	Fall
0	335	295	308	299	266	187	177	185	228	311	321	334	873	549	860
1	4	9	20	18	43	77	80	89	54	17	6	3	81	246	77
2	2	4	9	10	25	40	47	30	33	9	3	2	44	117	45
3		2	3	3	3	17	26	24	12	4		2	9	67	16
4			1		3	6	9	10	3				4	25	3
5					0	2	2	3					0	7
6					1	1							1	1
Total	341	310	341	330	341	330	341	341	330	341	330	341	1012	1012	1001

x	Jan.	Feb.	March	April	May	June	July	Aug.	Sep.	Oct.	Nov.	Dec.	Spring	Summer	Fall
0	334.99	294.92	307.72	298.62	265.09	185.42	173.55	184.55	225.47	310.66	320.97	333.98	871.38	543.54	857.62
1	4.63	10.34	22.56	21.67	49.00	83.09	91.84	87.20	65.33	20.41	6.95	4.197	93.70	262.13	93.87
2	0.98	2.92	6.64	6.16	16.57	36.22	43.18	39.67	24.21	6.06	1.48	1.43	28.96	119.04	29.91
3		1.06	2.42	2.16	6.25	15.14	18.98	17.26	9.25	2.25		0.65	10.69	51.37	11.41
4			0.96		2.45	6.14	8.00	7.28	3.55				4.24	21.43	4.67
5					0.98	2.44	3.28	3.01					1.74	8.73
6					0.39	0.96							0.73	3.49
Total	341	310	341	330	341	330	341	341	330	341	330	341	1012	1012	1001
LL( $- l$ )	34.86	75.33	142.11	133.79	259.57	394.33	439.78	423.75	317.00	133.19	48.29	43.62	549.79	1258.98	560.87
$\hat{p}$	0.399	0.499	0.461	0.442	0.401	0.354	0.358	0.367	0.361	0.477	0.386	0.659	0.439	0.360	0.455
$\hat{β}$	0.034	0.066	0.141	0.144	0.731	0.815	0.740	0.526	0.123	0.053	0.019	0.019	0.204	0.760	0.200
Var $(\hat{p})$	0.012	0.0051	0.0021	0.002	0.001	0.0003	0.0003	0.0003	0.0005	0.0023	0.0073	0.011	0.0005	0.0001	0.0005
Var $(\hat{β})$	0.0002	0.0002	0.0004	0.001	0.001	0.0019	0.0021	0.0018	0.0014	0.0004	0.0003	0.00003	0.0003	0.0006	0.0002
Cov $(\hat{β}, \hat{p})$	0.0002	0.0012	0.001	0.001	0.001	0.0008	0.0008	0.0008	0.0009	0.0011	0.0015	0.0009	0.0003	0.0003	0.0003
Sk. $((\hat{β}, \hat{p})$	161.49	85.3649	51.053	50.412	36.69	44.767	49.662	46.471	37.537	55.072	108.482	241.63	42.2	46.884	42.551
Kur. $((\hat{β}, \hat{p})$	217.87	110.767	59.456	58.598	34.25	32.069	33.828	32.770	30.661	65.549	142.418	343.444	45.438	32.851	45.680
ID $((\hat{β}, \hat{p})$	1.69	2.109	2.0204	1.946	1.860	1.640	1.6217	1.6728	1.7132	2.079	1.678	3.030	1.973	1.647	2.037
AIC	73.72	154.66	288.22	271.58	523.14	792.66	883.56	851.5	638	270.38	100.58	91.24	1103.58	2521.96	1125.74
$χ^{2}$	0.027	0.032	1.173	3.250	5.504	2.392	4.538	5.745	5.559	3.306	0.038	0.083	9.80	7.90	11.18
df	1	1	2	2	3	4	4	4	3	3	2	1	4	5	3
p-value	0.869	0.858	0.556	0.197	0.138	0.664	0.338	0.219	0.135	0.347	0.981	0.773	0.044	0.162	0.011

x	Jan.	Feb.	March	April	May	June	July	Aug.	Sep.	Oct.	Nov.	Dec.	Spring	Summer	Fall
0	334.99	294.94	307.79	298.71	265.09	184.64	171.83	183.86	225.14	310.75	320.97	333.98	871.68	540.51	858.12
1	4.62	10.32	22.55	21.62	49.39	84.55	94.06	88.66	29.33	20.37	6.94	4.195	93.83	267.18	93.81
2	0.98	2.91	6.58	6.09	16.22	35.87	43.32	39.28	23.74	6.01	1.48	1.43	28.52	118.32	29.42
3		1.06	2.39	2.15	6.09	14.82	18.69	16.90	9.00	2.23		0.65	10.54	50.37	11.23
4			0.96		2.44	6.04	7.79	7.16	3.51				4.24	21.01	4.67
5					0.43	2.44	3.18	3.01					1.79	8.66
6					0.39	0.98							0.78	3.54
Total	341	310	341	330	341	330	341	341	330	341	330	341	1012	1012	1001
LL( $- l$ )	34.86	75.34	142.17	133.86	259.78	394.59	440.18	423.95	317.57	133.26	48.29	43.62	550.14	1259.91	561.41
$\hat{p}$	0.412	0.528	0.510	0.492	0.471	0.391	0.374	0.404	0.422	0.524	0.405	0.659	0.500	0.391	0.518
$\hat{β}$	0.034	0.066	0.144	0.147	0.396	1.172	1.465	1.192	0.698	0.125	0.053	0.019	0.215	1.263	0.211
AIC	73.72	154.68	288.34	271.72	523.56	793.18	884.36	851.9	639.14	270.52	100.58	91.24	1104.28	2523.82	1126.82
$χ^{2}$	0.028	0.035	1.232	3.347	6.008	2.888	5.428	5.967	6.429	3.307	0.039	0.084	10.38	9.51	11.87
df	1	1	2	2	3	4	4	4	3	3	2	1	4	5	3
p-value	0.867	0.852	0.540	0.188	0.111	0.577	0.246	0.202	0.093	0.347	0.981	0.772	0.034	0.090	0.008

x	Jan.	Feb.	March	April	May	June	July	Aug.	Sep.	Oct.	Nov.	Dec.	Spring	Summer	Fall
0	334.98	294.01	307.65	298.55	264.57	183.74	170.46	183.04	224.09	310.59	320.96	333.97	857.081	537.41	857.08
1	4.68	12.18	23.18	22.22	50.81	86.27	96.14	90.36	68.25	21.01	7.04	4.42	97.04	272.77	97.04
2	0.92	2.59	6.17	5.73	15.59	35.53	43.26	38.89	23.26	5.62	1.39	1.30	27.83	117.51	27.83
3		0.77	2.23	2.01	5.76	14.40	18.28	16.43	8.62	2.06		0.57	10.45	49.04	10.45
4			0.96		2.44	6.04	7.79	7.16	3.51				4.47	20.42	4.47
5					0.43	2.44	3.18	3.01					8.55	8.73
6					0.39	0.98							3.61	3.61
Total	341	310	341	330	341	330	341	341	330	341	330	341	1012	1012	1001
LL( $- l$ )	34.94	75.88	142.57	134.35	260.26	394.98	440.81	424.34	318.36	133.79	48.41	43.92	551.65	1261.37	563.64
$\hat{p}$	0.241	0.245	0.312	0.297	0.279	0.221	0.207	0.231	0.239	0.323	0.236	0.453	0.303	0.221	0.316
$\hat{β}$	0.018	0.053	0.103	0.100	0.254	0.586	0.694	0.622	0.387	0.093	0.028	0.021	0.150	0.633	0.1552
AIC	73.88	155.76	289.14	272.7	524.52	793.96	885.62	852.68	640.72	271.58	100.82	91.84	1107.3	2526.74	1131.28
$χ^{2}$	0.028	0.022	1.879	4.385	7.167	3.553	6.556	6.496	7.867	4.440	0.039	0.081	13.32	11.91	15.87
df	1	1	2	2	3	4	4	4	3	3	2	1	4	5	3
p-value	0.867	0.882	0.391	0.112	0.067	0.470	0.161	0.165	0.049	0.218	0.981	0.776	0.010	0.036	0.001

Competing Models	Statistics	January	February	March	April	May	June
WNBL-NB	𝐙 p-value	83.5625	31.42032	36.5475	108.1294	7.2535	69.2805
WNBL-NB	𝐙 p-value	0.0001	0.0001	0.0001	0.0001	0.0001	0.0001
WNBL-GP	𝐙 p-value	3.4798	1.0256	12.113	15.0277	5.5650	40.252
WNBL-GP	𝐙 p-value	0.0005	0.3051	0.0001	0.0001	0.0001	0.0001

Competing Models	Statistics	July	August	September	October	November	December
WNBL-NB	𝐙 p-value	90.2142	125.0967	26.7266	183.6862	1032.339	1838.215
WNBL-NB	𝐙 p-value	0.0001	0.0001	0.0001	0.0001	0.0001	0.0001
WNBL-GP	𝐙 p-value	69.8226	54.8183	12.113	13.5131	6.1042	3.1211
WNBL-GP	𝐙 p-value	0.0005	0.0001	0.0001	0.0001	0.0001	0.0018

Brasil

Brasil

A weighted negative binomial Lindley distribution with applications to dispersed data

Abstract

INTRODUCTION

THE PROPOSED DISTRIBUTION WITH MOTIVATIONS

STATISTICAL PROPERTIES OF THE WNBL DISTRIBUTION

OVER-AND UNDER-DISPERSION

OTHER MOMENTS MEASURES

PARAMETER ESTIMATION AND INFERENCE

MOMENTS METHOD

MAXIMUM LIKELIHOOD METHOD

SIMULATION SCHEME

SIMULATION STUDY

CHARACTERIZATION

DISCRETE ANALOGUES OF SELF-DECOMPOSABILITY

TEST STATISTICS WITH DATA APPLICATIONS

DESCRIPTION OF THE DATA SETS WITH THEIR FITTING

CONCLUSIONS

ACKNOWLEGMENTS

REFERENCES

APPENDIX

Publication Dates

History