Acessibilidade / Reportar erro

THE COMPOSED ZERO TRUNCATED LINDLEY-POISSON DISTRIBUTION

ABSTRACT

In this paper, a new compounding distribution, named zero truncated Lindley-Poisson distribution is introduced. The probability density function, cumulative distribution function, survival function, failure rate function and quantiles expressions of it are provided. The parameters estimatives were obtained by six methods: maximum likelihood (MLE), ordinary least-squares (OLS), weighted least-squares (WLS), maximum product of spacings (MPS), Crame´r-von-Mises (CM) and Anderson-Darling (AD), and intensive simulation studies are conducted to evaluate the performance of parameter estimation. Some generalizations are also proposed. Application in a real data set is given and shows that the composed zero truncated Lindley-Poisson distribution provides better fit than the Lindley distribution and three of its generalizations. The paper is motivated by application in real data set and we hope this model may be able to attract wider applicability in survival and reliability.

Keywords:
compounding; estimation methods; Lindley distribution; survival analysis; zero truncated Poisson distribution

1 INTRODUCTION

The one parameter Lindley distribution was introduced by Lindley (see, Lindley 195826 LINDLEY DV. 1958. Fiducial distributions and Bayes’ theorem. It Journal of the Royal Statistical Society. Series B. Methodological, 20: 102-107. and 196525 LINDLEY D. 1965. Introduction to Probability and Statistics from a Bayesian Viewpoint, Part II: Inference. Cambridge University Press, New York.) as a new distribution useful to analyze lifetime data, especially in stress-strength reliability modeling. Suppose that T 1 , ... , TM are independent and identically distributed random variables following the one parameter Lindley distribution with probability density function and distribution function written, respectively, as:

(1)

(2)

where t > 0 and θ > 0.

For a random variable with the one parameter Lindley distribution, the probability density function, (1), is unimodal for 0 < θ < 1 and decreasing when θ > 1 (see Fig. 1-a). The hazard rate function is an increasing function in t and θ (see Fig. 1-b) and given by:

(3)

Figure 1
Probability density function and hazard rate function behavior for different values of θ .

Ghitany et al. (2008b19 GHITANY ME, ATIEH B & NADARAJAH S. 2008. Lindley distribution and its application. Mathematics and Computers in Simulation, 78(4): 493-506.) studied the properties of the one parameter Lindley distribution under a careful mathematical treatment. They also showed, in a numerical example, that the Lindley distribution is a better model than the Exponential distribution. A generalized Lindley distribution, which includes as special cases the Exponential and Gamma distributions was proposed by Zakerzadeh & Dolati (200938 ZAKERZADEH H & DOLATI A. 2009. Generalized Lindley distribution. Journal of Mathematical Extension, 3: 13-25.), and Nadarajah et al. (201131 NADARAJAH S, BAKOUCH HS & TAHMASBI R. 2011. A generalized Lindley distribution. Sankhya˜, B 73(2): 331-359.) introduced the exponentiated Lindley distribution. Ghitany & Al-Mutari (200817 GHITANY ME & AL-MUTARI DK. 2008. Size-biased Poisson-Lindley distribution and its application. METRON - International Journal of Statistics, 66(3): 299-311.) considered a size-biased Poisson-Lindley distribution and Sankaran (197036 SANKARAN M. 1970. The discrete Poisson-Lindley distribution. Biometrics, 26: 145-149.) proposed the Poisson-Lindley distribution to model count data. Some properties of Poisson-Lindley distribution and its derived distributions were considered in Borah & Begum (20026 BORAH M & BEGUM RA. 2002. Some properties of Poisson-Lindley and its derived distributions. Journal of the Indian Statistical Association, 40(1): 13-25.) while Borah & Deka (2001a7 BORAH M & DEKA NA. 2001. Poisson-Lindley and some of its mixture distributions. Pure and Applied Mathematika Sciences, 53(1-2): 1-8.) considered the Poisson-Lindley and some of its mixture distributions. The zero-truncated Poisson-Lindley distribution and the generalized Poisson-Lindley distribution were considered in Ghitany et al. (2008a16 GHITANY ME, AL-MUTAIRI DK & NADARAJAH S. 2008. Zero-truncated Poisson-Lindley distribution and its application. Mathematics and Computers in Simulation, 79(3): 279-287.) and Mahmoudi & Zakerzadeh (201029 MAHMOUDI E & ZAKERZADEH H. 2010. Generalized Poisson Lindley distribution. Communications in Statistics Theory and Methods, 39: 1785-1798.), respectively. A study on the inflated Poisson-Lindley distribution was presented in Borah & Deka (2001b8 BORAH M & DEKA NA. 2001. A study on the inflated Poisson Lindley distribution. Journal of the Indian Society of Agricultural Statistics, 54(3): 317-323.) and Zamani & Ismail (201039 ZAMANI H & ISMAIL N. 2010. Negative Binomial-Lindley distribution and its application. Journal of Mathematics and Statistics, 6(1): 4-9.) considering the Negative BinomialLindley distribution. The weighted and extended Lindley distribution were considered by Ghitany et al. (201118 GHITANY ME, ALQALLAF F, AL-MUTAIRI DK & HUSAIN HA . 2011. A two-parameter weighted Lindley distribution and its applications to survival data. Mathematics and Computers in Simulation, 81: 1190-1201.) and Bakouch et al. (20123 BAKOUCH HS, AL-ZAHRANI BM, AL-SHOMRANI AA, MARCHI, VITOR AA & LOUZADA-NETO F. 2012. An extended Lindley distribution. Journal of the Korean Statistical Society, 41(1): 75-85.), respectively. The one parameter Lindley distribution in the competing risks scenario was considered in Mazucheli & Achcar (201130 MAZUCHELI J & ACHCAR JA. 2011. The Lindley distribution applied to competing risks lifetime data. Computer Methods and Programs in Biomedicine, 104(2): 188-192.). The exponential Poisson Lindley distribution was presented in Barreto-Souza & Bakouch (20134 BARRETO-SOUZA W & BAKOUCH HS. 2013. A new lifetime model with decreasing failure rate. Statistics: A Journal of Theoretical and Applied Statistics, 47(2): 465-476.). Ghitany et al. (201314 GHITANY M, AL-MUTAIRI D, BALAKRISHNAN N& AL-ENEZI L. 2013. Power lindley distribution and associated inference. Computational Statistics & Data Analysis, 64: 20-33.) introduced the power Lindley distribution. Ali (20151 ALI S. 2015. On the bayesian estimation of the weighted lindley distribution. Journal of Statistical Computation and Simulation, 85(5): 855-880.) investigated various properties of the weighted Lindley distribution which main focus was the Bayesian analysis. A new four-parameter class of generalized Lindley distribution called the beta-generalized Lindley distribution is proposed by Oluyede & Yang (201532 OLUYEDE B & YANG T. 2015. A new class of generalized lindley distributions with applications. Journal of Statistical Computation and Simulation, 85(10): 2072-2100.).

Aim to offers more flexible distributions for modeling lifetime data set, in this paper, is proposed an extension of the Lindley distribution. We consider that Tj, j = 1, . . ., M is a random sample from the one parameter Lindley distribution and that our variable of interest is defined as:

(i) Y = min (T 1, . . ., TM ) and (ii) Y = max (T 1 , .. ., TM )

representing, respectively, the first and the last failure time of a certain device subject to the presence of an unknown number M of causes of failures. Furthermore, we consider that M has a zero truncated Poisson distribution, MPoissonT runc (λ), λ > 0, and that Tj , j = 1, .. ., M, and M are independent random variables, leading to the composed zero truncated Lindley-Poisson distribution. The process of composition using the zero truncated Poisson distribution has been fairly used in the literature. In Kus¸ (200723 KUS¸ C. 2007. A new lifetime distribution. Computational Statistics & Data Analysis, 51(9): 4497- 4509.) was considered the zero truncated Exponential-Poisson distribution in the competing risks scenario. Hemmati et al. (201121 HEMMATI F, KHORRAM E& REZAKHAH S. 2011. A new three-parameter ageing distribution. Journal of Statistical Planning and Inference, 141(7): 2266-2275.) developed the zero truncated Weibull-Poisson distribution. Also in 2011, the same distribution was studied by Ristic´ & Nadarajah (201235 RISTIC´ MM & NADARAJAH S. 2012. A new lifetime distribution. Journal of Statistical Computation and Simulation iFirst, 1-16.) and Lu & Shi (201227 LU W & SHI D. 2012. A new compounding life distribution: the Weibull-Poisson distribution. Journal of Applied Statistics, 39(1): 21-38.). The zero truncated Exponential-Poisson distribution in the complementary risks scenario was introduced by Rezaei & Tahmasbi (201234 REZAEI S & TAHMASBI R. 2012. A new lifetime distribution with increasing failure rate: Exponential truncated Poisson. Journal of Basic and Applied Scientific Research, 2(2): 1749-1762.).

The paper is organized as follows: in Section 2 the zero truncated Lindley-Poisson distribution is formulated. In Section 3 six estimation methods are presented. A simulation study is introduced in Section 4. The Section 5 brings a real data application. And finally, conclusions are presented in Section 6.

2 MODEL FORMULATION

In the theory of competing risks and complementary risks the number of risk factors (or causes) that may lead to the event of interest, is known and denoted as M. However, in models of distributions composition is assumed that M is unknown. Therefore, there is a number M of latent risk factors competing to cause the event of interest. In what follows, let us consider the situation where an individual or unit is exposed to M possible causes of death or failure, such that the exact cause is fully known (David & Moeschberger, 197812 DAVID HA & MOESCHBERGER ML. 1978. The Theory of Competing Risks. Vol. 39 of Griffin’s Statistical Monograph Series. Macmillan Co., New York.). The model for lifetime in the presence of suchcompeting risks structure or complementary risks structure is known as model of composition distributions. If Tj , j = 1, ..., M denote the latent failure times of a individual subject to M risks, which are independent of M, what is observed is the time to failure Y = min (T 1 , ..., TM ). Given M = m, under the assumption that the latent failure times Tj , j = 1, ... , M are independent and identically distributed random variables with the distribution function (2), the probability density function and the cumulative distribution function are written, respectively, as:

(4)

(5)

It is important to note that (4) and (5) are uniquely determined by the distribution function of the minimum, that is, P(Yy) = 1 − [1 − F1(y | θ )]m , (Arnold et al., 20082 ARNOLD BC, BALAKRISHNAN N & NAGARAJA HN. 2008. A first course in order statistics. Vol. 54 of Classics in Applied Mathematics. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA.).

Now, assuming the number of causes of death or failure, M, is a zero truncated Poisson random variable with probability mass function given by:

(6)

where m = 1, 2, ... and λ> 0, Rezaei & Tahmasbi (201234 REZAEI S & TAHMASBI R. 2012. A new lifetime distribution with increasing failure rate: Exponential truncated Poisson. Journal of Basic and Applied Scientific Research, 2(2): 1749-1762.).

The marginal probability density function, f min (y | θ, λ), the marginal cumulative distribution function, F min (y | θ, λ), and the marginal hazard rate function, h min (y | θ, λ) of Y = min (T 1 , .. ., TM ) are given, respectively, by:

(7)

(8)

(9)

where θ > 0,λ > 0 and y > 0, which defines the zero truncated Lindley-Poisson distribution in the competing risks scenario. Taking the λ = 0 we have the one parameter Lindley distribution as a particular case. Note that f min (0 | θ, λ) = λθ 2/(θ + 1)(1 - e - λ ) and f min (∞ | θ, λ) = 0. For all θ > 0 and λ > 0, the probability density function, (7), is decreasing or unimodal (see Fig. 2). For values of λ close to 1, the curve resembles the one parameter Lindley distribution, while when λ → 0 the curve tends to be symmetric.

Figure 2
The zero truncated Lindley-Poisson probability density function for different values of the λ and θ = 0.5 if Y = min(T 1 , . . . , TM ).

The hazard rate, (9), is increasing, increasing-decreasing-increasing and decreasing (see Fig. 3). Is easy to see that h min (0 | θ, λ) = f min (0 | θ, λ) = λθ 2/(θ + 1)(1 - e - λ ) and h min (∞ | θ, λ) = θ.

Figure 3
The zero truncated Lindley-Poisson hazard rate function for different values of the λ and θ = 2.0 if Y = min(T 1 , . . . , TM ).

Now, under the same assumptions and considering a complementary risks scenario, (Basu, 19815 BASU AP. 1981. Identifiability problems in the theory of competing and complementary risks - a survey. In: Statistical distributions in scientific work, Vol. 5 (Trieste, 1980). Vol. 79 of NATO Adv. Study Inst. Ser. C: Math. Phys. Sci. Reidel, Dordrecht, pp. 335-347.), where Y = max (T 1 , . . ., TM ) is observed, the marginal probability density function, f max (y | θ, λ), the cumulative distribution function, F max (y | θ, λ), and the hazard rate function, h max (y | θ, λ), are given, respectively, by:

(10)

(11)

(12)

where θ > 0, λ > 0 and y > 0.

Note that f max (0 | θ, λ) = λθ 2 e - λ /(θ + 1)(1 - e - λ ) and f max (∞ | θ, λ) = 0. For all θ > 0 and λ > 0 the probability density function (10), is decreasing or unimodal (see Fig. 4). For values of λ close to 1, the curve resembles the one parameter Lindley distribution, while when λ → ∞ the curve tends to be symmetric.

Figure 4
The zero truncated Lindley-Poisson probability density function for different values of the λ and θ = 2.0 if Y = max(T 1 , . . . , TM ).

For all θ > 0 and λ > 0, the hazard rate function, (12), is increasing (see Fig. 5). Is easy to see that h max (0 | θ, λ) = f max (0 | θ, λ) = λθ 2 e - λ /(θ + 1)(1 - e - λ ) and h max (∞ | θ, λ) = θ. Note that h min (∞ | θ, λ) = h max (∞ | θ, λ) = θ.

Figure 5
The zero truncated Lindley-Poisson hazard rate function for different values of the λ and θ = 2.0 if Y = max(T 1 ,. .. , TM ).

Glaser (198020 GLASER RE. 1980. Bathtub and related failure rate characterizations. Journal of the American Statistical Association, 75(371): 667-672.) and Chechile (20039 CHECHILE RA. 2003. Mathematical tools for hazard function analysis. Journal of Mathematical Psychology, 47(5-6): 478-494.) studied the hazard rate function behavior by the η (y) = -f’(y|θ, λ)/f(y|θ, λ) function and its derivative η’(y). Because of the complexity of such studies, this work only presents the functions η (y) and η’(y). Considering the hazard rate functions (9) and (12), we have:

(13)

(14)

and its first derivatives are:

Therefore, the hazard rate function behavior properties of the zero truncated Lindley-Poisson distribution follows from the results in Glaser (198020 GLASER RE. 1980. Bathtub and related failure rate characterizations. Journal of the American Statistical Association, 75(371): 667-672.) and Chechile (20039 CHECHILE RA. 2003. Mathematical tools for hazard function analysis. Journal of Mathematical Psychology, 47(5-6): 478-494.).

2.1 Quantile function

The quantile function of the zero truncated Lindley-Poisson distribution is given by:

if Y = min (T 1 , . . ., TM), where 0 < u < 1 and W−1 (·) denotes the negative branch of the Lambert W function (i.e., the solution of the equation W(z)eW ( z ) = z) because (1 + θ + θ y) > 1 and e, 0). And, the quantile function of the zero truncated Lindley- Poisson distribution is given by: ∈ (-1/

if Y = max (T 1 , . . ., TM), where 0 < u < 1 and W−1 (·) denotes the negative branch of the Lambert W function because (1 + θ + θ y) > 1 and e, 0) (Jodrá, 201022 JODRA´ P. 2010. Computer generation of random variables with Lindley or Poisson-Lindley distribution via the Lambert W function. Mathematics and Computers in Simulation, 81(4): 851-859.; Ghitany et al., 201215 GHITANY ME, AL-MUTAIRI DK, AL-AWADHI FA & AL-BURAIS MM. 2012. Marshall-Olkin extended Lindley distribution and its application. International Journal of Applied Mathematics, 25(5): 709-721.). ∈ (-1/

Our approach may be generalized in some different ways, for instance, it is important to note that for any probability density function f 1 (y | θ), θ = θ 1 , . . . , θp), and MPoissonT runc (λ) as the discrete distribution, the general marginal probability density function can be written as:

(15)

where Fp (y | θ) = F 1 (y | θ) when Y = min (T 1 , . . ., TM) and Fp (y | θ) = 1−F 1 (y | θ) when Y = max (T 1 , . . ., TM ).

From (15), the cumulative distribution, survival and hazard functions for Y = min (T 1 , . . ., TM ) and Y = max (T 1 , . . ., TM ) can be generically written as:

3 ESTIMATION METHODS

In this section, considering the distribution obtained by the composition of distributions we describe six methods used to estimate λ and θ. For all methods we consider the case when both λ and θ are unknown. This is also considered in the simulation study presented in Section 4. Note that the methods were presented for a general baseline function f 1 (y | θ).

3.1 Maximum Likelihood

Let y = (y 1 , ... , yn ) be a random sample of n size from the distribution obtained by the composition of distributions with parameters λ and θ , the likelihood and log-likelihood function are, respectively:

(16)

(17)

where:

  1. i) Fp (yi | θ) = F 1 (yi | θ) if Y = min (T 1 , . . ., TM )

  2. ii) Fp (yi | θ) = 1 − F 1 (yi | θ) if Y = max (T 1 , . . ., TM ).

The maximum likelihood estimates of θ and λ, MLE and MLE respectively, can be obtained numerically by maximizing the log-likelihood function (17). In this case, the log-likelihood function is maximized by solving numerically ∂/∂θ l (θ,λ | y) = 0 and ∂/∂λ l (θ,λ | y) = 0 in θ and λ, respectively, where:

(18)

(19)

where f 1 (yi | θ) = ∂/∂θ f 1 (yi | θ) and F’p (yi | θ) = ∂/∂θ Fp (yi | θ).

3.2 Ordinary Least-Squares

Let y 1: n < y 2: n ··· < yn : n be the order statistics of a random sample of n size from a distribution with cumulative distribution function F (y). It’s well known that:

(20)

For the distribution obtained by the composition process, the least square estimates OL S and OL S of the parameters θ and λ, respectively, are obtained by minimizing the function:

(21)

when Y = min (T 1 , .. ., TM), and minimizing

(22)

when Y = max (T 1, . . ., TM ).

Therefore, if Y = min (T 1 , .. ., TM), these estimates can also be obtained by solving the nonlinear equations:

(23)

(24)

where:

(25)

(26)

But, if Y = max (T 1 , .. ., TM ), these estimates can also be obtained by solving the nonlinear equations:

(27)

(28)

where:

(29)

(30)

Note that ∆1 and ∆2 are derivative from first order distribution function for parameters θ and λ, respectively.

3.3 Weighted Least-Squares

The weighted least-squares estimates WL S of the parameters θ and λ, respectively, are obtained by minimizing the function:WL S and

(31)

if Y = min (T 1 , .. ., TM ), and minimizing

(32)

if Y = max (T 1 , .. ., TM ).

The correction factor wi is given by:

(33)

Therefore, if Y = min (T 1 , .. ., TM), these estimates can also be obtained by solving the nonlinear equations:

(34)

(35)

where ∆1 (yi : n |θ, λ) and ∆2 (yi : n |θ, α) are given by (25) and (26), respectively.

Thus, if Y = max (T 1 , . . ., TM ), these estimates can also be obtained by solving the nonlinear equations:

(36)

(37)

where ∆1 (yi : n |θ, λ) and ∆2 (yi : n |θ, α) are given by (29) and (30), respectively.

3.4 Maximum Product of Spacings

Cheng & Amin (197910 CHENG RCH & AMIN NAK. 1979. Maximum product-of-spacings estimation with applications to the lognormal distribution. Tech. Rep. 1, University of Wales IST, Department of Mathematics., 198311 CHENG RCH & AMIN NAK. 1983. Estimating parameters in continuous univariate distributions with a shifted origin. Journal of the Royal Statistical Society, Series B45(3): 394-403.) introduced the maximum product of spacings (MPS) method as alternative to MLE for the estimation of parameters of continuous univariate distributions. Ranneby (198433 RANNEBY B. 1984. The maximum spacing method. An estimation method related to the maximum likelihood method. Scandinavian Journal of Statistics. Theory and Applications, 11(2): 93-112.) independently developed the same method as an approximation of Kullback-Leibler measure of information. In what follows, let y1:n < y2:n < ··· < yn : n be an ordered random sample drawn from the general model of composition distribution. Are defined as the uniform spacings of the sample the quantities: D 1 = F (y 1: n | θ, λ), Dn +1 = 1 − F (tn : n | θ, λ) and Di = F (ti : n | θ, λ) − F t ( i −1): n | θ, λ , i = 2, ... , n. There are (n + 1) spacings of the first order.

Following Cheng & Amin (198311 CHENG RCH & AMIN NAK. 1983. Estimating parameters in continuous univariate distributions with a shifted origin. Journal of the Royal Statistical Society, Series B45(3): 394-403.), the maximum product of spacings method consists in finding the values of θ and λ which maximize the geometric mean of the spacings, the MPS statistics, is given by:

(38)

or, equivalently, its logarithm H = log(G). Considering 0 = F (t 0: n | θ, λ) < F (y 1: n | θ, λ) < ··· < F (yn : n | θ, λ) < F y ( n +1 ) : n | θ, λ = 1 the quantitie H = log(G) can be calculated as:

(39)

The estimates for θ and λ can be found solving, respectively in θ and λ, the nonlinear equations:

(40)

(41)

where ∆ is the first order difference operator.

Cheng & Amin (198311 CHENG RCH & AMIN NAK. 1983. Estimating parameters in continuous univariate distributions with a shifted origin. Journal of the Royal Statistical Society, Series B45(3): 394-403.) showed that maximizing H as a method of parameter estimation is as efficient as MLE estimation and the MPS estimators are consistent under more general conditions than the MLE estimators.

Therefore, if Y = min (T 1 , .. ., TM ), the estimates MP S and MP S can be obtained by solving the nonlinear equations:

(42)

(43)

Thus, if Y = max (T 1 , .. ., TM ), the estimates MP S can be obtained by solving the nonlinear equations:MP S and

(44)

(45)

3.5 Minimum distance methods

In this subsection we present two estimation methods for θ and λ based on the minimization of the goodness-of-fit statistics. This class of statistics is based on the difference between the estimate of the cumulative distribution function and the empirical distribution function (Luceño, 200628 LUCEN˜O A. 2006. Fitting the generalized pareto distribution to data using maximum goodness-of-fit estimators. Computational Statistics & Data Analysis, 51(2): 904-917.).

3.5.1 Crame´r-von-Mises

The Crame´r-von-Mises estimates of the parameters CM and CM, respectively, are obtained by minimizing, in θ and λ, the function:

(46)

These estimates can also be obtained by solving the nonlinear equations:

(47)

(48)

where ∆1 (·|θ, λ) and ∆2 (·|θ, λ) are given, respectively, by (25) and (26) if Y = min(T 1 ,... , TM ) and, respectively, by (29) and (30) if Y = max(T 1 , ... , TM ).

3.5.2 Anderson-Darling

The Anderson-Darling estimates of the parameters AD , respectively, are obtained by minimizing, with respect to θ and λ, the function:AD and

(49)

These estimates can also be obtained by solving the nonlinear equations:

(50)

(51)

where ∆1 (·|θ, λ) and ∆2 (·|θ, λ) are given, respectively, by (25) and (26) if Y = min(T 1 ,... , TM ) and, respectively, by (29) and (30) if Y = max(T 1 , ... , TM ).

4 SIMULATION STUDY

In this section we present results of some numerical experiments to compare the performance of the different estimation methods discussed in the previous section. We have taken sample sizes n = 20, 50, 100 and 200, θ = 1.0 and λ = 0.5, 1.0, 2.0, 3.0 and 5.0. For each combination (n,θ, λ) we have generated B = 500, 000 pseudo random samples from the zero truncated Lindley-Poisson distribution.

The estimates were obtained in Ox version 6.20 (Doornik, 200713 DOORNIK JA. 2007. Object-Oriented Matrix Programming Using Ox, 3rd ed. London: Timberlake Consultants Press and Oxford.) using MaxBFGS function in MLE, OLS, WLS, MPS, CM and AD methods. For each estimate we computed the bias, the root mean-squared error, the average absolute difference between the true and estimate distributions functions and the maximum absolute difference between the true and estimate distributions functions, respectively, as:

(52)

(53)

(54)

(55)

In Tables 1, 2, 3, 4 and 5 we show the calculated values of (52)-(55). The superscript values indicate the rank obtained by each of the methods considered, and the total line shows the global rank for each method based on measures (52)-(55).

For the simulations, the MLE method proved to be the most efficient for estimate the parameters of zero truncated Lindley-Poisson distribution to Y = min(T1, . . . , TM ) when λ= 0.5 and λ = 1.0. For λ = 2.0, 3.0 and 5.0, the OLS, in general, proved to be better.

Table 1
Simulations results for θ = 1.0 and λ = 0.5.
Table 2
Simulations results for θ = 1.0 and λ = 1.0.
Table 3
Simulations results for θ = 1.0 and λ = 2.0.
Table 4
Simulations results for θ = 1.0 and λ = 3.0.
Table 5
Simulations results for θ = 1.0 and λ = 5.0.

In Tables 6, 7, 8, 9 and 10 we show the calculated values of (52)-(55). The superscript values indicate the rank obtained by each of the methods considered, and the total line shows the global rank for each method based on measures (52)-(55).

Table 6
Simulations results for θ = 1.0 and λ = 0.5.
Table 7
Simulations results for θ = 1.0 and λ = 1.0.
Table 8
Simulations results for θ = 1.0 and λ = 2.0.
Table 9
Simulations results for θ = 1.0 and λ = 3.0.
Table 10
Simulations results for θ = 1.0 and λ = 5.0.

In general, the MPS method proved to be the best method to estimate the parameters of the zero truncated Lindley-Poisson distribution to Y = max(T 1 , ..., TM ). The MLE method showed the worst results even for the large sample size. For future work, further study the zero truncated Lindley-Poisson distribution to Y = max(T 1 , ... , TM ) to understand why the MLE method was not as good would be very relevant.

For λ = 0.5 and 1.0 the MPS method had the highest rank and AD method the second. For λ = 3.0 and 5.0 the AD method was the best, the MPS rank was the second one, only when n = 20 the MPS was better, and for λ = 2.0, the MPS was better for n = 20 and 50 while the AD was better for n = 100 and 200.

5 REAL DATA APPLICATION

In this section we fit the zero truncated Lindley-Poisson distribution (LP) to a real data set. For comparison, we also have considered four alternative models: the one parameter Lindley f (y| θ) = θ 2/1+θ (1 + y) e−θ y , the weighted Lindley distribution (WL)

the exponentiated or generalized Lindley distribution (EL)

and the power Lindley distribution (PL)

The data set was extracted from Lee & Wang (200324 LEE ET & WANG JW. 2003. Statistical methods for survival data analysis, 3rd Edition. Wiley Series in Probability and Statistics. Hoboken, NJ.) and refers to remission times (in months) of a randomly censored of 137 bladder cancer patients. Out of 137 data points, 9 observations are right censored. We considered (y 1 , y 2 , ..., yn ) the observed values from Y = min (T 1 , ... , TM ).

In Table 11 we present, for all models, the maximum likelihood, maximum product of spacings, ordinary least-squares, weighted least-squares, Crame´r-von-Mises and Anderson-Darling estimates for θ and λ and its respectivally standard errors estimates. The maximum likelihood estimates were obtained in SAS/SEVERITY procedure (SAS, 201137 SAS. 2011. SAS/ETS® User’s Guide, Version 9.33. Cary, NC: SAS Institute Inc.) and others estimates were obtained in R version 2.15, using the “fitdist”, “max.Lik” and “nls” functions. The dotted in Table 11 indicates is not possible to calculate standard errors estimates to the Crame´r-von-Mises and Anderson-Darling methods.

Table 11
Maximum likelihood, Maximum product of spacings, Ordinary least-squares, Weighted least-squares, Crame´r-von-Mises and Anderson-Darling estimates and (standard errors) estimates.

From Table 11, it is observed that all estimation methods were effective to estimate the parameters θ and λ, in addition, the standard errors there were small.

The SAS/SEVERITY procedure can fit multiple distributions at the same time and choose the best distribution according to a specified selection criterion. Seven different statistics of fit can be used as selection criteria. They are log likelihood, Akaike’s information criterion (AIC), corrected Akaike’s information criterion (AICC), Schwarz Bayesian information criterion (BIC), Kolmogorov-Smirnov statistic (KS), Anderson-Darling statistic (AD) and Crame´r-von-Mises statistic (CvM). The calculed values of theses statistics are report in Table 12. In Figure 6 is possible to see similar fit for the five models applied to the data set.

Table 12
2log-likelihood values and goodness of fit measures.

Figure 6
Fitted survival curves.

A close examination of Table 12 reveals that the zero truncated Lindley-Poisson model is the best choice among the competing models, since it has the lowest AIC, AICC and others statistics. This is also supported by the survival curves in Figure 6.

6 CONCLUDING REMARKS

In this paper we proposed the composed zero truncated Lindley-Poisson distribution, which was obtained by compounding an one parameter Lindley distribution with a zero truncated Poisson under the first and last failure time when a device is subjected to the presence of an unknown number M of causes of failures. Both alternative distributions have the one parameter Lindley distribution as a particular case. For the first distribution we assume we have a series system and observe the time to the first failure, Y = min(T 1 , ... , TM ) while for the second distribution we assume we have a parallel system and observe the time to the last failure of the device,Y = max(T 1 , ... , TM ).

We compared, via intensive simulation experiments, the estimation of parameters of the zero truncated Lindley-Poisson distribution using six known estimation methods, namely: the maximum likelihood, maximum product of spacings, ordinal and weighted least-squares, Crame´r-von Mises and Anderson-Darling.

In general, the methods of estimation showed to be efficient to estimate the parameters of the zero truncated Lindley-Poisson distribution. Motivated by application in real data set, we hope this model may be able to attract wider applicability in survival and reliability. For possible future works, there the interest of authors in studies of the Fisher information matrix, Confidence intervals, Hypothesis test and Bayesian estimates.

REFERENCES

  • 1
    ALI S. 2015. On the bayesian estimation of the weighted lindley distribution. Journal of Statistical Computation and Simulation, 85(5): 855-880.
  • 2
    ARNOLD BC, BALAKRISHNAN N & NAGARAJA HN. 2008. A first course in order statistics. Vol. 54 of Classics in Applied Mathematics. Society for Industrial and Applied Mathematics (SIAM), Philadelphia, PA.
  • 3
    BAKOUCH HS, AL-ZAHRANI BM, AL-SHOMRANI AA, MARCHI, VITOR AA & LOUZADA-NETO F. 2012. An extended Lindley distribution. Journal of the Korean Statistical Society, 41(1): 75-85.
  • 4
    BARRETO-SOUZA W & BAKOUCH HS. 2013. A new lifetime model with decreasing failure rate. Statistics: A Journal of Theoretical and Applied Statistics, 47(2): 465-476.
  • 5
    BASU AP. 1981. Identifiability problems in the theory of competing and complementary risks - a survey. In: Statistical distributions in scientific work, Vol. 5 (Trieste, 1980). Vol. 79 of NATO Adv. Study Inst. Ser. C: Math. Phys. Sci. Reidel, Dordrecht, pp. 335-347.
  • 6
    BORAH M & BEGUM RA. 2002. Some properties of Poisson-Lindley and its derived distributions. Journal of the Indian Statistical Association, 40(1): 13-25.
  • 7
    BORAH M & DEKA NA. 2001. Poisson-Lindley and some of its mixture distributions. Pure and Applied Mathematika Sciences, 53(1-2): 1-8.
  • 8
    BORAH M & DEKA NA. 2001. A study on the inflated Poisson Lindley distribution. Journal of the Indian Society of Agricultural Statistics, 54(3): 317-323.
  • 9
    CHECHILE RA. 2003. Mathematical tools for hazard function analysis. Journal of Mathematical Psychology, 47(5-6): 478-494.
  • 10
    CHENG RCH & AMIN NAK. 1979. Maximum product-of-spacings estimation with applications to the lognormal distribution. Tech. Rep. 1, University of Wales IST, Department of Mathematics.
  • 11
    CHENG RCH & AMIN NAK. 1983. Estimating parameters in continuous univariate distributions with a shifted origin. Journal of the Royal Statistical Society, Series B45(3): 394-403.
  • 12
    DAVID HA & MOESCHBERGER ML. 1978. The Theory of Competing Risks. Vol. 39 of Griffin’s Statistical Monograph Series. Macmillan Co., New York.
  • 13
    DOORNIK JA. 2007. Object-Oriented Matrix Programming Using Ox, 3rd ed. London: Timberlake Consultants Press and Oxford.
  • 14
    GHITANY M, AL-MUTAIRI D, BALAKRISHNAN N& AL-ENEZI L. 2013. Power lindley distribution and associated inference. Computational Statistics & Data Analysis, 64: 20-33.
  • 15
    GHITANY ME, AL-MUTAIRI DK, AL-AWADHI FA & AL-BURAIS MM. 2012. Marshall-Olkin extended Lindley distribution and its application. International Journal of Applied Mathematics, 25(5): 709-721.
  • 16
    GHITANY ME, AL-MUTAIRI DK & NADARAJAH S. 2008. Zero-truncated Poisson-Lindley distribution and its application. Mathematics and Computers in Simulation, 79(3): 279-287.
  • 17
    GHITANY ME & AL-MUTARI DK. 2008. Size-biased Poisson-Lindley distribution and its application. METRON - International Journal of Statistics, 66(3): 299-311.
  • 18
    GHITANY ME, ALQALLAF F, AL-MUTAIRI DK & HUSAIN HA . 2011. A two-parameter weighted Lindley distribution and its applications to survival data. Mathematics and Computers in Simulation, 81: 1190-1201.
  • 19
    GHITANY ME, ATIEH B & NADARAJAH S. 2008. Lindley distribution and its application. Mathematics and Computers in Simulation, 78(4): 493-506.
  • 20
    GLASER RE. 1980. Bathtub and related failure rate characterizations. Journal of the American Statistical Association, 75(371): 667-672.
  • 21
    HEMMATI F, KHORRAM E& REZAKHAH S. 2011. A new three-parameter ageing distribution. Journal of Statistical Planning and Inference, 141(7): 2266-2275.
  • 22
    JODRA´ P. 2010. Computer generation of random variables with Lindley or Poisson-Lindley distribution via the Lambert W function. Mathematics and Computers in Simulation, 81(4): 851-859.
  • 23
    KUS¸ C. 2007. A new lifetime distribution. Computational Statistics & Data Analysis, 51(9): 4497- 4509.
  • 24
    LEE ET & WANG JW. 2003. Statistical methods for survival data analysis, 3rd Edition. Wiley Series in Probability and Statistics. Hoboken, NJ.
  • 25
    LINDLEY D. 1965. Introduction to Probability and Statistics from a Bayesian Viewpoint, Part II: Inference. Cambridge University Press, New York.
  • 26
    LINDLEY DV. 1958. Fiducial distributions and Bayes’ theorem. It Journal of the Royal Statistical Society. Series B. Methodological, 20: 102-107.
  • 27
    LU W & SHI D. 2012. A new compounding life distribution: the Weibull-Poisson distribution. Journal of Applied Statistics, 39(1): 21-38.
  • 28
    LUCEN˜O A. 2006. Fitting the generalized pareto distribution to data using maximum goodness-of-fit estimators. Computational Statistics & Data Analysis, 51(2): 904-917.
  • 29
    MAHMOUDI E & ZAKERZADEH H. 2010. Generalized Poisson Lindley distribution. Communications in Statistics Theory and Methods, 39: 1785-1798.
  • 30
    MAZUCHELI J & ACHCAR JA. 2011. The Lindley distribution applied to competing risks lifetime data. Computer Methods and Programs in Biomedicine, 104(2): 188-192.
  • 31
    NADARAJAH S, BAKOUCH HS & TAHMASBI R. 2011. A generalized Lindley distribution. Sankhya˜, B 73(2): 331-359.
  • 32
    OLUYEDE B & YANG T. 2015. A new class of generalized lindley distributions with applications. Journal of Statistical Computation and Simulation, 85(10): 2072-2100.
  • 33
    RANNEBY B. 1984. The maximum spacing method. An estimation method related to the maximum likelihood method. Scandinavian Journal of Statistics. Theory and Applications, 11(2): 93-112.
  • 34
    REZAEI S & TAHMASBI R. 2012. A new lifetime distribution with increasing failure rate: Exponential truncated Poisson. Journal of Basic and Applied Scientific Research, 2(2): 1749-1762.
  • 35
    RISTIC´ MM & NADARAJAH S. 2012. A new lifetime distribution. Journal of Statistical Computation and Simulation iFirst, 1-16.
  • 36
    SANKARAN M. 1970. The discrete Poisson-Lindley distribution. Biometrics, 26: 145-149.
  • 37
    SAS. 2011. SAS/ETS® User’s Guide, Version 9.33. Cary, NC: SAS Institute Inc.
  • 38
    ZAKERZADEH H & DOLATI A. 2009. Generalized Lindley distribution. Journal of Mathematical Extension, 3: 13-25.
  • 39
    ZAMANI H & ISMAIL N. 2010. Negative Binomial-Lindley distribution and its application. Journal of Mathematics and Statistics, 6(1): 4-9.

Publication Dates

  • Publication in this collection
    Sep-Dec 2016

History

  • Received
    23 Jan 2015
  • Accepted
    20 Oct 2016
Sociedade Brasileira de Pesquisa Operacional Rua Mayrink Veiga, 32 - sala 601 - Centro, 20090-050 Rio de Janeiro RJ - Brasil, Tel.: +55 21 2263-0499, Fax: +55 21 2263-0501 - Rio de Janeiro - RJ - Brazil
E-mail: sobrapo@sobrapo.org.br