SciELO - Scientific Electronic Library Online

 
vol.36 número2A COMPARATIVE STUDY BETWEEN ARTIFICIAL NEURAL NETWORK AND SUPPORT VECTOR MACHINE FOR ACUTE CORONARY SYNDROME PROGNOSISINFERENCE OF THE MARKET VALUE OF URBAN LOTS FROM THE PERSPECTIVE OF SURVIVAL ANALYSIS. CASE STUDY: CITY OF SÃO CARLOS, SÃO PAULO, BRAZIL índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

Compartilhar


Pesquisa Operacional

versão impressa ISSN 0101-7438versão On-line ISSN 1678-5142

Pesqui. Oper. vol.36 no.2 Rio de Janeiro mai./ago. 2016

https://doi.org/10.1590/0101-7438.2016.036.02.0345 

Articles

AN ALTERNATIVE REPARAMETRIZATION FOR THE WEIGHTED LINDLEY DISTRIBUTION

Josmar Mazucheli1  * 

Emílio Augusto Coelho-Barros2 

Jorge Alberto Achcar3 

1Universidade Estadual de Maringá, Departamento de Estatìstica, Maringá, PR, Brasil. E-mail: jmazucheli@gmail.com

2Universidade Tecnológica Federal do Paraná, Departamento Acadêmico de Matemática, Cornélio Procópio, PR, Brasil. E-mail: eabarros@utfpr.edu.br

3Universidade de São Paulo, Departamento de Medicina Social, Ribeirão Preto, SP, Brasil. E-mail: achcar@fmrp.usp.br


ABSTRACT

Recently,12 introduced a generalization of a one parameter Lindley distribution and named it as a weighted Lindley distribution. Considering this new introduced weighted Lindley distribution, we propose a reparametrization on the shape parameter leading it to be orthogonal to the other shape parameter. In this alternative parametrization, we get a direct interpretation for this transformed parameter which is the mean survival time. For illustrative purposes, the weighted Lindley distribution on the new parametrization is applied on two real data sets. The one parameter Lindley distribution and its generalized form are fitted for the considered data sets.

Keywords: generalized Lindley distribution; Lindley distribution; orthogonal parameters; survival analysis; weighted Lindley distribution

1 INTRODUCTION

A non negative random variable T follows the two-parameter weighted Lindley distribution,12, with shape parameters μ > 0 and β > 0 if its probability density function is given by:

(1)

where t>0 and Γ = (β) = ∫0 t β - 1 e - tdt is the gamma function. From (1), the corresponding survival and hazard functions, are given, respectively, by:

(2)

and

(3)

where Γ(a, b), a > 0 and b ≥ 0, is the upper incomplete gamma function (see,29), defined as ∫0 t β - 1 e - tdt.

In (1), taking the shape parameter β = 1 we have the one parameter Lindley distribution as a special case. The one parameter Lindley distribution was introduced by Lindley (see, Lindley, 1958 and 1965) as a new distribution useful to analyze lifetime data, especially in applications modeling stress-strength reliability.13 studied the properties of the one parameter Lindleydistribution under a careful mathematical approach. These authors also showed, in a numerical example, that the Lindley distribution usually gives better fit for the data when compared to the standard Exponential distribution. A generalized Lindley distribution, which includes as special cases the Exponential and Gamma distributions was introduced by36. (Ghitany and Al-Mutari, 2008) considered a size-biased Poisson-Lindley distribution and31 introduced the Poisson-Lindley distribution to model count data. Some properties of the Poisson-Lindley distribution, its derived distributions and some mixtures of this distribution were studied by5), (6), (24. A zero-truncated Poisson-Lindley was considered in10. A study on the inflated Poisson-Lindley distribution was presented in7 and the Negative Binomial-Lindley distribution was introduced in37. The one parameter Lindley distribution in the competing risks scenario wasconsidered in26.

Since the standard one parameter Lindley distribution does not provide enough flexibility to analyze different types of lifetime data, the two-parameter weighted Lindley distribution could be a good alternative in the analysis of lifetime data. A nice feature of the two-parameter weighted Lindley distribution is that its hazard function has a bathtub form for 0 < β < 1 and it is increasing for β ≥ 1, for all μ > 0.

It is important to point out, that in the last years, several distributions have been introduced in the literature to model bathtub hazard functions but in general these distributions have three or more parameters usually depending on numerical methods to find the maximum likelihood estimates which could be, in general, not very accurate. In this case good reparametrizations with less parameters could be very useful in applications. A comprehensive review of the existing know distributions that exhibit bathtub shape is provided in30), (17), (3), (28. In addition to the weighted Lindley distribution, that can be used to model bathtub-shaped failure rate, we also could consider as alternatives, four other two-parameter distributions introduced in the literature18), (8), (15), (35 with this behavior.

The main goal of this paper is to propose an alternative parametrization for the one shape parameter of the weighted Lindley distribution. In the proposed parametrization, we get the new parameter orthogonal to the other shape parameter where this new reparametrized form of the parameter gives the mean survival time. The obtained orthogonality of the reparametrized form of the parameter is related to the observed Fisher information2. Orthogonal parameters have many advantages in the inference results as, for example, for large sample sizes we have independence among the maximum likelihood of the orthogonal parameters, since the Fisher information matrix is diagonal. Other advantage of orthogonal parameters is related to the conditional likelihood approach (for further details see, Cox & Reid, 1987; Lancaster, 2002; Louzada-Neto & Pardo-Fernandez, 2001; Louzada & Cavali, 2014).

The paper is organized as follows. In Section 2 the likelihood function for the two-parameter weighted Lindley distribution is formulated where we also present the proposed orthogonal reparametrization. Two examples considering real data sets are provided in Section 3 where its observed that the weighted Lindley distribution gives better fit for the data when compared to the one-parameter Lindley distribution and the generalized Lindley distribution. Some conclusions are presented in Section 4.

2 THE LIKELIHOOD FUNCTION

Let t = (t 1, ..., tn ) be a realization of the random sample T = (T 1, ..., Tn ), where T 1, ..., Tn are i.i.d. (identically independent distribution) random variables according to a two-parameter Lindley distribution, with shape parameters μ > 0 and β > 0. From (1) the likelihood function can be written as:

(4)

where T 0 = ti and Γ = (β) = ∫0∞yβ - 1e-ydy is the gamma function. From (4), the log-likelihood function for μ and β, l(μ, β|t), is given by:

(5)

where T1 = log(ti) and T2 = log(1 + ti).

Differentiating (5) with respect to μ and β and setting the results equal to zero we have:

(6)

(7)

where ψ(β) = logΓ(β) is the digamma function. The maximum likelihood estimates, and , for μ and β, respectively, are obtained by solving equations (6) and (7) in μ and β, respectively.

From (6), the maximum likelihood estimate for μ is obtained as a function of β, (β), given by:

(8)

In this way, replace μ in (7) by (β) given by (8), which leads to an equation with only one variable β. After choose an initial value for β, use standard Newton-Raphson algorithm to find the maximum likelihood estimator for β. With the obtained maximum likelihood estimator for β get the maximum likelihood estimator for μ using equation (8).

Based on a single observation, the observed information matrix, , is given by:

(9)

where ψ'(β) = logΓ(β), and the terms in the (2 × 2) observed Fisher information matrix are obtained from the second derivatives given by,

and

The maximum likelihood estimates for μ and β have asymptotic bivariate normal distribution with mean (μ, β) and variance-covariance matrix given by the inverse of the Fisher Information matrix (9) locally at the maximum likelihood estimates and . Since the data is independent, the information matrix is equal to the expected information matrix.

In this paper we propose to reparametrize the two-parameter weighted Lindley distribution such that (μ, β) is transformed to (θ, β), where:

(10)

where θ > 0 is the mean of the weighted Lindley distribution with parameters μ and β and

Using the construction method of orthogonality parameters, proposed in(9), and from (9) we observe that μ is obtained as solution of the following orthogonality differential equation:

(11)

In this new parametrization we have that the maximum likelihood estimate for θ is given by = n -1ti and Coυ(, ) = 0. The orthogonality between θ and β implies that the information matrix is asymptotically diagonal which implies that the the maximum likelihood estimates and are asymptotically independent. The orthogonality simplify the parameters estimation process and its interpretation. For the weighted Lindley distribution the parameter interpretation in the orthogonal parametrization is obvious since is the mean time to failure. Further orthogonality consequences are pointed out in9.

3 APPLICATIONS

In this section we fit the two-parameter weighted Lindley distribution (WL) to two real data sets. For comparative purposes we also have considered two alternative models: (L): the one parameter Lindley distribution, f(t|μ) = (1 + t)e-μt, and (GL): the generalized Lindley distribution f(t|μ, β, γ) = (β + γt)tβ - 1e-μt,(36).

The first data set was reported by(4), and employed by(14) among others, represents the survival times (in days) of 72 guinea pigs infected with virulent tubercle bacilli, regimen 4.3. The regimen number is the common log of the number of bacillary units in 0.5 ml of challenge solution. The second data set was extracted from(33), see also(25), representing hours to failure of 59 test conductors of 400-micrometer length. All specimens ran to failure at a certain high temperature and current density. The 59 specimens were all tested under the same temperature and current density.

Table 1 list for the two data sets and models L, WL and GL the maximum likelihood estimates and their standard errors. For comparative purposes the estimates are also presented in the original parameterization and were obtained using SAS/NLMIXED procedure,(32), by applying the Newton-Raphson algorithm. For the WL model, in the orthogonal parameterization, we have = = 176.82 (data set 1) and = = 6.98 (data set 2). The standard errors are given, respectively, by 11.86 and 0.21.

Table 1 Maximum likelihood (standard error) estimates for Lindley (L), weighted Lindley (WL) and generalized Lindley (GL) distribution. 

Data Model μ β γ
L 0.0112 (0.0009)
data 1 WL 0.0175 (0.0030) 2.1052 (0.4872)
GL 0.0182 (0.0034) 2.3367 (0.7700) 0.1474 (0.4428)
L 0.2572 (0.0239)
data 2 WL 2.6100 (0.4796) 17.3488 (3.3002)
GL 2.6242 (0.4949) 17.6186 (4.4834) 6.0704 (62.5868)

In Table 2 are listed standard model selection measures: -2×log-likelihood, AIC (Akaike's Information Criterion,1) and BIC (Schwarz's Bayesian Information Criterion,34). From the values of these statistics we conclude that the two parameter Lindley distribution provides a better fit for the data sets when compared to the two alternative models. For the WL model the obtained estimates for θ are respectively given by: 176.82 (data set 1) and 6.98 (data set 2). The standard errors are given by, 11.86 and 0.21, respectively.

Table 2 Model selection measures. 

Data Model -2 log-like AIC BIC
L 858.6 860.6 862.8
Data 1 WL 851.5 855.5 860.1
GL 851.3 857.3 864.2
L 316.7 318.7 320.8
Data 2 WL 223.6 227.6 231.8
GL 223.6 229.6 235.9

For illustrative purposes, we present in Figure 1 the 50%, 90% and 95% likelihood contour plots in the original and the proposed orthogonal parametrization. In contrast to panels (b, data set 1) and (d, data set 2), the orientation of the contours in panels (a, data set 1) and (c, data set 2) revels a high positive correlation between and . We observe in panels (b) and (d) that the axes of the elliptical contours are parallel to the coordinate axes, and for this reason we have an indication that the correlation is equal to zero. Naturally, this is expected since θ and β are estimated independently. These contours were built using the procedure described in16 and also presented in27.

Figure 1 (a, c): Contour plot for the joint relative likelihood function of β and μ. (b, d): Contour plot for the joint relative likelihood function of β and θ. 

In Table 3 we present, for both data sets, the corresponding p-values to Kolmogorov-Smirnov (K-S) and Anderson-Darling (A-D) goodness-of-fit statistics. From these results, it is clear that the WL distribution provides a good fit to the given data sets. We also consider the Log-Normal distribution (LN) in the data analysis, since this distribution was considered by14 (data set 1) and by25 (data-set 2).

Table 3 Kolmogorov-Smirnov and Anderson-Darling goodness-of-fit statistics. 

data set 1
Test L WL GL LN
K-S 0.0274 0.5681 < 0.0001 0.3200
A-D 0.1037 0.6668 < 0.0001 0.5274
data set 2
Test L WL GL LN
K-S < 0.0001 0.9085 < 0.0001 0.7335
A-D < 0.0001 0.9776 < 0.0001 0.8744

4 CONCLUDING REMARKS

In this paper we introduced an alternative parametrization for the shape parameter of the weighted Lindley distribution (WL) introduced by12, which generalizes the one parameter Lindley distribution. In the proposed parametrization, the new parameter have a direct interpretation and it is orthogonal to the shape parameter.

In the last years, the Lindley distribution have been considered in several applications as an alternative lifetime model and its generalization called weighted Lindley distribution considering the orthogonal parametrization could be another good alternative distribution to modeling lifetime data. We fitted the WL distribution to two real data sets and compared the obtained results with those of L and GL distributions, which showed the great potentialities of the WL distribution.

REFERENCES

1 AKAIKE H. 1983. Information measures and model selection. In: 'Proceedings of the 44th session of the International Statistical Institute, Vol. 1 (Madrid, 1983)', Vol. 50, pp. 277-290. With a discussion in Vol. 3, pp. 209-219. [ Links ]

2 BARNDORFF-NIELSEN OE & COX DR. 1994. Inference and asymptotics, Vol. 52 of Monographs on Statistics and Applied Probability, Chapman & Hall, London. [ Links ]

3 BEBBINGTON M, LAI C & ZITIKIS R. 2007. Bathtub-type curves in reliability and beyond. Australian & New Zealand Journal of Statistics, 49(3): 251-265. [ Links ]

4 BJERKEDAL T. 1960. Acquisition of resistance in guinea pigs infected with different doses of virulent tubercle bacilii. Amer. J. Hyg., 72: 130-148. [ Links ]

5 BORAH M & BEGUM RA. 2002. Some properties of Poisson-Lindley and its derived distributions. Journal of the Indian Statistical Association, 40(1): 13-25. [ Links ]

6 BORAH M & DEKA NATH A. 2001. Poisson-Lindley and some of its mixture distributions. Pure and Applied Mathematika Sciences, 53(1-2): 1-8. [ Links ]

7 BORAH M & DEKA NATH A. 2001. A study on the inflated Poisson Lindley distribution. Journal of the Indian Society of Agricultural Statistics, 54(3): 317-323. [ Links ]

8 CHEN Z. 2000. A new two-parameter lifetime distribution with bathtub shape or increasing failure rate function. Statistics & Probability Letters, 49: 155-161. [ Links ]

9 COX DR & REID N. 1987. Parameter orthogonality and approximate conditional inference. Journal of the Royal Statistical Society. Series B, 49(1): 1-39. With a discussion. [ Links ]

10 GHITANY ME, AL-MUTAIRI DK & NADARAJAH S. 2008. Zero-truncated Poisson-Lindley distribution and its application. Mathematics and Computers in Simulation, 79(3): 279-287. [ Links ]

11 GHITANY ME & AL-MUTARI DK. 2008. Size-biased Poisson-Lindley distribution and its application. METRON - International Journal of Statistics, 66(3): 299-311. [ Links ]

12 GHITANY ME, ALQALLAF F, AL-MUTAIRI DK & HUSAIN HA. 2011. A two-parameter weighted Lindley distribution and its applications to survival data', Mathematics and Computers in Simulation, 81: 1190-1201. [ Links ]

13 GHITANY ME, ATIEH B & NADARAJAH S. 2008. Lindley distribution and its application. Mathematics and Computers in Simulation, 78(4): 493-506. [ Links ]

14 GUPTA RC, KANNAN N & RAYCHAUDHURI A. 1997. Analysis of Lognormal survival data. Mathematical Biosciences, 139: 103-105. [ Links ]

15 HAUPT E & SCHÄBE H. 1992. A new model for a lifetime distribution with bathtub shaped failure rate. Microelectronics Reliability, 32(5): 33-639. [ Links ]

16 KALBFLEISCH JG. 1985. Probability and statistical inference. Vol. 2, Springer Texts in Statistics, second edn, Springer-Verlag, New York. [ Links ]

17 LAI CD, XIE M & MURTHY DNP. 2001. Bathtub-shaped failure rate life distributions. In:'Advances in reliability', Vol. 20 of Handbook of Statistics. North-Holland, Amsterdam, pp. 69-104. [ Links ]

18 LAN Y & LEEMIS LM. 2008. The logistic-exponential survival distribution. Naval Res. Logist., 55(3): 252-264. [ Links ]

19 LANCASTER T. 2002. Orthogonal parameters and panel data. Review of Economic Studies, 69(3): 647-666. [ Links ]

20 LINDLEY D. 1965. Introduction to Probability and Statistics from a Bayesian Viewpoint, Part II: Inference. Cambridge University Press, New York. [ Links ]

21 LINDLEY DV. 1958. Fiducial distributions and Bayes' theorem. Journal of the Royal Statistical Society. Series B. Methodological, 20: 102-107. [ Links ]

22 LOUZADA F & CAVALI W. 2014. On a double reparametrization for accelerated lifetime testing. Chilean Journal of Statistics, 5(1): 37-48. [ Links ]

23 LOUZADA-NETO F & PARDO-FERNANDEZ. 2001. The effect of reparametrization on the accuracy of inferences for accelerated lifetime tests. Journal of Applied Statistics, 28: 703-711. [ Links ]

24 MAHMOUDI E & ZAKERZADEH H. 2010. Generalized Poisson Lindley distribution. Communications in Statistics Theory and Methods, 39: 1785-1798. [ Links ]

25 MARTÍN J & PÉREZ CJ. 2009. Bayesian analysis of a generalized lognormal distribution. Computational Statistics & Data Analysis, 53: 1377-1387. [ Links ]

26 MAZUCHELI J & ACHCAR JA. 2011. The Lindley Distribution Applied to Competing Risks Lifetime Data. Computer Methods and Programs in Biomedicine, 104(2): 188-192. [ Links ]

27 MEEKER WQ & ESCOBAR LA. 1998. Statistical Methods for Reliability Data. John Wiley & Sons, New York. [ Links ]

28 NADARAJAH S. 2008. Bathtub-shaped failure rate functions. Quality & Quantity, 43(5): 855-863. [ Links ]

29 OLVER FWJ, LOZIER DW, BOISVERT RF & CLARK CW. (eds.) 2010. NIST Handbook of mathematical functions, U.S. Department of Commerce National Institute of Standards and Technology, Washington, DC. [ Links ]

30 RAJARSHI S & RAJARSHI MB. 1988. Bathtub distributions: A review. Communications in Statistics. Theory and Methods, 17: 2597-2621. [ Links ]

31 SANKARAN M. 1970. The discrete Poisson-Lindley distribution. Biometrics, 26: 145-149. [ Links ]

32 SAS. 2010. The NLMIXED Procedure, SAS/STAT(r) User's Guide, Version 9.22, Cary, NC: SAS Institute Inc. [ Links ]

33 SCHAFFT HA, STATON TC, MANDEL J & SHOTT JD. 1987. Reproducibility of electromigration measurements. IEEE Transactions on Electronic Device, 34(3): 673-681. [ Links ]

34 SCHWARZ GE. 1978. Estimating the dimension of a model. Annals of Statistics, 6(2): 461-464. [ Links ]

35 SMITH RM & BAIN LJ. 1975. An Exponential Power Life-Test Distribution. Communications in Statistics, 4(5): 469-481. [ Links ]

36 ZAKERZADEH H & DOLATI A. 2009. Generalized Lindley distribution. Journal of Mathematical Extension, 3(4): 13-25. [ Links ]

37 ZAMANI H & ISMAIL N. 2010. Negative Binomial-Lindley distribution and its application. Journal of Mathematics and Statistics, 6(1): 4-9. [ Links ]

Received: February 19, 2015; Accepted: June 04, 2016

*Corresponding author.

Creative Commons License This is an open-access article distributed under the terms of the Creative Commons Attribution License