Acessibilidade / Reportar erro

General results for the Marshall and Olkin's family of distributions

Abstracts

Abstract Marshall and Olkin (1997)Marshall AW and Olkin I. 1997. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 84: 641-652. introduced an interesting method of adding a parameter to a well-established distribution. However, they did not investigate general mathematical properties of their family of distributions. We provide for this family of distributions general expansions for the density function, explicit expressions for the moments and moments of the order statistics. Several especial models are investigated. We discuss estimation of the model parameters. An application to a real data set is presented for illustrative purposes.

Marshall-Olkin extended distribution; maximum likelihood estimation; Monte Carlo simulation; Rényi entropy


Marshall and Olkin (1997)Marshall AW and Olkin I. 1997. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 84: 641-652.introduziram um método interessante de adicionar um parâmetro a uma distribuição especificada para obter uma família ampla de distribuições. Entretanto, eles não investigaram propriedades matemáticas gerais dessa família. Nós deduzimos para essa classe de distribuições, expansões gerais para a função densidade, expressões explícitas para os momentos e momentos das estatísticas de ordem. Vários modelos especiais são investigados. Nós discutimos a estimação dos parâmetros do modelo. Uma aplicação de um conjunto de dados reais é apresentada para fins ilustrativos.

distribuição estendida Marshall-Olkin; estimação por máxima verossimilhança; simulação de Monte Carlo; entropia de Rényi


INTRODUCTION

Adding parameters to a well-established distribution is a time honored device for obtaining more flexible new families of distributions. (Marshall and Olkin 2007Marshall AW and Olkin I. 2007. Life Distributions. Structure of Nonparametric, Semiparametric and Parametric Families. Springer, New York.) introduced an interesting method of adding a new parameter to an existing distribution. The resulting distribution, known as Marshall-Olkin (M-O) extended distribution, includes the baseline distribution as a special case and gives more flexibility to model various types of data. The M-O family of distributions is also known as the proportional odds family (proportional odds model) or family with tilt parameter (Marshall and Olkin 2007Marshall AW and Olkin I. 2007. Life Distributions. Structure of Nonparametric, Semiparametric and Parametric Families. Springer, New York.).

Let F(x) = 1−F(x) denote the survival function of a continuous random variable X which depends on a parameter vector β = (β 1,…, β q)T of dimension q. Then, the corresponding M-O extended distribution has survival function defined by

(1)
where α > 0 andα = 1−α. Forα = 1, G(x) = F(x). Marshall and Olkin (1997)Marshall AW and Olkin I. 1997. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 84: 641-652. have noted that the method has a stability property, i.e., if the method is applied twice, nothing new is obtained in the second time around. Additionally, the extended model is geometrically extremely stable. If Xi = (i = 1, 2, …) is a sequence of independent and identically distributed random variables with cdf F(x) and if N has a geometric distribution taken values {1, 2, …}, then the random variables U = min {X1 , …, XN } and V = max{X1 , …, XN } are distributed as in (1). It implies that the new distribution is geometrically extremely stable. Marshall and Olkin (2007)Marshall AW and Olkin I. 2007. Life Distributions. Structure of Nonparametric, Semiparametric and Parametric Families. Springer, New York. have called the additional shape parameter “tilt parameter”, since the hazard rate of the new family is shifted below (α ≥ 1) or above (0 < α ≤ 1) the hazard rate of the underlying distribution, that is, for all x ≥ 0, h(x) ≤ r(x) when α ≥ 1, and h(x) ≥ r(x) when 0 <α ≤ 1, where h(x) denotes the hazard rate of the transformed distribution and r(x) is that of the original distribution.

Some special cases discussed in the literature include the M-O extensions of the Weibull distribution (Ghitany et al. 2005Ghitany ME, Al-Hussaini EK and Al-Jarallah RA. 2005. Marshall-Olkin extended Weibull distribution and its application to censored data. J Appl Stat 32: 1025-1034., Zhang and Xie 2007Zhang T and XIE M. 2007. Failure data analysis with extended Weibull distribution. Commun Stat Simulat 36: 579-592.), Pareto distribution (Ghitany 2005Ghitany ME. 2005. Marshall-Olkin extended Pareto distribution and its application. Int J Appl Math 18: 17-32.), gamma distribution (Ristić et al. 2007Ristić MM, Jose KK and Ancy J. 2007. A Marshall-Olkin gamma distribution and minification process. STARS: Stress and Anxiety Research Society 11: 107-117.), Lomax distribution (Ghitany et al. 2007Ghitany ME, Al-Awadhi FA and Alkhalfan LA. 2007. Marshall-Olkin extended Lomax distribution and its application to censored data. Commun Stat Theory M 36: 1855-1866.) and linear failure-rate distribution (Ghitany and Kotz 2007Ghitany ME and Kotz S. 2007. Reliability properties of extended linear failure-rate distributions. Probab Eng Inform Sc 21: 441-450.). More recently, Gómez-Déniz (2010)Gómez-Déniz E. 2010. Another generalization of the geometric distribution. Test 19: 399-415. presented a new generalization of the geometric distribution using the M-O scheme. Economou and Caroni (2007)Economou P and Caroni C. 2007. Parametric proportional odds frailty models. Commun Stat Simulat 36: 579-592. showed that the M-O extended distributions have a proportional odds property and Caroni (2010)Caroni C. 2010. Testing for the Marshall-Olkin extended form of the Weibull distribution. Stat Pap 51: 325-336. presented some Monte Carlo simulations considering hypothesis testing on the parameterα for the extended Weibull distribution. Maximum likelihood estimation in M-O family is given in Lam and Leung (2001)Lam KF and Leung TL. 2001. Marginal likelihood estimation for proportional odds models with right censored data. Lifetime Data Anal 7: 39-54. and Gupta and Peng (2009)Gupta RD and Peng C. 2009. Estimating reliability in proportional odds ratio models. Comput Stat Data An 53: 1495-1510.. Gupta et al. (2010)Gupta RC, Lvin S and Peng C. 2010. Estimating turning points of the failure rate of the extended Weibull distribution. Comput Stat Data An 54: 924-934. compared this family and the original distribution with respect to some stochastic orderings and also investigate thoroughly the monotonicity of the failure rate of the resulting distribution when the baseline distribution is taken as Weibull. Nanda and Das (2012)Nanda AK and DAS S. 2012. Stochastic orders of the Marshall-Olkin extended distribution. Stat Probabil Lett 82: 295-302. investigated the tilt parameter of the M-O extended family. The probability density function (pdf) of the M-O extended-F distribution, say g(x), is given by

(2)
where f(x) =dF(x) /dx is the baseline density function corresponding toF(x). Here after, we refer to the family (2) as the M-O extended-F distribution.

General mathematical properties of the M-O extended-F distribution were not derived by Marshall and Olkin (1997)Marshall AW and Olkin I. 1997. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 84: 641-652. such as moments and moments of order statistics. In this article, we derive some general structural properties of the M-O extended-F distribution including: (i) expansions for the pdf; (ii) general expressions for the moments; (iii) moments of order statistics; (iv) Rényi entropy. We propose several M-O extended-F distributions taken as baseline in the definitions the Weibull, Fréchet, Pareto, generalized exponential, Kumaraswamy and power function distributions. We discuss maximum likelihood estimation of the model parameters.

The article is organized as follows. Section 2 presents expansions for the density function and for the density function of the order statistics. Explicit expressions for the moments and moments of the order statistics of the M-O extended-F distribution are given in Section 3. Rényi entropy is derived in Section 4. Estimation of the model parameters by maximum likelihood is discussed in Section 5. Section 6 presents an alternative method to estimate the model parameters. In Section 7, we propose several M-O extended-F distributions and discuss some of their properties. Simulation results are performed in Section 8. An application of the current family to a real data set is explored in Section 9. Finally, some concluding remarks are presented in Section 10.

Expansions

Consider the series representation

(3)
which is valid for |z| < 1 and k > 0, where Γ(·) is the gamma function. Ifα ∈ (0,1) using (3) in (2), we obtain
(4)
where

The density function (2) can be expressed as

Hence, for α > 1, using (3) in the last equation yields
(5)
where
We now give the pdf of the ith order statisticXi:n , say gi:n (x), in a random sample of sizen from the M-O extended-F distribution. The pdf ofXi:n can be expressed as
If α ∈ (0,1) using expansion (??) in the last equation, we obtain
(6)
where
For α > 1, we writegi:n (x) becomes
(7)
where
and using (Ref: exp),

Equations (4)-(7) reveal that the density functions of the M-O extended-F distribution and of its order statistics can be expressed as the baseline densityf(x) multiplied by an infinite power series of F(x). They play an important role and will be used to obtain explicit expressions for the moments of the M-O extended-F distribution and of its order statistics in a general framework and for special models.

Moments

Here after, suppose that X has the density function (2). We derive general expressions for the moments ofX and its order statistics in terms of the probability weighted moments (PWMs) of the F distribution. The PWMs, first proposed by Greenwood et al. (1979)Greenwood JA, Landwehr JM, Matalas NC and Wallis JR. 1979. Probability weighted moments: definition and relation to parameters of several distributions expressable in inverse form. Water Resour Res 15: 1049-1054., are expectations of certain functions of a random variable whose mean exists. A general theory for these moments covers the summarization and description of theoretical probability distributions and observed data samples, non parametric estimation of the underlying distribution of an observed sample, estimation of parameters, quantiles of probability distributions and hypothesis tests. The PWMs method can generally be used for estimating parameters of a distribution whose inverse form cannot be expressed explicitly. The PWMs for the baseline F distribution are formally defined by

Thus, from equations (4) and (5), thesth moment of X forα ∈ (0,1) andα > 1 can be written as
(8)
respectively.

Now, using equations (6) and (7), we can determine the sth moment of the ith order statistic Xi:n in a random sample of size n from Xfor α ∈ (0,1) andα > 1 as

(9)
respectively, where the quantities wj,k , vj , uj,l,k and cj,l are defined in Section 2. Thus, the moments of X andXi:n are obtained in terms of infinite weighted sums of PWMs of the baseline F distribution.

Rényi Entropy

The entropy of a random variable is a measure of uncertainty variation and has been used in various situations in science and engineering. The Rényi entropy is defined by

where δ > 0 andσ ≠ 1. For furthers details, the reader is referred to Song (2001)Song KS. 2001. Rényi information, loglikelihood and an intrinsic distribution measure. em J Stat Plan Infer 93: 51-69.. For α ∈ (0,1), using expansion (3), we can write
For α > 1, we obtain
Thus, the Rényi entropy of X can be obtained for α ∈ (0,1) andα > 1 as
(10)
and
(11)
respectively, where

An interesting quantity based on the Rényi entropy is defined bySg = –2d IR (δ) / dδ|δ =1. It is a location and scale-free positive functional and measures the intrinsic shape of a distribution (see, Song 2001Song KS. 2001. Rényi information, loglikelihood and an intrinsic distribution measure. em J Stat Plan Infer 93: 51-69.).

For the purpose of illustration, we derive this quantity for the M-O extended exponential (M-O-EE) distribution, whose density function isx > 0 and λ > 0). Consider the case α ∈ (0,1); the case α > 1 follows in a similar way. Define

The Song's measure for the M-O-EE distributionα ∈ (0,1) takes the form
where
and
where Ψ (·) is the digamma function, Γ′(z)/ = dΓ (z)/ dz andΓ″(z) = dΓ′(z) / dz.(for

Maximum Likelihood

The model parameters of the M-O extended-F distribution can be estimated by maximum likelihood. Letn fromX with unknown parameter vectorθ is

By taking the partial derivatives of the log-likelihood function with respect to α and β, we obtain the components of the score vector
given bybe a random sample of size , wherecorresponds to the parameter vector of the baseline distribution. The log-likelihood function for

Setting these equations to zero, Uθ = 0, and solving them simultaneously yields the maximum likelihood estimate (MLE)Nocedal and Wright 1999Nocedal J and Wright SJ. 1999. Numerical Optimization. Springer: New York.,Press et al. 2007Press WH, Teulosky SA, Vetterling WT and Flannery BP. 2007. Numerical Recipes in C: The Art of Scientific Computing: 3rd ed., Cambridge University Press.) with analytical derivatives can be used for maximizing the log-likelihood function. These equations cannot be solved analytically and statistical software can be used to solve them numerically. For example, the BFGS method (see, .of

The normal approximation for the θ can be used for constructing approximate confidence intervals and confidence regions for the parameters α andβ. Under conditions that are fulfilled for the parameters in the interior of the parameter space, we haveK(θ) is the unit expected information matrix given by

whose elements are given by
, wheremeans approximately distributed and

The asymptotic behavior remains valid if K(θ) = lim n →∞ n −1 J n(θ), where J n(θ) is is the observed information matrix, it is replaced by the average sample information matrix evaluated atn −1

whose elements are
, i.e. . The observed information matrix is given by

We can easily check if the fit of the M-O extended-F model is statistically “superior” to a fit using the F model by testing the null hypothesisthe likelihood ratio (LR) statistic is given byare the unrestricted MLEs obtained from the maximization ofunder the null hypothesis. The null hypothesis is rejected if the test statistic exceeds the upper 100(1–γ)% quantile of thedistribution.and. The limiting distribution of this statistic isunder, whereand. For testing

Estimation-Type Method of Moments

We now present an alternative method to estimate the model parameters. Since the moments cannot be obtained in closed form, the estimation by the method of moments is complicated. However, after some algebra, we obtain

(12)

Thus, we can use (12) to construct a new method of estimation, i.e., if

is a random sample with survival function (1), we can estimate the model parameters from the equation, …,

In Section, we apply the two methods (maximum likelihood and estimation-type method of moments) to estimate the model parameters of the M-O extended family.

Special M-O Extended Models

We motivate the study of Marshall and Olkin's distributions by considering some special models to illustrate the applicability of the previous results. Here, we obtain the moments and Rényi entropy for some special M-O extended-F distributions when the baseline F distribution follows the Weibull, Fréchet, Pareto, generalized exponential, Kumaraswamy and power distributions. Some others M-O extended-F distributions could be proposed and our general results applied to them. Clearly, the quantitiesτp,r are determined from the baseline F cdf.

M-O Extended Weibull Distribution

The M-O extended Weibull (M-O-EW) distribution is defined by takingx, λ, γ > 0. Its associated pdf and cdf are respectively given by

The M-O-EW distribution was studied by Ghitany et al. (2005)Ghitany ME, Al-Hussaini EK and Al-Jarallah RA. 2005. Marshall-Olkin extended Weibull distribution and its application to censored data. J Appl Stat 32: 1025-1034.; see also Barreto-Souza et al. (2011)Barreto-Souza W, de Morais AL and Cordeiro GM. 2011. The Weibull-geometric distribution. J Stat Comput Sim 40: 798-811.. We obtain
and
where the last equation holds for (δ–1)(γ–1) > – 1 From these quantities, we immediately obtain explicit expressions for the moments, moments of the order statistics and Rényi entropy. If γ = 1, the results correspond to the M-O extended exponential distribution.and,

M-O Extended Fréchet Distribution

Here, we consider the Fréchet distribution (forx, σ, λ > 0) with cdf and pdf given byx > 0) reduce to

respectively. After some algebra, we obtain
which is valid for p < λ. Applying this result in (8) and (9), it follows simple expressions for the moments and moments of the order statistics of the M-O-EF distribution. An expression for the Rényi entropy of this distribution is obtained by inserting
in (10) and (11)., respectively. The pdf and survival function of the M-O extended Fréchet (M-O-EF) distribution (forand

M-O Extended Exponentiated Pareto Distribution

The pdf and cdf of the exponentiated Pareto distribution, introduced by Nadarajah (2005)Nadarajah S. 2005. Exponentiated Pareto distributions. Statistics 39: 255-260., forx > θ andθ, k, γ > 0, are given by f(x) = γk x −(k +1) {1– (θ / x)k}γ−1 andF(x) = {1– (θ / x)k}γ, respectively. Applying the functionF(x) in (1), the M-O extended exponentiated Pareto (M-O-EEP) survival function reduces to

The corresponding density function becomes
The M-O-EEP distribution has as special sub-models the Pareto, exponentiated Pareto and extended Pareto distributions for the choicesα = γ = 1,α = 1 and γ = 1, respectively. The extended M-O-Pareto (γ = 1) distribution was studied by Ghitany (2005)Ghitany ME. 2005. Marshall-Olkin extended Pareto distribution and its application. Int J Appl Math 18: 17-32.. Algebraic manipulations lead to
restricted to p < k, and
which is valid forδ(γ–1) > –1 and δ > (k+1)−1, whereis the beta function.

M-O Extended Generalized Exponential Distribution

The pdf and cdf of the generalized exponential distribution, introduced by Gupta and Kundu (1999)Gupta RD and Kundu D. 1999. Generalized exponential distributions. Aust N Z J Stat 41: 173-188., for x > 0 and λ, γ > 0, are given byF(x) = (1 – eλx )γ, respectively. By replacing these quantities in (1) and (2), the pdf and survival function of the M-O extended generalized exponential (M-O-EGE) distribution reduce respectively to (for x > 0)

From (4) and (5), we obtain the moment generating function (mgf) of the M-O-EGE distribution as
for α ∈ (0,1) and
for α > 1. Both formulas hold fort < λ min {1,1+γ}. Hence, their moments can be obtained from the derivatives of the mgf at t= 0. We also have
which leads to a simple expression for the Rényi entropy.and

M-O Extended Kumaraswamy Distribution

Now, we propose a new distribution for modeling rates and proportions. Consider the Kumaraswamy distribution with pdf and cdf in the forms [for x ∈ (0,1) anda, b > 0] f(x) = abxa −1 and F(x) = 1– (1 – xa )b, respectively. This distribution, introduced by Kumaraswamy (1980)Kumaraswamy P. 1980. A generalized probability density function for double-bounded random processes. J Hydrol 46: 79-88., was well investigated byJones (2009)Jones MC. 2009. Kumaraswamy's distribution: A beta-type distribution with some tractability advantages. Stat Methodol 6: 70-81.; see alsoLemonte (2011)Lemonte AJ. 2011. Improved point estimation for the Kumaraswamy distribution. J Stat Comput Sim 81: 1971-1982.. The M-O extended Kumaraswamy distribution, for x ∈ (0,1), have pdf and cdf given respectively by

Hence,
and
restricted to (δ–1)(1– α −1) > –1.

M-O Extended Power Distribution

Our final special case concentrates on the M-O extended power (M-O-EPo) distribution defined (for x∈ (0, 1/θ) andθ > 0) by takingF(x) = (θx)k in (1). For x ∈ (0, 1/θ), the pdf and cdf of the M-O-EPo distribution are given respectively by

Their moments and moments of the order statistics can be obtained by setting
in (8) and (9). We also have
valid for δ (1–k)−1. Replacing the last expression in (10) and (11), it follows simple expressions for the Rényi entropy.

SIMULATION RESULTS

In what follows, we shall present Monte Carlo simulation results. All the Monte Carlo simulation experiments are performed using the Ox matrix programming language (Doornik 2006Doornik JA. 2006. An Object-Oriented Matrix Language - Ox 4: 5th ed. Timberlake Consultants Press: London.). Ox is freely distributed for academic purposes and available at http://www.doornik.com. The number of Monte Carlo replications was R = 20,000.

We apply the estimation methods before discussed in order to estimate the model parameters of the M-O extended-F distribution. We adopt the M-O-EE distribution with pdf and hazard rate given by

respectively, where α > 0 and λ > 0. According to Marshall and Olkin (1997)Marshall AW and Olkin I. 1997. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 84: 641-652., this distribution may sometimes be a competitor to the Weibull and gamma distributions. The authors derived several properties of the M-O-EE distribution. For example, they showed thath(x) is decreasing inx for 0 < α ≤ 1 and thath(x) is increasing inx for α ≥ 1. Additionally, λ ≤ h(x) ≤ λ / α for 0 ≤ α≤ 1, λ /αh(x) ≤ λ forα ≤ 1,α ≥ 1 andα ≥ 1. Rao et al. (2009)Rao GS, Ghitany ME and Kantam RRL. 2009. Reliability test plans for Marshall-Olkin extended exponential distribution. Appl Math Sci 55: 2745-2755. developed a reliability test plan for acceptance/rejection of a lot of products submitted for inspection with lifetimes governed by the M-O-EE distribution.for for 0 ≤

For a random sample of size n from this distribution, the total log-likelihood function for the parameter vector

The components of the score vector
Setting Uθ = 0 and solving them simultaneously yields the MLEsα and λ, respectively. The observed information matrix for the parameter vector
The simulation of the M-O-EE independent deviates can be performed using
where, come from the simultaneous solution of the non linear equationsis given byandandis given in the Appendix. The estimates ofof are.

The evaluation of point estimation was performed based on the following quantities for each sample size: (i) mean; (ii) relative bias (the relative bias of an estimator (θ is defined asR Monte Carlo replications. The sample sizes are taken asn = 50 and 150. The values of the parameters were set at λ = 0.5 and α = 0.2, 0.6, 1.5, 1.8, 2.0, 2.5, 3.0, 4.0, 5.0, 7.0, 10 and 20.by Monte Carlo); (iii) root mean squared error,, its estimate being obtained by estimating, where MSE is the mean squared error estimated from!!!) of a parameter

The point estimates are presented in Tables I and II forn = 50 and n = 150, respectively. From these tables, note that the root mean squared errors ofα whereas the root mean squared error ofα. The relative bias ofα increases. Additionally, in most of the cases, the relative bias ofα and λ than the estimation-type method of moments.increase with . Also, the relative bias and the root mean squared error ofandis smaller than the relative bias (in absolute value) ofin most of the cases. Also noteworthy is that as the sample size increases, the root mean squared error decreases. In general, the maximum likelihood method yields better estimates of decreases with also decreases as is smaller than the relative bias (in absolute value) and the root mean squared error of

TABLE I
Point estimates for n=50 and different values of α.

Method of moments α Estimates of α Estimates of λ Mean Rel. Bias Mean Rel. Bias 0.2 0.3781 0.8906 0.3438 0.8216 0.6432 0.6851 0.6 0.8682 0.4469 0.3256 0.6601 0.3201 0.2320 1.5 1.1207 -0.2528 0.4437 0.4197 -0.1607 0.1129 1.8 1.2927 -0.2818 0.6802 0.4064 -0.1872 0.1328 2.0 1.4299 -0.2850 0.8329 0.4033 -0.1935 0.1429 2.5 1.8539 -0.2584 1.2343 0.4058 -0.1884 0.1582 3.0 2.3946 -0.2018 1.7244 0.4160 -0.1680 0.1671 4.0 3.6432 -0.0892 3.0690 0.4338 -0.1324 0.1843 5.0 5.0381 0.0076 4.8922 0.4446 -0.1108 0.1971 7.0 8.2577 0.1797 11.7298 0.4591 -0.0818 0.3087 10.0 13.4647 0.3465 25.7848 0.4603 -0.0794 0.3387 20.0 28.3708 0.4185 53.4771 0.4343 -0.1315 0.2879 MSE: mean squared error; Rel. Bias: relative bias.

TABLE II
Point estimates for n=50 and different values of α.

Method of moments α Estimates of α Estimates of λ Mean Rel. Bias Mean Rel. Bias 0.2 0.2881 0.4407 0.2331 0.6595 0.3191 0.4515 0.6 0.7899 0.3165 0.2714 0.6090 0.2180 0.1732 1.5 1.2103 -0.1932 0.3948 0.4364 -0.1272 0.0936 1.8 1.4839 -0.1756 0.5785 0.4397 -0.1206 0.1080 2.0 1.6932 -0.1534 0.6985 0.4453 -0.1095 0.1133 2.5 2.2637 -0.0945 1.0076 0.4595 -0.0811 0.1192 3.0 2.8963 -0.0346 1.3633 0.4731 -0.0539 0.1212 4.0 4.2494 0.0624 2.3092 0.4919 -0.0162 0.1262 5.0 5.6095 0.1219 3.5487 0.5003 0.0005 0.1330 7.0 8.5945 0.2278 8.6495 0.5117 0.0235 0.1461 10.0 13.3775 0.3378 15.3995 0.5182 0.0365 0.1606 20.0 32.1990 0.6099 53.5464 0.5267 0.0534 0.1878 MSE: mean squared error; Rel. Bias: relative bias.

Tables III and IV evaluate the overall performance of each of the two different estimators, for each value of n. Each entry in Table III corresponds to a specific estimator and a specific value of n. We obtain what we call the integrated relative bias squared norm (Cribari-Neto and Vasconcellos 2002Cribari-Neto F and Vasconcellos KLP. 2002. Nearly unbiased maximum likelihood estimation for the beta distribution. J Stat Comput Sim 72: 107-118.). This is computed as

where the rh 's (h = 1, …, 12) correspond to the twelve different values of the relative bias of each estimator. Similarly, Table IV provides the average root mean squared error expressed as
whereh is the root mean squared error corresponding to the twelve different values of then, Table III provides a measure of the overall performance of the estimators regarding the bias, and Table IV gives a measure of their overall performance regarding the root mean squared error. In short, the figures in these tables reveal that the maximum likelihood method should be preferred than the estimation-type method of moments in order to estimate the model parameters. of each estimator. For each value of

TABLE III
Integrated relative bias squared norm.
TABLE IV
Average root mean squared error.

Application

As Marshall and Olkin (1997)Marshall AW and Olkin I. 1997. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 84: 641-652. pointed out the M-O-EE distribution considered in the previous section may be an alternative to the Weibull and gamma distributions. In what follows, we shall present an empirical application to a real data set in which the M-O-EE distribution may be preferred than the Weibull and gamma models. We consider the active repair times (hours) for an airborne communication transceiver given in Jørgensen (1982)Jørgensen B. 1982. Statistical Properties of the Generalized Inverse Gaussian Distribution. Springer-Verlag, New York.. All the computations were done using the Ox matrix programming language (Doornik 2006Doornik JA. 2006. An Object-Oriented Matrix Language - Ox 4: 5th ed. Timberlake Consultants Press: London.).

Table V lists the MLEs of the parameters (standard errors in parentheses) and the values of the log-likelihood functions. The M-O-EE distribution yields the highest value of the log-likelihood function. Plots of the estimated density of all fitted models are given in Figure 1. Note that the M-O-EE model provides a better fit than the other models. In Figure 2, we plot the estimated hazard rate for the M-O-EE, Weibull and gamma distributions. Notice that the estimated hazard ratio of the M-O-EE distribution is decreasing and belongs to the interval, expected.

Figure 1
Estimated pdf of the Marshall-Olkin extended exponential (M-O-EE), Weibull and gamma models for a real data set.

Figure 2
Estimated hazard rate of the Marshall-Olkin extended exponential (M-O-EE), Weibull and gamma models for a real data set.

TABLE V
MLEs of the model parameters and log-likelihood functions.

Now, we shall apply formal goodness-of-fit tests in order to verify which distribution fits better to these data. We apply the Cramér-von Mises (W*) and Anderson-Darling (A*) statistics. The statisticsW* and A* are described in details in Chen and Balakrishnan (1995)Chen G and Balakrishnan N. 1995. A general purpose approximate goodness-of-fit test. J Qual Technol 27: 154-161.. In general, the smaller the values of the statistics W* and A*, the better the fit to the data. Let H(x; θ) be the cdf, where the form of H is known but θ (a k-dimensional parameter vector, say) is unknown. To obtain the statistics W* andA*, one can proceed as follows: (i) Computexi 's are in ascending order; (ii) Computeyi = Φ−1(vi ), where Φ(·) is the standard normal cdf and Φ−1(·) its inverse; (iii) Compute

and
(v) Modify W 2 into W* = (1 + 0.5 / n) and A 2 into A* = A 2(1+0.75 / n+2.25 / n 2). The values of the statistics W* and A* for all models are given in Table VI. According to these statistics, the M-O-EE model fits the current data better than the other models.and, where, where the ; (iv) Calculate

TABLE VI
MLEs of the model parameters and log-likelihood functions.

CONCLUDING REMARKS

We study some mathematical properties of the Marshall and Olkin's family of distributions (Marshall and Olkin 1997Marshall AW and Olkin I. 1997. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 84: 641-652.). This family is defined by adding a parameter to a baseline distribution, giving more flexibility to model various type of data. In the last few years, several authors proposed Marshall-Olkin models to extend well-known distributions (see, for example, Ghitany 2005Ghitany ME. 2005. Marshall-Olkin extended Pareto distribution and its application. Int J Appl Math 18: 17-32., Zhang and Xie 2007Zhang T and XIE M. 2007. Failure data analysis with extended Weibull distribution. Commun Stat Simulat 36: 579-592., Ristić et al. 2007Ristić MM, Jose KK and Ancy J. 2007. A Marshall-Olkin gamma distribution and minification process. STARS: Stress and Anxiety Research Society 11: 107-117., Ghitany et al. 2007Ghitany ME, Al-Awadhi FA and Alkhalfan LA. 2007. Marshall-Olkin extended Lomax distribution and its application to censored data. Commun Stat Theory M 36: 1855-1866., Ghitany and Kotz 2007Ghitany ME and Kotz S. 2007. Reliability properties of extended linear failure-rate distributions. Probab Eng Inform Sc 21: 441-450., Gómez-Déniz 2010Gómez-Déniz E. 2010. Another generalization of the geometric distribution. Test 19: 399-415.). In this article, we derive various structural properties of the Marshall-Olkin extended-F distribution not explored before including simple expansions for the density function and explicit expressions for the moments, moments of the order statistics and Rénvy entropy. Our formulas related with this class of distributions are manageable, and with the use of modern computer resources with analytic and numerical capabilities, may turn into adequate tools comprising the arsenal of applied statisticians. Several Marshall-Olkin extended-F distributions are proposed and some of their mathematical properties are given. We discuss maximum likelihood estimation of the model parameters and propose an alternative estimation method. Monte Carlo simulation experiments are also considered. Finally, an empirical application to a real data set is presented.

The observed information matrix for J n(θ), is given by

where the elements are
,

ACKNOWLEDGMENTS

We gratefully acknowledge grants from Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) and Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) Brazil. We thank an anonymous referee for helpful comments that lead to several improvements in this paper.

REFERENCES

  • Barreto-Souza W, de Morais AL and Cordeiro GM. 2011. The Weibull-geometric distribution. J Stat Comput Sim 40: 798-811.
  • Caroni C. 2010. Testing for the Marshall-Olkin extended form of the Weibull distribution. Stat Pap 51: 325-336.
  • Chen G and Balakrishnan N. 1995. A general purpose approximate goodness-of-fit test. J Qual Technol 27: 154-161.
  • Cribari-Neto F and Vasconcellos KLP. 2002. Nearly unbiased maximum likelihood estimation for the beta distribution. J Stat Comput Sim 72: 107-118.
  • Doornik JA. 2006. An Object-Oriented Matrix Language - Ox 4: 5th ed. Timberlake Consultants Press: London.
  • Economou P and Caroni C. 2007. Parametric proportional odds frailty models. Commun Stat Simulat 36: 579-592.
  • Ghitany ME. 2005. Marshall-Olkin extended Pareto distribution and its application. Int J Appl Math 18: 17-32.
  • Ghitany ME, Al-Hussaini EK and Al-Jarallah RA. 2005. Marshall-Olkin extended Weibull distribution and its application to censored data. J Appl Stat 32: 1025-1034.
  • Ghitany ME, Al-Awadhi FA and Alkhalfan LA. 2007. Marshall-Olkin extended Lomax distribution and its application to censored data. Commun Stat Theory M 36: 1855-1866.
  • Ghitany ME and Kotz S. 2007. Reliability properties of extended linear failure-rate distributions. Probab Eng Inform Sc 21: 441-450.
  • Gómez-Déniz E. 2010. Another generalization of the geometric distribution. Test 19: 399-415.
  • Greenwood JA, Landwehr JM, Matalas NC and Wallis JR. 1979. Probability weighted moments: definition and relation to parameters of several distributions expressable in inverse form. Water Resour Res 15: 1049-1054.
  • Gupta RD and Kundu D. 1999. Generalized exponential distributions. Aust N Z J Stat 41: 173-188.
  • Gupta RC, Lvin S and Peng C. 2010. Estimating turning points of the failure rate of the extended Weibull distribution. Comput Stat Data An 54: 924-934.
  • Gupta RD and Peng C. 2009. Estimating reliability in proportional odds ratio models. Comput Stat Data An 53: 1495-1510.
  • Jones MC. 2009. Kumaraswamy's distribution: A beta-type distribution with some tractability advantages. Stat Methodol 6: 70-81.
  • Jørgensen B. 1982. Statistical Properties of the Generalized Inverse Gaussian Distribution. Springer-Verlag, New York.
  • Kumaraswamy P. 1980. A generalized probability density function for double-bounded random processes. J Hydrol 46: 79-88.
  • Lam KF and Leung TL. 2001. Marginal likelihood estimation for proportional odds models with right censored data. Lifetime Data Anal 7: 39-54.
  • Lemonte AJ. 2011. Improved point estimation for the Kumaraswamy distribution. J Stat Comput Sim 81: 1971-1982.
  • Marshall AW and Olkin I. 1997. A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Biometrika 84: 641-652.
  • Marshall AW and Olkin I. 2007. Life Distributions. Structure of Nonparametric, Semiparametric and Parametric Families. Springer, New York.
  • Nadarajah S. 2005. Exponentiated Pareto distributions. Statistics 39: 255-260.
  • Nanda AK and DAS S. 2012. Stochastic orders of the Marshall-Olkin extended distribution. Stat Probabil Lett 82: 295-302.
  • Nocedal J and Wright SJ. 1999. Numerical Optimization. Springer: New York.
  • Press WH, Teulosky SA, Vetterling WT and Flannery BP. 2007. Numerical Recipes in C: The Art of Scientific Computing: 3rd ed., Cambridge University Press.
  • Rao GS, Ghitany ME and Kantam RRL. 2009. Reliability test plans for Marshall-Olkin extended exponential distribution. Appl Math Sci 55: 2745-2755.
  • Ristić MM, Jose KK and Ancy J. 2007. A Marshall-Olkin gamma distribution and minification process. STARS: Stress and Anxiety Research Society 11: 107-117.
  • Song KS. 2001. Rényi information, loglikelihood and an intrinsic distribution measure. em J Stat Plan Infer 93: 51-69.
  • Zhang T and XIE M. 2007. Failure data analysis with extended Weibull distribution. Commun Stat Simulat 36: 579-592.

Publication Dates

  • Publication in this collection
    Jan-Mar 2013

History

  • Received
    2 July 2011
  • Accepted
    24 Apr 2012
Academia Brasileira de Ciências Rua Anfilófio de Carvalho, 29, 3º andar, 20030-060 Rio de Janeiro RJ Brasil, Tel: +55 21 3907-8100 - Rio de Janeiro - RJ - Brazil
E-mail: aabc@abc.org.br