A new class of distributions as a finite functional mixture using functional weights

In this paper, we introduce a new family of distributions whose probability density function is defined as a weighted sum of two probability density functions; one is defined as a warped version of the other. We focus our attention on a special case based on the exponential distribution with three parameters, a dilation transformation and a weight with polynomial decay, leading to a new life-time distribution. The explicit expressions of the moments generating function, moments and quantile function of the proposed distribution are provided. For estimating the parameters, the method of maximum likelihood estimation is used. Two applications with practical data sets are given.


INTRODUCTION
The mixture distributions arise in a wide variety of applications, including children's heights distribution, discussed by Everitt & Hand (1981), and plasma concentration of Beta-Carotene given in Schlattmann (2009). Also, a natural application of mixture distributions is in the modelling of heterogeneous data where each component of the mixture distribution corresponds to a cluster of the data. Since the mixture distributions have the potential to model a wide variety of random phenomena, they have received increasing attention in the literature and have been explored by many researchers in various contexts, see McLachlan & Peel (2000). The k-component mixture distribution is defined via the following probability density function (pdf): where p i ∈ [0, 1], k i=1 p i = 1, and f i (x) is the pdf of the ith cluster of population. The mixture distribution in (1) is expressed as a weighted sum of pdfs and shows enough flexibility in modeling heterogeneous data having multiple modes, see Elmahdy & Aboutahoun (2013), Elmahdy (2017), Frühwirth-Schnatter (2006), Seidel (2010) and the monographs cited above. On the other side, a given distribution can be generalized by employing a composite function. To be more specific, let f (x) be a pdf with support on (a, b), G(x) be increasing function on (a, b) with lim one can show that g(x)f (G(x)) is also a pdf based on the warping of f (x). For instance, when f (x) is the pdf of an exponential distribution, then g(x)f (G(x)) is the pdf of the Weibull distribution for a polynomial term G(x). Further developments and examples can be found in, e.g., AL-Hussaini (2012), Alzaatreh et al. (2013Alzaatreh et al. ( , 2012, Sharma et al. (2017), and the references therein. Combining the two approaches mentioned above, a natural way to increase the flexibility of f (x) and g(x)f (G(x)) is to consider a mixture of these two pdfs defined by h(x) = pf (x) + (1 -p)g(x)f (G(x)) with p ∈ [0, 1]. The parameter p operates a compromise between f (x) and g(x)f (G(x)), with h(x) = f (x) if p = 1 and h(x) = g(x)f (G(x)) if p = 0. It is important to note that the proportion p is assumed to be a fixed constant regardless of the support of the random variable, although this can seem impractical in certain cases. Keeping this in mind, we introduce a new generator of distributions which generalizes the finite mixture of the pdfs f (x) and g(x)f (G(x)) by introducing a Lebesgue measurable and monotonic function w(x) with w(x) ∈ [0, 1] for any x ∈ (a, b). The new family of distributions is characterized by the following pdf: (further details are given in Proposition 1 below). Thus, h(x) can be viewed as two components "functional mixture" of f (x) and g(x)f (G(x)) with the "functional weights" {w(x), 1 -w(G(x))}. It provides a compromise between f (x) and g(x)f (G(x)), using a monotonic weight function which depends on the variations of x. The introduction of such a functional weight in a finite mixture of distributions is also motivated by the weighted distributions' utility in efficient modeling and prediction from data, see Saghir et al. (2017), and the references therein. The rest of the paper is organized as follows. In the second section (The proposed family of distributions and some of its properties) presents the fundamentals of our proposed family of distributions. Some special cases are discussed. We also stress the significance of the family, with a highlight on existing connections with other well-known families of distributions. Expressions of the ordinary moments are derived. The third section (The FWE distribution and its properties) is devoted to a special case, providing a new lifetime distribution with three parameters. It is based on the exponential distribution for f (x), a dilation transformation for G(x), and a weight with polynomial decay for w(x). The moments of this distribution are also provided. The maximum likelihood estimations of the parameters are considered in the forth section (Applications), and two real life applications are presented to demonstrate the applicability of the distribution. A concluding remark is given in the last section (Concluding remarks).

THE PROPOSED FAMILY OF DISTRIBUTIONS AND SOME OF ITS PROPERTIES
In this section, we formally define the proposed family of distributions along with some particular cases. Expressions of ordinary moments are also investigated in the general case.

Construction of the family
The pdf of the family is defined in the next proposition.
with w(x) ∈ [0, 1] for any x ∈ (a, b). Then the following function is a pdf: (3) Proof of Proposition 1. Using the assumptions on f (x), w(x) and G(x), and after some standard analytical arguments, we can show that h(x) ≥ 0. Now, we need to show that b a h(x)dx = 1. By using the change of variables for y This completes the proof. Table I lists some special cases of pdfs as presented in (3) with various choices of f (x), w(x) and G(x). It is important to note there that the pdfs that are given in Table I are new to the statistics literature. The particular distribution based on exponential density with w(x) = 1 1+αx 2 and G(x) = x σ will be discussed in detail in the next section (The FWE distribution and its properties). Readers may explore other cases for their properties and potential applications in future studies. The motivation for the choice of this distribution will be clear later.
It is important to note that the pdf h(x) is very flexible and can be expressed as a sum of the functions of different natures/shapes. Due to the complex structure, the cumulative distribution function (cdf) associated to h(x) does not necessarily have a closed form. However, if we take w(x) = F(x), the cdf of the family has a very nice closed form expression and a probabilistic interpretation. The associated pdf and cdf are given by and DALAL LALA BOUALI et al.

A NEW CLASS OF DISTRIBUTIONS USING FUNCTIONAL MIXTURE
respectively. The random variable associated with (4) can be interpreted as the following random variable: where ε is a random variable following the Bernoulli distribution with parameter 1 2 , X and Y are independent and identically distributed random variables with common pdf f (x), and G -1 (x) denotes the inverse/quantile function of G(x). Therefore, the proposed family of distributions includes finite mixtures of sup and inf of random variables.
Another point is to show the possible connection between the proposed family and skewed distributions.
The proposed family of distributions can be used to generate another skewed family of distributions investigated by Huang & Chen (2007). It can be defined as follows. Let k(x) be a pdf symmetric around 0 and m( , we can derive a new skewed family of distributions; for a given k(x), we get Based on this idea, it is possible to develop new skewed families of distributions.

Some properties of the family
We now present the ordinary moments of the proposed family of distributions specified by (3). Let X be a random variable with pdf h(x), defined by (3), and Y be a random variable with pdf f (x). The rth non-central moment of X is Upon rearranging this equality, an alternative formula is (2021) 93(2)  In particular, the mean of X is given by

An Acad Bras Cienc
The moments generating function is One can verify that μ r = M (k) (t) | t=0 . Using the same mathematical arguments, the rth non-central conditional moment of X is given by, for t ∈ (a, b), The mean deviation about the median M can be written as All the expectations above can be calculated or approximated for specific functions f (x), w(x) and G(x).
In the next section, we focus on a submodel of the family with three parameters based on the exponential distribution, a dilation transformation and a weight with polynomial decay, called the functional weighted exponential distribution.

THE FWE DISTRIBUTION AND ITS PROPERTIES
In this section, we consider a special submodel of the proposed family based on the exponential distribution and discuss some of its properties.

Definition
We now consider the exponential pdf We call the distribution with pdf (5) the functional weighted exponential distribution, FWE for short. It is of interest because of the compromise made between the functions with exponential decay f (x) and g(x)f (G(x)), and a function with polynomial decay w(x). This ensures greater flexibility in terms of the rates of decay, which is an advantage for modeling a wide variety of lifetime data. Also, note that the FWE distribution is reduced to the exponential distribution when α = 0 or σ = 1. Thus the proposed distribution can be considered as an extension of the exponential distribution. Figure 1 DALAL LALA BOUALI et al.

A NEW CLASS OF DISTRIBUTIONS USING FUNCTIONAL MIXTURE
shows the pdf plots of the FWE distribution for selected values of the parameters. The pdf of the FWE distribution takes decreasing and uni-modal shapes depending on the choices of the parameters. The first derivative of h(x) is and, when it exists, the mode of the distribution, x 0 , satisfies the following equation: see the role of the parameters in the curvature of the pdf around x = 0, mainly for the polynomial term x 2 . Also, we have lim From a probabilistic point of view, the FWE distribution comes from the simple stochastic representation, Y = S X X, where X is a random variable with pdf f (x), σ = 1, and S X is a random variable such that DALAL LALA BOUALI et al.

We can observe that h(x) is a weighted exponential distribution function since it can be written as
Note that W(0) = 1 and lim x→∞ W(x) = 0 if σ < 1, lim x→∞ W(x) = 1 if σ = 1 and lim x→∞ W(x) = ∞ if σ > 1. For more information on weighted distributions see Saghir et al. (2017). The practical aspects of the FWE distribution are studied in the applications section.

Moments of the FWE distribution
Let X be a random variable with pdf h(x), defined by (5). Then the rth non-central moment of X is given by Using the expression 1 1+α(x/θ) 2 = 1 α(x/θ) 2 1 1+(α(x/θ) 2 ) -1 and geometric series, we obtain x r+2i e -x + α θ 2 σ r x r+2i+2 e -x dx where γ(a, x) = x 0 s a-1 e -s ds, a, x > 0 and Γ(a, x) = ∞ x s a-1 e -s ds, a ∈ R, x > 0, are the upper and lower incomplete gamma functions, respectively. In particular, the mean of X is given by The variance of X can be obtained as V(X) = μ 2 -(μ 1 ) 2 .

A NEW CLASS OF DISTRIBUTIONS USING FUNCTIONAL MIXTURE
Using similar mathematical arguments, the moment generating function can be expressed as, for t < θ/ max(1, σ), The mean deviation about the median M is given by The integrals appeared above can be written in terms of sums as done in (8)

Using the series expansion E
The expansions of the expectations can be done via similar mathematical arguments used for the moments.

Weighted Weibull (WtW) distribution by Shahbaz et al. (2010) with pdf
Extended generalized gamma (EGG) distribution used by Lee & Wang (2013) with pdf The EGG distribution is reduced to the Weibull distribution for α = 1.
The fitting results are compared using two practical data sets related to reliability and survival analysis. Data sets are discussed in the following subsections.
with H -1 (u) = F -1 (u) 0 [1 -F(y)]dy, u ∈ (0, 1), and its empirical version is where r = 1, 2, . . . , n and x i:n , i = 1, 2, . . . , n, represent the order statistics of the sample. It has been shown that the scaled TTT transform is convex (concave) if the hazard rate is decreasing (increasing), and for bathtub (unimodal) hazard rates, the scaled TTT transform is first convex (concave) and then concave (convex). Figure 2 indicates that the failure times data set has an increasing hazard rate.  The MLEs of the distribution parameters along with their standard errors (SEs) are shown in Table  III for this data set. From this table, it is clear that the SEs corresponding to the estimates of parameters of the FWE distribution are the smallest among the others.
We now apply formal goodness-of-fit tests in order to verify which distribution fits better the given data set. We consider Akaike Information Criterion (AIC = 2p -2 ln( )), Bayesian Information Criterion (BIC = p ln(n) -2 ln( )), -ln( ) and Kolmogorov-Smirnov (K-S) statistic along with p-value as goodness-of-fit criterion, where ln( ) is the value of the likelihood function evaluated at the parameter estimates, n is the number of observations, and p is the number of estimated parameters. For a given data set, the smaller AIC or BIC indicates a better fit. These statistics are computed using MLEs of the parameters based on the data set and presented in Table IV. We can note from this table that the FWE distribution has smaller values of AIC, BIC and KS statistics, among others. Therefore, we can conclude that FWE distribution fits better than the considered distributions for the given set of data. The Probability-Probability (PP) plots of the distributions are given in Figure 3 for failure times data set. Figures 4 and 5 show the fitted pdf and cdf of the FWE distribution for this data set, respectively. The fitted and empirical estimates are extremely close. These figures indicate that the FWE distribution can provide good estimates of the probabilities associated with lifetimes of fatigue fracture of Kevlar 373/epoxy, e.g., q = P (2 < X < 3) = 0.184, and its estimate isq = 0.176. Table III. The MLEs and SEs of the parameters of the distributions for failure times data.

Model
Estimate (

Survival times data
In this subsection, we present the modelling of survival times of 33 patients who died from acute myelogenous leukaemia. The survival times are noted in weeks. The data set is obtained from Feigl & Zelen (1965) and is also available in "MASS" package of R software. The frequency distribution of the data is heavy tailed and right skewed, see Table II. From Figure 2, we can see that the TTT plot for failure times data set first convex and then concave, which means the data set has a bathtub shaped hazard rate.
We compute the MLEs, along with respective SEs, of the parameters of all distributions for survival times data. They are presented in Table V. For each distribution, the log-likelihood, AIC, BIC, KS and p-values are obtained using the MLEs. They are shown in Table VI. From this table, we see that the FWE distribution has the smallest AIC, BIC and KS values over all other distributions. The p-value of the KS test statistic is maximum for the FWE distribution. Therefore, we can conclude that the FWE distribution is a better model for modelling survival times than the EG, EM, EGG WtW and MW distributions. PP plots of the distributions are given in Figure 6 for the survival times data set. Figure 7 shows the fitted pdf of the FWE distribution for the given data set. Figure 8 shows the fitted cdf of the FWE distribution. Since the fitted and empirical estimates are very close to each other, we can say that the FWE distribution fits well with this frequency distribution.

CONCLUDING REMARKS
We introduce and study a new family of distributions based on a finite functional mixture using functional weights. Some mathematical properties of the new family are investigated. A special case based on the polynomial weights and the exponential distribution, called the functional weighted exponential (FWE) distribution, is studied in detail. The estimates of the unknown parameters of the FWE distribution are obtained using the maximum likelihood method. The usefulness of the proposed submodel, FWE, is demonstrated via two real life data sets.