Processing math: 100%
Submit manuscript...
eISSN: 2378-315X

Biometrics & Biostatistics International Journal

Research Article Volume 6 Issue 5

On two-parameter akash distribution

Rama Shanker, Kamlesh Kumar Shukla

Department of Statistics, Eritrea Institute of Technology, Eritrea

Correspondence: Rama Shanker, Department of Statistics, Eritrea Institute of Technology, Asmara, Eritrea

Received: November 03, 2017 | Published: November 22, 2017

Citation: Shanker R, Shukla KK. On two-parameter akash distribution. Biom Biostat Int J. 2017;6(5):416-425. DOI: 10.15406/bbij.2017.06.00178

Download PDF

Abstract

In this paper a two-parameter Akash distribution (TPAD), of which one parameter Akash distribution of Shanker1 is a particular case, has been introduced. Its mathematical and statistical properties including its shapes, moments, skewness, kurtosis, hazard rate function, mean residual life function, stochastic ordering, mean deviations, order statistics, Renyi entropy, Bonferroni and Lorenz curves and stress-strength reliability has been discussed. The estimation of its parameters has been discussed using method of moments and maximum likelihood estimation. A real lifetime data has been presented to test the goodness of fit of TPAD over exponential, Akash and Lognormal distributions.

Keywords: akash distribution, moments and associated measures, reliability measures, stochastic ordering, order statistics, renyi entropy measure, mean deviations, bonferroni and lorenz curves, estimation of parameters, goodness of fit

Introduction

Shanker1 proposed a one-parameter lifetime distribution, known as Akash distribution, defined by its probability density function (pdf) and cumulative distribution function (cdf)

  f1(x;θ)=θ3θ2+2(1+x2)eθx;     x>0, θ>0  (1.1)
F1(x;θ)=1[1+θx(θx+2)θ2+2]eθx;x>0,θ>0  (1.2)

Akash distribution is a convex combination of exponential (θ) and gamma (3,θ) distributions with their mixing proportions θ2θ2+2 and 1θ2+2  respectively. Shanker1 has discussed its various mathematical and statistical properties and showed that in many ways (1.1) provides a better model for modeling lifetime data from medical science and engineering than Lindley2 and exponential distributions.

The first four moments about origin of Akash distribution obtained by Shanker1 are given by
μ1=θ2+6θ(θ2+2) , μ2=2(θ2+12)θ2(θ2+2) , μ3=6(θ2+20)θ3(θ2+2) , μ4=24(θ2+30)θ4(θ2+2)

The central moments of Akash distribution obtained by Shanker1 are given by

μ2=θ4+16θ2+12θ2(θ2+2)2
μ3=2(θ6+30θ4+36θ2+24)θ3(θ2+2)3
μ4=3(3θ8+128θ6+408θ4+576θ2+240)θ4(θ2+2)4

Shanker3 obtained a Poisson- Akash distribution (PAD), a Poisson mixture of Akash distribution, and discussed its various statistical and mathematical properties along with estimation of parameter and applications for count data from different fields of knowledge. Shanker4,5 has also introduced size-biased and zero-truncated version of PAD and studied their properties, estimation of parameter using both the method of moments and the maximum likelihood estimation and applications for count datasets which structurally excludes zero counts. Shanker6 has introduced a quasi Akash distribution for modeling lifetime data and discussed its statistical properties, estimation of parameters using both the method of moments and the maximum likelihood estimation. Shanker and Shukla7 have obtained a weighted Akash distribution for modeling lifetime data and observed that it gives better fit than several one parameter and two-parameter lifetime distributions

Note that the pdf and the cdf of Lindley distribution introduced by Lindley2) are defined as

f2(x;θ)=θ2θ+1(1+x)eθx  (1.3)
F2(x;θ)=1[1+θxθ+1]eθx  (1.4)

Ghitany et al.8 studied Lindley distribution and discussed its various statistical and mathematical properties, estimation of parameter and application to model waiting time data in a Bank and showed that it gives better fit than exponential distribution. Shanker9 have detailed critical and comparative study on modeling of lifetime data using one parameter exponential and Lindley distribution and showed that there are many lifetime data where exponential distribution gives better fit than Lindley distribution. Further, Shanker10 have comparative study on lifetime data using one parameter Akash, Lindley and exponential distribution and showed that Akash distribution gives better fit in some of the datasets than both exponential and Lindley distributions.

In this paper, a two-parameter Akash distribution (TPAD) which includes one parameter Akash distribution of Shanker1 as a particular case has been suggested. Its shapes, moments and moments based properties have been derived and discussed. The hazard rate function, mean residual life function, stochastic ordering, mean deviations, order statistics, Renyi entropy measure, Bonferroni and Lorenz curves, stress-strength reliability of TPAD have been derived and discussed. The estimation of parameters has been discussed using both the maximum likelihood estimation and that of method of moments. Finally, goodness of fit of the proposed distribution has been discussed with a real lifetime dataset and the fit has been compared with some well known lifetime distributions.

A Two-parameter akash distribution

 A two-parameter Akash distribution (TPAD) with parameters θ  and α  is defined by its pdf and cdf

f3(x;θ,α)=θ3αθ2+2(α+x2)eθx;x>0,θ>0,α>0 ; (2.1)
F3(x;θ,α)=1[1+θx(θx+2)αθ2+2]eθx;x>0,θ>0,α>0 (2.2)

It can be easily verified that the Akash distribution defined in (1.1) is a particular case of TPAD (2.1) at α=1 . Like the pdf of Akash distribution, the pdf of TPAD is also a convex combination of exponential (θ) and gamma (3,θ) distributions. We have

f3(x;θ,α)=pg1(x,θ)+(1p)g2(x;3,θ)  (2.3)

where p=αθ2αθ2+2,g1(x,θ)=θeθx,g2(x)=θ3Γ(3)eθxx31

The graph of the pdf of TPAD has been drawn for varying values of the parameter and shown in figure 1. It is obvious that the pdf takes different shapes for varying values of the parameters. The graph of the cdf of TPAD has been shown for varying values of the parameters in figure 2.

Figure 1 pdf of TPAD for varying values of parameters θ and α.

Figure 2 cdf of TPAD for varying values of the parameters θ and α.

Moments associated measures

The r th moment about origin of TPAD can be obtained as

μr=r!{αθ2+(r+1)(r+2)}θr(αθ2+2); r=1,2,3,...  (3.1)

Taking r=1,2,3  and 4 in (3.1), the first four moments about origin of TPAD are obtained as

  μ1=αθ2+6θ(αθ2+2),μ2=2(αθ2+12)θ2(αθ2+2),  μ3=6(αθ2+20)θ3(αθ2+2),   μ4=24(αθ2+30)θ4(αθ2+2)  

Using the relationship between moments about origin and central moments, the central moments of TPAD are obtained as

μ2= α2θ4+16αθ2+12θ2(αθ2+2)2μ3= 2(α3θ6+30α2θ4+36αθ2+24)θ3(αθ2+2)3  μ4= 3(3α4θ8+128α3θ6+408α2θ4+576αθ2+240)θ4(αθ2+2)4     

It can be easily verified that at α=1 , these raw moments and central moments of TPAD (2.1) reduce to the corresponding moments of the Akash distribution.

The coefficients of variation (C.V), skewness (β1) , kurtosis (β2) and index of dispersion (γ) of TPAD are given by

C.V=σμ1=α2θ4+16αθ2+12αθ2+6β1=μ3μ23/2=2(α3θ6+30α2θ4+36αθ2+24)(α2θ4+16αθ2+12)3/2β2=μ4μ22=3(3α4θ8+128α3θ6+408α2θ4+576αθ2+240)(α2θ4+16αθ2+12)2    γ=σ2μ1α2θ4+16αθ2+12θ(αθ2+2)(αθ2+6)

Graphs of coefficients of variation (C.V), skewness (β1) , kurtosis (β2) and index of dispersion (γ) of TPAD for varying values of the parameters are shown in figure 3.

Figure 3 Graphs of Coefficient of variation, skewness, kurtosis and index of dispersion of TPAD for varying values of parameters θ and α.

Statistical and mathematical properties

Hazard rate and mean residual life functions

Let f(x) and F(x)  be the pdf and cdf of a continuous random variable X .The hazard rate function (also known as the failure rate function) h(x)  and the mean residual life function m(x) are respectively defined as

h(x)=limΔx0P(X<x+Δx|X>x)Δx=f(x)1F(x)  (4.1.1)
and  m(x)=E[Xx|X>x]=11F(x)x[1F(t)]dt  (4.1.2)

The corresponding h(x) and m(x) of TPAD are thus obtained as

h(x)=θ3(α+x2)θx(θx+2)+(αθ2+2)  (4.1.3)
And m(x)=1[θx(θx+2)+(αθ2+2)]eθxx[θt(θt+2)+(αθ2+2)]eθtdt
=θ2x2+4θx+(αθ2+6)θ[θx(θx+2)+(αθ2+2)] (4.1.4)

It can be easily verified that h(0)=αθ3αθ2+2=f(0) and m(0)=αθ2+6θ(αθ2+2)=μ1 . Graphs of h(x)  and m(x)  of TPAD for varying values of parameters are shown in figure 4 and figure 5.

Figure 4 Hazard rate function of TPAD for varying values of parameters θ and α.

Figure 5 Mean residual life function of TPAD for varying values of parameters θ and α.

Stochastic ordering

Stochastic ordering of positive continuous random variables is an important tool for judging the comparative behavior. A random variable X is said to be smaller than a random variable Y in the

  1. stochastic order (XstY) if FX(x)FY(x) for all x
  2. hazard rate order (XhrY) if hX(x)hY(x)  for all x
  3. mean residual life order (XmrlY) if mX(x)mY(x) for all x
  4. likelihood ratio order (XlrY) if fX(x)fY(x)  decreases in x .

The following results due to Shaked and Shanthikumar11 are well known for establishing stochastic ordering of distributions

XlrYXhrYXmrlY
XstY

The TPAD is ordered with respect to the strongest ‘likelihood ratio’ ordering as shown in the following theorem:
Theorem: Let X  TPAD (θ1,α1)  and Y  TPAD (θ2,α2) . If α1=α2  and θ1θ2 (or if θ1=θ2  and α1α2 ), then XlrY and hence XhrY , XmrlY and XstY .

Proof: We have
fX(x;θ1,α1)fY(x;θ2,α2)=θ13(α2θ22+2)θ23(α1θ12+2)(α1+x2α2+x2)e(θ1θ2)x;x>0  
Now
lnfX(x;θ1,α1)fY(x;θ2,α2)=ln[θ13(α2θ22+2)θ23(α1θ12+2)]+ln(α1+x2α2+x2)(θ1θ2)x .
Thus
ddx{lnfX(x;θ1,α1)fY(x;θ2,α2)}=2(α1α2)x(α1+x2)(α2+x2)(θ1θ2)  

Case (i) If α1=α2  and θ1θ2 , then ddx{lnfX(x;θ1,α1)fY(x;θ2,α2)}<0 . This means that XlrY and hence XhrY , XmrlY and XstY .

Case (ii) If θ1=θ2 and α1α2 , then ddx{lnfX(x;θ1,α1)fY(x;θ2,α2)}<0 . This means that XlrY and hence XhrY , XmrlY and XstY .

This theorem shows the flexibility of TPAD over Akash and exponential distributions.

Distribution of order statistics

Let X1,X2,...,Xn  be a random sample of size n from TPAD. Let X(1)<X(2)<...<X(n) denote the corresponding order statistics. The pdf and the cdf of the k th order statistic, say Y=X(k) are given by

fY(y)=n!(k1)!(nk)!Fk1(y){1F(y)}nkf(y)
=n!(k1)!(nk)!nkl=0(nkl)(1)lFk+l1(y)f(y)
and
FY(y)=nj=k(nj)Fj(y){1F(y)}nj
=nj=knjl=0(nj)(njl)(1)lFj+l(y) ,

respectively, for k=1,2,3,...,n .

  Thus, the pdf and the cdf of k th order statistics of TPAD are obtained as
fY(y)=n!θ3(α+x2)eθx(αθ2+2)(k1)!(nk)!nkl=0(nkl)(1)l×[1θx(θx+2)+(αθ2+2)αθ2+2eθx]k+l1

 and
FY(y)=nj=knjl=0(nj)(njl)(1)l[1θx(θx+2)+(αθ2+2)αθ2+2eθx]j+l

Renyi Entropy measure

An entropy of a random variable x is a measure of variation of uncertainty. A popular entropy measure is Renyi entropy.12 If x is a continuous random variable having probability density function f(.) , then Renyi entropy is defined as

TR(γ)=11γlog{fγ(x)dx}
where γ>0andγ1 .

Thus, the Renyi entropy of TPAD (2.1) can be obtained as

  TR(γ)=11γlog[0θ3γ(αθ2+2)γ(α+x2)γeθγxdx]
=11γlog[0θ3γαγ(αθ2+2)γ(1+x2α)γeθγxdx]
=11γlog[0θ3γαγ(αθ2+2)γj=0(γj)(x2α)jeθγxdx]
  =11γlog[j=0(γj)θ3γαγj(αθ2+2)γ0eθγxx2j+11dx]
=11γlog[j=0(γj)θ3γαγj(αθ2+2)γΓ(2j+1)(θγ)2j+1]
=11γlog[j=0(γj)θ3γ2j1αγj(αθ2+2)γΓ(2j+1)(γ)2j+1] .

Mean deviations

The amount of scatter in a population is measured to a certain extent by the totality of deviations usually from mean and median. These are known as the mean deviation about the mean and the mean deviation about the median defined by
δ1(X)=<0|xμ|f(x)dx  and δ2(X)=0|xM|f(x)dx , respectively, where μ=E(X)  and M=Median (X) . The measures δ1(X)  and δ2(X) can be calculated using the following simplified relationships

δ1(X)=μ0(μx)f(x)dx+μ(xμ)f(x)dx
=μF(μ)μ0xf(x)dxμ[1F(μ)]+μxf(x)dx
=2μF(μ)2μ+2μxf(x)dx
=2μF(μ)2μ0xf(x)dx  (4.5.1)
and
δ2(X)=M0(Mx)f(x)dx+<M(xM)f(x)dx
=MF(M)M0xf(x)dxM[1F(M)]+Mxf(x)dx
=μ+2Mxf(x)dx
=μ2M0xf(x)dx  (4.5.2)

Using pdf (2.1) and expression for the mean of TPAD, we get

μ0xf(x)dx=μ{θ3(μ3+αμ)+θ2(3μ2+α)+6(θμ+1)}eθμθ(αθ2+2) (4.5.3)
M0xf(x)dx=μ{θ3(M3+αM)+θ2(3M2+α)+6(θM+1)}eθMθ(αθ2+2) (4.5.4)

Using expressions from (4.5.1), (4.5.2), (4.5.3), and (4.5.4), the mean deviation about mean, δ1(X)  and the mean deviation about median, δ2(X)  of TPAD are finally obtained as

δ1(X)=2{θ2(μ2+α)+2(2θμ+3)}eθμθ(αθ2+2)  (4.5.5)
δ2(X)=2{θ3(M3+αM)+θ2(3M2+α)+6(θM+1)}eθMθ(αθ2+2)μ  (4.5.6)

 Bonferroni and lorenz curves

Bonferroni and Lorenz curves introduced by Bonferroni13 and Bonferroni and Gini indices have applications not only in economics to study income and poverty, but also in other fields like reliability, demography, insurance and medicine. The Bonferroni and Lorenz curves are defined as

B(p)=1pμq0xf(x)dx=1pμ[0xf(x)dxqxf(x)dx]=1pμ[μqxf(x)dx]  (4.6.1)
and L(p)=1μq0xf(x)dx=1μ[0xf(x)dxqxf(x)dx]=1μ[μqxf(x)dx] (4.6.2)

respectively or equivalently

B(p)=1pμp0F1(x)dx (4.6.3)
and L(p)=1μp0F1(x)dx  (4.6.4)

respectively, where μ=E(X)  and q=F1(p) .

The Bonferroni and Gini indices are thus defined as

B=110B(p)dp  (4.6.5)
and G=1210L(p)dp  (4.6.6)

respectively.

 Using pdf of TPAD, we get

,qxf(x)dx={θ3(q3+αq)+θ2(3q2+α)+6(θq+1)}eθqθ(αθ2+2)  (4.6.7)

Now using equation (4.6.7) in (4.6.1) and (4.6.2), we get

B(p)=1p[1{θ3(q3+αq)+θ2(3q2+α)+6(θq+1)}eθqαθ2+6]  (4.6.8)
and L(p)=1{θ3(q3+αq)+θ2(3q2+α)+6(θq+1)}eθqαθ2+6 (4.6.9)

Now using equations (4.6.8) and (4.6.9) in (4.6.5) and (4.6.6), the Bonferroni and Gini indices of TPAD are thus obtained as

B=1{θ3(q3+αq)+θ2(3q2+α)+6(θq+1)}eθqαθ2+6  (4.6.10)

G=2{θ3(q3+αq)+θ2(3q2+α)+6(θq+1)}eθqαθ2+61  (4.6.11)

Stress-strength reliability

The stress- strength reliability describes the life of a component which has random strength X that is subjected to a random stress Y . When the stress applied to it exceeds the strength, the component fails instantly and the component will function satisfactorily till X>Y . Therefore, R=P(Y<X) is a measure of component reliability and in statistical literature it is known as stress-strength parameter. It has wide applications in almost all areas of knowledge especially in engineering such as structures, deterioration of rocket motors, static fatigue of ceramic components, aging of concrete pressure vessels etc.

Let X and Y be independent strength and stress random variables having TPAD with parameter (θ1,α1)  and (θ2,α2)  respectively. Then the stress-strength reliability R can be obtained as

R=P(Y<X)=0P(Y<X|X=x)fX(x)dx
=0f(x;θ1,α1)F(x;θ2,α2)dx
=1θ13[α1α2θ26+4α1α2θ1θ25+2(3α1α2θ12+3α1+α2)θ24+2(2α1α2θ12+9α1+2α2)θ1θ23+(α1α2θ14+20α1θ12+2α2θ12+40)θ22+10(α1θ12+2)+2(α1θ12+2)θ12](α1θ12+2)(α2θ22+2)(θ1+θ2)5 .

It can be easily verified that at α1=α2=1 , the above expression reduces to the corresponding expression for Akash distribution of Shanker.1

Estimation of parameters

Estimates from moments

Since the TPAD has two parameters to be estimated, the first two moments about origin are required to estimate its parameters. Using the first two moments about origin of TPAD, we have

μ2(μ1)2=k(Say)=2(αθ2+12)(αθ2+2)(αθ2+6)2  (5.1.1)

Assuming αθ2=β  in (5.2.1), we get a quadratic equation in β as
(2k)β2+4(73k)β+12(43k)=0  (5.1.2)

It should be noted that for real values of b , k2.083 . Replacing μ1 and μ2  by their respective sample moments in (5.1.1), an estimate of k can be obtained and substituting the value of k in equation (5.1.2), value of β  can be obtained. Again taking αθ2=β in the expression for the mean of TPAD, we get the method of moment estimate (MOME) ˜θ of θ as ˜θ=β+6(β+2)ˉx  and thus the MOME ˜α=βθ2=β(β+2)2(ˉx)2(β+6)2 .

 Maximum likelihood estimates

Let (x1,x2,x3,...,xn)  be a random sample from TPAD .The likelihood function, L of TPAD is given by
L=(θ3αθ2+2)nni=1(α+xi2)enθˉx

The natural log likelihood function is thus obtained as
lnL=nln(θ3αθ2+2)+ni=1ln(α+xi2)nθˉx

The maximum likelihood estimates (MLE) ˆθ  and ˆα  of θ  and α  are then the solutions of the following non-linear equations
dlnLdθ=3nθ2nαθαθ2+2nˉx=0  

dlnLdα=2θ2αθ2+2+ni=11α+xi2=0

where ˉx is the sample mean.

These two natural log likelihood equations do not seem to be solved directly because these cannot be expressed in closed forms. However, the Fisher’s scoring method can be applied to solve these equations. We have

2lnLθ2=3nθ2+2nα(αθ22)(αθ2+2)22lnLα2=nθ4(αθ2+2)2ni=11(α+xi2)22lnLθα=4nθ(αθ2+2)2The following equations can be solved for MLEs ˆθ and ˆα  ofθ and  of TPAD                         [2lnLθ22lnLθα2lnLθα2lnLα2]ˆθ=θ0ˆα=α0[ˆθθ0ˆαα0]=[lnLθlnLα]ˆθ=θ0ˆα=α0  

where θ0 and α0 are the initial values of ˆθ and ˆα , respectively. These equations are solved iteratively till sufficiently close values of ˆθ  and ˆα  are obtained. The initial values of the parameters are the values given by MOME.

Data analysis

The following data set represents the failure times (in minutes) for a sample of 15 electronic components in an accelerated life test given on page 204 of Lawless.14

1.4          5.1          6.3          10.8        12.1        18.5        19.7        22.2        23.0        30.6        37.3        46.3     53.9       59.8        66.2

For this data set, TPAD has been fitted along with one parameter exponential and Akash distributions and two-parameter Lognormal distribution introduced by Pearce.15 The ML estimates, values of 2lnL and K-S statistics of the fitted distributions are presented in table 1. Recall that the best distribution corresponds to the lower values of 2lnL and K-S.

Distribution

ML estimates

−2lnL

K-S statistics

TPAD

0.096

37.847

128.91

0.138

Lognormal

2.931

1.061

131.234

0.161

Akash

0.108

133.68

0.184

Exponential

0.036

129.47

0.156

Table 1 MLE’s, -2ln L and K-S Statistics of the fitted distributions

It can be easily seen from above table that the TPAD gives better fit than all the considered distributions and hence it can be considered as an important two-parameter lifetime distribution for modeling lifetime data.

Conclusions

A two-parameter Akash distribution (TPAD), of which one parameter Akash distribution of Shanker1 is a particular case, has been suggested and investigated. Its mathematical properties including moments, coefficient of variation, skewness, kurtosis, index of dispersion, hazard rate function, mean residual life function, stochastic ordering, mean deviations, order statistics, Bonferroni and Lorenz curves, Renyi entropy measure and stress-strength reliability have been discussed. For estimating its parameters, the method of moments and the method of maximum likelihood estimation have been discussed. Finally, a numerical example of real lifetime dataset has been presented to test the goodness of fit of TPAD over exponential, Akash and Lognormal distributions. It is obvious that TPAD gives a better fir over these distributions.

Acknowledgments

None.

Conflicts of interest

None.

References

  1. Shanker R. Akash distribution and Its Applications. International Journal of Probability and Statistics. 2015;4(3):65–75.
  2. Lindley DV. Fiducial distributions and Bayes’ theorem. Journal of the Royal Statistical Society, Series B. 1958;20(1):102–107.
  3. Shanker R. The Discrete Poisson-Akash Distribution. International Journal of Probability and Statistics. 2017;6(1):1–10.
  4. Shanker R. Size-biased Poisson-Akash distribution and its Applications. To appear in, International Journal of Statistics and Applications. 2017.
  5. Shanker R. Zero-truncated Poisson-Akash distribution and its Applications. To appear in, American Journal of Mathematics and Statistics. 2017.
  6. Shanker R. A quasi Akash distribution. Assam Statistical Review. 2016;30(1):135–160.
  7. Shanker R, Shukla KK. Weighted Akash distribution and its Application to model lifetime data. 2016;39(2):1138–1147.
  8. Ghitany ME, Atieh B, Nadarajah S. Lindley distribution and its Application. Mathematics Computing and Simulation. 2008;78(4):493–506.
  9. Shanker R, Hagos F, Sujatha S. On modeling of Lifetimes data using exponential and Lindley distributions. Biometrics & Biostatistics International Journal. 2015;2(5):1–9.
  10. Shanker R, Hagos F, Sujatha S. On modeling of Lifetimes Data using one parameter Akash, Lindley and exponential distributions. Biometrics & Biostatistics international Journal. 2016;3(2):1–10.
  11. Shaked M, Shanthikumar JG. Stochastic Orders and Their Applications. New York, USA: Academic Press; 1994.
  12. Renyi A. On measures of entropy and information. In proceedings of the 4th Berkeley symposium on Mathematical Statistics and Probability. Berkeley, USA: University of California press; 1961:547–561.
  13. Bonferroni CE. Elementi di Statistca generale, Seeber, Firenze. 1930.
  14. Lawless JF. Statistical Models and Methods for Lifetime data. 2nd ed. New York, USA: John Wiley and Sons; 2003.
  15. Pearce S. Lognormal distribution. Nature. 1945;156:747.
Creative Commons Attribution License

©2017 Shanker, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.