Submit manuscript...
eISSN: 2378-315X

Biometrics & Biostatistics International Journal

Research Article Volume 7 Issue 1

A generalized akash distribution

Rama Shanker, Kamlesh Kumar Shukla, Ravi Shanker, Ajay Pratap

Department of Statistics, College of Science, Eritrea Institute of Technology, Eritrea

Correspondence: Rama Shanker, Department of Statistics, College of Science, Eritrea Institute of Technology, Asmara, Eritrea

Received: November 27, 2017 | Published: January 19, 2018

Citation: Shanker R, Shukla KK, Shanker R, et al. A generalized akash distribution. Biom Biostat Int J. 2018;7(1):18-26. DOI: 10.15406/bbij.2018.07.00187

Download PDF

Abstract

In this paper a generalized Akash distribution (GAD) which includes Akash distribution of Shanker1 and exponential distribution as particular cases has been introduced and studied. Its mathematical and statistical properties including its shapes for varying values of parameters, moments based measures, hazard rate function, mean residual life function, stochastic ordering, mean deviations, order statistics, Renyi entropy measure, Bonferroni and Lorenz curves and stress-strength reliability has been studied. The estimation of its parameters has been discussed using both the method of moments and the maximum likelihood estimation. Goodness of fit of GAD has been explained through a real lifetime data and the fit has been compared with other lifetime distributions.

Keywords: akash distribution, moments and associated measures, reliability measures, stochastic ordering, order statistics, renyi entropy measure, mean deviations, bonferroni and lorenz curves, estimation of parameters, goodness of fit

Introduction

Shanker1 introduced a one-parameter lifetime distribution, known as Akash distribution, defined by its probability density function (pdf) and cumulative distribution function (cdf)

f1x;θ  =  θ3θ2+2  1+x2  eθx  ;     x>0,    θ>0   (1.1)

F1x;θ=11+θxθx+2θ2+2eθx      ;x>0,θ>0   (1.2)

This distribution is a two-component mixture of exponential θ and gamma 3 , θ distributions with their mixing proportions θ 2 θ 2 + 2 and 1 θ 2 + 2 respectively. Shanker1 has discussed its various mathematical and statistical properties and showed that in many ways (1.1) provides a better model for modeling lifetime data from medical science and engineering than Lindley2 and exponential distributions. Shanker et al.3,4 have detailed comparative and critical study on Akash, Lindley and exponential distributions for modeling lifetime data from biomedical science and engineering and observed that Akash distribution provides better fit than both Lindley and exponential distributions in almost all lifetime datasets.

The first four moments about origin of Akash distribution obtained by Shanker1 are given by

μ1'=θ2+6θθ2+2,μ2'=2θ2+12θ2θ2+2,μ3'=6θ2+20θ3θ2+2,μ4'=24θ2+30θ4θ2+2

The central moments of Akash distribution as obtained by Shanker1 are

μ2=θ4+16θ2+12θ2θ2+22

μ3=2θ6+30θ4+36θ2+24θ3θ2+23

μ4=33θ8+128θ6+408θ4+576θ2+240θ4θ2+24

Shanker5 obtained Poisson- Akash distribution (PAD), a Poisson mixture of Akash distribution, and discussed its various statistical and mathematical properties along with estimation of parameter and applications for counts data from different fields of knowledge. Shanker6,7 has also introduced size-biased and zero-truncated version of PAD and studied their statistical properties, estimation of parameter and applications for count datasets which structurally excludes zero counts.

In this paper, a generalized Akash distribution (GAD), of which the Akash distribution (1.1) of Shanker1 and exponential distribution are special cases, has been suggested. Its shapes for varying values of parameters, moments and moments based properties have been derived and discussed. The hazard rate function, mean residual life function, stochastic ordering, mean deviations, order statistics, Renyi entropy measure, Bonferroni and Lorenz curves, and stress-strength reliability of GAD have been derived and discussed. The estimation of parameters has been discussed using both the method of moments and the maximum likelihood estimation. Finally, goodness of fit of GAD has been discussed with a real lifetime dataset and the fit has been compared with some well known distributions.

A generalized akash distribution

 A two-parameter generalized Akash distribution (GAD) with parameters  θ and α is defined by its pdf

f2x;θ,α=θ3θ2+2α1+αx2eθx;x>0,θ>0,α>0   (2.1)

It can be easily verified that the Akash distribution defined in (1.1) and exponential distribution are particular cases of GAD (2.1) at α=1 and α=0 respectively. The pdf (2.1) can be shown as a two-component mixture of exponential θ and gamma 3,θ distributions. We have

f2x;θ,α=pg1x,θ+1pg2x;3,θ   (2.2)

where p=θ2θ2+2α,g1x,θ=θeθx and g2x=θ3Γ3eθxx31

The corresponding cdf of GAD (2.1) can be obtained as          

F2x;θ,α=11+αθxθx+2θ2+2αeθx;x>0,θ>0,α>0   (2.3)

The graphs of pdf and cdf of GAD for varying values of parameters are shown in figures 1 and 2.

Figure 1 pdf of GAD for varying values of parameters θ and α.

Figure 2 cdf of GAD for varying values of parameters θ and α.

Statistical constants

The moment generating function (mgf) about origin of GAD (2.1) can be obtained as

MXt=θ3θ2+2α0eθtx1+αx2dx

=θ3θ2+2α1θt+2αθt3

=θ3θ2+2α1θk=0tθk+2αθ3k=0k+2ktθk

=k=0θ2+k+1k+2αθ2+2αtθk

The rth moment about origin of GAD (2.1) thus can be obtained as

μr'=r!θ2+r+1r+2αθrθ2+2α; r=1,2,3,...   (3.1)

Taking r=1,2,3 and 4 in (3.1), the first four moments about origin of GAD (2.1) are obtained as

μ1'=θ2+6αθθ2+2α,  μ2'=2θ2+12αθ2θ2+2α,   μ3'=6θ2+20αθ3θ2+2α,   μ4'=24θ2+30αθ4θ2+2α

Using the relationship between moments about origin and central moments, central moments of GAD (2.1) are obtained as

μ2= θ4+16θ2α+12α2θ2θ2+2α2

μ3= 2θ6+30θ4α+36θ2α2+24α3θ3θ2+2α3

μ4= 33θ8+128θ6α+408θ4α2+576θ2α3+240α4θ4θ2+2α4     

The coefficients of variation γ , skewness β1, kurtosis β2 and index of dispersion γ of GAD are thus obtained as

C.V=σμ1'=θ4+16θ2α+12α2θ2+6α

β1=μ3μ23/2=2θ6+30θ4α+36θ2α2+24α3θ4+16θ2α+12α23/2

β2=μ4μ22=33θ8+128θ6α+408θ4α2+576θ2α3+240α4θ4+16θ2α+12α22

γ=σ2μ1'θ4+16θ2α+12α2θθ2+2αθ2+6α

It can be easily verified that these statistical constants of GAD reduces to the corresponding statistical constants of Akash and exponential distributions at α=1 and α=0 respectively. Graphs of coefficient of variation, coefficient of skewness, coefficient of kurtosis and index of dispersion of GAD for varying values of parameters and have been drawn and presented in Figure 3.

Figure 3 Graphs of coefficient of variation, coefficient of skewness, coefficient of kurtosis and index of dispersion of GAD for varying values of parameters θ and α.

Statistical properties

Stochastic ordering

Stochastic ordering of positive continuous random variables is an important tool for judging the comparative behavior. A random variable X is said to be smaller than a random variable Y in the

  1. stochastic order XstY if FXxFYx for all x
  2. hazard rate order XhrY if hXxhYx for all x
  3. mean residual life order XmrlY if mXxmYx for all x
  4. likelihood ratio order XlrY if fXxfYx decreases inx.

The following results due to Shaked and Shanthikumar8 are well known for establishing stochastic ordering of distributions

XlrYXhrYXmrlY                            XstY

The GAD is ordered with respect to the strongest "likelihood ratio" ordering as established in the following theorem:

Theorem: Let X~ GAD θ1,α1  and Y~GAD θ2,α2. If α1=α2 and θ1θ2(or if θ1=θ2 and α1α2), then XlrY and hence XhrY, XmrlY and XstY.

P roof: We have

fXx;θ1,α1fYx;θ2,α2=θ13θ22+2α2θ23θ12+2α11+α1x21+α2x2eθ1θ2x;x>0

Now

ln f X x ; θ 1 , α 1 f Y x ; θ 2 , α 2 = ln θ 1 3 θ 2 2 + 2 α 2 θ 2 3 θ 1 2 + 2 α 1 + ln 1 + α 1 x 2 1 + α 2 x 2 θ 1 θ 2 x .

Thus

ddxlnfXx;θ1,α1fYx;θ2,α2=2α1α2x1+α1x21+α2x2θ1θ2

Case (i) If α1=α2 and θ1θ2, then ddxlnfXx;θ1,α1fYx;θ2,α2<0. This means that XlrY and hence XhrY, XmrlY and XstY.

Case (ii) If θ1=θ2 and α1α2, then ddxlnfXx;θ1,α1fYx;θ2,α2<0. This means that XlrY and hence XhrY, XmrlY and XstY.

This theorem shows the flexibility of GAD over Akash and exponential distributions.

Mean deviations

The amount of scatter in a population is generally measured to some extent by the totality of deviations usually from the mean and the median. These are known as the mean deviation about the mean and the mean deviation about the median defined by

δ1X=0xμfxdx and δ2X=0xMfxdx

respectively, where μ=EX and M=Median X. The measures δ1X and δ2X can be calculated using the following simplified relationships

δ1X=0μμxfxdx+μxμfxdx

=μFμ0μxfxdxμ1Fμ+μxfxdx

=2μFμ2μ+2μxfxdx

=2μFμ20μxfxdx    (4.2.1)

and

δ2X=0MMxfxdx+MxMfxdx

=MFM0MxfxdxM1FM+Mxfxdx

=μ+2Mxfxdx

=μ20Mxfxdx   (4.2.2)

Using pdf (2.1) and expression for the mean of GAD, we get

0μxfxdx=μθ3αμ3+μ+θ23αμ2+1+6αθμ+1eθμθθ2+2α   (4.2.3)

0Mxfxdx=μθ3αM3+M+θ23αM2+1+6αθM+1eθMθθ2+2α   (4.24)

Using expressions from (4.2.1), (4.2.2), (4.2.3), and (4.2.4), the mean deviation about mean, δ1X and the mean deviation about median, δ2X of GAD are finally obtained as

δ1X=2θ2αμ2+1+2α2θμ+3eθμθθ2+2α   (4.2.5)

δ2X=2θ3αM3+M+θ23αM2+1+6αθM+1eθMθθ2+2αμ   (4.26)

Distribution of order statistics

Let X1,X2,...,Xn be a random sample of size n from GAD      (2.1). Let X1<X2<  ...  <Xn denote the corresponding order statistics. The pdf and the cdf of the kth order statistic, say Y=Xk are given by

fYy=n!k1!nk!Fk1y1Fynkfy

=n!k1!nk!l=0nknkl1lFk+l1yfy

and

FYy=j=knnjFjy1Fynj

=j=knl=0njnjnjl1lFj+ly

respectively, for k=1,2,3,...,n.

Thus, the pdf and the cdf of kth order statistics of GAD are obtained as

fYy=n!θ31+αx2eθxθ2+2αk1!nk!l=0nknkl1l×1αθxθx+2+θ2+2αθ2+2αeθxk+l1

and

FYy=j=knl=0njnjnjl1l1αθxθx+2+θ2+2αθ2+2αeθxj+l

Renyi entropy measure

Entropy of a random variable X is a measure of variation of uncertainty. A popular entropy measure is Renyi entropy.9 If X is a continuous random variable having pdf f., then Renyi entropy is defined as

TRγ=11γlogfγxdx

where γ>0   and   γ1.

Thus, the Renyi entropy of GAD (2.1) can be obtained as

TRγ=11γlog0θ3γθ2+2αγ1+αx2γeθγxdx

=11γlog0θ3γθ2+2αγj=0γjαx2jeθγxdx

=11γlogj=0γjθ3γαjθ2+2αγ0eθγxx2j+11dx

=11γlogj=0γjθ3jαjθ2+2αγΓ2j+1θγ2j+1

=11γlogj=0γjθ3γ2j1αjθ2+2αγΓ2j+1γ2j+1

Bonferroni and lorenz curves

The Bonferroni and Lorenz curves10 and Bonferroni and Gini indices have applications not only in economics to study income and poverty, but also in other fields like reliability, demography, insurance and medicine. The Bonferroni and Lorenz curves are defined as

Bp=1pμ0qxfxdx=1pμ0xfxdxqxfxdx=1pμμqxfxdx   (4.5.1)

and Lp=1μ0qxfxdx=1μ0xfxdxqxfxdx=1μμqxfxdx,    (4.5.2)

respectively or equivalently

Bp=1pμ0pF1xdx   (4.5.3)

and Lp=1μ0pF1xdx   (4.5.4)

respectively, where μ=EX and q=F1p.

The Bonferroni and Gini indices are thus defined as

B=101Bpdp   (4.5.5)

and G=1201Lpdp   (4.5.6)

respectively.

Using pdf of GAD (2.1), we get

qxfxdx=θ3αq3+q+θ23αq2+1+6αθq+1eθqθθ2+2α   (4.5.7)

Now using equation (4.5.7) in (4.5.1) and (4.5.2), we get

Bp=1p1θ3αq3+q+θ23αq2+1+6αθq+1eθqθ2+6α   (4.5.8)

 and  Lp=1θ3αq3+q+θ23αq2+1+6αθq+1eθqθ2+6α

 Now using equations (4.5.8) and (4.5.9) in (4.5.5) and (4.5.6), the Bonferroni and Gini indices of GAD are thus obtained as

B=1θ3αq3+q+θ23αq2+1+6αθq+1eθqθ2+6α   (4.5.10)

G=2θ3αq3+q+θ23αq2+1+6αθq+1eθqθ2+6α1   (4.5.11)

Reliability measures

Hazrad rate function and mean residual life function

For a continuous distribution with pdf fx and cdf Fx, the hazard rate function (also known as the failure rate function) hx and the mean residual life function mx are respectively defined as

hx=limΔx0PX<x+ΔxX>xΔx=fx1Fx   (5.1.1)

and  mx=EXxX>x=11Fxx1Ft  dt (5.1.2)

The hx and mx of GAD (2.1) are thus obtained as

hx=θ31+αx2αθxθx+2+θ2+2α   (5.1.3)

and mx=1αθxθx+2+θ2+2αeθxxαθtθt+2+θ2+2αeθtdt 

=αθ2x2+4αθx+θ2+6αθαθxθx+2+θ2+2α   (5.1.4)

It can be easily verified that h0=θ3θ2+2α=f0 and m0=θ2+6αθθ2+2α=μ1'. Graphs of hx and mx of GAD for varying values of parameters θ and α are shown in figures 4 and 5. It is obvious that hx is either constant or increasing for various values of parameters θ and α. Further, the graph of mx is always decreasing for various values of parameters θ and α .

Stress- strength reliability

The stress-strength reliability describes the life of a component which has random strength X that is subjected to a random stress X. When the stress applied to it exceeds the strength, the component fails instantly and the component will function satisfactorily till X>Y. Therefore, R=PY<X is a measure of the component reliability and known as stress-strength parameter in statistical literature. It has wide applications in almost all areas of knowledge especially in medical science and engineering.

Let X and X be independent strength and stress random variables having GAD (2.1) with parameter θ1,α1 and θ2,α2 respectively. Then the stress-strength reliability R can be obtained as

R=PY<X=0PY<X|X=xfXxdx

=0fx;θ1,α1  Fx;θ2,α2dx

=1θ13θ26+4θ1θ25+23θ12+α1+3α2θ24+22θ12+2α1+9α2θ1θ23+θ14+2α1θ12+20θ12α2+40α1α2θ22+10α2θ12+2α1θ1θ2+2α2θ12+2α1θ12θ12+2α1θ22+2α2θ1+θ25

It can be easily verified that at α1=α2=1 and α1=α2=0 , the above expression reduces to the corresponding expression for Akash distribution of Shanker1 and exponential distribution.

Estimation

Estimates from moments

Since the GAD (2.1) has two parameters to be estimated, the first two moments about origin are required to estimate its parameters. Using the first two moments about origin of GAD (2.1), we have

μ2'μ1'2=k  (Say)=2θ2+12αθ2+2αθ2+6α2   (6.1.1)

Assuming α=bθ2 in (6.1.1), we get a quadratic equation in b as

1243kb2+473kb+2k=0   (6.12)

It should be noted that for real values ofb,k2.083. Replacing μ1' and μ2' by their respective sample moments in (6.1.1), an estimate of k can be obtained and substituting the value of k in equation (6.1.2), value of b can be obtained. Again taking α=bθ2 in the expression for the mean of GAD, we get the method of moment estimate (MOME) θ~ of θ as θ~=1+6b1+2bx¯ and thus the MOME α~=bθ2=b1+6b21+2b2x¯.

Maximum likelihood estimates

Let x1,x2,x3,  ...  ,xn be a random sample from GAD (2.1)). The likelihood function, L of (2.1) is given by

L=θ3θ2+2αni=1n1+αxi2enθx¯

and so its natural log likelihood function is thus obtained as

lnL=nlnθ3θ2+2α+i=1nln1+αxi2nθx¯

The maximum likelihood estimates (MLE) θ^ and α^ of parameters θ and α are then the solutions of the following non-linear equations

dlnLdθ=3nθ2nθθ2+2αnx¯=0

dlnLdα=2nθ2+2α+i=1nxi21+αxi2=0

where x¯ is the sample mean.

These two natural log likelihood equations do not seem to be solved directly because these equations cannot be expressed in closed forms. However, the Fisher"s scoring method can be applied to solve these equations.

We have

2lnLθ2=3nθ2+2nθ22αθ2+2α2

2lnLα2=4nθ2+2α2i=1nxi41+αxi22

2lnLθα=4nθθ2+2α2

The following equations can be solved for the MLEs θ^ and α^ of θ and α of GAD (2.1)

2lnLθ22lnLθα2lnLθα2lnLα2θ^=θ0α^=α0θ^θ0α^α0=lnLθlnLαθ^=θ0α^=α0

Where and are the initial values of and , respectively. These equations are solved iteratively till sufficiently close values of and are obtained. The initial values of the parameters and are taken from MOME estimates.

Figure 4 Graphs of of GAD for varying values of θ and α.

Figure 5 Graphs of mx of GAD for varying values of θ and α.

An illustrative example

The following data set represents the failure times (in minutes) for a sample of 15 electronic components in an accelerated life test, Lawless11

1.4    5.1    6.3    10.8    12.1    18.5    19.7
22.2    23.0    30.6    37.3    46.3    53.9    59.8
66.2

For this data set, GAD has been fitted along with one parameter exponential, Lindley and Akash distributions and two parameters generalized exponential distribution (GED) introduced by Gupta and Kundu,12 Weibull distribution introduced by Weibull,13 Lognormal distribution introduced by Pearce,14 quasi Lindley distribution (QLD) introduced by Shanker and Mishra,15 quasi Shanker distribution (QSD) introduced by Shanker and Shukla.16 The pdf and the cdf of GED, Weibull distribution, Lognormal distribution, QLD, QSD, Lindley distribution and exponential distribution are presented in table 1. The ML estimates, values of 2lnL, K-S statistics and p-values of the fitted distributions are presented in table 2. Recall that the best distribution corresponds to the lower values of 2lnL and K-S and higher p-value.

It can be easily seen from above table that the GAD gives better fit than all the considered distributions and hence it can be considered as an important two-parameter lifetime distribution for modeling lifetime dataset.

Models

p.d.f.

c.d.f.

GED

fx;θ,α=θα1eθxα1eθx

Fx;θ,α=1eθxα

Weibull

fx;θ,α=θαxα1eθxα

Fx;θ,α=1eθxα

Lognormal

fx;θ,α=12παxe12logxθα2

Fx;θ,α=ϕlogxθα

QLD

fx;θ,α=θα+1  α+θxeθx

Fx;θ,α=11+θxα+1eθx

QSD

fx;θ,α=θ3θ3+θ+2α                        ×θ+x+αx2eθx

Fx;θ,α=11+αθ2x2+θxθ+2αθ3+θ+2αeθx

Lindley

fx;θ=θ2θ+11+xeθx

Fx;θ=1θ+1+θxθ+1eθx

Exponential

fx;θ=θeθx

Fx;θ=1eθx

Table 1 The pdf and the cdf of fitted distributions

Distribution

ML estimates

2lnL

 

K-S

p- value

θ^

α^

GAD

0.084

0.007

128.02

0.107

0.987

GED

0.045

1.443

128.47

0.108

0.986

Weibull

0.034

1.306

128.04

0.451

0.941

Lognormal

2.931

1.061

131.23

0.161

0.951

QLD

0.062

0.398

128.21

0.123

0.964

QSD

0.074

0.001

129.37

0.121

0.961

Akash

0.108

 

133.68

0.184

0.627

Lindley

0.070

 

128.81

0.110

0.983

Exponential

0.036

 

129.47

0.156

0.807

Table 2 MLE"s, - 2ln L and K-S statistics of the fitted distributions

Concluding remarks

A two-parameter generalized Akash distribution (GAD), of which one parameter Akash distribution of Shanker1 and exponential distribution are particular cases, has been suggested and investigated. Its mathematical properties including moments, coefficients of variation, skewness, kurtosis, index of dispersion; hazard rate function, mean residual life function, stochastic ordering, mean deviations, order statistics, Bonferroni and Lorenz curves, Renyi entropy measure and stress-strength reliability have been discussed. For estimating its parameters, the method of moments and the method of maximum likelihood estimation have been discussed. Finally, a numerical example of real lifetime dataset has been presented to test the goodness of fit of GAD over exponential, Lindley, Akash, Lognormal, Weibull, QSD, QLD and GED.

Acknowledgments

None.

Conflicts of interests

None.

References

  1. Shanker R. Akash distribution and Its Applications. International Journal of Probability and Statistics. 2015;4(3):65‒75.
  2. Lindley DV. Fiducial distributions and Bayes’ theorem. Journal of the Royal Statistical Society. Series B. 1958;20(1):102‒ 107.
  3. Shanker R, Fesshaye H, Selvaraj S. On Modeling of Lifetimes Data Using Exponential and Lindley Distributions. Biom Biostat Int J. 2015;2(5):00042.
  4. Shanker R, Fesshaye H. On Modeling of Lifetime Data Using Akash, Shanker, Lindley and Exponential Distributions. Biom Biostat Int J. 2016;3(6):00084.
  5. Shanker R. The Discrete Poisson-Akash Distribution. International Journal of Probability and Statistics. 2017a;6(1):1‒10.
  6. Shanker R. Size-biased Poisson-Akash distribution and its Applications. International Journal of Statistics and Applications. 2017b;7(6):289‒297.
  7. Shanker R. Zero-truncated Poisson-Akash distribution and its Applications. American Journal of Mathematics and Statistics. 2017c;7(6):227‒236.
  8. Shaked M, Shanthikumar JG. Stochastic Orders and Their Applications. Academic Press, New York, USA. 1994.
  9. Renyi A. On measures of entropy and information, in proceedings of the 4th berkeley symposium on Mathematical Statistics and Probability. University of California press, Berkeley, USA. 1961;p.1:547‒561.
  10. Bonferroni CE. Elementi di Statistca generale, Seeber, Firenze. 1930.
  11. Lawless JF. Statistical Models and Methods for Lifetime data. John Wiley and Sons, New York, USA. 2003.
  12. Gupta RD, Kundu D. Generalized Exponential Distribution. Austalian & New Zealand Journal of Statistics. 1999;41(2):173‒188.
  13. Weibull W. A statistical distribution of wide applicability. Journal of Applied Mathematics. 1951;18:293‒297.
  14. Pearce S. Lognormal distribution. Nature. 1945;156:747.
  15. Shanker R, Mishra A. A Quasi Lindley Distribution. African Journal of Mathematics and Computer Science Research (AJMCSR). 2013;6(4):64‒71.
  16. Shanker R, Shukla KK. A Quasi Shanker distribution and Its Applications. Biometrics & Biostatistics International Journal. 2017 d;6(1):11‒20.
Creative Commons Attribution License

©2018 Shanker, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.