Submit manuscript...
eISSN: 2378-315X

Biometrics & Biostatistics International Journal

Research Article Volume 7 Issue 3

A two–parameter Sujatha distribution

Mussie Tesfay, Rama Shanker

Department of Statistics, Eritrea Institute of Technology Asmara, Eritrea

Correspondence:

Received: March 15, 2018 | Published: May 10, 2018

Citation: Tesfay M, Shanker R. A two–parameter Sujatha distribution. Biom Biostat Int J. 2018;7(3):188–197. DOI: 10.15406/bbij.2018.07.00208

Download PDF

Abstract

This paper proposes a two parameter Sujatha distribution (TPSD). This includes size–biased Lindley distribution and Sujatha distribution as particular cases. It’s important statistical properties including its shapes for varying values of parameters, coefficient of variation, skewness, kurtosis, index of dispersion, hazard rate function, mean residual life function, stochastic ordering ,mean deviations, Bonferroni and Lorenz curves, and stress–strength reliability have been discussed. The estimation of parameters has been discussed using the method of moments and the method of maximum likelihood. Application of the distribution has been discussed with a real lifetime data.

Keywords: Sujatha distribution, moments, statistical properties, estimation of parameters, application

Introduction

The statistical analysis and modeling of lifetime data are crucial for statisticians working in various field of knowledge including medical science, engineering, social science, behavioral science, insurance, finance, among others. The classical one parameter lifetime distribution in statistics which were popular for modeling lifetime data are exponential distribution and Lindley distribution proposed by Lindley.1 Shanker et al.2 have detailed critical study on applications of lifetime data from engineering and biomedical science and observed that exponential and Lindley distributions are not always suitable due to theoretical or applied point of view and presence of single parameter. In search for a lifetime distribution which gives a better fit than exponential and Lindley distributions Shanker3 has proposed a new lifetime distribution named Sujatha distribution defined by its probability density function (pdf) and cumulative distribution function (cdf).

f1(x;θ)=θ3θ2+θ+2(1+x+x2)eθx;x>0,θ>0f1(x;θ)=θ3θ2+θ+2(1+x+x2)eθx;x>0,θ>0 (1.1)

F1(x;θ;α)=1[1+θx(θx+θ+2)αθ2+θ+2]eθx;x>0,θ>0F1(x;θ;α)=1[1+θx(θx+θ+2)αθ2+θ+2]eθx;x>0,θ>0 (1.2)

where θ is a scale parameter. It has been shown by Shanker3 that Sujatha distribution is a convex combination of exponential (θ) distribution, a gamma (2, θ) distribution and a gamma (3, θ) distribution. The first four moments about origin and central moments of Sujatha distribution obtained by Shanker3 are

μ1=θ2+2θ+6θ(θ2+θ+2)μ2=2(θ2+3θ+12)θ2(θ2+θ+2)

μ3=6(θ2+4θ+20)θ3(θ2+θ+2)μ4=24(θ2+5θ+30)θ4(θ2+θ+2)

μ2=θ4+4θ3+18θ2+12θ+12θ2(θ2+θ+2)2

μ3=2(θ6+6θ5+36θ4+44θ3+54θ2+36θ+24)θ3(θ2+θ+2)3

μ4=3(3θ8+24θ7+172θ6+376θ5+736θ4+864θ3+912θ2+480θ+240)θ4(θ2+θ+2)4

Shanker3 has discussed its important properties including shapes of density function for varying values of parameters, hazard rate function, mean residual life function, stochastic ordering, mean deviations, Bonferroni and Lorenz curves, and stress–strength reliability. Shanker3 discussed the maximum likelihood estimation of parameter and showed applications of Sujatha distribution to model lifetime data from biomedical science and engineering. Shanker4 has introduced Poisson–Sujatha distribution (PSD), a Poisson mixture of Sujatha distribution, and studied its properties, estimation of parameter and applications to model count data. Shanker & Hagos5 have discussed zero–truncated Poisson– Sujatha distribution (ZTPSD) and applications for modeling count data excluding zero counts. Shanker & Hagos6 have also studied size–biased Poisson– Sujatha distribution and its applications for count data excluding zero counts.

The Lindley distribution and a size–biased Lindley distribution (SBLD) having parameter  are defined by their pdf

f2(x;θ)=θ2θ+1(1+x)eθx;x>0,θ>0 (1.3)

f3(x;θ)=θ3θ+2x(1+x)eθx;x>0,θ>0(1.4)

Ghitany et al.7 have discussed various statistical and mathematical properties, estimation of parameter and application of Lindley distribution to model waiting time data in a bank and it has been showed that Lindley distribution provides better fit than exponential distribution.

In this paper, a two– parameter Sujatha distribution (TPSD), which includes size–biased Lindley distribution and Sujatha distribution as particular cases, has been proposed. It’s important statistical properties including coefficient of variation, skewness, kurtosis, index of dispersion, hazard rate function, mean residual life function, stochastic ordering, mean deviations, Bonferroni and Lorenz curves, stress–strength reliability have been discussed. The estimation of the parameters has been discussed using maximum likelihood estimation. A numerical example has been given to test the goodness of fit of TPSD over Lindley and Sujatha distributions.

A two–parameter Sujatha distribution

A Two parameter Sujatha distribution (TPSD) having parameters θ and α is defined by its pdf

f4(x;θ,α)=θ3αθ2+θ+2(α+x+x2)eθx;x>0,θ>0,α0    (2.1)

where θis a scale parameter and is α is a shape parameter. It can be easily verified that (2.1) reduces to Sujatha distribution (1.1) and SBLD (1.4) for α= 1 and α= 0 respectively.

Like Sujatha distribution (1.1), TPSD (2.1) is also a convex combination of exponential (θ), gamma (2, θ) and gamma (3, θ) distributions. We have

f4(x;θ,α)=p1g1(x,θ)+p2g2(x,θ)+(1p1p2)g3(x,θ)    (2.2)

where p1=αθ2αθ2+θ+2,p2=θαθ2+θ+2,g1(x,θ)=θeθx;x>0,θ>0

g2(x,θ)=θ2Γ(2)eθxx21;x>0,θ>0,g3(x,θ)=θ3Γ(3)eθxx31;x>0,θ>0.

The corresponding cdf of TPSD (2.1) can be obtained as

F2(x;θ,α)=1[1+θx(θx+θ+2)αθ2+θ+2]eθx;x>0,θ>0,α>0   (2.3)

Behavior of the pdf and the cdf of TPSD for varying values of parameter  and α are shown in Figures 1 & 2 respectively.

Figure 1 Behavior of the pdf of TPSD for varying values of parameter θ and α.
Figure 2 Behavior of the cdf of TPSD for varying values of parameter θ and α.

Moments and related measures

The moment generating function of TPSD (2.1) can be obtained as

MX(t)=θ3αθ2+θ+20e(θt)x(α+x+x2)dx

=θ3αθ2+θ+2[α(θt)+1(θt)2+2(θt)3]

=θ3αθ2+θ+2[αθk=0(tθ)k+1θ2k=0(k+1k)(tθ)k+2θ3k=0(k+2k)(tθ)k]

=k=0αθ2+θ(k+1)+(k+1)(k+2)αθ2+θ+2(tθ)k.

Thus, the rth moment about origin of TPSD (2.1), obtained as the coefficient of trr! in MX(t), is given by

μ/r=r!{αθ2+θ(r+1)+(r+1)(r+2)}θr(αθ2+θ+2);r=1,2,3,...  (3.1)

The first four moments about origin of TPSD are obtained as

μ/1=αθ2+2θ+6θ(αθ2+θ+2)    μ/2=2(αθ2+3θ+12)θ2(αθ2+θ+2)

μ/3=6(αθ2+4θ+20)θ3(αθ2+θ+2)       μ/4=24(αθ2+5θ+30)θ4(αθ2+θ+2)

Using the relationship between moments about the mean and moments about the origin, the moments about mean of TPSD are obtained as

μ2=α2θ4+4αθ3+16αθ2+2θ2+12θ+12θ2(αθ2+θ+2)2

μ3=2(α3θ6+6α2θ5+30α2θ4+6αθ4+42αθ3+36αθ2+2θ3+18θ2+36θ+24)θ3(αθ2+θ+2)3

μ4=3(3α4θ8+24α3θ7+128α3θ6+44α2θ6+344α2θ5+408α2θ4+32αθ5+320αθ4+768αθ3+8θ4+576αθ2+96θ3+336θ2+480θ+240)θ4(αθ2+θ+2)4

The coefficient of variation (C.V), coefficient of skewness(β1), coefficient of kurtosis (β2) and index of dispersion (γ) of TPSD are given by

C.V=σμ'1=α2θ4+4αθ3+16αθ2+2θ2+12θ+12αθ2+2θ+6

β1=μ3μ3/22=2(α3θ6+6α2θ5+30α2θ4+6αθ4+42αθ3+36αθ2+2θ3+18θ2+36θ+24)(α2θ4+4αθ3+16αθ2+2θ2+12θ+12)3/2

β2=μ4μ22=3(3α4θ8+24α3θ7+128α3θ6+44α2θ6+344α2θ5+408α2θ4+32αθ5+320αθ4+768αθ3+8θ4+576αθ2+96θ3+336θ2+480θ+240)(α2θ4+4αθ3+16αθ2+2θ2+12θ+12)2

γ=σ2μ'1=α2θ4+4αθ3+16αθ2+2θ2+12θ+12θ(αθ2+θ+2)(αθ2+2θ+6)

It can be easily verified that these statistical constants of TPSD reduce to the corresponding statistical constants of Sujatha distribution and SBLD at α=1 and α=0respectively. To study the behavior of C.V.,β1,β2 and γ, their values for varying values of the parameters θ and αhave been computed and presented in Tables 1–4.

      θ
α 

0.2

0.5

1

2

3

4

5

0.2

0.59658

0.624798

0.668399

0.739814

0.792609

0.8317

0.861102

0.5

0.599565

0.639569

0.708329

0.816497

0.882958

0.922627

0.946881

1

0.604466

0.662392

0.761739

0.892143

0.95119

0.977525

0.989835

2

0.614004

0.702377

0.83666

0.96225

0.996661

1.005655

1.007547

3

0.623205

0.736304

0.886072

0.991701

1.009814

1.011382

1.009973

4

0.632091

0.765466

0.920447

1.0059

1.014222

1.012415

1.009836

5

0.640678

0.790787

0.945247

1.013246

1.015576

1.012149

1.009163

Table 1 CV of TPSD for varying values of parameters and
For a given value of , C.V increases as the value of increases .But for values , CV decreases as the value of increases.

 θ
α 

0.2

0.5

1

2

3

4

5

0.2

1.156092

1.164414

1.193838

1.288579

1.40832

1.544566

1.694179

0.5

1.151692

1.153618

1.202728

1.394848

1.600302

1.785072

1.947347

1

1.145006

1.145839

1.247611

1.535588

1.733747

1.848046

1.912879

2

1.133828

1.153526

1.352316

1.647373

1.698737

1.653874

1.586127

3

1.125191

1.176753

1.43637

1.643895

1.562899

1.429794

1.310578

4

1.118703

1.206238

1.496066

1.59473

1.421347

1.244821

1.108

5

1.114041

1.237609

1.535958

1.528385

1.293862

1.097469

0.956984

Table 2 Coefficient of skewness of TPSD for varying values of parameters and
Since , TPSD is always positively skewed, and this means that TPSD is a suitable model for positively skewed lifetime data.

        θ 
α 

0.2

0.5

1

2

3

4

5

0.2

5.003116

5.022048

5.093943

5.346882

5.661781

5.979645

6.275987

0.5

4.991667

4.984856

5.082378

5.625

6.28542

6.865586

7.326691

1

4.973635

4.944566

5.170213

6.21499

7.193906

7.868405

8.297711

2

4.94128

4.924032

5.510204

7.2144

8.270528

8.774988

9.001011

3

4.913483

4.956867

5.903269

7.900925

8.799663

9.113171

9.206366

4

4.889821

5.022933

6.283795

8.3676

9.077558

9.253624

9.271966

5

4.869916

5.109996

6.633262

8.690336

9.230612

9.313262

9.289003

Table 3 Coefficient of kurtosis of TPSD for varying values of parameters and
Since TPSD is always leptokurtic, and this means that TPSD is more peaked than the normal curve. Thus TPSD is suitable for lifetime data which are leptokurtic.

θ

α

0.2

0.5

1

2

3

4

5

0.2

5.164536

2.158531

1.144817

0.615741

0.424979

0.323306

0.259524

0.5

5.197861

2.220551

1.218487

0.666667

0.451356

0.334416

0.262078

1

5.252329

2.31348

1.305556

0.696429

0.452381

0.325758

0.251067

2

5.357375

2.466667

1.4

0.694444

0.431884

0.306064

0.235088

3

5.457478

2.585608

1.439394

0.676136

0.414263

0.293608

0.2264

4

5.552914

2.678571

1.452381

0.657692

0.401423

0.285531

0.221109

5

5.643939

2.751515

1.451923

0.641667

0.39193

0.279936

0.217569

Table 4 Index of dispersion of TPSD for varying values of parameters and
As long as and , the nature of TPSD is over dispersed and for and , the nature of TPSD is over dispersed

The behavior of C.V.,β1,β2 and γ, for selected values of the parameters θ and αare shown in Figure 3.

Figure 3Behavior of C.V., , and , for varying values of the parameters θ and α.

Statistical properties

In this section, statistical properties of TPSD including hazard rate function, mean residual life function, stochastic ordering, mean deviation, Bonferroni and Lorenz curves and stress–strength reliability have been discussed.

Hazard rate function and mean residual life function

Let X be a continuous random variable with pdf f(x) and cdf F(x).The hazard rate function (also known as failure rate function), h(x) and the mean residual function,m(x) of X are respectively defined as

h(x)=limΔx0p(X<x+Δx|X>x)Δx=f(x)1F(x)

and m(x)=E[Xx|X>x]=11F(x)x[1F(t)]dt

 The corresponding hazard rate function, h(x) and the mean residual function, m(x)of TPSD (2.1) are thus obtained as

h(x)=θ3(α+x+x2)θ2(α+x+x2)+2θx+θ+2

And m(x)=αθ2+θ+2[(αθ2+θ+2)+θx(θx+θ+2)]eθxx[1+θt(θt+θ+2)αθ2+θ+2]eθtdt

=θ2(α+x+x2)+2θ(2x+1)+6θ[(αθ2+θ+2)+θx(θx+θ+2)]

=θ2x2+θ(θ+4)x+(αθ2+2θ+6)θ[θ2x2+θ(θ+2)x+(αθ2+θ+2)]

It can be easily verified that h(0)=αθ3αθ2+θ+2=f(0) and m(0)=αθ2+2θ+6θ(αθ2+θ+2)=μ/1.

It can also be easily verified that the expression of h(x)and m(x)of TPSD reduce to the corresponding h(x) and of Sujatha distribution at α=1.

The behavior of h(x) and m(x)of TPSD (2.1) for different values of its parameters are shown in Figures 4 & 5 respectively.

Figure 4Behavior of of TPSD for selected values of parameters θ and α.
Figure 5Behavior of of TPSD for selected values of parameters θ and α.

It is clearly seen from the graphs of h(x)and m(x) that h(x)is monotonically increasing function of x,θandαwhere as m(x) is monotonically decreasing function of x,θandα.

Stochastic ordering

Stochastic ordering of positive continuous random variable is an important tool for judging the comparative behavior of continuous distributions. A random variable X is said to be smaller than a random variable Y in the

  1. Stochastic order (XstY) if Fx(x)Fy(x) for all x
  2. Hazard rate order (XhrY) if hx(x)hy(x) for all x
  3. Mean residual life order (XmrlY) if mx(x)my(x) for all x
  4. Likelihood ratio order (XlrY) if fx(x)fy(x) decreases in x

The following results due to Shaked & Shanthikumar8 are well known for establishing stochastic ordering of distributions

XlrYXhrxstyYXmlrY.

The TPSD (2.1) is ordered with respect to the strongest “likelihood ratio” ordering as shown in the following theorem:

Theorem: Let X~TPSD(θ1,α1) and Y~TPSD(θ2,α2).If θ1>θ2 and α1=α2(or θ1=θ2 and α1α2) then XlrYandhenceXhrY,XmrlYandXstY. 

Proof: We have

fX(x;θ1,α1)fY(x;θ2,α2)=θ13(α2θ22+θ2+2)θ23(α1θ12+θ1+2)(α1+x+x2α2+x+x2)e(θ1θ2)x;x>0

lnfX(x;θ1,α1)fY(x;θ2,α2)=ln[θ13(α2θ22+θ2+2)θ23(α1θ12+θ1+2)]+ln(α1+x+x2α2+x+x2)(θ1θ2)x

This gives ddxlnfX(x;θ1,α1)fY(x;θ2,α2)=(α2α1)+2(α2α1)x(α1+x+x2)(α2+x+x2)(θ1θ2).

Thus, for (θ1>θ2andα1=α2)or(α1α2andθ1=θ2), ddxlnfX(x;θ1,α1)fY(x;θ2,α2)<0.

This means that XlrYandhenceXhrY,XmrlYandXstY. This shows flexibility of TPSD over Sujatha distribution.

Mean deviations

The amount of scatter in a population is evidently measured to some extent by the totality of deviations from the mean and the median. These are known as the mean deviation about the mean and the mean deviation about the median and are defined as

 δ1(x)=0|xμ|f(x)dx and δ2(x)=0|xM|f(x)dx, respectively,

where μ=E(X) and M=Median(X). The measures δ1(x) and δ2(x) can be calculated using the following relationships

δ1(x)=μ0(μx)f(x)dx+μ(xμ)f(x)dx

=μF(μ)μ0xf(x)dxμ[1F(μ)]+μxf(x)dx

=2μF(μ)2μ+2μxf(x)dx

=2μF(μ)2μ0xf(x)dx    (4.3.1)

and δ2(x)=M0(Mx)f(x)dx+M(xM)f(x)dx

=MF(M)M0xf(x)dxM(1F(M))+Mxf(x)dx

=μ+2Mxf(x)dx

=μ2M0xf(x)dx   (4.3.2)

Using the pdf (2.1) and expression for the mean of TPSD, we get

μ0xf4(x;θ,α)dx=μ[θ3(μ3+μ2+αμ)+θ2(3μ2+2μ+α)+2θ(3μ+1)+6]eθμθ(αθ2+θ+2)    (4.3.3)

M0xf4(x;θ,α)dx=μ[θ3(M3+M2+αM)+θ2(3M2+2M+α)+2θ(3M+1)+6]eθMθ(αθ2+θ+2)   (4.3.4)

Using expressions from (4.3.1), (4.3.2), (4.3.3) and (4.3.4) and after some tedious algebraic simplifications, the mean deviation about the mean, δ1(x) and the mean deviation about the median, δ2(x) of TPSD are obtained as

δ1(x)=2[θ2(μ2+μ+α)+2θ(2μ+1)+6]eθμθ(αθ2+θ+2)   (4.3.5)

and δ2(x)=2[θ3(M3+M2+αM)+θ2(3M2+2M+α)+2θ(3M+1)+6]eθMθ(αθ2+θ+2)μ   (4.3.6)

Bonferroni and Lorenz curves and indices

The Bonferroni and Lorenz curves and Bonferroni9 and Gini indices have applications not only in economics to study income and poverty, but also in other fields like reliability, demography and medical science. The Bonferroni and Lorenz curves are defined as

B(p)=1pμq0xf(x)dx=1pμ[0xf(x)dxqxf(x)dx]=1pμ[μqxf(x)dx]  (4.4.1)

andL(p)=1μq0xf(x)dx=1μ[0xf(x)dxqxf(x)dx]=1μ[μqxf(x)dx]   (4.4.2)

respectively or equivalently.

B(p)=1pμp0F1(x)dx   (4.4.3)

 and L(p)=1μp0F1(x)dx    (4.4.4)

respectively, where μ=E(x) and q=F1(p).

The Bonferroni and Gini indices are thus defined as

B=110B(p)dp   (4.4.5)

and G=1210L(p)dp   (4.4.6)

respectively.

Using pdf of TPSD (2.1), we get

qxf(x)dx={θ3(q3+q2+αq)+θ2(3q2+2q+α)+2θ(3q+1)+6}eθqθ(αθ2+2θ+6)   (4.4.7)

Now using equation (4.4.7), (4.4.1) and (4.4.2), we get

B(p)=1p[1{θ3(q3+q2+αq)+θ2(3q2+2q+α)+2θ(3q+1)+6}eθqαθ2+2θ+6]   (4.4.8)

and L(p)=1{θ3(q3+q2+αq)+θ2(3q2+2q+α)+2θ(3q+1)+6}eθqαθ2+2θ+6  (4.4.9)

Now using the equations (4.4.8) and (4.4.9) in (4.4.5) and (4.4.6), the Bonferroni and Gini indices of TPSD (2.1) are obtained as

B=1{θ3(q3+q2+αq)+θ2(3q2+2q+α)+2θ(3q+1)+6}eθqαθ2+2θ+6  (4.4.10)

G=1+2{θ3(q3+q2+αq)+θ2(3q2+2q+α)+2θ(3q+1)+6}eθqαθ2+2θ+6  (4.4.11)

Stress–strength reliability

The stress–strength reliability of a component illustrates the life of the component which has random strength that is subjected to random stress. When the stress of the component Y applied to it exceeds the strength of the component X, the component fails instantly and the component will function satisfactorily till X > Y . Therefore, R = P (Y < X ) is a measure of the component reliability and is known as stress–strength reliability in statistical literature. It has extensive application in almost all areas of knowledge especially in engineering such as structure, deterioration of rocket motor, static fatigue of ceramic component, aging of concrete pressure vessels etc.

Let X and Y be independent strength and stress random variables having TPSD (2.1) with parameter (θ1,α1) and (θ2,α2) respectively. Then the stress–strength reliability R of TPSD can be obtained as

R=P(Y<X)=0P(Y<X|X=x)fx(x)dx

=0f(x;θ1,α1)F(x;θ2,α2)dx

=1θ31[(α1α2)θ62+(2α1+α2+4α1α2θ1)θ52+(7α1θ1+3α2θ1+6α1+2α2+6α1α2θ21+3)θ42+(9α1θ21+3α2θ21+18α1θ1+4α2θ1+7θ1+4α1α2θ31+20)θ32+(5α1θ31+α2θ31+20α1θ21+2α2θ21+5θ21+30θ1+α1α2θ41+40)θ22+(α1θ31+10α1θ21+θ21+12θ1+20)θ1θ2+2(α1θ21+θ1+2)θ21](α1θ21+θ1+2)(α2θ22+θ2+2)(θ1+θ2)5

It can be verified that the stress–strength reliability of Sujatha distribution is a particular case of stress–strength reliability of TPSD at α1=α2=1.

Estimation of parameters

In this section, the estimations of parameters of TPSD using method of moments and method of maximum likelihood have been discussed.

Method of moment estimates (MOME)

Since TPSD (2.1) has two parameters to be estimated, the first two moments about the origin are required to estimate its parameters using method of moments. Equating the population mean to the sample mean, we have

ˉx=αθ2+2θ+6θ(αθ2+θ+2)=αθ2+θ+2θ(αθ2+θ+2)+θ+4θ(αθ2+θ+2)

ˉx=1θ+θ+4θ(αθ2+θ+2)

(αθ2+θ+2)=θ+4θˉx1   (5.1.1)

Again equating the second population moment with the corresponding sample moment, we have

m'2=2(αθ2+3θ+12)θ2(αθ2+θ+2)=2(αθ2+θ+2)θ2(αθ2+θ+2)+4(θ+5)θ2(αθ2+θ+2)

m'2=2(αθ2+3θ+12)θ2(αθ2+θ+2)=2θ2+4(θ+5)θ2(αθ2+θ+2)

αθ2+θ+2=4(θ+5)m'2θ22    (5.1.2)

Equations (5.1.1) and (5.1.2) give the following cubic equation in θ 

m'2θ3+4(m'2ˉX)θ22(10ˉX1)θ+12=0   (5.1.3)

Solving equation (5.1.3) using any iterative method such as Newton–Raphson method, Regula–Falsi method or Bisection method, method of moment estimation (MOME) ˜θof θ can be obtained and substituting the value of ˜θ in equation (5.1.1), MOME ˜α of αcan be obtained as

˜α=ˉx˜θ22(ˉx1)˜θ+6˜θ2(˜θˉx1)   (5.1.4)

Maximum likelihood estimates (MLE)

Let (x1,x2,x3,...xn) be random sample from TPSD (2.1). The likelihood function L is given by

L=(θ3αθ2+θ+2)nni=1(α+xi+x2i)enθˉx,

where ˉx is the sample mean.

The natural log likelihood function is thus obtained as

lnL=n[3lnθln(αθ2+θ+2)]+ni=1ln(α+xi+x2i)nθˉx

The maximum likelihood estimate (MLE’s) (ˆθ,ˆα) of (θ,α) are then the solutions of the following non–linear equations

lnLθ=3nθn(2αθ+1)αθ2+θ+2nˉx=0    (5.2.1)

lnLα=nθ2αθ2+θ+2+ni=11α+xi+x2i=0     (5.2.2)

These two natural log likelihood equations do not seem to be solved directly, because they cannot be expressed in closed forms. The (MLE’s) (ˆθ,ˆα) of (θ,α) can be computed directly by solving the natural log likelihood equations using Newton–Raphson iteration method using R–software till sufficiently close values of ˆθandˆα are obtained. The initial values of parameters  and α are the MOME (˜θ,˜α) of the parameters (θ,α).

A numerical example

In this section an application of TPSD using maximum likelihood estimates has been discussed with a real lifetime data set. The data set regarding vinyl chloride obtained from clean up gradient monitoring wells in mg/l, available in Bhaumik et al.10 has been considered. The data set is

5.1

1.2

1.3

0.6

0.5

2.4

0.5

1.1

8

0.8

0.4

0.6

0.9

0.4

2

0.5

5.3

3.2

2.7

2.9

2.5

2.3

1

0.2

0.1

0.1

1.8

0.9

2.4

6.8

1.2

0.4

0.2

In order to compare lifetime distributions, values of2lnL, AIC (Akaike Information Criterion), BIC (Bayesian Information Criterion) and K–S Statistic (Kolmogorov–Smirnov Statistic) for the above data set has been computed. The formulae for computing AIC, BIC, and K–S Statistics are as follows:

AIC=2lnL+2k, BIC=2lnL+klnn and D=Supx|Fn(x)F0(x)|, where k= the number of parameters,n= the sample size, and the Fn(x)= empirical distribution function. The best distribution is the distribution which corresponds to lower values of 2lnL, AIC, and K–S statistic and higher p–value. The MLE (ˆθ,ˆα) along with their standard errors, 2lnL, AIC, BIC, K–S Statistic and p–value of the fitted distributions are presented in the Table 5.

It is obvious that TPSD gives much closer fit than Sujatha and Lindley distributions. Therefore, TPSD can be considered as an important two–parameter lifetime distribution. In order to see the closeness of the fit given by Lindley, Sujatha and TPSD, the fitted pdf plots of these distributions for the given dataset have been shown in Figure 6. It is also obvious from the fitted plots of the distribution along with the histogram of the original dataset that TPSD gives much closer fit than Lindley and Sujatha distributions.

Figure 6Fitted pdf plots of distributions for the given dataset.

Conclusion

A two parameter Sujatha distribution (TPSD) has been introduced which includes size–biased Lindley distribution and Sujatha distribution, proposed by Shanker (2016a) as particular cases. Moments about origin and moments about mean have been obtained and nature of coefficient of variation, coefficient of skewness, coefficient of kurtosis and index of dispersion of TPSD have been studied with varying values of the parameters. The nature of probability density function, cumulative distribution function, hazard rate function and mean residual life function have been discussed with varying values of the parameters. The stochastic ordering, mean deviations, Bonferroni and Lorenz curves, and stress–strength reliability have also been discussed. The method of moments and method of maximum likelihood have been discussed for estimating parameters. A numerical example of real lifetime data have been presented to show the application of TPSD and the goodness of fit of TPSD gives much closer fit over Sujatha and Lindley distributions.

Acknowledgement

Authors are grateful to the editor–in–chief of the journal and the anonymous reviewer for constructive comments on the paper.

Conflict of interest

Authors declare that there is no conflict of interest.

References

Creative Commons Attribution License

©2018 Tesfay, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.