A two–parameter Sujatha distribution

doi:10.15406/bbij.2018.07.00208

This paper proposes a two parameter Sujatha distribution (TPSD). This includes size–biased Lindley distribution and Sujatha distribution as particular cases. It’s important statistical properties including its shapes for varying values of parameters, coefficient of variation, skewness, kurtosis, index of dispersion, hazard rate function, mean residual life function, stochastic ordering ,mean deviations, Bonferroni and Lorenz curves, and stress–strength reliability have been discussed. The estimation of parameters has been discussed using the method of moments and the method of maximum likelihood. Application of the distribution has been discussed with a real lifetime data.

Keywords: Sujatha distribution, moments, statistical properties, estimation of parameters, application

The statistical analysis and modeling of lifetime data are crucial for statisticians working in various field of knowledge including medical science, engineering, social science, behavioral science, insurance, finance, among others. The classical one parameter lifetime distribution in statistics which were popular for modeling lifetime data are exponential distribution and Lindley distribution proposed by Lindley.¹ Shanker et al.² have detailed critical study on applications of lifetime data from engineering and biomedical science and observed that exponential and Lindley distributions are not always suitable due to theoretical or applied point of view and presence of single parameter. In search for a lifetime distribution which gives a better fit than exponential and Lindley distributions Shanker³ has proposed a new lifetime distribution named Sujatha distribution defined by its probability density function (pdf) and cumulative distribution function (cdf).

$f_{1} (x; θ) = \frac{θ^{3}}{θ^{2} + θ + 2} (1 + x + x^{2}) e^{- θ x}; x > 0, θ > 0$ (1.1)

$F_{1} (x; θ; α) = 1 - [1 + \frac{θ x (θ x + θ + 2)}{α θ^{2} + θ + 2}] e^{- θ x}; x > 0, θ > 0$ (1.2)

where θ is a scale parameter. It has been shown by Shanker³ that Sujatha distribution is a convex combination of exponential (θ) distribution, a gamma (2, θ) distribution and a gamma (3, θ) distribution. The first four moments about origin and central moments of Sujatha distribution obtained by Shanker³ are

$μ_{1}^{'} = \frac{θ^{2} + 2 θ + 6}{θ (θ^{2} + θ + 2)}$ $μ_{2}^{'} = \frac{2 (θ^{2} + 3 θ + 12)}{θ^{2} (θ^{2} + θ + 2)}$

$μ_{3}^{'} = \frac{6 (θ^{2} + 4 θ + 20)}{θ^{3} (θ^{2} + θ + 2)}$ $μ_{4}^{'} = \frac{24 (θ^{2} + 5 θ + 30)}{θ^{4} (θ^{2} + θ + 2)}$

$μ_{2} = \frac{θ^{4} + 4 θ^{3} + 18 θ^{2} + 12 θ + 12}{θ^{2} {(θ^{2} + θ + 2)}^{2}}$

$μ_{3} = \frac{2 (θ^{6} + 6 θ^{5} + 36 θ^{4} + 44 θ^{3} + 54 θ^{2} + 36 θ + 24)}{θ^{3} {(θ^{2} + θ + 2)}^{3}}$

$μ_{4} = \frac{3 (3 θ^{8} + 24 θ^{7} + 172 θ^{6} + 376 θ^{5} + 736 θ^{4} + 864 θ^{3} + 912 θ^{2} + 480 θ + 240)}{θ^{4} {(θ^{2} + θ + 2)}^{4}}$

Shanker³ has discussed its important properties including shapes of density function for varying values of parameters, hazard rate function, mean residual life function, stochastic ordering, mean deviations, Bonferroni and Lorenz curves, and stress–strength reliability. Shanker³ discussed the maximum likelihood estimation of parameter and showed applications of Sujatha distribution to model lifetime data from biomedical science and engineering. Shanker⁴ has introduced Poisson–Sujatha distribution (PSD), a Poisson mixture of Sujatha distribution, and studied its properties, estimation of parameter and applications to model count data. Shanker & Hagos⁵ have discussed zero–truncated Poisson– Sujatha distribution (ZTPSD) and applications for modeling count data excluding zero counts. Shanker & Hagos⁶ have also studied size–biased Poisson– Sujatha distribution and its applications for count data excluding zero counts.

The Lindley distribution and a size–biased Lindley distribution (SBLD) having parameter are defined by their pdf

$f_{2} (x; θ) = \frac{θ^{2}}{θ + 1} (1 + x) e^{- θ x}; x > 0, θ > 0$ (1.3)

$f_{3} (x; θ) = \frac{θ^{3}}{θ + 2} x (1 + x) e^{- θ x}; x > 0, θ > 0$ (1.4)

Ghitany et al.⁷ have discussed various statistical and mathematical properties, estimation of parameter and application of Lindley distribution to model waiting time data in a bank and it has been showed that Lindley distribution provides better fit than exponential distribution.

In this paper, a two– parameter Sujatha distribution (TPSD), which includes size–biased Lindley distribution and Sujatha distribution as particular cases, has been proposed. It’s important statistical properties including coefficient of variation, skewness, kurtosis, index of dispersion, hazard rate function, mean residual life function, stochastic ordering, mean deviations, Bonferroni and Lorenz curves, stress–strength reliability have been discussed. The estimation of the parameters has been discussed using maximum likelihood estimation. A numerical example has been given to test the goodness of fit of TPSD over Lindley and Sujatha distributions.

A Two parameter Sujatha distribution (TPSD) having parameters $θ$ and $α$ is defined by its pdf

$f_{4} (x; θ, α) = \frac{θ^{3}}{α θ^{2} + θ + 2} (α + x + x^{2}) e^{- θ x}; x > 0, θ > 0, α \geq 0$ (2.1)

where $θ$ is a scale parameter and is $α$ is a shape parameter. It can be easily verified that (2.1) reduces to Sujatha distribution (1.1) and SBLD (1.4) for $α$ = 1 and $α$ = 0 respectively.

Like Sujatha distribution (1.1), TPSD (2.1) is also a convex combination of exponential ( $θ$ ), gamma (2, $θ$ ) and gamma (3, $θ$ ) distributions. We have

$f_{4} (x; θ, α) = p_{1} g_{1} (x, θ) + p_{2} g_{2} (x, θ) + (1 - p_{1} - p_{2}) g_{3} (x, θ)$ (2.2)

where $p_{1} = \frac{α θ^{2}}{α θ^{2} + θ + 2}, p_{2} = \frac{θ}{α θ^{2} + θ + 2}, g_{1} (x, θ) = θ e^{- θ x}; x > 0, θ > 0$

$g_{2} (x, θ) = \frac{θ^{2}}{Γ (2)} e^{- θ x} x^{2 - 1}; x > 0, θ > 0, g_{3} (x, θ) = \frac{θ^{3}}{Γ (3)} e^{- θ x} x^{3 - 1}; x > 0, θ > 0.$

The corresponding cdf of TPSD (2.1) can be obtained as

$F_{2} (x; θ, α) = 1 - [1 + \frac{θ x (θ x + θ + 2)}{α θ^{2} + θ + 2}] e^{- θ x}; x > 0, θ > 0, α > 0$ (2.3)

Behavior of the pdf and the cdf of TPSD for varying values of parameter and α are shown in Figures 1 & 2 respectively.

Figure 1 Behavior of the pdf of TPSD for varying values of parameter θ and α.

Figure 2 Behavior of the cdf of TPSD for varying values of parameter θ and α.

The moment generating function of TPSD (2.1) can be obtained as

$M_{X} (t) = \frac{θ^{3}}{α θ^{2} + θ + 2} \int_{0}^{\infty} e^{- (θ - t) x} (α + x + x^{2}) d x$

$= \frac{θ^{3}}{α θ^{2} + θ + 2} [\frac{α}{(θ - t)} + \frac{1}{{(θ - t)}^{2}} + \frac{2}{{(θ - t)}^{3}}]$

$= \frac{θ^{3}}{α θ^{2} + θ + 2} [\frac{α}{θ} {\sum_{k = 0}^{\infty} {(\frac{t}{θ})}^{k} + \frac{1}{θ^{2}} \sum_{k = 0}^{\infty} (_{k}^{k + 1}) (\frac{t}{θ})}^{k} + \frac{2}{θ^{3}} \sum_{k = 0}^{\infty} (_{k}^{k + 2}) {(\frac{t}{θ})}^{k}]$

$= \sum_{k = 0}^{\infty} \frac{α θ^{2} + θ (k + 1) + (k + 1) (k + 2)}{α θ^{2} + θ + 2} {(\frac{t}{θ})}^{k} .$

Thus, the rth moment about origin of TPSD (2.1), obtained as the coefficient of $\frac{t^{r}}{r!}$ in $M_{X} (t)$ , is given by

$μ_{r}^{/} = \frac{r! {α θ^{2} + θ (r + 1) + (r + 1) (r + 2)}}{θ^{r} (α θ^{2} + θ + 2)}; r = 1, 2, 3, ...$ (3.1)

The first four moments about origin of TPSD are obtained as

$μ_{1}^{/} = \frac{α θ^{2} + 2 θ + 6}{θ (α θ^{2} + θ + 2)}$ $μ_{2}^{/} = \frac{2 (α θ^{2} + 3 θ + 12)}{θ^{2} (α θ^{2} + θ + 2)}$

$μ_{3}^{/} = \frac{6 (α θ^{2} + 4 θ + 20)}{θ^{3} (α θ^{2} + θ + 2)}$ $μ_{4}^{/} = \frac{24 (α θ^{2} + 5 θ + 30)}{θ^{4} (α θ^{2} + θ + 2)}$

Using the relationship between moments about the mean and moments about the origin, the moments about mean of TPSD are obtained as

$μ_{2} = \frac{α^{2} θ^{4} + 4 α θ^{3} + 16 α θ^{2} + 2 θ^{2} + 12 θ + 12}{θ^{2} {(α θ^{2} + θ + 2)}^{2}}$

$μ_{3} = \frac{2 (α^{3} θ^{6} + 6 α^{2} θ^{5} + 30 α^{2} θ^{4} + 6 α θ^{4} + 42 α θ^{3} + 36 α θ^{2} + 2 θ^{3} + 18 θ^{2} + 36 θ + 24)}{θ^{3} {(α θ^{2} + θ + 2)}^{3}}$

$μ_{4} = \frac{3 (\begin{array}{l} 3 α^{4} θ^{8} + 24 α^{3} θ^{7} + 128 α^{3} θ^{6} + 44 α^{2} θ^{6} + 344 α^{2} θ^{5} + 408 α^{2} θ^{4} + 32 α θ^{5} + 320 α θ^{4} \\ + 768 α θ^{3} + 8 θ^{4} + 576 α θ^{2} + 96 θ^{3} + 336 θ^{2} + 480 θ + 240 \end{array})}{θ^{4} {(α θ^{2} + θ + 2)}^{4}}$

The coefficient of variation (C.V), coefficient of skewness $(\sqrt{β_{1}})$ , coefficient of kurtosis $(β_{2})$ and index of dispersion $(γ)$ of TPSD are given by

$C . V = \frac{σ}{μ_{1}^{'}} = \frac{\sqrt{α^{2} θ^{4} + 4 α θ^{3} + 16 α θ^{2} + 2 θ^{2} + 12 θ + 12}}{α θ^{2} + 2 θ + 6}$

$\sqrt{β_{1}} = \frac{μ_{3}}{μ_{2}^{3 / 2}} = \frac{2 (α^{3} θ^{6} + 6 α^{2} θ^{5} + 30 α^{2} θ^{4} + 6 α θ^{4} + 42 α θ^{3} + 36 α θ^{2} + 2 θ^{3} + 18 θ^{2} + 36 θ + 24)}{{(α^{2} θ^{4} + 4 α θ^{3} + 16 α θ^{2} + 2 θ^{2} + 12 θ + 12)}^{3 / 2}}$

$β_{2} = \frac{μ_{4}}{μ_{2}^{2}} = \frac{3 (\begin{array}{l} 3 α^{4} θ^{8} + 24 α^{3} θ^{7} + 128 α^{3} θ^{6} + 44 α^{2} θ^{6} + 344 α^{2} θ^{5} + 408 α^{2} θ^{4} + 32 α θ^{5} + 320 α θ^{4} \\ + 768 α θ^{3} + 8 θ^{4} + 576 α θ^{2} + 96 θ^{3} + 336 θ^{2} + 480 θ + 240 \end{array})}{{(α^{2} θ^{4} + 4 α θ^{3} + 16 α θ^{2} + 2 θ^{2} + 12 θ + 12)}^{2}}$

$γ = \frac{σ^{2}}{μ_{1}^{'}} = \frac{α^{2} θ^{4} + 4 α θ^{3} + 16 α θ^{2} + 2 θ^{2} + 12 θ + 12}{θ (α θ^{2} + θ + 2) (α θ^{2} + 2 θ + 6)}$

It can be easily verified that these statistical constants of TPSD reduce to the corresponding statistical constants of Sujatha distribution and SBLD at $α = 1$ and $α = 0$ respectively. To study the behavior of C.V., $\sqrt{β_{1}}$ , $β_{2}$ and $γ$ , their values for varying values of the parameters $θ$ and $α$ have been computed and presented in Tables 1–4.

$θ$ $α$	0.2	0.5	1	2	3	4	5
0.2	0.59658	0.624798	0.668399	0.739814	0.792609	0.8317	0.861102
0.5	0.599565	0.639569	0.708329	0.816497	0.882958	0.922627	0.946881
1	0.604466	0.662392	0.761739	0.892143	0.95119	0.977525	0.989835
2	0.614004	0.702377	0.83666	0.96225	0.996661	1.005655	1.007547
3	0.623205	0.736304	0.886072	0.991701	1.009814	1.011382	1.009973
4	0.632091	0.765466	0.920447	1.0059	1.014222	1.012415	1.009836
5	0.640678	0.790787	0.945247	1.013246	1.015576	1.012149	1.009163

Table 1 CV of TPSD for varying values of parameters and
For a given value of , C.V increases as the value of increases .But for values , CV decreases as the value of increases.

θ α	0.2	0.5	1	2	3	4	5
0.2	1.156092	1.164414	1.193838	1.288579	1.40832	1.544566	1.694179
0.5	1.151692	1.153618	1.202728	1.394848	1.600302	1.785072	1.947347
1	1.145006	1.145839	1.247611	1.535588	1.733747	1.848046	1.912879
2	1.133828	1.153526	1.352316	1.647373	1.698737	1.653874	1.586127
3	1.125191	1.176753	1.43637	1.643895	1.562899	1.429794	1.310578
4	1.118703	1.206238	1.496066	1.59473	1.421347	1.244821	1.108
5	1.114041	1.237609	1.535958	1.528385	1.293862	1.097469	0.956984

Table 2 Coefficient of skewness of TPSD for varying values of parameters and
Since , TPSD is always positively skewed, and this means that TPSD is a suitable model for positively skewed lifetime data.

θ α	0.2	0.5	1	2	3	4	5
0.2	5.003116	5.022048	5.093943	5.346882	5.661781	5.979645	6.275987
0.5	4.991667	4.984856	5.082378	5.625	6.28542	6.865586	7.326691
1	4.973635	4.944566	5.170213	6.21499	7.193906	7.868405	8.297711
2	4.94128	4.924032	5.510204	7.2144	8.270528	8.774988	9.001011
3	4.913483	4.956867	5.903269	7.900925	8.799663	9.113171	9.206366
4	4.889821	5.022933	6.283795	8.3676	9.077558	9.253624	9.271966
5	4.869916	5.109996	6.633262	8.690336	9.230612	9.313262	9.289003

Table 3 Coefficient of kurtosis of TPSD for varying values of parameters and
Since TPSD is always leptokurtic, and this means that TPSD is more peaked than the normal curve. Thus TPSD is suitable for lifetime data which are leptokurtic.

θ
α	0.2	0.5	1	2	3	4	5
0.2	5.164536	2.158531	1.144817	0.615741	0.424979	0.323306	0.259524
0.5	5.197861	2.220551	1.218487	0.666667	0.451356	0.334416	0.262078
1	5.252329	2.31348	1.305556	0.696429	0.452381	0.325758	0.251067
2	5.357375	2.466667	1.4	0.694444	0.431884	0.306064	0.235088
3	5.457478	2.585608	1.439394	0.676136	0.414263	0.293608	0.2264
4	5.552914	2.678571	1.452381	0.657692	0.401423	0.285531	0.221109
5	5.643939	2.751515	1.451923	0.641667	0.39193	0.279936	0.217569

Table 4 Index of dispersion of TPSD for varying values of parameters and
As long as and , the nature of TPSD is over dispersed and for and , the nature of TPSD is over dispersed

The behavior of C.V., $\sqrt{β_{1}}$ , $β_{2}$ and $γ$ , for selected values of the parameters $θ$ and $α$ are shown in Figure 3.

Figure 3Behavior of C.V., , and , for varying values of the parameters θ and α.

In this section, statistical properties of TPSD including hazard rate function, mean residual life function, stochastic ordering, mean deviation, Bonferroni and Lorenz curves and stress–strength reliability have been discussed.

Hazard rate function and mean residual life function

Let X be a continuous random variable with pdf $f (x)$ and cdf $F (x)$ .The hazard rate function (also known as failure rate function), $h (x)$ and the mean residual function, $m (x)$ of X are respectively defined as

$h (x) = \lim_{Δ x \to 0} \frac{p (X < x + Δ x | X > x)}{Δ x} = \frac{f (x)}{1 - F (x)}$

and $m (x) = E [X - x | X > x] = \frac{1}{1 - F (x)} \int_{x}^{\infty} [1 - F (t)] d t$

The corresponding hazard rate function, $h (x)$ and the mean residual function, $m (x)$ of TPSD (2.1) are thus obtained as

$h (x) = \frac{θ^{3} (α + x + x^{2})}{θ^{2} (α + x + x^{2}) + 2 θ x + θ + 2}$

And $m (x) = \frac{α θ^{2} + θ + 2}{[(α θ^{2} + θ + 2) + θ x (θ x + θ + 2)] e^{- θ x}} \int_{x}^{\infty} [1 + \frac{θ t (θ t + θ + 2)}{α θ^{2} + θ + 2}] e^{- θ t} d t$

$= \frac{θ^{2} (α + x + x^{2}) + 2 θ (2 x + 1) + 6}{θ [(α θ^{2} + θ + 2) + θ x (θ x + θ + 2)]}$

$= \frac{θ^{2} x^{2} + θ (θ + 4) x + (α θ^{2} + 2 θ + 6)}{θ [θ^{2} x^{2} + θ (θ + 2) x + (α θ^{2} + θ + 2)]}$

It can be easily verified that $h (0) = \frac{α θ^{3}}{α θ^{2} + θ + 2} = f (0)$ and $m (0) = \frac{α θ^{2} + 2 θ + 6}{θ (α θ^{2} + θ + 2)} = μ_{1}^{/} .$

It can also be easily verified that the expression of $h (x)$ and $m (x)$ of TPSD reduce to the corresponding $h (x)$ and of Sujatha distribution at $α = 1.$

The behavior of $h (x)$ and $m (x)$ of TPSD (2.1) for different values of its parameters are shown in Figures 4 & 5 respectively.

Figure 4Behavior of of TPSD for selected values of parameters θ and α.

Figure 5Behavior of of TPSD for selected values of parameters θ and α.

It is clearly seen from the graphs of $h (x)$ and $m (x)$ that $h (x)$ is monotonically increasing function of $x, θ and α$ where as $m (x)$ is monotonically decreasing function of $x, θ and α$ .

Stochastic ordering

Stochastic ordering of positive continuous random variable is an important tool for judging the comparative behavior of continuous distributions. A random variable X is said to be smaller than a random variable Y in the

Stochastic order $(X \leq_{s t} Y)$ if $F_{x} (x) \geq F_{y} (x)$ for all x
Hazard rate order $(X \leq_{h r} Y)$ if $h_{x} (x) \geq h_{y} (x)$ for all x
Mean residual life order $(X \leq_{m r l} Y)$ if $m_{x} (x) \leq m_{y} (x)$ for all x
Likelihood ratio order $(X \leq_{l r} Y)$ if $\frac{f_{x} (x)}{f_{y} (x)}$ decreases in x

The following results due to Shaked & Shanthikumar8 are well known for establishing stochastic ordering of distributions

$X \leq_{l r} Y \Rightarrow X \underset{\underset{x \leq_{s t} y}{⇓}}{\leq_{h r}} Y \Rightarrow X \leq_{m l r} Y .$

The TPSD (2.1) is ordered with respect to the strongest “likelihood ratio” ordering as shown in the following theorem:

Theorem: Let $X ~ TPSD (θ_{1}, α_{1})$ and $Y ~ TPSD (θ_{2}, α_{2})$ .If $θ_{1} > θ_{2}$ and $α_{1} = α_{2}$ (or $θ_{1} = θ_{2}$ and $α_{1} \geq α_{2}$ ) then $X \leq_{l r} Y and hence X \leq_{h r} Y, X \leq_{m r l} Y and X \leq_{s t} Y .$

Proof: We have

$\frac{f_{X} (x; θ_{1}, α_{1})}{f_{Y} (x; θ_{2}, α_{2})} = \frac{θ_{1}^{3} (α_{2} θ_{2}^{2} + θ_{2} + 2)}{θ_{2}^{3} (α_{1} θ_{1}^{2} + θ_{1} + 2)} (\frac{α_{1} + x + x^{2}}{α_{2} + x + x^{2}}) e^{- (θ_{1} - θ_{2}) x}; x > 0$

$\ln \frac{f_{X} (x; θ_{1}, α_{1})}{f_{Y} (x; θ_{2}, α_{2})} = \ln [\frac{θ_{1}^{3} (α_{2} θ_{2}^{2} + θ_{2} + 2)}{θ_{2}^{3} (α_{1} θ_{1}^{2} + θ_{1} + 2)}] + \ln (\frac{α_{1} + x + x^{2}}{α_{2} + x + x^{2}}) - (θ_{1} - θ_{2}) x$

This gives $\frac{d}{d x} \ln \frac{f_{X} (x; θ_{1}, α_{1})}{f_{Y} (x; θ_{2}, α_{2})} = \frac{(α_{2} - α_{1}) + 2 (α_{2} - α_{1}) x}{(α_{1} + x + x^{2}) (α_{2} + x + x^{2})} - (θ_{1} - θ_{2}) .$

Thus, for $(θ_{1} > θ_{2} and α_{1} = α_{2}) or (α_{1} \geq α_{2} and θ_{1} = θ_{2})$ , $\frac{d}{d x} \ln \frac{f_{X} (x; θ_{1}, α_{1})}{f_{Y} (x; θ_{2}, α_{2})} < 0 .$

This means that $X \leq_{l r} Y and hence X \leq_{h r} Y, X \leq_{m r l} Y and X \leq_{s t} Y$ . This shows flexibility of TPSD over Sujatha distribution.

Mean deviations

The amount of scatter in a population is evidently measured to some extent by the totality of deviations from the mean and the median. These are known as the mean deviation about the mean and the mean deviation about the median and are defined as

$δ_{1} (x) = \int_{0}^{\infty} | x - μ | f (x) d x$ and $δ_{2} (x) = \int_{0}^{\infty} | x - M | f (x) d x,$ respectively,

where $μ = E (X)$ and $M = M e d i a n (X) .$ The measures $δ_{1} (x)$ and $δ_{2} (x)$ can be calculated using the following relationships

$δ_{1} (x) = \int_{0}^{μ} (μ - x) f (x) d x + \int_{μ}^{\infty} (x - μ) f (x) d x$

$= μ F (μ) - \int_{0}^{μ} x f (x) d x - μ [1 - F (μ)] + \int_{μ}^{\infty} x f (x) d x$

$= 2 μ F (μ) - 2 μ + 2 \int_{μ}^{\infty} x f (x) d x$

$= 2 μ F (μ) - 2 \int_{0}^{μ} x f (x) d x$ (4.3.1)

and $δ_{2} (x) = \int_{0}^{M} (M - x) f (x) d x + \int_{M}^{\infty} (x - M) f (x) d x$

$= M F (M) - \int_{0}^{M} x f (x) d x - M (1 - F (M)) + \int_{M}^{\infty} x f (x) d x$

$= μ + 2 \int_{M}^{\infty} x f (x) d x$

$= μ - 2 \int_{0}^{M} x f (x) d x$ (4.3.2)

Using the pdf (2.1) and expression for the mean of TPSD, we get

$\int_{0}^{μ} x f_{4} (x; θ, α) d x = μ - \frac{[θ^{3} (μ^{3} + μ^{2} + α μ) + θ^{2} (3 μ^{2} + 2 μ + α) + 2 θ (3 μ + 1) + 6] e^{- θ μ}}{θ (α θ^{2} + θ + 2)}$ (4.3.3)

$\int_{0}^{M} x f_{4} (x; θ, α) d x = μ - \frac{[θ^{3} (M^{3} + M^{2} + α M) + θ^{2} (3 M^{2} + 2 M + α) + 2 θ (3 M + 1) + 6] e^{- θ M}}{θ (α θ^{2} + θ + 2)}$ (4.3.4)

Using expressions from (4.3.1), (4.3.2), (4.3.3) and (4.3.4) and after some tedious algebraic simplifications, the mean deviation about the mean, $δ_{1} (x)$ and the mean deviation about the median, $δ_{2} (x)$ of TPSD are obtained as

$δ_{1} (x) = \frac{2 [θ^{2} (μ^{2} + μ + α) + 2 θ (2 μ + 1) + 6] e^{- θ μ}}{θ (α θ^{2} + θ + 2)}$ (4.3.5)

and $δ_{2} (x) = \frac{2 [θ^{3} (M^{3} + M^{2} + α M) + θ^{2} (3 M^{2} + 2 M + α) + 2 θ (3 M + 1) + 6] e^{- θ M}}{θ (α θ^{2} + θ + 2)} - μ$ (4.3.6)

Bonferroni and Lorenz curves and indices

The Bonferroni and Lorenz curves and Bonferroni⁹ and Gini indices have applications not only in economics to study income and poverty, but also in other fields like reliability, demography and medical science. The Bonferroni and Lorenz curves are defined as

$B (p) = \frac{1}{p μ} \int_{0}^{q} x f (x) d x = \frac{1}{p μ} [\int_{0}^{\infty} x f (x) d x - \int_{q}^{\infty} x f (x) d x] = \frac{1}{p μ} [μ - \int_{q}^{\infty} x f (x) d x]$ (4.4.1)

and $L (p) = \frac{1}{μ} \int_{0}^{q} x f (x) d x = \frac{1}{μ} [\int_{0}^{\infty} x f (x) d x - \int_{q}^{\infty} x f (x) d x] = \frac{1}{μ} [μ - \int_{q}^{\infty} x f (x) d x]$ (4.4.2)

respectively or equivalently.

$B (p) = \frac{1}{p μ} \int_{0}^{p} F^{- 1} (x) d x$ (4.4.3)

and $L (p) = \frac{1}{μ} \int_{0}^{p} F^{- 1} (x) d x$ (4.4.4)

respectively, where $μ = E (x)$ and $q = F^{- 1} (p)$ .

The Bonferroni and Gini indices are thus defined as

$B = 1 - \int_{0}^{1} B (p) d p$ (4.4.5)

and $G = 1 - 2 \int_{0}^{1} L (p) d p$ (4.4.6)

respectively.

Using pdf of TPSD (2.1), we get

$\int_{q}^{\infty} x f (x) d x = \frac{{θ^{3} (q^{3} + q^{2} + α q) + θ^{2} (3 q^{2} + 2 q + α) + 2 θ (3 q + 1) + 6} e^{- θ q}}{θ (α θ^{2} + 2 θ + 6)}$ (4.4.7)

Now using equation (4.4.7), (4.4.1) and (4.4.2), we get

$B (p) = \frac{1}{p} [1 - \frac{{θ^{3} (q^{3} + q^{2} + α q) + θ^{2} (3 q^{2} + 2 q + α) + 2 θ (3 q + 1) + 6} e^{- θ q}}{α θ^{2} + 2 θ + 6}]$ (4.4.8)

and $L (p) = 1 - \frac{{θ^{3} (q^{3} + q^{2} + α q) + θ^{2} (3 q^{2} + 2 q + α) + 2 θ (3 q + 1) + 6} e^{- θ q}}{α θ^{2} + 2 θ + 6}$ (4.4.9)

Now using the equations (4.4.8) and (4.4.9) in (4.4.5) and (4.4.6), the Bonferroni and Gini indices of TPSD (2.1) are obtained as

$B = 1 - \frac{{θ^{3} (q^{3} + q^{2} + α q) + θ^{2} (3 q^{2} + 2 q + α) + 2 θ (3 q + 1) + 6} e^{- θ q}}{α θ^{2} + 2 θ + 6}$ (4.4.10)

$G = - 1 + \frac{2 {θ^{3} (q^{3} + q^{2} + α q) + θ^{2} (3 q^{2} + 2 q + α) + 2 θ (3 q + 1) + 6} e^{- θ q}}{α θ^{2} + 2 θ + 6}$ (4.4.11)

Stress–strength reliability

The stress–strength reliability of a component illustrates the life of the component which has random strength that is subjected to random stress. When the stress of the component Y applied to it exceeds the strength of the component X, the component fails instantly and the component will function satisfactorily till X > Y . Therefore, R = P (Y < X ) is a measure of the component reliability and is known as stress–strength reliability in statistical literature. It has extensive application in almost all areas of knowledge especially in engineering such as structure, deterioration of rocket motor, static fatigue of ceramic component, aging of concrete pressure vessels etc.

Let X and Y be independent strength and stress random variables having TPSD (2.1) with parameter $(θ_{1}, α_{1})$ and $(θ_{2}, α_{2})$ respectively. Then the stress–strength reliability R of TPSD can be obtained as

$R = P (Y < X) = \int_{0}^{\infty} P (Y < X | X = x) f_{x} (x) d x$

$= \int_{0}^{\infty} f (x; θ_{1,} α_{1}) F (x; θ_{2,} α_{2}) d x$

$= 1 - \frac{θ_{1}^{3} [\begin{array}{l} (α_{1} α_{2}) θ_{2}^{6} + (2 α_{1} + α_{2} + 4 α_{1} α_{2} θ_{1}) θ_{2}^{5} + (7 α_{1} θ_{1} + 3 α_{2} θ_{1} + 6 α_{1} + 2 α_{2} + 6 α_{1} α_{2} θ_{1}^{2} + 3) θ_{2}^{4} \\ + (9 α_{1} θ_{1}^{2} + 3 α_{2} θ_{1}^{2} + 18 α_{1} θ_{1} + 4 α_{2} θ_{1} + 7 θ_{1} + 4 α_{1} α_{2} θ_{1}^{3} + 20) θ_{2}^{3} \\ + (5 α_{1} θ_{1}^{3} + α_{2} θ_{1}^{3} + 20 α_{1} θ_{1}^{2} + 2 α_{2} θ_{1}^{2} + 5 θ_{1}^{2} + 30 θ_{1} + α_{1} α_{2} θ_{1}^{4} + 40) θ_{2}^{2} \\ + (α_{1} θ_{1}^{3} + 10 α_{1} θ_{1}^{2} + θ_{1}^{2} + 12 θ_{1} + 20) θ_{1} θ_{2} + 2 (α_{1} θ_{1}^{2} + θ_{1} + 2) θ_{1}^{2} \end{array}]}{(α_{1} θ_{1}^{2} + θ_{1} + 2) (α_{2} θ_{2}^{2} + θ_{2} + 2) {(θ_{1} + θ_{2})}^{5}}$

It can be verified that the stress–strength reliability of Sujatha distribution is a particular case of stress–strength reliability of TPSD at $α_{1} = α_{2} = 1 .$

In this section, the estimations of parameters of TPSD using method of moments and method of maximum likelihood have been discussed.

Method of moment estimates (MOME)

Since TPSD (2.1) has two parameters to be estimated, the first two moments about the origin are required to estimate its parameters using method of moments. Equating the population mean to the sample mean, we have

$\bar{x} = \frac{α θ^{2} + 2 θ + 6}{θ (α θ^{2} + θ + 2)} = \frac{α θ^{2} + θ + 2}{θ (α θ^{2} + θ + 2)} + \frac{θ + 4}{θ (α θ^{2} + θ + 2)}$

$\bar{x} = \frac{1}{θ} + \frac{θ + 4}{θ (α θ^{2} + θ + 2)}$

$(α θ^{2} + θ + 2) = \frac{θ + 4}{θ \bar{x} - 1}$ (5.1.1)

Again equating the second population moment with the corresponding sample moment, we have

$m_{2}^{'} = \frac{2 (α θ^{2} + 3 θ + 12)}{θ^{2} (α θ^{2} + θ + 2)} = \frac{2 (α θ^{2} + θ + 2)}{θ^{2} (α θ^{2} + θ + 2)} + \frac{4 (θ + 5)}{θ^{2} (α θ^{2} + θ + 2)}$

$m_{2}^{'} = \frac{2 (α θ^{2} + 3 θ + 12)}{θ^{2} (α θ^{2} + θ + 2)} = \frac{2}{θ^{2}} + \frac{4 (θ + 5)}{θ^{2} (α θ^{2} + θ + 2)}$

$α θ^{2} + θ + 2 = \frac{4 (θ + 5)}{m_{2}^{'} θ^{2} - 2}$ (5.1.2)

Equations (5.1.1) and (5.1.2) give the following cubic equation in $θ$

$m_{2}^{'} θ^{3} + 4 (m_{2}^{'} - \bar{X}) θ^{2} - 2 (10 \bar{X} - 1) θ + 12 = 0$ (5.1.3)

Solving equation (5.1.3) using any iterative method such as Newton–Raphson method, Regula–Falsi method or Bisection method, method of moment estimation (MOME) $\tilde{θ}$ of $θ$ can be obtained and substituting the value of $\tilde{θ}$ in equation (5.1.1), MOME $\tilde{α}$ of $α$ can be obtained as

$\tilde{α} = \frac{- \bar{x} {\tilde{θ}}^{2} - 2 (\bar{x} - 1) \tilde{θ} + 6}{{\tilde{θ}}^{2} (\tilde{θ} \bar{x} - 1)}$ (5.1.4)

Maximum likelihood estimates (MLE)

Let $(x_{1}, x_{2}, x_{3}, ... x_{n})$ be random sample from TPSD (2.1). The likelihood function L is given by

$L = {(\frac{θ^{3}}{α θ^{2} + θ + 2})}^{n} \prod_{i = 1}^{n} (α + x_{i} + x_{i}^{2}) e^{- n θ \bar{x}},$

where $\bar{x}$ is the sample mean.

The natural log likelihood function is thus obtained as

$\ln L = n [3 \ln θ - \ln (α θ^{2} + θ + 2)] + \sum_{i = 1}^{n} \ln (α + x_{i} + x_{i}^{2}) - n θ \bar{x}$

The maximum likelihood estimate (MLE’s) $(\hat{θ}, \hat{α})$ of $(θ, α)$ are then the solutions of the following non–linear equations

$\frac{\partial \ln L}{\partial θ} = \frac{3 n}{θ} - \frac{n (2 α θ + 1)}{α θ^{2} + θ + 2} - n \bar{x} = 0$ (5.2.1)

$\frac{\partial \ln L}{\partial α} = - \frac{n θ^{2}}{α θ^{2} + θ + 2} + \sum_{i = 1}^{n} \frac{1}{α + x_{i} + x_{i}^{2}} = 0$ (5.2.2)

These two natural log likelihood equations do not seem to be solved directly, because they cannot be expressed in closed forms. The (MLE’s) $(\hat{θ}, \hat{α})$ of $(θ, α)$ can be computed directly by solving the natural log likelihood equations using Newton–Raphson iteration method using R–software till sufficiently close values of $\hat{θ} and \hat{α}$ are obtained. The initial values of parameters and α are the MOME $(\tilde{θ}, \tilde{α})$ of the parameters $(θ, α) .$

In this section an application of TPSD using maximum likelihood estimates has been discussed with a real lifetime data set. The data set regarding vinyl chloride obtained from clean up gradient monitoring wells in mg/l, available in Bhaumik et al.¹⁰ has been considered. The data set is

5.1	1.2	1.3	0.6	0.5	2.4	0.5	1.1	8	0.8	0.4	0.6	0.9
0.4	2	0.5	5.3	3.2	2.7	2.9	2.5	2.3	1	0.2	0.1	0.1
1.8	0.9	2.4	6.8	1.2	0.4	0.2

In order to compare lifetime distributions, values of $- 2 \ln L$ , AIC (Akaike Information Criterion), BIC (Bayesian Information Criterion) and K–S Statistic (Kolmogorov–Smirnov Statistic) for the above data set has been computed. The formulae for computing AIC, BIC, and K–S Statistics are as follows:

$A I C = - 2 \ln L + 2 k$ , $B I C = - 2 \ln L + k \ln n$ and $D = \underset{x}{Sup} | F_{n} (x) - F_{0} (x) |$ , where $k$ = the number of parameters, $n =$ the sample size, and the $F_{n} (x)$ = empirical distribution function. The best distribution is the distribution which corresponds to lower values of $- 2 \ln L$ , AIC, and K–S statistic and higher p–value. The MLE $(\hat{θ}, \hat{α})$ along with their standard errors, $- 2 \ln L$ , AIC, BIC, K–S Statistic and p–value of the fitted distributions are presented in the Table 5.

It is obvious that TPSD gives much closer fit than Sujatha and Lindley distributions. Therefore, TPSD can be considered as an important two–parameter lifetime distribution. In order to see the closeness of the fit given by Lindley, Sujatha and TPSD, the fitted pdf plots of these distributions for the given dataset have been shown in Figure 6. It is also obvious from the fitted plots of the distribution along with the histogram of the original dataset that TPSD gives much closer fit than Lindley and Sujatha distributions.

Figure 6Fitted pdf plots of distributions for the given dataset.

A two parameter Sujatha distribution (TPSD) has been introduced which includes size–biased Lindley distribution and Sujatha distribution, proposed by Shanker (2016a) as particular cases. Moments about origin and moments about mean have been obtained and nature of coefficient of variation, coefficient of skewness, coefficient of kurtosis and index of dispersion of TPSD have been studied with varying values of the parameters. The nature of probability density function, cumulative distribution function, hazard rate function and mean residual life function have been discussed with varying values of the parameters. The stochastic ordering, mean deviations, Bonferroni and Lorenz curves, and stress–strength reliability have also been discussed. The method of moments and method of maximum likelihood have been discussed for estimating parameters. A numerical example of real lifetime data have been presented to show the application of TPSD and the goodness of fit of TPSD gives much closer fit over Sujatha and Lindley distributions.

Authors are grateful to the editor–in–chief of the journal and the anonymous reviewer for constructive comments on the paper.

Authors declare that there is no conflict of interest.

Lindley DV. Fiducial distributions and Bayes’ theorem. Journal of the Royal Statistical Society. 1958;20(1):102–107.
Shanker R, Hagos F. Zero–Truncated Poisson–Sujatha distribution with Applications. Journal of Ethiopian Statistical Association. 2015;(24):55–63.
Shanker R. Sujatha distribution and its applications. Statistics in Transition New series. 2016a;17(3):1–20.
Shanker R. The discrete Poisson–Sujatha distribution. International Journal of Probability and Statistics. 2016b;5(1):1–9.
Shanker R, Hagos F, Sujatha S. On modeling of lifetimes data using exponential and Lindley distribution. BBIJ. 2015;2(5):1–5.
Shanker R, Hagos F. Size–biased Poisson–Sujatha distribution with Applications. American Journal of Mathematics and Statistics. 2016;6(4):145–154.
Ghitany ME, Atieh B, Nadarajah S. Lindley distribution and its Application. Mathematics Computing and Simulation. 2008;78(4):493–506.
Shaked M, Shanthikumar JG. Stochastic Orders and Their Applications. New York: Academic Press; 1994.
Bonferroni CE. Elementi di Statistica Generale. Firenze: Libreria Seeber; 1930.
Bhaumik DK, Kapur K, Gibbson RD. Testing of parameters of a Gamma distribution for small samples. Technometrics. 2009;51(3):326–334.

Submit manuscript...

eISSN: 2378-315X

Biometrics & Biostatistics International Journal

A two–parameter Sujatha distribution

Mussie Tesfay, Rama Shanker

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

Abstract

Introduction

A two–parameter Sujatha distribution

Moments and related measures

Statistical properties

Estimation of parameters

A numerical example

Conclusion

Acknowledgement

Conflict of interest

References

Citations

Rejected Articles

Journal Menu

Useful Links

5.1	1.2	1.3	0.6	0.5	2.4	0.5	1.1	8	0.8	0.4	0.6	0.9
0.4	2	0.5	5.3	3.2	2.7	2.9	2.5	2.3	1	0.2	0.1	0.1
1.8	0.9	2.4	6.8	1.2	0.4	0.2

5.1	1.2	1.3	0.6	0.5	2.4	0.5	1.1	8	0.8	0.4	0.6	0.9
0.4	2	0.5	5.3	3.2	2.7	2.9	2.5	2.3	1	0.2	0.1	0.1
1.8	0.9	2.4	6.8	1.2	0.4	0.2

5.1	1.2	1.3	0.6	0.5	2.4	0.5	1.1	8	0.8	0.4	0.6	0.9
0.4	2	0.5	5.3	3.2	2.7	2.9	2.5	2.3	1	0.2	0.1	0.1
1.8	0.9	2.4	6.8	1.2	0.4	0.2