A quasi shanker distribution and its applications

doi:10.15406/bbij.2017.06.00156

eISSN: 2378-315X

Biometrics & Biostatistics International Journal

Research Article Volume 6 Issue 1

A quasi shanker distribution and its applications

Rama Shanker,

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

Kamlesh Kumar Shukla

Department of Statistics, Eritrea Institute of Technology, Eritrea

Correspondence: Rama Shanker, Department of Statistics, Eritrea Institute of Technology, Asmara, Eritrea

Received: June 03, 2017 | Published: June 13, 2017

Citation: Shanker R. A quasi shanker distribution and its applications. Biom Biostat Int J. 2017;6(1):267-276. DOI: 10.15406/bbij.2017.06.00156

Download PDF

Abstract

In the present paper, a two-parameter quasi Shanker distribution (QSD) which includes one parameter Shanker distribution introduced by Shanker¹ as a special case has been proposed. Its statistical and mathematical properties including moments and moments based measures, hazard rate function, mean residual life function, stochastic ordering, mean deviations, Bonferroni and Lorenz curves and stress-strengthreliability have also been discussed. The method of maximum likelihood estimation has been discussed for estimating the parameters of QSD. Finally, the goodness of fit of the QSD has been discussed with two real lifetime data and the fit is quite satisfactory over one parameter exponential, Lindley and Shanker distributions.

Keywords: shanker distribution, moments, hazard rate function, mean residual life function, stochastic ordering, mean deviations, stress-strength reliability, estimation of parameters, goodness of fit

Introduction

Shanker¹ has introduced a one parameter lifetime distribution for modeling lifetime data from biomedical science and engineering having probability density function(pdf) and cumulative distribution function(cdf) given by

$f_{1} (x; θ) = \frac{θ^{2}}{θ^{2} + 1} (θ + x) e^{- θ x}; x > 0, θ > 0$ …. (1.1)
$F_{1} (x, θ) = 1 - [1 + \frac{θ x}{θ^{2} + 1}] e^{- θ x}; x > 0, θ > 0$ (1.2)

Shanker¹ has shown that it gives better fit than both one parameter exponential and Lindley² distributions. This distribution is a mixture of exponential $(θ)$ and gamma $(2, θ)$ distributions with their mixing proportion $\frac{θ^{2}}{θ^{2} + 1}$ and $\frac{1}{θ^{2} + 1}$ respectively.
The first four moments about origin of Shanker distribution obtained by Shanker¹ are given as

$μ_{1}^{'} = \frac{θ^{2} + 2}{θ (θ^{2} + 1)}$ , $μ_{2}^{'} = \frac{2 (θ^{2} + 3)}{θ^{2} (θ^{2} + 1)}$ , $μ_{3}^{'} = \frac{6 (θ^{2} + 4)}{θ^{3} (θ^{2} + 1)}$ , $μ_{4}^{'} = \frac{24 (θ^{2} + 5)}{θ^{4} (θ^{2} + 1)}$

The central moments of Shanker distribution obtained by Shanker¹ are

$μ_{2} = \frac{θ^{4} + 4 θ^{2} + 2}{θ^{2} {(θ^{2} + 1)}^{2}}$
$μ_{3} = \frac{2 (θ^{6} + 6 θ^{4} + 6 θ^{2} + 2)}{θ^{3} {(θ^{2} + 1)}^{3}}$
$μ_{4} = \frac{3 (3 θ^{8} + 24 θ^{6} + 44 θ^{4} + 32 θ^{2} + 8)}{θ^{4} {(θ^{2} + 1)}^{4}}$

Shanker¹ studied its important properties including coefficient of variation, skewness, kurtosis, Index of dispersion, hazard rate function, mean residual life function, stochastic ordering, mean deviations, order statistics, Bonferroni and Lorenz curves, Renyi entropy measure, and stress-strength reliability. The discrete Poisson - Shanker distribution, a Poisson mixture of Shanker distribution has also been studied by Shanker.³.

Recall that the Lindley distribution, introduced by Lindley² in the context of Bayesian analysis as a counter example of fiducial statistics, is defined by its pdf and cdf

$f_{2} (x; θ) = \frac{θ^{2}}{θ + 1} (1 + x) e^{- θ x}; x > 0, θ > 0$ (1.3)
$F_{2} (x; θ) = 1 - [1 + \frac{θ x}{θ + 1}] e^{- θ x}; x > 0, θ > 0$ (1.4)

In this paper, a two - parameter quasi Shanker distribution (QSD), of which one parameter Shanker distribution introduced by Shanker¹ is a particular case, has been proposed. Its raw moments and central moments have been obtained and coefficients of variation, skewness, kurtosis and index of dispersion have been discussed. Some of its important mathematical and statistical properties including hazard rate function, mean residual life function, stochastic ordering, mean deviations, Bonferroni and Lorenz curves and stress-strength reliability have also been discussed. The estimation of the parameters has been discussed using maximum likelihood estimation. The goodness of fit of QSD has been illustrated with two real lifetime data sets and the fit has been compared with one parameter exponential, Lindley and Shanker distributions.

A Quasi shanker distribution

A two - parameter quasi Shanker distribution (QSD) having parameters $θ$ and $α$ is defined by its pdf

$f (x; θ, α) = \frac{θ^{3}}{θ^{3} + θ + 2 α} (θ + x + α x^{2}) e^{- θ x}; x > 0, θ > 0, θ^{3} + θ + 2 α > 0.$ (2.1)
It can be easily verified that (2.1) reduces to the Shanker distribution (1.1) at $α = 0$ . It can be easily verified that QSD is a three-component mixture of exponential $(θ)$ , gamma $(2, θ)$ and gamma $(3, θ)$ distributions. We have

$f (x; θ, α) = p_{1} f_{1} (x; θ) + p_{2} f_{2} (x; 2, θ) + (1 - p_{1} - p_{2}) f_{3} (x; 3, θ)$ (2.2)

where

$p_{1} = \frac{θ^{3}}{θ^{3} + θ + 2 α}, p_{2} = \frac{θ}{θ^{3} + θ + 2 α}$ ,
$f_{1} (x; θ) = θ e^{- θ x}; x > 0, θ > 0$
$f_{2} (x; 2, θ) = \frac{θ^{2}}{Γ (2)} e^{- θ x} x^{2 - 1}; x > 0, θ > 0$
$f_{3} (x; 3, θ) = \frac{θ^{3}}{Γ (3)} e^{- θ x} x^{3 - 1}; x > 0, θ > 0$

The corresponding cdf of QSD (2.1) can be obtained as

$F (x; θ, α) = 1 - [1 + \frac{α θ^{2} x^{2} + θ x (θ + 2 α)}{θ^{3} + θ + 2 α}] e^{- θ x}; x > 0, θ > 0$ (2.3)

The nature and behavior of the pdf and the cdf of QSD for varying values of the parameters $θ and α$ have been explained graphically and presented in Figures 1 & 2, respectively.

Figure 1 Graphs of the pdf of QSD for varying values of parameters $θ$ and $α$

Figure 2 Graphs of the cdf of QSD for varying values of parameters $θ$ and $α$ .

Statistical constants

The $r$ th moment about origin of QSD can be obtained as

$μ_{r}^{'} = \frac{r! [θ^{3} + (r + 1) θ + (r + 1) (r + 2) α]}{θ^{r} (θ^{3} + θ + 2 α)}; r = 1, 2, 3, ..$ (3.1)

Thus, the first four moments about origin of QSD are given by

$μ_{1}^{'} = \frac{θ^{3} + 2 θ + 6 α}{θ (θ^{3} + θ + 2 α)}$ , $μ_{2}^{'} = \frac{2 (θ^{3} + 3 θ + 12 α)}{θ^{2} (θ^{3} + θ + 2 α)}$
$μ_{3}^{'} = \frac{6 (θ^{3} + 4 θ + 20 α)}{θ^{3} (θ^{3} + θ + 2 α)}$ , $μ_{4}^{'} = \frac{24 (θ^{3} + 5 θ + 30 α)}{θ^{4} (θ^{3} + θ + 2 α)}$

Using relationship between central moments and moments about origin, the central moments of QSD (2.1) are thus obtained as

$μ_{2} = \frac{θ^{6} + 4 θ^{4} + 16 θ^{3} α + 2 θ^{2} + 12 θ α + 12 α^{2}}{θ^{2} {(θ^{3} + θ + 2 α)}^{2}}$
$μ_{3} = \frac{2 {θ^{9} + 6 θ^{7} + 30 θ^{6} α + 6 θ^{5} + 42 θ^{4} α + (36 α^{2} + 2) θ^{3} + 18 θ^{2} α + 36 θ α^{2} + 24 α^{3}}}{θ^{3} {(θ^{3} + θ + 2 α)}^{3}}$
$μ_{4} = \frac{3 {\begin{cases} 3 θ^{12} + 24 θ^{10} + 128 θ^{9} α + 44 θ^{8} + 344 θ^{7} α + (408 α^{2} + 32) θ^{6} + 320 θ^{5} α \\ + (768 α^{2} + 8) θ^{4} + (576 α^{3} + 96 α) θ^{3} + 336 θ^{2} α^{2} + 480 θ α^{3} + 240 α^{4} \end{cases}}}{θ^{4} {(θ^{3} + θ + 2 α)}^{4}}$
The coefficient of variation $(C . V)$ , coefficient of skewness $(\sqrt{β_{1}})$ , coefficient of kurtosis $(β_{2})$ and index of dispersion $(γ)$ of QSD are obtained as

$C . V = \frac{σ}{μ_{1}^{'}} = \frac{\sqrt{θ^{6} + 4 θ^{4} + 16 θ^{3} α + 2 θ^{2} + 12 θ α + 12 α^{2}}}{θ^{3} + 2 θ + 6 α}$

$\sqrt{β_{1}} = \frac{μ_{3}}{μ_{2}^{3 / 2}} = \frac{2 {θ^{9} + 6 θ^{7} + 30 θ^{6} α + 6 θ^{5} + 42 θ^{4} α + (36 α^{2} + 2) θ^{3} + 18 θ^{2} α + 36 θ α^{2} + 24 α^{3}}}{{(θ^{6} + 4 θ^{4} + 16 θ^{3} α + 2 θ^{2} + 12 θ α + 12 α^{2})}^{3 / 2}}$

$β_{2} = \frac{μ_{4}}{μ_{2}^{2}} = \frac{3 {\begin{cases} 3 θ^{12} + 24 θ^{10} + 128 θ^{9} α + 44 θ^{8} + 344 θ^{7} α + (408 α^{2} + 32) θ^{6} + 320 θ^{5} α \\ + (768 α^{2} + 8) θ^{4} + (576 α^{3} + 96 α) θ^{3} + 336 θ^{2} α^{2} + 480 θ α^{3} + 240 α^{4} \end{cases}}}{{(θ^{6} + 4 θ^{4} + 16 θ^{3} α + 2 θ^{2} + 12 θ α + 12 α^{2})}^{2}}$

$γ = \frac{σ^{2}}{μ_{1}^{'}} = \frac{θ^{6} + 4 θ^{4} + 16 θ^{3} α + 2 θ^{2} + 12 θ α + 12 α^{2}}{θ (θ^{3} + θ + 2 α) (θ^{3} + 2 θ + 6 α)}$

Graphs of C.V, $\sqrt{β_{1}}$ , $β_{2}$ and $γ$ of QSD for varying values of the parameters $θ$ and $α$ have been presented in Figure 3.

Figure 3 Graphs of C.V, $\sqrt{β_{1}}$ , $β_{2}$ and $γ$ of QSD for varying values of the parameter $θ$ and $α$ .

Hazard rate function and mean residual life function

Suppose $X$ be a continuous random variable with pdf $f (x)$ and cdf $F (x)$ . The hazard rate function (also known as the failure rate function) and the mean residual life function of $X$ are respectively defined as

$h (x) = \lim_{Δ x \to 0} \frac{P (X < x + Δ x | X > x)}{Δ x} = \frac{f (x)}{1 - F (x)}$ (4.1)
And $m (x) = E [X - x | X > x] = \frac{1}{1 - F (x)} \int_{x}^{\infty} [1 - F (t)] d t$ (4.2)
The corresponding hazard rate function $h (x)$ , and the mean residual life function $m (x)$ of QSD are thus obtained as

$h (x) = \frac{θ^{3} (θ + x + α x^{2})}{α θ^{2} x^{2} + θ (θ + 2 α) x + (θ^{3} + θ + 2 α)}$ (4.3)
and $m (x) = \frac{1}{[α θ^{2} x^{2} + θ (θ + 2 α) x + (θ^{3} + θ + 2 α)] e^{- θ x}} \int_{x}^{\infty} [\begin{array}{l} α θ^{2} t^{2} + θ (θ + 2 α) t \\ + (θ^{3} + θ + 2 α) \end{array}] e^{- θ t} d t$

$= \frac{α θ^{2} x^{2} + θ (θ + 4 α) x + (θ^{3} + 2 θ + 6 α)}{θ [α θ^{2} x^{2} + θ (θ + 2 α) x + (θ^{3} + θ + 2 α)]}$ (4.4)
It can be easily verified that $h (0) = \frac{θ^{4}}{θ^{3} + θ + 2 α} = f (0)$ and $m (0) = \frac{θ^{3} + 2 θ + 6 α}{θ (θ^{3} + θ + 2 α)} = μ_{1}^{'}$
The nature and behavior of $h (x)$ and $m (x)$ of QSD for varying values of parameters $θ$ and $α$ have been shown graphically in Figures 4 & 5. It is obvious that $h (x)$ of QSD is monotonically increasing whereas $h (x)$ is monotonically decreasing

Figure 4 Graphs of $h (x)$ of QSD for varying values of parameters $θ$ and $α$ .

Figure 5 Graphs of $m (x)$ of QSD for varying values of parameters $θ$ and $α$ .

Stochastic orderings

Stochastic ordering of positive continuous random variables is an important tool for judging their comparative behavior. A random variable $X$ is said to be smaller than a random variable $Y$ in the

stochastic order $(X \leq_{s t} Y)$ if $F_{X} (x) \geq F_{Y} (x)$ for all $x$
hazard rate order $(X \leq_{h r} Y)$ if $h_{X} (x) \geq h_{Y} (x)$ for all $x$
mean residual life order $(X \leq_{m r l} Y)$ if $m_{X} (x) \leq m_{Y} (x)$ for all $x$
likelihood ratio order $(X \leq_{l r} Y)$ if $\frac{f_{X} (x)}{f_{Y} (x)}$ decreases in $x$ .

The following results due to Shaked and Shanthikumar⁴ are well known for establishing stochastic ordering of distributions

$X \leq_{l r} Y \Rightarrow X \leq_{h r} Y \Rightarrow X \leq_{m r l} Y$
$\underset{X \leq_{s t} Y}{⇓}$

The QSD is ordered with respect to the strongest ‘likelihood ratio ordering’ as shown in the following theorem:

Theorem: Let $X$ $\sim$ QSD $(θ_{1}, α_{1})$ and $Y$ $\sim$ QSD $(θ_{2}, α_{2})$ . If $α_{1} = α_{2} and θ_{1} > θ_{2}$ (or $θ_{1} = θ_{2} and α_{1} < α_{2}$ ), then $X \leq_{l r} Y$ and hence $X \leq_{h r} Y$ , $X \leq_{m r l} Y$ and $X \leq_{s t} Y$ .
Proof: We have

$\frac{f_{X} (x; θ_{1}, α_{1})}{f_{Y} (x; θ_{2}, α_{2})} = \frac{θ_{1}^{3} (θ_{2}^{3} + θ_{2} + 2 α_{2})}{θ_{2}^{3} (θ_{1}^{3} + θ_{1} + 2 α_{1})} (\frac{θ_{1} + x + α_{1} x^{2}}{θ_{2} + x + α_{2} x^{2}}) e^{- (θ_{1} - θ_{2}) x}; x > 0$

Now

$\ln \frac{f_{X} (x; θ_{1}, α_{1})}{f_{Y} (x; θ_{2}, α_{2})} = \log [\frac{θ_{1}^{3} (θ_{2}^{3} + θ_{2} + 2 α_{2})}{θ_{2}^{3} (θ_{1}^{3} + θ_{1} + 2 α_{1})}] + \ln (\frac{θ_{1} + x + α_{1} x^{2}}{θ_{2} + x + α_{2} x^{2}}) - (θ_{1} - θ_{2}) x$

This gives

$\frac{d}{d x} {\ln \frac{f_{X} (x; θ_{1}, α_{1})}{f_{Y} (x; θ_{2}, α_{2})}} = \frac{(θ_{2} - θ_{1}) + (α_{2} - α_{1}) + 2 (α_{1} θ_{2} - α_{2} θ_{1}) x + 2 (α_{1} - α_{2}) x^{2}}{(θ_{1} + x + α_{1} x^{2}) (θ_{2} + x + α_{2} x^{2})} - (θ_{1} - θ_{2})$

Thus if $α_{1} = α_{2} and θ_{1} > θ_{2}$ or $θ_{1} = θ_{2} and α_{1} < α_{2}$ , $\frac{d}{d x} \ln \frac{f_{X} (x; θ_{1}, α_{1})}{f_{Y} (x; θ_{2}, α_{2})} < 0$ . This means that $X \leq_{l r} Y$ and hence $X \leq_{h r} Y$ , $X \leq_{m r l} Y$ and $X \leq_{s t} Y$ .

Mean deviations from the mean and the median

The amount of scatter in a population is measured to some extent by the totality of deviations usually from mean and median. These are known as the mean deviation about the mean and the mean deviation about the median defined by
$δ_{1} (X) = \int_{0}^{\infty} | x - μ | f (x) d x$ and $δ_{2} (X) = \int_{0}^{\infty} | x - M | f (x) d x$ , respectively, where $μ = E (X)$ and $M = Median (X)$ . The measures $δ_{1} (X)$ and $δ_{2} (X)$ can be calculated using the following simplified relationships

$δ_{1} (X) = \int_{0}^{μ} (μ - x) f (x) d x + \int_{μ}^{\infty} (x - μ) f (x) d x$
$= μ F (μ) - \int_{0}^{μ} x f (x) d x - μ [1 - F (μ)] + \int_{μ}^{\infty} x f (x) d x$
$= 2 μ F (μ) - 2 μ + 2 \int_{μ}^{\infty} x f (x) d x$
$= 2 μ F (μ) - 2 \int_{0}^{μ} x f (x) d x$ (6.1)

and

$δ_{2} (X) = \int_{0}^{M} (M - x) f (x) d x + \int_{M}^{\infty} (x - M) f (x) d x$
$= M F (M) - \int_{0}^{M} x f (x) d x - M [1 - F (M)] + \int_{M}^{\infty} x f (x) d x$
$= - μ + 2 \int_{M}^{\infty} x f (x) d x$
$= μ - 2 \int_{0}^{M} x f (x) d x$ (6.2)
Using p.d.f. (2.1) and expression for the mean of QSD, we get

$\int_{0}^{μ} x f (x) d x = μ - \frac{{α θ^{3} μ^{3} + θ^{2} (θ + 3 α) μ^{2} + θ (θ^{3} + 2 θ + 6 α) μ + (θ^{3} + 2 θ + 6 α)} e^{- θ μ}}{θ (θ^{3} + θ + 2 α)}$ (6.3)
$\int_{0}^{M} x f (x) d x = μ - \frac{{α θ^{3} M^{3} + θ^{2} (θ + 3 α) M^{2} + θ (θ^{3} + 2 θ + 6 α) M + (θ^{3} + 2 θ + 6 α)} e^{- θ M}}{θ (θ^{3} + θ + 2 α)}$ (6.4)
Using expressions from (6.1), (6.2), (6.3), and (6.4), the mean deviation about mean, $δ_{1} (X)$ and the mean deviation about median, $δ_{2} (X)$ of QSD are finally obtained as

$δ_{1} (X) = \frac{2 {α θ^{2} μ^{2} + θ (θ + 4 α) μ + (θ^{3} + 2 θ + 6 α)} e^{- θ μ}}{θ (θ^{3} + θ + 2 α)}$ (6.5)
$δ_{2} (X) = \frac{2 {α θ^{3} M^{3} + θ^{2} (θ + 3 α) M^{2} + θ (θ^{3} + 2 θ + 6 α) M + (θ^{3} + 2 θ + 6 α)} e^{- θ M}}{θ (θ^{3} + θ + 2 α)} - μ$ (6.6)

Bonferroni and lorenz curves

The Bonferroni and Lorenz curves⁵ and Bonferroni and Gini indices have applications not only in economics to study income and poverty, but also in other fields like reliability, demography, insurance and medicine. The Bonferroni and Lorenz curves are defined as

$B (p) = \frac{1}{p μ} \int_{0}^{q} x f (x) d x = \frac{1}{p μ} [\int_{0}^{\infty} x f (x) d x - \int_{q}^{\infty} x f (x) d x] = \frac{1}{p μ} [μ - \int_{q}^{\infty} x f (x) d x]$ (7.1)
and $L (p) = \frac{1}{μ} \int_{0}^{q} x f (x) d x = \frac{1}{μ} [\int_{0}^{\infty} x f (x) d x - \int_{q}^{\infty} x f (x) d x] = \frac{1}{μ} [μ - \int_{q}^{\infty} x f (x) d x]$ (7.2)

Respectively or equivalently

$B (p) = \frac{1}{p μ} \int_{0}^{p} F^{- 1} (x) d x$ (7.3)
and $L (p) = \frac{1}{μ} \int_{0}^{p} F^{- 1} (x) d x$ (7.4)

Respectively, where $μ = E (X)$ and $q = F^{- 1} (p)$ .
The Bonferroni and Gini indices are thus defined as

$B = 1 - \int_{0}^{1} B (p) d p$ (7.5)
and $G = 1 - 2 \int_{0}^{1} L (p) d p$ (7.6) respectively.

Using p.d.f. of QSD (2.1), we get

$\int_{q}^{\infty} x f (x) d x = \frac{{α θ^{3} q^{3} + θ^{2} (θ + 3 α) q^{2} + θ (θ^{3} + 2 θ + 6 α) q + (θ^{3} + 2 θ + 6 α)} e^{- θ q}}{θ (θ^{3} + θ + 2 α)}$ (7.7)
Now using equation (7.7) in (7.1) and (7.2), we get
$B (p) = \frac{1}{p} [1 - \frac{{α θ^{3} q^{3} + θ^{2} (θ + 3 α) q^{2} + θ (θ^{3} + 2 θ + 6 α) q + (θ^{3} + 2 θ + 6 α)} e^{- θ q}}{θ^{3} + 2 θ + 6 α}]$ (7.8)

and

$L (p) = 1 - \frac{{α θ^{3} q^{3} + θ^{2} (θ + 3 α) q^{2} + θ (θ^{3} + 2 θ + 6 α) q + (θ^{3} + 2 θ + 6 α)} e^{- θ q}}{θ^{3} + 2 θ + 6 α}$ (7.9)

Now using equations (7.8) and (7.9) in (7.5) and (7.6), the Bonferroni and Gini indices of QSD are thus obtained as

$B = 1 - \frac{{α θ^{3} q^{3} + θ^{2} (θ + 3 α) q^{2} + θ (θ^{3} + 2 θ + 6 α) q + (θ^{3} + 2 θ + 6 α)} e^{- θ q}}{θ^{3} + 2 θ + 6 α}$ (7.10)
$G = \frac{2 {α θ^{3} q^{3} + θ^{2} (θ + 3 α) q^{2} + θ (θ^{3} + 2 θ + 6 α) q + (θ^{3} + 2 θ + 6 α)} e^{- θ q}}{θ^{3} + 2 θ + 6 α} - 1$ (7.11)

Stress-strength reliability

The stress- strength reliability describes the life of a component which has random strength $X$ that is subjected to a random stress $Y$ . When the stress applied to it exceeds the strength, the component fails instantly and the component will function satisfactorily till $X > Y$ . Therefore, $R = P (Y < X)$ is a measure of component reliability and in statistical literature it is known as stress-strength parameter. It has wide applications in almost all areas of knowledge especially in engineering such as structures, deterioration of rocket motors, static fatigue of ceramic components, aging of concrete pressure vessels etc. Let $X$ and $Y$ be independent strength and stress random variables having QSD (2.1) with parameter $(θ_{1}, α_{1})$ and $(θ_{2}, α_{2})$ respectively. Then the stress-strength reliability $R$ of QSD (2.1) can be obtained as

$R = P (Y < X) = \int_{0}^{\infty} P (Y < X | X = x) f_{X} (x) d x$
$= \int_{0}^{\infty} f (x; θ_{1}, α_{1}) F (x; θ_{2}, α_{2}) d x$
$= 1 - \frac{θ_{1}^{3} [\begin{array}{l} θ_{1} θ_{2}^{7} + (4 θ_{1}^{2} + 1) θ_{2}^{6} + (6 θ_{1}^{3} + 5 θ_{1} + 2 α_{1}) θ_{2}^{5} + (4 θ_{1}^{4} + 10 θ_{1}^{2} + 4 α_{1} θ_{1} + 4 α_{2} θ_{1} + 3) θ_{2}^{4} \\ + (θ_{1}^{5} + 10 θ_{1}^{3} + 2 α_{1} θ_{1}^{2} + 14 α_{2} θ_{1}^{2} + 8 α_{1} + 7 θ_{1} + 2 α_{2} θ_{1} + 6 α_{2}) θ_{2}^{3} \\ + (5 θ_{1}^{4} + 18 α_{2} θ_{1}^{3} + 4 α_{2} θ_{1}^{2} + 5 θ_{1}^{2} + 16 α_{1} α_{2} + 10 α_{1} θ_{1} + 14 α_{2} θ_{2} + 6 α_{2}) θ_{2}^{2} \\ + (θ_{1}^{5} + 10 α_{2} θ_{1}^{4} + 2 α_{2} θ_{1}^{3} + θ_{1}^{3} + 10 α_{2} θ_{1}^{2} + 2 α_{1} θ_{1}^{2} + 20 α_{1} α_{2} θ_{1} + 24 α_{1} α_{2} + 6 α_{2} θ_{1}) θ_{2} \\ + 2 (α_{2} θ_{1}^{5} + 2 α_{1} α_{2} θ_{1}^{2} + 2 α_{2} θ_{1}^{3}) \end{array}]}{(θ_{1}^{3} + θ_{1} + 2 α_{1}) (θ_{2}^{3} + θ_{2} + 2 α_{2}) {(θ_{1} + θ_{2})}^{5}}$

It can be easily verified that at $α_{1} = 0$ and $α_{2} = 0$ , the above expression reduces to the corresponding expression for Shanker distribution introduced by Shanker.¹

Maximum likelihood estimation of parameters

Let $(x_{1}, x_{2}, x_{3}, ..., x_{n})$ be a random sample of size $n$ from QSD (2.1)). The likelihood function, $L$ of (2.1) is given by

$L = {(\frac{θ^{3}}{θ^{3} + θ + 2 α})}^{n} \prod_{i = 1}^{n} (θ + x_{i} + α x_{i}^{2}) e^{- n θ \bar{x}}$

The natural log likelihood function is thus obtained as

$\ln L = n \ln (\frac{θ^{3}}{θ^{3} + θ + 2 α}) + \sum_{i = 1}^{n} \ln (θ + x_{i} + α x_{i}^{2}) - n θ \bar{x}$

The maximum likelihood estimates (MLE) $(\hat{θ}, \hat{α})$ of $(θ, α)$ are then the solutions of the following non-linear equations

$\frac{\partial \ln L}{\partial θ} = \frac{3 n}{θ} - \frac{n (3 θ^{2} + 1)}{θ^{3} + θ + 2 α} + \sum_{i = 1}^{n} \frac{1}{θ + x_{i} + α x_{i}^{2}} - n \bar{x} = 0$
$\frac{\partial \ln L}{\partial α} = \frac{- 2 n}{θ^{3} + θ + 2 α} + \sum_{i = 1}^{n} \frac{x_{i}^{2}}{θ + x_{i} + α x_{i}^{2}} = 0$

where $\bar{x}$ is the sample mean.

These two natural log likelihood equations do not seem to be solved directly because they are not in closed forms. However, the Fisher’s scoring method can be applied to solve these equations. For, we have

$\frac{\partial^{2} \ln L}{\partial θ^{2}} = - \frac{3 n}{θ^{2}} + \frac{n (3 θ^{4} - 6 θ^{3} + 5 θ^{2} - 12 θ α + 1) α^{2}}{{(θ^{3} + θ + 2 α)}^{2}} - \sum_{i = 1}^{n} \frac{1}{{(θ + x_{i} + α x_{i}^{2})}^{2}}$
$\frac{\partial^{2} \ln L}{\partial θ \partial α} = \frac{2 n (3 θ^{2} + 1)}{{(θ^{3} + θ + 2 α)}^{2}} - \sum_{i = 1}^{n} \frac{x_{i}^{2}}{{(θ + x_{i} + α x_{i}^{2})}^{2}}$
$\frac{\partial^{2} \ln L}{\partial α^{2}} = \frac{4 n}{{(θ^{3} + θ + 2 α)}^{2}} - \sum_{i = 1}^{n} \frac{x_{i}^{4}}{{(θ + x_{i} + α x_{i}^{2})}^{2}}$

The solution of following equations gives MLE’s $(\hat{θ}, \hat{α})$ of $(θ, α)$ of QSD

${[\begin{matrix} \frac{\partial^{2} \ln L}{\partial θ^{2}} & \frac{\partial^{2} \ln L}{\partial θ \partial α} \\ \frac{\partial^{2} \ln L}{\partial θ \partial α} & \frac{\partial^{2} \ln L}{\partial α^{2}} \end{matrix}]}_{\begin{array}{l} \hat{θ} = θ_{0} \\ \hat{α} = α_{0} \end{array}} [\begin{matrix} \hat{θ} - θ_{0} \\ \hat{α} - α_{0} \end{matrix}] = {[\begin{matrix} \frac{\partial \ln L}{\partial θ} \\ \frac{\partial \ln L}{\partial α} \end{matrix}]}_{\begin{matrix} \hat{θ} = θ_{0} \\ \hat{α} = α_{0} \end{matrix}}$

where $θ_{0}$ and $α_{0}$ are the initial values of $θ$ and $α$ , respectively. These equations are solved iteratively till sufficiently close values of $\hat{θ}$ and $\hat{α}$ are obtained.

Data analysis

In this section, the goodness of fit of QSD has been discussed with two real lifetime data sets from engineering and the fit has been compared with one parameter exponential, Lindley and Shanker distributions. The following two data sets have been considered.

Data set 1

This data set is the strength data of glass of the aircraft window reported by Fuller et al.⁶

18.83	20.8	21.657	23.03	23.23	24.05	24.321	25.5	25.52	25.8	26.69	26.77	26.78
27.05	27.67	29.9	31.11	33.2	33.73	33.76	33.89	34.76	35.75	35.91	36.98	37.08
37.09	39.58	44.045	45.29	45.381

Data set 2

The following data represent the tensile strength, measured in GPa, of 69 carbon fibers tested under tension at gauge lengths of 20mm, Bader and Priest.⁷

1.312	1.314	1.479	1.552	1.7	1.803	1.861	1.865	1.944	1.958	1.966	1.997	2.006
	2.021	2.027	2.055	2.063	2.098	2.14	2.179	2.224	2.24	2.253	2.27	2.272
	2.274	2.301	2.301	2.359	2.382	2.382	2.426	2.434	2.435	2.478	2.49	2.511
	2.514	2.535	2.554	2.566	2.57	2.586	2.629	2.633	2.642	2.648	2.684	2.697
	2.726	2.77	2.773	2.8	2.809	2.818	2.821	2.848	2.88	2.954	3.012	3.067
	3.084	3.09	3.096	3.128	3.233	3.433	3.585	3.585

In order to compare the considered distributions, values of $- 2 \ln L$ , AIC(Akaike Information Criterion) and K-S Statistic ( Kolmogorov-Smirnov Statistic) for the data sets have been computed and presented in Table 1. The formula for AIC and K-S Statistic is defined as follow:

$A I C = - 2 \ln L + 2 k$ and $K - S = \underset{x}{Sup} | F_{n} (x) - F_{0} (x) |$ , where $k =$ number of parameters, $n =$ sample size, $F_{n} (x)$ is the empirical distribution function and $F_{0} (x)$ is the theoretical cumulative distribution function.. The best distribution corresponds to lower values of $- 2 \ln L$ , AIC and K-S statistic. It can be easily seen from table 1 that the QSD gives better fit than one parameter exponential, Lindley and Shanker distributions and hence it can be considered as an important distribution for modeling lifetime data from engineering.

Data sets	Distributions	ML estimates	Standard errors	$- 2 \ln L$	AIC	K-S statistic
1	QSD	$\hat{θ} = 0.097330$	0.0101017	240.53	244.53	0.298
	QSD	$\hat{α} = 13.623065$	52.81378	240.53	244.53	0.298
	Shanker	$\hat{θ} = 0.6471636$	0.0082	252.35	254.35	0.358
	Lindley	$\hat{θ} = 0.062990$	0.008	253.98	255.98	0.365
	Exponential	$\hat{θ} = 0.032449$	0.005822	274.53	276.53	0.458
2	QSD	$\hat{θ} = 1.20552$	0.083861	186.78	190.78	0.314
	QSD	$\hat{α} = 49.73844$	34.58363	186.78	190.78	0.314
	Shanker	$\hat{θ} = 0.658030$	0.052373	233	235	0.369
	Lindley	$\hat{θ} = 0.65450$	0.058031	238.38	240.38	0.401
	Exponential	$\hat{θ} = 0.407942$	0.04911	261.73	263.73	0.448

Table 1 MLE’s, $- 2 ln L$ , standard error, AIC, and K-S statistic of the fitted distributions of data sets 1 and 2

Concluding remarks

A two-parameter quasi Shanker distribution (QSD), of which one parameter Shanker distribution introduced by Shanker¹ is a particular case, has been suggested and investigated. Its mathematical properties including moments, coefficient of variation, skewness, kurtosis, index of dispersion, hazard rate function, mean residual life function, stochastic ordering, mean deviations, Bonferroni and Lorenz curves, and stress-strength reliability have been discussed. For estimating its parameters method of maximum likelihood estimation has been discussed. Finally, two numerical examples of real lifetime data sets has been presented to test the goodness of fit of QSD over exponential, Lindley and Shanker distributions and the fit by QSD has been quite satisfactory. Therefore, QSD can be recommended as an important two-parameter lifetime distribution.