Parameter estimation for the bivariate inverse lomax distribution by the EM algorithm based on censored samples

doi:10.15406/bbij.2019.08.00293

eISSN: 2378-315X

Biometrics & Biostatistics International Journal

Research Article Volume 8 Issue 6

Parameter estimation for the bivariate inverse lomax distribution by the EM algorithm based on censored samples

Dina H. Abdel Hady

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

Department of Statistics, Mathematics and Insurance, Faculty of Commerce, Tanta University, Egypt

Correspondence: Dina H. Abdel Hady, Department of Statistics, Mathematics and Insurance, Faculty of Commerce, Tanta University, Egypt

Received: September 24, 2019 | Published: December 30, 2019

Citation: Abdelhady DH. Parameter estimation for the bivariate inverse lomax distribution by the EM algorithm based on censored samples. Biom Biostat Int J. 2019;8(6):223-229. DOI: 10.15406/bbij.2019.08.00293

Download PDF

Abstract

In this article, we estimate the parameters of the bivariate Inverse Lomax distribution of Marshall-Olkin based on right censored sample. Utilizing EM algorithm is a priority because the vector of the observed data is not complete but viewed as an observable function of complete data. After that, the EM algorithm makes use of the simplicity of maximum likelihood estimation for complete data. In addition, normal deviations of the estimates of bivariate Lomax distribution are derived.

A comparison is conducted via a simulation study between estimates obtained by using the EM algorithm and without the EM algorithm.

Keywords: bivariate lomax distribution, censored sample, EM algorithm, missing data, simulation study

Introduction

Inverse Lomax distribution that is one of the inverted distributions which is very flexible to analyze the situation with non-monotonic failure rate, Singh, et al.¹ If a random variable Y has Lomax distribution, then x=1/y has an Inverse Lomax distribution (ILD) Singh, et al.² The Inverse Lomax Distribution (ILD) is used in random modeling of life components that have a decreasing failure rate. Like other distributions included in the family of generalized Beta distribution, Inverse Lomax Distribution also has application in actuarial sciences and economy and Kleiber and Kotz.³

Inverse Lomax was implemented on geophysical databases McKenzie, et al. (2011)4 These databases were about sizes of land fires in the state of California Rahman et al.,⁵ carried out research about statistical inference and Prediction on inverse Lomax distribution via Bayesian inferences. Kleiber⁶ tackled Inverse Lomax distribution to possess the Lorenz ordering relationship between ordered statistics.

In this article, maximum likelihood estimates (MLEs) of the parameters of the bivariate Inverse Lomax distribution (MOBIL) of Marshall-Olkin⁷ are obtained based on censored samples. The censoring time (T) is supposed to be independent of the life times X,Y of the two components. For example, this situation happens in medical studies of organs with pairs like kidneys, eyes, lungs, or any other paired organs of an individual like a two-component system that is interdependent. Failure of an individ ual might censor failure of either one of the paired organ or both.

This scheme of censoring is right censoring, There are similar situation in engineering science whenever sub-systems are considered having two components with life times (X,Y) being independent of the life time (T) of the entire system. However, failure of the main system may censor failure of either one component or both. Maximum likelihood estimators of the parameters for the case of univariate right censoring were derived by Hanagal (a,b).^8,9 In addition, censoring might happen in different ways. Patients may not follow up during the study. Some patients might decide to move somewhere else. Thereupon, the experimenter may not follow them again, or the patients may not continue to cooperate because of bad side effects of the treatment. These cases are called withdrawal from the study. Useful information is presented by patients with censored data. Therefore, they should not be omitted from the analysis. Due to lack of data about real processes, the data in this study are derived from the BVL of Marshall-Olkin⁷ with Matlab software. Subsequently, the parameters are assessed using EM algorithm.

Marshall-Olkin bivariate inverse lomax distribution

Let X be a random variable with the following CDF as follows;

$F_{I L} (x) = {(1 + \frac{θ}{x})}^{- α}, x > 0, α, θ > 0 (1)$ $F_{I L} (x) = {(1 + \frac{θ}{x})}^{- α}, x > 0, α, θ > 0 (1)$

The distribution of this form is said to be a Inverse Lomax distribution with parameters α and θ, will be denoted by IL(α,θ). The PDF and the hazard function of Inverse Lomax distribution with parameter (α,θ) will be

$f_{I L} (x; α, k) = \frac{θ α}{x^{2}} {(1 + \frac{θ}{x})}^{- α - 1}, x > 0 (2)$ $f_{I L} (x; α, k) = \frac{θ α}{x^{2}} {(1 + \frac{θ}{x})}^{- α - 1}, x > 0 (2)$

Suppose U₁,U₂ and U₃ are three independent random variables such that $U_{i} ~ I L (α_{i}, θ)$ $U_{i} ~ I L (α_{i}, θ)$ for i = 1, 2 and 3 it is assumed that $α_{1}, α_{2}, α_{3}, θ > 0$ $α_{1}, α_{2}, α_{3}, θ > 0$ . Define $X = max (u_{1}, u_{3})$ $X = \max (u_{1}, u_{3})$ and $Y = max (u_{2}, u_{3})$ $Y = \max (u_{2}, u_{3})$ .

Then we say that the bivariate vector $(X, Y)$ $(X, Y)$ has a bivariate Inverse Lomax distribution Marshall-Olkin.⁷

The joint cumulative function of X and Y is defined as

$F_{X, Y} (x, y) = {(1 + \frac{θ}{x})}_{1}^{- α_{1}} {(1 + \frac{θ}{y})}_{2}^{- α_{2}} {(1 + \frac{θ}{min (x, y)})}_{3}^{- α_{3}}$ $F_{X, Y} (x, y) = {(1 + \frac{θ}{x})}^{- α_{1}} {(1 + \frac{θ}{y})}^{- α_{2}} {(1 + \frac{θ}{\min (x, y)})}^{- α_{3}}$ $= F_{I L} (x; α_{1}, θ) F_{I L} (y; α_{2}, θ) F_{I L} (min (x, y) .; α_{3}, θ) (3)$ $= F_{I L} (x; α_{1}, θ) F_{I L} (y; α_{2}, θ) F_{I L} (\min (x, y) .; α_{3}, θ) (3)$

$F_{X, Y} (x, y) = ⎧ ⎪ ⎨ ⎪ ⎩ \begin{matrix} F_{1} (x, y) 0 < x < y < \infty F_{2} (x, y) 0 < y < x < \infty F_{0} (x) 0 < x = y < \infty \end{matrix} (4)$ $F_{X, Y} (x, y) = {\begin{matrix} F_{1} (x, y) 0 < x < y < \infty \\ F_{2} (x, y) 0 < y < x < \infty \\ F_{0} (x) 0 < x = y < \infty \end{matrix} (4)$

where

$F_{1} (x, y) = F (y; α_{2} + α_{3}, θ) F (x; α_{1}, θ) = {(1 + \frac{θ}{x})}_{1}^{- (α_{1} + α_{3})} {(1 + \frac{θ}{y})}_{2}^{- α_{2}}$ $F_{1} (x, y) = F (y; α_{2} + α_{3}, θ) F (x; α_{1}, θ) = {(1 + \frac{θ}{x})}^{- (α_{1} + α_{3})} {(1 + \frac{θ}{y})}^{- α_{2}}$

$F_{2} (x, y) = F (x; α_{1} + α_{3}, θ) F (y; α_{2}, θ) = {(1 + \frac{θ}{y})}_{2}^{- (α_{2} + α_{3})} {(1 + \frac{θ}{x})}_{1}^{- α_{1}}$ $F_{2} (x, y) = F (x; α_{1} + α_{3}, θ) F (y; α_{2}, θ) = {(1 + \frac{θ}{y})}^{- (α_{2} + α_{3})} {(1 + \frac{θ}{x})}^{- α_{1}}$

$F_{0} (x) = F (x; α, θ) = {(1 + \frac{θ}{x})}^{- α}$ $F_{0} (x) = F (x; α, θ) = {(1 + \frac{θ}{x})}^{- α}$

and $α = α_{1} + α_{2} + α_{3}$ $α = α_{1} + α_{2} + α_{3}$

The joint probability density function $f_{X, Y} (x, y)$ $f_{X, Y} (x, y)$ of X and Y takes the form

$f_{X, Y} (x, y) = ⎧ ⎪ ⎨ ⎪ ⎩ \begin{matrix} f_{1} (x, y) i f x < y f_{2} (x, y) i f y < x f_{0} (x) i f x = y \end{matrix} (5)$ $f_{X, Y} (x, y) = {\begin{matrix} f_{1} (x, y) i f x < y \\ f_{2} (x, y) i f y < x \\ f_{0} (x) i f x = y \end{matrix} (5)$

where

$f_{1} (x, y) = \frac{(α_{1} + α_{3}) α_{2} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{x})}_{1}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{y})}_{2}^{- α_{2} - 1}$ $f_{1} (x, y) = \frac{(α_{1} + α_{3}) α_{2} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{x})}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{y})}^{- α_{2} - 1}$

$f_{2} (x, y) = \frac{(α_{2} + α_{3}) α_{1} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{y})}_{2}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{x})}_{1}^{- α_{1} - 1}$ $f_{2} (x, y) = \frac{(α_{2} + α_{3}) α_{1} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{y})}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{x})}^{- α_{1} - 1}$

and

$f_{0} (x) = \frac{θ^{2} α_{3}}{x^{2}} {(1 + \frac{θ}{x})}^{- α - 1}$ $f_{0} (x) = \frac{θ^{2} α_{3}}{x^{2}} {(1 + \frac{θ}{x})}^{- α - 1}$

The density functions of $X | {(x, y) | x > y}, Y | {(x, y) | y > x}$ $X | {(x, y) | x > y}, Y | {(x, y) | y > x}$ and Z=min (X,Y) are given as follows:

$f_{X | {(x, y) | x > y}} (x) = \frac{θ (α_{1} + α_{3})}{x^{2}} {(1 + \frac{θ}{x})}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{y})}_{1}^{α_{1} + α_{3}}, x > y$ $f_{X | {(x, y) | x > y}} (x) = \frac{θ (α_{1} + α_{3})}{x^{2}} {(1 + \frac{θ}{x})}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{y})}^{α_{1} + α_{3}}, x > y$

$f_{Y | {(x, y) | y > x}} (y) = \frac{θ (α_{2} + α_{3})}{y^{2}} {(1 + \frac{θ}{y})}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{x})}_{2}^{α_{2} + α_{3}}, y < x$ $f_{Y | {(x, y) | y > x}} (y) = \frac{θ (α_{2} + α_{3})}{y^{2}} {(1 + \frac{θ}{y})}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{x})}^{α_{2} + α_{3}}, y < x$

$f_{Z} (z) = \frac{θ α}{z^{2}} {(1 + \frac{θ}{z})}^{- (α) - 1}, Z = min (x, y)$ $f_{Z} (z) = \frac{θ α}{z^{2}} {(1 + \frac{θ}{z})}^{- (α) - 1}, Z = \min (x, y)$

This article aims at deriving an estimation method for the parameters of a bivariate Inverse Lomax distribution of Marshall-Olkin by the EM algorithm. In Section 2, the EM algorithm is presented, while in Section 3, the parameters are estimated by applying the EM algorithm. Finally in Section 4, we present the results of a simulation study.

The EM Algorithm

The expectation-maximization (EM) algorithm was introduced by Dempster et al.¹⁰ The algorithm is considered a repeated procedure used to find the maximum likelihood estimates of data that are missing, incomplete, unobserved or censored data. Because it is easy to implement, the impact of the EM algorithm has had great effect, not only as a tool for computation but also as a method of solving complicated statistical problems.

The basic idea behind the method is to transform a set of incomplete data into a complete data problem for which the required maximization is computationally more tractable and stable numerically. Each repetition raises the likelihood, which is finally converged to a local maximum. The complete data set x can be viewed as consisting of vectors $(t, t^{*})$ $(t, t^{*})$ , where t is the observed incomplete data, and $t^{*}$ $t^{*}$ is the missing data.

The EM Algorithm method has been applied by several authors, for example, see Qin, et al.,¹¹ Rudolf,¹² Ning, et al.,¹³ Zanini, et al.,¹⁴ Açıkgöz,¹⁵ Kalabatsos,¹⁶Attia, et al.¹⁷ and Hanagal and Ahmadi.¹⁸

The iterations

The objective is to draw inferences about the parameter vectors We will useto denote the likelihood function where t is the vector of observed data. Let $t^{*}$ $t^{*}$ represent the vector of missing data. Starting with a guessed value for the parameter $α - -$ $\underline{α}$ , carry out the following iterations:

Replace the missing data $t^{*}$ $t^{*}$ by their expectation given the guessed value of the parameter vector and the observed data. Let this conditional expectation be denoted by $^{*}$ $\tilde{t^{*}}$ .
Maximize $L (α - -, t^{*} | t)$ $L (\underline{α}, t^{*} | t)$ with respect to $α - -$ $\underline{α}$ replacing the missing data $t^{*}$ $t^{*}$ by their expected values. This is equivalent to maximizing $L (α - -, E [T^{*} | t])$ $L (\underline{α}, E [T^{*} | t])$ .
Re-estimate the missing values $t^{*}$ $t^{*}$ using their conditional expectation based on the updated $α - -$ $\underline{α}$ .
Re-estimate $α - -$ $\underline{α}$ and continue until the difference between a new iterated value and the previous iterated value is less than 0.00001.

Execution of the algorithm

Consider the logarithm of the likelihood log $L (α - -, t^{*} | t)$ $L (\underline{α}, t^{*} | t)$ .
Let the conditional expectation of log $L (α - -, t^{*} | t)$ $L (\underline{α}, t^{*} | t)$ with respect to $t^{*} ∣ ∣ ({α - -}^{(k)}, t)$ $t^{*} | ({\underline{α}}^{(k)}, t)$ at k^th step be noted by $Q (α - - | {α - -}^{(k)})$ $Q (\underline{α} | {\underline{α}}^{(k)})$ , where ${α - -}^{(k)}$ ${\underline{α}}^{(k)}$ is the current guess of $α - -$ $\underline{α}$ .Then the EM-steps are as follows:

E-step: Calculate $Q (α - - | {α - -}^{(k)})$ $Q (\underline{α} | {\underline{α}}^{(k)})$ , that is, that is, the expectation of the log-likelihood with respect to the conditional distribution of the missing data, given the observed data and the current guess of $α - -$ $\underline{α}$ .
M-step: Maximize $Q (α - - | {α - -}^{(k)})$ $Q (\underline{α} | {\underline{α}}^{(k)})$ with respect to $α - -$ $\underline{α}$ and set the result equal to ${α - -}^{(k + 1)}$ ${\underline{α}}^{(k + 1)}$ , the new value of the parameter vector. Since ${α - -}^{(k + 1)}$ ${\underline{α}}^{(k + 1)}$ maximizes $Q (α - - | {α - -}^{(k)})$ $Q (\underline{α} | {\underline{α}}^{(k)})$ the M- Step results in $Q ({α - -}^{(k + 1)} ∣ ∣ {α - -}^{(k)}) \geq Q (α - - | {α - -}^{(k)})$ $Q ({\underline{α}}^{(k + 1)} | {\underline{α}}^{(k)}) \geq Q (\underline{α} | {\underline{α}}^{(k)})$ for all $α - - \in Ω$ $\underline{α} \in Ω$ , implying that ${α - -}^{(k + 1)}$ ${\underline{α}}^{(k + 1)}$ is a solution to equation, $\frac{\partial Q (α - - | {α - -}^{(k)})}{\partial α - -} = 0.$ $\frac{\partial Q (\underline{α} | {\underline{α}}^{(k)})}{\partial \underline{α}} = 0.$

The two steps are repeated iteratively until the difference between two successive iterations is less than 0.00001. This iterative procedure leads to a monotonic increase of log $L (α - -, E [T^{*} | t])$ $L (\underline{α}, E [T^{*} | t])$ :

$log L ({α - -}^{(k + 1)}, E [T^{*} ∣ ∣ t]) \geq log L (α - -, E [T^{*} | t]) f o r k = 1, 2, ... (8)$ $\log L ({\underline{α}}^{(k + 1)}, E [T^{*} | t]) \geq \log L (\underline{α}, E [T^{*} | t]) f o r k = 1, 2, ... (8)$

Since the likelihood increases in each step, the EM algorithm converges generally to a local maximum. When there is no closed form solution of the M-step, a numerical algorithm as, for example, the Newton-Raphson procedure, may be used for iteratively computing ${α - -}^{(k)}$ ${\underline{α}}^{(k)}$ . In fact, in this paper the Newton-Raphson procedure is used to obtain maximum likelihood estimates of $α - -$ $\underline{α}$ at the (k +1)^th iteration as follows:

${α - -}^{(k + 1)} = {α - -}^{(k)} - {(\frac{\partial^{2} Q (α - - | {α - -}^{(k)})}{\partial α - - \partial ´ α - -})}^{- 1}_{α - - = {α - -}^{(k)}}^{- 1} {(\frac{\partial Q (α - - | {α - -}^{(k)})}{\partial α - -})}_{α - - = {α - -}^{(k)}}^{(k)} (9)$ ${\underline{α}}^{(k + 1)} = {\underline{α}}^{(k)} - {(\frac{\partial^{2} Q (\underline{α} | {\underline{α}}^{(k)})}{\partial \underline{α} \partial \overset{´}{\underline{α}}})}^{- 1}_{\underline{α} = {\underline{α}}^{(k)}} {(\frac{\partial Q (\underline{α} | {\underline{α}}^{(k)})}{\partial \underline{α}})}_{\underline{α} = {\underline{α}}^{(k)}} (9)$

The iterative procedure is carried out until the difference ${α - -}^{(k + 1)} - {α - -}^{(k)} < 0.0001$ ${\underline{α}}^{(k + 1)} - {\underline{α}}^{(k)} < 0.0001$ , For more detail see Hanagal and Ahmadi.¹⁸

Parameter estimation

The density function of (X,Y) is given by

$f_{X, Y} (x, y) = ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ \begin{matrix} \frac{(α_{1} + α_{3}) α_{2} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{x})}_{1}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{y})}_{2}^{- α_{2} - 1} f o r 0 < x < y < \infty \frac{(α_{2} + α_{3}) α_{1} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{y})}_{2}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{x})}_{1}^{- α_{1} - 1} f o r 0 < y < x < \infty \frac{θ^{2} α_{3}}{x^{2}} {(1 + \frac{θ}{x})}^{- α - 1} f o r x = y \end{matrix} (10)$ $f_{X, Y} (x, y) = {\begin{matrix} \frac{(α_{1} + α_{3}) α_{2} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{x})}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{y})}^{- α_{2} - 1} f o r 0 < x < y < \infty \\ \frac{(α_{2} + α_{3}) α_{1} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{y})}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{x})}^{- α_{1} - 1} f o r 0 < y < x < \infty \\ \frac{θ^{2} α_{3}}{x^{2}} {(1 + \frac{θ}{x})}^{- α - 1} f o r x = y \end{matrix} (10)$

For the bivariate life time distribution, we use the univariate censoring plan presented by Hanagal(a,b)^8,9 as the persons do not join the study at the same time and withdrawal or death of a person or ending the study will censor life times of both components. The time of censoring is independent of life times of both components, This is the standard univariate right.

Suppose that there are n independent pairs of components, for example, paired kidneys, lungs, eyes, ears in an individual under study and i^th pair of the components have life times $(X_{i}, Y_{i})$ $(X_{i}, Y_{i})$ and censoring time (T_i ). The life times associated with i-th pair of the components are given by

$(X_{i}, Y_{i}) = ⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩ \begin{matrix} (X_{i}, Y_{i}) i f max (X_{i}, Y_{i}) < T_{i} (X_{i}, T_{i y}) i f X_{i} < T_{i y} < Y_{i} (T_{i x}, Y_{i}) i f Y_{i} < T_{i x} < X_{i} (T_{i x y}, T_{i x y}) i f min (X_{i}, Y_{i}) > T_{i x y} \end{matrix} (11)$ $(X_{i}, Y_{i}) = {\begin{matrix} (X_{i}, Y_{i}) i f \max (X_{i}, Y_{i}) < T_{i} \\ (X_{i}, T_{i y}) i f X_{i} < T_{i y} < Y_{i} \\ (T_{i x}, Y_{i}) i f Y_{i} < T_{i x} < X_{i} \\ (T_{i x y}, T_{i x y}) i f \min (X_{i}, Y_{i}) > T_{i x y} \end{matrix} (11)$

for i = 1,2,...,n

where T_ix,T_iy and T_ixy represent the unobserved random variables x<y, x>y, x=y respectively.

The likelihood of the sample of size n after discarding factors which do not contain any of the parameters of interest is given as follows

$L (α - -, t_{i x}, t_{i y}, t_{i x y}) = \prod_{j = i}^{6} \prod_{j}^{n_{j}} f_{X, Y}^{j} (x_{i}, y_{i}) (12)$ $L (\underline{α}, t_{i x}, t_{i y}, t_{i x y}) = \prod_{j = i}^{6} \prod_{i = 1}^{n_{j}} f_{X, Y}^{j} (x_{i}, y_{i}) (12)$

where

$\begin{matrix} \begin{matrix} {f^{1}}_{X, Y}^{1} (x, y) = \frac{(α_{1} + α_{3}) α_{2} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{x})}_{1}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{y})}_{2}^{- α_{2} - 1}, 0 < x < y < t {f^{2}}_{X, Y}^{2} (x, y) = \frac{(α_{2} + α_{3}) α_{1} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{y})}_{2}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{x})}_{1}^{- α_{1} - 1}, 0 < x < y < t {f^{3}}_{X, Y}^{3} (x, y) = \frac{θ^{2} α_{3}}{x^{2}} {(1 + \frac{θ}{x})}^{- α - 1}, 0 < x = y < t \end{matrix} {f^{4}}_{X, Y}^{4} (x, y) = \frac{(α_{1} + α_{3}) α_{2} θ^{2}}{x^{2} {t_{y}}_{y}^{2}} {(1 + \frac{θ}{x})}_{1}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{t_{y}})}_{2}^{- α_{2} - 1}, 0 < x < t < t_{y} = y \begin{matrix} {f^{5}}_{X, Y}^{5} (x, y) = \frac{(α_{2} + α_{3}) α_{1} θ^{2}}{{t_{x}}_{x}^{2} y^{2}} {(1 + \frac{θ}{y})}_{2}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{t_{x}})}_{1}^{- α_{1} - 1}, 0 < y < t < t_{x} = x {f^{6}}_{X, Y}^{6} (x, y) = \frac{θ^{2} α_{3}}{{t_{x y}}_{x y}^{2}} {(1 + \frac{θ}{t_{x y}})}^{- α - 1}, 0 < t < t_{x y} = min (x, y) \end{matrix} \end{matrix} ⎫ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎬ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎭ (13)$ $\begin{matrix} \begin{matrix} {f^{1}}_{X, Y} (x, y) = \frac{(α_{1} + α_{3}) α_{2} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{x})}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{y})}^{- α_{2} - 1}, 0 < x < y < t \\ {f^{2}}_{X, Y} (x, y) = \frac{(α_{2} + α_{3}) α_{1} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{y})}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{x})}^{- α_{1} - 1}, 0 < x < y < t \\ {f^{3}}_{X, Y} (x, y) = \frac{θ^{2} α_{3}}{x^{2}} {(1 + \frac{θ}{x})}^{- α - 1}, 0 < x = y < t \end{matrix} \\ {f^{4}}_{X, Y} (x, y) = \frac{(α_{1} + α_{3}) α_{2} θ^{2}}{x^{2} {t_{y}}^{2}} {(1 + \frac{θ}{x})}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{t_{y}})}^{- α_{2} - 1}, 0 < x < t < t_{y} = y \\ \begin{matrix} {f^{5}}_{X, Y} (x, y) = \frac{(α_{2} + α_{3}) α_{1} θ^{2}}{{t_{x}}^{2} y^{2}} {(1 + \frac{θ}{y})}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{t_{x}})}^{- α_{1} - 1}, 0 < y < t < t_{x} = x \\ {f^{6}}_{X, Y} (x, y) = \frac{θ^{2} α_{3}}{{t_{x y}}^{2}} {(1 + \frac{θ}{t_{x y}})}^{- α - 1}, 0 < t < t_{x y} = \min (x, y) \end{matrix} \end{matrix}} (13)$

where $\sum_{j = 1}^{6} n_{j} = n$ $\sum_{j = 1}^{6} n_{j} = n$ , $n_{1}, n_{2}, n_{3}, n_{4}, n_{5} a n d n_{6}$ $n_{1}, n_{2}, n_{3}, n_{4}, n_{5} a n d n_{6}$ be the numbers of realizations falling in the range corresponding to ${f^{j}}_{X, Y}^{j} (x, y)$ ${f^{j}}_{X, Y} (x, y)$ and j=1,2,...,6, respectively. ${f^{1}}_{X, Y}^{1} (x, y)$ ${f^{1}}_{X, Y} (x, y)$ , ${f^{2}}_{X, Y}^{2} (x, y)$ ${f^{2}}_{X, Y} (x, y)$ , ${f^{4}}_{X, Y}^{4} (x, y)$ ${f^{4}}_{X, Y} (x, y)$ and ${f^{5}}_{X, Y}^{5} (x, y)$ ${f^{5}}_{X, Y} (x, y)$ are density functions with respect to the Lebesque measure on R2, while , ${f^{3}}_{X, Y}^{3} (x, y)$ ${f^{3}}_{X, Y} (x, y)$ ,and ${f^{6}}_{X, Y}^{6} (x, y)$ ${f^{6}}_{X, Y} (x, y)$ are density functions with respect to the Lebesque measure on R.

Let the range of variability corresponding to ${f^{j}}_{X, Y}^{j} (x, y)$ ${f^{j}}_{X, Y} (x, y)$ be denoted by $A_{j}$ $A_{j}$ , j=1, 2, …., 6.and $A_{1}, \dots, A_{6}$ $A_{1}, \dots, A_{6}$ are disjoint sets and letting $E_{1} = A_{1} \cup {A_{4}}_{4}$ $E_{1} = A_{1} \cup {A_{4}}^{}$ , $E_{2} = A_{2} \cup A_{5}, E_{3} = A_{3} \cup A_{6}, E_{4} = E_{1} \cup E_{2}$ $E_{2} = A_{2} \cup A_{5}, E_{3} = A_{3} \cup A_{6}, E_{4} = E_{1} \cup E_{2}$ , $E_{5} = E_{1} \cup A_{2} \cup A_{3}$ $E_{5} = E_{1} \cup A_{2} \cup A_{3}$ and $E_{6} = E_{2} \cup A_{1}$ $E_{6} = E_{2} \cup A_{1}$ , the log-likelihood function $ln L_{c} (α - -, t_{i x}, t_{i y}, t_{i x y})$ $\ln L_{c} (\underline{α}, t_{i x}, t_{i y}, t_{i x y})$ can be written as follows:

$ln L = (n_{1} + n_{4}) (ln (α_{1} + α_{3}) + ln α_{2}) + (n_{2} + n_{5}) (ln (α_{2} + α_{3}) + ln α_{1}) + (n_{3} + n_{6}) ln α_{3}$ $\ln L = (n_{1} + n_{4}) (\ln (α_{1} + α_{3}) + \ln α_{2}) + (n_{2} + n_{5}) (\ln (α_{2} + α_{3}) + \ln α_{1}) + (n_{3} + n_{6}) \ln α_{3}$

$- 2 \sum_{i \in E_{5}} ln (x_{i}) - 2 \sum_{i \in E_{6}} ln (y_{i}) - (α_{1} + α_{3} + 1) \sum_{i \in E_{1}} ln (1 + \frac{θ}{x_{i}}) - (α + 1) \sum_{i \in A_{3}} ln (1 + \frac{θ}{x_{i}})$ $- 2 \sum_{i \in E_{5}} \ln (x_{i}) - 2 \sum_{i \in E_{6}} \ln (y_{i}) - (α_{1} + α_{3} + 1) \sum_{i \in E_{1}} \ln (1 + \frac{θ}{x_{i}}) - (α + 1) \sum_{i \in A_{3}} \ln (1 + \frac{θ}{x_{i}})$

$- (α_{1} + 1) \sum_{i \in A_{2}} ln (1 + \frac{θ}{x_{i}}) - (α_{2} + α_{3} + 1) \sum_{i \in E_{2}} ln (1 + \frac{θ}{y_{i}}) - (α_{2} + 1) \sum_{i \in A_{6}} ln (1 + \frac{θ}{y_{i}})$ $- (α_{1} + 1) \sum_{i \in A_{2}} \ln (1 + \frac{θ}{x_{i}}) - (α_{2} + α_{3} + 1) \sum_{i \in E_{2}} \ln (1 + \frac{θ}{y_{i}}) - (α_{2} + 1) \sum_{i \in A_{6}} \ln (1 + \frac{θ}{y_{i}})$

$- 2 \sum_{i \in A_{4}} ln (t_{y i}) - 2 \sum_{i \in A_{5}} ln (t_{x i}) - (α_{1} + 1) \sum_{i \in A_{5}} ln (1 + \frac{θ}{t_{x i}}) - (α_{2} + 1) \sum_{i \in A_{4}} ln (1 + \frac{θ}{t_{y i}})$ $- 2 \sum_{i \in A_{4}} \ln (t_{y i}) - 2 \sum_{i \in A_{5}} \ln (t_{x i}) - (α_{1} + 1) \sum_{i \in A_{5}} \ln (1 + \frac{θ}{t_{x i}}) - (α_{2} + 1) \sum_{i \in A_{4}} \ln (1 + \frac{θ}{t_{y i}})$

$- 2 \sum_{i \in A_{6}} ln (t_{x y i}) - (α + 1) \sum_{i \in A_{6}} ln (1 + \frac{θ}{t_{x y i}}) + n ln (θ) + 2 (n_{1} + n_{4} + n_{2} + n_{5}) ln (θ) (14)$ $- 2 \sum_{i \in A_{6}} \ln (t_{x y i}) - (α + 1) \sum_{i \in A_{6}} \ln (1 + \frac{θ}{t_{x y i}}) + n \ln (θ) + 2 (n_{1} + n_{4} + n_{2} + n_{5}) \ln (θ) (14)$

Estimation parameter with EM algorithm

The E-step and M- step are obtained as follow:

E- step:

The unobserved random variables, T_ix,T_iy and T_ixy follow the distributions as stated in (7). The conditional distributions of $T_{x} | {(t_{x}, t, y) | (t_{x} > t > y)}$ $T_{x} | {(t_{x}, t, y) | (t_{x} > t > y)}$ , $T_{y} ∣ ∣ {(t_{y}, t, x) ∣ ∣ (t_{y} > t > x)}$ $T_{y} | {(t_{y}, t, x) | (t_{y} > t > x)}$ and $T_{x y} ∣ ∣ {(t_{x y}, t) ∣ ∣ (t_{x y} > t)}$ $T_{x y} | {(t_{x y}, t) | (t_{x y} > t)}$ are given as follows:

$\begin{matrix} f_{T_{x} | (t_{x}, t, y)} (t_{x}) = \frac{θ (α_{1} + α_{3})}{{t_{x}}_{x}^{2}} {(1 + \frac{θ}{t_{x}})}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{t})}_{1}^{α_{1} + α_{3}}, y < t < t_{x} f_{T_{y} ∣ ∣ (t_{y}, t, x)} (t_{y}) = \frac{θ (α_{2} + α_{3})}{{t_{y}}_{y}^{2}} {(1 + \frac{θ}{t_{y}})}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{t})}_{2}^{α_{2} + α_{3}}, x < t < t_{y} f_{T_{x y} ∣ ∣ (t_{x y}, t)} (t_{x y}) = \frac{θ α}{{t_{x y}}_{x y}^{2}} {(1 + \frac{θ}{t_{x y}})}^{- (α) - 1} {(1 + \frac{θ}{t})}^{(α)}, t > t_{x y} \end{matrix} ⎫ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎬ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎭ (15)$ $\begin{matrix} f_{T_{x} | (t_{x}, t, y)} (t_{x}) = \frac{θ (α_{1} + α_{3})}{{t_{x}}^{2}} {(1 + \frac{θ}{t_{x}})}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{t})}^{α_{1} + α_{3}}, y < t < t_{x} \\ f_{T_{y} | (t_{y}, t, x)} (t_{y}) = \frac{θ (α_{2} + α_{3})}{{t_{y}}^{2}} {(1 + \frac{θ}{t_{y}})}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{t})}^{α_{2} + α_{3}}, x < t < t_{y} \\ f_{T_{x y} | (t_{x y}, t)} (t_{x y}) = \frac{θ α}{{t_{x y}}^{2}} {(1 + \frac{θ}{t_{x y}})}^{- (α) - 1} {(1 + \frac{θ}{t})}^{(α)}, t > t_{x y} \end{matrix}} (15)$

The values of the first moments of the conditional unobserved random variables $(1 + \frac{T_{x}}{k}) | {(t_{x}, t, y) | (t_{x} > t > y)}, (1 + \frac{T_{y}}{k}) | {(t_{y}, t, x) | (t_{y} > t > x)} a n d$
(1+Txy/k)|{(txy,t)|(txy>t)} are as follows.

$\begin{matrix} E (1 + T_{x} / k) | {(t_{x}, t, y) | (t_{x} > t > y)} = \frac{- (α_{1} + α_{3})}{α_{1} + α_{3} - 1} (1 + \frac{θ}{t}) = γ_{1} (1 + \frac{θ}{t}) (s a y) \\ E (1 + T_{y} / k) | {(t_{y}, t, x) | (t_{y} > t > x)} = \frac{- (α_{2} + α_{3})}{α_{2} + α_{3} - 1} (1 + \frac{θ}{t}) = γ_{2} (1 + \frac{θ}{t}) (s a y) \\ E (1 + T_{x y} / k) | {(t_{x y}, t) | (t_{x y} > t)} = \frac{- α}{α - 1} (1 + \frac{t}{k}) = γ_{3} (1 + \frac{θ}{t}) (s a y) \end{matrix}} (16)$

The conditional expectation $Q (\underline{α} | {\underline{α}}^{(k)})$ is obtained as follows:

$Q (\underline{α} | {\underline{α}}^{(k)}) = (n_{1} + n_{4}) (\ln (α_{1} + α_{3}) + \ln α_{2}) + (n_{2} + n_{5}) (\ln (α_{2} + α_{3}) + \ln α_{1}) + (n_{3} + n_{6}) \ln α_{3}$ $- 2 \sum_{i \in E_{5}} \ln (x_{i}) - 2 \sum_{i \in E_{6}} \ln (y_{i}) - (α_{1} + α_{3} + 1) \sum_{i \in E_{1}} {\dot{x}}_{i} - (α + 1) \sum_{i \in A_{3}} {\dot{x}}_{i}$ $- (α_{1} + 1) \sum_{i \in A_{2}} {\dot{x}}_{i} - (α_{2} + α_{3} + 1) \sum_{i \in E_{2}} {\dot{y}}_{i} - (α_{2} + 1) \sum_{i \in A_{6}} {\dot{y}}_{i}$ $+ 2 \sum_{i \in A_{4}} \ln (\frac{γ_{2}^{(k)} - 1}{θ} + \frac{γ_{2}^{(k)}}{t_{i}}) + 2 \sum_{i \in A_{5}} \ln (\frac{γ_{1}^{(k)} - 1}{θ} + \frac{γ_{1}^{(k)}}{t_{i}}) - (α_{1} + 1) \sum_{i \in A_{5}} (γ_{1}^{(k)} + {\dot{t}}_{i})$ $- (α_{2} + 1) \sum_{i \in A_{4}} (γ_{2}^{(k)} + {\dot{t}}_{i}) + 2 \sum_{i \in A_{6}} \ln (\frac{γ_{3}^{(k)} - 1}{θ} + \frac{γ_{3}^{(k)}}{t_{i}}) - (α + 1) \sum_{i \in A_{6}} (γ_{3}^{(k)} + {\dot{t}}_{i})$ $+ n \ln (θ) + 2 (n_{1} + n_{4} + n_{2} + n_{5}) \ln (θ)$ (17)

suppose is θ known and ${γ_{1}}^{(k)}, {γ_{2}}^{(k)}, {γ_{3}}^{(k)}$ are expressed in terms of ${α_{1}}^{(k)}, {α_{2}}^{(k)}, {α_{3}}^{(k)}$ at K^th iteration, and

${\dot{x}}_{i} = \ln (1 + \frac{θ}{x_{i}}), {\dot{y}}_{i} = \ln (1 + \frac{θ}{y_{i}}) a n d {\dot{t}}_{i} = \ln (1 + \frac{θ}{t_{i}}) .$

M-step:

The following equations are obtained by equating the partial derivatives of $Q (\underline{α} | {\underline{α}}^{(k)})$ with respect to $α_{1}, α_{2} a n d α_{3}$ to zero:

$\frac{n_{2} + n_{5}}{α_{1}} + \frac{n_{1} + n_{4}}{α_{1} + α_{3}} - \sum_{i \in D_{1}} {\dot{x}}_{i} - \sum_{i \in A_{5}} (γ_{1}^{(k)} + {\dot{t}}_{i}) - \sum_{i \in A_{6}} (γ_{3}^{(k)} + {\dot{t}}_{i}) = 0 (18)$

$\frac{n_{1} + n_{4}}{α_{2}} + \frac{n_{2} + n_{5}}{α_{2} + α_{3}} - \sum_{i \in A_{3}} {\dot{x}}_{i} - \sum_{i \in D_{2}} {\dot{y}}_{i} - \sum_{i \in A_{4}} (γ_{2}^{(k)} + {\dot{t}}_{i}) - \sum_{i \in A_{6}} (γ_{3}^{(k)} + {\dot{t}}_{i}) = 0 (19)$

$\frac{n_{1} + n_{4}}{α_{1} + α_{3}} + \frac{n_{2} + n_{5}}{α_{2} + α_{3}} + \frac{n_{3} + n_{6}}{α_{3}} - \sum_{i \in A_{3}} {\dot{x}}_{i} - \sum_{i \in E_{1}} {\dot{x}}_{i} - \sum_{i \in E_{2}} {\dot{y}}_{i} - \sum_{i \in A_{6}} (γ_{3}^{(k)} + {\dot{t}}_{i}) = 0 (20)$

Where $D_{1} = E_{1} \cup A_{2} \cup A_{3}$ and $D_{2} = E_{2} \cup A_{6} .$

The above likelihood equations are solved for the maximum likelihood estimates $({\hat{α}}_{1}^{(k)}, {\hat{α}}_{2}^{(k)}, {\hat{α}}_{3}^{(k)})$ using the Newton-Raphson procedure. Below the observed elements of the symmetric information matrix at the current guess $\underline{α} = {\underline{α}}^{(k)}$ are given which is required in the Newton-Raphson iterative procedure.

$\begin{array}{l} - \frac{\partial^{2} Q (\underline{α} | {\underline{α}}^{(k)})}{\partial α_{1}^{2}} = \frac{(n_{2} + n_{5})}{{α_{1}}^{2}} + \frac{(n_{1} + n_{4})}{{(α_{1} + α_{3})}^{2}} \\ - \frac{\partial^{2} Q (\underline{α} | {\underline{α}}^{(k)})}{\partial α_{1} \partial α_{2}} = 0 \\ - \frac{\partial^{2} Q (\underline{α} | {\underline{α}}^{(k)})}{\partial α_{1} \partial α_{3}} = \frac{(n_{1} + n_{4})}{{(α_{1} + α_{3})}^{2}} \\ - \frac{\partial^{2} Q (\underline{α} | {\underline{α}}^{(k)})}{\partial α_{2}^{2}} = \frac{(n_{1} + n_{4})}{α_{2}^{2}} + \frac{(n_{2} + n_{5})}{{(α_{2} + α_{3})}^{2}} \\ - \frac{\partial^{2} Q (\underline{α} | {\underline{α}}^{(k)})}{\partial α_{2} \partial α_{3}} = \frac{(n_{2} + n_{5})}{{(α_{2} + α_{3})}^{2}} \\ - \frac{\partial^{2} Q (\underline{α} | {\underline{α}}^{(k)})}{\partial α_{3}^{2}} = \frac{(n_{1} + n_{4})}{{(α_{1} + α_{3})}^{2}} + \frac{(n_{2} + n_{5})}{{(α_{2} + α_{3})}^{2}} + \frac{(n_{3} + n_{6})}{α_{3}^{2}} \end{array}} (21)$

As mentioned above the iterative process is carried out until the following condition is met by two subsequent solutions:

$L ({\underline{α}}^{(k + 1)}, E [t_{i x}, t_{i y}, t_{i x y} | x_{i}, y_{i}, t_{i}]) - L ({\underline{α}}^{(k)}, E [t_{i x}, t_{i y}, t_{i x y} | x_{i}, y_{i}, t_{i}]) < 0.0001 (22)$

Estimation parameter without EM algorithm

The likelihood of the sample of size n after discarding factors which do not contain any of the parameters of interest is given as follows

$L (\underline{α}, t) = \prod_{j = i}^{6} \prod_{i = 1}^{n_{j}} f_{X, Y}^{j} (x_{i}, y_{i}) (23)$

where

$\begin{array}{l} {f^{1}}_{X, Y} (x, y) = \frac{(α_{1} + α_{3}) α_{2} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{x})}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{y})}^{- α_{2} - 1}, 0 < x < y < t \\ {f^{2}}_{X, Y} (x, y) = \frac{(α_{2} + α_{3}) α_{1} θ^{2}}{x^{2} y^{2}} {(1 + \frac{θ}{y})}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{x})}^{- α_{1} - 1}, 0 < x < y < t \\ {f^{3}}_{X, Y} (x, y) = \frac{θ^{2} α_{3}}{x^{2}} {(1 + \frac{θ}{x})}^{- α - 1}, 0 < x = y < t \\ {f^{4}}_{X, Y} (x, y) = \frac{(α_{1} + α_{3}) α_{2} θ^{2}}{x^{2} t^{2}} {(1 + \frac{θ}{x})}^{- (α_{1} + α_{3}) - 1} {(1 + \frac{θ}{t})}^{- α_{2} - 1}, 0 < x < t < y \\ {f^{5}}_{X, Y} (x, y) = \frac{(α_{2} + α_{3}) α_{1} θ^{2}}{t^{2} y^{2}} {(1 + \frac{θ}{y})}^{- (α_{2} + α_{3}) - 1} {(1 + \frac{θ}{t})}^{- α_{1} - 1}, 0 < y < t < x \\ {f^{6}}_{X, Y} (x, y) = \frac{θ^{2} α_{3}}{t^{2}} {(1 + \frac{θ}{t})}^{- α - 1}, 0 < t < \min (x, y) \end{array}} (24)$

The log-likelihood function $\ln L_{c} (\underline{α}, t)$ can be written as follows:

$\ln L = (n_{1} + n_{4}) (\ln (α_{1} + α_{3}) + \ln α_{2}) + (n_{2} + n_{5}) (\ln (α_{2} + α_{3}) + \ln α_{1}) + (n_{3} + n_{6}) \ln α_{3}$ $- 2 \sum_{i \in E_{5}} \ln (x_{i}) - 2 \sum_{i \in E_{6}} \ln (y_{i}) - (α_{1} + α_{3} + 1) \sum_{i \in E_{1}} {\dot{x}}_{i} - (α + 1) \sum_{i \in A_{3}} {\dot{x}}_{i}$ $- (α_{1} + 1) \sum_{i \in A_{2}} {\dot{x}}_{i} - (α_{2} + α_{3} + 1) \sum_{i \in E_{2}} {\dot{y}}_{i} - (α_{2} + 1) \sum_{i \in A_{6}} {\dot{y}}_{i}$ $- 2 \sum_{i \in A_{4}} \ln (t_{i}) - 2 \sum_{i \in A_{5}} \ln (t_{i}) - (α_{1} + 1) \sum_{i \in A_{5}} {\dot{t}}_{i} - (α_{2} + 1) \sum_{i \in A_{4}} {\dot{t}}_{i}$ $- 2 \sum_{i \in A_{6}} \ln (t_{i}) - (α + 1) \sum_{i \in A_{6}} {\dot{t}}_{i} + n \ln (θ) + 2 (n_{1} + n_{4} + n_{2} + n_{5}) \ln (θ) (25)$

The following likelihood equations are obtained by equating the partial derivatives of $\ln (L)$ with respect to $α_{1}, α_{2} a n d α_{3}$ to zero:

$\frac{n_{2} + n_{5}}{α_{1}} + \frac{n_{1} + n_{4}}{α_{1} + α_{3}} - \sum_{i \in D_{1}} {\dot{x}}_{i} - \sum_{i \in A_{5} \cup A_{6}} {\dot{t}}_{i} = 0 (26)$

$\frac{n_{1} + n_{4}}{α_{2}} + \frac{n_{2} + n_{5}}{α_{2} + α_{3}} - \sum_{i \in A_{3}} {\dot{x}}_{i} - \sum_{i \in D_{2}} {\dot{y}}_{i} - \sum_{i \in A_{4} \cup A_{6}} {\dot{t}}_{i} = 0 (27)$

Simulation Study and Conclusions

The sample data are generated based on following algorithms:

Step 1: Generate $u_{i}$ using the Lomax distribution white parameters $α_{1}, α_{2} {and α}_{3}$ .

Step 2: Take $X = \min (u_{1}, u_{3})$ and $Y = \min (u_{2}, u_{3})$ and, therefore (X,Y) follows a bivariate Inverse Lomax distribution of Marshall-Olkin type.

Step 3: Generate t_i using the Inverse Lomax distribution with $β, θ$ , where t_is are the censoring times.

For two cases with respect to the $α_{i} s$ , we generated 1000 sets of samples. Each set consisted of three samples with sizes n = 20, 35, 50 and 100. The corresponding maximum likelihood estimates are displayed in Table 1 together with the empirical standard deviation. The estimates denoted by $M L E_{e m}$ and $S E_{e m}$ are obtained by using the EM algorithm, while the estimates denoted by MLE and SE are obtained without the EM algorithm.

Parameters	α₁	α₂	α₃	α₄	α₅	α₆
Parameters	1.6	1.4	1.8	0.6	0.5	0.4
n=20
MLE_em	1.6113	1.3882	1.8154	0.6131	0.4812	0.3874
SE_em	0.0771	0.0682	0.0854	0.0865	0.0967	0.0793
MLE	1.6173	1.3845	1.8193	0.6216	0.4735	0.3818
SE	0.0943	0.0817	0.0975	0.0972	0.1076	0.0926
n=35
MLE_em	1.6092	1.3914	1.8122	0.6092	0.4885	0.3917
SE_em	0.0723	0.0637	0.0761	0.0782	0.0913	0.0725
MLE	1.6117	1.3902	1.8164	0.6173	0.4834	0.3861
SE	0.0788	0.0636	0.0826	0.0841	0.1015	0.0844
n=50
MLE_em	1.6064	1.3931	1.8093	0.6062	0.4921	0.3943
SE_em	0.0681	0.0586	0.0717	0.0726	0.0813	0.0637
MLE	1.6113	1.3915	1.8124	0.6125	0.4876	0.3903
SE	0.0714	0.0622	0.0773	0.0796	0..0972	0.0779
n=100
MLE_em	1.6033	1.3947	1.8068	0.6046	0.4969	0.3973
SE_em	0.0655	0.0561	0.0693	0.0597	0.0772	0.0589
MLE	1.6055	1.3968	1.8059	0.6071	0.4934	0.3942
SE	0.0658	0.0572	0.0714	0.0615	0.0786	0.0603

Table 1 Comparison of MLEs obtained using the EM algorithm and without EM algorithm.

Table 1 and Table 2 shows that the estimates obtained by using the EM algorithm gave a smaller empirical standard error than those obtained without the EM algorithm.

Parameters	α₁	α₂	α₃	α₄	α₅	α₆
Parameters	2.5	2	2.2	1.1	0.9	1.4
n=20
MLE_em	2.613	2.221	2.414	1.289	1.089	1.603
SE_em	0.141	0.135	0.157	0.167	0.149	0.163
MLE	2.777	2.335	2.575	1.432	1.176	1.699
SE	0.173	0.169	0.189	0.192	0.176	0.194
n=35
MLE_em	2.587	2.198	2.385	1.176	1.054	1.547
SE_em	0.109	0.119	0.132	0.123	0.135	0.152
MLE	2.711	2.282	2.513	1.356	1.103	1.621
SE	0.148	0.135	0.159	0.184	0.161	0.183
n=50
MLE_em	2.534	2.145	2.313	1.142	0.987	1.498
SE_em	0.089	0.097	0.114	0.114	0.119	0.124
MLE	2.665	2.214	2.478	1.258	1.067	1.557
SE	0.123	0.123	0.137	0.162	0..149	0.168
n=100
MLE_em	2.516	2.061	2.257	1.122	0.931	1.433
SE_em	0.078	0.068	0.096	0.079	0.088	0.104
MLE	2.608	2.111	2.319	1.207	0.987	1.519
SE	0.117	0.109	0.121	0.123	0.117	0.147

Table 2 Comparison of MLEs obtained using the EM algorithm and without EM algorithm.

Moreover, the estimates MLE_em are close to the true parameter values and the standard errors (SE_em) decrease as the sample size increases. The estimates for both methods are obtained by taking the mean of the 1000 maximum likelihood estimates and the mean of the 1000 standard errors from the 1000 samples of size n = 20, 35,50 and 100.

Also note that with a large sample size estimates of the values of both methods (with and without EM algorithm) from each other as well as the standard error, which means that when the sizes of large samples may give both methods convergent results and then fit whichever is applicable.