Effect of correlated measurement errors on estimation of population mean with modified ratio estimator

doi:10.15406/bbij.2022.11.00354

eISSN: 2378-315X

Biometrics & Biostatistics International Journal

Research Article Volume 11 Issue 2

Effect of correlated measurement errors on estimation of population mean with modified ratio estimator

Okafor Ikechukwu Boniface,

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

Onyeka Aloysius Chijioke, Ogbonna Chukwudi Justin, Izunobi Chinyeaka Hostensia

Department of Statistics, Federal University of Technology, Nigeria

Correspondence: Okafor Ikechukwu Boniface, Department of Statistics, Federal University of Technology, Owerri, Imo state, Nigeria

Received: November 19, 2021 | Published: April 25, 2022

Citation: Boniface OI, Chijioke OA, Justin OC, et al. Effect of correlated measurement errors on estimation of population mean with modified ratio estimator. Biom Biostat Int J. 2022;11(2):52-56. DOI: 10.15406/bbij.2022.11.00354

Download PDF

Abstract

This paper proposes a class of modified ratio estimators of population mean using correlation coefficient between study and auxiliary variables in the presence of correlated measurement errors under simple random strategy. Usual unbiased estimator of sample mean per unit, ratio and product-type estimators belong to the suggested modified class of estimators. Considering large sample approximation, properties of the proposed estimator are obtained. Theoretical and empirical analysis revealed that the proposed class of estimators are more efficient than some existing estimators.

Keywords: Correlated measurement errors, ratio estimator, bias, mean squared error, correlation coefficient.

Introduction

Many researchers have widely utilized auxiliary information while estimating population parameters. This has contributed immensely in advancing sampling theory as a result of its ability to improve the accuracy of sampling strategies and reduce their design variances. Due to the fact that sample sizes are not sufficiently large in most of the survey exercises, estimators of population parameters based on these survey exercises may not be satisfactory in terms of their variances. At the same time it is not unusual that some auxiliary information about the study variable may be available. Such additional information, if available, can be utilized to improve properties of estimators. Some of the auxiliary information about the population that is used to improve the accuracy of an estimator may include a known variable to which study variable is approximately related. Such estimators which utilize auxiliary information include ratio, product and regression estimators. Although use of auxiliary information may have improved the estimates of population parameters, measurement errors may still influence the efficiency of the estimators.

In sampling survey, properties of estimators presume that observed values are indeed true values. However, several observations of the same quantity on the same subject may not in most cases be the same as a result of natural variation in the subject, variation in the observational process, or both. Hence, it is generally accepted that data available for statistical analysis are subject to error.

The difference between the individual observed values and their corresponding true values are referred to as measurement errors. This constitutes an essential part of errors in any sample survey data and their presence is practically inevitable whatever precautions one takes. The causes of these measurement errors may be attributed to errors during data collection stage due to respondents or enumerators’ bias or both, and to data collation and coding.^1,2 The magnitude of the effect of measurement errors on statistical inference drawn about the population parameter may sometimes be inconsequential. However, in some other situation, the magnitude may throw a serious concern which may invalidate the inference drawn and lead to unfortunate implication.

Shalabh³ had examined the issue of observational error or measurement errors on ratio estimator under simple random sampling strategy. Following his work, other researchers further investigated the impact of measurement errors on the estimators of population parameters using different sampling schemes. Manish and Singh⁴considered linear combination of ratio estimator and sample mean per unit and came up with a family of estimators of population mean. They obtained the bias and mean squared error of the proposed family of estimators when the sample data are contaminated with measurement errors. Using variable transformation, Diwakar et al.⁵ worked on estimator of a population mean in the presence of measurement errors and the properties of the estimator were obtained. Comparing this estimator with the estimators proposed by Manish and Singh⁴ and Shalabh³ when the study and auxiliary variables are contaminated with measurement errors, it was observed that their proposed estimator is more efficient in a localized domain. Using variable transformation, Viplav et al.⁶ studied a class of difference-type estimator for estimating the population mean of the study variable when measurement errors are present. They generated some new estimators that belong to the family of estimators proposed by them. Their empirical study showed that the suggested estimators have more gain in efficiency overother existing estimators.

Gregoire and Salas⁷ studied systematic measurement errors as well as measurement errors that are assumed to be stochastic in nature. They obtained the statistical properties of three ratio estimators under these measurement error conditions. They concluded that the ratio-of-means estimator appears to be less affected when the auxiliary variants are contaminated with measurement errors. Empirical study of ratio and regression estimators through Monte Carlo simulation by Sahoo et al.⁸ when the auxiliary variable is contaminated with the measurement errors reveals that the regression estimator is more sensitive to measurement errors than the ratio estimator with respect to their efficiency. Bias of both estimators is sensitive to measurement errors with the bias of an estimator decreasing as the sample size is increasing, and increase when the regression line of (study variable) on (auxiliary variable)moves away from the origin.

All the work reviewed so far were based on the general assumption that measurement errors are uncorrelated though the study variable and auxiliary variable are correlated. However, Shalabh and Jia-Ren⁹relaxed the general assumption and studied the performance of ratio as well as product estimators of population mean with correlated measurement errors.

In this work, we examine the performance of modified ratio-type estimator of population mean under the influence of correlated measurement errors using simple random sampling scheme.

Measurement error model definition

Considering, a population of size N, $(U_{i} = U_{1}, U_{2}, \dots, U_{N})$ $(U_{i} = U_{1}, U_{2}, \dots, U_{N})$ . Let’s denote the study variable as $y$ $y$ and the auxiliary variableas $x$ $x$ and let them take on the values $y_{i}$ $y_{i}$ and $x_{i}$ $x_{i}$ respectively on the $i^{t h}$ $i^{t h}$ unit of $U_{i}, (i = 1, 2, \dots, N)$ $U_{i}, (i = 1, 2, \dots, N)$ . We denote population mean of $y$ $y$ and $x$ $x$ as $μ_{Y}$ $μ_{Y}$ and $μ_{X}$ $μ_{X}$ respectively, and the population variance of $y$ $y$ and $x$ $x$ as $σ_{Y}^{2}$ $σ_{Y}^{2}$ and $σ_{X}^{2}$ $σ_{X}^{2}$ respectively. Also let $σ_{X Y}$ $σ_{X Y}$ and $ρ$ $ρ$ denote the population covariance and the correlation coefficient between $ρ$ $ρ$ and $x$ $x$ .

Assume a simple random sample without replacement (SRSWOR) of size n is drawn from population U. Let $¯ y$ $\bar{y}$ and $¯ x$ $\bar{x}$ be the sample means of $y$ $y$ and $x$ $x$ respectively. Thus, for a simple random sampling scheme, let ( $y_{i}$ $y_{i}$ , $x_{i}$ $x_{i}$ ) be observed values instead of the true values $(y_{i}^{*}, x_{i}^{*})$ $(y_{i}^{*}, x_{i}^{*})$ on the two characteristics $(y, x)$ $(y, x)$ respectively for the $i^{t h}$ $i^{t h}$ unit $(i = 1, 2, \dots, n)$ $(i = 1, 2, \dots, n)$ in a sample of size n. Let the measurement errors be defined as:

$u_{i} = y_{i} - y_{i}^{*}$ $u_{i} = y_{i} - y_{i}^{*}$ (1)

$v_{i} = x_{i} - x_{i}^{*}$ $v_{i} = x_{i} - x_{i}^{*}$ (2)

Such that

$E (u) = E (v) = 0$ $E (u) = E (v) = 0$

$V a r (u) = σ_{u}^{2}$ $V a r (u) = σ_{u}^{2}$ , $V a r (v) = σ_{v}^{2}$ $V a r (v) = σ_{v}^{2}$

$cov (u, v) = ρ^{*} σ_{u} σ_{v}$ $cov (u, v) = ρ^{*} σ_{u} σ_{v}$

Thus, expressing the observed value as a function of the true value and the measurement errors, we have,

$y_{i} = y_{i}^{*} + u_{i}$ $y_{i} = y_{i}^{*} + u_{i}$ (3)

$x_{i} = x_{i}^{*} + v_{i}$ $x_{i} = x_{i}^{*} + v_{i}$ (4)

Notations

Considering large sample approximation, the finite population correction $1 - f$ $1 - f$ can be ignored,

where

$f = \frac{n}{N}$ $f = \frac{n}{N}$

We define mean and variance of study variable $Y$ $Y$ and auxiliary variable $X$ $X$ as

$¯ ¯ ¯ X = \frac{1}{N} N \sum i = 1 X_{i}, ¯ ¯ ¯ Y = \frac{1}{N} N \sum i = 1 Y_{i}, σ_{X} = \frac{1}{N} N \sum i = 1 {(X_{i} - ¯ ¯ ¯ X)}^{2}, σ_{Y} = \frac{1}{N} N \sum i = 1 {(Y_{i} - ¯ ¯ ¯ Y)}^{2}$ $\bar{X} = \frac{1}{N} \sum_{i = 1}^{N} X_{i}, \bar{Y} = \frac{1}{N} \sum_{i = 1}^{N} Y_{i}, σ_{X} = \frac{1}{N} \sum_{i = 1}^{N} {(X_{i} - \bar{X})}^{2}, σ_{Y} = \frac{1}{N} \sum_{i = 1}^{N} {(Y_{i} - \bar{Y})}^{2}$

Further, we define the coefficient of variation of $X$ $X$ and $Y$ $Y$ as

$C_{X} = \frac{σ_{X}}{¯ ¯ ¯ X} and C_{Y} = \frac{σ_{Y}}{¯ ¯ ¯ Y} respectively$ $C_{X} = \frac{σ_{X}}{\bar{X}} and C_{Y} = \frac{σ_{Y}}{\bar{Y}} respectively$

Also Covariance of $Y$ $Y$ and $X$ $X$ , Correlation Coefficient between $Y$ $Y$ and $X$ $X$ , and Correlation Coefficient between $u$ $u$ and $v$ $v$ are defined as

$σ_{X Y} = \frac{1}{N} N \sum i = 1 (X_{i} - ¯ ¯ ¯ X) (Y_{i} - ¯ ¯ ¯ Y), ρ = \frac{σ_{X Y}}{σ_{X} σ_{Y}} and ρ^{*} = \frac{σ_{u v}}{σ_{v} σ_{u}} respectively$ $σ_{X Y} = \frac{1}{N} \sum_{i = 1}^{N} (X_{i} - \bar{X}) (Y_{i} - \bar{Y}), ρ = \frac{σ_{X Y}}{σ_{X} σ_{Y}} and ρ^{*} = \frac{σ_{u v}}{σ_{v} σ_{u}} respectively$

Using delta notation, we define the following:

$δ_{0} = \frac{¯ y}{¯ y} - 1 \Rightarrow ¯ y = ¯ ¯ ¯ Y (1 + δ_{o})$ $δ_{0} = \frac{\bar{y}}{\bar{y}} - 1 \Rightarrow \bar{y} = \bar{Y} (1 + δ_{o})$ (5)

$δ_{1} = \frac{¯ x}{¯ x} - 1 \Rightarrow ¯ x = ¯ ¯ ¯ X (1 + δ_{1})$ $δ_{1} = \frac{\bar{x}}{\bar{x}} - 1 \Rightarrow \bar{x} = \bar{X} (1 + δ_{1})$ (6)

Such that,

$E (δ_{0}) = E (δ_{1}) = 0$ $E (δ_{0}) = E (δ_{1}) = 0$ (7)

$E (δ_{0}^{2}) = \frac{σ_{Y}^{2}}{n θ_{Y} {¯ ¯ ¯ Y}^{2}}$ $E (δ_{0}^{2}) = \frac{σ_{Y}^{2}}{n θ_{Y} {\bar{Y}}^{2}}$ (8)

$E (δ_{1}^{2}) = \frac{σ_{X}^{2}}{n {¯ ¯ ¯ X}^{2}} (\frac{σ_{X}^{2} + σ_{v}^{2}}{σ_{X}^{2}}) = \frac{σ_{X}^{2}}{n θ_{X} {¯ ¯ ¯ X}^{2}}$ $E (δ_{1}^{2}) = \frac{σ_{X}^{2}}{n {\bar{X}}^{2}} (\frac{σ_{X}^{2} + σ_{v}^{2}}{σ_{X}^{2}}) = \frac{σ_{X}^{2}}{n θ_{X} {\bar{X}}^{2}}$ (9)

where,

$θ_{Y} = \frac{σ_{Y}^{2}}{σ_{Y}^{2} + σ_{u}^{2}}$ $θ_{Y} = \frac{σ_{Y}^{2}}{σ_{Y}^{2} + σ_{u}^{2}}$ and $θ_{X} = \frac{σ_{X}^{2}}{σ_{X}^{2} + σ_{v}^{2}}$ $θ_{X} = \frac{σ_{X}^{2}}{σ_{X}^{2} + σ_{v}^{2}}$ ,

and are bounded on (0,1).

Also,

$E (δ_{0 h} δ_{1 h}) = \frac{1}{n ¯ ¯ ¯ Y ¯ ¯ ¯ X} (C_{Y} C_{X} ρ + σ_{u} σ_{v} ρ^{*})$ $E (δ_{0 h} δ_{1 h}) = \frac{1}{n \bar{Y} \bar{X}} (C_{Y} C_{X} ρ + σ_{u} σ_{v} ρ^{*})$ (10)

Adapted Estimators

The traditional sample mean per unit estimator for estimating population mean when the sample data is contaminated with measurement error is given by:

$t_{0} = ¯ y$ $t_{0} = \bar{y}$ (11)

The variance is given as

$V (t_{0}) = \frac{C_{Y}^{2}}{n θ_{Y}}$ $V (t_{0}) = \frac{C_{Y}^{2}}{n θ_{Y}}$ (12)

Shalabh and Jia-Ren⁹proposed ratio estimator and product estimator when the general assumption on the measurement errors is relaxed as

$t_{1} = ¯ y \frac{¯ ¯ ¯ X}{¯ x}$ $t_{1} = \bar{y} \frac{\bar{X}}{\bar{x}}$ (13)

$t_{2} = ¯ y \frac{¯ x}{¯ ¯ ¯ X}$ $t_{2} = \bar{y} \frac{\bar{x}}{\bar{X}}$ (14)

They obtained the mean square error of ratio and product estimators as

$M S E (t_{1}) = \frac{{¯ ¯ ¯ Y}^{2}}{n} (\frac{C_{Y}^{2}}{θ_{Y}} + \frac{C_{X}^{2}}{θ_{X}} - 2 (C_{Y} C_{X} ρ + \frac{σ_{u} σ_{v} ρ^{*}}{¯ ¯ ¯ Y ¯ ¯ ¯ X}))$ $M S E (t_{1}) = \frac{{\bar{Y}}^{2}}{n} (\frac{C_{Y}^{2}}{θ_{Y}} + \frac{C_{X}^{2}}{θ_{X}} - 2 (C_{Y} C_{X} ρ + \frac{σ_{u} σ_{v} ρ^{*}}{\bar{Y} \bar{X}}))$ (15)

$M S E (t_{2}) = \frac{{¯ ¯ ¯ Y}^{2}}{n} (\frac{C_{Y}^{2}}{θ_{Y}} + \frac{C_{X}^{2}}{θ_{X}} + 2 (C_{Y} C_{X} ρ + \frac{σ_{u} σ_{v} ρ^{*}}{¯ ¯ ¯ Y ¯ ¯ ¯ X}))$ $M S E (t_{2}) = \frac{{\bar{Y}}^{2}}{n} (\frac{C_{Y}^{2}}{θ_{Y}} + \frac{C_{X}^{2}}{θ_{X}} + 2 (C_{Y} C_{X} ρ + \frac{σ_{u} σ_{v} ρ^{*}}{\bar{Y} \bar{X}}))$ (16)

Proposed estimator

Motivated by the Shalabh and Jia-Ren,⁹ we propose the following modified ratio estimator to estimate population mean in the presence of correlated measurement errors as

$t_{r} = ¯ y {(\frac{¯ ¯ ¯ X + ρ}{¯ x + ρ})}^{β}$ $t_{r} = \bar{y} {(\frac{\bar{X} + ρ}{\bar{x} + ρ})}^{β}$ (17)

where $β$ $β$ is any real number chosen so as to minimize the mean squared errors of $t_{1}$ $t_{1}$ . It may be noted that the proposed modified estimator is a class of estimators and that the following estimators are particular members of the proposed estimators when

$β = 0, t_{r 0} = ¯ y$ $β = 0, t_{r 0} = \bar{y}$ (18)

$β = 1, t_{r 1} = ¯ y (\frac{¯ ¯ ¯ X + ρ}{¯ x + ρ})$ $β = 1, t_{r 1} = \bar{y} (\frac{\bar{X} + ρ}{\bar{x} + ρ})$ (19)

$β = - 1, t_{r 2} = ¯ y (\frac{¯ x + ρ}{¯ ¯ ¯ X + ρ})$ $β = - 1, t_{r 2} = \bar{y} (\frac{\bar{x} + ρ}{\bar{X} + ρ})$ (20)

$β = \frac{1}{2}, t_{r 3} = ¯ y {(\frac{¯ ¯ ¯ X + ρ}{¯ x + ρ})}^{\frac{1}{2}}$ $β = \frac{1}{2}, t_{r 3} = \bar{y} {(\frac{\bar{X} + ρ}{\bar{x} + ρ})}^{\frac{1}{2}}$ (21)

$β = - \frac{1}{2}, t_{r 4} = ¯ y {(\frac{¯ ¯ ¯ X + ρ}{¯ x + ρ})}^{- \frac{1}{2}}$ $β = - \frac{1}{2}, t_{r 4} = \bar{y} {(\frac{\bar{X} + ρ}{\bar{x} + ρ})}^{- \frac{1}{2}}$ (22)

Properties of proposed estimator

Using notations defined in Section 3, we obtain the properties of the proposed estimators. Expressing (17) in terms of $δ_{i}, (i = 0, 1)$ $δ_{i}, (i = 0, 1)$

$t_{r} = ¯ ¯ ¯ Y (1 + δ_{0}) {(\frac{¯ ¯ ¯ X + ρ}{¯ ¯ ¯ X (1 + δ_{1}) + ρ})}^{β}$ $t_{r} = \bar{Y} (1 + δ_{0}) {(\frac{\bar{X} + ρ}{\bar{X} (1 + δ_{1}) + ρ})}^{β}$ (23)

(23) can be rewritten as

$t_{r} = ¯ ¯ ¯ Y (1 + δ_{0}) {(1 + \frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ} δ_{1})}^{β}$ $t_{r} = \bar{Y} (1 + δ_{0}) {(1 + \frac{\bar{X} ρ}{\bar{X} + ρ} δ_{1})}^{β}$

$= ¯ ¯ ¯ Y (1 + δ_{0}) [1 - β (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) δ_{1} + \frac{β (β + 1)}{2} {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} δ_{1} + O (δ_{1})]$ $= \bar{Y} (1 + δ_{0}) [1 - β (\frac{\bar{X} ρ}{\bar{X} + ρ}) δ_{1} + \frac{β (β + 1)}{2} {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} δ_{1} + O (δ_{1})]$

$t_{r} = ¯ ¯ ¯ Y + ¯ ¯ ¯ Y [δ_{0} - β (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) δ_{1} δ_{0} + \frac{β (β + 1)}{2} {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} δ_{1}^{2} δ_{0} - β (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) δ_{1} + \frac{β (β + 1)}{2} {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} δ_{1}^{2}]$ $t_{r} = \bar{Y} + \bar{Y} [δ_{0} - β (\frac{\bar{X} ρ}{\bar{X} + ρ}) δ_{1} δ_{0} + \frac{β (β + 1)}{2} {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} δ_{1}^{2} δ_{0} - β (\frac{\bar{X} ρ}{\bar{X} + ρ}) δ_{1} + \frac{β (β + 1)}{2} {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} δ_{1}^{2}]$

$t_{r} - ¯ ¯ ¯ Y = ¯ ¯ ¯ Y [δ_{0} - β (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) δ_{1} δ_{0} + \frac{β (β + 1)}{2} {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} δ_{1}^{2} δ_{0} - β (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) δ_{1} + \frac{β (β + 1)}{2} {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} δ_{1}^{2}]$ $t_{r} - \bar{Y} = \bar{Y} [δ_{0} - β (\frac{\bar{X} ρ}{\bar{X} + ρ}) δ_{1} δ_{0} + \frac{β (β + 1)}{2} {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} δ_{1}^{2} δ_{0} - β (\frac{\bar{X} ρ}{\bar{X} + ρ}) δ_{1} + \frac{β (β + 1)}{2} {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} δ_{1}^{2}]$ (24)

Taking expectation of both sides of (24) and making necessary substitutions using (8), (9) and (10) and simplifying the bias up to first order approximation, (24) becomes

$B i a s (t_{r}) = E (t_{r} - ¯ ¯ ¯ Y) = \frac{¯ ¯ ¯ Y β}{n} [(\frac{β + 1}{2}) {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} \frac{C_{X}}{θ_{X}} - (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v} ρ^{*}}{¯ ¯ ¯ Y ¯ ¯ ¯ X})]$ $B i a s (t_{r}) = E (t_{r} - \bar{Y}) = \frac{\bar{Y} β}{n} [(\frac{β + 1}{2}) {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} \frac{C_{X}}{θ_{X}} - (\frac{\bar{X} ρ}{\bar{X} + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v} ρ^{*}}{\bar{Y} \bar{X}})]$ (25)

Squaring and taking expectation of both sides of (24) and making necessary substitution using (8), (9) and (10) and simplifyingthe mean square error up to first order approximation, (24) becomes

$M S E (t_{r}) = E {(t_{r} - ¯ ¯ ¯ Y)}^{2} = \frac{{¯ ¯ ¯ Y}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} + β^{2} {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} \frac{C_{X}^{2}}{θ_{X}} - 2 β (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*})]$ $M S E (t_{r}) = E {(t_{r} - \bar{Y})}^{2} = \frac{{\bar{Y}}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} + β^{2} {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} \frac{C_{X}^{2}}{θ_{X}} - 2 β (\frac{\bar{X} ρ}{\bar{X} + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*})]$ (26)

Using the least square method which seek to minimize sum of square errors, we obtain the optimum value $β$ $β$ which minimizes the mean square error of $t_{r}$ $t_{r}$ as

$β = β_{o p t} = (\frac{¯ ¯ ¯ X + ρ}{¯ ¯ ¯ X ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*}) \frac{θ_{X}}{C_{X}^{2}}$ $β = β_{o p t} = (\frac{\bar{X} + ρ}{\bar{X} ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*}) \frac{θ_{X}}{C_{X}^{2}}$ (27)

Substituting (27) in (26) we obtain minimum mean square error of $t_{r}$ $t_{r}$ as

$M S E_{min} (t_{r}) = \frac{{¯ ¯ ¯ Y}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} - \frac{θ_{X}}{C_{X}^{2}} {(ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*})}^{2}]$ $M S E_{\min} (t_{r}) = \frac{{\bar{Y}}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} - \frac{θ_{X}}{C_{X}^{2}} {(ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*})}^{2}]$ (28)

The variance and the mean square errors of the estimators which are particular members of the proposed modified estimator can easily be obtained by substituting the appropriate values of $β = 0, 1, - 1, \frac{1}{2}, - \frac{1}{2}$ $β = 0, 1, - 1, \frac{1}{2}, - \frac{1}{2}$ in (26). Thus,

$V a r (t_{r 0}) = \frac{{¯ ¯ ¯ Y}^{2}}{n} \frac{C_{Y}^{2}}{θ_{Y}}$ $V a r (t_{r 0}) = \frac{{\bar{Y}}^{2}}{n} \frac{C_{Y}^{2}}{θ_{Y}}$ (29)

$M S E (t_{r 1}) = \frac{{¯ ¯ ¯ Y}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} + {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} \frac{C_{X}^{2}}{θ_{X}} - 2 (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*})]$ $M S E (t_{r 1}) = \frac{{\bar{Y}}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} + {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} \frac{C_{X}^{2}}{θ_{X}} - 2 (\frac{\bar{X} ρ}{\bar{X} + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*})]$ (30)

$M S E (t_{r 2}) = \frac{{¯ ¯ ¯ Y}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} + {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} \frac{C_{X}^{2}}{θ_{X}} + 2 (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*})]$ $M S E (t_{r 2}) = \frac{{\bar{Y}}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} + {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} \frac{C_{X}^{2}}{θ_{X}} + 2 (\frac{\bar{X} ρ}{\bar{X} + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*})]$ (31)

$M S E (t_{r 3}) = \frac{{¯ ¯ ¯ Y}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} + \frac{1}{4} {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} \frac{C_{X}^{2}}{θ_{X}} - (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*})]$ $M S E (t_{r 3}) = \frac{{\bar{Y}}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} + \frac{1}{4} {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} \frac{C_{X}^{2}}{θ_{X}} - (\frac{\bar{X} ρ}{\bar{X} + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*})]$ (32)

$M S E (t_{r 4}) = \frac{{¯ ¯ ¯ Y}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} + {(\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ})}^{2} \frac{C_{X}^{2}}{θ_{X}} + 2 (\frac{¯ ¯ ¯ X ρ}{¯ ¯ ¯ X + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*})]$ $M S E (t_{r 4}) = \frac{{\bar{Y}}^{2}}{n} [\frac{C_{Y}^{2}}{θ_{Y}} + {(\frac{\bar{X} ρ}{\bar{X} + ρ})}^{2} \frac{C_{X}^{2}}{θ_{X}} + 2 (\frac{\bar{X} ρ}{\bar{X} + ρ}) (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*})]$ (33)

Theoretical efficiency comparison of t_r with some existing estimators

The optimum mean square error of $t_{r}$ $t_{r}$ was compared with the existing estimators $t_{0}, t_{1}, t_{2}$ $t_{0}, t_{1}, t_{2}$ . Thus, from (28) and(12), we observed that

$M S E_{min} (t_{r}) - V a r (t_{0}) = - {(ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*})}^{2} < 0$ $M S E_{\min} (t_{r}) - V a r (t_{0}) = - {(ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*})}^{2} < 0$ (34)

Since ${(ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*})}^{2}$ ${(ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*})}^{2}$ will always be positive, (34) will always be negative, and the proposed estimator will always be more efficient than the usual unbiased sample mean per unit estimator.

From (28) and (15), we observed that

$M S E_{min} (t_{r}) - M S E (t_{1}) = - \frac{θ_{X}}{C_{X}^{2}} {(ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*})}^{2} - \frac{C_{X}^{2}}{θ_{X}} + 2 (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*}) < 0$ $M S E_{\min} (t_{r}) - M S E (t_{1}) = - \frac{θ_{X}}{C_{X}^{2}} {(ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*})}^{2} - \frac{C_{X}^{2}}{θ_{X}} + 2 (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*}) < 0$ (35)

From (28) and (16), we observed that

$M S E_{min} (t_{r}) - M S E (t_{2}) = - \frac{θ_{X}}{C_{X}^{2}} {(ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*})}^{2} - \frac{C_{X}^{2}}{θ_{X}} - 2 (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{¯ ¯ ¯ Y ¯ ¯ ¯ X} ρ^{*}) < 0$ $M S E_{\min} (t_{r}) - M S E (t_{2}) = - \frac{θ_{X}}{C_{X}^{2}} {(ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*})}^{2} - \frac{C_{X}^{2}}{θ_{X}} - 2 (ρ C_{Y} C_{X} + \frac{σ_{u} σ_{v}}{\bar{Y} \bar{X}} ρ^{*}) < 0$ (36)

From (34), (35) and (36), the proposed estimator will always be more efficient than the sample mean per unit estimator, ratio estimator and product estimator in the presence of correlated measurement errors.

Empirical efficiency comparison

The efficiency of the proposed estimator $t_{r}$ $t_{r}$ is illustrated using hypothetical data set on income and expenditure from Gujarati and Porter.¹⁰

$y_{i}^{*} = Household Spending (True Value)$ $y_{i}^{*} = Household Spending (True Value)$

$x_{i}^{*} = Household Earning (True Value)$ $x_{i}^{*} = Household Earning (True Value)$

$y_{i} = Household Spending (Observed Value)$ $y_{i} = Household Spending (Observed Value)$

$x_{i} = Household Earning (Observed Value)$ $x_{i} = Household Earning (Observed Value)$

The following values of the parameter were obtained from the given data.

N	$¯ ¯ ¯ Y$ $\bar{Y}$	$¯ ¯ ¯ X$ $\bar{X}$	$σ_{Y}^{2}$ $σ_{Y}^{2}$	$σ_{X}^{2}$ $σ_{X}^{2}$	$σ_{u}^{2}$ $σ_{u}^{2}$	$σ_{v}^{2}$ $σ_{v}^{2}$	$ρ$ $ρ$	$ρ^{}$ $ρ^{}$	$θ_{Y}$ $θ_{Y}$	$θ_{X}$ $θ_{X}$
10	127	170	1278	3300	36	41	0.964	-0.09087	0.975	0.988

Table 1 Value of the Parameters

Table 2 shows the percentage relative efficiency (PRE) with respect to sample mean per unit $¯ y$ $\bar{y}$ of the proposed estimator and some existing estimator. This was defined as

$P R E (\cdot) = \frac{V a r (¯ y)}{M S E (\cdot)} \times 100$ $P R E (\cdot) = \frac{V a r (\bar{y})}{M S E (\cdot)} \times 100$ (37)

Estimators	Mean square error	Percentage relative efficiency
$t_{0}$ $t_{0}$	131.3974	100
$t_{r_{o p t}}$ $t_{r_{o p t}}$	14.4820	907.32
$t_{1}$ $t_{1}$	22.5620	582.38
$t_{2}$ $t_{2}$	613.1759	21.43
$t_{r 1}$ $t_{r 1}$	19.6744	667.86
$t_{r 2}$ $t_{r 2}$	611.8517	21.48
$t_{r 3}$ $t_{r 3}$	32.6882	401.97
$t_{r 4}$ $t_{r 4}$	315.8020	41.61

Table 2 Mean square error and relative efficiency

Further illustration of the efficiency of the proposed estimator was done using another hypothetical dataset from Okafor¹² on land area available for cultivation and land area cultivate with maize, where,

$y_{i} = the observed land area of the village cultivated with maize$ $y_{i} = the observed land area of the village cultivated with maize$

$x_{i} = the observed land area of the village avaliable for cultivation$ $x_{i} = the observed land area of the village avaliable for cultivation$

$y_{i}^{*} = the true land area of the village cultivated with maize$ $y_{i}^{*} = the true land area of the village cultivated with maize$

$x_{i}^{*} = the true land area of the village avaliable for cultivation$ $x_{i}^{*} = the true land area of the village avaliable for cultivation$

The following values for the population parameter were obtained from the given data.

N	$\bar{Y}$	$\bar{X}$	$σ_{Y}^{2}$	$σ_{X}^{2}$	$σ_{u}^{2}$	$σ_{v}^{2}$	$ρ$	$ρ^{*}$	$θ_{Y}$	$θ_{X}$
20	530.08	829.16	61824.97	190361.30	9.57	9.31	0.814	0.998	0.99985	0.99995

Table 3 Value of the Parameters Population II

Table 4 shows the mean squared error and percentage relative efficiency (PRE) of the proposed estimator and some estimators which are particular members of the proposed modified estimator with respect to sample mean per unit $\bar{y} .$

Estimators	Mean square error	Percentage relative efficiency
$t_{0}$	3091.712	100.00
$t_{r_{o p t}}$	0.892	346460.000
$t_{r 1}$	1073.425	288.023
$t_{r 2}$	10253.820	30.152
$t_{r 3}$	2587.140	119.503
$t_{r 4}$	4882.238	63.326
$t_{1}$	1336.565	231.318
$t_{2}$	12627.140	24.485

Table 4 Mean Squared Error and Percentage Relative Efficiency

For different values of $β$ , we also obtained the relative efficiency of $t_{r}$ over $t_{0}$ defined as

$P R E (.) = \frac{V a r (t_{0})}{M S E (t_{r})}$ (38)

Table 5 represents the relative efficiency of $t_{r}$ with respect to $t_{0}$ for different values of $β$ .

Value of $β$	MSE(t_r)	Relative Efficiency
0.00	131.397	1.000
0.05	117.645	1.117
0.10	104.750	1.254
0.15	92.711	1.417
0.20	81.530	1.612
0.25	71.205	1.845
0.30	61.738	2.128
0.35	53.127	2.473
0.40	45.374	2.896
0.45	38.477	3.415
0.50	32.437	4.051
0.55	27.255	4.821
0.60	22.929	5.731
0.65	19.460	6.752
0.70	16.848	7.799
0.75	15.093	8.706
0.80	14.195	9.256
$β_{o p t} = 0.828$	14.067	9.341
0.85	14.154	9.283
0.90	14.970	8.777
0.95	16.643	7.895
1.00	19.173	6.853
1.05	22.560	5.824
1.10	26.803	4.902
1.15	31.904	4.119
1.20	37.862	3.470
1.25	44.676	2.941
1.30	52.348	2.510
1.35	60.876	2.158
1.40	70.262	1.870
1.45	80.504	1.632
1.50	91.603	1.434
1.55	103.560	1.269

Table 5 Relative efficiency of $t_{r}$ with respect to $t_{0}$ for different values of $β$

Conclusion

The main aim of this work is to ascertain the extent of the impact of correlated measurement errors on the quality of sample statistics which estimate the population parameters. Thus, since $B i a s (t_{r})$ is a function of $θ_{X},$ it shows that the bias of the proposed class of estimator is affected by the presence of correlated measurement error in the auxiliary variable. Also $M S E_{m i n} (t_{r})$ is a function of $θ_{Y}, θ_{X},$ it also showed that the mean squared error of the proposed class of estimator is affected by presence of correlated measurement errors in both study and auxiliary variables. Also the proposed modified ratio estimator at its optimum value has more gain in efficiency than some existing estimators in the presence of correlated measurement errors. The study also revealed that even when the proposed modified ratio estimator deviates from its optimum value, there are still range of estimators at different values of $β$ to choose from. Therefore, the proposed estimator should be preferred in practice.