Comparing two quantities by using a ratio

doi:10.15406/bbij.2020.09.00318

eISSN: 2378-315X

Biometrics & Biostatistics International Journal

Editorial Volume 9 Issue 5

Comparing two quantities by using a ratio

Shimin Zheng,¹

Verify Captcha

Regret for the inconvenience: we are taking measures to prevent fraudulent form submissions by extractors and page crawlers. Please type the correct Captcha word to see email ID.

Michael Smith²

¹Department of Biostatistics and Epidemiology, East Tennessee State University, USA
²Department of Health Services Management & Policy, East Tennessee State University, USA

Correspondence: Shimin Zheng, Department of Biostatistics and Epidemiology, East Tennessee State Uni-versity, Box 70259, Johnson City, TN 37614, USA

Received: October 26, 2020 | Published: October 31, 2020

Citation: Zheng S, Smith M. Comparing two quantities by using a ratio. Biom Biostat Int J. 2020;9(5):186-187. DOI: 10.15406/bbij.2020.09.00318

Download PDF

It is often useful to compare multiple subgroups to assess meaningful differences. The purpose of this editorial is to summarize a method for using a ratio to detect significant differences in associations across multiple population subgroups. In Statistics, two quantities can be compared by taking difference (for example, differences between two means: $μ_{1} - μ_{2}$ , or difference between two proportions: $p_{1} - p_{2}$ ), or taking ratios (for example, ratios of two proportions: $\frac{p_{1}}{p_{2}}$ , also known as a relative risk: (RR)). Also, odds ratio (OR), incidence rate ratio (IR) or hazard ratio (HR) can be used to compare two groups, such as treatment vs control, or exposed vs unexposed. The standard error of the ratio should be computed if we conduct ratio analysis. For example, to compute the confidence interval for the estimated odds ratio we need to compute the standard error (SE) of log(OR):

$\hat{σ} (\log (O R)) = \sqrt{(\frac{1}{n_{11}} + \frac{1}{n_{12}} + \frac{1}{n_{21}} + \frac{1}{n_{22}})},$

where n_ij are sometimes replaced by $n_{i j} + 0. 5$ , when some of the $n_{i j}$ are zero. This result can be derived under the assumption of multinomial sampling by using the Delta Method. When RR is used to compare two quantities, the log transformation is conducted, that is $\log ({\hat{π}}_{1 | 1} / {\hat{π}}_{1 | 2})$ is often considered instead of $({\hat{π}}_{1 | 1} / {\hat{π}}_{1 | 2})$ , since the former has a sampling distribution which is closer to normal than that of the latter. The estimated asymptotic standard error (ASE) of log(RR):

$\hat{σ} \log ({\hat{π}}_{1 | 1} / {\hat{π}}_{1 | 2}) = \sqrt{[\frac{(1 - {\hat{π}}_{1 | 1})}{{\hat{π}}_{1 | 1} n_{1 +}} + \frac{(1 - {\hat{π}}_{1 | 2})}{{\hat{π}}_{1 | 2} n_{2 +}}]}$

This result can be derived under the assumption of independent binomial sampling using the Delta Method, where ${\hat{π}}_{1 | 1}$ and ${\hat{π}}_{1 | 2}$ are sample proportions based on independent binomial samples with success probabilities $π_{1 | 1}$ and $π_{1 | 2}$ , respectively. The confidence interval (CI) of log(RR) (Wald CI for log $(π_{1 | 1} / π_{1 | 2})$ ) can be calculated as follows:

$\log ({\hat{π}}_{1 | 1} / {\hat{π}}_{1 | 2}) \pm z_{α /}_{2} (\hat{σ} (log ({\hat{π}}_{1 | 1} / {\hat{π}}_{1 | 2}))) .$

The CI tends to be slightly conservative (i.e., the actual coverage probability tends to be higher than the nominal level). Exponentiating the endpoints provides a CI for $(π_{1 | 1} / π_{1 | 2})$ .

Now we discuss applying Delta Method to estimate the SE of a trans-formed parameter. The delta method, in its essence, expands a function of a random variable about its mean. Usually with a one-step Taylor approximation, and then takes the variance. For example, if we want to approximate the variance of G(X) where X is a random variable with mean $μ$ and function G is differentiable, we can try

$G (X) \approx G (µ) + (X - µ) G^{'} (µ),$

$V a r (G (X)) \approx G^{'} (µ) V a r (X) {[G^{'} (µ)]}^{T} .$

This idea can easily be expanded to vector-valued functions of random vectors.

$V a r (G (X)) \approx G^{'} (µ) V a r (X) {[G^{'} (µ)]}^{T} .$

The Delta Method can be applied to Random effects meta-regression analysis, which can be used to investigate factors associated with the magnitude of the ratio of RR (or OR IR HR) (RRR). The triple R method can easily be extended to the quadruple R, the magnitude of the ratio of the ratio of RR (or OR IR HR), or RRRR.

To compare subgroups within selected studies, the natural logarithm transformation of the ratio of RR (or OR IR HR) values (RRR; or analogous estimates of association) for the two compared subgroups should be used. Since the logarithm of RRR has a sampling distribution which is closer to normal than RRR. To find the estimate of the RRR and its CI, the SE of RRR can be derived using the Delta Method. On the other hand, for meta-analysis fixed-model, we assume that there is no heterogeneity between the studies. The model assumes that within-study variances may differ, but that there is homogeneity of effect size across all studies. Often the homogeneity assumption is unlikely and variation in the true effect across studies is to be expected. Therefore, caution is required when using this model.^1,2

For meta-analysis random-effects model (the most commonly used), we assume that models heterogeneity between the studies, or we assume that the true effect can be different for each study. For example, the effect estimates of urban and rural subpopulation was evaluated. First, the log transformation was conducted: the natural logarithm of the ratio of RR values (RRR; or analogous estimates of association) for the two subgroups, i.e., RR(rural)/RR(urban), a method given by Benmarhnia et al.³ The formula used to calculate the standard errors of the ratios is as follows (adopted from Benmarhnia et al.³):

$S D (r a t i o) = r a t i o \times \sqrt{(\frac{S D R R_{r u r a l}^{2}}{R R_{r u r a l}}) + (\frac{S D R R_{u r b a n}^{2}}{R R_{u r b a n}})}$

Based on the formula above, Li et al.⁴ compared the heat-related mortality between rural and urban populations using the RRR method and found that there was no statistically significant difference between the two subgroups.