It is often useful to compare multiple subgroups to assess meaningful differences. The purpose of this editorial is to summarize a method for using a ratio to detect significant differences in associations across multiple population subgroups. In Statistics, two quantities can be compared by taking difference (for example, differences between two means:
, or difference between two proportions:
), or taking ratios (for example, ratios of two proportions:
, also known as a relative risk: (RR)). Also, odds ratio (OR), incidence rate ratio (IR) or hazard ratio (HR) can be used to compare two groups, such as treatment vs control, or exposed vs unexposed. The standard error of the ratio should be computed if we conduct ratio analysis. For example, to compute the confidence interval for the estimated odds ratio we need to compute the standard error (SE) of log(OR):
where nij are sometimes replaced by
, when some of the
are zero. This result can be derived under the assumption of multinomial sampling by using the Delta Method. When RR is used to compare two quantities, the log transformation is conducted, that is
is often considered instead of
, since the former has a sampling distribution which is closer to normal than that of the latter. The estimated asymptotic standard error (ASE) of log(RR):
This result can be derived under the assumption of independent binomial sampling using the Delta Method, where
and
are sample proportions based on independent binomial samples with success probabilities
and
, respectively. The confidence interval (CI) of log(RR) (Wald CI for log
) can be calculated as follows:
The CI tends to be slightly conservative (i.e., the actual coverage probability tends to be higher than the nominal level). Exponentiating the endpoints provides a CI for
.
Now we discuss applying Delta Method to estimate the SE of a trans-formed parameter. The delta method, in its essence, expands a function of a random variable about its mean. Usually with a one-step Taylor approximation, and then takes the variance. For example, if we want to approximate the variance of G(X) where X is a random variable with mean
and function G is differentiable, we can try
This idea can easily be expanded to vector-valued functions of random vectors.
The Delta Method can be applied to Random effects meta-regression analysis, which can be used to investigate factors associated with the magnitude of the ratio of RR (or OR IR HR) (RRR). The triple R method can easily be extended to the quadruple R, the magnitude of the ratio of the ratio of RR (or OR IR HR), or RRRR.
To compare subgroups within selected studies, the natural logarithm transformation of the ratio of RR (or OR IR HR) values (RRR; or analogous estimates of association) for the two compared subgroups should be used. Since the logarithm of RRR has a sampling distribution which is closer to normal than RRR. To find the estimate of the RRR and its CI, the SE of RRR can be derived using the Delta Method. On the other hand, for meta-analysis fixed-model, we assume that there is no heterogeneity between the studies. The model assumes that within-study variances may differ, but that there is homogeneity of effect size across all studies. Often the homogeneity assumption is unlikely and variation in the true effect across studies is to be expected. Therefore, caution is required when using this model.1,2
For meta-analysis random-effects model (the most commonly used), we assume that models heterogeneity between the studies, or we assume that the true effect can be different for each study. For example, the effect estimates of urban and rural subpopulation was evaluated. First, the log transformation was conducted: the natural logarithm of the ratio of RR values (RRR; or analogous estimates of association) for the two subgroups, i.e., RR(rural)/RR(urban), a method given by Benmarhnia et al.3 The formula used to calculate the standard errors of the ratios is as follows (adopted from Benmarhnia et al.3):
Based on the formula above, Li et al.4 compared the heat-related mortality between rural and urban populations using the RRR method and found that there was no statistically significant difference between the two subgroups.