Submit manuscript...
MOJ
eISSN: 2379-6383

Public Health

Review Article Volume 4 Issue 4

An alternative approach to cochran Q test for dichotomous data

Okeh UM,1 Oyeka ICA,2 Igwenagu CM3

1Department of Industrial Mathematics and Applied Statistics, Ebonyi State University, Nigeria
2Department of Statistics, Nnamdi Azikiwe University, Nigeria
3Department of Industrial Mathematics and Applied Statistics and Demography, Enugu State University of Science and Technology, Nigeria

Correspondence: Okeh UM, Department of Industrial Mathematics and Applied Statistics, Ebonyi State University, Nigeria

Received: April 03, 2016 | Published: May 7, 2016

Citation: Okeh UM, Oyeka ICA, Igwenagu C. An alternative approach to cochran Q test for dichotomous data. MOJ Public Health. 2016;4(4):114–117. DOI: 10.15406/mojph.2016.04.00086

Download PDF

Abstract

This paper proposes, develops and presents a statistical method for the analysis of sample data where these data are dichotomous responses assuming only two possible mutually exclusive values such as 1, representing positive response say, and 0 representing negative response say. The Chi-square test statistic based on the Chi-square test for independence is developed as an alternative to the usual Cochran Q test for dichotomous data to test the null hypothesis of equal positive response rate by subjects to a set of test or treatment. The proposed method is illustrated with some sample data and shown to be at least as powerful as the usual Cochran Q test when applied to the same sample observations.

Keywords: chi-square, dichotomous data, cochran q test, subjects, responses, success

Introduction

Sometimes a researcher may perform an experiment involving repeated observations or blocks, in which the variable of interest is dichotomous, meaning that it can assume only one of two possible mutually exclusive values. One of these two possible values is considered a ‘success’, positive response, present, well, etc. This is often coded as ‘1’,while the other value may be considered a failure, negative response, absent, bad, etc often coded a ‘0’.1

Research interest would then be to determine the proportions of subjects responding positive if the sampled blocks of subjects are the same across all treatments or tests. In this situation Cochran Q test2-4 for the dichotomous data may be applied.

To adjust, and make allowance for some situations in which test or trial outcomes are not just dichotomous assuming only two mutually exclusive options such as 1 or 0,but when there may be some intermediate and third outcome such as unknown, indeterminate, non-definitive, etc, that may be code with say a minus sign(-). Oyeka, et al.5 introduced a third category of response and developed a Chi-square (χ2)(χ2) test statistic for independence to test the null hypothesis of equal positive response rates under various treatments or test.

We will here however construct and alternative test statistic similar to Cochran Q test assuming that there are only two possible response outcomes or options that may be coded as either 1 or 0.

Methods

The proposed method would then be compared with the usual Cochran Q test as shown below.

The proposed method

Sometimes a researcher may be interested in comparing responses of blocks of subjects to a set of treatments in a diagnostic screened test or clinical trials. Specifically, suppose a researcher has collected a random sample of n block of subjects matched on some demographic characteristics such as age, sex, body weight, etc, where each block of subjects contain some c matched subjects and interest of the researcher is to administer each of these c subjects randomly one of c treatments. The subject responses to each of the treatments are all dichotomous assuming only one of two possible and mutually exclusive response options such as positive, or negative, present or absent, good or bad; success or failure, dead or alive, etc.

Let xijxij be the response by a randomly selected subject from the ith block of subjects administered treatment TjTj for i=1,2,…,n;j=1,2,…,c where each xijxij is dichotomous, either positive or negative, present or absent etc. To develop a test statistic to help determine whether on the average subject responses are the same for all treatments or conditions we may

Let
uij={1,ifarandomlyselectedsubjectfromtheithblockofsubjectsrespondpositive,indicatingconditionpresent,goodwhenadministeredtreatmentTj0,ifthesesamesubjectrespondsnegative,indicatingconditionabsent,badwhenadminiterdtreatmentTjuij=⎪ ⎪ ⎪ ⎪⎪ ⎪ ⎪ ⎪1,ifarandomlyselectedsubjectfromtheithblockofsubjectsrespondpositive,indicatingconditionpresent,goodwhenadministeredtreatmentTj0,ifthesesamesubjectrespondsnegative,indicatingconditionabsent,badwhenadminiterdtreatmentTj (1)

For i=1,2,..,n;j=1,2,..c.

Letπ+j=P(uij=1)(2)andWj=f+.j=ni=1uij(3)Letπ+j=P(uij=1)(2)andWj=f+.j=ni=1uij(3)

Be the total number of positive responses, that is total number of 1’s by subjects administered treatment TjTj and

f0.j=nf+.j(4)f0.j=nf+.j(4)

Be the total number of negative response that is number of 0’s obtained when subjects are administered treatment TjTj .

Now the total number of positive responses, that is the total number of 1’s and the total number of negative responses, that is the total number of 0’s for all the c treatments are respectively

f+..=cj=1f+.j;f0..=cj=1(nf+.j)=ncf+..(5)f+..=cj=1f+.j;f0..=cj=1(nf+.j)=ncf+..(5)

Note that the expected value and variance of uijuij are respectively

E(uij)=π+.j,Var(uij)=π+.j(1π+.j)(6)E(uij)=π+.j,Var(uij)=π+.j(1π+.j)(6)

The sample estimate of π+jπ+j is

ˆπ+.j=p.j=f+.jn(7)ˆπ+.j=p.j=f+.jn(7)

Similarly, the overall sample proportion of positive responses for all the c treatments is

p..=f+..nc(8)p..=f+..nc(8)

To obtain a test statistic for the null hypothesis that subjects on the average do not differ in their positive responses to all the c treatments, it would be easier to use the Chi-square test for independence.

Under this approach, we note that the observed frequencies of positive and negative responses by subjects at treatment TjTj that is the total number of 1’s and 0’s at treatment TjTj are respectively

o1j=f+.j;o2j=f0.j=nf+.j(9)o1j=f+.j;o2j=f0.j=nf+.j(9)

Now, under the null hypothesis of equal positive responses for all the c treatments, the expected frequencies of positive and negative responses for TjTj are respectively

E1j=n.f+..nc=f+..c;O1j=n.f0..nc=f0..c;O2j=ncf0..c;E2j=ncf+..c(10)E1j=n.f+..nc=f+..c;O1j=n.f0..nc=f0..c;O2j=ncf0..c;E2j=ncf+..c(10)

Hence,the required χ2χ2 test statistic is

χ2=cj=12i=1(OijEij)2Eij(11)χ2=cj=12i=1(OijEij)2Eij(11)

With c-1 degrees of freedom under the null hypothesis H0

Substituting equations (9) and (10) in equation (11), we obtain

χ2=cj=1(f+.jf+..c)/f+..c+cj=1((nf+.j)(ncf+..)c)2ncf+..cχ2=cj=1(f+.jf+..c)/f+..c+cj=1((nf+.j)(ncf+..)c)2ncf+..c

Which when further simplified becomes

χ2=n.c2f+..(ncf+..)cj=1(f+.jf+..c)(12)χ2=n.c2f+..(ncf+..)cj=1(f+.jf+..c)(12)

Which under the null hypothesis H0 of equal positive response rate has approximately the χ2χ2 distribution with c-1 degrees of freedom for sufficiently large n and c. Equation (12) when expressed in terms of sampled proportion becomes

χ2=np..(1p..)cj=1(p+.jp..)2=n.(cj=1(p+2.jp2..))p..(1p..)(13)χ2=np..(1p..)cj=1(p+.jp..)2=n.(cj=1(p+2.jp2..))p..(1p..)(13)

Illustrative example: The effects of four drug presentations on patients are to be studied. Interest is to determine whether or not the four drugs equally improved patients’ condition. Sixty patients are selected and grouped into 15 blocks so that the four patients in each block are approximately identical in age, initial condition, sex, etc. Patients in each block are randomly selected for treatment with only one of the four experimental drugs. After the specified medication period, the patients are classified as either improved (success) or not improved (failure) under a given drug and coded with a 1 or 0 respectively. The results are shown in Table 1. Can it be concluded on the basis of these data that patients improve equally on all the four drugs?

Drugs(treatments)

Patients(Blocks)

Drug 1

Drug 2

Drug 3

Drug 4

Total

1

1

1

0

0

2

2

1

1

0

1

2

3

1

0

0

0

1

4

1

1

1

1

4

5

1

1

0

1

3

6

0

1

0

0

1

7

0

1

1

0

2

8

1

1

1

0

3

9

0

0

1

0

1

10

0

0

0

0

0

11

0

0

0

0

0

12

1

1

0

1

3

13

1

0

0

1

2

14

0

1

1

0

2

15

1

1

0

0

2

Total(n)

15

15

15

15

6060

f+.jf+.j

9

10

5

5

2929

f0.jf0.j

6

5

10

10

3131
p.jp.j

0.6

0.67

0.33

0.33

0.480.48

Table 1 Patients’ response to four drug preparations (1=improved, 0=Not improved)

An illustration of proposed alternative to Cochran Q test: To illustrate the proposed method we apply equation 1 to obtain values of uijuij for the data of Table 1,for i=1,2,…,15;j=1,2,…,4.The summary values of uijuij that is of f+.jandf0.jf+.jandf0.j with the corresponding sample proportions p.jp.j of 1’s are shown at the bottom of Table 1, for j=1,2,3,4. Using the values of p.jp.j in equation 13 we obtain the Chi-square test statistic for the null hypothesis of no difference in possible response rates, that is the proportion of subjects, or patients improving under the four drugs as

χ2=(15)((0.60)2+(0.67)2+(0.33)2+(0.33)240(0.48)2)(0.48)(0.52)χ2=(15)(1.0270.922)0.250=(15)(0.105)0.25=1.5750.25=6.300(pvalue=0.0984),χ2=(15)((0.60)2+(0.67)2+(0.33)2+(0.33)240(0.48)2)(0.48)(0.52)χ2=(15)(1.0270.922)0.250=(15)(0.105)0.25=1.5750.25=6.300(pvalue=0.0984),

Which with 4-1=3 degrees of freedom is not statistically significant (χ20.99;3=11.35) , leading to the conclusion that patients may not have deferred in their improvement rates over the four drugs.

Now that if one had instead applied the usual Cochran Q test to analyze the same data, we would obtain the required test statistic as

Q=(c1)[4j=1T2.j(4j=1T.j)2/c]15i=1Bi(15i=1B2i/c)=(41)((9)2+(10)2+(5)2+(5)2)/4(2+3...+2)((2)2+(3)2+...+(2))2/4=(3)(231210.25)2918.75=62.2510.25=6.073(pvalue=0.1057)

Which with 4-1=3 degrees of freedom is also not statistically significant (χ20.99;3=11.35) .However the Chi-square value of 6.073 obtained using the usual Cochran Q test is slightly less than the corresponding Chi-square value of 6.300 obtained using the proposed method. Hence the usual Cochran Q test statistic is likely to lead to an acceptance of false null hypothesis (Type II error) more frequently and is therefore likely to be less powerful than the present method at least for the present data.

Summary and conclusion

We have discussed and presented above an alternative modified and probably easier method for the analysis of sample data that may be appropriate for use with the usual Cochran Q test.

A test statistic based on the Chi-square test for independence is developed for testing the null hypothesis that subjects or blocks of matched subjects on the average do not differ in proportions responding positive when administered a number of tests or treatments in a diagnostic screening test or clinical trial.

The proposed test method is illustrated with some sample data and shown to be at least as powerful as the usual Cochran Q test when applied to data of equal sizes. This is because the Chi-square value of 6.073 obtained using the usual Cochran Q test is slightly less than the corresponding Chi-square value of 6.300 obtained using the proposed method. Hence the usual Cochran Q test statistic is likely to lead to an acceptance of false null hypothesis (Type II error) more frequently and is therefore likely to be less powerful than the present method at least for the present data.

Acknowledgements

None.

Conflict of interest

The author declares no conflict of interest.

References

  1. Oyeka ICA, Nnanatu CC. Pairwise Comparison in Repeated Measures. Journal of Modern Applied Statistical Methods. 2014;13(2):8.
  2. Oyeka CA. An Introduction to Applied Statistical Methods. 8th ed. Nigeria: Nobern Avocation Publishing Company; 2010.
  3. Cochran WG. The comparison of percentages in matched samples. Biometrika. 1950;37(3–4):256–266.
  4. Freund JE. Mathematical Statistics. 5th ed. USA: Prentice–Hall Internal Editions; 1992.
  5. Oyeka CA, Utazi CE, Nwosu CR, et al. A Statistical Comparison of Test Scores: A Non–Parametric Approach. Journal of Mathematical Sciences. 2010;21(1):77–87.
Creative Commons Attribution License

©2016 Okeh, et al. This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, and build upon your work non-commercially.

Media Partner & Supporters