Nonparametric Testing Using The Chi-Square Distribution: Reading Tips
Nonparametric Testing Using The Chi-Square Distribution: Reading Tips
a
healthy infants
b
5
c
high-risk infants
no abnormalities
25
d
20
10
DEFINITIONS
Before we explain how to perform the analysis, we will give
some basic information about the chi-square test. The chisquare test is a nonparametric statistical test. Nonparametric
or distribution-free methods of analysis, as opposed to parametric tests, do not assume a normal distribution of the
population from which the samples are drawn. Nonparametric tests are used with ordinal and nominal data. Nonparametric methods can be used to analyze studies that have
collected interval or ratio data. These data, however, will have
to be reduced to nominal or ordinal data.2,4,5
The chi-square statistic gives a measure of the discrepancies
between the observed and expected frequencies (nominal
data). If the discrepancies are large, the chi-square value will
be large. If no discrepancies exist, the chi-square value will be
0. Negative chi-square values do not occur (as you will see
later) because the numerator in the chi-square formula always
is squared, which eliminates the possibility of a negative
number occurring. This test is used when you want to know
if the differences between the observed and expected frequencies are significant. Although chi square can be used to analyze
studies that collect ratio data, that data will have to be reduced
to frequency data for analysis. The test usually is used with
discontinuous frequency data classified into mutually exclusive categories (eg, pass-fail, yes-no). Most commonly, experimental studies with nonparametric hypotheses have independent groups of subjects being measured on some qualitative dependent variable.
ANALYSIS AND INTERPRETATION
If we are to compare the observed frequencies (fo) with the
expected frequencies (fe) under the null hypothesis, we need
to calculate the expected frequencies. This is a simple process.
First find the row (r) and column (c) totals and the total
number of observations (subjects). The expected frequencies
for each cell are calculated by multiplying the r sum that
contains the cell by the c sum that contains the cell and
dividing by the total number of subjects (N). In our example
Cell a: Fe =
= 12.5
(1)
Cell b: Fe =
= 17.5
(2)
Cell c: Fe =
= 12.5
(3)
Cell d: Fe =
= 17.5
(4)
PHYSICAL THERAPY
healthy infants
high-risk infants
column sum
no abnormalities
fo = 5
fo = 25
fe = 12.5
fe = 17.5
c
fo = 20
d
fo = 10
fe = 12.5
fe = 17.5
25
row sum
35
30
30
60
2 = 13.44
In our example, the correction for continuity reduced our
calculated chi-square value from 15.4 to 13.44.
2 2 Special Formula
For the 2 2 table only, a special alternative formula can
be used that makes the calculation easier. The uncorrected
formula is
(7)
a
healthy infants
c
high-risk infants
column sums
no abnormalities
b
row sum
25
30
10
30
d
20
25
35
60
COMBINING CATEGORIES
If your study has a number of categories that have small
observed frequencies, it is advisable to combine categories,
when possible, to increase the number of observations in each
cell. For example, if in your study two out of six categories
are similar and have very small observed frequencies, com
bining the two categories into one will give you a larger
frequency count.
The problem with small expected frequencies is that you
increase the likelihood of making an error in estimating
probabilities from the theoretical frequency curve used to
determine critical significant values. Thus, a good policy is to
combine categories with small observed frequencies when
possible.1.2
PHYSICAL THERAPY
row sums
above
median
11
25
35
71
below
median
15
29
26
32
42
100
column sums
total
above
median
fo = 25
fo = 35
f o = 11
fe = 25.56 fe = 22.72 fe = 29.82
below
median
fo = 7
fo = 7
fo = 15
fe = 10.44 fe = 9.28 fe = 12.18
29
32
100
column sums
36
42
row sums
71
total
(10)
267
REFERENCES
1. Michels E: Design of Research and Analysis of Data in the Clinic: An
Introductory Manual for Clinical Research. Alexandria, VA, American Physical Therapy Association, 1982
2. Ferguson GA: Statistical Analysis in Psychology and Education, ed 4. New
York, NY, McGraw-Hill Inc, 1976
3. Maxwell AE: Analyzing Qualitative Data. London, England, Chapman and
Hall, 1975
4. Lancaster HO: The Chi-Squared Distribution. New York, NY, John Wiley
& Sons Inc, 1969
5. Nesbitt JE: Chi-Square: Statistical Guides in Educational Research, no. 2.
England, Manchester University Press, 1966
6. Noether G: Introduction to Statistics: A Fresh Approach. Boston, MA,
Houghton Mifflin Co, 1971
7. Hays WL: Statistics for the Social Sciences, ed 2. New York, NY, Holt,
Rinehart & Winston General Book, 1973
8. Howell DC: Statistical Methods for Psychology. Boston, MA, Duxbury
Press, 1982
9. Dayton CM: The Design of Educational Experiments. New York, NY,
McGraw-Hill Inc, 1970
10. Dotson CD, Kirkendall DR: Statistics for Physical Education, Health and
Recreation. New York, NY, Harper & Row, Publishers Inc, 1974
C = .35
SUMMARY
268
Group 2
column sum
row sum
total
PHYSICAL THERAPY