Fisher's Exact Test (Expanded)
Menu location: Analysis_Exact_Expanded Fisher.
This allows you to see a conventional Fisher's exact (Fisher-Irwin) test in more detail, including mid-P values. The complete conditional distribution for the observed marginal totals is displayed (Bailey, 1977).
Like the chi-square test for fourfold (2 by 2) tables, Fisher's exact test examines the relationship between the two dimensions of the table (classification into rows vs. classification into columns). The null hypothesis is that these two classifications are not different.
The P values in this test are computed by considering all possible tables that could give the row and column totals observed. A mathematical short cut relates these permutations to factorials; a form shown in many textbooks. StatsDirect uses the hypergeometric distribution for the calculation (Conover 1999). The test statistic that is hypergeometrically distributed is the expected value of the first count A.
This exact treatment of the fourfold table should be used instead of the chi-square test when any expected frequency is less than 1 or 20% of expected frequencies are less than or equal to 5. With StatsDirect, it is reasonable to use Fisher's exact test by default because the computational method used can cope with large numbers.
StatsDirect uses the definition of a 2 sided P value described by Bailey (1977) (P values for all possible tables with P less than or equal to that for the observed table are summed). Many authors prefer to simply double the 1 sided P value (Armitage and Berry, 1994; Bland, 2000).
Consider using mid-P values and intervals when you have several similar studies within an overall investigation (Armitage and Berry, 1994; Barnard, 1989).
Assumptions:
· each observation is classified into exactly one cell
· the row and column totals are fixed, not random
The assumption of fixed marginal (row/column) totals is controversial and causes disagreements such as the best approach to two sided inference from this test.
DATA INPUT:
Observed frequencies should be entered as a standard fourfold table:
feature present | feature absent | |
outcome positive: | a | b |
outcome negative: | c | d |
Example
From Armitage and Berry (1994, p. 138).
The following data compare malocclusion of teeth with type of feeding received by infants.
Normal teeth | Malocclusion | |
Breast fed: | 4 | 16 |
Bottle fed: | 1 | 21 |
To analyse these data in StatsDirect you must select the expanded Fisher's exact test function from the exact tests section of the analysis menu. Enter the frequencies into the contingency table on screen as shown above.
For this example:
Rearranged table:
4 | 1 | 5 |
16 | 21 | 37 |
20 | 22 | 42 |
Expectation of A = 2.380952
A | Lower Tail | Individual P | Upper Tail |
0 | 1.000000000000000 | 0.030956848030019 | 0.030956848030019 |
1 | 0.202939337085679 | 0.171982489055660 | 0.969043151969981 |
2 | 0.546904315196998 | 0.343964978111320 | 0.797060662914321 |
3 | 0.856472795497186 | 0.309568480300188 | 0.453095684803002 |
4 | 0.981774323237738 | 0.125301527740552 | 0.143527204502814 |
5 | 1.000000000000000 | 0.018225676762262 | 0.018225676762262 |
One sided (upper tail) P = 0.1435, (doubled one sided P = 0.2871)
Two sided (by summation) P = 0.1745
One sided mid-P = 0.0809
Two sided mid-P = 0.1618
Here we cannot reject the null hypothesis that there is no association between these two classifications, i.e. between feeding mode and malocclusion.