Sample Size for Matched Case-control Studies
Menu location: Analysis_Sample Size_Matched Case-Control.
This function gives you the minimum sample size necessary to detect a true odds ratio OR with power POWER and a two sided type I error probability ALPHA. If you are using more than one control per case then this function also provides the reduction in sample size relative to a paired study that you can obtain using your number of controls per case (Dupont, 1988).
Information required
- POWER: probability of detecting a real effect.
- ALPHA: probability of detecting a false effect (two sided: double this if you need one sided).
- r: correlation coefficient (ϕ) for exposure between matched cases and controls.
- P0: probability of exposure in the control group.
- m: number of control subjects matched to each case subject.
- OR: odds ratio (ψ).
Practical issues
- Usual values for POWER are 80%, 85% and 90%; try several in order to explore/scope.
- 5% is the usual choice for ALPHA.
- r can be estimated from previous studies - note that r is the phi (correlation) coefficient that is given for a two by two table if you enter it into the StatsDirect r by c chi-square function. When r is not known from previous studies, some authors state that it is better to use a small arbitrary value for r, say 0.2, than it is to assume independence (a value of 0) (Dupont, 1988).
- P0 can be estimated as the population prevalence of exposure. Note, however, that due to matching, the control sample is not a random sample from the population therefore population prevalence of exposure can be a poor estimate of P0 (especially if confounders are strongly associated with exposure, Dupont, 1988).
- If possible, choose a range of odds ratios that you want have the statistical power to detect.
Technical validation
The estimated sample size n is calculated as (Dupont, 1990):
                                            
                                        
- where α = alpha, β = 1 - power, ψ = odds ratio, ϕ is the correlation coefficient for exposure between matched cases and controls, and Zp is the standard normal deviate for probability p. n is rounded up to the closest integer.