
Organization of Multiple Comparisons
The Problem with Multiple Comparisons
If n independent contrasts are each tested at α, then the probability of making at least one type I error is 1 - (1 - α)n.
The table below gives the probability of making at least one type I error for four different numbers of comparisons:
n probability 3 .1426 5 .2262 10 .4013 15 .5367
Conceptual Error Rates
Changing the critical value of the statistical test is what controls the conceptual error rate.
Beware
You are generally safe sticking with the following post-hoc comparison techniques: Dunnett, Fisher-Hayter, Tukey HSD, Scheffé or Bonferroni, since they do a reasonably good job of of protecting the familywise error rate. They are known to strongly protect the familywise error rate. However, post-hoc techniques such as Fisher's least significant difference (LSD), Student-Newman-Keuls, and Duncan's multiple range test fail to strongly protect the familywise error rate. Such procedures are said to protect the familywise error rate in a weak sense, avoid them if possible.
Contrasts
Group 1 vs Group 2: ψ1 = (1)M1 + (-1)M2
+ (0)M3 + (0)M4
c1 = 1 -1 0 0
Group 1 vs Group 3: ψ2 = (1)M1 + (0)M2
+ (-1)M3 + (0)M4
c2 = 1 0 -1 0
Group 3 vs Group 4: ψ3 = (0)M1 + (0)M2
+ (1)M3 + (-1)M4
c3 = 0 0 1 -1
Groups 1 & 2 vs Groups 3 & 4: ψ4 = (1)M1
+ (1)M2 + (-1)M3 + (-1)M4
c4 = 1 1 -1 -1
Group 1 vs Group 4: ψ5 = (1)M1 + (0)M2
+ (0)M3 + (-1)M4
c5 = 1 0 0 -1
Orthogonal Contrasts
ψ1 & ψ2 = (1)(1) + (-1)(0) + (0)(-1) + (0)(0) = 1 [not orthogonal]
ψ1 & ψ3 = (1)(0) + (-1)(0) + (0)(1) + (0)(-1) = 0 [orthogonal]
ψ1 & ψ4 = (1)(1) + (-1)(1) + (0)(-1) + (0)(-1) = 0 [orthogonal]
ψ2 & ψ4 = (1)(1) + (0)(1) + (-1)(-1) + (0)(-1) = 2 [not orthogonal]
ψ3 & ψ4 = (0)(1) + (0)(1) + (1)(-1) + (-1)(-1) = 0 [orthogonal]
Planned Orthogonal Comparisons
t Tests for Orthogonal Comparisons

An Example
Using Stata
This section make use of the anovacontrast.ado file which can be obtained from UCLA ATS via the Internet.
use http://www.gseis.ucla.edu/courses/data/cr4new, clear
table a, cont(freq mean y sd y)
----------------------------------------------
a | Freq. mean(y) sd(y)
----------+-----------------------------------
1 | 8 3 1.511858
2 | 8 3.5 .9258201
3 | 8 4.25 1.035098
4 | 8 6.25 2.12132
----------------------------------------------
anova y a
Number of obs = 32 R-squared = 0.4455
Root MSE = 1.476 Adj R-squared = 0.3860
Source | Partial SS df MS F Prob > F
-----------+----------------------------------------------------
Model | 49.00 3 16.3333333 7.50 0.0008
|
a | 49.00 3 16.3333333 7.50 0.0008
|
Residual | 61.00 28 2.17857143
-----------+----------------------------------------------------
Total | 110.00 31 3.5483871
anovacontrast a, values(1 -1 0 0) title(1vs2)
1vs2
Contrast variable a (1 -1 0 0) Dep Var = y
source SS df MS Contrast = -0.50
---------+--------------------------------- N of obs = 32
contrast | 1 1 1.0000 F = 0.46
error | 61 28 2.1786 Prob > F = 0.5036
---------+--------------------------------- t = 0.68
anovacontrast a, values(0 0 1 -1) title(3vs4)
3vs4
Contrast variable a (0 0 1 -1) Dep Var = y
source SS df MS Contrast = -2.00
---------+--------------------------------- N of obs = 32
contrast | 16 1 16.0000 F = 7.34
error | 61 28 2.1786 Prob > F = 0.0114
---------+--------------------------------- t = 2.71
anovacontrast a, values(1 1 -1 -1) title(12vs34)
12vs34
Contrast variable a (1 1 -1 -1) Dep Var = y
source SS df MS Contrast = -4.00
---------+--------------------------------- N of obs = 32
contrast | 32 1 32.0000 F = 14.69
error | 61 28 2.1786 Prob > F = 0.0007
---------+--------------------------------- t = 3.83
char a[user] (1,-1,0,0\0,0,1,-1\1,1,-1,-1)
xi3 u.a
u.a _Ia_1-4 (naturally coded; _Ia_4 omitted)
describe _Ia_1 _Ia_2 _Ia_3
storage display value
variable name type format label variable label
-------------------------------------------------------------------------------
_Ia_1 double %10.0g a(1 -1 0 0)
_Ia_2 double %10.0g a(0 0 1 -1)
_Ia_3 double %10.0g a(1 1 -1 -1)
xi2: regress y u.a
u.a _Ia_1-4 (naturally coded; _Ia_4 omitted)
Source | SS df MS Number of obs = 32
-------------+------------------------------ F( 3, 28) = 7.50
Model | 49.00 3 16.3333333 Prob > F = 0.0008
Residual | 61.00 28 2.17857143 R-squared = 0.4455
-------------+------------------------------ Adj R-squared = 0.3860
Total | 110.00 31 3.5483871 Root MSE = 1.476
------------------------------------------------------------------------------
y | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
_Ia_1 | -.5 .7379992 -0.68 0.504 -2.011723 1.011723
_Ia_2 | -2 .7379992 -2.71 0.011 -3.511723 -.4882771
_Ia_3 | -4 1.043689 -3.83 0.001 -6.137899 -1.862101
_cons | 4.25 .2609221 16.29 0.000 3.715525 4.784475
------------------------------------------------------------------------------
Recall
Group 1 2 3 4 Mean 3.00 3.50 4.25 6.25
Dunnett's Test (Pairwise versus Control Group)

An Example
Using Stata
use http://www.gseis.ucla.edu/courses/data/cr4new
anova y a
Number of obs = 32 R-squared = 0.4455
Root MSE = 1.476 Adj R-squared = 0.3860
Source | Partial SS df MS F Prob > F
-----------+----------------------------------------------------
Model | 49.00 3 16.3333333 7.50 0.0008
|
a | 49.00 3 16.3333333 7.50 0.0008
|
Residual | 61.00 28 2.17857143
-----------+----------------------------------------------------
Total | 110.00 31 3.5483871
anovacontrast a, values(1 -1 0 0) title(1vs2)
1vs2
Contrast variable a (1 -1 0 0) Dep Var = y
source SS df MS Contrast = -0.50
---------+--------------------------------- N of obs = 32
contrast | 1 1 1.0000 F = 0.46
error | 61 28 2.1786 Prob > F = 0.5036
---------+--------------------------------- t = 0.68
anovacontrast a, values(1 0 -1 0) title(1vs3)
1vs3
Contrast variable a (1 0 -1 0) Dep Var = y
source SS df MS Contrast = -1.25
---------+--------------------------------- N of obs = 32
contrast | 6.25 1 6.2500 F = 2.87
error | 61 28 2.1786 Prob > F = 0.1014
---------+--------------------------------- t = 1.69
anovacontrast a, values(1 0 0 -1) title(1vs4)
1vs4
Contrast variable a (1 0 0 -1) Dep Var = y
source SS df MS Contrast = -3.25
---------+--------------------------------- N of obs = 32
contrast | 42.25 1 42.2500 F = 19.39
error | 61 28 2.1786 Prob > F = 0.0001
---------+--------------------------------- t = 4.40
xi3: regress y g.a
s.a _Ia_1-4 (naturally coded; _Ia_1 omitted)
Source | SS df MS Number of obs = 32
-------------+------------------------------ F( 3, 28) = 7.50
Model | 49.00 3 16.3333333 Prob > F = 0.0008
Residual | 61.00 28 2.17857143 R-squared = 0.4455
-------------+------------------------------ Adj R-squared = 0.3860
Total | 110.00 31 3.5483871 Root MSE = 1.476
------------------------------------------------------------------------------
y | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
_Ia_2 | .5 .7379992 0.68 0.504 -1.011723 2.011723
_Ia_3 | 1.25 .7379992 1.69 0.101 -.2617229 2.761723
_Ia_4 | 3.25 .7379992 4.40 0.000 1.738277 4.761723
_cons | 4.25 .2609221 16.29 0.000 3.715525 4.784475
------------------------------------------------------------------------------
describe _Ia_2 _Ia_3 _Ia_4
storage display value
variable name type format label variable label
-------------------------------------------------------------------------------
_Ia_2 double %10.0g a(2 vs. 1)
_Ia_3 double %10.0g a(3 vs. 1)
_Ia_4 double %10.0g a(4 vs. 1)Recall
Group 1 2 3 4 Mean 3.00 3.50 4.25 6.25
Fisher-Hayter Pairwise Comparisons
1vs2 -0.50 n.s. 1vs3 -1.25 n.s. 1vs4 -3.25 sig. 2vs3 -0.75 n.s. 2vs4 -2.75 sig. 3vs4 -2.00 sig.
Alternatively

1vs2 -0.96 n.s. 1vs3 -2.39 n.s. 1vs4 -6.23 sig. 2vs3 -1.44 n.s. 2vs4 -5.27 sig. 3vs4 -3.83 sig.
Using Stata
use http://www.gseis.ucla.edu/courses/data/cr4new
anova y a
Number of obs = 32 R-squared = 0.4455
Root MSE = 1.476 Adj R-squared = 0.3860
Source | Partial SS df MS F Prob > F
-----------+----------------------------------------------------
Model | 49.00 3 16.3333333 7.50 0.0008
|
a | 49.00 3 16.3333333 7.50 0.0008
|
Residual | 61.00 28 2.17857143
-----------+----------------------------------------------------
Total | 110.00 31 3.5483871
fhcomp a
Fisher-Hayter pairwise comparisons for variable grp
studentized range critical value(.05, 3, 28) = 3.4994064
mean critical
grp vs grp group means dif dif
-------------------------------------------------------
1 vs 2 3.0000 3.5000 0.5000 1.8261
1 vs 3 3.0000 4.2500 1.2500 1.8261
1 vs 4 3.0000 6.2500 3.2500* 1.8261
2 vs 3 3.5000 4.2500 0.7500 1.8261
2 vs 4 3.5000 6.2500 2.7500* 1.8261
3 vs 4 4.2500 6.2500 2.0000* 1.8261
Tukey's HSD Pairwise Comparisons
1vs2 -0.50 n.s. 1vs3 -1.25 n.s. 1vs4 -3.25 sig. 2vs3 -0.75 n.s. 2vs4 -2.75 sig. 3vs4 -2.00 n.s.
Alternatively

1vs2 -0.96 n.s. 1vs3 -2.39 n.s. 1vs4 -6.23 sig. 2vs3 -1.44 n.s. 2vs4 -5.27 sig. 3vs4 -3.83 n.s.
Using Stata
use http://www.gseis.ucla.edu/courses/data/cr4new
anova y a
Number of obs = 32 R-squared = 0.4455
Root MSE = 1.476 Adj R-squared = 0.3860
Source | Partial SS df MS F Prob > F
-----------+----------------------------------------------------
Model | 49.00 3 16.3333333 7.50 0.0008
|
a | 49.00 3 16.3333333 7.50 0.0008
|
Residual | 61.00 28 2.17857143
-----------+----------------------------------------------------
Total | 110.00 31 3.5483871
tukeyhsd a
Tukey HSD pairwise comparisons for variable a
studentized range critical value(.05, 4, 28) = 3.8613586
uses harmonica mean sample size = 8.000
mean critical
grp vs grp group means dif dif
-------------------------------------------------------
1 vs 2 3.0000 3.5000 0.5000 2.0150
1 vs 3 3.0000 4.2500 1.2500 2.0150
1 vs 4 3.0000 6.2500 3.2500* 2.0150
2 vs 3 3.5000 4.2500 0.7500 2.0150
2 vs 4 3.5000 6.2500 2.7500* 2.0150
3 vs 4 4.2500 6.2500 2.0000 2.0150
Comparing Fisher-Hayter with Tukey's HSD
Recall
Group 1 2 3 4 Mean 3.00 3.50 4.25 6.25
Scheffé's Test

From Our Example
ca = 3 -1 -1 -1 cb = 2 0 -1 -1 cc = 1 1 -1 -1 cd = 1 1 -2 0
Fa = 7.65 -- n.s. Fb = 12.39 -- sig. Fc = 14.69 -- sig. Fd = 2.45 -- n.s.Using Stata
use http://www.gseis.ucla.edu/courses/data/cr4new
anova y a
Number of obs = 32 R-squared = 0.4455
Root MSE = 1.476 Adj R-squared = 0.3860
Source | Partial SS df MS F Prob > F
-----------+----------------------------------------------------
Model | 49.00 3 16.3333333 7.50 0.0008
|
a | 49.00 3 16.3333333 7.50 0.0008
|
Residual | 61.00 28 2.17857143
-----------+----------------------------------------------------
Total | 110.00 31 3.5483871
anovacontrast a, values(3 -1 -1 -1) title(1vs234)
1vs234
Contrast variable a (3 -1 -1 -1) Dep Var = y
source SS df MS Contrast = -5.00
---------+--------------------------------- N of obs = 32
contrast | 16.6666667 1 16.6667 F = 7.65
error | 61 28 2.1786 Prob > F = 0.0099
---------+--------------------------------- t = 2.77
anovacontrast a, values(2 0 -1 -1) title(1vs34)
1vs34
Contrast variable a (2 0 -1 -1) Dep Var = y
source SS df MS Contrast = -4.50
---------+--------------------------------- N of obs = 32
contrast | 27 1 27.0000 F = 12.39
error | 61 28 2.1786 Prob > F = 0.0015
---------+--------------------------------- t = 3.52
anovacontrast a, values(1 1 -1 -1) title(12vs34)
12vs34
Contrast variable a (1 1 -1 -1) Dep Var = y
source SS df MS Contrast = -4.00
---------+--------------------------------- N of obs = 32
contrast | 32 1 32.0000 F = 14.69
error | 61 28 2.1786 Prob > F = 0.0007
---------+--------------------------------- t = 3.83
anovacontrast a, values(1 1 -2 0) title(12vs3)
12vs3
Contrast variable a (1 1 -2 0) Dep Var = y
source SS df MS Contrast = -2.00
---------+--------------------------------- N of obs = 32
contrast | 5.33333333 1 5.3333 F = 2.45
error | 61 28 2.1786 Prob > F = 0.1289
---------+--------------------------------- t = 1.56
Bonferroni & Sidak Methods
The Sidak critical value is αSi = 1 - (1-.05).25 = .01274146 which equates to a critical value of FSi = 4.31.
Compare these critical values with the Scheffé critical value of 8.85.
Comparing the Comparisons
Consider a four group design with error df=28. Here are the critical values for pairwise comparisons using various methods at α = 0.05.
Method Critical Value of t* Ordinary Student's t 2.048 Dunnett's test 2.157 Fisher-Hayter 2.474 requires rescaling studentized range statistic Tukey HSD 2.730 requires rescaling studentized range statistic Sidak 2.830 Bonferoni 2.839 Scheffé 2.975
Linear Statistical Models Course
Phil Ender, 13apr06, 12Feb98