Fisher's least significant difference
| Type | Post-hoc analysis |
|---|---|
| Developer | Ronald Fisher |
| Year | 1935 |
| Purpose | Pairwise comparison of group means |
| Requirement | Significant ANOVA F-test |
In statistics, Fisher's least significant difference (LSD) is a procedure used to identify statistically significant differences between the means of multiple groups. Developed by Ronald Fisher in 1935, it was the first post-hoc test designed to be performed following a significant analysis of variance (ANOVA) result.[1]
The method is intended to control the Type I error rate while maintaining higher statistical power than more conservative adjustments, such as the Bonferroni correction. It remains widely used in fields like agronomy and the social sciences.[2]
Methodology
The LSD procedure is typically applied in two stages, a process often referred to as Fisher's protected LSD:
- Omnibus test: an F-test is performed via ANOVA to determine if there are any statistically significant differences among the group means. If the F-test is not significant, the procedure stops to prevent inflating the family-wise error rate.
- Pairwise comparisons: if the omnibus F-test from step 1 is significant, pairwise t-tests are conducted for all pairs of groups. These tests use a pooled variance estimated derived from the ANOVA in step 1.
Mathematical formulation
The least significant difference for two groups and is calculated as:
where:
- is the critical value from the t-distribution for a given significance level and the error degrees of freedom from the ANOVA.
- is the mean square error from the ANOVA.
- and are the sample sizes of the groups being compared.[3]
Comparison with other methods
Fisher's LSD is categorized as an "anti-conservative" test because it does not directly adjust the Type I error rate for the total number of comparisons.
Versus Bonferroni
Unlike the Bonferroni correction, which divides the significance level by the number of comparisons , Fisher's LSD maintains the per-comparison error rate at . While this increases the probability of finding a true effect (power), it also increases the risk of a false positive when the number of groups is large.[4]
Versus Tukey's HSD
Tukey's Honest Significant Difference (HSD) controls the family-wise error rate for all possible pairwise comparisons. Fisher's LSD is generally more powerful than Tukey's HSD but is only considered valid for controlling the family-wise error rate when comparing exactly three groups.[3]
Criticisms and limitations
The primary criticism of Fisher's LSD is that the "protection" offered by the omnibus F-test diminishes as the number of groups increases. For four or more groups, the probability of at least one Type I error occurring among the pairwise comparisons can exceed the nominal , even if the F-test is significant. For this reason, for experiments involving many groups, many statisticians recommend more modern procedures like the Holm–Bonferroni method or Tukey's range test.[5]
References
- ^ Fisher, Ronald A. (1935). The Design of Experiments. Edinburgh: Oliver & Boyd.
- ^ Meier, Ulrich (5 May 2006). "A note on the power of Fisher's least significant difference procedure". Pharmaceutical Statistics. 5 (4). John Wiley & Sons, Ltd.: 253–263. doi:10.1002/pst.210.
- ^ a b Hayter, Anthony J. (1986). "The Maximum Familywise Error Rate of Fisher's Least Significant Difference Test". Journal of the American Statistical Association. 81 (396): 1000–1004. doi:10.1080/01621459.1986.10478364.
- ^ Seaman, Michael A.; Levin, Joel R.; Serlin, Ronald C. (1991). "New developments in pairwise multiple comparisons: Some powerful and practicable procedures". Psychological Bulletin. 110 (3): 577–586. doi:10.1037/0033-2909.110.3.577.
- ^ Maxwell, Scott E.; Delaney, Harold D.; Kelley, Ken (2017). Designing Experiments and Analyzing Data: A Model Comparison Perspective (3rd ed.). Routledge. doi:10.4324/9781315642956. ISBN 9781315642956.