Tukey's trend test

Tukey's trend test (or the Tukey-Ciminera-Heyse trend test) is a non-parametric or semi-parametric statistical hypothesis test used to detect a dose–response relationship between a discrete independent variable (such as dosage level) and a continuous dependent variable.

First proposed by John Tukey, Joseph Ciminera, and John Heyse in 1985, the test was developed to address the limitations of simple linear regression when the exact functional form of the dose-response curve is unknown.[1] It is frequently applied in toxicology and pharmacology to determine if increasing exposure to a substance correlates with a monotonic increase or decrease in a biological response.

Background

In clinical and laboratory experiments, researchers often test a control group alongside several increasing dose levels (e.g., control, low, medium, and high). While a one-way ANOVA can identify differences between dose groups, it does not account for the ordered nature of the doses. On the other hand, standard linear regression assumes a strictly linear relationship, which may fail to detect trends that are concave, convex, or step functions.

Tukey's trend test provides a robust alternative by using a "maximum-type" test statistic. It simultaneously considers multiple possible "shapes" of the trend, reducing the risk of a Type II error when the true dose-response shape is non-linear.[2]

Methodology

The test typically evaluates a null hypothesis () of no trend against an alternative hypothesis () of a monotonic trend.

Scoring shapes

The original Tukey trend test employs three different sets of scores () to represent potential dose-response shapes:

  1. Arithmetic: assumes equal increments between doses (e.g., 0, 1, 2, 3).
  2. Ordinal: ranks the doses regardless of their physical value (e.g., 1, 2, 3, 4)
  3. Logarithmic: assumes the effect is proportional to the logarithm of the dose (e.g.,

Test statistic and correlation

Testing for a trend is treated as a regression problem. For each set of scores, a t-statistic is calculated, based on the Pearson correlation coefficient () between the response () and the assigned dose scores (). For each score, its corresponding t-statistic is defined as:

where is the total number of observations. The t-statistics are denoted , , and respectively, and quantify the level of "statistical certainty" that the slope of the relationship is non-zero. The final test statistic is formed as the maximum of these individual statistics:

Because is the maximum of several correlated statistics, it does not follow a standard Student's t-distribution. Instead, the corresponding p-value must be adjusted, either via a multivariate t-distribution or through permutation methods.[3]

Significance and regulatory context

It is often cited as a precursor to the Multiple Comparison Procedures and Modeling (MCP-MOD) approach.[4] While Tukey's 1985 test used a fixed set of three contrasts, MCP-MOD generalizes this to any number of candidate parametric models.[5]

In 2014, the European Medicines Agency (EMA) issued a positive qualification opinion on MCP-MOD as an efficient methodology for Phase II dose-finding studies.[6] The Food and Drug Administration gave a similar endorsement in 2016.[7]

Comparison to other tests

Unlike the Jonckheere's trend test, which is fully non-parametric and based on ranks, Tukey's trend test is more closely related to regression-based approaches.

References

  1. ^ Tukey, J. W.; Ciminera, J. L.; Heyse, J. F. (March 1985). "Testing the statistical certainty of a response to increasing doses of a drug". Biometrics. 41 (1): 295–301. ISSN 0006-341X. PMID 4005384.
  2. ^ Bretz, Frank; Hothorn, Ludwig A. (2002-11-30). "Detecting dose-response using contrasts: asymptotic power and sample size determination for binomial data". Statistics in Medicine. 21 (22): 3325–3335. doi:10.1002/sim.1324. ISSN 0277-6715. PMID 12407675.
  3. ^ Schaarschmidt, Frank; Ritz, Christian; Hothorn, Ludwig A. (June 2022). "The Tukey trend test: Multiplicity adjustment using multiple marginal models". Biometrics. 78 (2): 789–797. doi:10.1111/biom.13442. ISSN 1541-0420. PMID 33559878.
  4. ^ Bretz, F.; Pinheiro, J. C.; Branson, M. (2005). "Combining multiple comparisons and modeling techniques in dose-response studies". Biometrics. 61 (3): 738–748. doi:10.1111/j.1541-0420.2005.00344.x.
  5. ^ Pinheiro, Jose; Bornkamp, Björn; Glimm, Ekkehard; Bretz, Franz (May 2014). "Model-based dose finding under model uncertainty using general parametric models". Stat Med. 33 (10): 1646–61. arXiv:1305.0889. doi:10.1002/sim.6052. PMID 24302486.
  6. ^ "Qualification opinion of MCP-Mod as an efficient statistical methodology" (PDF). European Medicines Agency. 2014. Retrieved 11 January 2026.
  7. ^ Food and Drug Administration (July 2016). "U.S. FDA Qualification of The MCP-Mod Procedure" (PDF). FDA.gov. U.S. Department of Health and Human Services. Retrieved January 11, 2026.