||This article's tone or style may not reflect the encyclopedic tone used on Wikipedia. (May 2013)|
The Wald test is a parametric statistical test named after the Transylvanian statistician Abraham Wald with a great variety of uses. Whenever a relationship within or between data items can be expressed as a statistical model with parameters to be estimated from a sample, the Wald test can be used to test the true value of the parameter based on the sample estimate.
Suppose an economist, who has data on social class and shoe size, wonders whether social class is associated with shoe size. Say is the average increase in shoe size for upper-class people compared to middle-class people: then the Wald test can be used to test whether is 0 (in which case social class has no association with shoe size) or non-zero (shoe size varies between social classes). Here, , the hypothetical difference in shoe sizes between upper and middle-class people in the whole population, is a parameter. An estimate of might be the difference in shoe size between upper and middle-class people in the sample. In the Wald test, the economist uses the estimate and an estimate of variability (see below) to draw conclusions about the unobserved true . Or, for a medical example, suppose smoking multiplies the risk of lung cancer by some number R: then the Wald test can be used to test whether R = 1 (i.e. there is no effect of smoking) or is greater (or less) than 1 (i.e. smoking alters risk).
Under the Wald statistical test, the maximum likelihood estimate of the parameter(s) of interest is compared with the proposed value , with the assumption that the difference between the two will be approximately normally distributed. Typically the square of the difference is compared to a chi-squared distribution. In the univariate case, the Wald statistic is
which is compared against a chi-squared distribution.
Alternatively, the difference can be compared to a normal distribution. In this case the test statistic is
In the multivariate case, a test about several parameters at once is carried out using a variance matrix.2 A common use for this is to carry out a Wald test on a categorical variable by recoding it as several dichotomous variables.
The likelihood-ratio test can also be used to test whether an effect exists or not. The Wald test and the likelihood ratio test often give similar conclusions (as they are asymptotically equivalent), but they could disagree enough to lead to different conclusions.
There are several reasons to prefer the likelihood ratio test to the Wald test.345 One is that the Wald test can give different answers to the same question, depending on how the question is phrased.6 For example, asking whether R = 1 is the same as asking whether log R = 0; but the Wald statistic for R = 1 is not the same as the Wald statistic for log R = 0 (because there is in general no neat relationship between the standard errors of R and log R). Likelihood ratio tests will give exactly the same answer whether we work with R, log R or any other monotonic transformation of R. The other reason is that the Wald test uses two approximations (that we know the standard error, and that the distribution is chi-squared), whereas the likelihood ratio test uses one approximation (that the distribution is chi-squared).
Yet another alternative is the score test, which has the advantage that it can be formulated in situations where the variability is difficult to estimate; e.g. the Cochran–Mantel–Haenzel test is a score test.7
- Harrell, Frank E., Jr. (2001). "Sections 9.2, 10.5". Regression modeling strategies. New York: Springer-Verlag. ISBN 0387952322.
- Harrell, Frank E., Jr. (2001). "Section 9.3.1". Regression modeling strategies. New York: Springer-Verlag. ISBN 0387952322.
- Harrell, Frank E., Jr. (2001). "Section 9.3.3". Regression modeling strategies. New York: Springer-Verlag. ISBN 0387952322.
- Collett, David (1994). Modelling Survival Data in Medical Research. London: Chapman & Hall. ISBN 0412448807.
- Pawitan, Yudi (2001). In All Likelihood. New York: Oxford University Press. ISBN 0198507658.
- Fears, Thomas R.; Benichou, Jacques; Gail, Mitchell H. (1996). "A reminder of the fallibility of the Wald statistic". The American Statistician 50 (3): 226–227. doi:10.1080/00031305.1996.10474384.
- Agresti, Alan (2002). Categorical Data Analysis (2nd ed.). Wiley. p. 232. ISBN 0471360937.
- Engle, Robert F. (1983). "Wald, Likelihood Ratio, and Lagrange Multiplier Tests in Econometrics". In Intriligator, M. D.; and Griliches, Z. Handbook of Econometrics II. Elsevier. pp. 796–801. ISBN 978-0-444-86185-6.