## Brazilian Journal of Probability and Statistics

### Bayesian hypothesis testing: Redux

#### Abstract

Bayesian hypothesis testing is re-examined from the perspective of an a priori assessment of the test statistic distribution under the alternative. By assessing the distribution of an observable test statistic, rather than prior parameter values, we revisit the seminal paper of Edwards, Lindman and Savage (Psychol. Rev. 70 (1963) 193–242). There are a number of important take-aways from comparing the Bayesian paradigm via Bayes factors to frequentist ones. We provide examples where evidence for a Bayesian strikingly supports the null, but leads to rejection under a classical test. Finally, we conclude with directions for future research.

#### Article information

Source
Braz. J. Probab. Stat., Volume 33, Number 4 (2019), 745-755.

Dates
Accepted: April 2019
First available in Project Euclid: 26 August 2019

https://projecteuclid.org/euclid.bjps/1566806431

Digital Object Identifier
doi:10.1214/19-BJPS442

Mathematical Reviews number (MathSciNet)
MR3996315

#### Citation

Lopes, Hedibert F.; Polson, Nicholas G. Bayesian hypothesis testing: Redux. Braz. J. Probab. Stat. 33 (2019), no. 4, 745--755. doi:10.1214/19-BJPS442. https://projecteuclid.org/euclid.bjps/1566806431

#### References

• Bartlett, M. S. (1957). Comment on “A statistical paradox”. Biometrika 44, 533–534.
• Berger, J. O. (2003). Could Fisher, Jeffreys and Neyman have agreed on testing? Statistical Science 18, 1–32.
• Berger, J. O. and Delampady, M. (1987). Testing precise hypothesis (with discussion). Statistical Science 2, 317–335.
• Berger, J. O. and Jefferys, W. H. (1992). The application of robust Bayesian analysis to hypothesis testing and Occam’s razor. Journal of the Italian Statistical Society 1, 17–32.
• Berger, J. O. and Sellke, T. (1987). Testing of a point null hypothesis: The irreconcilability of significance levels and evidence (with discussion). Journal of the American Statistical Association 82, 112–139.
• Berkson, J. (1938). Some difficulties of interpretation encountered in the application of the chi-square test. Journal of the American Statistical Association 33, 526–542.
• Connolly, R. A. (1991). A posterior odds analysis of the weekend effect. Journal of Econometrics 49, 51–104.
• Edwards, W., Lindman, H. and Savage, L. J. (1963). Bayesian statistical inference for psychological research. Psychological Review 70, 193–242.
• Efron, B. and Gous, A. (2001). Scales of evidence for model selection: Fisher versus Jeffreys. In Model Selection, 208–246. Beachwood: Institute of Mathematical Statistics.
• Etz, A. and Wagenmakers, E.-J. (2017). J. B. S. Haldane’s contribution to the Bayes factor hypothesis test. Statistical Science 32, 313–329.
• Gelman, A., Jakulin, A., Pitau, M. G. and Su, Y.-S. (2008). A weakly informative default prior distribution for logistic and other regression models. Annals of Applied Statistics 2, 1360–1383.
• Good, I. J. (1992). The Bayes/non-Bayes compromise: A brief review. Journal of the American Statistical Association 87, 597–606.
• Hartigan, J. A. (2003). Akaike–Jeffreys priors. Technical report, Yale Univ.
• Held, L. and Ott, M. (2018). On $p$-values and Bayes factors. Annual Review of Statistics and Its Application 5, 393–419.
• Jefferys, W. H. and Berger, J. O. (1992). Ockham’s razor and Bayesian analysis. American Scientist 80, 64–72.
• Jeffreys, H. (1957). Scientific Inference. Cambridge: Cambridge University Press.
• Jeffreys, H. (1961). Theory of Probability. London: Oxford University Press.
• Jeffreys, H. (1961). Theory of Probability, 3rd ed. Oxford Classic Texts in the Physical Sciences. Oxford: Oxford Univ. Press.
• Johnson, V. (2005). Bayes factors based on test statistics. Journal of the Royal Statistical Society, Series B 67, 689–701.
• Johnson, V. (2008). Properties of Bayes factors based on test statistics. Scandinavian Journal of Statistics 35, 354–368.
• Kass, R. E. and Raftery, A. E. (1995). Bayes factors. Journal of the American Statistical Association 90, 773–795.
• Lehmann, E. L. (1959). Testing Statistical Hypotheses. New York: Wiley.
• Lindley, D. V. (1957). A statistical paradox. Biometrika 44, 187–192.
• Lopes, H. F. and West, M. (2004). Bayesian model assessment in factor analysis. Statistica Sinica 14, 41–67.
• Polson, N. G. and Roberts, G. O. (1994). Bayes factors for discrete observations from diffusion processes. Biometrika 81, 11–26.
• Savage, L. J. (1962). Subjective probability and statistical practice. In The Foundations of Statistical Inference: A Discussion (L. J. Savage et al., eds.). New York: Wiley.
• Scott, J. G. and Berger, J. O. (2010). Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem. The Annals of Statistics 38, 2587–2619.
• Zellner, A. and Siow, A. (1979). Posterior odds ratio for selected regression hypotheses. In Bayesian Statistics (J. M. Bernardo, M. H. De Groot, D. V. Lindley and A. F. M. Smith, eds.), Proceedings of the First International Meeting, 585–603. Valencia: University Press.