The Annals of Mathematical Statistics

Some Distribution-Free Tests for the Difference Between two Empirical Cumulative Distribution Functions

E. F. Drion

Full-text: Open access

Abstract

It sometimes happens that of two empirical cumulative distribution curves (step curves) one lies entirely above the other, in other words that, except at both ends, they have no point in common. The problem then arises, what is the probability that this will happen when both are random samples from the same population. In this paper a partial answer will be given, based on the ingenious solution of Andre (as cited in the well known textbook of Bertrand [1] in the problem of the ballot and also in Chap. VIII, Sect. 5 of [7]). Moreover an analogous method will allow us to give an exact answer to the problem of the maximum difference between two empirical cumulative distribution functions of random samples from the same population, but only if both samples have the same size. Smirnov has given an asymptotic solution for the latter problem (cited by Feller [3], see also [2]). Our result leads, by using the Stirling approximation for the factorials, to the asymptotic formula of Smirnov. A comparison of numerical results of the exact formula and the asymptotic formula of Smirnov shows that at least in the case of equal samples, the probabilities calculated by the Smirnov formula have, for samples as small as 20, an error of less than 4% for probabilities 0.033 or more. (See also Massey [5], who has calculated the exact probabilities for equal samples by means of difference equations.)

Article information

Source
Ann. Math. Statist., Volume 23, Number 4 (1952), 563-574.

Dates
First available in Project Euclid: 28 April 2007

Permanent link to this document
https://projecteuclid.org/euclid.aoms/1177729335

Digital Object Identifier
doi:10.1214/aoms/1177729335

Mathematical Reviews number (MathSciNet)
MR51492

Zentralblatt MATH identifier
0047.38202

JSTOR
links.jstor.org

Citation

Drion, E. F. Some Distribution-Free Tests for the Difference Between two Empirical Cumulative Distribution Functions. Ann. Math. Statist. 23 (1952), no. 4, 563--574. doi:10.1214/aoms/1177729335. https://projecteuclid.org/euclid.aoms/1177729335


Export citation