Cramér-von-Mises criterion

Jump to: navigation, search

In statistics the Cramér-von-Mises criterion for judging the goodness of fit of a probability distribution <math>F^*</math> compared to a given distribution <math>F</math> is given by

<math>W^2 = \int_{-\infty}^\infty [F(x)-F^*(x)]^2 dF(x) </math>

In one-sample applications <math>F</math> is the theoretical distribution and <math>F^*</math> is the empirically observed distribution. Alternatively the two distributions can both be empirically estimated ones; this is called the two-sample case.

The criterion is named after Harald Cramér and Richard Edler von Mises who first proposed it in 1928-1930. The generalization to two samples is due to Anderson (1962).

The Cramér-von-Mises test is an alternative to the Kolmogorov-Smirnov test. It is thought that the CvM test is more powerful than the KS test, but this has not been shown theoretically.

Cramér-von-Mises test (one sample)

Let <math>x_1,x_2,\cdots,x_n</math> be the observed values, in increasing order. Then it is possible to show that

<math>T = n W^2 = \frac{1}{12n} + \sum_{i=1}^n \left[ \frac{2i-1}{2n}-F(x_i) \right]^2. </math>

If this value is larger than the tabulated value we can reject the hypothesis that the data come from the distribution <math>F(.)</math>.

Cramér-von-Mises test (two samples)

Let <math>x_1,x_2,\cdots,x_n</math> and <math>y_1,y_2,\cdots,y_m</math> be the observed values in the first and second sample respectively, in increasing order. Let <math>r_1,r_2,\cdots,r_n</math> be the ranks of the x's in the combined sample, and let <math>s_1,s_2,\cdots,s_m</math> be the ranks of the y's in the combined sample. It can be shown that

<math>T = n W^2 = \frac{U}{n m (n+m)}-\frac{4 m n - 1}{6(m+n)} </math>

where U is defined as

<math>U = n \sum_{i=1}^n (r_i-i) + m \sum_{j=1}^m (s_j-j) </math>

If the value of T is larger than the tabulated value we can reject the hypothesis that the two samples come from the same distribution. (Some books give critical values for U, which is more convenient, as it avoids the need to compute T via the expression above. The conclusion will be the same).

References

Anderson: 'On the Distribution of the Two-Sample Cramer-von Mises Criterion', Annals Math. Stat. 33, #3 (1962), p1148-1159. [1]

Xiao, Gordon, Yakovlev: 'A C++ program for the Cramér-von-Mises two sample test', Journal of Statistical Software, 17 #8, January 2007 [2]


Navigation WikiDoc | WikiPatient | Up To Date Pages | Recently Edited Pages | Recently Added Pictures

Table of Contents In Alphabetical Order | By Individual Diseases | Signs and Symptoms | Physical Examination | Lab Tests | Drugs

Editor Tools Become an Editor | Editors Help Menu | Create a Page | Edit a Page | Upload a Picture or File | Printable version | Permanent link | Maintain Pages | What Pages Link Here
There is no pharmaceutical or device industry support for this site and we need your viewer supported Donations | Editorial Board | Governance | Licensing | Disclaimers | Avoid Plagiarism | Policies
Linked-in.jpg