Q-Q plot

Jump to: navigation, search
File:Qqnormexp.png
A normal Q-Q plot of randomly generated exponential(1) data.
File:Qqnormcorr.png
A normal Q-Q plot of randomly generated normal(0,1) data.

In statistics, a Q-Q plot ("Q" stands for quantile) is a graphical method for diagnosing differences, between the probability distribution of a statistical population from which a random sample has been taken and a comparison distribution. An example of the kind of differences that can be tested for is non-normality of the population distribution.

For a sample of size n, one plots n points, with the (n+1)-quantiles of the comparison distribution (e.g. the normal distribution) on the horizontal axis (for k = 1, ..., n), and the order statistics of the sample on the vertical axis. If the population distribution is the same as the comparison distribution this approximates a straight line, especially near the center. In the case of substantial deviations from linearity, the statistician rejects the null hypothesis of sameness.

Plotting positions

For the quantiles of the comparison distribution typically the formula k/(n + 1) is used. Several different formulas have been used or proposed as symmetrical plotting positions. Such formulas have the form (k − a)/(n + 1 − 2a) for some value of a in the range from 0 to 1/2. The above expression k/(n + 1) is one example of these, for a = 0. Other expressions include:

  • (k − 1/3)/(n + 1/3) [1]
  • (k − 0.3175)/(n + 0.365) [2]
  • (k − 0.326)/(n + 0.348) [3]
  • (k − 0.375)/(n + 0.25)[4]
  • (k − 0.44)/(n + 0.12)[5]

For large sample size, n, there is little difference between these various expressions.

Relation with rankit plots

Q-Q plots are similar to rankit plots, also called normal probability plots. The difference is that in normal probability plots, instead of using the quantile of the normal distribution as the x-axis, one uses the expected value of the kth order statistic from a normal distribution with expectation 0 and variance 1. Only when n is small is there a substantial difference between a Q-Q plot and a normal probability plot.

See also

References

  1. A simple (and easy to remember) formula for plotting positions.
  2. Engineering Statistics Handbook: Normal Probability Plot – Note that this also uses a different expression for the first & last points. [1] cites the original work by Filliben 1975.
  3. Distribution free plotting position, Yu & Huang
  4. This is Blom's earlier approximation 1953 and is the expression used in MINITAB.
  5. This plotting position was used by Gringorten 1963 to plot points in tests for the Gumbel distribution.
de:Quantile-Quantile-Plotit:Plotting position




Navigation WikiDoc | WikiPatient | Popular pages | Recently Edited Pages | Recently Added Pictures

Table of Contents In Alphabetical Order | By Individual Diseases | Signs and Symptoms | Physical Examination | Lab Tests | Drugs

Editor Tools Become an Editor | Editors Help Menu | Create a Page | Edit a Page | Upload a Picture or File | Printable version | Permanent link | Maintain Pages | What Pages Link Here
There is no pharmaceutical or device industry support for this site and we need your viewer supported Donations | Editorial Board | Governance | Licensing | Disclaimers | Avoid Plagiarism | Policies
Linked-in.jpg
Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox
In other languages