Pareto interpolation
You don't need to be Editor-In-Chief to add or edit content to WikiDoc. You can begin to add to or edit text on this WikiDoc page by clicking on the edit button at the top of this page. Next enter or edit the information that you would like to appear here. Once you are done editing, scroll down and click the Save page button at the bottom of the page.
Pareto interpolation is a method of estimating the median and other properties of a population that follows a Pareto distribution. It is used in economics when analysing the distribution of incomes in a population, when one must base estimates on a relatively small random sample taken from the population.
The family of Pareto distributions is parameterized by
- a positive number κ that is the smallest value that a random variable with a Pareto distribution can take. As applied to distribution of incomes, κ is the lowest income of any person in the population; and
- a positive number θ the "Pareto index"; as this increases, the tail of the distribution gets thinner. As applied to distribution of incomes, this means that the larger the value of the Pareto index θ the smaller the proportion of incomes many times as big as the smallest incomes.
Pareto interpolation can be used when the available information includes the proportion of the sample that falls below each of two specified numbers a < b. For example, it may be observed that 45% of individuals in the sample have incomes below a = $35,000 per year, and 55% have incomes below b = $40,000 per year.
Let
- Pa = proportion of the sample that lies below a;
- Pb = proportion of the sample that lies below b.
Then the estimates of κ and θ are
- Failed to parse (PNG conversion failed;
check for correct installation of latex, dvips, gs, and convert): \widehat{\kappa} = \left( \frac{P_b - P_a} { \left(1/a^\widehat{\theta}\,\right) - \left(1/b^\widehat{\theta}\,\right)} \right)^{ 1/\widehat{\theta}}
and
The estimate of the median would then be
since the actual population median is
References
- U.S. Census Bureau, Memorandum on statistical techniques used in 2001 income survey (PDF). See Equation 10 on p24.
- Stults, Brian J, Deriving median household income. Gives a derivation of the equations for Pareto interpolation.
Acknowledgement and Attribution Regarding Sources of Content
Some of the initial content on this page may be incorporated in part from copyleft sources in the public domain including wikis such as Wikipedia and AskDrWiki. Drug information for patients came from the The National Library of Medicine. Infectious disease information may have come from the Centers for Disease Control (CDC). Differential Diagnoses are drawn from clinicians as well as an amalgamation of 3 sources: 1.The Disease Database; 2. Kahan, Scott, Smith, Ellen G. In A Page: Signs and Symptoms. Malden, Massachusetts: Blackwell Publishing, 2004:3; 3. Sailer, Christian, Wasner, Susanne. Differential Diagnosis Pocket. Hermosa Beach, CA: Borm Bruckmeir Publishing LLC, 2002:7 .

