In order to provide the highest quality information on Wikidoc, we've had to implement a human verification system called ReCaptcha to combat attempts to deface the website. If the Captcha technology proves difficult to use, please send an email to bugs AT wikidoc.org with your username and we will be happy to lift the Captcha requirement from your account. We apologize for any inconvenience.

Power transform

You don't need to be Editor-In-Chief to add or edit content to WikiDoc. You can begin to add to or edit text on this WikiDoc page by clicking on the edit button at the top of this page. Next enter or edit the information that you would like to appear here. Once you are done editing, scroll down and click the Save page button at the bottom of the page.
(Redirected from Box-Cox transformation)
Jump to: navigation, search

WikiDoc Resources for

Power transform

Articles

Most recent articles on Power transform

Most cited articles on Power transform

Review articles on Power transform

Articles on Power transform in N Eng J Med, Lancet, BMJ

Media

Powerpoint slides on Power transform

Images of Power transform

Photos of Power transform

Podcasts & MP3s on Power transform

Videos on Power transform

Evidence Based Medicine

Cochrane Collaboration on Power transform

Bandolier on Power transform

TRIP on Power transform

Clinical Trials

Ongoing Trials on Power transform at Clinical Trials.gov

Trial results on Power transform

Clinical Trials on Power transform at Google

Guidelines / Policies / Govt

US National Guidelines Clearinghouse on Power transform

NICE Guidance on Power transform

NHS PRODIGY Guidance

FDA on Power transform

CDC on Power transform

Books

Books on Power transform

News

Power transform in the news

Be alerted to news on Power transform

News trends on Power transform

Commentary

Blogs on Power transform

Definitions

Definitions of Power transform

Patient Resources / Community

Patient resources on Power transform

Discussion groups on Power transform

Patient Handouts on Power transform

Directions to Hospitals Treating Power transform

Risk calculators and risk factors for Power transform

Healthcare Provider Resources

Symptoms of Power transform

Causes & Risk Factors for Power transform

Diagnostic studies for Power transform

Treatment of Power transform

Continuing Medical Education (CME)

CME Programs on Power transform

International

Power transform en Espanol

Power transform en Francais

Business

Power transform in the Marketplace

Patents on Power transform

Experimental / Informatics

List of terms related to Power transform

Please Take Over This Page and Apply to be Editor-In-Chief for this topic: There can be one or more than one Editor-In-Chief. You may also apply to be an Associate Editor-In-Chief of one of the subtopics below. Please mail us [1] to indicate your interest in serving either as an Editor-In-Chief of the entire topic or as an Associate Editor-In-Chief for a subtopic. Please be sure to attach your CV and or biographical sketch.

In statistics, the power transform is a family of transformations that map data from one space to another using power functions. This is a useful data (pre)processing technique used to reduce data variation, make the data more normal distribution-like, improve the correlation between variables and for other data stabilization procedures. The Box–Cox transformation, by statisticians George E. P. Box and David Cox, is one particular way of parameterising a power transform that has advantageous properties.

Definition

The power transformation is defined as a continuously varying function, with respect to the power parameter λ, in a piece-wise function form that makes it continuous at the point of singularity (λ = 0). For data vectors (y1,..., yn) in which each yi > 0, the power transform is

<math>y_i^{(\lambda)} =

\begin{cases} \dfrac{y_i^\lambda-1}{\lambda(\operatorname{GM}(y))^{\lambda -1}} , &\mbox{ if } \lambda \neq 0 \\ \\ \operatorname{GM}(y)\log{y_i} , &\mbox{ if } \lambda = 0 \end{cases} </math>

where

<math> \operatorname{GM}(y) = (y_1\cdots y_n)^{1/n} \, </math>

is the geometric mean of the observations y1, ..., yn.

The inclusion of the (λ − 1)th power of the geometric mean in the denominator implies that the units of measurement do not change as λ changes. That makes it possible to compare sums of squares of residuals and choose the value of λ that minimizes that sum.

The value at Y = 1 for any λ is 0, and the derivative with respect to Y there is 1 for any λ. Sometimes Y is a version of some other variable scaled to give Y = 1 at some sort of average value.

The transformation is a power transformation, but done in such a way as to make it continuous with the parameter λ at λ = 0. It has proved popular in regression analysis, including econometrics.

Box and Cox also proposed a more general form of the transformation that incorporates a shift parameter.

<math>\tau(y_i;\lambda, \alpha) = \begin{cases} \dfrac{(y_i + \alpha)^\lambda - 1}{\lambda (\operatorname{GM}(y))^{\lambda - 1}} & \mathrm{if}\ \lambda\neq 0, \\ \\

\operatorname{GM}(y)\ln(y_i + \alpha)& \mathrm{if}\ \lambda=0.\end{cases}</math>

If τ(Y, λ, α) follows a truncated normal distribution, then Y is said to follow a Box–Cox distribution.

Use of the power transform

Power transform activities

The SOCR resource pages contain a number of hands-on interactive activities with the Power Transform using Java applets and charts.

Example

The BUPA liver data set contains data on liver enzymes ALT and γGT. The data can be found via the classic data sets page. Suppose we are interested in using log(γGT) to predict ALT. A plot of the data appears in panel (a) of the figure. There appears to be non-constant variance, and a Box–Cox transformation might help.

BUPA BoxCox.JPG

The log-likelihood of the power parameter appears in panel (b). The horizontal reference line is at a distance of χ12/2 from the maximum and can be used to read off an approximate 95% confidence interval for λ. It appears as though a value close to zero would be good, so we take logs.

Possibly, the transformation could be improved by adding a shift parameter to the log transformation. Panel (c) of the figure shows the log-likelihood. In this case, the maximum of the likelihood is close to zero suggesting that a shift parameter is not needed. The final panel shows the transformed data with a superimposed regression line.

Note that although Box–Cox transformations can make big improvements in model fit, there are some issues that the transformation cannot help with. In the current example, the data are rather heavy-tailed so that the assumption of normality is not realistic and a robust regression approach leads to a more precise model.

Econometric application

Economists often characterize production relationships by some variant of the Box–Cox transformation.

Consider a common representation of production Q as dependent on services provided by a capital stock K and by labor hours N:

<math>\tau(Q)=\alpha \tau(K)+ (1-\alpha)\tau(N).\,</math>

Solving for Q by inverting the Box–Cox transformation we find

<math>Q=\big(\alpha K^\lambda + (1-\alpha) N^\lambda\big)^{1/\lambda},\,</math>

which is known as the constant elasticity of substitution (CES) production function.

The CES production function is a homogeneous function of degree one.

When λ = 1, this produces the linear production function:

<math>Q=\alpha K + (1-\alpha)N.\,</math>

When λ → 0 this produces the famous Cobb-Douglas production function:

<math>Q=K^\alpha N^{1-\alpha}.\,</math>

Activities and demonstrations

The SOCR resource pages contain a number of hands-on interactive activities demonstrating the Box–Cox (Power) Transformation using Java applets and charts. These directly illustrate the effects of this transform on Qq plots, X-Y scatterplots, time-series plots and histograms.

References

  • Box, George E. P.; Cox, D. R. (1964). "An analysis of transformations". Journal of the Royal Statistical Society, Series B 26: 211–246.
  • Carroll, RJ and Ruppert, D. On prediction and the power transformation family. Biometrika 68: 609–615.
  • DeGroot, M. H. (1987). "A Conversation with George Box". Statistical Science 2: 239–258. doi:10.1214/ss/1177013223.
  • Handelsman, DJ. Optimal Power Transformations for Analysis of Sperm Concentration and Other Semen Variables. Journal of Andrology, Vol. 23, No. 5, September/October 2002.
  • Gluzman, S and Yukalov, VI. Self-similar power transforms in extrapolation problems. Journal of Mathematical Chemistry, Volume 39, Number 1 / January, 2006, DOI 10.1007/s10910-005-9003-7, 47–56.
  • Howarth, RJ and Earle, SAM. Application of a generalized power transformation to geochemical data Journal Mathematical Geology, Volume 11, Number 1 / February, 1979, DOI 10.1007/BF01043245, pages 45–62.
  • Peters, JL Rushton, L, Sutton, AJ, Jones, DR, Abrams, KR, Mugglestone, MA. (2005) Bayesian methods for the cross-design synthesis of epidemiological and toxicological evidence. Journal of the Royal Statistical Society: Series C (Applied Statistics) 54 (1), 159–172, doi:10.1111/j.1467-9876.2005.00476.x

External links


de:Box-Cox-Transformation eu:Box-Cox aldakuntza


WikiDoc Help Menu

Quick Start..

Editing basics

Advanced editing

Communicating your edits

Help Videos You Can Watch

[edit] Acknowledgement and Attribution Regarding Sources of Content

Some of the initial content on this page may be incorporated in part from copyleft sources in the public domain including wikis such as Wikipedia and AskDrWiki. Drug information for patients came from the The National Library of Medicine. Infectious disease information may have come from the Centers for Disease Control (CDC). Differential Diagnoses are drawn from clinicians as well as an amalgamation of 3 sources: 1.The Disease Database; 2. Kahan, Scott, Smith, Ellen G. In A Page: Signs and Symptoms. Malden, Massachusetts: Blackwell Publishing, 2004:3; 3. Sailer, Christian, Wasner, Susanne. Differential Diagnosis Pocket. Hermosa Beach, CA: Borm Bruckmeir Publishing LLC, 2002:7 .

Personal tools
Wikidoc Board Review
In other languages