Bayes estimator

You don't need to be Editor-In-Chief to add or edit content to WikiDoc. You can begin to add to or edit text on this WikiDoc page by clicking on the edit button at the top of this page. Next enter or edit the information that you would like to appear here. Once you are done editing, scroll down and click the Save page button at the bottom of the page.

Jump to: navigation, search

In decision theory and estimation theory, a Bayes estimator is an estimator or decision rule that maximizes the posterior expected value of a utility function or minimizes the posterior expected value of a loss function. (See also prior probability.)

Specifically, suppose an unknown parameter θ is known to have a prior distribution Π. Let δ be an estimator of θ (based on some measurements), and let R(θ,δ) be a risk function, such as the mean squared error. The Bayes risk of δ is defined as EΠ{R(θ,δ)}, where the expectation is taken over the probability distribution of θ. An estimator δ is said to be a Bayes estimator if it minimizes the Bayes risk among all estimators.

Examples

Risk functions are chosen depending on how one measures the distance between the estimate and the unknown parameter. Following are several examples of risk functions and the corresponding Bayes estimators. We denote the posterior generalized distribution function as F.

  1. If we take the mean squared error as a risk function, then it is not difficult to show that the Bayes' estimate of the unknown parameter is simply the posterior mean,
    \widehat{\theta }(x) = E[\theta |X]=\int \theta f(\theta |x)\,d\theta.
    The Bayes risk, in this case, is the posterior variance.
  2. A "linear" loss function, with a > 0, which yields the posterior median as the Bayes' estimate:
     L(\theta,\widehat{\theta}) = a|\theta-\widehat{\theta}|
     F(\widehat{\theta }(x)|X) = \tfrac{1}{2}
  3. Another "linear" loss function, which assigns different "weights" a,b > 0 to over or sub estimation. It yields a quantile from the posterior distribution, and is a generalization of the previous loss function:
     L(\theta,\widehat{\theta}) = \left\{\begin{matrix}
  a|\theta-\widehat{\theta}| & \mbox{for }\theta-\widehat{\theta} \ge 0 \\
  b|\theta-\widehat{\theta}| & \ \ \ \mbox{for }\theta-\widehat{\theta} < 0
  \end{matrix}\right.
     F(\widehat{\theta }(x)|X) = \frac{a}{a+b}
  4. The following loss function is trickier: it yields either the posterior mode, or a point close to it depending on the curvature and properties of the posterior distribution. Small values of the parameter K > 0 are recommended, in order to use the mode as an aproximation (L > 0):
     L(\theta,\widehat{\theta}) = \left\{\begin{matrix}
  0 & \mbox{for }|\theta-\widehat{\theta}| < K \\
  L & \ \ \ \mbox{for }|\theta-\widehat{\theta}| \ge K 
  \end{matrix}\right.

Other loss functions can be conceived, although the mean squared error is the most widely used and validated.

See also

External links



WikiDoc Help Menu

Quick Start..

Editing basics

Advanced editing

Communicating your edits

Help Videos You Can Watch

Acknowledgement and Attribution Regarding Sources of Content

Some of the initial content on this page may be incorporated in part from copyleft sources in the public domain including wikis such as Wikipedia and AskDrWiki. Drug information for patients came from the The National Library of Medicine. Infectious disease information may have come from the Centers for Disease Control (CDC). Differential Diagnoses are drawn from clinicians as well as an amalgamation of 3 sources: 1.The Disease Database; 2. Kahan, Scott, Smith, Ellen G. In A Page: Signs and Symptoms. Malden, Massachusetts: Blackwell Publishing, 2004:3; 3. Sailer, Christian, Wasner, Susanne. Differential Diagnosis Pocket. Hermosa Beach, CA: Borm Bruckmeir Publishing LLC, 2002:7 .