Behrens-Fisher problem

You don't need to be Editor-In-Chief to add or edit content to WikiDoc. You can begin to add to or edit text on this WikiDoc page by clicking on the edit button at the top of this page. Next enter or edit the information that you would like to appear here. Once you are done editing, scroll down and click the Save page button at the bottom of the page.

Jump to: navigation, search

Editor-In-Chief: C. Michael Gibson, M.S., M.D. [1] Phone:617-632-7753

Please Take Over This Page and Apply to be Editor-In-Chief for this topic: There can be one or more than one Editor-In-Chief. You may also apply to be an Associate Editor-In-Chief of one of the subtopics below. Please mail us [2] to indicate your interest in serving either as an Editor-In-Chief of the entire topic or as an Associate Editor-In-Chief for a subtopic. Please be sure to attach your CV and or biographical sketch.

Overview

In statistics, the Behrens-Fisher problem is the problem of interval estimation and hypothesis testing concerning the difference between the means of two normally distributed populations when the variances of the two populations are not assumed to be equal, based on two independent samples.

Ronald Fisher in 1935 introduced fiducial inference in order to apply it to this problem. He referred to an earlier paper by W. V. Behrens from 1929. Behrens and Fisher proposed to find the probability distribution of

 T \equiv {\bar x_1 - \bar x_2 \over \sqrt{s_1^2/n_1 + s_2^2/n_2}}

where  \bar x_1 and  \bar x_2 are the two sample means, and s1 and s2 are their standard deviations. Fisher approximated the distribution of this by ignoring the random variation of the relative sizes of the standard deviations,  {s_1 / \sqrt{n_1} \over \sqrt{s_1^2/n_1 + s_2^2/n_2}} .

Fisher's solution provoked controversy because it did not have the property that the hypothesis of equal means would be rejected with probability α if the means were in fact equal. Many other methods of treating the problem have been proposed since.

Welch's approximate t solution

The most widely used method (for example in statistical packages and in Microsoft Excel) is that of B. L. Welch, who, like Fisher, was at University College London. The variance of the mean difference \bar d =\bar x_1 - \bar x_2 results in  s_{\bar d}^2 = s_1^2/n_1 + s_2^2/n_2. Welch (1938) approximated the distribution of s_{\bar d}^2 by that Typ III Pearson distribution (a scaled chi-squared distribution) whose first two moments agree with that of s_{\bar d}^2. This applies to the following number of degrees of freedom (d.f.), which is generally non-integer:

 \nu = {(\gamma_1 + \gamma_2)^2 \over \gamma_1^2/(n_1-1) + \gamma_2^2/(n_2-1)} \qquad{\rm where~~}\gamma_i = \sigma_i^2/n_i.

Under the null hypothesis of equal expectations, μ1 = μ2, the distribution of the Behrens Fisher statistic T, which also depends on the variance ratio \sigma_1^2/\sigma_2^2, could now be approximated by Student's t distribution with these ν degrees of freedom. But this ν contains the population variances \sigma_i^2, and these are unknown. The following estimate only replaces the population variances by the sample variances:

\hat\nu = {(g_1 + g_2)^2 \over g_1^2/(n_1-1) + g_2^2/(n_2-1)} \qquad{\rm where~~}g_i = s_i^2/n_i.

This \hat\nu is a random variable. A t distribution with a random number of degrees of freedom does not exist. Nevertheless, the Behrens Fisher T can be compared with a corresponding quantile of Student's t distribution with these estimated number of degrees of freedom, \hat\nu, which is generally non-integer. In this way, the boundary between acceptance and rejection region of the test statistic T is described by a smooth function dependent on the empirical variances s_i^2.

This method also does not give exactly the nominal rate, but is generally not too far off. However, if the population variances are equal, or if the samples are rather small and the population variances can be assumed to be approximately equal, it is more accurate to use the standard method, which is the two-sample t-test.

References and external links


WikiDoc Help Menu

Quick Start..

Editing basics

Advanced editing

Communicating your edits

Help Videos You Can Watch

Acknowledgement and Attribution Regarding Sources of Content

Some of the initial content on this page may be incorporated in part from copyleft sources in the public domain including wikis such as Wikipedia and AskDrWiki. Drug information for patients came from the The National Library of Medicine. Infectious disease information may have come from the Centers for Disease Control (CDC). Differential Diagnoses are drawn from clinicians as well as an amalgamation of 3 sources: 1.The Disease Database; 2. Kahan, Scott, Smith, Ellen G. In A Page: Signs and Symptoms. Malden, Massachusetts: Blackwell Publishing, 2004:3; 3. Sailer, Christian, Wasner, Susanne. Differential Diagnosis Pocket. Hermosa Beach, CA: Borm Bruckmeir Publishing LLC, 2002:7 .

related articles
viewed previously [ + ]

often viewed next [ + ]