Statistics Tutorial: Bayes' Theorem (aka, Bayes' Rule)
Bayes' theorem (also known as Bayes' rule) is a useful tool for calculating
conditional probabilities. Bayes' theorem can be stated as follows:
Bayes' theorem. Let A 1, A 2,
... , A n be a set of mutually exclusive events that together form
the sample space S. Let B be any event from the same sample space, such that
P(B) > 0. Then,
| P( Ak | B ) =
|
P( Ak
∩ B )
P( A1 ∩
B ) + P( A2 ∩
B ) + . . . + P( An ∩ B )
|
Note: Invoking the fact that P( A k ∩
B ) = P( A k )P( B | A k ), Baye's theorem can
also be expressed as
| P( Ak | B ) =
|
P( Ak ) P( B | Ak )
P( A1 ) P( B | A1 ) + P( A2 ) P( B | A2
) + . . . + P( An ) P( B | An )
|
Unless you are a world-class statiscian, Bayes' theorem (as expressed above) can
be intimidating. However, it really is easy to use. The remainder of this
lesson covers material that can help you understand when and how to apply
Bayes' theorem effectively.
When to Apply Bayes' Theorem
Part of the challenge in applying Bayes' theorem involves recognizing the types
of problems that warrant its use. You should consider Bayes' theorem when the
following conditions exist.
-
The sample space is
partitioned into a set of
mutually exclusive events { A1, A2, . . . , An
}.
-
Within the sample space, there exists an
event
B, for which P(B) > 0.
-
The analytical goal is to compute a conditional probability of the form: P( Ak
| B ).
-
You know at least one of the two sets of probabilities described below.
-
P( Ak ∩
B ) for each Ak
-
P( Ak ) and P( B | Ak ) for each Ak
Bayes Rule
Calculator
Use the Bayes Rule Calculator to compute conditional probability, when
Bayes' theorem can be applied. The calculator is free, and it is easy to use.
It can be found under the Tools
menu item, which appears in the header of every Stat Trek web page.
Sample Problem
Bayes' theorem can be best understood through an example. This section presents
an example that demonstrates how Bayes' theorem can be applied effectively to
solve statistical problems.
Example 1
Marie is getting married tomorrow, at an outdoor ceremony in the desert. In
recent years, it has rained only 5 days each year. Unfortunately, the
weatherman has predicted rain for tomorrow. When it actually rains, the
weatherman correctly forecasts rain 90% of the time. When it doesn't rain, he
incorrectly forecasts rain 10% of the time. What is the probability that it
will rain on the day of Marie's wedding?
Solution: The sample space is defined by two mutually-exclusive events -
it rains or it does not rain. Additionally, a third event occurs when the
weatherman predicts rain. Notation for these events appears below.
-
Event A1. It rains on Marie's wedding.
-
Event A2. It does not rain on Marie's wedding
-
Event B. The weatherman predicts rain.
In terms of probabilities, we know the following:
-
P( A1
) = 5/365 =0.0136985 [It rains 5 days out of the year.]
-
P( A2
) = 360/365 = 0.9863014 [It does not rain 360 days out of the year.]
-
P( B | A1
) = 0.9 [When it rains, the weatherman predicts rain 90% of the time.]
-
P( B | A2 ) = 0.1 [When it does not rain, the weatherman predicts
rain 10% of the time.]
We want to know P( A1 | B ), the probability it will rain on the day
of Marie's wedding, given a forecast for rain by the weatherman. The answer can
be determined from Bayes' theorem, as shown below.
| P( A1 | B ) =
|
P( A1 ) P( B | A1 )
P( A1 ) P( B | A1 ) + P( A2 ) P( B | A2
)
|
| P( A1 | B ) =
|
(0.014)(0.9) / [ (0.014)(0.9) + (0.986)(0.1) ]
|
| P( A1 | B ) =
|
0.111
|
Note the somewhat unintuitive result. When the weatherman predicts rain, it
actually rains only about 11% of the time. Despite the weatherman's gloomy
prediction, there is a good chance that Marie will not get rained on at her
wedding.
This is an example of something called the false positive paradox. It
illustrates the value of using Bayes theorem to calculate conditional
probabilities.
|