*Variance of binomial distro
OK, what's that "*" doing up there? Well that means that if you look down
the page and you start to feel like you might lose your lunch, just skip to
eqn. 1.54 at the end of this section to see the final result.
Now let's do a simple variation on this. as in problem 1.5.5,
you get 0 for tails, and 1 for heads. So the mean for 1 trial is .
What's the variance?
 |
(1.45) |
This is a standard deviation
. This makes sense, because
it's the same as the previous example except the peaks are now separated
by half the distance, so you expect half the width.
Now let's consider what you get with 10 tosses. What happens is now
. For tosses
.
The mean we found out was . So they are both proportional to .
How do I know this is the answer? What, you think I looked it up?
You can do this following the methods we used in section 1.5.6.
We can write down the variance directly from the binomial distribution, just like
we did for the mean (eqn. 1.37).
 |
|
|
(1.46) |
 |
|
|
|
As with the mean, you can do a bad-ass differentiation trick to
get the answer, it's a bit messy, but it works. However I'll
follow the same kind of reasoning we talked about for the mean
to get the answer.
So you use the same variable as we did before
.
Now we want to calculate the variance, so let's see how far we can get:
![\begin{displaymath}
\sigma^2 = \langle (X-\langle X\rangle)^2\rangle = \langle (...
...=
\langle (\sum_{i=1}^n [x_i -\langle x_i \rangle])^2\rangle
\end{displaymath}](img214.png) |
(1.47) |
Well we've got pretty far, but not quite far enough. Now we have the square
of a sum. How do you handle that? Think of the case ,
. This looks like a double summation of
two indices. So generalizing this, we see that that summation in the above
equation can be written as a double sum:
![\begin{displaymath}
\sigma^2 =
\langle \sum_{i=1}^n \sum_{j=1}^n [x_i -\langle x_i \rangle] [x_j -\langle x_j \rangle] \rangle
\end{displaymath}](img217.png) |
(1.48) |
But the average of a sum is the sum of the average (even for a double sum) so
 |
(1.49) |
Now there are two kinds of terms in this double sum; ones with and with .
If , for example
then we can use independence to figure out the answer. In that case
 |
(1.50) |
Because as we saw in section 1.5.6 for independent variables,
the average of the product is the product of the average.
But
is just a constant so averaging that gives doesn't
change it. So
 |
(1.51) |
So all terms with vanish. We're only left with the terms in
other words a single sum:
 |
(1.52) |
Well each term in the sum is just the variance for a single trial.
Let's compute that, say for :
 |
|
|
(1.53) |
 |
|
|
|
Good, you're still awake! Must have had some pretty strong coffee. Well
we're almost there. Now we sum over all terms and obtain
 |
(1.54) |
That's a pretty simple answer. I could've just written it
down and left it that. But that would've taken away all the fun!
Josh Deutsch
2009-03-05
|