Search This Blog

Friday, March 29, 2019

MAT 209 - Statistics Chapter 6 - Probability Distributions

THE FOLLOWING NOTES USE EXTENSIVE MS Word Math Equations that will NOT transfer over to HTML.

Chapter 6 Probability Distributions

Random variable = Quantities that can take on different values depending on chance.
·      Discrete random variable – Finite # of values (Ex: Dice)
·      Countable infinity of values – As many values as whole numbers
Probability Distribution assigns a probability to each measurable subset of the possible outcomes of a random experiment, survey, or procedure of statistical inference.

6.1 Probability Distributions

Probability distribution of random variable (single die) example assigns probability to values:
Number rolled on Die
Probability
1
1/6
2
1/6
3
1/6
4
1/6
5
1/6
6
1/6
Probability distribution written as function:
Two general rules for probability distribution:
1.     Values of probability distribution must be numbers between 0 and 1 inclusive.
2.     A random variable has to take on one of its values, sum of all values = 1.

6.2 Binomial Distribution

Borrowing the language of games of chance for the probability that an event will occur x times out of n

We assume that for this section …
1.     There are a fixed number of trials resulting in success/failure.
2.     The probability of success is the same for each trial.
3.     Trials are all independent.

Example p.203 Student reduced to guessing on 8 questions on a multiple-choice test. Each question has 5 choices, but ONLY one is correct. If C = correct answer and I = incorrect answer what are probability that the student will get 6 questions right and 2 questions wrong.

One possible outcome is CCCCCCII.
Since P(C) = 1/5 and P(I) = 4/5 à CCCCCCII = (⅕)(⅕)(⅕)(⅕)(⅕)(⅕)(⅘)(⅘) = (⅕)6(⅘)2
However, the CCCCCCII is only one possibility à Need to count the ways of arranging 6 C’s and 2 I’s à Combination  à multiply by probability of 6 C’s and 2 I’s à

Binomial Distribution: If n = # of trials, p = probability of success of each trial, and all trials are independent à probability of x successes in n trials is …
Jargon: Number of successes in n trials is a random variable having binomial probability distribution/binomial distribution.

Example 6.5 Calculate the probability that 2 of 5 reviewers will like a new movie, given that the probability that any one of them will like is 0.70. Plug in for binomial distribution formula.
X = 2, n = 5, p =0.70

Sampling a binomial population = When we observe a value of a random variable having the binomial distribution.

6.3 Hypergeometric Distribution

Different from binomial distribution – selection with NO replacement.
If n objects are chosen at random from a set consisting of a objects of type A and b objects of type B, and the selection is without replacement à probability of “x of type A and nx of type B” is …
Example 6.10 A department store has 16 delivery trucks, 5 have worn brakes. If 3 randomly picked for general overhaul, find the separate probabilities that these 3 will include 0,1,2, or 3 trucks with worn brakes.
Problem has sampling without replacement. Therefore 5 trucks (a) have worn brakes (Type A = Trucks with worn brakes). 11 trucks (b) are fine without worn brakes (Type B = Trucks w/o worn brakes). 3 Random trucks taken (n). Chance of success or chance of getting 0, 1, 2, or 3 trucks (x) with worn brakes in the random sample of 3.
f(0) = (1*165)/560 ≈ 0.295
f(1) = (5*55)/560 ≈ 0.491
f(2) = (10*11)/560 ≈ 0.196
f(3) = (10*1)/560 ≈ 0.018

*Binomial distribution & Hypergeometric distribution values are close enough to each other (as demonstrated in the textbook example 6.13) when certain conditions are fulfilled:
Sample does NOT exceed 5% of population à .

Approximation of binomial distribution and hypergeometric distribution is useful b/c binomial distribution is less complicated (binomial has two parameters n & p vs. hypergeometric’s three a, b, and n.

6.4 Poisson Distribution

When n (number of trials) is large and p (probability of trial success) is small (when n≥100 and (n)*(p)<10), binomial distribution changes to
Poisson distribution approximation to binomial distribution:

Poisson Distribution useful b/c it has only one parameter (np) vs binomial distribution (2) and hypergeometric distribution (3).

Also has a generalized Poisson Distribution (with parameter λ):
 Where, λ = expected/average number of successes, can be applied to situations where we expect a fixed number of successes per unit time (or some other kind of unit).

Example 6.14 If 1% of labels in a large shipment of jars are defective, use the Poisson distribution to find the probability that among 400 randomly inspected jars, only 3 will have defective labels.
X = 3   n = 400          p = 0.01          (n)(p) = 4      e-4 = 0.018
f(3) = (0.018)(43/3!) = 0.1920

There are cases where the hypergeometric distribution ≈ Poisson distribution.
NO replace + large n values.

Example 6.16 Auditor asked to investigate a collection of 4,000 sales invoices, of which 28 contain errors. Sample of 150 invoices is selected. What is the probability that this set of 150 invoices will contain exactly 2 with errors?
Hypergeometric problem b/c NO replacement à a = 28, b = 3972, n = 150, and x = 2.
 since, 0.05(4000) = 200 and 150 < 200 à hypergeometric distribution ≈ binomial distribution.

n > 100 and np <10 since, 150 > 100 and (150)(0.007) = 1.05 < 10 à Poisson approximation ≈ binomial distribution.

Example 6.17 Bank receives on average λ = 6 bad checks/day, what is the probability that it will receive 4 bad checks on any given day?
SUMMARY:
Binomial distribution – Repeated independent trials.
Hypergeometric distribution – Sample w/o replacement from a finite population.
Poisson distribution – Counted events w/ NO fixed upper limit.

6.5 Multinomial Distribution

When:
·      More than 2 possible outcomes for each trial
·      Probability of various outcomes remain the same each trial
·      Trials are all independent
Mutlinomial Distribution: If k possible outcomes for each trial and their probabilities are p1… and pk, it can be shown that the probability of x1 outcomes of first kind … and xk outcomes of the kth kind in n trials. Requirements are that x1 + x2 + … + xk = n   and     p1 + p2 + … +pk = 1.

Example 6.18 City has 3 channels for viewing on Saturday night. Each channel has varying percentage of viewing audiences. Channel 3 has 50% of viewing audience, channel 12 has 30%, channel 10 has 20%. What is the probability that among 8 TV viewers randomly selected in that city on Saturday night, 5 will be watching channel 3, 2 will be watching channel 12, and 1 will be watching channel 10?
N = 8, x1 = 5, x2 = 2, x3 = 1, p1 =0.50, p2 = 0.30, and p3 = 0.20

6.6 Mean Probability Distribution

Mean of Probability Distribution: Multiply the outcome by its probability and add all values.
Where µ = mean of probability distribution, x = random variable, f(x) = probability for x.
*Similar to mean of population related mean (denoted by µ).

Above formula can be further simplified for mean of binomial distributions:
Where µ = mean of probability distribution, n = trials, p = probability of success for each trial.

6.7 Standard Deviation of a Probability Distribution

Similar procedure for measuring variability, BUT instead of averaging square deviations from the mean calculate expected value.
Variance of Probability Distribution:
Where σ = standard deviation, σ2 = variance (of probability distribution in this case),
µ = probability distribution mean, f(x) = probability for x, x = random variable.

Standard Deviation of Probability Distribution
Note the similarities to standard deviation of population.

Example 6.6 – Expansion Example 6.22
Probability is 0.60 that a person shopping at a certain market will spend at least $50.
Find probability that among 5 people shopping at this market, 0, 1, 2, 3, 4, or 5 will at least spend $50.
f(0) =  = 0.010
f(1) =  = 0.077
f(2) =  = 0.230
f(3) =  = 0.346
f(4) =  = 0.259
f(5) =  = 0.078
Determine the standard deviation of distribution among the five people who will spend at least $50 at the market.
µ = 0(0.01024) + 1(0.077) + 2(0.230) + 3(0.346) + 4(0.259) + 5(0.078) = 3.001
Number of Persons, x
X - µ
(x-µ)2
Probability f(x)
(x-µ)2*f(x)
0
-3
9
0.010
0.090
1
-2
4
0.077
0.308
2
-1
1
0.230
0.230
3
0
0
0.346
0
4
1
1
0.259
0.259
5
2
4
0.078
0.312
Total σ2 = 1.199        σ = (1.199) ^(1/2) = 1.0949885 ~ 1.095

Standard deviation of probability distribution can be simplified when dealing with other types of probability distributions. For example: Standard Deviation of Binomial distribution
where σ = standard deviation of binomial distribution, n = # of trials, p = probability of success.

Example 6.24 Find variance of probability distribution of number of heads obtained in 3 flips of a balanced coin.
n = 3, p = 0.5             1 – p = 0.5
σ2 = (3)(0.5)(1-0.5) = 0.75

For hypergeometric distributions the variance is given by .

For Poisson distribution standard deviation is σ =(λ)^½ , with variance being σ2 = λ.

6.8 Chebyshev’s Theorem

Variance & standard deviation of probability distribution measure expected size of fluctuation of random variable.
When σ is small à Likely to be close to mean
When σ is large à Likely to far from mean

Chebyshev’s Theorem for probability: The probability that a random variable will take on a value within k standard deviations of the mean is at least 1 – (1/k2).

Therefore the probability of getting a value within 2 standard deviations (2σ) is at least ¾ and the probability of getting a value within 5 deviations (5σ) is at least 24/25.

Example 6.26 What does Chebyshev’s theorem with k =6 tell us about the number of heads that we may get in 400 flips of a balanced coin?
N = 400          p = ½ = 0.5    µ = n*p = 400*0.5 = 200
σ = (n*p*(1-p)) ½ = (400*0.5*(1-0.5))½ = 10       à        µ ± 6σ = 140 or 260
Probability of at least 1 – (1/62) = 35/36 that in 400 flips there will be between 140 and 260 heads.