# Probability


## Randomness

Randomness describes a phenomenon in which the outcome of a single repetition is uncertain, but there is nonetheless a regular distribution of relative frequencies in a large number of repetitions.

* https://www.statisticsteacher.org/2019/09/19/building-understanding-of-randomness-from-ideas-about-variation-and-expectation/
* https://plato.stanford.edu/entries/chance-randomness/


## Randomized Controlled Trials

Confounding Variables

* In an observational study, if the treatment and control groups differ in ways other than the treatment, it is difficult to make conclusions about causality
* An underlying difference between the two groups (other than the treatment) is called a confounding factor, because it might confound you (that is, mess you up) when you try to reach a conclusion
* A confounding variable is a variable that is not included in an experiment but affects the relationship between two variables in the experiment. Confounding variables can distort or mask the effects of another variable on the disease in question
* Confounding variables can cause two major problems: Increase variance, Introduce bias

Blind Experiment

* If you are able to randomize individuals into the treatment and control groups, you are running a randomized controlled experiment, also known as a randomized controlled trial (RCT)
* Sometimes, people’s responses in an experiment are influenced by their knowing which group they are in
* So you might want to run a blind experiment in which individuals do not know whether they are in the treatment group or the control group
* To make this work, you will have to give the control group a **placebo**, which is something that looks exactly like the treatment but in fact has no effect

Treatment Group

* Treatment groups are also known as experimental groups. In an experiment, a treatment group receives the treatment that the researcher is interested in

Control Group

* The control group does not receive the treatment
* Instead, the control group serves as a comparison group for the treatments
* Researchers compare the results of the treatment group to the control group to determine the effect size (how much difference between the groups)


## Probability

* Outcomes you are looking for / total possible outcomes
* Terms you may see: events and sample space
* Combinations, permutations, factorials
* Intersections, unions, and conditional probability

https://www.statology.org/variance-of-probability-distribution/<br />
https://www.probabilitycourse.com/preface.php

## In the Beginning: Availability Bias

* Greeks laid foundation for math
* Greeks had problems with probability because what happens is the will of gods
* Greeks insisted on absolute truth
* Can't count on memory determining what you are looking for and total possible outcomes
* Reconstructing the past, we give unwarranted importance to most vivid memories
* Our understanding of probability is often distorted by the events that we then associate with reality
* Arithmetic, as we know it, didn't exist in Greek times
* The concept of 0 came to Greece via Alexander and Babylon, 0 - the absence of something
* Adding and multipling by 0 is introduced in the 9th century by Mahavira
* In fact 0, as a number, not a placeholder, is introduced in the writings of Brahmagupta (also introduced negative numbers)
* Bhaskara II, says that a fraction with denominator 0 indicates and infinite quantity (also known for procedures solving polynomial equations)

## Cardano / Hindus / Asians

* **Gerolamo Cardano** (24 September 1501– 21 September 1576), The Book on Games of Chance
    * Mother tried to abort him
    * Caught the bubonic plague
    * His writings (medical practice was a sham, blood letting, burning, drilling holes in the head) children made him an anathema (daughter got pregnant with brother Giovanni, had an abortion, and became infertile, Giovannie was executed for supected murder of his wife, son 2 liked killing small animals at an early age and eventually became a torturer for the Inquisition, who later gave evidence of heresy that imprisoned his father
    * Used gambling to pay for his medical school
    * Had limited knowledge of math as we know it
    * Hindus were introducing notational math, base 10, and fractions in 700 AD but because of the 'Dark Ages?' it wasn't introduced to Europe till much later (Mahavira, Brahmagupta, Bhaskara)
    * Islam world was developing quadratic equations, as well as geometry and algebra (al-Khwarizmi, al-Farisi, Thabit ibn Qurrah)
    * Our own number system, composed of the ten symbols {0,1,2,3,4,5,6,7,8,9} is called the Hindu-Arabic system. This is a base-ten (decimal) system since place values increase by powers of ten
    * Asians were already developing linear algebra
    * Of note are the Nine Chapters on the Mathematical Art containing arithmetic, algebraic, and geometric algorithms (200 BCE)
    * Plus and negative sign was introduced by the Germans
    * The = sign (two parallel lines) 1557
    * The Scientific Revolution is getting rooted
    * An example of Cardano's writing: Suppose a random process has many equally likely outcomes, some favorable (that is, winning), some unfavorable (losing). Then the probability of obtaining a favorable outcome is equal to the proportion of outcomes that are favorable. The set of all possible outcomes is called the sample space
    * After incarceration, Cardano moved to Rome, where he received a lifetime annuity from Pope Gregory XIII (after first having been rejected by Pope Pius V, who died in 1572) and finished his autobiography. He was accepted in the Royal College of Physicians, and as well as practising medicine he continued his philosophical studies until his death in 1576.
    
* **Galileo Galilei** (15 February 1564 – 8 January 1642), rather than listening to the services, stared at something he found far more intriguing: the swinging of a large hanging lamp.noticed that the lamp seemed to take the same amount of time to swing through a wide arc as it did to swing through a narrow one. That observation suggested to him a law: the time required by a pendulum to perform a swing is independent of the amplitude of the swing.
    * Introduces the idea that science must focus on experience and experimentation—how nature operates—rather than on what intuition dictates or our minds find appealing. And most of all, it must be done with mathematics.
    * Wrote a book Thoughts on the Game of Dice
    * Why does 10 show up more often when rolling 3 die?
    * Logic says the dice should sum to 10 and 9 with equal frequency: both 10 and 9 can be constructed in 6 ways from the throw of three dice. For 9 we can write those ways as (621), (531), (522), (441), (432), and (333). For 10 they are (631), (622), (541), (532), (442), and (433).
    * But consider the outcome (631) consists of the possibilities (1,3,6), (1,6,3), (3,1,6), (3,6,1), (6,1,3), and (6,3,1), whereas the outcome (333) consists only of (3,3,3). Once we’ve made this decomposition, we can see that the outcomes are equally probable and we can apply the law. Since there are 27 ways of rolling a 10 with three dice but only 25 ways to get a total of 9, Galileo concluded that with three dice, rolling a 10 was 27/25, or about 1.08, times more likely
    * Dies under 'house arrest' for heresy
    
* **Blaise Pascal** (19 June 1623 – 19 August 1662) added to probability and gambling
* Pascal's development of probability theory was his most influential contribution to mathematics. Originally applied to gambling, today it is extremely important in economics, especially in actuarial science
* In 1654, prompted by his friend the Chevalier de Méré, he corresponded with Pierre de Fermat on the subject of gambling problems, and from that collaboration was born the mathematical theory of probabilities. The specific problem was that of two players who want to finish a game early and, given the current circumstances of the game, want to divide the stakes fairly, based on the chance each has of winning the game from that point
* Pascal and Cardano (Everything is Predictable):
  * Cardano
  * The probability of rolling a six is 1/6
  * Probability of a six in four rolls is 4/6? Then a six in six rolls? 6/6
  * Pascal
  * The probability of not rolling a six is 5/6
  * The probability of not rolling a six in 4 rolls = 4 * 5/6 or 5/6 ^ 4
  * 5/6 is about .83, (.83)^4 is about .48, 1 - .48 = .52
* From this discussion, the notion of expected value was introduced
* Pascal's Triangle: Pascal’s triangle is useful any time you need to know the number of ways in which you can choose some number of objects from a collection that has an equal or greater number
* For example (The Drunkard's Walk): 1996 World Series. Atlanta was up 2 - 0
    * Probabilites for either team to win: possible 5 games to play 2^5 gives us 32 probabilities
    * Yankees’ victory, would have been victorious if they had won 4 of the 5 possible remaining games.
    * 1 of 5 ways: BYYYY, YBYYY, YYBYY, YYYBY, or YYYYB,  and YYYYY.
    * Braves victory: Yankees win only 3 games, 10 possible ways (BBYYY, BYBYY, and so on), or Yankees win only 2 games (which again could have happened in 10 ways), or if the Yankees had won only 1 game (which could have happened in 5 ways), or if they had won none (which could have happened in only 1 way).
    * Possibilities = 32, Yankees 6 in 32 and Braves 26 in 32 (Yankees won)
    * Considering Pascal's Triangle, We can now read the number of ways in which the Yankees can win 0, 1, 2, 3, 4, or 5 games directly from row 5 of the triangle:
    * 1 5 10 10 5 1

https://www.mathsisfun.com/pascals-triangle.html

* **Pascal's Death**
    * In 1662, a few days after Pascal died, a servant noticed a curious bulge in one of Pascal’s jackets. The servant pulled open the lining to find hidden within it folded sheets of parchment and paper. Pascal had apparently carried them with him every day for the last eight years of his life. Scribbled on the sheets, in his handwriting, was a series of isolated words and phrases dated November 23, 1654. The writings were an emotional account of the trance, in which he described how God had come to him and in the space of two hours delivered him from his corrupt ways. Following that revelation, Pascal had dropped most of his friends, calling them “horrible attachments.”
    * He sold his carriage, his horses, his furniture, his library—everything except his Bible. He gave his money to the poor, leaving himself with so little that he often had to beg or borrow to obtain food. He wore an iron belt with points on the inside so that he was in constant discomfort and pushed the belt’s spikes into his flesh whenever he found himself in danger of feeling happy. He denounced his studies of mathematics and science. Of his childhood fascination with geometry, he wrote, “I can scarcely remember that there is such a thing as geometry. I recognize geometry to be so useless…it is quite possible I shall never think of it again.”
    * Yet Pascal remained productive. In the years that followed the trance, he recorded his thoughts about God, religion, and life. Those thoughts were later published in a book titled Pensées, a work that is still in print today. And although Pascal had denounced mathematics, amid his vision of the futility of the worldly life is a mathematical exposition in which he trained his weapon of mathematical probability squarely on a question of theology and created a contribution just as important as his earlier work on the problem of points.

### Factorials

$n!$<br />
5! = 1 * 2 * 3 * 4 * 5

### Permutations

Order does matter (permutation, position)<br />
The combination of a safe

$_{n}P_r = \frac{n!}{(n-r)!}$

with repetitions

$n^r$

### Combinations

Order doesn't matter<br />
Items on a pizza

$C(n,r) = C_r^{n} = {_nC_r} = \binom {n}{r}  = \frac{n!}{r!(n-r)!}$

#### With Replacement

$_{n+r-1}C_r = \frac{(r+n-1)!}{r!(n-1)!}$

### Intersections

$A \cap B = \{\: x: x \in A \: and \: x \in B \:\}$

order doesn't matter $A \cap B$ or $B \cap A$

#### Independent

$P(A \cap B) = P(A) * P(B)$

#### Dependent

$P(A \cap B) = P(A) * P(B|A) = P(B) * P(A|B)$<br />
also expressed as P(A and B) = P(A) * P(B given A)

### Unions

$P(A \cup B) = P(A) + P(B) - P(A \cap B)$<br />
also expressed P(A or B)<br />
order doesn't matter

#### Unions if mutually exclusive (vs. disjoint)

$P(A \cup B) = P(A) + P(B) - P(A \cap B)$ where $P(A \cap B) = 0$
<br />so<br />
$P(A \cup B) = P(A) + P(B)$

Mutually Exclusive: $P(A \cap B) = 0$<br />
Disjoint (dealing in sets): $A \cap B = 0$

### Complement

Complement of $A$ is $\bar{A}$

Is the complement of A mutually exclusive with A?

## Conditional Probability

https://towardsdatascience.com/conditional-probability-with-a-python-example-fd6f5937cd2<br />
https://towardsdatascience.com/conditional-probability-with-python-concepts-tables-code-c23ffe65d110<br />

$P(A|B) = \frac{P(A \cap B)}{P(B)}$<br />

What's the probability of something given something else

Terms
* $P(A|B)$: Probability of A given B
* $P(A \cap B)$: Probability of A and B
* $P(B)$: Probability of B

#### Addition and Multiplication Rules

* Addition Rule: $P(A \cup B) = P(A) + P(B) - P(A \cap B)$
* Multiplication Rule: $P(A \cap B) = P(A) * P(B|A)$<br />

## Adding or Multiplying Probabilities

* If using the word `or`, add
* If using the word `and`, multiply
    * What's the probability of rolling a 1 or a 6
    * What's the probability of first rolling a 1 and then a 6
    
### Probability of Two Events Occurring Together: Independent

$P(A \cap B) = P(A) * P(B)$

Multiply the probabilities of each event together

Example problem: The probability of getting a job you applied for is 45% and the probability of you getting the apartment you applied for is 75%. What is the probability of getting both the new job and the new car?

* Step 1: Convert your percentages of the two events to decimals. In the above example:
    * 45% = .45
    * 75% = .75
* Step 2: Multiply the decimals from step 1 together:
    * .45 x .75 = .3375 or 33.75 percent.
* The probability of you getting the job and the car is 33.75%
* The probability that two events will both occur can never be greater than the probability that each will occur individually.


### Probability of Two Events Occurring Together: Dependent

$P(A \cap B) = P(A) * P(B|A)$

Example problem: Eighty five percent of employees have health insurance. Out of those 85%, 45% had deductibles higher than 1K. What percentage of people had deductibles higher than 1K?

* Step 1: Convert your percentages of the two events to decimals. In the above example:
    * 85% = .85
    * 45% = .45
* Step 2: Multiply the decimals from step 1 together:
    * .85 x .45 = .3825 or 38.35 percent.
* The probability of someone having a deductible of over $1,000 is 38.35%
 The probability that two events will both occur can never be greater than the probability that each will occur individually.

https://www.statisticshowto.com/probability-and-statistics/probability-main-index/how-to-find-the-probability-of-two-events-occurring-together/

In a pack of 52 cards, a card is drawn at random without replacement. Find the probability of drawing a queen followed by a jack.

Solution:

* P (drawing a queen in the first place) = 4 / 52
* P (drawing jack in the second place given that queen is in the first place) = 4 / 51
* P (drawing a queen follo

## Probability Foundations

* The probability that two events will both occur can never be greater than the probability that each will occur individually. Why not? Simple arithmetic: the chances that event A will occur = the chances that events A and B will occur + the chance that event A will occur and event B will not occur.   
* If two possible events, A and B, are independent, then the probability that both A and B will occur is equal to the product of their individual probabilities
* If an event can have a number of different and distinct possible outcomes, A, B, C, and so on, then the probability that either A or B will occur is equal to the sum of the individual probabilities of A and B, and the sum of the probabilities of all the possible outcomes (A, B, C, and so on) is 1 (that is, 100%)wed by a jack)
* (4/52) * (4/51)

### Foundation 1: The Conjunction Fallacy

The Conjunction Fallacy is a fallacy or error in decision making where people judge that a conjunction of two possible events is more likely than one or both of the conjuncts.

* Which is greater: the number of six-letter English words having n as their fifth letter or the number of six-letter
English words ending in ing?
* Which is more likely: that a defendant, after discovering the body, left the scene of the crime or that a defendant, after discovering the body, left the scene of the crime because he feared being accused of the grisly murder?
* Is it more probable that the president will increase federal aid to education or that he or she will increase federal aid to education with funding freed by cutting other aid to the states?
* Is it more likely that your company will increase sales next year or that it will increase sales next year because the overall economy has had a banner year?

In each case, even though the latter is less probable than the former, it may sound more likely. Or as Kahneman and Tversky put it, “A good story is often less probable than a less satisfactory explanation.

The probability that two events will both occur can never be greater than the probability that each will occur individually: The chances that event A will occur = the chances that event A and B wil occur + the chance that event A will occur and event B will not occur

### Foundation 2: The Truth About the Romans

* Cicero introduced probabilis, his contribution towards randomness and warfare (the greeks didn't believe in randomness because it was the will of the gods)
* Roman justice grew from Germanic doctrine - if there was a dispute, let one man be chosen from each group to fight it out with shields and spears. Whoever loses is a perjurer and must lose his right hand
* Romans created the concept of half proof (there was no compelling evidence either way)
* Two half proofs constituted complete proof

#### The Problem
* Suppose you have a list of 100 restaurants and you want to find a restaurant with good food and good service
* 10 restaurants have good food, and 10 have good service
* 1 out of 10 of the restaurants have both good food and good service = 10 restaurants
* How many out of 100 have **both**?
* Those with good food = 10 / 100
* Those with good service = 10 / 100
* Basically, 1/10 * 1/10 = 1/100 and with more conditions, you have to keep multiplying
* The Romans believed two half truths equal 1 whole truth. Their problem was that they were adding when they should have been multiplying.
* The chance of two independent half proofs' being wrong are 1 in 4 so 2 half proofs constitute three-fourths of a proof

### Foundation 3: Either / Or / Mutually Exclusive

* Suppose an airline has 1 seat available but missing two passengers
* From experience, the airline knows that there is a 2/3 chance a passenger will show
* When passengers are independent, don't know each other, then the chance of both showing up is 2/3 * 2/3
* 1/3 * 1/3 neither showing up
* If they always either both show or both don’t, and the probability that they (as a pair) show is  $2/3$, then $P(bothshow) = 2/3$.
* If mutually exclusive, the probability of both showing up or neither is (2/3 * 2/3) + (1/3 * 1/3) about 55%
* When you want to know the chances that either two mutually exclusive events, A or B, will occur, you add

### Convicted Because You Have a Beard and a Mustache

* Man with beard = 1 / 10
* Man with a mustache = 1 / 10
* So 1 / 100?

Men with beards usually have a mustache so maybe it's 1 / 4

The Drunkard's Walk

## Monty Hall Problem and Sample Space

Ask Marilyn column (in Parade magazine)

* Marilyn von Savant
* Guinness World Records Hall of Fame highest IQ
* Married to Robert Jarvik, artificial heart

Let's Make a Deal

* Monty Hall
* Wayne Brady

Suppose the contestants on a game show are given the choice of three doors: Behind one door is
a car; behind the others, goats. After a contestant picks a door, the host, who knows what’s
behind all the doors, opens one of the unchosen doors, which reveals a goat. He then says to the
contestant, “Do you want to switch to the other unopened door?” Is it to the contestant’s
advantage to make the switch?

* Marilyn says switch
* Onslaught of criticism started rolling in
* Mathematical Professors and teachers

Professor from George Mason: Let me explain: If one door is shown to be a loser, that information changes the probability of either remaining choice—neither of which has any reason to be more likely—to 1/2. As a professional mathematician, I’m very concerned with the general public’s lack of mathematical skills. Please help by confessing your error and, in the future, being more careful.

From Dickinson State University came this: “I am in shock that after being corrected by at least three mathematicians, you still do not see your mistake.” From Georgetown: “How many irate mathematicians are needed to change your mind?” And someone from the U.S. Army Research Institute remarked, “If all those PhDs are wrong the country would be in serious trouble.” Responses continued in such great numbers and for such a long time that after devoting quite a bit of column space to the issue, Marilyn decided she would no longer address it.

Marilyn was right and here's the breakdown:

* Starting with 1 out 3 choice you have the lucky guess scenario of picking the right door (1 out of 3)
* The Wrong Guess scenario has chances 2 out of 3 that you are wrong
* Host intervenes and opens a door knowing where the car is and not wanting to reveal it yet. This action violates randomness

<table>
<tr>
<td>Behind door 1</td>	<td>Behind door 2</td>	<td>Behind door 3</td>	<td>Result if staying at door 1</td>	<td>Result if switching to the door offered</td>
</tr>
<tr>
<td>Goat</td><td>Goat</td><td>Car</td>	<td>Wins goat</td>	<td>Wins car</td>
</tr>
<tr>
<td>Goat</td><td>Car</td>	<td>Goat</td>	<td>Wins goat</td>	<td>Wins car</td>
</tr>
<tr>
<td>Car</td>	<td>Goat</td><td>Goat</td>	<td>Wins car</td>	<td>Wins goat</td>
</tr>
</table>

https://en.m.wikipedia.org/wiki/Monty_Hall_problem

## Sample Space

Boy Girl Activity

* Start with 2s
* Increase by 1 till no more people left

Sample space for tossing a coin 2 times where 2 = n
* There are two outcomes HT per toss
* Sample Space = 2^n

Sample space for tossing a coin 3 times
* 2^n

Sample space for 2 dice - https://www.thoughtco.com/probabilities-of-rolling-two-dice-3126559

Sample space for rolling a die 2 times
* There are 6 outcomes per roll
* Sample Space = 6^n

Sample space for rolling a die 3 times
* 6^3

What's the probability of rolling a 1 or a 6?
* 1/6 + 1/6

What is the probability of getting at least one 6 if a die is rolled 3 times?

* The sample space in a 3 times die roll  =6^3=216
* Now,  P(getting at least one 6)=1−P(getting NO 6)
* Probability of getting no 6 in a single die roll = 5/6
* Thus probability of getting NO 6 in three die rolls = 5/6×5/6×5/6=125/216
* Therefore, the probability of getting at least one 6 P=1−125/216=91/216