# [Fiddler](https://thefiddler.substack.com/p/how-likely-is-a-lopsided-league?publication_id=1745947&post_id=137055625) 15th Sept 2023

About halfway through the current Major League Baseball season, all five teams in the American League East division had better records (i.e., winning percentages, or percent of games won) than all five teams in the American League Central region.

Inspired by this surprising fact, suppose Fiddler League Baseball has six divisions, with five teams in each division. For simplicity, further suppose each team has a winning percentage chosen randomly, uniformly, and independently between zero percent and 100 percent.

Let’s look at two divisions: The Enigma League East division and the Enigma League Central division. What is the probability that every team in the Enigma League East division has a higher winning percentage than every team in the Enigma League Central division?

Two observations:

- the independance assumption is used to get the product of probabilities
- the fact that "for the weakest team from any division to be stronger than the strongest team of another division" is equivalent to "for all teams in any division to be stronger than the strongest team of another division"

## Answer

$$P\left(Central_{\textrm{max}} < Eastern_{\textrm{min}}\right) = \prod_{i=1}^{5}P\left(Central_i < Eastern_{\textrm{min}}\right)$$

$$=\prod_{i=1}^{5}\prod_{i=1}^{5}P\left(Central_i < Eastern_i\right)$$

$$=\prod_{i=1}^{5}\prod_{i=1}^{5}1/2$$

$$=\prod_{i=1}^{5}\frac{1}{2^{5}}$$

$$=\frac{1}{2^{25}}$$

## Interpretation

The interpretation can be seen below: for each of the 25 possible pairings, we need a yes/no question (is the central team weaker than the eastern team?) to be answered the right way, with a 1/2 chance each time

In [2]:
import pandas as pd

In [13]:
df = pd.DataFrame([["yes/no"]*5 for _ in range(5)], columns=[f"Central_{i}" for i in range(1,6)],index = [f"Eastern_{i}" for i in range(1,6)])
df.columns = pd.MultiIndex.from_tuples([('Is this Central team weaker than this Eastern team?', col) for col in df.columns])
df

Unnamed: 0_level_0,Is this Central team weaker than this Eastern team?,Is this Central team weaker than this Eastern team?,Is this Central team weaker than this Eastern team?,Is this Central team weaker than this Eastern team?,Is this Central team weaker than this Eastern team?
Unnamed: 0_level_1,Central_1,Central_2,Central_3,Central_4,Central_5
Eastern_1,yes/no,yes/no,yes/no,yes/no,yes/no
Eastern_2,yes/no,yes/no,yes/no,yes/no,yes/no
Eastern_3,yes/no,yes/no,yes/no,yes/no,yes/no
Eastern_4,yes/no,yes/no,yes/no,yes/no,yes/no
Eastern_5,yes/no,yes/no,yes/no,yes/no,yes/no


The proba of winning at the French lottery by buying a single, random ticket is 2.4 times that of this happening...

In [21]:
proba_win_french_lottery = 1/13_983_816
proba_easter_stronger_central_all = 1/2**25
proba_win_french_lottery / proba_easter_stronger_central_all

2.3995189868058904