# Simulating Lotto Games

Here I'll be writing some code to simulate the NZ Lottery.

In [3]:
balls = list(range(40))
print(balls)

[0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39]


In [4]:
import random

In [19]:
random.sample(balls, 6)

[35, 8, 10, 19, 28, 25]

So here are the questions that I would like to address. How do you estimate the number of ticket buyers from the number of winners at each division?

Intuitively, the number of winners will depend on the number of tickets sold, and the popularity of certain numbers. e.g. if the draw result features a popular sequence of numbers, then you 'should' expect more winners at the lower divisions. At the same time, if more people show up on the day to buy tickets, then you also would expect more people to win then as well.

At the higher divisions (esp. the jackpot + powerball), you would expect the dominating factor to be the a-priori odds of winning.

Is there a way to make this relationship more concrete?

Claim: Number of tickets sold depends on the amount of prize money and the time of year, say.

The number of winners depends on the number of tickets sold, and the popularity of those numbers. My guess is that the more popular the winning numbers are, the number of winners will be skewed towards the lower division winners, and if the winning numbers are unpopular, then the number of winners will more closely match the theoretical distribution.

We will assume that players come in 2 varieties. Those who select their numbers at random, and those who do not. We can model this behaviour with the relationship:

$$ P = Y X_1 + (1-Y) X_2 $$

Where:
- $P$ is the vector of numbers selected by the player
- $Y$ is a binary random variable, where $Y=1$ with probability $p$ if the player is inclined to select their own numbers, and $Y=0$ if the player selects their numbers randomly
- $X_1$ is the vector of random numbers selected from the biased player distribution
- $X_2$ is the vector of random numbers selected uniformly at random

For now we will assume that the lucky dip numbers are chosen uniformly at random. It is more likely they are selected with a pseudo-random function.

I will be interested to learn whether or not there is evidence that each player's lotto numbers really are being selected at random. I would also like to know how randomly the balls are selected from the machine. My guess is that these are fine, but it would be interesting to try and make an argument for why that must be the case.

So, what are the things that I know?

The expected earnings are a function of the probabilities for the winning results. Since the winning are to be divided up among the winning players, this means they also depend on the number of winners as well as the numbers they consider popular.
