# Bayesian Random Intercept Model

In [1]:
import sys

sys.path.append("../../")

import penaltyblog as pb
import pandas as pd
import arviz as az



## Get data from football-data.co.uk

In [2]:
fb = pb.scrapers.FootballData("ENG Premier League", "2019-2020")
df = fb.get_fixtures()

df.head()

Unnamed: 0_level_0,date,datetime,season,competition,div,time,team_home,team_away,fthg,ftag,...,b365_cahh,b365_caha,pcahh,pcaha,max_cahh,max_caha,avg_cahh,avg_caha,goals_home,goals_away
id,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
1565308800---liverpool---norwich,2019-08-09,2019-08-09 20:00:00,2019-2020,ENG Premier League,E0,20:00,Liverpool,Norwich,4,1,...,1.91,1.99,1.94,1.98,1.99,2.07,1.9,1.99,4,1
1565395200---bournemouth---sheffield_united,2019-08-10,2019-08-10 15:00:00,2019-2020,ENG Premier League,E0,15:00,Bournemouth,Sheffield United,1,1,...,1.95,1.95,1.98,1.95,2.0,1.96,1.96,1.92,1,1
1565395200---burnley---southampton,2019-08-10,2019-08-10 15:00:00,2019-2020,ENG Premier League,E0,15:00,Burnley,Southampton,3,0,...,1.87,2.03,1.89,2.03,1.9,2.07,1.86,2.02,3,0
1565395200---crystal_palace---everton,2019-08-10,2019-08-10 15:00:00,2019-2020,ENG Premier League,E0,15:00,Crystal Palace,Everton,0,0,...,1.82,2.08,1.97,1.96,2.03,2.08,1.96,1.93,0,0
1565395200---tottenham---aston_villa,2019-08-10,2019-08-10 17:30:00,2019-2020,ENG Premier League,E0,17:30,Tottenham,Aston Villa,3,1,...,2.1,1.7,2.18,1.77,2.21,1.87,2.08,1.8,3,1


## Train the Model

In [3]:
clf = pb.models.BayesianRandomInterceptGoalModel(
    df["goals_home"], df["goals_away"], df["team_home"], df["team_away"]
)
clf.fit()

Only 312 samples in chain.
Auto-assigning NUTS sampler...
Initializing NUTS using jitter+adapt_diag...
Multiprocess sampling (8 chains in 8 jobs)
NUTS: [home, tau_int, intercept, tau_att, atts_star, tau_def, def_star]


Sampling 8 chains for 2_000 tune and 312 draw iterations (16_000 + 2_496 draws total) took 8 seconds.


## The model's parameters

In [4]:
clf

Module: Penaltyblog

Model: Bayesian Random Intercept

Number of parameters: 61
Team                 Intercept            Attack               Defence             
--------------------------------------------------------------------------------
Arsenal              0.128                0.056                -0.04               
Aston Villa          -0.013               -0.071               0.204               
Bournemouth          -0.018               -0.092               0.177               
Brighton             -0.038               -0.097               0.033               
Burnley              0.002                -0.052               -0.019              
Chelsea              0.234                0.157                0.056               
Crystal Palace       -0.145               -0.193               -0.03               
Everton              0.019                -0.05                0.063               
Leicester            0.218                0.133                -0.141              

## Predict Match Outcomes

In [5]:
probs = clf.predict("Liverpool", "Wolves")
probs

Module: Penaltyblog

Class: FootballProbabilityGrid

Home Goal Expectation: 1.9676411376125518
Away Goal Expectation: 0.8380652174466229

Home Win: 0.6382722472371388
Draw: 0.20963505993669984
Away Win: 0.15209268970253048

### 1x2 Probabilities

In [6]:
probs.home_draw_away

[0.6382722472371388, 0.20963505993669984, 0.15209268970253048]

In [7]:
probs.home_win

0.6382722472371388

In [8]:
probs.draw

0.20963505993669984

In [9]:
probs.away_win

0.15209268970253048

### Probablity of Total Goals >1.5

In [10]:
probs.total_goals("over", 1.5)

0.7698915889572326

### Probability of Asian Handicap 1.5

In [11]:
probs.asian_handicap("home", 1.5)

0.3902210181231132

## Probability of both teams scoring

In [12]:
probs.both_teams_to_score

0.48813124322056645