# Coding exercise

In this exercise, you'll be expected to fit a simple first-order linear model (e.g. $y=m\cdot x + c$) that relates the monthly value of El Niño 3.4 index to MODIS-derived fire counts (e.g. number of thermal anomalies detected by the MODIS mounted on the TERRA platform). Both datasets are provided for you.

## The ENSO (Niña 3.4) data

The ENSO data is provided as a [CSV file](data/nina34.csv). The file format is easily readable, but
1. The first line contains the start and end year of the dataset
2. Each line contains the monthly value of the index (from January to December) for every year in the record.
3. Missing data are noted by the value -99.99
4. The last three lines provide you with some additional information of where the data was acquired from

## The fire counts data

Since it'd be cumbersome to fit a model to every month, for each year, we have calculated the "peak fire month" (e.g. the month which on average tends to have the biggest number of fires) and the number of fire counts on that month per year. All the fire data has been aggregated to a 5 degree grid.

The peak fire month is available as a single band georeferenced GeoTIFF file with name [stuff](stuff). 

The fire counts data is provided as a georeferenced GeoTIFF file with name [`data/FireCounts_2001_2016.tif`](data/FireCounts_2001_2016.tif). Each band represents the fire counts on the most fire active month for every year. 

The fire counts data is assumed to have a Gaussian uncertainty associated with the estimate, so that for each fire count estimate, there's an associated standard deviation of 10% of the fire counts.

## Fitting the model

We assume that the relationship between fire counts and the ENSO index at the peak fire month can be approximated by a linear function, e.g. 

$$
N_{fires} = m\cdot ENSO_{\textrm{peak month}} + c
$$

You ought to be able to solve this linear problem as a matrix, and solve it as a matrix (e.g. without using any higher order package.


## Deliverables

The aim of this exercise is to fit a linear model model to the observations from 2001 to 2014, provide an estimate of the slope and intercept, as well as the associated uncertainty in the parameters. You can do this by providing spatial maps of the value of slope and intercept, and spatial maps of the standar deviation of the slope and intercept. You should also provide a scatter plot of the data, model response for grid the grid cell corresponding to **LATITUDE** and **LONGITUDE**.

## Installed software

The software installed in this instance are

* numpy
* scipy
* matplotlib
* gdal
* pandas

