Our team's code and files for "Deep Learning Datathon 2020", organised by ai4impact.
Team Name: Oracle
repo and code by Murat Shagirov
Suggested way to use and edit ./src/*.py
files and notebook templates in ./nbs
:
- Copy notebook templates to a new folder, then create symbolic link e.g.:
ln -s '../datathon2020/src/' .
- Once you created the link, in your notebooks you can import all the modules in
./src
withfrom src.[MODULE_NAME] import [SOMETHING]
e.g.:from src.datautils import windowed_data
-
./src
python source code (2nd part of datathon). -
./nbs
notebook templates for normalising, preprocessing, training and prediction steps. -
/NN101/lab1
and/NN101/lab2
are self-contained codes and notebooks for labs 1 and 2 (1st part of the datathon) - Datasets:
-
lab1
- Raw training and test datasets in
./lab1/sg_temps
(*.csv
files) - Raw
sg-temps
dataset stats:lab1/sg_temps_stats.csv
- Raw training and test datasets in
-
lab2 datasets and stats:
- Raw dataset (has missing values for some hours):
./lab2/data/PJM_Load_hourly.csv
- Raw dataset stats:
./lab2/data_stats.csv
- Test and training datasets (not normalized data):
./lab2/data/test.csv
and./lab2/data/train.csv
- Raw dataset (has missing values for some hours):
-
NN201 (week 2)
-
./NN201/sg_temps
contains raw (sg_temps_raw.csv
) and normalised (sg_temps.csv
), data was normalised by shifting it by mean$\mu\approx28.0$ and scaling it by the S.D.$\sigma\approx0.8$ .$$x_{norm}=\frac{x_{raw}-\mu}{\sigma}$$ .
-
-
P003 (dataset for the challenge)
-
./P003/datasets/
contains:- Energy Generation Data for Ile-de-France, raw files from RET (real-time data will be updated when you re-download it in
./P003/introduction.ipynb
):- Units for power are in
MW
eCO2mix_RTE_Ile-de-France_Annuel-Definitif_2017.xls
eCO2mix_RTE_Ile-de-France_Annuel-Definitif_2018.xls
-
eCO2mix_RTE_Ile-de-France_En-cours-Consolide_FIXED_ERRCOLS.xls
(this is clean version ofeCO2mix_RTE_Ile-de-France_En-cours-Consolide.xls
data after removing empty columns) -
eCO2mix_RTE_Ile-de-France_En-cours-TR.xls
(near real-time data, date and time are in Paris time)
- Units for power are in
- Energy Generation Data for Ile-de-France, (averaged for 1h frequencies, in kWh)
energy-ile-de-france.csv
- Wind Forecast Data (from Terra Weather) from two weather models for major wind farm locations in Ile-de-France region:
-
/model1/
contains latest forecasts for model 1. -
/model2/
latest forecasts for model 2. -
/historical1/
historical forecasts for model 1. -
/historical2/
historical forecasts for model 2.
-
- Energy Generation Data for Ile-de-France, raw files from RET (real-time data will be updated when you re-download it in
-
-
lab1