Train/test model with blurred data #74

nbren12 · 2019-05-21T00:17:46Z

No description provided.

The pre-processing pipeline was too confusing, and was scatter accross several snakemake rules. This commit combines these into one script called uwnet/data/preprocess.py.

There is no longer a `step` dimension in the training data.

The data blurred with a radius of xxx will be stored at data/processed/training/sigmaxxx.nc The unblurred data will be stored at data/processed/training/noBlur.nc

This commit makes it easier to identify rules related to pre-processing

training doesn't work in this commit

Previously it was hard debugging errors with the input data.

The new pre-processed data has a time varying layer_mass dimension, which broke the metrics calculation.

It now runs on olympus using the `sam_path` specified in the configuration file.

one is for fast debugging purpose one is for the blurred data

model_run_path needs to be set

To simulate the effect of coarse-resolution data, we can test/train the NN on blurred training data. Changes needed: * Add script for blurring the data * Refactor and improve SAM-based pre-processing scripts. The pre-processing pipeline was too confusing, and was scatter across several snakemake rules. Now these are combined into one script: `uwnet/data/preprocess.py.` * Automate NN training and SAM simulation with snakemake These steps had to be executed manually because Sacred generated the folder names automatically. Now the model training and SAM runs are named based on the filename of the json file used to train them. * Improve training messages Previously it was hard debugging errors with the input data. Also use the agg backend for plots, so that the training does not die on olympus.

nbren12 added 14 commits May 16, 2019 17:35

Refactor pre-processing scripts

0234a30

The pre-processing pipeline was too confusing, and was scatter accross several snakemake rules. This commit combines these into one script called uwnet/data/preprocess.py.

Remove handling of step dimension

ba2e07a

There is no longer a `step` dimension in the training data.

Implement pre-processing on olympus

2c3f40c

Fix pre-processing in docker

c3d6974

Add files to gitignore

7a41014

Add blurring information to training data paths

5c66a0f

The data blurred with a radius of xxx will be stored at data/processed/training/sigmaxxx.nc The unblurred data will be stored at data/processed/training/noBlur.nc

Add preprocess_ prefix to snakemake rules

cfc44f7

This commit makes it easier to identify rules related to pre-processing

automate sam run with snakemake

8dac233

training doesn't work in this commit

Improve error messages

46cee42

Previously it was hard debugging errors with the input data.

Remove time dimension from layer_mass variable

2115656

The new pre-processed data has a time varying layer_mass dimension, which broke the metrics calculation.

Use agg backend for plots

2b33035

Fix sam running rule

f49d687

It now runs on olympus using the `sam_path` specified in the configuration file.

Add training configuration files

d866f14

one is for fast debugging purpose one is for the blurred data

Fix error in create_case

cf0f865

model_run_path needs to be set

nbren12 merged commit 5571450 into master May 21, 2019

nbren12 deleted the feature/blur-inputs branch May 21, 2019 06:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train/test model with blurred data #74

Train/test model with blurred data #74

nbren12 commented May 21, 2019

Train/test model with blurred data #74

Train/test model with blurred data #74

Conversation

nbren12 commented May 21, 2019