Parts of the code are are based on this project.
semseg
├── storage % data (not code): datasets, trained models, log-files, ...
├── data % data loading and preparation: Dataset, MiniBatchReader
| ├── preparers % Iccv09Preparer
| | ├── abstract_preparer.py
| | └── iccv09_preparer.py
| ├── dataset_dir.py
| └── dataset.py
├── models % AbstractModel, BaselineA
| ├── preprocessing.py
| ├── tf_utils
| | ├── blocks.py % higher-level operations: ResNet block, LinkNet rencoder/decoder blocks
| | ├── layers.py % elementary operations: conv, max_pool, resize
| | └── variables.py
| ├── abstract_model.py % AbstractModel
| └── baseline_a.py % BaselineA
├── processing % image and label processing
| ├── image_format.py
| ├── labels.py
| ├── shape.py % TODO: test resize
| └── transform.py % TODO: use skimage.transform and test
├── test % unit tests
| :.
├── util % helper functions and classes
| ├── visualizer.py % Visualizer
| :.
├── evaluation.py
:.- implement evaluation measures used in LinkNet and modify
evaluation.pyso that it makes use of numpy/scipy - evaluate our LinkNet
- implement inference time measurement depending on mini-batch size (to compare with the results in the LinkNet paper)
- write the report
- nothing
- use
tf.nn.sparse_softmax_cross_entropy_with_logitsfor more efficient training - make a baseline similar to
BaselineAthat uses strided convolutions instead of pooling layers (use 3x3 conv with stride 2 instead of pool->conv) - try IoU loss (like here)
- finish saving and loading of trained models ~ Ivan
- implement LinkNet ~ Josip
- make data loading work (
data.preparers.Iccv09Preparer, data.Dataset) - make a simple baseline
- complete
abstract_model - make the cost function (as well as other used evaluation measures) in
BaselineAignore "unknown" class (class 0) - implement
util.Visualizer - improve the colors in
util.visualizer.Visualizer - enable usage of
util.Visualizerwhile training (by pressing d followed by ENTER in the console) - add
stride:intanddilation:intparameters totf_utils.layers.conv(usetf.nn.convolution) - add batch normalization to
tf_utils.layers, usetf.layers.batch_normalization(input_layer, fused=True, data_format='NCHW') - improve random seeding in
Datasetfor beter reproducibility - add ResNet layers and encoder/decoder blocks used in LinkNet to
tf_utils.blocks - implement a textual options menu that can be opened while training is paused, enabling network output visualization, saving/loading of weights, stopping training (after the current epoch) and other actions
- add transposed convolution to
tf_utils.layers
mini-batch size = 16, Pentium 2020M
| Model | mIoU | Pixel acc. | #epochs | Inference time [s] | Hardware |
|---|---|---|---|---|---|
| BaselineA | - | 0.630 | 150 | 0.100* | Pentium 2020M |
"Inference time" - on what hardware?