Instance Segmentation

Instance segmentation in event-based videos (Research project). Paper here.

For this project we are currently using: Python 3.8.12, Miniconda3 and Pytorch. This is because it should be compatible with HPC so we can make use of training the models on it.

Required background knowledge:

What is an event-based camera? Link1, Link2
Basic Machine Learning (ML) knowledge and what is a neural network (NN)? 3b1b playlist
Basic Pytorch knowledge. 60min tutorial
Image Processing and Computational Intelligence knowledge from courses like CSE2225 and CSE2530.

Setting up Miniconda (for Windows only)

Make a Virtual environment with Miniconda3 by following this youtube tutorial.

In miniconda command line:

conda create --name instance_segmentation python=3.8.12  
conda info --envs  
conda activate instance_segmentation

Hopefully just running the following command should work:

pip install -r requirements.txt

Otherwise check this section!

For Pytorch

conda install astunparse numpy ninja pyyaml mkl mkl-include setuptools cmake cffi typing_extensions future six requests dataclasses
conda install -c conda-forge libuv=1.39
pip3 install torch==1.8.1+cpu torchvision==0.9.1+cpu torchaudio===0.8.1 -f https://download.pytorch.org/whl/torch_stable.html

Data visualization:

pip install tonic
pip install matplotlib

OpenCV:

python3.8 -m pip install opencv-python

pip install scikit-image

If Mask R-CNN is acting up read this!

Working fork of Mask R-CNN TF2 - working as of May 2022 Official Mask R-CNN - was not working with installed setup

For h5py:

pip uninstall h5py
conda install -c anaconda h5py

For imgaug:

pip3 install imgaug

For pycocotools:

pip install cython
pip install git+https://github.com/philferriere/cocoapi.git#egg=pycocotools^&subdirectory=PythonAPI

For scipy:

pip install -U scikit-image==0.16.2

For wandb:

pip install wandb

Running 1 digit

Change settings in src/main.py to generate the datasets or use the already generated datasets from data/.
Change paths to correct datasets in src/dvs_training.py, make sure the DETECTION_MAX_INSTANCES from src/mrcnn/config.py is set to 1.
From src/dvs_training.py, make sure the init_with variable is set to coco if training from scratch or set it to last to continue training some previous model.
Run src/dvs_training.py, wait until finished, setup similar paths in src/dvs_testing.py and run it. Plots should be generated and the results in terms of Accuracy, MIoU and mAP will be displayed when it finishes running.

Running multiple digits

Make sure the DETECTION_MAX_INSTANCES from src/mrcnn/config.py is set to 4 (or if you change the generation of multiple digits, set it to how many digits there are).
Similar to "Running 1 digit", but for generating the dataset, you only need to generate it the first time when running src/dvs_training_multiple.py, so afterwards you can set REGENERATE to False from src/dvs_dataset_multiple.py.
Run src/dvs_training_multiple.py and then run src/dvs_testing_multiple.py.

Visuals

Generated training masks

Predictions

Roadmap

W1 starting on 19/04/2022, presentation on 22/06/2022, documented here.

Authors and acknowledgment

Author: Ana Băltărețu
Supervisors: Nergis Tömen, Ombretta Strafforello, Xin Liu

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
configs		configs
img		img
release_logs		release_logs
src		src
tutorials		tutorials
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.sh		run.sh

License

ana-baltaretu/instance-segmentation

Folders and files

Latest commit

History

Repository files navigation

Instance Segmentation

Setting up Miniconda (for Windows only)

Running 1 digit

Running multiple digits

Visuals

Roadmap

Authors and acknowledgment

Related work

License

About

Resources

License

Stars

Watchers

Forks

Languages