Semantic Segmentation

What is Semantic Segmentation?

Semantic Segmentation is segmenting an image into different part (or classes) based on semantics (or essentially words). For example, if I had an image of a human walking a dog next to the street, maybe I want to segement that image into the human, the dog, and perhaps some cars. To do this, I will be training an FCN model to detect 22 different classes of objects within an image.

Why is this useful?

There are many reasons why we might want to semantically segment an image. Here are a few examples:

Autonomous driving vehicles may segment images from cameras and sensors to identify humans, other cars, signage, etc...
Satellites may segment images of space, identifying stars, nebulas, planets, etc... to locate their coordinates in space
Researchers may want to identify how much space is, on average, occupied by people walking around or by cars standing still in traffic

There are many use cases, this project will focus more on identifying 22 specific classes in particular for now.

Extra Notes

Code for this is entirely consolidated to a Jupyter Notebook. This was done mostly for purposes being able to use Google Colab GPUs, as I have no GPUs to easily train my model as of now. Thus, development on this project will be done primarily on Google Colab, though I plan on eventually moving away from that and fully onto my machine if I can.

Example Outputs

Here's a few examples of my output. Output was produced from a model trained with 50 epochs.

I expect higher quality and accuracy with more epochs. I have plans to train to 100 epochs once I have more resources availiable to me.

Future Developments

Data I used for training was provided by PASCAL, but I also want to see if I make personal training data that I can fit the model to train for objects I specify.

Also, I'm going to try tweaking my convolutional layers. Right now I upscale to 4096 x 4096 before pooling (stride of 2) down to 64 x 64, which should be enough to activate on features. But perhaps stopping before at around 128 or 256 will be better.

I do plan on continuing work on this project, expect occassional updates to the GitHub as most work is done on Colab.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src		src
.gitignore		.gitignore
README.md		README.md
semantic.ipynb		semantic.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Segmentation

What is Semantic Segmentation?

Why is this useful?

Extra Notes

Example Outputs

Future Developments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Semantic Segmentation

What is Semantic Segmentation?

Why is this useful?

Extra Notes

Example Outputs

Future Developments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages