This respository is used as the final project for the course "Deep Learning" on opencampus in the Winter semester 2020/21.
Topic: Image Segmentation
Details: This project managed to finish the task of image segmentation of the Kaggle Challenge, Carvana Image Masking Challenge.
The idea came from the interests of knowing how to segment the object from the image.
The datasets from the Carvana Image Masking Challenge is based on high quality car photos and the backgrounds of the images usually contain similar colors as cars.
Goals: Work through the challenge and get to understand how image segmentation works, to know what are the state-or-art methods using for image segmentation, and to fine tune the image segmentation method with the knowledge we have learned from the course.
Dream: This project initially planned to finish the Cloud Segmentation challenge on Kaggle, but with the time limitation and busy schedules...
- Some challenges on Kaggle:
- The GitHub Repository collecting some satellite imagery datasets:
- Image Segmentation tutorial with Oxford-IIIT Pet Dataset
- Open Images 2019 - Instance Segmentation
- Segmentation evaluation database
- A Large-scale Dataset for Instance Segmentation in Aerial Images
- Google Drive: right click the shared folder and click on "Add a shortcut to Drive" to make sure you can easily reach the folder
- Instruction for loading data in Google drive to Google Colab
from google.colab import drive
drive.mount('/content/drive')
%cd /content/drive/MyDrive/DL_project/kaggle/data/
- U-Net
- FractalNet
- Others: A repository for Deep Segmentation with several CNNs for semantic segmentation (U-Net, SegNet, ResNet, FractalNet) using Keras
- Optical flow
- Using Machine Learning to “Nowcast” Precipitation in High Resolution
- A Neural Weather Model for Eight-Hour Precipitation Forecasting
- Examples to work from: DL_Images.ipynb
- Final Notebook: Image_segmentation.ipynb
14th Dec. - 4th Jan.: project choice, dataset pre-processing, maybe first simple model and objective
4th Jan.: Peer review session, each group present their status to another group
4th Jan. - 25th Jan.: Architecture, fine-tuning, preparation presentation
Main task: Find an intermediate target. It sounds a bit too hard to reach the target of cloud segmentation in six weeks. So before the peer review, we should choose the dataset and have objectives.
- Look into different training datasets
- Try to train them with some exist networks (for example, revising the Image Segmentation tutorial to train the dataset you find)
- List down your findings:
- Datasets: What you find? How it works with the networks that you used? Any interesting notebooks you find? What might be the challenging part?
- Possible networks: What kind of application the networks mostly used for? What are their architecture? Any explanation for them? (And feel free to upload the program you wrote, it would be nice for the other to test)
- Any useful documents you think it might help our project
- Have another discussion before 4th Jan.
- All: Look into the Image Segmentation tutorial with Oxford-IIIT Pet Dataset
- EJ: Check how to link Colab with GitHub repository
- add file from github: simply click the link, check the buttom "Private Repositories einschließen", and select this repository
- push the file to github: file > Save a copy in Github
- Sebastian: Meeting tools
- Suman: Look into the dataset from Kaggle and give a brief summary
- Erwin: Work through the dataset from Kaggle and the possible applications
Date: 18th Jan. 2021 Time: 10 minutes for presentation + 5 minutes for a round of questions
Presentation structure:
- Group: who are inside the group
- Project: short description of the project and the motivation behind
- Tools (optional)
- Architecture
- Story (optional): attempts, failures, successes
- Results
- Baselines - how to measure the performance? is it good enough?? compare to??
- Missing (optional) - is there something you missed to improve in the project?
- Future work (optional) - how to improve
- Sharing (optional) on the opencampus gitbook: Code, Data and Presentation