No description, website, or topics provided.
Clone or download
Pull request Compare This branch is even with cloudtrack:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
dataset
sample
src
.gitignore
Makefile
README.md
demo.sh
raw.json
requirements.in
requirements.txt

README.md

amznbin

Project Description

amznbin is a deep learning image recognition project using Amazon Bin Image Dataset.

We solve two tasks,

  1. Count: Predict the number of items in the image
  2. Classify: Predict what item is contained in the image

To reduce the complexity of the tasks and training time,

  1. We use images with only one kind of item. - this enables us to solve classify task as a single classification task
  2. We specify the minimum number of repetition of an item - most items appear only once in all images, when untreated it will make testing and validating unconvincing.
  3. We manually ruled out a number of invalid images. - check dataset/invalid_images.json to see which images are excluded.

How to Start

  • On root directory run pip install -r requirements.txt to install required libraries
  • Run make image_dataset and make load_metadata on command line to download the images and metadata (Using tmux would be a good idea)
  • After make load_metadata is finished, run make prepare_train and input the number of repetition you want (we recommend 20 or higher) and data augmentation mode.
  • When all above is finished you are ready to run our program!

Demo

  • Run make demo to checkout our training demo  - To run the demo of our pretrained model, open jupyter notebook and run src/demo.ipynb
    • our pretrained model was too big to upload on github, you can train the model and use your own trained model to use this file

Results

We accomplished 68.5714% accuracy on count task, and 41.4286% accuracy on classify task