Building a car image dataset from scraping.
Python
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
scripts
.gitignore
.pre-commit-config.yaml
LICENSE
README
cars
colors
types

README

How lazy can I be making a dataset of car images

car make and models derived from: https://github.com/n8barr/automotive-model-year-data
most popular car colors : http://www.forbes.com/pictures/egdh45ijhf/1-white/#413d9991b1d4
deepLearning repos: https://github.com/fchollet/deep-learning-models

Commented out the line saying to use ANY image form GIS, now usage rights = free to share (I think)

-----
general workflow:
  make a datafile of all the URLS to crawl
  use datafile to download the URLS to local files
  verify the filetypes are accurate (only want images)
  verify the file contents using ResNet50 (only want images containing a specific thing..)
  generate additional classes/tags for future training
  cut dataset into training and testing
  ...
  profit