Skip to content

staceytay/movie-crawlr

Repository files navigation

Movie-crawlr

A web application that returns a movie's or TV series' IMDb metadata given its title. Built this to learn and gain familiarity with Scrapy, Flask, and Heroku. I built this in a weekend in April 2014 to experiement with the above technologies for the very first time, so it's very rough. Check out how I parsed JSON then if you dare.

http://movie-crawlr.herokuapp.com

References

Problems and bugs encountered

  1. Could not install Scrapy in virtualenv
  • error: distutils.errors.DistutilsError: Setup script exited with error: command 'cc' failed with exit status 1
  • Solution on Stack Overflow
  1. Accidentally adding venv to git
  • Fixed using git reset HEAD^
  1. Pushing to Heroku
  • error: distutils.errors.DistutilsError: Setup script exited with error: command 'gcc' failed with exit status 1
  • Solution on Stack Overflow
  1. Bash script could not run on Heroku
  • Substituted wget with curl in get_imdb_url.sh

About

A web application that scrapes IMDb movie metadata on demand

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published