Speedy, lightweight web scrapper for Shutterstock.
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
LICENSE.md
README.md
gettyscrape.py
shutterscrape.py

README.md

ShutterScrape

ShutterScrape is a web scrapper for bulk downloading images or videos from Shutterstock with blinding speed. ⚡
It implements Selenium for browser automation and Beautiful Soup for parsing.

If you like this repo, feel free to star ⭐ it!
For more information, contact https://davidlin.io/.


Setting up

  1. Install Python 2.7.

  2. Open terminal and enter the following lines:

pip install beautifulsoup4
pip install selenium
pip install lxml
  1. Install ChromeDriver.

  2. (Optional) Configure environment variables paths for python.exe and chromedriver.exe.


Running

Open terminal in the directory of shutterscrape.py and enter:

python shutterscrape.py

Go grab a cup of coffee while waiting... oh wait, it's already done!


Definitions

  • Search mode: Enter i for scraping images and v for scraping videos .
  • Number of search terms: For example, if you want to search for drone single person, enter 3.
  • Search term: Keyword(s) for searching on Shutterstock.
  • Number of pages to scrape: Higher number of pages means greater quantity of content with lower keyword precision.

Updates

10/1/2018
Added GUI for save directory selection.

07/31/2018
More stability fixes.

07/25/2018
Added gettyscrape.py for scraping videos from Getty Images.

07/23/2018
Stability fixes.