Skip to content

bes-dev/image_crawler

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Image Crawler

An image crawler with Python 2.7.

It's a forked project of Bulk-Bing-Image-downloader. With this tool, you can crawl full-sized images.

This tool supports the following search engines and databases

  • Bing (default)
  • Google (API key and search engine ID are necessary)
  • ImageNet

Features

  • UTF-8 support (tested on Linux env.)
  • Multi-threaded downloads
  • Check image headers and don't download corrupted files (e.g. HTML error messages)
  • Keep source URLs (in a pickle file)

Important

Use of the crawled images must be abide by the license of the original URLs.

Usage

chmod +x downloader.py
./downloader.py [-h] [-s SEARCH_STRING] [-f SEARCH_FILE] [-o OUTPUT] [--no-filter] [-e SEARCH_ENGINE]

Example

./downloader.py -s earth

About

Search and download images by keywords

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%