Skip to content

gcamerli/flickr_scrapy_bot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Flickr Scrapy Bot

Description

Flickr Scrapy Bot is a web scraping bot build upon Scrapy, an application framework for crawling web sites and extracting structured data which can be used for a wide range of useful applications, like data mining, information processing or historical archival.

This bot collects images licensed under CC using Flickr API.

Purpose

Flickr Scrapy Bot automates collecting of images on creativechain.net.

Requirements

You must already have:

You need to install scrapy:

$ pip install scrapy  

It is recommended to set a virtual environment.

Usage

Clone the repo:

$ git clone https://github.com/gcamerli/flickr_scrapy_bot.git

Change path:

$ cd flickr_scrapy_bot

Run spider, setting up the Flickr API key:

$ FLICKR_KEY=******** scrapy crawl flickr_cc

Collected images are saved into images dir, which will be created if not exists.

Documentation

All you need to know about Scrapy it is possible to find here.

Credits

@orangain: sushibot

GPL License

This work is licensed under the terms of GNU General Public License v3.0

Donations

Donations are accepted at:

  • BTC: 1DNWtR4wJbFE7vjcfQvuj4iE7FURYBURtr
  • CREA: CRgURSBHqM5FzQhy2iuGKPAHycTUwzr3Ei

Releases

No releases published

Packages

No packages published

Languages