Flickr Scrapy Bot

Description

Flickr Scrapy Bot is a web scraping bot build upon Scrapy, an application framework for crawling web sites and extracting structured data which can be used for a wide range of useful applications, like data mining, information processing or historical archival.

This bot collects images licensed under CC using Flickr API.

Purpose

Flickr Scrapy Bot automates collecting of images on creativechain.net.

Requirements

You must already have:

python (2.7 or 3.3 or above)
Flickr API KEY

You need to install scrapy:

$ pip install scrapy

It is recommended to set a virtual environment.

Usage

Clone the repo:

$ git clone https://github.com/gcamerli/flickr_scrapy_bot.git

Change path:

$ cd flickr_scrapy_bot

Run spider, setting up the Flickr API key:

$ FLICKR_KEY=******** scrapy crawl flickr_cc

Collected images are saved into images dir, which will be created if not exists.

Documentation

All you need to know about Scrapy it is possible to find here.

Credits

@orangain: sushibot

GPL License

This work is licensed under the terms of GNU General Public License v3.0

Donations

Donations are accepted at:

BTC: 1DNWtR4wJbFE7vjcfQvuj4iE7FURYBURtr
CREA: CRgURSBHqM5FzQhy2iuGKPAHycTUwzr3Ei

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
flickr		flickr
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Flickr Scrapy Bot

Description

Purpose

Requirements

Usage

Documentation

Credits

GPL License

Donations

About

Releases

Packages

Languages

License

gcamerli/flickr_scrapy_bot

Folders and files

Latest commit

History

Repository files navigation

Flickr Scrapy Bot

Description

Purpose

Requirements

Usage

Documentation

Credits

GPL License

Donations

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages