Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Python
branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
.gitignore
BrowserDecoy.py
README
all_links_to_full_text.txt
fetch_urls_full_text.py

README

Welcome to blueflower!

At the heart of the project lies the BrowserDecoy class. For the moment, this is not required, as www.cell.com does not block crawlers. This will turn out useful later.

To get started you can call the first script from the command line.
$ python fetch_urls_full_text.py > all_links_to_full_text.txt
Something went wrong with that request. Please try again.