Skip to content
MarsFromSpace.com - image+content scraper and hacked off-the-shelf Wordpress theme
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
api
salmoncream
tmp
.gitignore
Procfile
README.md
manage.py
publish.py
remove_this_from_published.txt
requirements.txt
scrape.py
scrape_to_publish.py
settings.py

README.md

Scraper for the Mars Reconnaissance Orbiter (HiRISE) website, grabs press release images and content from http://hirise.lpl.arizona.edu/releases/all_captions.php and publishes to a Wordpress blog http://www.marsfromspace.com/about/

New: added a django + tastypie api, if anyone wants to grab all the data we scraped: https://github.com/basilleaf/marsfromspace/tree/master/api

Salmoncream is our hacked WP theme.

Posts up to 5 a day, runs on Heroku scheduler:

heroku run python scrape_to_publish.py page_min page_max

ie:

heroku run python scrape_to_publish.py 1 5

You can’t perform that action at this time.