Overview

For a more detailed synopsis, please visit http://d2fn.com/2010/07/28/schedule-scraping.html

Usage

This script requires an input file of one id per line of the sessions to scrape. The included file all-sessions.txt is an example.

Usage: python scrape.py [input-sessions-file] [out]

Called in this way, the scraper will download information for all sessions given in [input-sessions-file]. [out] defines the names of the output files that will be generated. [out].ics will contain the iCalendar output and [out].json will contain a single json document of all downloaded sessions. This json document is suitable for uploading to CouchDB

Dependencies

icalendar library from http://codespeak.net/icalendar/ simplejson

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
all-sessions-report.tsv		all-sessions-report.tsv
all-sessions.txt		all-sessions.txt
max2010-all.html		max2010-all.html
max2010-all.ics		max2010-all.ics
max2010-all.json		max2010-all.json
post2couch		post2couch
readme.md		readme.md
scrape.py		scrape.py
speakers.json		speakers.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Usage

Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

Usage

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages