South West Big Data Hack/Reduce code repository.
Please join our mailing list to discuss past and future Hack/Reduce events in the South West of England.
15th/16th February 2013
We made these (and more) tools, visualizations and other things at the first SW Hack/Reduce event. Most are works in progress. If you made something at the event, or have since updated or improved something we made, please let us know.
(data from Wardley 2000 and Ordnance Survey OS OpenData)
- http://dataunity.github.com/bristol-open-data/HealthLifeExpectancyByWard.html (Bar charts in an iPython notebook by Kev Kirkland)
- http://deathandtaxes.herokuapp.com/ (Javascript trend plots by Duncan Wilkie)
- http://mrdnk.github.com/BristolWardMap/ (Javascript map of wards by Simon Price)
- https://github.com/MRdNk/DeathAndTaxes (Duncan's trends)
Simon's maps. Open ''map.html''. As well as the map demo the raw JSON from the OS is also in the zip.
- https://github.com/MRdNk/BristolWardMap
- https://fluff.bris.ac.uk/fluff/u2/ecsnp/r9j1wtwEttlNqcuzIUBn3wEIL/ (hackreduce.zip available until Mon Mar 4)
By Duncan Wilkie
- http://bristol-data.herokuapp.com/ (data from various sources)
We tackled this from a couple of angles.
Firstly, this Javascript code by Mark Heseltine produces a word cloud visualization. Users can select the month and year of the source texts interactively. Licensed as open source under the Creative Commons Attribution 3.0 Unported License.
Chris Bailey's pair of Python scripts, ''mapper.py'' and ''reducer.py'' (in the literary-obituaries directory of this repository) can be used as a Mapper and Reducer in Hadoop to count word frequencies in the body of the articles, omitting the metadata. ''scrape_obituaries.py'' (in the same place) can be used to obtain a similar corpus of obituaries from a 21st century newspaper website, which you can compare with the historical data.
(data from Wardley 2000)
-
http://freebase.com (also on AWS as a raw dump)
-
https://www.nomisweb.co.uk/query/construct/components/stdListComponent.asp?menuopt=12&subcomp=100
-
http://www.google.com/publicdata/directory#!q=UK&dp=Department+for+Work+and+Pensions,+via+NOMIS
Wardley, P. (2000) (editor). Bristol historical resource CD-ROM, University of the West of England, Bristol. ISBN 1860433081 OCLC Number 49008697
(Readers at the UWE Library can access the CD at http://eprints.uwe.ac.uk/15092/ and other local libraries also have copies.)
Dr. Peter Wardley of the University of the West of England kindly loaned us his Bristol Historical Resource CD ROM for the first SWBD Hack/Reduce event.
Map data from Ordnance Survey's OS OpenData products. Contains Ordnance Survey data © Crown copyright and database right 2013