TEE2012_HadoopDemos
Pull request Compare This branch is 5 commits behind mwinkle:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
DotNetMapReduce
FSharpMapReduce
Hive
JavaMapReduce
Pig
basicStreaming
javascriptMapReduce
javascriptPig
.gitignore
README.md

README.md

TEE2012_HadoopDemos

This is a set of demos used at TechEd Europe 2012

Data Set assumptions

The bulk of these demos operate on a set of flight delay information, originally obtained from the Azure DataMarket (available here: https://datamarket.azure.com/dataset/e29b7fb9-3d2e-4f35-8088-c97dbd75cd1f)

We expect the following comma separated schema for these demo jobs:

ArrDelayMinutes Carrier DayofMonth DepDelayMinutes Dest FlightDate Month Origin RowId Year