Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
source examples to support the "Cascading for the Impatient" blog post series -- now for Cascalog
branch: cascalog

This branch is 62 commits ahead, 88 commits behind Cascading:master

Failed to load latest commit information.
docs lightening talk version of Impatient
part1 using :provided for hadoop-core, lein-preview10
part2 using :provided for hadoop-core, lein-preview10
part3
part4 use negation instead of stub in stop word filter
part5 use negation instead of stub in stop word filter
part6 fix etl-docs-gen-test
.gitignore gitignore tmp/
README.md repo move notice

README.md

Cascading Cascalog for the Impatient

Notice: This project has been moved to https://github.com/Cascading/Impatient-Cascalog. Please use that link instead.

Welcome to Cascalog for the Impatient, a series of tutorial and Cascalog code examples to get you started. This series is a fork of Cascading for the Impatient.

This set of progressive coding examples starts with a simple file copy and builds up to a MapReduce implementation of the TF-IDF algorithm.

Getting Started

Clone this repository and head over to the Wiki to follow through with this 6-part tutorial.

Prerequisites

Install the following:

  1. Hadoop, see Apache's instruction on setting up a local node
  2. Leiningen build tool for Clojure

Some basic knowledge of Clojure and using Leiningen would be helpful.

Something went wrong with that request. Please try again.