Skip to content

Commit

Permalink
first commit
Browse files Browse the repository at this point in the history
  • Loading branch information
mlimotte committed Apr 26, 2012
0 parents commit f759c0c
Showing 1 changed file with 31 additions and 0 deletions.
31 changes: 31 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
# OVERVIEW

lemur is a tool to launch hadoop jobs locally or on EMR based on a configuration
file, referred to as a jobdef. The general command line format is:

```lemur <command> <jobdef-file> [options] [remaining]
lemur help - display this help text
lemur run ./jobdef.clj - Run a job on EMR
lemur dry-run ./jobdef.clj - Dry-run, i.e. just print out what would be done
lemur start ./jobdef.clj - Start an EMR cluster, but don't run the steps (jobs)
lemur local ./jobdef.clj - Run the job using local hadoop (e.g. standalone mode)
lemur display-types - Display the instance-types with basic stats and exit
lemur spot-price-history - Display the spot price history for the last day and exit
Examples:
lemur run clj/wb-clj/scripts/launch/hrap-jobdef.clj --dataset ahps --num-days 10
lemur start clj/wb-clj/src/weatherbill/lemur/sample-jobdef.clj
```

# JOBDEF OVERVIEW

A jobdef file describes your EMR cluster and zero or more "steps". A step is Amazon's
name for a task or job submitted to the cluster. lemur reads your jobdef, which
defines a bunch of options inside (defcluster) and (defstep). Finally, at the end
of your jobdef, you execute (fire! ...) to make things happen. Also keep in mind that
the jobdef is an interpreted clj file, so you can insert arbitrary Clojure code to be
executed anywhere in the file (but see HOOKS below for a better way).


0 comments on commit f759c0c

Please sign in to comment.