Skip to content
Commits on Jun 4, 2012
Commits on Nov 18, 2011
  1. meh, typo in last commit

  2. HitsByHour example

Commits on Oct 18, 2011
  1. lil' typo in README

Commits on Aug 5, 2011
Commits on Jul 27, 2011
  1. fecking namespces

  2. type hint for iterator

Commits on Jul 23, 2011
  1. apidocs

  2. fix arg counting (probably still broken when people don't use a space…

    … seprator, but who cares really)
  3. slightly nicer help

  4. Allow debug builds of packages; so far they just do skipped record co…

    …unting. Can be enabled with --debug flag for compile.php.
Commits on Jul 22, 2011
Commits on Jul 21, 2011
  1. fix infinite reduce loop due to null key (which means EOF) being cast…

    … to string, confusing handle()
  2. fix an issue where keys get stuck by consuming all values for current…

    … key if reducer returns prematurely
Commits on Jun 6, 2011
Commits on May 31, 2011
  1. allow passing of timezone to use to compile.php (if not given, the ti…

    …mezone of the machine the job is built on will be used)
  2. set default timezone in generated packages based on the timezone of t…

    …he system the job was compiled on
Commits on May 14, 2011
  1. support org.apache.hadoop.mapred.lib.KeyFieldBasedPartitioner and add…

    … a getCurrentKey() to the reducer/iterator so you can get the real key, not the one used for grouping, in cases where you partition by parts of a key
Commits on May 13, 2011
  1. document known issues

Commits on May 12, 2011
  1. remove eol anchor from apache log regex, that way lines with referrer…

    …s and/or user agents can be parsed too (although those two fields won't show up for now, can't be arsed to fix this properly at the minute
Commits on May 9, 2011
  1. readme update

Commits on Apr 18, 2011
  1. use env info to dynamically adjust reading and writing behavior to ha…

    …doop streaming settings (key separators and key size for map and reduce input and output); had to add a class Key for things to remain convenient. also reads the reporter prefix for status and counts, but that's not really interesting :)
Commits on Apr 13, 2011
  1. document -c flag for generated shell scripts and remove 'if possible'…

    … for streaming settings autodetect (Hadoop passes them, I checked today, will implement this soon)
Something went wrong with that request. Please try again.