Skip to content
Commits on Oct 30, 2012
  1. @jwills
  2. @jwills
Commits on Jul 11, 2012
  1. CRUNCH-8: Moving the code into multiple Maven modules. Contributed by…

    … Matthias Friedrich
    jwills committed Jul 10, 2012
Commits on Jul 10, 2012
  1. CRUNCH-11: Fix invalid package reference in the WordCountHBaseTest fo…

    …r hadoop 2.0.0-based builds, contributed by Jakob Homan
    jwills committed Jul 10, 2012
  2. Merge branch 'CRUNCH-9'

    jwills committed Jul 10, 2012
  3. Fix license header on PipelineAppTest

    jwills committed Jul 10, 2012
  4. Put scrunch classpath ahead of hadoop classpath

    jwills committed Jul 10, 2012
Commits on Jul 9, 2012
  1. @gabrielreid

    Add test-specific log4j.properties file

    Add a log4j.properties file to be used when testing. The
    difference with the main log4j.properties file is that this
    version enables logging for org.apache.hadoop, which shows errors
    that are otherwise hidden when running unit tests within the local
    job runner.
    gabrielreid committed Jul 9, 2012
  2. Update pom.xml and build.sbt to Apache naming conventions

    Signed-off-by: jwills <jwills@apache.org>
    jwills committed Jul 7, 2012
  3. Update pom.xml for Apache inclusion

    Signed-off-by: jwills <jwills@apache.org>
    jwills committed Jul 7, 2012
Commits on Jul 7, 2012
  1. @jwills

    Rename packages for the crunch-examples project and add license headers

    Signed-off-by: Josh Wills <jwills@cloudera.com>
    jwills committed Jul 7, 2012
  2. @jwills

    Rename examples/ packages and add license headers

    Signed-off-by: Josh Wills <jwills@cloudera.com>
    jwills committed Jul 7, 2012
  3. @jwills

    Rename scrunch packages and add license headers

    Signed-off-by: Josh Wills <jwills@cloudera.com>
    jwills committed Jul 7, 2012
  4. @jwills

    Rename com.cloudera.crunch -> org.apache.crunch in the Java core

    Signed-off-by: Josh Wills <jwills@cloudera.com>
    jwills committed Jul 7, 2012
  5. @jwills
Commits on Jul 6, 2012
  1. @gabrielreid

    Detach iterated join values

    Values being joined are typically re-used objects from a reducer's
    iterator, meaning storing them in a local collection does not have
    the desired behavior. The iterated values are now detached (i.e.
    deep copied) in joins to get around this.
    gabrielreid committed Jul 6, 2012
  2. @gabrielreid
  3. @gabrielreid

    Disable AvrosTest#testNestedTables

    AvrosTest#testNestedTables creates an invalid schema that causes
    Schema#toString to fail. This test has been @Ignored for now, but
    will either be removed completely (or fixed)
    gabrielreid committed Jul 3, 2012
  4. @gabrielreid
  5. @gabrielreid
  6. @gabrielreid

    Make PType extend Serializable

    Make PType extend Serializable so that PTypes can be passed within
    a DoFn to be used with map side joins.
    gabrielreid committed Jun 24, 2012
  7. @gabrielreid

    Extract setup for materialize

    Pull out the setup method for doing a materialize on a PCollection
    re-using it for in-memory mapside joins.
    gabrielreid committed Jun 24, 2012
  8. @jwills
Commits on Jul 4, 2012
  1. @jwills
Commits on Jul 3, 2012
  1. @gabrielreid

    Add Ptype#getDetachedValue

    Add getDetachedValue to PType to allow creating deep copies of
    values in reducer-based DoFns. A side-effect of this is that PType
    now extends Serializable.
    
    Also fixes the bug in Aggregate#collectValues that caused the same
    value to be collected multiple times in the case of custom
    Writables or AvroTypes.
    gabrielreid committed Jul 3, 2012
  2. @jwills
Commits on Jul 2, 2012
  1. @gabrielreid

    Improve documentation of PTable#materializeToMap

    The PTable#materializeToMap method returns a Map, while a PTable
    is actually a multi-map (i.e. possibly multiple values for a
    single key). The documentation of this method has been updated
    to clarify this.
    gabrielreid committed Jul 2, 2012
Commits on Jun 29, 2012
  1. @gabrielreid

    Improve compatibility with Avro ReflectDatumReader

    Allow Avro-based readers to correctly select between the
    ReflectDatumReader, GenericDatumReader, and SpecificDatumReader,
    allowing POJOs to be used fully throughout pipelines.
    gabrielreid committed Jun 29, 2012
Commits on Jun 28, 2012
  1. @tzolov
  2. @tzolov
Commits on Jun 27, 2012
  1. @jwills
Commits on Jun 21, 2012
  1. @jwills
Something went wrong with that request. Please try again.