Permalink
Commits on Sep 5, 2011
  1. Bump version to 0.4.14

    toddlipcon committed Sep 5, 2011
Commits on Sep 1, 2011
  1. Fix performance issue when reinit() is called with a null Configuration

    toddlipcon committed Sep 1, 2011
    Previously, this would instantiate a new Configuration object on every call,
    which involved re-reading and parsing the configuration XML files to
    load the defaults. This was very slow.
    
    The new version caches a default Configuration object statically
    and uses that one in this circumstance.
Commits on Aug 26, 2011
  1. Merge pull request #28 from ivmaykov/master

    rangadi committed Aug 26, 2011
    Added an option for LzoTextInputFormat to handle non-Lzo file through TextInputFormat.
Commits on Aug 23, 2011
  1. Made LzoTextInputFormat a subclass of TextInputFormat

    Ilya Maykov
    Ilya Maykov committed Aug 23, 2011
  2. Made DeprecatedLzoTextInputFormat a subclass of TextInputFormat, whic…

    Ilya Maykov
    Ilya Maykov committed Aug 23, 2011
    …h cleans up the code nicely.
  3. Per code review feedback from Raghu Angadi, removed the LzoStreamingI…

    Ilya Maykov
    Ilya Maykov committed Aug 23, 2011
    …nputFormat.
Commits on Aug 18, 2011
  1. Added the LzoStreamingInputFormat and LzoStreamingLineRecordReader cl…

    Ilya Maykov
    Ilya Maykov committed Aug 18, 2011
    …asses.
    
    These classes are more appropriate than DeprecatedLzoTextInputFormat /
    DeprecatedLzoLineRecordReader for use with the hadoop-streaming jar, since
    they have the same behavior as the default streaming input format:
    
    - input is broken into lines using any of '\n', '\r', or '\r\n'.
    - line contents up to the first '\t' character are treated as the key
    - the rest of the line is treated as the value
    
    In contrast, the DeprecatedLzoTextInputFormat treats the file offset as the
    key and the entire line as the value. This resulted in weird behavior when
    using the DeprecatedLzoTextInputFormat with a streaming MR job. For example,
    when using -mapper 'cat' and no reducer (which should produce an output
    file that's identical to the input file), this input
    
    key1	    value1
    key2	    value2
    key3	    value3
    
    Produced this output:
    
    0	 key1 value1
    95	 key2 value2
    95	 key3 value3
    
    which is clearly wrong. Using LzoStreamingInputFormat produces the expected
    output (same as input).
Commits on Aug 17, 2011
  1. 1) Added the boolean option "lzo.text.input.format.ignore.nonlzo", (d…

    Ilya Maykov
    Ilya Maykov committed Aug 17, 2011
    …efault
    
    is true). The option is to be used with the DeprecatedLzoTextInputFormat and
    LzoTextInputFormat input format classes.
    
    When true, it causes all files that don't end in ".lzo" to be silently dropped
    from the input set.
    
    When false, it will keep files that don't end in ".lzo", and will process them
    with TextInputFormat (however, files that end in ".lzo.index" will still be
    ignored). This makes it possible to process a mix of LZO and non-LZO files
    with a single MR job, which in turn makes it much easier to perform an online
    upgrade to LZO compression in a production system without incurring downtime.
    
    It also makes it possible to reprocess ranges of log files that span the
    pre-LZO / post-LZO boundary in a single MR job.
    
    2) Added unit test for the above feature to TestLzoTextInputFormat.
    
    3) Added a public LzopCodec.DEFAULT_LZO_EXTENSION constant.
Commits on Aug 10, 2011
  1. version 0.4.13 : add support for writing lzo index file

    Raghu Angadi
    Raghu Angadi committed Aug 10, 2011
Commits on Aug 4, 2011
  1. Merge pull request #23 from rangadi/inline_index

    dvryaboy committed Aug 4, 2011
    Add an option to write lzo index file along with lzo file.
Commits on Jul 27, 2011
Commits on Jul 26, 2011
  1. pull #23: merge LzoIndexdOutputFormat into LzoOutputFormat.

    Raghu Angadi
    Raghu Angadi committed Jul 26, 2011
    Updated outputformat tests to verify the index.
Commits on Jul 21, 2011
  1. Add an option write lzo index file along with lzo file.

    Raghu Angadi
    Raghu Angadi committed Jul 21, 2011
Commits on Jul 1, 2011
Commits on Jun 3, 2011
  1. Bump version number after merging compression_level branch.

    Travis Crawford
    Travis Crawford committed Jun 3, 2011
  2. Merge remote branch 'origin/master' into compression_level

    Travis Crawford
    Travis Crawford committed Jun 3, 2011
  3. Merge remote branch 'kevinweil/master'

    Travis Crawford
    Travis Crawford committed Jun 3, 2011
  4. Merge pull request #21 from miguno/master

    rangadi committed Jun 3, 2011
    Fix issue #20: write uncompressed bytes if uncompressed size==compressed size
Commits on May 31, 2011
  1. Bugfix: fix default parameter value.

    Travis Crawford
    Travis Crawford committed May 31, 2011
Commits on May 27, 2011
  1. Merge remote branch 'origin/master' into compression_level

    Travis Crawford
    Travis Crawford committed May 27, 2011
  2. Merge remote branch 'kevinweil/master'

    Travis Crawford
    Travis Crawford committed May 27, 2011
  3. Updated related to compression level support.

    Travis Crawford
    Travis Crawford committed May 27, 2011
  4. Initial compression level support.

    Travis Crawford
    Travis Crawford committed May 27, 2011
  5. Update .gitignore for intellij.

    Travis Crawford
    Travis Crawford committed May 27, 2011
Commits on Apr 26, 2011
  1. Bump version to 0.4.11

    miguno committed Apr 26, 2011
Commits on Apr 13, 2011
  1. Issue 20: write uncompressed bytes if uncompressed size==compressed size

    miguno committed Apr 13, 2011
    The LZO specification says that we should write the uncompressed bytes
    rather than the compressed bytes if the compressed buffer is actually
    larger than the uncompresesd buffer.
    
    To conform to the standard, this means we have to write the uncompressed
    bytes also when they have exactly the same size as the compressed bytes.
Commits on Mar 19, 2011
Commits on Mar 17, 2011
  1. Add a template file .archive-version with git attributes to record ha…

    toddlipcon committed Mar 17, 2011
    …sh for github-generated tarballs