Skip to content
Commits on Jan 10, 2012
  1. @rangadi

    Merge pull request #31 from butlermh/master

    Fixed build so it is possible to publish artifacts to local Maven repository
    rangadi committed Jan 10, 2012
Commits on Jan 9, 2012
  1. @dvryaboy

    Merge pull request #40 from rangadi/toddlipcon_master_jan_03

    Toddlipcon master jan 03
    dvryaboy committed Jan 9, 2012
Commits on Jan 4, 2012
  1. @dvryaboy

    Merge pull request #39 from rangadi/read_index_improvement

    readIndex : avoid creating Long objects for each lzo block
    dvryaboy committed Jan 4, 2012
Commits on Jan 3, 2012
  1. @dvryaboy

    Merge pull request #37 from rangadi/read_index_improvement

    Read index improvement, Mac build fix.
    dvryaboy committed Jan 3, 2012
  2. readIndex : avoid 2 extra RPCs to HDFS for each file.

    Raghu Angadi committed Jan 3, 2012
Commits on Nov 26, 2011
  1. @toddlipcon
  2. @toddlipcon
Commits on Oct 27, 2011
  1. @rangadi

    Merge pull request #34 from miguno/issue30

    Issue 30: Build fails on Ubuntu 11.10 (changed ld default behavior)
    rangadi committed Oct 27, 2011
  2. @miguno

    Issue 30: Build fails on Ubuntu 11.10 (changed ld default behavior)

    This patch explicitly sets the ld option '--no-as-needed'.  In Ubuntu
    11.10, the default behavior of ld was changed to '--as-needed', which
    breaks the src/native/configure script and its detection of the native
    liblzo2 library.
    
    More information is available at:
    https://github.com/kevinweil/hadoop-lzo/issues/33
    miguno committed Oct 27, 2011
Commits on Sep 19, 2011
  1. Bump version to 0.4.14

    Raghu Angadi committed Sep 19, 2011
  2. @dvryaboy

    Merge pull request #32 from rangadi/fix_verifyChecksum

    LzopInputStream : skip compressed data checksum when data is not compressed
    dvryaboy committed Sep 19, 2011
  3. LzopInputStream : skip compressed data checksum when data is uncompre…

    …ssed.
    Raghu Angadi committed Sep 19, 2011
Commits on Sep 6, 2011
  1. Removed Hadoop version numbers as they are not used

    Mark H. Butler committed Sep 6, 2011
Commits on Sep 5, 2011
  1. @toddlipcon

    Bump version to 0.4.14

    toddlipcon committed Sep 5, 2011
  2. @toddlipcon
  3. @toddlipcon

    Fix some javadoc formatting

    toddlipcon committed Sep 5, 2011
  4. @toddlipcon
  5. @toddlipcon
Commits on Sep 1, 2011
  1. @toddlipcon

    Fix performance issue when reinit() is called with a null Configuration

    Previously, this would instantiate a new Configuration object on every call,
    which involved re-reading and parsing the configuration XML files to
    load the defaults. This was very slow.
    
    The new version caches a default Configuration object statically
    and uses that one in this circumstance.
    toddlipcon committed Sep 1, 2011
Commits on Aug 29, 2011
  1. Fixed build file so it is possible to publish artifacts to local Mave…

    …n repository
    Mark H. Butler committed Aug 29, 2011
Commits on Aug 26, 2011
  1. @rangadi

    Merge pull request #28 from ivmaykov/master

    Added an option for LzoTextInputFormat to handle non-Lzo file through TextInputFormat.
    rangadi committed Aug 26, 2011
Commits on Aug 23, 2011
  1. Made LzoTextInputFormat a subclass of TextInputFormat

    Ilya Maykov committed Aug 23, 2011
  2. Made DeprecatedLzoTextInputFormat a subclass of TextInputFormat, whic…

    …h cleans up the code nicely.
    Ilya Maykov committed Aug 23, 2011
  3. Per code review feedback from Raghu Angadi, removed the LzoStreamingI…

    …nputFormat.
    Ilya Maykov committed Aug 23, 2011
Commits on Aug 18, 2011
  1. Added the LzoStreamingInputFormat and LzoStreamingLineRecordReader cl…

    …asses.
    
    These classes are more appropriate than DeprecatedLzoTextInputFormat /
    DeprecatedLzoLineRecordReader for use with the hadoop-streaming jar, since
    they have the same behavior as the default streaming input format:
    
    - input is broken into lines using any of '\n', '\r', or '\r\n'.
    - line contents up to the first '\t' character are treated as the key
    - the rest of the line is treated as the value
    
    In contrast, the DeprecatedLzoTextInputFormat treats the file offset as the
    key and the entire line as the value. This resulted in weird behavior when
    using the DeprecatedLzoTextInputFormat with a streaming MR job. For example,
    when using -mapper 'cat' and no reducer (which should produce an output
    file that's identical to the input file), this input
    
    key1	    value1
    key2	    value2
    key3	    value3
    
    Produced this output:
    
    0	 key1 value1
    95	 key2 value2
    95	 key3 value3
    
    which is clearly wrong. Using LzoStreamingInputFormat produces the expected
    output (same as input).
    Ilya Maykov committed Aug 17, 2011
Commits on Aug 17, 2011
  1. 1) Added the boolean option "lzo.text.input.format.ignore.nonlzo", (d…

    …efault
    
    is true). The option is to be used with the DeprecatedLzoTextInputFormat and
    LzoTextInputFormat input format classes.
    
    When true, it causes all files that don't end in ".lzo" to be silently dropped
    from the input set.
    
    When false, it will keep files that don't end in ".lzo", and will process them
    with TextInputFormat (however, files that end in ".lzo.index" will still be
    ignored). This makes it possible to process a mix of LZO and non-LZO files
    with a single MR job, which in turn makes it much easier to perform an online
    upgrade to LZO compression in a production system without incurring downtime.
    
    It also makes it possible to reprocess ranges of log files that span the
    pre-LZO / post-LZO boundary in a single MR job.
    
    2) Added unit test for the above feature to TestLzoTextInputFormat.
    
    3) Added a public LzopCodec.DEFAULT_LZO_EXTENSION constant.
    Ilya Maykov committed Aug 17, 2011
Commits on Aug 10, 2011
  1. version 0.4.13 : add support for writing lzo index file

    Raghu Angadi committed Aug 10, 2011
Commits on Aug 4, 2011
  1. @dvryaboy

    Merge pull request #23 from rangadi/inline_index

    Add an option to write lzo index file along with lzo file.
    dvryaboy committed Aug 4, 2011
Commits on Jul 27, 2011
Commits on Jul 26, 2011
  1. pull #23: merge LzoIndexdOutputFormat into LzoOutputFormat.

    Updated outputformat tests to verify the index.
    Raghu Angadi committed Jul 26, 2011
Commits on Jul 21, 2011
  1. Add an option write lzo index file along with lzo file.

    Raghu Angadi committed Jul 21, 2011
Commits on Jul 1, 2011
  1. @dvryaboy
Something went wrong with that request. Please try again.