Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Branch: master
Commits on Jan 10, 2012
  1. @rangadi

    Merge pull request #31 from butlermh/master

    rangadi authored
    Fixed build so it is possible to publish artifacts to local Maven repository
Commits on Jan 9, 2012
  1. @dvryaboy

    Merge pull request #40 from rangadi/toddlipcon_master_jan_03

    dvryaboy authored
    Toddlipcon master jan 03
Commits on Jan 4, 2012
  1. @dvryaboy

    Merge pull request #39 from rangadi/read_index_improvement

    dvryaboy authored
    readIndex : avoid creating Long objects for each lzo block
  2. @rangadi
Commits on Jan 3, 2012
  1. @dvryaboy

    Merge pull request #37 from rangadi/read_index_improvement

    dvryaboy authored
    Read index improvement, Mac build fix.
  2. @rangadi
  3. @rangadi
Commits on Nov 26, 2011
  1. @toddlipcon
  2. @toddlipcon
Commits on Oct 27, 2011
  1. @rangadi

    Merge pull request #34 from miguno/issue30

    rangadi authored
    Issue 30: Build fails on Ubuntu 11.10 (changed ld default behavior)
  2. @miguno

    Issue 30: Build fails on Ubuntu 11.10 (changed ld default behavior)

    miguno authored
    This patch explicitly sets the ld option '--no-as-needed'.  In Ubuntu
    11.10, the default behavior of ld was changed to '--as-needed', which
    breaks the src/native/configure script and its detection of the native
    liblzo2 library.
    
    More information is available at:
    https://github.com/kevinweil/hadoop-lzo/issues/33
Commits on Sep 19, 2011
  1. @rangadi

    Bump version to 0.4.14

    rangadi authored
  2. @dvryaboy

    Merge pull request #32 from rangadi/fix_verifyChecksum

    dvryaboy authored
    LzopInputStream : skip compressed data checksum when data is not compressed
  3. @rangadi
Commits on Sep 6, 2011
  1. Removed Hadoop version numbers as they are not used

    Mark H. Butler authored
Commits on Sep 5, 2011
  1. @toddlipcon

    Bump version to 0.4.14

    toddlipcon authored
  2. @toddlipcon
  3. @toddlipcon
  4. @toddlipcon
  5. @toddlipcon
Commits on Sep 1, 2011
  1. @toddlipcon

    Fix performance issue when reinit() is called with a null Configuration

    toddlipcon authored
    Previously, this would instantiate a new Configuration object on every call,
    which involved re-reading and parsing the configuration XML files to
    load the defaults. This was very slow.
    
    The new version caches a default Configuration object statically
    and uses that one in this circumstance.
Commits on Aug 29, 2011
  1. Fixed build file so it is possible to publish artifacts to local Mave…

    Mark H. Butler authored
    …n repository
Commits on Aug 26, 2011
  1. @rangadi

    Merge pull request #28 from ivmaykov/master

    rangadi authored
    Added an option for LzoTextInputFormat to handle non-Lzo file through TextInputFormat.
Commits on Aug 23, 2011
  1. Made LzoTextInputFormat a subclass of TextInputFormat

    Ilya Maykov authored
  2. Made DeprecatedLzoTextInputFormat a subclass of TextInputFormat, whic…

    Ilya Maykov authored
    …h cleans up the code nicely.
  3. Per code review feedback from Raghu Angadi, removed the LzoStreamingI…

    Ilya Maykov authored
    …nputFormat.
Commits on Aug 18, 2011
  1. Added the LzoStreamingInputFormat and LzoStreamingLineRecordReader cl…

    Ilya Maykov authored
    …asses.
    
    These classes are more appropriate than DeprecatedLzoTextInputFormat /
    DeprecatedLzoLineRecordReader for use with the hadoop-streaming jar, since
    they have the same behavior as the default streaming input format:
    
    - input is broken into lines using any of '\n', '\r', or '\r\n'.
    - line contents up to the first '\t' character are treated as the key
    - the rest of the line is treated as the value
    
    In contrast, the DeprecatedLzoTextInputFormat treats the file offset as the
    key and the entire line as the value. This resulted in weird behavior when
    using the DeprecatedLzoTextInputFormat with a streaming MR job. For example,
    when using -mapper 'cat' and no reducer (which should produce an output
    file that's identical to the input file), this input
    
    key1	    value1
    key2	    value2
    key3	    value3
    
    Produced this output:
    
    0	 key1 value1
    95	 key2 value2
    95	 key3 value3
    
    which is clearly wrong. Using LzoStreamingInputFormat produces the expected
    output (same as input).
Commits on Aug 17, 2011
  1. 1) Added the boolean option "lzo.text.input.format.ignore.nonlzo", (d…

    Ilya Maykov authored
    …efault
    
    is true). The option is to be used with the DeprecatedLzoTextInputFormat and
    LzoTextInputFormat input format classes.
    
    When true, it causes all files that don't end in ".lzo" to be silently dropped
    from the input set.
    
    When false, it will keep files that don't end in ".lzo", and will process them
    with TextInputFormat (however, files that end in ".lzo.index" will still be
    ignored). This makes it possible to process a mix of LZO and non-LZO files
    with a single MR job, which in turn makes it much easier to perform an online
    upgrade to LZO compression in a production system without incurring downtime.
    
    It also makes it possible to reprocess ranges of log files that span the
    pre-LZO / post-LZO boundary in a single MR job.
    
    2) Added unit test for the above feature to TestLzoTextInputFormat.
    
    3) Added a public LzopCodec.DEFAULT_LZO_EXTENSION constant.
Commits on Aug 10, 2011
  1. @rangadi
Commits on Aug 4, 2011
  1. @dvryaboy

    Merge pull request #23 from rangadi/inline_index

    dvryaboy authored
    Add an option to write lzo index file along with lzo file.
Commits on Jul 27, 2011
  1. @rangadi
  2. @rangadi
Commits on Jul 26, 2011
  1. @rangadi

    pull #23: merge LzoIndexdOutputFormat into LzoOutputFormat.

    rangadi authored
    Updated outputformat tests to verify the index.
Commits on Jul 21, 2011
  1. @rangadi
Commits on Jul 1, 2011
  1. @dvryaboy
Something went wrong with that request. Please try again.