Permalink
Commits on May 12, 2010
  1. Nutch a tlp, moving svn

    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-0.8@943363 13f79535-47bb-0310-9956-ffa450edef68
    gmcdonald committed May 12, 2010
Commits on Dec 14, 2006
Commits on Nov 14, 2006
Commits on Oct 31, 2006
Commits on Oct 28, 2006
  1. Fix NUTCH-394.

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@468673 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Oct 28, 2006
Commits on Oct 24, 2006
  1. fix for NUTCH-379

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@467357 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Oct 24, 2006
  2. fix for NUTCH-391

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@467343 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Oct 24, 2006
Commits on Oct 9, 2006
Commits on Sep 27, 2006
  1. had logged wron jira id

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@450488 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Sep 27, 2006
Commits on Sep 25, 2006
Commits on Sep 24, 2006
  1. prepare for development for 0.8.2

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@449375 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Sep 24, 2006
  2. preparing for 0.8.1 release

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@449365 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Sep 24, 2006
  3. preparing for 0.8.1 release

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@449364 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Sep 24, 2006
Commits on Sep 23, 2006
  1. NUTCH-336: differentiate between newly discovered pages (known value …

    …through
    
    inlink contributions) and newly injected pages (aribtrarily defined initial
    value).
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@449279 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Sep 23, 2006
Commits on Sep 22, 2006
  1. NUTCH-332: fix the problem of doubling scores caused by links pointing

    to the current page (e.g. anchors).
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@449100 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Sep 22, 2006
  2. Use a CombiningCollector when calculating readdb -stats. This drastic…

    …ally
    
    reduces the size of intermediate data, resulting in significant speedups
    for large databases.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@449097 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Sep 22, 2006
Commits on Sep 19, 2006
  1. NUTCH-105 - Network error during robots.txt fetch causes file to beig…

    …nored, contributed by Greg Kim
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@447867 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Sep 19, 2006
Commits on Sep 18, 2006
Commits on Aug 19, 2006
  1. NUTCH-338 - Remove the text parser as an option for parsing PDF files…

    … in parse-plugins.xml (Chris A. Mattmann)
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@432794 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Aug 19, 2006
Commits on Aug 18, 2006
  1. NUTCH-341 - if -workingdir is specified, always create a unique subdir.

    Also, use unique directory names to allow multiple IndexMergers to run
    simultaneously.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@432675 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Aug 18, 2006
Commits on Aug 17, 2006
  1. Update CHANGES.

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@432290 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Aug 17, 2006
  2. Apply patch in NUTCH-348 - Generator used the lowest score instead of

    the highest. Contributed by Chris Schneider and Stefan Groschupf.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@432287 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Aug 17, 2006
Commits on Aug 14, 2006
  1. Fix incorrect calculation of max and min scores in readdb -stats. Spo…

    …tted
    
    by Chris Schneider.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@431368 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Aug 14, 2006
  2. Apply patches in rev 431364.

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@431366 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Aug 14, 2006
Commits on Aug 11, 2006
Commits on Aug 8, 2006
  1. NUTCH-260 - update hadoop.jar

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@429769 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Aug 8, 2006
Commits on Jul 27, 2006
  1. logging changes to 0.8 branch also

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@426118 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 27, 2006
Commits on Jul 25, 2006
  1. Nutch 0.8 release maintenance branch.

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/branches/branch-0.8@425492 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 25, 2006
  2. Change the name of SegmentReader alias to 'readseg' for consistency w…

    …ith other
    
    reading-related commands. Keep the old 'segread' for compatibility, and
    give a deprecation message.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@425354 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jul 25, 2006
  3. preparing 0.8 release

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@425324 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 25, 2006