Permalink
Commits on Jul 2, 2012
  1. Nutch 1.5.1 release.

    Lewis John McGibbney committed Jul 2, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/tags/release-1.5.1-rc2@1356364 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1404 Nutch script fails to find job file in deploy mode

    Lewis John McGibbney committed Jul 2, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.5.1@1356363 13f79535-47bb-0310-9956-ffa450edef68
  3. NUTCH-1415 release packages to contain top level folder apache-nutch-…

    Lewis John McGibbney committed Jul 2, 2012
    …x.x + final commits to 1.5.1RC#2
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.5.1@1356359 13f79535-47bb-0310-9956-ffa450edef68
  4. backport of NUTCH-1400 Remove developer -core option for bin/nutch

    Lewis John McGibbney committed Jul 2, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.5.1@1356357 13f79535-47bb-0310-9956-ffa450edef68
  5. backport of NUTCH-1384 Typo in ParseSegment's run-method

    Lewis John McGibbney committed Jul 2, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.5.1@1356351 13f79535-47bb-0310-9956-ffa450edef68
  6. backport of NUTCH-1398 Upgrade to Hadoop 1.0.3

    Lewis John McGibbney committed Jul 2, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.5.1@1356343 13f79535-47bb-0310-9956-ffa450edef68
  7. create new branch-1.5.1

    Lewis John McGibbney committed Jul 2, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.5.1@1356339 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 25, 2012
  1. commit to sync pom.xml with Ivy deps

    Lewis John McGibbney committed Jun 25, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.5.1@1353619 13f79535-47bb-0310-9956-ffa450edef68
  2. commit to set up RC

    Lewis John McGibbney committed Jun 25, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.5.1@1353615 13f79535-47bb-0310-9956-ffa450edef68
  3. copying trunk -r1352008 to branch-1.5.1

    Lewis John McGibbney committed Jun 25, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.5.1@1353610 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 20, 2012
  1. NUTCH-1400 + changed version to 1.5.1-SNAPSHOT

    jnioche committed Jun 20, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1352008 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 19, 2012
  1. NUTCH-1404 Nutch script fails to find job file in deploy mode (sidaba…

    jnioche committed Jun 19, 2012
    …tra, jnioche)
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1351709 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 15, 2012
  1. NUTCH-1398 Upgrade to Hadoop 1.0.3

    jnioche committed Jun 15, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1350630 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 12, 2012
  1. NUTCH-1300 Indexer to filter normalize URL's

    Markus Jelsma committed Jun 12, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1349262 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1330 WebGraph OutlinkDB to preserve back up

    Markus Jelsma committed Jun 12, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1349240 13f79535-47bb-0310-9956-ffa450edef68
  3. NUTCH-1319 HostNormalizer plugin

    Markus Jelsma committed Jun 12, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1349236 13f79535-47bb-0310-9956-ffa450edef68
  4. NUTCH-1386 Headings filter not to add empty values

    Markus Jelsma committed Jun 12, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1349233 13f79535-47bb-0310-9956-ffa450edef68
  5. NUTCH-1356 ParseUtil use ExecutorService instead of manually thread h…

    Markus Jelsma committed Jun 12, 2012
    …andling
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1349230 13f79535-47bb-0310-9956-ffa450edef68
  6. NUTCH-1352 Improve regex urlfilters/normalizers synchronization

    Markus Jelsma committed Jun 12, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1349227 13f79535-47bb-0310-9956-ffa450edef68
  7. NUTCH-1024 Dynamically set fetchInterval by MIME-type

    Markus Jelsma committed Jun 12, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1349226 13f79535-47bb-0310-9956-ffa450edef68
  8. commit to address NUTCH-1364 and update to CHANGES.txt

    Lewis John McGibbney committed Jun 12, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1349076 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 11, 2012
  1. commit to address NUTCH-1360 and update to CHANGES.txt

    Lewis John McGibbney committed Jun 11, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1348993 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1262 Map `duplicating` content-types to a single type

    Markus Jelsma committed Jun 11, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1348785 13f79535-47bb-0310-9956-ffa450edef68
  3. NUTCH-1384 Typo in ParseSegments's run-method

    Markus Jelsma committed Jun 11, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1348766 13f79535-47bb-0310-9956-ffa450edef68
  4. NUTCH-1385 More robust plug-in order properties in nutch-site.xml

    Markus Jelsma committed Jun 11, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1348764 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 8, 2012
  1. trivial commit to add my details to KEYS file

    Lewis John McGibbney committed Jun 8, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1348094 13f79535-47bb-0310-9956-ffa450edef68
  2. trivial commit to add license header and update schema number

    Lewis John McGibbney committed Jun 8, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1348070 13f79535-47bb-0310-9956-ffa450edef68
  3. NUTCH-1336 Optionally not index db_notmodified pages

    Markus Jelsma committed Jun 8, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1347909 13f79535-47bb-0310-9956-ffa450edef68
  4. NUTCH-1346 Follow outlinks to ignore external

    Markus Jelsma committed Jun 8, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1347897 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 7, 2012
  1. NUTCH-1320 IndexChecker and ParseChecker choke on IDN's

    Markus Jelsma committed Jun 7, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1347755 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1351 DomainStatistics to aggregate by TLD

    Markus Jelsma committed Jun 7, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1347747 13f79535-47bb-0310-9956-ffa450edef68
  3. NUTCH-1381 Allow to override default subcollection field name

    Markus Jelsma committed Jun 7, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1347744 13f79535-47bb-0310-9956-ffa450edef68
Commits on May 31, 2012
  1. commit to backport ant tar-src and zip-src config to build.xml

    Lewis John McGibbney committed May 31, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1344900 13f79535-47bb-0310-9956-ffa450edef68
  2. commit to finalise ant tar-src and ant zip-src targets for RC4

    Lewis John McGibbney committed May 31, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.5@1344886 13f79535-47bb-0310-9956-ffa450edef68
Commits on May 30, 2012
  1. commit to backport release1.5 changes to trunk

    Lewis John McGibbney committed May 30, 2012
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1344477 13f79535-47bb-0310-9956-ffa450edef68