Permalink
Commits on Nov 23, 2012
  1. Tag for Nutch 1.6 release.

    git-svn-id: https://svn.apache.org/repos/asf/nutch/tags/release-1.6@1412896 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 23, 2012
  2. committing pom.xml for 1.6 RC

    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.6@1412894 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 23, 2012
  3. Tag for Nutch 1.6 release.

    git-svn-id: https://svn.apache.org/repos/asf/nutch/tags/release-1.6@1412839 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 23, 2012
  4. Branch of Nutch 1.6 release.

    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/branch-1.6@1412835 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 23, 2012
  5. prepare for 1.6 release

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1412834 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 23, 2012
Commits on Nov 22, 2012
  1. NUTCH-1370 Expose exact number of urls injected @runtime

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1412573 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 22, 2012
Commits on Nov 16, 2012
  1. trivial commit to remove unused import and @Test annotation

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1410392 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 16, 2012
Commits on Nov 13, 2012
  1. NUTCH-1117 JUnit test for index-anchor

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1408898 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 13, 2012
Commits on Nov 12, 2012
  1. NUTCH-1451 Upgrade automaton jar to 1.11-8

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1408282 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 12, 2012
Commits on Nov 9, 2012
  1. * NUTCH-1488 bin/nutch to run junit from any directory (snagel via le…

    …wismc)
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1407527 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 9, 2012
Commits on Nov 8, 2012
  1. NUTCH-1493 Error adding field 'contentLength'= during solrindex using…

    … index-more v2
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1407089 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 8, 2012
Commits on Nov 7, 2012
  1. NUTCH-1493 Error adding field 'contentLength'='' during solrindex usi…

    …ng index-more
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1406752 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Nov 7, 2012
Commits on Nov 6, 2012
  1. removed uncommitted issue

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1406079 13f79535-47bb-0310-9956-ffa450edef68
    Markus Jelsma committed Nov 6, 2012
  2. NUTCH-1491 Strip UTF-8 non-character codepoints in title

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1406076 13f79535-47bb-0310-9956-ffa450edef68
    Markus Jelsma committed Nov 6, 2012
Commits on Oct 23, 2012
  1. NUTCH-1341 NotModified time set to now but page not modified

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1401288 13f79535-47bb-0310-9956-ffa450edef68
    Markus Jelsma committed Oct 23, 2012
  2. NUTCH-1215 UpdateDB should not require segment as input

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1401225 13f79535-47bb-0310-9956-ffa450edef68
    Markus Jelsma committed Oct 23, 2012
Commits on Oct 11, 2012
  1. NUTCH-1383 IndexingFiltersChecker to show error message instead of nu…

    …ll pointer exception
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1397308 13f79535-47bb-0310-9956-ffa450edef68
    sebastian-nagel committed Oct 11, 2012
  2. NUTCH-1476 SegmentReader getStats should set parsed = -1 if no parsin…

    …g took place
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1397298 13f79535-47bb-0310-9956-ffa450edef68
    sebastian-nagel committed Oct 11, 2012
  3. NUTCH-1252 SegmentReader -get shows wrong data

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1397281 13f79535-47bb-0310-9956-ffa450edef68
    sebastian-nagel committed Oct 11, 2012
Commits on Oct 10, 2012
  1. NUTCH-706 (applied correct patch)

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1396817 13f79535-47bb-0310-9956-ffa450edef68
    sebastian-nagel committed Oct 10, 2012
  2. NUTCH-706 Url regex normalizer: pattern for session id removal not to…

    … match "newsId"
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1396796 13f79535-47bb-0310-9956-ffa450edef68
    sebastian-nagel committed Oct 10, 2012
Commits on Sep 18, 2012
  1. NUTCH-1441 AnchorIndexingFilter should use plain HashSet

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1387341 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Sep 18, 2012
Commits on Sep 15, 2012
  1. NUTCH-1470 Ensure test files are included for runtime testing

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1385197 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Sep 15, 2012
Commits on Aug 23, 2012
  1. NUTCH-1434 Indexer to delete robots noindex

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1376394 13f79535-47bb-0310-9956-ffa450edef68
    Markus Jelsma committed Aug 23, 2012
Commits on Jul 31, 2012
  1. NUTCH-1443 Solr schema version is invalid

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1367786 13f79535-47bb-0310-9956-ffa450edef68
    Markus Jelsma committed Jul 31, 2012
Commits on Jul 29, 2012
  1. NUTCH-1416 Remove o.a.n.metadata.Office

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1366847 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Jul 29, 2012
  2. NUTCH-1376 Add description parameter to every ant task

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1366843 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Jul 29, 2012
  3. NUTCH-1376 Add description parameter to every ant task

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1366836 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Jul 29, 2012
Commits on Jul 27, 2012
  1. NUTCH-1440 reconfigure non-existent stopwords_en.txt in schema-solr4.xml

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1366342 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Jul 27, 2012
Commits on Jul 26, 2012
  1. NUTCH-1439 Define boost field as type float in schema-solr4.xml

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1366159 13f79535-47bb-0310-9956-ffa450edef68
    Lewis John McGibbney committed Jul 26, 2012
Commits on Jul 20, 2012
  1. NUTCH-1433 Upgrade to Tika 1.2 (take 2)

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1363842 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Jul 20, 2012
  2. NUTCH-1433 Upgrade to Tika 1.2

    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1363794 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Jul 20, 2012