Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Commits on Sep 21, 2012
  1. revert gora-cassandra to v0.2

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.1@1388524 13f79535-47bb-0310-9956-ffa450edef68
Commits on Sep 18, 2012
  1. forward port of NUTCH-1415

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.1@1387363 13f79535-47bb-0310-9956-ffa450edef68
  2. prepare branch for tag

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.1@1387353 13f79535-47bb-0310-9956-ffa450edef68
  3. Nutch 2.1 branch

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.1@1387350 13f79535-47bb-0310-9956-ffa450edef68
  4. NUTCH-1432 property storage.schema does not work anymore, should be s…

    Lewis John McGibbney authored
    …torage.schema.webpage and storage.schema.host
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1387347 13f79535-47bb-0310-9956-ffa450edef68
  5. add keyspace reference to NullPointerException on inject before

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1387175 13f79535-47bb-0310-9956-ffa450edef68
  6. NUTCH-1162 test file

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1387173 13f79535-47bb-0310-9956-ffa450edef68
Commits on Sep 17, 2012
  1. NUTCH-1468 Redirects that are external links not adhering to db.ignor…

    Ferdy Galema authored
    …e.external.links
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1386526 13f79535-47bb-0310-9956-ffa450edef68
Commits on Sep 15, 2012
  1. NUTCH-1470 Ensure test files are included for runtime testing

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1385199 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1162 Write JUnit tests for parse-js

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1385103 13f79535-47bb-0310-9956-ffa450edef68
Commits on Sep 7, 2012
  1. NUTCH-1456 Updater not setting batchId in markers correctly. (Alexand…

    Ferdy Galema authored
    …er Kingson via ferdy)
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1382037 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1459 Remove dead code (phase2) from InjectorJob

    Ferdy Galema authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1381931 13f79535-47bb-0310-9956-ffa450edef68
Commits on Aug 31, 2012
  1. NUTCH-1431 Introduce link 'distance' and add configurable max distanc…

    Ferdy Galema authored
    …e in the generator
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1379488 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1448 Redirected urls should be handled more cleanly (more like …

    Ferdy Galema authored
    …an outlink url)
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1379438 13f79535-47bb-0310-9956-ffa450edef68
  3. NUTCH-1463 Elasticsearch indexer should wait and check response for l…

    Ferdy Galema authored
    …ast flush
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1379435 13f79535-47bb-0310-9956-ffa450edef68
  4. NUTCH-1462 Elasticsearch not indexing when type==null in NutchDocumen…

    Ferdy Galema authored
    …t metadata
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1379431 13f79535-47bb-0310-9956-ffa450edef68
Commits on Aug 30, 2012
  1. NUTCH-1395 Show batchId when skipping within ParserJob

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1379137 13f79535-47bb-0310-9956-ffa450edef68
Commits on Aug 14, 2012
  1. NUTCH-1365 Fix crawlId functionalilty by making using of new gora con…

    Ferdy Galema authored
    …figuration
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1372752 13f79535-47bb-0310-9956-ffa450edef68
Commits on Aug 13, 2012
  1. NUTCH-1442 indexingfilter.order is property is misread in code

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1372593 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1450 upgrade gora deps to 0.2.1

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1372527 13f79535-47bb-0310-9956-ffa450edef68
Commits on Aug 12, 2012
  1. Trivial commit to get nightly target to depend on Javadoc target

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1372092 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1161 Write JUnit test for microformats-reltag

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1372086 13f79535-47bb-0310-9956-ffa450edef68
Commits on Aug 10, 2012
  1. NUTCH-1160 Write JUnit test for index-basic

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1371708 13f79535-47bb-0310-9956-ffa450edef68
Commits on Aug 6, 2012
  1. NUTCH-1159 Write JUnit test for index-anchor

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1369847 13f79535-47bb-0310-9956-ffa450edef68
Commits on Aug 3, 2012
  1. NUTCH-1445 Add ElasticIndexerJob that indexes to elasticsearch (addPr…

    Ferdy Galema authored
    …opsToConfig)
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1369013 13f79535-47bb-0310-9956-ffa450edef68
Commits on Aug 1, 2012
  1. NUTCH-1445 Add ElasticIndexerJob that indexes to elasticsearch (addTo…

    Ferdy Galema authored
    …NutchScript)
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1368024 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1445 Add ElasticIndexerJob that indexes to elasticsearch

    Ferdy Galema authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1368016 13f79535-47bb-0310-9956-ffa450edef68
  3. NUTCH-1444 Indexing should not create temporary files (do not extend …

    Ferdy Galema authored
    …from FileOutputFormat)
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1368012 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jul 31, 2012
  1. NUTCH-1443 Solr schema version is invalid

    Markus Jelsma authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1367788 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jul 30, 2012
  1. NUTCH-1441 AnchorIndexingFilter should use plain HashSet

    Ferdy Galema authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1367064 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jul 29, 2012
  1. NUTCH-1416 Remove o.a.n.metadata.Office

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1366845 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-1376 Add description parameter to every ant task

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1366844 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jul 27, 2012
  1. NUTCH-1440 reconfigure non-existent stopwords_en.txt in schema-solr4.xml

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1366348 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jul 26, 2012
  1. copy over solr 4 schema.

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1366170 13f79535-47bb-0310-9956-ffa450edef68
  2. remove unnecessary doap.rdf

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/2.x@1365973 13f79535-47bb-0310-9956-ffa450edef68
Something went wrong with that request. Please try again.