Permalink
Commits on May 12, 2010
  1. Nutch a tlp, moving svn

    git-svn-id: https://svn.apache.org/repos/asf/nutch/tags/1.1-rc1@943363 13f79535-47bb-0310-9956-ffa450edef68
    gmcdonald committed May 12, 2010
Commits on Apr 19, 2010
  1. - rename initial tag to include rc1 label, since I will cut a new RC …

    …for Nutch 1.1 that includes NUTCH-812
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/tags/1.1-rc1@935454 13f79535-47bb-0310-9956-ffa450edef68
    chrismattmann committed Apr 19, 2010
Commits on Apr 7, 2010
  1. Nutch 1.1 release.

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/tags/1.1@931431 13f79535-47bb-0310-9956-ffa450edef68
    chrismattmann committed Apr 7, 2010
  2. Release 1.1: step 3/4 from http://bit.ly/d5ugid

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@931421 13f79535-47bb-0310-9956-ffa450edef68
    chrismattmann committed Apr 7, 2010
  3. - prep for 1.1 release

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@931420 13f79535-47bb-0310-9956-ffa450edef68
    chrismattmann committed Apr 7, 2010
  4. - prep for 1.1 release

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@931419 13f79535-47bb-0310-9956-ffa450edef68
    chrismattmann committed Apr 7, 2010
Commits on Apr 6, 2010
  1. NUTCH-810 Upgraded to Tika 0.7

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@931098 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Apr 6, 2010
Commits on Mar 30, 2010
  1. NUTCH 785 : Fetcher : copy metadata from origin URL when redirecting …

    …+ call scfilters.initialScore on newly created URL
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@929039 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Mar 30, 2010
Commits on Mar 29, 2010
  1. NUTCH-784 : CrawlDBScanner

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@928746 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Mar 29, 2010
Commits on Mar 22, 2010
  1. fixed NPE introduced in NUTCH-762

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@926163 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Mar 22, 2010
  2. NUTCH-762 : Generator can generate several segments in one parse of t…

    …he crawlDB
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@926155 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Mar 22, 2010
  3. NUTCH-740 Configuration option to override default language for fetch…

    …ed pages
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@926003 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Mar 22, 2010
Commits on Mar 19, 2010
  1. NUTCH-803 Upgrade to Hadoop 0.20.2.

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@925186 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Mar 19, 2010
  2. NUTCH-787 Upgrade to Lucene 3.0.1.

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@925179 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Mar 19, 2010
Commits on Mar 18, 2010
  1. NUTCH-796 Zero results problems difficult to troubleshoot due to lack…

    … of logging.
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@924945 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Mar 18, 2010
Commits on Mar 11, 2010
  1. NUTCH-801 Remove RTF and MP3 parse plugins

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@921840 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Mar 11, 2010
  2. NUTCH-798 : Upgrade to SOLR1.4 and its dependencies

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@921831 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Mar 11, 2010
Commits on Mar 5, 2010
Commits on Mar 1, 2010
  1. NUTCH-782: Ability to order htmlparsefilters

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@917557 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Mar 1, 2010
Commits on Feb 19, 2010
  1. NUTCH-719 fetchQueues.totalSize incorrect in Fetcher

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@911905 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Feb 19, 2010
Commits on Feb 16, 2010
  1. NUTCH-794 : Language Identification must use check the parse metadata…

    … for language values
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@910454 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Feb 16, 2010
Commits on Feb 15, 2010
  1. NUTCH-766: small improvement to Tika parser : prioritise default Tika…

    … parser when discovering plugins matching mime-type
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@910187 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Feb 15, 2010
  2. NUTCH-793 search.jsp compile errors

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@910173 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Feb 15, 2010
Commits on Feb 14, 2010
  1. NUTCH-792 update version

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@910044 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Feb 14, 2010
  2. NUTCH-790 Some external javadoc links are broken

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@910041 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Feb 14, 2010
Commits on Feb 12, 2010
  1. - 2nd part of NUTCH-766 Tika parser

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@909269 13f79535-47bb-0310-9956-ffa450edef68
    chrismattmann committed Feb 12, 2010
  2. - fix for NUTCH-766 Tika parser

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@909268 13f79535-47bb-0310-9956-ffa450edef68
    chrismattmann committed Feb 12, 2010
Commits on Feb 5, 2010
  1. NUTCH-786

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@906907 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Feb 5, 2010
Commits on Feb 2, 2010
  1. NUTCH-781 : updated tika-mimetypes.xml

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@905550 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Feb 2, 2010
Commits on Feb 1, 2010
  1. NUTCH-775 Enhance Searcher interface

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@905410 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Feb 1, 2010
  2. NUTCH-781: upgrade tika to version 0.6

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@905229 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Feb 1, 2010
  3. NUTCH-781: upgrade tika to version 0.6

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@905228 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Feb 1, 2010
Commits on Jan 11, 2010
  1. fix for NUTCH-767 : reverted original expected values for test + trea…

    …t text/plain as a default mime-type from Tika
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@897825 13f79535-47bb-0310-9956-ffa450edef68
    jnioche committed Jan 11, 2010
Commits on Jan 8, 2010