Permalink
Commits on May 12, 2010
  1. Nutch a tlp, moving svn

    gmcdonald committed May 12, 2010
    git-svn-id: https://svn.apache.org/repos/asf/nutch/tags/release-1.0-rc0@943363 13f79535-47bb-0310-9956-ffa450edef68
Commits on Mar 8, 2009
  1. Nutch 1.0 rc0

    siren committed Mar 8, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/tags/release-1.0-rc0@751480 13f79535-47bb-0310-9956-ffa450edef68
  2. the version is indeed 1.0

    siren committed Mar 8, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@751475 13f79535-47bb-0310-9956-ffa450edef68
  3. preparing for release

    siren committed Mar 8, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@751471 13f79535-47bb-0310-9956-ffa450edef68
Commits on Mar 4, 2009
  1. NUTCH-711 - Indexer failing after upgrade to Hadoop 0.19.1. This is a…

    sigram committed Mar 4, 2009
    … temporary
    
    fix, to be revisited later.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@750037 13f79535-47bb-0310-9956-ffa450edef68
Commits on Mar 2, 2009
  1. NUTCH-669 - Consolidate code for Fetcher and Fetcher2

    siren committed Mar 2, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@749289 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-700 - revert to nekohtml-0.9.4

    siren committed Mar 2, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@749256 13f79535-47bb-0310-9956-ffa450edef68
  3. Commit changes to CHANGES.

    sigram committed Mar 2, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@749249 13f79535-47bb-0310-9956-ffa450edef68
  4. NUTCH-419 Unavailable robots.txt kills fetch.

    sigram committed Mar 2, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@749247 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 27, 2009
  1. NUTCH-703 Upgrade to Hadoop 0.19.1.

    sigram committed Feb 27, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@748637 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-699 - Add an "official" solr schema for solr integration. Contr…

    siren committed Feb 27, 2009
    …ibuted by dogacan, Dmitry Lihachev
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@748408 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 24, 2009
  1. NUTCH-698 - CrawlDb is corrupted after a few crawl cycles, contribute…

    siren committed Feb 24, 2009
    …d by dogacan
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@747324 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-626 - Fetcher2 breaks out the domain with db.ignore.external.li…

    siren committed Feb 24, 2009
    …nks set at cross domain redirects, contributed by Remco Verhoef, dogacan
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@747312 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 23, 2009
  1. NUTCH-694 - Distributed Search Server fails

    siren committed Feb 23, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@746900 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 19, 2009
  1. NUTCH-695 - incorrect mime type detection by MoreIndexingFilter plugi…

    siren committed Feb 19, 2009
    …n, contributed by Dmitry Lihachev
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@745808 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 18, 2009
  1. remove web2 as agreed on nutch-dev

    siren committed Feb 18, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@745517 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-563 Include custom fields in BasicQueryFilter, contributed by J…

    siren committed Feb 18, 2009
    …ulien Nioche
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@745503 13f79535-47bb-0310-9956-ffa450edef68
  3. NUTCH-691 - Update jakarta poi jars to the most relevant version, con…

    siren committed Feb 18, 2009
    …tributed by Dmitry Lihachev
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@745499 13f79535-47bb-0310-9956-ffa450edef68
  4. NUTCH-687 add RAT, also check plugins

    siren committed Feb 18, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@745448 13f79535-47bb-0310-9956-ffa450edef68
  5. NUTCH-688 add missing headers, part 2 rest

    siren committed Feb 18, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@745446 13f79535-47bb-0310-9956-ffa450edef68
  6. NUTCH-688 add missing headers, part 1 core

    siren committed Feb 18, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@745441 13f79535-47bb-0310-9956-ffa450edef68
  7. NUTCH-687 add RAT

    siren committed Feb 18, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@745416 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 17, 2009
  1. fix NUTCH-631 - thanks to Stefan Will

    siren committed Feb 17, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@745096 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 11, 2009
  1. fix link and name

    siren committed Feb 11, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@743573 13f79535-47bb-0310-9956-ffa450edef68
  2. add apachecon promo

    siren committed Feb 11, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@743464 13f79535-47bb-0310-9956-ffa450edef68
  3. NUTCH-683 - NUTCH-676 broke CrawlDbMerger

    Tacettin Guney committed Feb 11, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@743277 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 6, 2009
  1. NUTCH-643 ClassCastException in PDF parser, upgrade to unofficial PDF…

    sigram committed Feb 6, 2009
    …Box 0.7.4
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@741558 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 3, 2009
  1. NUTCH-671 - JSP errors in Nutch searcher webapp.

    sigram committed Feb 3, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@740324 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-279 Additions to urlnormalizer-regex (modified).

    sigram committed Feb 3, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@740318 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jan 29, 2009
  1. NUTCH-682 - SOLR indexer does not set boost on the document. Patch by…

    Tacettin Guney committed Jan 29, 2009
    … julien nioche
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@738970 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jan 28, 2009
  1. NUTCH-571 - parse-mp3 plugin doesn't always index album of mp3. Patch

    Tacettin Guney committed Jan 28, 2009
    by Joseph Chen.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@738455 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jan 27, 2009
  1. NUTCH-628 - DomainStatistics tool

    Tacettin Guney committed Jan 27, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@738175 13f79535-47bb-0310-9956-ffa450edef68
  2. NUTCH-680 - Remove pmd-ext jars for now

    Tacettin Guney committed Jan 27, 2009
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@738049 13f79535-47bb-0310-9956-ffa450edef68