Permalink
Commits on May 12, 2010
  1. Nutch a tlp, moving svn

    git-svn-id: https://svn.apache.org/repos/asf/nutch/tags/release-0.8@943363 13f79535-47bb-0310-9956-ffa450edef68
    gmcdonald committed May 12, 2010
Commits on Jul 25, 2006
  1. Nutch 0.8 release.

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/tags/release-0.8@425328 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 25, 2006
  2. preparing 0.8 release

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@425324 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 25, 2006
  3. preparing 0.8 release

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@425321 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 25, 2006
Commits on Jul 24, 2006
  1. Even if a filter doesn't make any adjustments, each one should still …

    …return
    
    the input value, which other filters may have modified.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@425087 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jul 24, 2006
  2. Expire all finished addresses. When sites request long crawl delays

    this quickly ties down all threads, and lock expiration heppens
    rarely and proceeds too slowly to remove all expired entries.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@425071 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jul 24, 2006
  3. Fix an NPE.

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@425042 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jul 24, 2006
  4. Set job names (NUTCH-329).

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@424965 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jul 24, 2006
Commits on Jul 23, 2006
  1. NUTCH-328 update commons-cli-2.0-SNAPSHOT.jar

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@424784 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 23, 2006
  2. NUTCH-327 fix log path under cygwin

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@424779 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 23, 2006
Commits on Jul 20, 2006
  1. Add a required alias definition for parse-oo plugin.

    Problem debugged and fix provided by Matthew Holt.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@423859 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jul 20, 2006
  2. Set http.agent.name and related properties to empty values. This forces

    people to put some sensible values there, and protects the Nutch project
    from being blamed for someone else's misbehavior.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@423670 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jul 20, 2006
Commits on Jul 19, 2006
  1. Fix a deficiency in the scoring API (NUTCH-321).

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@423643 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jul 19, 2006
  2. Add a copy constructor to MapWritable, and use it in CrawlDatum.set

    to ensure a deep copy of metaData (NUTCH-323).
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@423641 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jul 19, 2006
  3. Add support for Crawl-delay in robots.txt (NUTCH-293).

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@423630 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jul 19, 2006
Commits on Jul 18, 2006
  1. fixed stylesheet processing instruction

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@422986 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 18, 2006
Commits on Jul 17, 2006
  1. show label also when clustering is enabled

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@422758 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 17, 2006
  2. use existing html renderer

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@422754 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 17, 2006
  3. NUTCH-320 urls are now outputted to stdout

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@422641 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 17, 2006
Commits on Jul 12, 2006
  1. Patch a bug introduced by Hadoop 0.4.0, which requires specified input

    directories to exist.
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@421185 13f79535-47bb-0310-9956-ffa450edef68
    cutting committed Jul 12, 2006
Commits on Jul 11, 2006
  1. tab->space

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@420917 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 11, 2006
  2. added some of missing changes

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@420902 13f79535-47bb-0310-9956-ffa450edef68
    siren committed Jul 11, 2006
  3. - Typo fix

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@420682 13f79535-47bb-0310-9956-ffa450edef68
    Otis Gospodnetic committed Jul 11, 2006
Commits on Jul 6, 2006
  1. NUTCH-317 : Add some javadoc about queryLang argument of Query.parse

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@419616 13f79535-47bb-0310-9956-ffa450edef68
    Jerome Charron committed Jul 6, 2006
Commits on Jun 29, 2006
  1. - Typo fix

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@417928 13f79535-47bb-0310-9956-ffa450edef68
    Otis Gospodnetic committed Jun 29, 2006
Commits on Jun 28, 2006
  1. NUTCH-312. Upgrade to Hadoop 0.4.0.

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@417884 13f79535-47bb-0310-9956-ffa450edef68
    cutting committed Jun 28, 2006
Commits on Jun 27, 2006
  1. - Fixed a ParseUtil method call and removed unused line of code

    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@417560 13f79535-47bb-0310-9956-ffa450edef68
    Otis Gospodnetic committed Jun 27, 2006
Commits on Jun 26, 2006
  1. Add an optional mechanism to time limit long-running queries. This he…

    …lps to
    
    protect search servers from adverse effects of certain resource-intensive
    queries.
    
    Development of this functionality was supported by Krugle.net. Thank you!
    
    
    
    git-svn-id: https://svn.apache.org/repos/asf/lucene/nutch/trunk@417285 13f79535-47bb-0310-9956-ffa450edef68
    sigram committed Jun 26, 2006