Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Commits on Jul 1, 2015
  1. NUTCH-1980 Jexl expressions for CrawlDbReader

    Markus Jelsma authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1688569 13f79535-47bb-0310-9956-ffa450edef68
  2. @chrismattmann

    Updates to make tests pass related to NUTCH-2038: Naive Bayes classif…

    chrismattmann authored
    …ier based html Parse filter (for filtering outlinks) this closes #42.
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1688549 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 29, 2015
  1. @chrismattmann

    fix for NUTCH-2038: Naive Bayes classifier based html Parse filter (f…

    chrismattmann authored
    …or filtering outlinks) contributed by Asitang Mishra <asitang@gmail.com> this closes #39
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1688084 13f79535-47bb-0310-9956-ffa450edef68
Commits on Apr 23, 2015
  1. NUTCH-1994 Upgrade to Apache Tika 1.8

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1675723 13f79535-47bb-0310-9956-ffa450edef68
Commits on Apr 22, 2015
  1. @chrismattmann

    Fix for NUTCH-1973 Job Administration end point for the REST service …

    chrismattmann authored
    …contributed by Sujen Shah <sujen1412@gmail.com> this closes #16.
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1675243 13f79535-47bb-0310-9956-ffa450edef68
Commits on Apr 18, 2015
  1. @chrismattmann

    Fix for NUTCH-1989 Handling invalid URLs in CommonCrawlDataDumper con…

    chrismattmann authored
    …tributed by Giuseppe Totaro.
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1674536 13f79535-47bb-0310-9956-ffa450edef68
Commits on Apr 11, 2015
  1. @sebastian-nagel

    NUTCH-1981 Upgrade to icu4j 55.1

    sebastian-nagel authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1672939 13f79535-47bb-0310-9956-ffa450edef68
Commits on Mar 29, 2015
  1. @chrismattmann

    - NUTCH-1970 Pretty print JSON output in config resouce contributed b…

    chrismattmann authored
    …y Tyler Palsulich and mattmann
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1669855 13f79535-47bb-0310-9956-ffa450edef68
Commits on Mar 19, 2015
  1. @chrismattmann

    Fix for NUTCH-1966 Configuration endpoint for 1x REST API contributed…

    chrismattmann authored
    … by Sujen Shah <sujen1412@gmail.com> this closes #13.
    
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1667649 13f79535-47bb-0310-9956-ffa450edef68
Commits on Mar 4, 2015
  1. NUTCH-1949 Dump out the Nutch data into the Common Crawl format

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1664109 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 26, 2015
  1. NUTCH-1933 nutch-selenium plugin

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1662530 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 12, 2015
  1. NUTCH 1925 Upgrade Tika to version 1.7

    Markus Jelsma authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1659168 13f79535-47bb-0310-9956-ffa450edef68
Commits on Nov 11, 2014
  1. Revert bothed commit

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1638040 13f79535-47bb-0310-9956-ffa450edef68
  2. Correct formatting in build.xml Ant targets help

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1638038 13f79535-47bb-0310-9956-ffa450edef68
Commits on Oct 27, 2014
  1. NUTCH-1865 Enable use of SNAPSHOT's with Nutch Ivy dependency management

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1634633 13f79535-47bb-0310-9956-ffa450edef68
Commits on Oct 16, 2014
  1. @jnioche

    NUTCH-1876 upgraded to crawler-commons 0.5

    jnioche authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1632315 13f79535-47bb-0310-9956-ffa450edef68
Commits on Sep 21, 2014
  1. @sebastian-nagel

    add committer snagel

    sebastian-nagel authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1626581 13f79535-47bb-0310-9956-ffa450edef68
Commits on Sep 8, 2014
  1. @jnioche

    NUTCH-1837 Upgrade to Tika 1.6 (jnioche)

    jnioche authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1623562 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jul 15, 2014
  1. @jnioche

    NUTCH-1804 Move JUnit dependency to test scope

    jnioche authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1610624 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 30, 2014
  1. @jnioche
Commits on Apr 28, 2014
  1. @jnioche

    NUTCH-1759 Upgrade to Crawler Commons 0.4

    jnioche authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1590600 13f79535-47bb-0310-9956-ffa450edef68
Commits on Apr 17, 2014
  1. @jnioche

    updated list of committers in pom and mvn.template

    jnioche authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1588206 13f79535-47bb-0310-9956-ffa450edef68
Commits on Apr 15, 2014
  1. Update Developer Information for Trunk Branch

    Talat Uyarer authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1587598 13f79535-47bb-0310-9956-ffa450edef68
Commits on Apr 4, 2014
  1. @jnioche

    NUTCH-1745 Upgraded to ElasticSearch 1.1.0

    jnioche authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1584722 13f79535-47bb-0310-9956-ffa450edef68
Commits on Mar 29, 2014
  1. NUTCH-1737 Upgrade to recent JUnit 4.x

    Lewis John McGibbney authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1582928 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 21, 2014
  1. @jnioche

    NUTCH-1729 upgrade tika 1.5

    jnioche authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1570542 13f79535-47bb-0310-9956-ffa450edef68
Commits on Feb 9, 2014
  1. NUTCH-1721 Upgrade to Crawler commons 0.3

    Tejas Patil authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1566255 13f79535-47bb-0310-9956-ffa450edef68
Commits on Nov 18, 2013
  1. @jnioche
Commits on Jul 5, 2013
  1. NUTCH-1595 Upgrade to Tika 1.4

    Markus Jelsma authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1499960 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 20, 2013
  1. NUTCH-1527 ES dep in ivy missing

    Markus Jelsma authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1494893 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 10, 2013
  1. @jnioche

    NUTCH-1522 Upgrade to Tika 1.3

    jnioche authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1491420 13f79535-47bb-0310-9956-ffa450edef68
Commits on Jun 3, 2013
  1. NUTCH-1578 Upgrade to Hadoop 1.2.0

    Markus Jelsma authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1488879 13f79535-47bb-0310-9956-ffa450edef68
Commits on Apr 5, 2013
  1. NUTCH-1031 Delegate parsing of robots.txt to crawler-commons

    Tejas Patil authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1465159 13f79535-47bb-0310-9956-ffa450edef68
Commits on Mar 7, 2013
  1. @jnioche

    NUTCH-1047 Pluggable indexing backends

    jnioche authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1453776 13f79535-47bb-0310-9956-ffa450edef68
Commits on Dec 27, 2012
  1. NUTCH-1510 Upgrade to Hadoop 1.1.1

    Markus Jelsma authored
    git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1426181 13f79535-47bb-0310-9956-ffa450edef68
Something went wrong with that request. Please try again.