-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
merge lewismc:master with apache:master #2
Commits on Feb 16, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 2fae4cd - Browse repository at this point
Copy the full SHA 2fae4cdView commit details
Commits on Feb 18, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 5250d62 - Browse repository at this point
Copy the full SHA 5250d62View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2724578 - Browse repository at this point
Copy the full SHA 2724578View commit details
Commits on Mar 15, 2021
-
NUTCH-2596 Upgrade from org.mortbay.jetty to org.eclipse.jetty
- remove Jetty (serving JSP pages) for HTTP protocol plugin tests - replace JSP pages by header/content strings hold in unit test classes
Configuration menu - View commit details
-
Copy full SHA for d193137 - Browse repository at this point
Copy the full SHA d193137View commit details
Commits on Mar 16, 2021
-
Merge pull request #574 from sebastian-nagel/NUTCH-2596-http-protocol…
…-plugin-test-remove-jsp NUTCH-2596 Remove org.mortbay.jetty from unit tests of HTTP protocol plugins
Configuration menu - View commit details
-
Copy full SHA for 81fb7bc - Browse repository at this point
Copy the full SHA 81fb7bcView commit details
Commits on Mar 21, 2021
-
NUTCH-2857 Upgrade from JDK1.8 --> JDK11 (#573)
* NUTCH-2857 Upgrade from JDK1.8 --> JDK11
Configuration menu - View commit details
-
Copy full SHA for b91fae5 - Browse repository at this point
Copy the full SHA b91fae5View commit details
Commits on Mar 27, 2021
-
NUTCH-2858 urlnormalizer-protocol: URL port is lost during normalization
- if URL includes a port the protocol is not normalized - add unit tests to verify correct behavior
Configuration menu - View commit details
-
Copy full SHA for c454a64 - Browse repository at this point
Copy the full SHA c454a64View commit details
Commits on Mar 29, 2021
-
NUTCH-2858 urlnormalizer-protocol: URL port is lost during normalization
- add note in config file that URLs including port are not left unchanged
Configuration menu - View commit details
-
Copy full SHA for d749920 - Browse repository at this point
Copy the full SHA d749920View commit details -
NUTCH-2859: urlnormalizer-protocol: allow to normalize domains
- host names starting with `*.` are matched as suffixes: `*.example.org` matches `example.org`, `www.example.org`, `www.subdomain.example.org`, etc. - allow to read config file protocols.txt from hdfs:// or any file system supported by Hadoop - add Javadoc package documentation - document configuration properties in nutch-default.xml - reduce memory footprint by deduplicating protocol strings so that same protocol values are references to same objects
Configuration menu - View commit details
-
Copy full SHA for 081c826 - Browse repository at this point
Copy the full SHA 081c826View commit details
Commits on Apr 1, 2021
-
NUTCH-2855 Update org.elasticsearch.client (#577)
* NUTCH-2855 Update org.elasticsearch.client
Configuration menu - View commit details
-
Copy full SHA for 2837039 - Browse repository at this point
Copy the full SHA 2837039View commit details
Commits on Apr 6, 2021
-
Merge pull request #576 from sebastian-nagel/NUTCH-2859-urlnormalizer…
…-protocol-domain-rules NUTCH-2859: urlnormalizer-protocol: allow to normalize domains
Configuration menu - View commit details
-
Copy full SHA for 6c02da0 - Browse repository at this point
Copy the full SHA 6c02da0View commit details
Commits on May 31, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 0d6eaa3 - Browse repository at this point
Copy the full SHA 0d6eaa3View commit details
Commits on Jun 1, 2021
-
Merge pull request #648 from sebastian-nagel/NUTCH-2866-metadata-tost…
…ring NUTCH-2866 Fix MetaData.toString() to return "key=value ..."
Configuration menu - View commit details
-
Copy full SHA for 18d2872 - Browse repository at this point
Copy the full SHA 18d2872View commit details
Commits on Jun 3, 2021
-
NUTCH-2864 Upgrade Dockerfile to use JDK 11 (#647)
* NUTCH-2864 Upgrade Dockerfile to use JDK 11
Configuration menu - View commit details
-
Copy full SHA for cc8d76a - Browse repository at this point
Copy the full SHA cc8d76aView commit details