Permalink
Browse files

NUTCH-1215 UpdateDB should not require segment as input

git-svn-id: https://svn.apache.org/repos/asf/nutch/trunk@1401225 13f79535-47bb-0310-9956-ffa450edef68
  • Loading branch information...
Markus Jelsma
Markus Jelsma committed Oct 23, 2012
1 parent 1cd24ba commit cf2b782906d586c5b18410b7e091dd405454e7db
Showing with 3 additions and 2 deletions.
  1. +2 −0 CHANGES.txt
  2. +1 −2 src/java/org/apache/nutch/crawl/CrawlDb.java
View
@@ -2,6 +2,8 @@ Nutch Change Log
(trunk) Current Development:
+* NUTCH-1215 UpdateDB should not require segment as input (markus)
+
* NUTCH-1383 IndexingFiltersChecker to show error message instead of null pointer exception (snagel)
* NUTCH-1476 SegmentReader getStats should set parsed = -1 if no parsing took place (snagel)
@@ -124,7 +124,6 @@ public static JobConf createJob(Configuration config, Path crawlDb)
JobConf job = new NutchJob(config);
job.setJobName("crawldb " + crawlDb);
-
Path current = new Path(crawlDb, CURRENT_NAME);
if (FileSystem.get(job).exists(current)) {
FileInputFormat.addInputPath(job, current);
@@ -169,7 +168,7 @@ public static void main(String[] args) throws Exception {
}
public int run(String[] args) throws Exception {
- if (args.length < 2) {
+ if (args.length < 1) {
System.err.println("Usage: CrawlDb <crawldb> (-dir <segments> | <seg1> <seg2> ...) [-force] [-normalize] [-filter] [-noAdditions]");
System.err.println("\tcrawldb\tCrawlDb to update");
System.err.println("\t-dir segments\tparent directory containing all segments to update from");

0 comments on commit cf2b782

Please sign in to comment.