Skip to content

Commit

Permalink
NUTCH-1354 nutchgora support fetcher.queue.depth.multiplier property
Browse files Browse the repository at this point in the history
git-svn-id: https://svn.apache.org/repos/asf/nutch/branches/nutchgora@1334945 13f79535-47bb-0310-9956-ffa450edef68
  • Loading branch information
Ferdy Galema committed May 7, 2012
1 parent 0ef6fa7 commit b45c102
Show file tree
Hide file tree
Showing 3 changed files with 14 additions and 1 deletion.
2 changes: 2 additions & 0 deletions CHANGES.txt
@@ -1,6 +1,8 @@
Nutch Change Log

Release nutchgora - Current Development
* NUTCH-1354 nutchgora support fetcher.queue.depth.multiplier property (ferdy)

* NUTCH-1353 nutchgora DomainStatistics support crawlId, counter bug and reformatting (ferdy)

* NUTCH-1350 remove unused dependancy because of access restriction (ferdy)
Expand Down
10 changes: 10 additions & 0 deletions conf/nutch-default.xml
Expand Up @@ -686,6 +686,16 @@
</description>
</property>

<property>
<name>fetcher.queue.depth.multiplier</name>
<value>50</value>
<description>(EXPERT)The fetcher buffers the incoming URLs into queues based on the [host|domain|IP]
(see param fetcher.queue.mode). The depth of the queue is the number of threads times the value of this parameter.
A large value requires more memory but can improve the performance of the fetch when the order of the URLS in the fetch list
is not optimal.
</description>
</property>

<!-- indexingfilter plugin properties -->

<property>
Expand Down
3 changes: 2 additions & 1 deletion src/java/org/apache/nutch/fetcher/FetcherReducer.java
Expand Up @@ -775,7 +775,8 @@ public void run(Context context)
}
LOG.info("Fetcher: threads: " + threadCount);

feeder = new QueueFeeder(context, fetchQueues, threadCount * 50);
int maxFeedPerThread = conf.getInt("fetcher.queue.depth.multiplier", 50);
feeder = new QueueFeeder(context, fetchQueues, threadCount * maxFeedPerThread);
feeder.start();

for (int i = 0; i < threadCount; i++) { // spawn threads
Expand Down

0 comments on commit b45c102

Please sign in to comment.