Skip to content

Slow fetching - #396

@MyraBaba

Description

@MyraBaba

Hi ,

This is obviously a configuration issue. But I couldnt find elsewhere to write:

I couldnt get the full throttle of the storm crawler. I have plenty bandwidth
image
and resources.

I seed 400 urls (which is only 80 of them taken inside the ES I dont know why) . and :

etcher.server.delay: 0.2
fetcher.server.min.delay: 0.0
fetcher.queue.mode: "byHost"
fetcher.threads.per.queue: 2
fetcher.threads.number: 200
fetcher.max.urls.in.queues: -1

depth is 3 also.

When I look i didnt see much bandwidth usage. What else the other option to get %100 speed and the power of the storm crawler ? testing local now and more than enough resources.

Is there any config that I missed ?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions