Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add pause, stop, resume #45

Closed
sjdirect opened this issue Dec 26, 2014 · 8 comments
Closed

Add pause, stop, resume #45

sjdirect opened this issue Dec 26, 2014 · 8 comments

Comments

@sjdirect
Copy link
Owner

Add functionality that will allow a crawl to be continued from where it was stopped or paused.

@sjdirect
Copy link
Owner Author

A few ideas...

var pausedCrawler = manager.Pause(crawler);
manager.Resume(pausedCrawler);

@sjdirect
Copy link
Owner Author

Made all classes in Abot project Serializable so others may implement a pause/resume

@sjdirect
Copy link
Owner Author

@sjdirect sjdirect reopened this Mar 16, 2015
@mng-au
Copy link

mng-au commented Jul 29, 2015

Hi sjdirect, if you don't mind I ask, I remember you have a file based Url and Crawl repository in Google repository before, or is it my memory corrupted or fragmented. I hope I understand this issue correctly, but as long as Abot resume previous unfinished task, that will satisfy this ticket? i.e. stop / resume and pause is consider as stop. I am working on a simple Mongodb based Scheduler for Abot, I will upload to Github later, but then again, if you don't mind it is in F#. Thanks.

@sjdirect
Copy link
Owner Author

Yes there use to be a file based crawl repo but it was overly complex and under performant. A lightweight version of it MAY be created if I decide that is the best way to pause the crawl.

@mng-au
Copy link

mng-au commented Oct 17, 2015

Hi Steven, I added a new repository using Redis as scheduler store (I thought about using MongoDB before, but Redis gives much better performance as it runs in memory). That should allow the crawler to start and stop without losing track of previous progress. Hope this will help. Cheers. https://github.com/mnta/Abot.Redis.Scheduler

@sjdirect
Copy link
Owner Author

Thanks, i'll take a look at this when I cross that bridge

On Sat, Oct 17, 2015 at 5:18 AM, mnta notifications@github.com wrote:

Hi Steven, I added a new repository using Redis as scheduler store (I
thought about using MongoDB before, but Redis gives much better performance
as it runs in memory). That should allow the crawler to start and stop
without losing track of previous progress. Hope this will help. Cheers.
https://github.com/mnta/Abot.Redis.Scheduler


Reply to this email directly or view it on GitHub
#45 (comment).

@sjdirect
Copy link
Owner Author

sjdirect commented Dec 1, 2015

You can use the Pause/Resume feature of AbotX. AbotX build on top of abot. Closing issue.

@sjdirect sjdirect closed this as completed Dec 1, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants