-
Couldn't load subscription status.
- Fork 509
docs: add example resume a crawl #1143
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Copilot reviewed 2 out of 3 changed files in this pull request and generated no comments.
Files not reviewed (1)
- docs/examples/resume_crawl.mdx: Language not supported
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I believe we can demonstrate this better:
- Do not use
enqueue_links, instead start with a predefined set of URLs. - Remove unnecessary calls and comments, like the
export_data_json, to reduce the noise. - During the first crawl, stop the run manually using
CTRL+C. - Execute a second run, let it finish, and compare the statistics (logs) of both runs.
Co-authored-by: Vlada Dusek <v.dusek96@gmail.com>
Co-authored-by: Jan Buchar <Teyras@gmail.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Description
Issues