Skip to content

Conversation

@Mantisus
Copy link
Collaborator

@Mantisus Mantisus commented Apr 8, 2025

Description

  • add example resume a crawl

Issues

@Mantisus Mantisus requested a review from Copilot April 8, 2025 01:17
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot reviewed 2 out of 3 changed files in this pull request and generated no comments.

Files not reviewed (1)
  • docs/examples/resume_crawl.mdx: Language not supported

@Mantisus Mantisus requested review from janbuchar and vdusek April 8, 2025 01:18
Copy link
Collaborator

@vdusek vdusek left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe we can demonstrate this better:

  • Do not use enqueue_links, instead start with a predefined set of URLs.
  • Remove unnecessary calls and comments, like the export_data_json, to reduce the noise.
  • During the first crawl, stop the run manually using CTRL+C.
  • Execute a second run, let it finish, and compare the statistics (logs) of both runs.

Mantisus and others added 2 commits April 8, 2025 16:01
@Mantisus Mantisus self-assigned this Apr 8, 2025
@Mantisus Mantisus requested a review from vdusek April 9, 2025 01:38
Mantisus and others added 2 commits April 9, 2025 13:51
Copy link
Collaborator

@vdusek vdusek left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@vdusek vdusek merged commit 40edf70 into apify:master Apr 10, 2025
23 checks passed
@vdusek vdusek added the t-tooling Issues with this label are in the ownership of the tooling team. label Apr 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

t-tooling Issues with this label are in the ownership of the tooling team.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Explain how to resume a crawl

3 participants