Skip to content

Conversation

@ikreymer
Copy link
Member

… pending, un-uploaded size

  • use pending size to determine if quota reached
  • also request pause to be set before assuming paused state
  • also ensure data is actually committed before shutting down pods (in case of any edge cases)
  • clear paused flag in redis after crawler pods shutdown
  • add OpCrawlStats to avoid adding unnecessary profile_update to public API

this assumes changes in crawler to support: clearing size after WACZ upload, ensure upload happens if pod starts when crawl is paused

… pending, un-uploaded size

- use pending size to determine if quota reached
- also request pause to be set before assuming paused state
- also ensure data is actually committed before shutting down pods (in case of any edge cases)
- clear paused flag in redis after crawler pods shutdown

this assumes changes in crawler to support: clearing size after WACZ upload, ensure upload happens if pod starts when crawl is paused
@ikreymer ikreymer requested a review from tw4l November 25, 2025 05:55
Copy link
Member

@tw4l tw4l left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🚀 🙇

@tw4l tw4l merged commit 93d60ca into issue-2957-pause-crawl-on-quota-reached Nov 25, 2025
22 of 24 checks passed
@tw4l tw4l deleted the issue-2957-pause-crawl-on-quota-reached-with-pending-size branch November 25, 2025 16:11
tw4l added a commit that referenced this pull request Nov 25, 2025
#3013)

… pending, un-uploaded size

- use pending size to determine if quota reached
- also request pause to be set before assuming paused state
- also ensure data is actually committed before shutting down pods (in
case of any edge cases)
- clear paused flag in redis after crawler pods shutdown
- add OpCrawlStats to avoid adding unnecessary profile_update to public
API

this assumes changes in crawler to support: clearing size after WACZ
upload, ensure upload happens if pod starts when crawl is paused

---------

Co-authored-by: Tessa Walsh <tessa@bitarchivist.net>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants