-
-
Notifications
You must be signed in to change notification settings - Fork 60
Closed
Copy link
Labels
back endRequires back end dev workRequires back end dev workbugSomething isn't workingSomething isn't working
Description
Browsertrix Version
v1.19.1 (and previous versions since pausing was introduced)
What did you expect to happen? What happened instead?
I expect that if I cancel a paused crawl (or a previously paused crawl), that the WACZs uploaded during pausing will be deleted from s3 storage (which is happening correctly), and that the crawl object's files should be delete and fileSize and fileCount should both be reset to 0 (which is not currently happening).
When deleting that canceled crawl, there should then be no effect on the org's bytesStored and bytesStoredCrawls, because there are no files left to delete. Currently, however, the canceled crawl's fileSize is decremented from the org, which results in the org's bytesStored and bytesStoredCrawls to become inaccurate.
Reproduction instructions
- Spin up a fresh local deployment
- Run a crawl and pause it after a few pages have been successfully crawled
- Cancel the crawl while it's paused
- Verify that the WACZ uploaded during crawling is deleted from s3 storage, but that the crawl's
files,fileSizeandfileCountare not updated in the database, meaning they're now inaccurate - Delete the canceled crawl from the workflow's crawls list
- Verify the org's
bytesStoredandbytesStoredCrawlsare now negative
Screenshots / Video
Environment
No response
Additional details
No response
Metadata
Metadata
Assignees
Labels
back endRequires back end dev workRequires back end dev workbugSomething isn't workingSomething isn't working
Type
Projects
Status
Done!