Harvester runs really slow when hit bugs #3550

jbrown-xentity · 2021-11-19T20:35:08Z

When the harvester hits a hard failure and crashes, it takes ~2.5-3 minutes to build a new machine and get the fetch process started. This use to take ~1 second. This means jobs that have a bunch of failures can tie up the harvester for hours...

How to reproduce

Create a harvest source for https://www.usda.gov/sites/default/files/documents/data.json on a cloud.gov machine with harvesters.

Expected behavior

Processes the ~2K datasets in less than 15 minutes

Actual behavior

Still running after 24 hours

Sketch

We could take the "mitigation" approach, and try running the harvest job as a sub-process that gets restarted on failure (using supervisor as we do currently), or just running a bunch of fetch processes.
The long term fix is to make the harvests more robust and just report failures, don't hard fail. This will require much better error handling than is currently implemented, across upstream code and GSA code.

jbrown-xentity added the bug Software defect or bug label Nov 19, 2021

jbrown-xentity mentioned this issue Nov 19, 2021

catalog harvester process runs in cloud.gov #2810

Closed

1 task

jbrown-xentity mentioned this issue Sep 15, 2022

Feature/auto restart fetch GSA/catalog.data.gov#545

Merged

jbrown-xentity self-assigned this Sep 27, 2022

hkdctol closed this as completed Sep 29, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harvester runs really slow when hit bugs #3550

Harvester runs really slow when hit bugs #3550

jbrown-xentity commented Nov 19, 2021 •

edited

Loading

Harvester runs really slow when hit bugs #3550

Harvester runs really slow when hit bugs #3550

Comments

jbrown-xentity commented Nov 19, 2021 • edited Loading

How to reproduce

Expected behavior

Actual behavior

Sketch

jbrown-xentity commented Nov 19, 2021 •

edited

Loading