Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extend Europeana timeout #3450

Closed
stacimc opened this issue Dec 4, 2023 · 0 comments · Fixed by #3453
Closed

Extend Europeana timeout #3450

stacimc opened this issue Dec 4, 2023 · 0 comments · Fixed by #3453
Labels
💻 aspect: code Concerns the software code in the repository 🛠 goal: fix Bug fix 🟩 priority: low Low priority and doesn't need to be rushed 🧱 stack: catalog Related to the catalog and Airflow DAGs

Comments

@stacimc
Copy link
Contributor

stacimc commented Dec 4, 2023

Description

The Europeana DAG runs daily. Most days finish very quickly, but periodically we encounter a day that has a very large number of records and takes considerably longer. The current 3-day timeout is meant to accommodate this by allowing large ingestion days a significant amount of time to ingest, with the expectation that subsequent days will be very small so we can always catch back up to the present day.

Three days does not appear to be sufficient. We've had a few times where a long day has timed out, and we've needed to manually restart the timed out day from the point of failure. The most recent such example would have taken 6 days to complete. We should consider extending the timeout to 6 or 7 days.

Reproduction

Observe production logs from Europeana.

@stacimc stacimc added 🟩 priority: low Low priority and doesn't need to be rushed 🛠 goal: fix Bug fix 💻 aspect: code Concerns the software code in the repository 🧱 stack: catalog Related to the catalog and Airflow DAGs labels Dec 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
💻 aspect: code Concerns the software code in the repository 🛠 goal: fix Bug fix 🟩 priority: low Low priority and doesn't need to be rushed 🧱 stack: catalog Related to the catalog and Airflow DAGs
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

1 participant