🐛 Bug Report: Timeout error while processing large parent location registered by Github Catalog Discovery Plugin #24369
Labels
area:catalog
Related to the Catalog Project Area
bug
Something isn't working
help wanted
Help/Contributions wanted from community members
📜 Description
We have a repository that contains over 3000 catalog-info.yaml files. Here is the directory structure -
repo
Now the GitHub Auto Discovery plugin has registered 1 Location which looks something like this https://github.com/vikyathharekal/book-my-tickets/tree/main/services/*/catalog-info.yaml
This led to 3000 Components showing up in the Catalog. However, since then intermittently refreshes have been failing for most entities. The error is "GitHub API time out" (see screenshot). The logs show that the GitHub Contents API is failing for only one or two of the 3000 entities on every refresh loop.
Options to mitigate this -
👍 Expected behavior
👎 Actual Behavior with Screenshots
GitHub Contents API is failing only one or two of the 3000 entities on every refresh loop is marking the parent location as failed.
👟 Reproduction steps
Test repository: https://github.com/vikyathharekal/book-my-tickets/tree/main/services
Github Catalog Discovery Plugin Configuration:
📃 Provide the context for the Bug.
No response
🖥️ Your Environment
No response
👀 Have you spent some time to check if this bug has been raised before?
🏢 Have you read the Code of Conduct?
Are you willing to submit PR?
Yes I am willing to submit a PR!
The text was updated successfully, but these errors were encountered: