Cache parsed repositories to avoid github rate limiting #13

hasheddan · 2020-02-21T15:52:38Z

Currently, we hit Github API rate limiting extremely quickly for our queries to find all CRD types in a repository (search), and relatively quickly for parsing each of those CRDs (get file contents). These results should be cached such that we do not have to hit Github every time they are requested.

A first step could be authenticating requests such that the limit for fetching CRD file content will rise from 60 to 5000 per hour, and 10 to 30 per minute for finding CRDs in a repository.

The next step will be to cache results and crawl repos on a periodic basis based on incoming requests for content.

This was referenced Feb 23, 2020

Remove API reference docs in favor of doc.crds.dev crossplane/crossplane#1295

Closed

Add github basic auth #14

Merged

Detect all CRDs under a path #9

Closed

hasheddan mentioned this issue Apr 6, 2020

Index repositories by cloning and serve CRDs at each tag in repo #17

Merged

hasheddan closed this as completed in #17 Apr 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache parsed repositories to avoid github rate limiting #13

Cache parsed repositories to avoid github rate limiting #13

hasheddan commented Feb 21, 2020

Cache parsed repositories to avoid github rate limiting #13

Cache parsed repositories to avoid github rate limiting #13

Comments

hasheddan commented Feb 21, 2020