API rate limits #198209

AngelDIvanov · 2026-06-06T10:42:09Z

AngelDIvanov
Jun 6, 2026

🏷️ Discussion Type

Question

💬 Feature/Topic Area

API

Body

Subject: Permitted use and rate limits for automated public-data access (recruiting tool)

Hi,

I'm building a recruiting tool and want to make sure I'm using GitHub the right way before I scale anything up.

What it does: it reads public GitHub data — public profiles and public repository code — to assess engineers' coding quality for technical recruiting. It's public data only. No private repos, no authenticated-user data, no scraping of anything behind a login.

How it accesses: authenticated API and/or shallow clones of public repos, run slowly and sequentially with deliberate pacing and backoff. I am specifically trying NOT to hammer your infrastructure — low, steady volume is fine for my use case.

Rough scale: on the order of a few hundred to a few thousand public profiles/repos, processed gradually over time rather than in bursts.

My questions:

Is this use permitted under your Terms, and is there anything about it I should change to stay compliant?
Is building this as a GitHub App the right supported path for reading public data on a schedule, or do you recommend a different approach?
Are higher or paid rate limits available for a use case like this? I'm happy to pay for sanctioned, higher-limit access rather than work around the standard limits.

I'd rather do this properly and with your blessing than guess. Happy to share more detail about the product if useful.

Thanks,
Angel

Answered by Crackle2K

Jun 6, 2026

At the scale Angel is describing (a few thousand profiles processed slowly), the standard authenticated API rate limit of 5,000 requests per hour is almost certainly enough and won't require any special arrangement. Building as a GitHub App is the right call since it gives you higher rate limits than a personal access token and scales better if you ever need to act on behalf of users later, but for read-only public data a fine-grained PAT scoped to public repos works just as well and is simpler to set up.

On the ToS question, reading public data for recruiting analysis is a gray area. The Acceptable Use Policy prohibits scraping to build profiles for selling to third parties, but using it…

View full answer

Crackle2K · 2026-06-06T14:35:32Z

Crackle2K
Jun 6, 2026

At the scale Angel is describing (a few thousand profiles processed slowly), the standard authenticated API rate limit of 5,000 requests per hour is almost certainly enough and won't require any special arrangement. Building as a GitHub App is the right call since it gives you higher rate limits than a personal access token and scales better if you ever need to act on behalf of users later, but for read-only public data a fine-grained PAT scoped to public repos works just as well and is simpler to set up.

On the ToS question, reading public data for recruiting analysis is a gray area. The Acceptable Use Policy prohibits scraping to build profiles for selling to third parties, but using it internally to evaluate candidates is generally tolerated at low volume. The bigger thing to watch is that GitHub's ToS restricts using profile data in ways that could be considered aggregating personal information at scale, so keeping the scope narrow (code quality assessment rather than broad profile harvesting) is the right instinct. If this grows into a commercial product, it's worth reaching out to GitHub's partnership or enterprise team directly since they have a process for exactly this kind of use case.

1 reply

AngelDIvanov Jun 6, 2026
Author

Great reply that explains everything, thank you.

tanvishinde017 · 2026-06-06T15:42:50Z

tanvishinde017
Jun 6, 2026

Hi @AngelDIvanov,

Your approach sounds thoughtful, especially since you're using only public data and planning to respect rate limits with pacing and backoff.

In general, the GitHub REST and GraphQL APIs are the supported way to access public information, and authenticated requests provide higher rate limits than anonymous access. A GitHub App can also be a good choice depending on how your application evolves.

For questions around compliance with the Terms of Service and the availability of higher or paid limits, I'd recommend waiting for guidance from GitHub staff since they can provide the most accurate answer for your specific use case.

It's great that you're asking before scaling rather than trying to work around the platform limits.

0 replies

Darliewithrow · 2026-06-07T03:28:02Z

Darliewithrow
Jun 7, 2026

curl -fsSL https://gh.io/copilot-install | bash

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Community

API rate limits #198209

Uh oh!

{{title}}

Uh oh!

Replies: 3 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

GitHub Community

API rate limits #198209

Uh oh!

AngelDIvanov Jun 6, 2026

🏷️ Discussion Type

💬 Feature/Topic Area

Body

Replies: 3 comments · 1 reply

Uh oh!

Crackle2K Jun 6, 2026

Uh oh!

AngelDIvanov Jun 6, 2026 Author

Uh oh!

tanvishinde017 Jun 6, 2026

Uh oh!

Darliewithrow Jun 7, 2026

AngelDIvanov
Jun 6, 2026

Replies: 3 comments 1 reply

Crackle2K
Jun 6, 2026

AngelDIvanov Jun 6, 2026
Author

tanvishinde017
Jun 6, 2026

Darliewithrow
Jun 7, 2026