This repository was archived by the owner on Jul 4, 2025. It is now read-only.

Description
Given that
Huggingface API rate limit: https://huggingface.co/docs/api-inference/en/rate-limits
User Tier Rate Limit:
- Unregistered Users: 1 request per hour
- Signed-up Users: 50 requests per hour
- PRO and Enterprise Users: 500 requests per hour
Issue encountered
e2e testing runs successfully and passes test cases, but if multiple PRs are executed within the same hour, some PRs will fail e2e testing due to the rate limit, and we have to wait an hour before rerunning for a pass.
Objective
larify whether this issue affects users. If it does, we need to discuss with the team about using the Huggingface API. If not, find a way to resolve this rate limit issue in CI.