PR: Use `GITHUB_TOKEN` to make authorized user requests against GitHub (CI) #22118

mrclary · 2024-05-22T22:51:11Z

Some unit tests intermittently fail due to 403 Client Error: rate limit exceeded for url: https://api.github.com/repos/spyder-ide/spyder/releases.

Rate Limit Failure

=================================== FAILURES ===================================
__________________________ test_updates_appenv[1.0.0] __________________________

qtbot = <pytestqt.qtbot.QtBot object at 0x7efba47508e0>
mocker = <pytest_mock.plugin.MockerFixture object at 0x7efba4750940>
version = '1.0.0'

    @pytest.mark.parametrize("version", ["1.0.0", "1000.0.0"])
    def test_updates_appenv(qtbot, mocker, version):
        """
        Test whether or not we offer updates for our installers according to the
        current Spyder version.
    
        Uses UpdateManagerWidget in order to also test QThread.
        """
        mocker.patch.object(update, "__version__", new=version)
        # Do not execute start_update after check_update completes.
        mocker.patch.object(
            UpdateManagerWidget, "start_update", new=lambda x: None
        )
        mocker.patch.object(workers, "__version__", new=version)
        mocker.patch.object(workers, "is_anaconda", return_value=True)
        mocker.patch.object(workers, "is_conda_based_app", return_value=True)
        mocker.patch.object(
            workers, "get_spyder_conda_channel",
            return_value=("conda-forge", "https://conda.anaconda.org/conda-forge")
        )
    
        um = UpdateManagerWidget(None)
        um.start_check_update()
        qtbot.waitUntil(um.update_thread.isFinished)
    
        update_available = um.update_worker.update_available
        if version.split('.')[0] == '1':
>           assert update_available
E           assert False

spyder/plugins/updatemanager/tests/test_update_manager.py:60: AssertionError
------------------------------ Captured log call -------------------------------
DEBUG    spyder.plugins.updatemanager.widgets.update:update.py:166 Checking for updates. startup = False.
INFO     spyder.plugins.updatemanager.workers:workers.py:153 Checking for updates from https://api.github.com/repos/spyder-ide/spyder/releases
WARNING  spyder.plugins.updatemanager.workers:workers.py:182 403 Client Error: rate limit exceeded for url: https://api.github.com/repos/spyder-ide/spyder/releases
Traceback (most recent call last):
  File "/home/runner/work/spyder/spyder/spyder/plugins/updatemanager/workers.py", line 156, in start
    page.raise_for_status()
  File "/home/runner/micromamba/envs/test/lib/python3.8/site-packages/requests/models.py", line 1024, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 403 Client Error: rate limit exceeded for url: https://api.github.com/repos/spyder-ide/spyder/releases
________________________ test_updates_appenv[1000.0.0] _________________________

qtbot = <pytestqt.qtbot.QtBot object at 0x7efba4750b50>
mocker = <pytest_mock.plugin.MockerFixture object at 0x7efba4750d30>
version = '1000.0.0'

    @pytest.mark.parametrize("version", ["1.0.0", "1000.0.0"])
    def test_updates_appenv(qtbot, mocker, version):
        """
        Test whether or not we offer updates for our installers according to the
        current Spyder version.
    
        Uses UpdateManagerWidget in order to also test QThread.
        """
        mocker.patch.object(update, "__version__", new=version)
        # Do not execute start_update after check_update completes.
        mocker.patch.object(
            UpdateManagerWidget, "start_update", new=lambda x: None
        )
        mocker.patch.object(workers, "__version__", new=version)
        mocker.patch.object(workers, "is_anaconda", return_value=True)
        mocker.patch.object(workers, "is_conda_based_app", return_value=True)
        mocker.patch.object(
            workers, "get_spyder_conda_channel",
            return_value=("conda-forge", "https://conda.anaconda.org/conda-forge")
        )
    
        um = UpdateManagerWidget(None)
        um.start_check_update()
        qtbot.waitUntil(um.update_thread.isFinished)
    
        update_available = um.update_worker.update_available
        if version.split('.')[0] == '1':
            assert update_available
        else:
            assert not update_available
>       assert len(um.update_worker.releases) > 1
E       TypeError: object of type 'NoneType' has no len()

spyder/plugins/updatemanager/tests/test_update_manager.py:63: TypeError
------------------------------ Captured log call -------------------------------
DEBUG    spyder.plugins.updatemanager.widgets.update:update.py:166 Checking for updates. startup = False.
INFO     spyder.plugins.updatemanager.workers:workers.py:153 Checking for updates from https://api.github.com/repos/spyder-ide/spyder/releases
WARNING  spyder.plugins.updatemanager.workers:workers.py:182 403 Client Error: rate limit exceeded for url: https://api.github.com/repos/spyder-ide/spyder/releases
Traceback (most recent call last):
  File "/home/runner/work/spyder/spyder/spyder/plugins/updatemanager/workers.py", line 156, in start
    page.raise_for_status()
  File "/home/runner/micromamba/envs/test/lib/python3.8/site-packages/requests/models.py", line 1024, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 403 Client Error: rate limit exceeded for url: https://api.github.com/repos/spyder-ide/spyder/releases

What is the root cause of this and how do we resolve it?

mrclary · 2024-05-23T16:50:18Z

I modified the test workflows for macOS, Linux, and Windows to report the rate limits. Following are excerpts from a failed Linux workflow:
16_Rate Limit Before Tests.txt
17_Run tests.txt
18_Rate Limit After Tests.txt

These show that before and after the tests are run, rate limits have not been exceeded. Nevertheless, the test fails due to an exceeded rate limit.

I conclude from this that the rate limit exceeded in the test is a "secondary" rate limit which is not reported in the rate limit queries.

mrclary · 2024-05-23T18:51:44Z

Reference: https://docs.github.com/en/rest/using-the-rest-api/rate-limits-for-the-rest-api?apiVersion=2022-11-28

Limits are shown in Warnings summary whether or not the test succeeds. Limits are also shown in a log message if the test fails. Update all test workflows Syntax error? Test getting rate limits during workflow

mrclary · 2024-05-24T01:21:52Z

So I've discovered a few things

There are no requests to github in our tests that are being counted against a rate limit except for those in test_update_manager.py. These amount to 12 requests: 2 for each test times 6 tests (1 macOS, 2 Windows, and 3 Linux).
The GITHUB_TOKEN was not passed to requests.get in the update manager worker, so the rate limit was counting against unauthenticated requests (limited to 60).
Passing the GITHUB_TOKEN to requests.get results in the expected 5000 limit and we see the counts incurred by the tests (see https://github.com/spyder-ide/spyder/actions/runs/9215667723/job/25354472458)
Unfortunately, both logging and warnings that report the rate limits from within the update manager call are not displayed in the CI logs. This is because these are displayed at the end of the test run which cut short when segfaults occur. Subsequent automated rerun of the test will not run the update manager test because it already succeeded.

But I think that the changes in this PR will fix the problem.

One thing I'm curious about is whether unauthenticated requests are counted against individual IP addresses or against the repository. If the repository, then we could have an issue if more Spyder instances in the wild are sending unauthenticated requests.

mrclary · 2024-05-24T01:27:38Z

Oh yep, "Unauthenticated requests are associated with the originating IP address". Okay, I think we're good.

mrclary · 2024-05-24T01:30:17Z

I think we were just hitting the 60 count limit by having multiple (~5) PRs running tests in close proximity.

ccordoba12

Thanks @mrclary for your help to solve this!

.github/workflows/test-linux.yml

spyder/plugins/updatemanager/tests/test_update_manager.py

spyder/plugins/updatemanager/workers.py

ccordoba12 · 2024-05-24T14:13:24Z

spyder/plugins/updatemanager/workers.py

        logger.info(f"Checking for updates from {url}")
        try:
-            page = requests.get(url)
+            page = requests.get(url, headers=headers)
+            _rate_limits(page)


Is this really necessary for production code? Or only for testing?

The motivation was for investigating the issue in our test suite. However, when our installers become mainstream, it may be prudent to have a debug log message in production in case users encounter a rate limit error. In production, headers={}, then requests.get(url) and requests.get(url, headers={}) are equivalent.

Clarification: currently, only macOS and Windows 5.x standalone installers check Github for updates. When 6.0 is released, we could (hopefully) see much more widespread adoption of our installers, resulting in increased unauthorized requests to Github.

spyder/plugins/updatemanager/workers.py

Co-authored-by: Carlos Cordoba <ccordoba12@gmail.com>

ccordoba12

Looks good to me now, thanks @mrclary!

mrclary force-pushed the rate-limit branch from 100ae99 to 92c850f Compare May 23, 2024 19:37

Display rate limit from requests to github url.

c623ced

Limits are shown in Warnings summary whether or not the test succeeds. Limits are also shown in a log message if the test fails. Update all test workflows Syntax error? Test getting rate limits during workflow

mrclary force-pushed the rate-limit branch from 42b8e6f to 95cc299 Compare May 24, 2024 00:39

mrclary added 2 commits May 23, 2024 18:18

Authorize requests to Github using GITHUB_TOKEN, if available.

150d15d

Remove rate limit steps

2a3e350

mrclary changed the title ~~Test Rate limit~~ PR: Use GITHUB_TOKEN to make authorized user requests against GitHub May 24, 2024

mrclary self-assigned this May 24, 2024

mrclary force-pushed the rate-limit branch from 95cc299 to 2a3e350 Compare May 24, 2024 01:29

mrclary marked this pull request as ready for review May 24, 2024 01:48

mrclary requested a review from ccordoba12 May 24, 2024 03:23

ccordoba12 added this to the v6.0beta2 milestone May 24, 2024

ccordoba12 added the type:Task label May 24, 2024

ccordoba12 reviewed May 24, 2024

View reviewed changes

Apply suggestions from code review.

0505208

Co-authored-by: Carlos Cordoba <ccordoba12@gmail.com>

ccordoba12 approved these changes May 24, 2024

View reviewed changes

ccordoba12 changed the title ~~PR: Use GITHUB_TOKEN to make authorized user requests against GitHub~~ PR: Use GITHUB_TOKEN to make authorized user requests against GitHub (CI) May 24, 2024

ccordoba12 merged commit 1475484 into spyder-ide:master May 24, 2024
14 checks passed

mrclary deleted the rate-limit branch May 24, 2024 20:48

mrclary mentioned this pull request May 31, 2024

PR: Try to decrease timeouts when running tests and check general tests robustness and speed (CI/Testing) #22077

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PR: Use `GITHUB_TOKEN` to make authorized user requests against GitHub (CI) #22118

PR: Use `GITHUB_TOKEN` to make authorized user requests against GitHub (CI) #22118

mrclary commented May 22, 2024 •

edited

Loading

mrclary commented May 23, 2024

mrclary commented May 23, 2024

mrclary commented May 24, 2024 •

edited

Loading

mrclary commented May 24, 2024

mrclary commented May 24, 2024

ccordoba12 left a comment

ccordoba12 May 24, 2024

mrclary May 24, 2024

mrclary May 24, 2024

ccordoba12 left a comment

PR: Use GITHUB_TOKEN to make authorized user requests against GitHub (CI) #22118

PR: Use GITHUB_TOKEN to make authorized user requests against GitHub (CI) #22118

Conversation

mrclary commented May 22, 2024 • edited Loading

mrclary commented May 23, 2024

mrclary commented May 23, 2024

mrclary commented May 24, 2024 • edited Loading

mrclary commented May 24, 2024

mrclary commented May 24, 2024

ccordoba12 left a comment

Choose a reason for hiding this comment

ccordoba12 May 24, 2024

Choose a reason for hiding this comment

mrclary May 24, 2024

Choose a reason for hiding this comment

mrclary May 24, 2024

Choose a reason for hiding this comment

ccordoba12 left a comment

Choose a reason for hiding this comment

PR: Use `GITHUB_TOKEN` to make authorized user requests against GitHub (CI) #22118

PR: Use `GITHUB_TOKEN` to make authorized user requests against GitHub (CI) #22118

mrclary commented May 22, 2024 •

edited

Loading

mrclary commented May 24, 2024 •

edited

Loading