Testing third-party stubs in isolated environments #5952

srittau · 2021-08-23T14:33:11Z

At the moment, whenever there is change to typeshed, we test all stdlib and third-party packages in the same testing environment, using the --custom-typeshed-dir so that all packages can see each other. There are several practical problems with that:

We can't test the requires fields for correctness; all packages are always available.
We can't have packages that depend on external packages, see for example Allow non-types dependencies #5768, Remove cryptography (depends on #5952) #5618, paramiko: Change dependencies on types-cryptography, allow using upstream type hints #5847, and the discussion in Allow non-types dependencies #5769 (all related to cryptography). (Except if we'd just install all non-types dependencies for all package into our venv.)
It doesn't scale if we increase the number of third-party packages in typeshed.
Ideas like per-package configuration is not possible at the moment, see for example mypy tests: Allow per distribution strictness options #1526.

My suggestion: Only run the tests for the third-party stubs that have actually changed, and run each test in a separate environment. This environment only has the requirements from METADATA.toml installed. This fixes the problems above and also means:

The stubs will not be able to use features from stdlib that are not yet in the latest type checker distributions.
Changes that depend on changes in another third-party stub need to wait until that change has been released. (Happens every three hours, so it shouldn't be too much of a problem.)
Running the tests for all stubs will be slower, since each package needs its own venv installation.

The text was updated successfully, but these errors were encountered:

Akuli · 2021-08-23T17:48:34Z

A possible downside: If the tests of a third-party stub break for some reason, e.g. because a new version of the corresponding non-stub package is released, the problem will remain unnoticed until someone makes a PR for that specific stub package. I remember most pull requests showing red CI because of some Pillow thing, but can't find it from the PR history now.

srittau · 2021-08-23T18:13:22Z

It would be useful to run full tests at a regular interval, say once per day or week.

hauntsaninja · 2021-08-23T21:39:53Z

Note the stubtest third party code already does this venv creation. We could steal that code / maybe use some fancy Github Actions CI caching to cache venvs across workflows. Switching to testing only changed third party stubs is probably a better idea than just doing everything and sharding, which is the approach test_stubtest currently takes.

Like you say, I'd want to think through how we manage different stdlib and types-* versions; to that goal, here are some previous issues we've had on those lines: #4815 #5786 #5751

Akuli · 2021-12-05T20:21:42Z

This seems to be done now.

srittau · 2021-12-06T08:43:44Z

No, we still need to use separate venvs for each distribution, so that the dependencies don't interfere with each other.

hmc-cs-mdrissi · 2022-02-20T08:24:12Z

I would be interested in working on this mainly to make it possible to have non-types dependencies.

Would reasonable steps in order be,

Adjust each check to only run for changed folders. Start with mypy then continue to pyright then pytype.
Create a new venv for each folder checked. Again mypy -> pyright -> pytype
Add support for reading metadata.toml and install dependencies for given folder.
Add support for per package mypy.ini/pyrightconfig.json/etc files.

Any major tasks I'm missing?

hauntsaninja · 2022-02-20T08:54:15Z

That sounds reasonable. Some thoughts:

It might make sense to split each test up into running on the stdlib vs running on individual third party distributions (like we do with stubtest)
Here's some code for venv creation:

typeshed/tests/stubtest_third_party.py

Line 39 in 823592e

with tempfile.TemporaryDirectory() as tmp:
We'd want to make sure that other stubs don't interfere, e.g. we should not be picking up cryptography-stubs if someone declares a cryptography dep. The various type checkers probably each handle this differently
We talked about only having a small, vetted list of deps. If this is the case, I personally wouldn't hate getting started on Allow non-types dependencies #5768 with a global venv. This might be controversial though :-)
It might be nice to use different keys in pyproject.toml for typeshed deps vs other deps (e.g., could make automation in the future easier / avoid mistakes where we assume that "types-*" packages are trusted). Might need a change in https://github.com/typeshed-internal/stub_uploader
There's an @tests folder you can reuse for per-distribution config files. We even store some extra requirements.txt's in there

srittau · 2022-02-21T16:31:22Z

@hmc-cs-mdrissi For now you could try to skip step 1 and we can see how that affects CI runtimes.

AlexWaygood · 2023-01-08T13:48:43Z

As of #9408, third-party stubs with non-types dependencies are now tested with mypy in isolated venvs. The venvs are setup concurrently using a threadpool, meaning we still test all typeshed stubs packages in every run, but the script remains performant.

(Stubs packages with no non-types dependencies are not tested in a separate venv, but they are tested using the --no-site-packages flag when they are tested, so are still run in an isolated environment.)

Closing as completed! 🥳

srittau added the project: policy Organization of the typeshed project label Aug 23, 2021

srittau mentioned this issue Aug 24, 2021

Remove cryptography (depends on #5952) #5618

Closed

srittau mentioned this issue Sep 21, 2021

Overhaul some of the stubs for requests/urllib3 #2612

Closed

srittau mentioned this issue Nov 3, 2021

Test third-party stubs in isolation #6229

Merged

Akuli closed this as completed Dec 5, 2021

srittau reopened this Dec 6, 2021

srittau added project: infrastructure typeshed build, test, documentation, or distribution related and removed project: policy Organization of the typeshed project labels Jan 17, 2022

hmc-cs-mdrissi mentioned this issue Feb 20, 2022

Allow non-types dependencies #5768

Closed

hauntsaninja mentioned this issue Feb 20, 2022

Tensorflow Type Stubs #7144

Closed

hauntsaninja changed the title ~~Testing third-party stubs~~ Testing third-party stubs in isolated environments Jun 18, 2022

AlexWaygood closed this as completed Jan 8, 2023

srittau mentioned this issue May 12, 2024

Document that we can't use new stdlib symbols until added to type-checkers #11903

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Testing third-party stubs in isolated environments #5952

Testing third-party stubs in isolated environments #5952

srittau commented Aug 23, 2021 •

edited

Loading

Akuli commented Aug 23, 2021

srittau commented Aug 23, 2021

hauntsaninja commented Aug 23, 2021

Akuli commented Dec 5, 2021

srittau commented Dec 6, 2021

hmc-cs-mdrissi commented Feb 20, 2022

hauntsaninja commented Feb 20, 2022 •

edited

Loading

srittau commented Feb 21, 2022

AlexWaygood commented Jan 8, 2023

Testing third-party stubs in isolated environments #5952

Testing third-party stubs in isolated environments #5952

Comments

srittau commented Aug 23, 2021 • edited Loading

Akuli commented Aug 23, 2021

srittau commented Aug 23, 2021

hauntsaninja commented Aug 23, 2021

Akuli commented Dec 5, 2021

srittau commented Dec 6, 2021

hmc-cs-mdrissi commented Feb 20, 2022

hauntsaninja commented Feb 20, 2022 • edited Loading

srittau commented Feb 21, 2022

AlexWaygood commented Jan 8, 2023

srittau commented Aug 23, 2021 •

edited

Loading

hauntsaninja commented Feb 20, 2022 •

edited

Loading