Improve test collection speed #375

lukasturcani · 2021-07-28T17:23:46Z

Related Issues: #370
Requested Reviewers: @andrewtarzia
Note for Reviewers: If you accept the review request add a 👍 to this post

The speed at which tests are collected has been dramatically improved. Previously, running

pytest --collect-only

would take approximately 12.84 s. It is now 3.07 s. This is extremely important because collection time is spent every time pytest is run, even if only a subset of tests are run. For example

pytest -k get_id

would have spent 12.84 s collecting all the tests and then about 1 s to actually run the tests. This is an incredibly frustrating developer experience because it makes running tests very unresponsive. Now running the same command would take 4.07 s.
Importantly the total time spent running tests is basically unchanged, going from 48.24 s previously to 48.78 s now.

Another important change is that test will no longer break in the collection phase, or at least, very few of them will. The issue with tests breaking during the collection phase is that if this happens, pytest does not provide very good error information.

Both the collection time and prevention of breakage during the collection change was achieved by deferring the creation of test cases to after the collection phase. For example, a fixture such as this

@pytest.fixture(
    params=(
        CaseData(...),
        CaseData(...),
    ),
)
def case_data(request):
    return request.param

has been changed to this

@pytest.fixture(
    params=(
        lambda: CaseData(...),
        lambda: CaseData(...)
    ),
)
def case_data(request):
    return request.param()

The difference between these two pieces of code is that in the original version the CaseData instance is created when the module is being read. However, in the new version, the CaseData instance is created only once the fixture is being used by the test. This means the collection phase of pytest does not actually create any instances any more, the test execution phase creates them now.

The other oft-repeated change is turning module-level variables into functions

some_var = Something()

into functions

def _get_some_var() -> Something:
    return Seomthing()

Again, this means that instances are not created when the module is being read. This change however does mean that the variable can no longer be re-used as a new instance will be created every time _get_some_var() is called. For most cases, I expect this will make no practical difference. In cases where I did think a performance impact may be incurred I used fixtures

@pytest.fixture(scope='session')
def _get_some_var() -> Something:
    return Something()

These changes constitute the vast majority of the changes, some other notable changes are:

pytest.ini: Added this file and explicitly specified the directory where the tests are located. This means that running pytest will only search the tests directory rather than all other other directories too when looking for tests, giving a small reduction in test collection time.
tests/molecular/molecules/.../fixtures/building_blocks.py: I added this file to factor out the common building blocks used across multiple modules.

andrewtarzia · 2021-07-28T17:36:09Z

My only comment is that you should delete all .npy and rebuild all (git commit should only be delete 2, add 2), because I think you added 2 instead of replacing 2.

lukasturcani · 2021-07-28T18:23:49Z

My only comment is that you should delete all .npy and rebuild all (git commit should only be delete 2, add 2), because I think you added 2 instead of replacing 2.

I'm not following. Could you spell it out a little more for me please?

andrewtarzia · 2021-07-28T18:25:25Z

You changed two test names in the cage fixtures, which lead to two new .npy files. But you did not delete the .npy files from the old name. Right?

lukasturcani · 2021-07-28T18:38:59Z

I did, that's why the the files are listed as renamed.

andrewtarzia · 2021-07-28T18:39:46Z

Oops misread!

lukasturcani added 30 commits July 27, 2021 15:02

Add helper functions

91dceda

Wip

ca68427

Wip

8df3ad0

Wip

12ff90c

Wip

b40e1ba

Wip

4f48cde

wip

a4fdbaf

NRotaxane working

8f3edb6

Make everything lazy

a285231

Polymer ready

6c43f0b

Macrocycle workiing

28621ea

Update host guest

8623151

wip

33d2a8c

Wip

e0c63c0

wip

434a2b8

wip

79ea23c

move

6c62b04

wip

65acd2c

wip

640e8bb

wip

e527321

wip

5f30c68

wip

1e57774

wip

9c62597

wip

da3d6cf

wip

958e10c

wip

7fc0485

wip

6fb806b

wip

369b3db

wip

2eb5a16

wip

8eaf450

lukasturcani added 16 commits July 28, 2021 00:38

wip

928a41b

wip

8a964ab

wip

151b70e

Remove warnings

d611d67

More deferring

3274f25

wip

6769c6d

wip

8a9199a

wip

5fc01f4

wip

a8d4323

wip

a550912

wip

c9ed25e

wip

95863b0

wip

4b5e1ec

Add pytest.ini

b81cfb1

Merge branch 'master' into lukas/improve-collection-speed

901d59a

lint

b2e41c8

andrewtarzia approved these changes Jul 28, 2021

View reviewed changes

lukasturcani merged commit f363470 into master Jul 28, 2021

lukasturcani deleted the lukas/improve-collection-speed branch July 28, 2021 21:01

lukasturcani mentioned this pull request Aug 29, 2021

Substitute Substructure Mutator #300

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve test collection speed #375

Improve test collection speed #375

lukasturcani commented Jul 28, 2021 •

edited

Loading

andrewtarzia commented Jul 28, 2021

lukasturcani commented Jul 28, 2021 •

edited

Loading

andrewtarzia commented Jul 28, 2021

lukasturcani commented Jul 28, 2021

andrewtarzia commented Jul 28, 2021

Improve test collection speed #375

Improve test collection speed #375

Conversation

lukasturcani commented Jul 28, 2021 • edited Loading

andrewtarzia commented Jul 28, 2021

lukasturcani commented Jul 28, 2021 • edited Loading

andrewtarzia commented Jul 28, 2021

lukasturcani commented Jul 28, 2021

andrewtarzia commented Jul 28, 2021

lukasturcani commented Jul 28, 2021 •

edited

Loading

lukasturcani commented Jul 28, 2021 •

edited

Loading