Only run tests for changed code #115

sindresorhus · 2015-10-31T17:30:20Z

Many times you only change code that affects one or a few tests, but still have to suffer through the whole test suite. Would be nice to only run the affected tests instead. This would be a huge win for people with lots of tests.

This is easy on a test file level as we can just recurse the dependency tree and see which test files depend on the changed file. It gets harder for individual tests. Here we could take advantage of code coverage, diff what's changed between runs, and check which lines were affected.

Prior art

This is not something that is likely to come in the short term, but opening for discussion and brainstorming. Help making this happen more than welcome.

schnittstabil · 2015-10-31T19:25:13Z

Nice challenge! But, to clarify: computability theory tell us, that we cannot only take the diff of two source states and calculate which tests are affected.

Possible approaches to solve this:

we run the test before the change and gain information from it (we may persist this in some way or other)
we use some restrictions, e.g.: if you use this or that naming pattern for your tests, we run the tests in cases of changes

It also means we cannot do it in a perfect way, we would always run tests unnecessarily, the benefit would be to decrease the number of tests to run.

madbence · 2015-11-02T10:21:52Z

imho just store dependencies of the tests, if some of them changes, we can rerun the tests. there is no better way to do this (that is simple).

vadimdemedes · 2015-11-02T10:28:50Z

This is the way I solved it in my side-project. Run tests under istanbul, check how coverage changes between tests and create a mapping of test -> depending file line. After the initial test run, which creates a mapping for all tests, I see what lines in my project have changed. And then I run only tests that "touch" those lines.

Qix- · 2015-11-02T18:23:13Z

@vdemedes that's clever, but still not theoretically sound. We're still dealing with the halting problem here; we have no idea if the program will run differently given different inputs. Keep in mind time is an input.

This means that even though a small code change might only affect one or two tests according to the last run's data, the inputs might change (or the way inputs are interpreted might change) to which it will actually affect many tests - which, given the isolate data (the profiling/coverage data), do not get run. This will almost definitely introduce regressions.

As well, this will require a command line flag for re-running all tests. Removing a cache file or something isn't really user friendly, and now you're storing something in the working directory in order to keep that data. People will have to migrate to this model and include something in their gitignore.

Side note: istanbul uses V8's profiler, which can dramatically slow things down. Keep that in mind, too.

sindresorhus · 2015-11-02T18:40:26Z

and now you're storing something in the working directory in order to keep that data. People will have to migrate to this model and include something in their gitignore.

We would store it in ~/.cache/ava/project-name-from-package-json, not in the project directory.

Qix- · 2015-11-02T18:41:25Z

What if you change branches in the project?

sindresorhus · 2015-11-02T18:53:38Z

We would use an LRU cache, so it wouldn't expunge it right away when switching branches.

Qix- · 2015-11-02T19:05:09Z

Qix- · 2015-11-02T19:27:43Z

Qix- · 2015-11-02T19:33:09Z

sigh and not even a history to prove that I'm not a potato. Thanks Github.

#BDFLAbuse #PotatoLivesMatter

schnittstabil · 2015-11-02T20:09:49Z

We're still dealing with the halting problem here; we have no idea if the program will run differently given different inputs.

We should not be worrying too much about obfuscated code, it should only be reasonable and practical…

require(new Date().getTime() === new Date(2017,3,16).getTime() ? './upload-genisys.js' : './');

Qix- · 2015-11-02T20:11:08Z

@schnittstabil not quite sure what you mean there.

schnittstabil · 2015-11-02T20:16:29Z

@Qix- I think it is ok to only consider code patterns usually used. As you already mentioned, this feature can not be theoretically sound.

Qix- · 2015-11-02T20:33:18Z

But by even introducing the possibility of skipping tests that could have been affected by change, you're introducing a chance of introducing regressions into your code base. Further, it becomes the test suite's fault - not something a test suite should do. Ever.

It's a sweet idea though.

sindresorhus · 2015-11-02T20:47:15Z

Wallaby.js somehow did it, though.

schnittstabil · 2015-11-02T20:48:32Z

Clearly this feature shouldn't be preset. I would imagine a --quick CLI flag, or similar, --help may note its inadequacies…

We may also use this in some watch mode: We run the possible affected test first and run the remaining tests afterwards.

Qix- · 2015-11-02T20:49:39Z

+1 for watch mode. That's a great idea, @schnittstabil. However, I think the functionality should be initial run of watch-mode runs them all, and then subsequent triggers as it's watching just does the affected tests. THAT would be cool.

sindresorhus · 2015-11-02T20:54:37Z

Clearly this feature shouldn't be preset. I would imagine a --quick CLI flag, or similar, --help may note its inadequacies…

👍

Yes, the intention is to combine it with some kind of watch mode. That's where it would shine. Being able to code and see the test results almost live.

ArtemGovorov · 2015-11-06T05:58:45Z

Wallaby.js core developer here, here's our 2c of experience with it:

As it has been mentioned a few times in the discussion, there's no theoretically waterproof solution. We use runtime analysis powered by instrumentation collected data plus a bazillion of heuristic rules of various nature, plus framework specific hacks, plus additional data received directly from a specific code editor that we integrate with. So while a simple dependency/call tree analysis covers like 70% of cases, the rest 29.9999% is really painful to cover.

sindresorhus added enhancement new functionality question labels Oct 31, 2015

sindresorhus mentioned this issue Nov 10, 2015

Atom plugin #183

Closed

jamestalmage mentioned this issue Dec 2, 2015

WIP - allow custom pre-processors #292

Closed

sindresorhus mentioned this issue Jan 22, 2016

Watch support #70

Closed

vadimdemedes mentioned this issue Feb 9, 2016

Suggestion: Run affected tests when sources change #535

Closed

novemberborn mentioned this issue Feb 10, 2016

Only run tests for changed sources #544

Merged

novemberborn self-assigned this Feb 29, 2016

novemberborn removed the question label Feb 29, 2016

sindresorhus closed this as completed in #544 Mar 1, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only run tests for changed code #115

Only run tests for changed code #115

sindresorhus commented Oct 31, 2015

schnittstabil commented Oct 31, 2015

madbence commented Nov 2, 2015

vadimdemedes commented Nov 2, 2015

Qix- commented Nov 2, 2015

sindresorhus commented Nov 2, 2015

Qix- commented Nov 2, 2015

sindresorhus commented Nov 2, 2015

Qix- commented Nov 2, 2015

Qix- commented Nov 2, 2015

Qix- commented Nov 2, 2015

schnittstabil commented Nov 2, 2015

Qix- commented Nov 2, 2015

schnittstabil commented Nov 2, 2015

Qix- commented Nov 2, 2015

sindresorhus commented Nov 2, 2015

schnittstabil commented Nov 2, 2015

Qix- commented Nov 2, 2015

sindresorhus commented Nov 2, 2015

ArtemGovorov commented Nov 6, 2015

Only run tests for changed code #115

Only run tests for changed code #115

Comments

sindresorhus commented Oct 31, 2015

Prior art

schnittstabil commented Oct 31, 2015

madbence commented Nov 2, 2015

vadimdemedes commented Nov 2, 2015

Qix- commented Nov 2, 2015

sindresorhus commented Nov 2, 2015

Qix- commented Nov 2, 2015

sindresorhus commented Nov 2, 2015

Qix- commented Nov 2, 2015

Qix- commented Nov 2, 2015

Qix- commented Nov 2, 2015

schnittstabil commented Nov 2, 2015

Qix- commented Nov 2, 2015

schnittstabil commented Nov 2, 2015

Qix- commented Nov 2, 2015

sindresorhus commented Nov 2, 2015

schnittstabil commented Nov 2, 2015

Qix- commented Nov 2, 2015

sindresorhus commented Nov 2, 2015

ArtemGovorov commented Nov 6, 2015