Async code testing paradigm is broken and ugly #2977

virl · 2017-03-15T08:07:15Z

Robolectric have very broken and ugly async code testing paradigm, because it doesn't allow execution of implementation's hidden loopers in the test (the loopers that test have no direct references to).

The real situation is even worse than that, because normal Threads/Executors are still running anyway and not paused.

As the result, it is almost impossible to write a test for complex async code system that tests it as blackbox, without requiring the test to depend on all code's implementation details (specific loopers).

Robolectric should behave like Xcode's XCTest in that regard: allowing to unpause ALL loopers in the app with call of single method like .unpauseAllLoopers() and executing code on them in exactly same manner (on separate threads, etc.) as it behaves in real app.
Frankly, this behaviour should be default and must not require even calling a method for this.

Such "unpausing" of loopers (especially if it it will be default behaviour, which it must be) should will execute as normal any Runnables submitted even after unpause.
Because current behaviour of unpauseLooper() when it doesn't execute any newly submitted Runnables is major bug.

And of course the Scheduler/ShadowLooper/etc. implementation should not substitute delayed (in next event loop interation) execution of runnables with inline/instant execution of them (on the same call stack), because it introduces major out of order execution errors!

Best solution would be to completely deprecate ShadowLooper mechanism and introduce analogue to XCTestExpectation: a condition objects that you can wait on from test's runloop and which will spin current runloop until condition or timeout is satisfied from another thread/runloop asynchronously.
But these expectations should not require the test to explicitly know about internal loopers of tested code — they should spin/await only test's current looper to allow waiting for events happening on that looper (and also for event on any other looper or thread without pausing it).

How it is properly implemented in iOS's XCTest framework: XCTestExpectation

To summarize, Robolectric makes a major mistake by deciding that test's multithreading reproducibility can be achieved by freezing of all threads and step-by-step iteration over their runnables — because you can't do that with Threads anyway and because for Loopers it prevents testing any minimally complex classes as black boxes and introduces out-of-order execution errors during tests.
Instead Robolectric should run async code in tests as it is in real app and provide developer strong API tools to check async expectations about its execution result, as Xcode's XCTest does.

Also see #1993 #1994 #1727 #1711 #2851 #2149 #1879
For current paradigm's ugly consequences see #3369 #3359 #3193 #3270 #2961 #2957 #2958 #2205 #2204 #1306 #2534 #3234 #3188
For failed workaround attempts (without changing the broken paradigm itself) see #3369 #2166 #2119 #2122 #2116

mikesol · 2017-09-05T06:01:01Z

Thanks for doing this research!
As you reference a couple bugs I reported, I thought I'd add a workaround attempt that I've done with pretty good results. I use cucumber to coordinate large-ish tests. Aside from the benefits it brings in human readability, it also allows for things to be parallelized - a CI pipeline ships off artifacts to parallel testing servers that run a test in an isolated environment and merge the result down the pipeline. This is, of course, a workaround, but at the end of the day it is a quite practical one that covers business-critical test cases and allows us to ship with confidence.

virl · 2017-09-05T06:39:42Z

@mikesol I think you're talking about different, but partially related problem.
This issue is about Robolectric's broken API for testing modern code due to Robolectric's fundamentally wrong multithreading architecture.

Your #3359 related to this in the sense that Robolectric's concurrency and scalability problems, being a different issue, have the same root in its wrong multithreading architecture that tries to "Freeze the World" in futile attempt to make the simplest of UI tests totally deterministic.

emartynov · 2018-04-18T07:37:29Z

Let me drop my 5 cents here. Below is IMO.

Using Robolectric for integration tests is probably a bad choice. The Robolectric is mainly used to test single class integration with Android system. In such tests, it is easy and reasonable to assume that multithreading could be replaced/tested by sequential execution runnables.

The real tests for the system integration and maybe e2e (if you prefer) are instrumental tests. no matter what they are hard to write, maintain and execute.

virl · 2018-04-19T01:11:01Z

@emartynov No, it is not so: even for single class multithreading is implementation detail, even if that class have only synchronous public API.
That is because class can (and will) accomplish intermediate parallel tasks to execute even for synchronous request. Or that class can use arbitrary third-party library (which is implementation detail too that tests should not rely on) that will use multithreading.

The Robolectric is mainly used to test single class integration with Android system. In such tests, it is easy and reasonable to assume that multithreading could be replaced/tested by sequential execution runnables

Multithreading can never be replaced or tested by sequential execution, because nor JVM, nor Android make such guarantees. JVM even have Java Memory Model that guarantees only happens-before execution barriers, not any deterministic behaviour between multiple threads without locks.

Therefore, replacing async testing with sequential runnables violates incapsulation of both tested code (tests will break when correct implementation changed to other correct implementation) and system API guarantees.

AstralStorm · 2019-06-04T13:29:10Z

How do we actually ensure all HandlerThreads do run? There should be some sort of barrier-like API to simulate old behavior.
Likewise, some API to allow running all loopers if it's force paused...

virl changed the title ~~Async code testing paradigm is broken/ugly.~~ Async code testing paradigm is broken and ugly Mar 15, 2017

This was referenced Mar 15, 2017

Make scheduler be paused by default. #2851

Closed

Handler doesn't execute Runnables when using Robolectric 3 #1993

Closed

Support Realm #1389

Open

Add support for Robolectric realm/realm-java#904

Open

This was referenced Sep 4, 2017

RFC: Deprecate Unpaused Main Looper #3369

Closed

java.util.ConcurrentModificationException when closing cursor #3359

Open

How to test behaviour using RxJava debounce #3270

Open

Robolectric enters infinite loop in the scheduler #3193

Closed

jongerrish added the scheduler Scheduler rewrite + cleanup label Nov 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Async code testing paradigm is broken and ugly #2977

Async code testing paradigm is broken and ugly #2977

virl commented Mar 15, 2017 •

edited

mikesol commented Sep 5, 2017

virl commented Sep 5, 2017 •

edited

emartynov commented Apr 18, 2018

virl commented Apr 19, 2018 •

edited

AstralStorm commented Jun 4, 2019 •

edited

Async code testing paradigm is broken and ugly #2977

Async code testing paradigm is broken and ugly #2977

Comments

virl commented Mar 15, 2017 • edited

mikesol commented Sep 5, 2017

virl commented Sep 5, 2017 • edited

emartynov commented Apr 18, 2018

virl commented Apr 19, 2018 • edited

AstralStorm commented Jun 4, 2019 • edited

virl commented Mar 15, 2017 •

edited

virl commented Sep 5, 2017 •

edited

virl commented Apr 19, 2018 •

edited

AstralStorm commented Jun 4, 2019 •

edited