Rework the sync tests #8435

tgoyne · 2023-12-13T23:36:07Z

This unfortunately turned into a giant unreviewable mess, but I don't really know how to split it up. An overview of the changes:

Each XCTestCase subclass now creates a server-side App which is used for most of the tests in that test case. Apps are now never shared between test cases, and do not share Mongo collections with other apps. Together this means that state from one test case should never bleed over to other test cases.

This required rearranging many of the tests, as we had test cases with both PBS and FLX tests. While doing this I also discovered several tests which were just plain in the wrong test case due to files having multiple multi-thousand-line test cases and people thought they were adding tests to the one defined at the top. I split these files into multiple files, which I think makes things much more managable but unfortunately results in the diff being unhelpful.

Each test case now explicitly defines which set of classes it uses, and only Rules for those classes is created in the server app. This cuts the time required to create apps roughly in half and helps offset the fact that we're now creating more apps. It also gets rid of the weird things like the hardcoded list of flx-compatible types in RealmServer.

Creating an app now waits for the initial sync to complete. With PBS this is required to avoid some very strange bugs. With FLX it mostly ensures that if this times out for some reason we get a test failure there, rather than later in some very confusing place in the middle of a test.

Client reset tests now use the API endpoint for FLX in addition to PBS. This makes them dramatically faster (several seconds instead of 30+).

FLX tests now consistently follow the pattern of using one of the object fields as a partition key to query on rather than querying for all objects of a type. Some tests already did this, while others tried to clear the data first (which did not always work if the server was in the middle of processing old requests), and some just plain broke if tests were run in the wrong order.

In the very early days of sync, opening the same Realm URL twice required two different processes, so all our sync tests spawned child processes. That hasn't been true for a very long time, but the tests stuck around and some more were written in that style due to mimicking the existing tests. I've ported almost all of them over to operating in a single process, which makes them both simpler and much faster (5s to .5s in many cases).

The tests are now run with developer mode off. This was initially required due to the change where opening with a class subset is now considered a breaking change in developer mode, but now that test cases explicitly specify their types that isn't a problem any more. However, it does let us once again test subscriptions failing due to an unqueryable field, and that test revealed that we were using the wrong error domain for that error.

I added some new helper functions for the things I discovered I was going to have to change in literally hundreds of places. Creating a temporary user is now just self.createUser() rather than separate steps of creating credentials and logging in. self.name is now used as the tag value for partitions and user names and such rather than #function or NSStringFromSelector(_cmd), which makes it so that it doesn't have to be explicitly passed into helper functions. There were a number of places where this previously was done incorrectly and #function was used inside helper functions, which didn't achieve the desired effect.

dianaafanador3

Really great work into making the sync test suite better. just some minor comments. And as a plus, I'm missing a little bit more documentation/comments into our test cases common code, I guess if you take a look at this PR everything makes sense, but if you are someone new this make feel like magic, or you can even miss adding this for a new sync test file.

dianaafanador3 · 2023-12-18T16:00:06Z

Realm/ObjectServerTests/ClientResetTests.swift

@@ -0,0 +1,715 @@
+////////////////////////////////////////////////////////////////////////////
+//
+// Copyright 2016 Realm Inc.


This date doesn't correspond to the date of creation of this file or test at least, in this and the other files created

dianaafanador3 · 2023-12-18T16:31:08Z

Realm/ObjectServerTests/RealmServer.swift

@@ -1228,4 +1252,7 @@ public class RealmServer: NSObject {
    }
 }

+extension String: Error {


Is this needed?

dianaafanador3 · 2023-12-18T16:32:13Z

Realm/ObjectServerTests/RealmServer.swift

+        while true {
+            let complete = try session.apps[appServerId].sync.progress.get()
+                .map { resp in
+                    guard let resp = resp as? Dictionary<String, Any?> else { return false }


can we just have all this conditions in the same guard?

dianaafanador3 · 2023-12-18T16:36:45Z

RealmSwift/SyncSubscription.swift

+     - parameter type: The type of the object to be queried.
+     - parameter query: A query which will be used to modify the existing query.
+     */
+    public func updateQuery<T: Object>(toType type: T.Type, where query: (Query<T>) -> Query<Bool>) {


why adding this, if the above API can be used for this

dianaafanador3 · 2023-12-18T16:37:29Z

RealmSwift/SyncSubscription.swift

@@ -166,6 +182,20 @@ import Realm.Private
        self.predicate = query?(Query()).predicate ?? NSPredicate(format: "TRUEPREDICATE")
    }

+    /**


Same in here, doesn't the above API already can be used for this?

The comment in the body says why: optional function parameters are always escaping, but this function doesn't escape.

sorry, didn't check the comment within the code, only the doctoring above

dianaafanador3 · 2023-12-18T16:47:25Z

Realm/Tests/SwiftUISyncTestHostUITests/SwiftUISyncTestHostUITests.swift

-
-        let realm = try Realm(configuration: config1)
-        try realm.write {
+        try write { realm in


Why not use populateData, if we are checking for count and not for specific data

The test is checking the number of sections, which depends on the number of unique first names.

tgoyne · 2023-12-19T17:42:13Z

I'm going to take a pass at writing some explanations of wtf the sync tests are doing, because you're absolutely right that there's a lot of complexity that's completely undocumented.

dianaafanador3 · 2023-12-19T17:44:59Z

I'm going to take a pass at writing some explanations of wtf the sync tests are doing, because you're absolutely right that there's a lot of complexity that's completely undocumented.

Don't worry, I think is something I mentioned only as a bonus point

…hange Core has allowed this for a while, but we had our own validation which made it not work.

The RLMSyncSubscriptionSet was retained until the task was deallocated even after the task completed, which is not strictly incorrect but made a test unreliable.

This compiles because the value can be implicitly converted to an optional, but it isn't actually checking anything. Even if the non-optional has an invalid nil value, it ends up as `.some(invalid)` and will pass the test.

We'll never get a success after an error, so not fulfilling the expectation just pointless waits for a timeout.

This unfortunately turned into a giant unreviewable mess, but I don't really know how to split it up. An overview of the changes: Each XCTestCase subclass now creates a server-side App which is used for most of the tests in that test case. Apps are now never shared between test cases, and do not share Mongo collections with other apps. Together this means that state from one test case should never bleed over to other test cases. This required rearranging many of the tests, as we had test cases with both PBS and FLX tests. While doing this I also discovered several tests which were just plain in the wrong test case due to files having multiple multi-thousand-line test cases and people thought they were adding tests to the one defined at the top. I split these files into multiple files, which I think makes things much more managable but unfortunately results in the diff being unhelpful. Each test case now explicitly defines which set of classes it uses, and only Rules for those classes is created in the server app. This cuts the time required to create apps roughly in half and helps offset the fact that we're now creating more apps. It also gets rid of the weird things like the hardcoded list of flx-compatible types in RealmServer. Creating an app now waits for the initial sync to complete. With PBS this is required to avoid some very strange bugs. With FLX it mostly ensures that if this times out for some reason we get a test failure there, rather than later in some very confusing place in the middle of a test. Client reset tests now use the API endpoint for FLX in addition to PBS. This makes them dramatically faster (several seconds instead of 30+). FLX tests now consistently follow the pattern of using one of the object fields as a partition key to query on rather than querying for all objects of a type. Some tests already did this, while others tried to clear the data first (which did not always work if the server was in the middle of processing old requests), and some just plain broke if tests were run in the wrong order. In the very early days of sync, opening the same Realm URL twice required two different processes, so all our sync tests spawned child processes. That hasn't been true for a very long time, but the tests stuck around and some more were written in that style due to mimicking the existing tests. I've ported almost all of them over to operating in a single process, which makes them both simpler and much faster (5s to .5s in many cases). The tests are now run with developer mode off. This was initially required due to the change where opening with a class subset is now considered a breaking change in developer mode, but now that test cases explicitly specify their types that isn't a problem any more. However, it does let us once again test subscriptions failing due to an unqueryable field, and that test revealed that we were using the wrong error domain for that error. I added some new helper functions for the things I discovered I was going to have to change in literally hundreds of places. Creating a temporary user is now just `self.createUser()` rather than separate steps of creating credentials and logging in. `self.name` is now used as the tag value for partitions and user names and such rather than `#function` or `NSStringFromSelector(_cmd)`, which makes it so that it doesn't have to be explicitly passed into helper functions. There were a number of places where this previously was done incorrectly and `#function` was used inside helper functions, which didn't achieve the desired effect.

tgoyne self-assigned this Dec 13, 2023

tgoyne force-pushed the tg/rework-sync-tests branch 3 times, most recently from 656fa80 to b22a853 Compare December 14, 2023 23:41

tgoyne changed the base branch from master to tg/write-transaction-notification December 14, 2023 23:55

tgoyne marked this pull request as ready for review December 15, 2023 02:15

tgoyne requested a review from dianaafanador3 December 15, 2023 02:15

tgoyne mentioned this pull request Dec 15, 2023

Upgrade to core 13.24.1 #8416

Merged

tgoyne force-pushed the tg/write-transaction-notification branch from 407d57b to a382ae5 Compare December 17, 2023 17:31

tgoyne force-pushed the tg/rework-sync-tests branch from b22a853 to a08817f Compare December 17, 2023 17:35

dianaafanador3 approved these changes Dec 18, 2023

View reviewed changes

tgoyne force-pushed the tg/write-transaction-notification branch from a382ae5 to bdd98d3 Compare December 19, 2023 04:39

tgoyne force-pushed the tg/rework-sync-tests branch 2 times, most recently from e2d0e0f to f2d95b6 Compare December 19, 2023 17:14

tgoyne added 10 commits December 19, 2023 11:19

Allow creating notifiers inside write transactions before the first c…

535097a

…hange Core has allowed this for a while, but we had our own validation which made it not work.

Release resources in RLMAsyncSubscriptionTask earlier

0b16b60

The RLMSyncSubscriptionSet was retained until the task was deallocated even after the task completed, which is not strictly incorrect but made a test unreliable.

Fulfill expectations even if an error occurred

5695a78

We'll never get a success after an error, so not fulfilling the expectation just pointless waits for a timeout.

Don't use developer mode in sync tests

63b7e5c

Upgrade to a newer baas version

bc93a88

Fix some unused variable warnings

95eb341

Add a few missing Sendable conformances

a5636c1

Add a bit of documentation for RLMSyncTestCase

710d6b6

tgoyne force-pushed the tg/rework-sync-tests branch from f2d95b6 to 710d6b6 Compare December 19, 2023 19:20

tgoyne force-pushed the tg/write-transaction-notification branch from bdd98d3 to 535097a Compare December 19, 2023 19:21

tgoyne changed the base branch from tg/write-transaction-notification to master December 19, 2023 20:22

tgoyne merged commit a73034c into master Dec 19, 2023
124 of 126 checks passed

tgoyne deleted the tg/rework-sync-tests branch December 19, 2023 21:36

github-actions bot locked as resolved and limited conversation to collaborators Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework the sync tests #8435

Rework the sync tests #8435

tgoyne commented Dec 13, 2023

dianaafanador3 left a comment

dianaafanador3 Dec 18, 2023

dianaafanador3 Dec 18, 2023

tgoyne Dec 18, 2023

dianaafanador3 Dec 18, 2023

dianaafanador3 Dec 18, 2023

dianaafanador3 Dec 18, 2023

tgoyne Dec 18, 2023

dianaafanador3 Dec 18, 2023 •

edited

Loading

dianaafanador3 Dec 18, 2023

tgoyne Dec 18, 2023

tgoyne commented Dec 19, 2023

dianaafanador3 commented Dec 19, 2023

Rework the sync tests #8435

Rework the sync tests #8435

Conversation

tgoyne commented Dec 13, 2023

dianaafanador3 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dianaafanador3 Dec 18, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tgoyne commented Dec 19, 2023

dianaafanador3 commented Dec 19, 2023

dianaafanador3 Dec 18, 2023 •

edited

Loading