Fixes #23492: Perf regression of calling isFirefoxDefault from main thread #23556

jhugman · 2022-02-03T17:40:59Z

Perf regression calling slow isFirefoxDefault from the main thread.

Prior #23400, the default browser message was only displayed when part of an experiment. The check that isFirefoxDefault was only done as a final check before the message is displayed.

#23400 changed the order of this, so the slow check was done each time the toolbar was created.

Fixes #23492; this PR does three things:

it moves the check for isDefaultBrowser until after the check to see which message surface to display on (as was prior to First use of Nimbus FML plugin #23400 )
it makes the default browser message be able to turn off.
it changes the feature of showing the default browser message to be off by default (as was prior to First use of Nimbus FML plugin #23400 )

Now, the default browser message is displayable when an experiment toggles it on.

The isFirefoxDefault is done when the message is toggled on, but:

There are no plans to toggle this on in this release
The messaging system is being re-written to avoid these problems.

Pull Request checklist

Tests: This PR includes thorough tests or an explanation of why it does not
Screenshots: This PR includes screenshots or GIFs of the changes made or an explanation of why it does not
Accessibility: The code in this PR follows accessibility best practices or does not include any user facing features. In addition, it includes a screenshot of a successful accessibility scan to ensure no new defects are added to the product.

To download an APK when reviewing a PR:

click on Show All Checks,
click Details next to "Taskcluster (pull_request)" after it appears and then finishes with a green checkmark,
click on the "Fenix - assemble" task, then click "Run Artifacts".
the APK links should be on the left side of the screen, named for each CPU architecture

Amejia481 · 2022-02-03T17:58:01Z

app/src/main/java/org/mozilla/fenix/components/toolbar/DefaultToolbarMenu.kt

+        val config = FxNimbus.features.defaultBrowserMessage.value()
+        return if (
+            config.messageLocation == MessageSurfaceId.APP_MENU_ITEM &&
+            !browsers.isFirefoxDefaultBrowser


I'm not sure how BrowsersCache.all() could impact performace, but maybe we could make it a bit faster if we inline the BrowsersCache.all(context) call with the condition check, this way we will only call BrowsersCache.all( ) when it's the right message? . What do you think?

config.messageLocation == MessageSurfaceId.APP_MENU_ITEM && !BrowsersCache.all(context).isFirefoxDefaultBrowser

Amejia481

LGTM!

gabrielluong · 2022-02-03T19:38:16Z

Decision Task broke, restarting

mcomella · 2022-02-03T19:43:43Z

I measured the performance difference. Results from original bisection:

before regression: 1373ms median
regression: 1501ms

Results from this PR:

before PR: 1436ms
after PR: 1402ms

It seems something may have changed after the initial regression causing the overall regression to be 63ms rather than ~100ms. One notable visual difference is that in the regressing builds there is a new UI element, "Set as default browser". The home screen races various UI elements against each other (when it should inflate from the top) so that may contribute to the different results. This PR improves the regression by another 34ms so we've still regressed ~29ms from the original state. Ideally we can try to address that (e.g. if we regress 30ms 2 more times, we're back to where we started) but I didn't see anything obviously actionable in the profiles. I'd be curious to see what our Nightly results say about this PR rather than my local measurements.

jhugman

I've added more aggressive caching of calling into Rust and checking the browser default.

The FML builds on top of the getVariables() -> Variables API and generates straightline kotlin to access those Variables.

The rest of the PR #23400 converts existing Fenix code to use the generated code, so this Pr concentrates on that.

@mcomella please could you re-profile? (are there profiles on CI for PRs with the Performance label?)

jhugman · 2022-02-07T14:18:57Z

app/src/main/java/org/mozilla/fenix/home/HomeMenu.kt

-            nimbusValidation.settingsIcon,
-            "drawable",
-            context.packageName
-        )


Calling getIdentifier is entirely avoidable, for the sake of a validation.

jhugman · 2022-02-07T14:23:39Z

app/src/main/java/org/mozilla/fenix/utils/Settings.kt

+            } else {
+                false
+            }
+        } ?: false


These lines do several things:

we put all the checking logic into one place.

we now only call into Rust once for the default browser feature.

we now only check isFirefoxDefaultBrowser after we check the surface id. There are no experiments about this currently, so this never happens now.

Amejia481 · 2022-02-07T22:05:38Z

One notable visual difference is that in the regressing builds there is a new UI element, "Set as default browser".

@jhugman this looks similar to what @mcomella described on #23556 (comment) see #23618 (comment)

Amejia481 · 2022-02-07T22:54:26Z

One notable visual difference is that in the regressing builds there is a new UI element, "Set as default browser".

@jhugman this looks similar to what @mcomella described on #23556 (comment) see #23618 (comment)

I think what is causing the regression is what @mcomella was describing above the "Set as default browser" card is showing all the time. Investigating #23618 I found that
val isExperimentBranch = feature.messageLocation == MessageSurfaceId.HOMESCREEN_BANNER it's always true after 82a6f8c, it looks like this was not the case before. See #23618 (comment) for more context.

mcomella · 2022-02-08T00:44:04Z

@mcomella please could you re-profile? (are there profiles on CI for PRs with the Performance label?)

I tried to check out and run this PR but it crashes on start up with:

02-07 16:39:03.891 19530 19563 E AndroidRuntime: FATAL EXCEPTION: DefaultDispatcher-worker-5
02-07 16:39:03.891 19530 19563 E AndroidRuntime: Process: org.mozilla.fenix, PID: 19530
02-07 16:39:03.891 19530 19563 E AndroidRuntime: org.mozilla.experiments.nimbus.internal.NimbusFeatureException: A Context is needed but not available. Consider passing in a context to the value() method when close to startup
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at org.mozilla.experiments.nimbus.internal.FeatureHolder.value$default(FeatureHolder.kt:7)
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at org.mozilla.fenix.utils.Settings$searchTermTabGroupsAreEnabled$2.invoke(Settings.kt:4)
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at org.mozilla.fenix.components.settings.LazyPreference$property$2.invoke(FeatureFlagPreference.kt:4)
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at kotlin.SynchronizedLazyImpl.getValue(LazyJVM.kt:5)
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at org.mozilla.fenix.components.settings.LazyPreference.getValue(FeatureFlagPreference.kt:3)
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at org.mozilla.fenix.utils.Settings.getSearchTermTabGroupsAreEnabled(Settings.kt:1)
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at org.mozilla.fenix.FenixApplication$initializeGlean$2.invokeSuspend(FenixApplication.kt:23)
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at kotlin.coroutines.jvm.internal.BaseContinuationImpl.resumeWith(ContinuationImpl.kt:3)
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at kotlinx.coroutines.DispatchedTask.run(DispatchedTask.kt:18)
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at kotlinx.coroutines.scheduling.CoroutineScheduler.runSafely(CoroutineScheduler.kt:1)
02-07 16:39:03.891 19530 19563 E AndroidRuntime:        at kotlinx.coroutines.scheduling.CoroutineScheduler$Worker.run(CoroutineScheduler.kt:10)

@jhugman btw, did you mean a benchmark or a profile?

(are there profiles on CI for PRs with the Performance label?)

Unfortunately, no – we haven't had the resources to implement something like that (e.g. our daily graphs from nightly are actually manually run with a script on a device on my desk).

For benchmarks, you can run it locally if you have a low-end phone (or a high-end phone you can reproduce the regression on). See https://wiki.mozilla.org/Performance/Fenix/Performance_reviews#Benchmark_locally. This regression was to the cold_main_first_frame benchmark.

jhugman · 2022-02-08T11:37:40Z

Ah. Ok yes, I keep forgetting to say: The default browser message was shown as part of the #23400 , and this PR fixes that.

I've added

Fixes #23556

to comment 0.

Fixed this early startup crash, with the suggested fix.

 org.mozilla.experiments.nimbus.internal.NimbusFeatureException: A Context is needed but not available. Consider passing in a context to the value() method when close to startup
        …
        at org.mozilla.fenix.utils.Settings$searchTermTabGroupsAreEnabled$2.invoke(Settings.kt:4)

… the main thread

jhugman · 2022-02-08T11:43:50Z

@jhugman btw, did you mean a benchmark or a profile?

I'm not sure, whichever you needed to find that 102ms regression.

Also question from the curious: given the precision of ms you're giving, I assumed that the error bars were small, however, knowing you're benchmarking on a local machine, I have no idea now. What sort of variation/error bars do you see in timings?

mcomella · 2022-02-08T19:01:45Z

@jhugman btw, did you mean a benchmark or a profile?

I'm not sure, whichever you needed to find that 102ms regression.

Sure. Results from original bisection:

before regression: 1373ms median
regression: 1501ms

Results from this PR, taken with our cold_main_first_frame benchmark run on a Moto G5 (i.e. ./perf-tools/measure_start_up.py nightly cold_main_first_frame results.txt):

before PR: 1443ms
after PR: 1406ms

These results are similar to when I last measured this PR.

Also question from the curious: given the precision of ms you're giving, I assumed that the error bars were small, however, knowing you're benchmarking on a local machine, I have no idea now. What sort of variation/error bars do you see in timings?

I don't know statistics well so I'm not sure how well I can answer this question 😓 (and yes, our results could probably be improved by someone more familiar with statistics). Through trial-and-error we decided to execute 25 iterations of each test and to take the median values of the results to find the duration of a benchmark for a build. When we originally set these values, this had a maximum variation of ~20ms for an A/A test. In the worst case, I think this would let us catch regressions of ~40ms+ (based on this thinking I did). However, the average variation is usually smaller and we can look at the results across multiple days (so hundreds of iterations instead of 25) so in practice we've caught smaller regressions. The variation seems larger these days though: maybe ~30ms. Our nightly benchmark results dashboards (populated once a week on mondays) is here if you're curious.

For context, we use local machines right now rather than CI due to limited resourcing: CI would be ideal because it's more accessible but it takes more time to implement and debug when issues occur.

…lt from main thread (mozilla-mobile#23556) * Fixes mozilla-mobile#23492 — Fixup perf regression of calling isFirefoxDefault from the main thread * Tightening of near defunct default browser message * Fixup early crash in debug build * ktlint

jhugman requested review from mcomella and Amejia481 February 3, 2022 17:40

jhugman requested review from a team as code owners February 3, 2022 17:41

Amejia481 closed this Feb 3, 2022

Amejia481 reopened this Feb 3, 2022

Amejia481 reviewed Feb 3, 2022

View reviewed changes

Amejia481 approved these changes Feb 3, 2022

View reviewed changes

jhugman added the pr:needs-landing-squashed PRs that are ready to land (squashed) [Will be merged by Mergify] label Feb 3, 2022

gabrielluong closed this Feb 3, 2022

gabrielluong reopened this Feb 3, 2022

jhugman force-pushed the jhugman/23492-perf-regression-on-isFirefoxDefault branch from 5832ed9 to 014f6bb Compare February 7, 2022 11:54

jhugman commented Feb 7, 2022

View reviewed changes

jhugman force-pushed the jhugman/23492-perf-regression-on-isFirefoxDefault branch from 5428f35 to d96e51b Compare February 8, 2022 11:32

jhugman added 3 commits February 8, 2022 11:37

Fixes #23492 — Fixup perf regression of calling isFirefoxDefault from…

c0a5d96

… the main thread

Tightening of near defunct default browser message

6683a37

Fixup early crash in debug build

4a22af0

jhugman force-pushed the jhugman/23492-perf-regression-on-isFirefoxDefault branch from d96e51b to 4a22af0 Compare February 8, 2022 11:38

ktlint

6eff2ab

mergify bot merged commit b230c39 into main Feb 8, 2022

bors bot deleted the jhugman/23492-perf-regression-on-isFirefoxDefault branch February 8, 2022 12:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes #23492: Perf regression of calling isFirefoxDefault from main thread #23556

Fixes #23492: Perf regression of calling isFirefoxDefault from main thread #23556

jhugman commented Feb 3, 2022 •

edited

Amejia481 Feb 3, 2022

Amejia481 left a comment

gabrielluong commented Feb 3, 2022

mcomella commented Feb 3, 2022

jhugman left a comment

jhugman Feb 7, 2022

jhugman Feb 7, 2022

Amejia481 commented Feb 7, 2022

Amejia481 commented Feb 7, 2022 •

edited

mcomella commented Feb 8, 2022

jhugman commented Feb 8, 2022 •

edited

jhugman commented Feb 8, 2022

mcomella commented Feb 8, 2022

Fixes #23492: Perf regression of calling isFirefoxDefault from main thread #23556

Fixes #23492: Perf regression of calling isFirefoxDefault from main thread #23556

Conversation

jhugman commented Feb 3, 2022 • edited

Pull Request checklist

To download an APK when reviewing a PR:

Amejia481 Feb 3, 2022

Choose a reason for hiding this comment

Amejia481 left a comment

Choose a reason for hiding this comment

gabrielluong commented Feb 3, 2022

mcomella commented Feb 3, 2022

jhugman left a comment

Choose a reason for hiding this comment

jhugman Feb 7, 2022

Choose a reason for hiding this comment

jhugman Feb 7, 2022

Choose a reason for hiding this comment

Amejia481 commented Feb 7, 2022

Amejia481 commented Feb 7, 2022 • edited

mcomella commented Feb 8, 2022

jhugman commented Feb 8, 2022 • edited

jhugman commented Feb 8, 2022

mcomella commented Feb 8, 2022

jhugman commented Feb 3, 2022 •

edited

Amejia481 commented Feb 7, 2022 •

edited

jhugman commented Feb 8, 2022 •

edited