Add offline use case for 'migrateFromSessionToken' #2492

vladikoff · 2020-01-20T20:20:36Z

Fixes #2396

Pull Request checklist

Quality: This PR builds and tests run cleanly
- cargo test --all produces no test failures
- cargo clippy --all --all-targets --all-features runs without emitting any warnings
- cargo fmt does not produce any changes to the code
- ./gradlew ktlint detekt runs without emitting any warnings
- swiftformat --swiftversion 4 megazords components/*/ios && swiftlint runs without emitting any warnings or producing changes
- Note: For changes that need extra cross-platform testing, consider adding [ci full] to the PR title.
Tests: This PR includes thorough tests or an explanation of why it does not
Changelog: This PR includes a changelog entry in CHANGES_UNRELEASED.md or an explanation of why it does not need one
- Any breaking changes to Swift or Kotlin binding APIs are noted explicitly
Dependencies: This PR follows our dependency management guidelines
- Any new dependencies are accompanied by a summary of the due dilligence applied in selecting them.

vladikoff · 2020-01-21T04:25:30Z

@eoger @rfk simpler approach to this now based on what we learned in the past PR. r?

rfk · 2020-01-21T04:38:57Z

Thanks @vladikoff; I'm adding @grigoryk as reviewer for the proposed approach from the consumer side.

rfk

This is definitely a lot simpler!

Both of my high-level comments from the previous PR still kind of apply here though:

How do we handle the difference between transient and fatal errors?
How can we effectively test the above in CI?

(I don't want to get too hung up on tests here, because we plan for this code to have a short shelf-life. But we also don't want to be accidentally breaking it as we land other fixes in the leadup to the migration release).

CHANGES_UNRELEASED.md

+### Features
+
+* `migrateFromSessionToken` now handles offline use cases. It caches the data the consumers originally provide.
+  If there's no network connectivity then the migration could be retried using the new `retryMigrateFromSessionToken` method. 


rfk · 2020-01-21T04:45:32Z

components/fxa-client/examples/migration.rs

+            Ok(migration_result) => migration_result,
+            Err(err) => {
+                println!("Error: {}", err);
+                // test offline behaviour


Unless this gets run in CI on a regular basis, I'm not sure we can claim it as a "test" ;-)

rfk · 2020-01-21T04:49:23Z

components/fxa-client/examples/migration.rs

+                    let retry = fxa.try_migration();
+                    match retry {
+                        Ok(result) => break result,
+                        Err(_) => println!("Retrying... Are you connected to the internet?"),


Continuing discussion from the previous PR, each migration attempt could fail for either both fatal or non-fatal reasons. It would be useful for this example to show how to handle the two different cases. For example right now, if I enter an invalid sessionToken into this example script, then IIUC it will loop forever asking me whether I'm connected to the internet. We don't want to get anyone's migrated Fennec stuck in such a loop in practice.

(I'll also note that "not connected to the internet" is not necessarily the only transient error; 500-level server errors should probably also be treated as transient and retried).

I know it's only for testing purposes, but I'd like to see this example use the same logic as we expect consumers to use in practice, which IIUC is basically:

match migrate_from_session_token(...) { Ok(result) => result, Err(...) => { while is_in_migration_state() { sleep(...) match retry_migrate_from_session_token() { Ok(result) => result, Err(...) => continue, } } } }

grigoryk

This is a good start, but I'm not sure how well the proposed API will work in practice.

From an a-c perspective, I expected to see one of:

automagical behaviour - a failed attempt persists FxA state, pretends to succeed, and will attempt to re-auth on FxA API access. On startup, a-c would detect that there's persisted state, and a call to ensureCapabilities will attempt to re-auth as well. Non-transient failures will reset this process.
diy approach - an ability to detect if there was an attempt to sign-in, which failed and could be retried

First option will make the integration much smoother, of course - we probably won't need to change anything in a-c, nor introduce any new UI states in Fenix. Second option is arguably much simpler for ya'll, but the complexity is moved upwards - in a-c/fenix, we now get a new state (attempted to sign-in via migration data, failed, but could retry) that needs to be managed and integrated with the existing flow.

What I see is close to second, but with some details missing. How can I figure out if I need to retry? What is the state of the world after a failed retry attempt? What actions will work, and what won't work? What state should the client end-up in right after this failure? Can a client attempt a fresh sign-in? What should happen after an app restart?

We already have a possibility of transient network issues at the very end of the sign-in process - perhaps we can adopt a similar approach for both of these problems (transient migration failures and transient regular sign-in flow failures)? E.g. IIUC, completeOAuthFlow failures could be treated similarly, with a caveat that code(and maybe state?) has a shorter lifespan.

Also, nit - it'll be very nice if there were some docs on what's in the returned JSON blobs - or at least a comment with a struct name, to make looking this stuff up easier.

vladikoff · 2020-01-23T22:55:02Z

automagical behaviour - a failed attempt persists FxA state, pretends to succeed, and will attempt to re-auth on FxA API access.

We tried that in the other approach and it didn't pan out. We will possibly revisit that in the future once we simplify the state persistence

rfk · 2020-01-23T22:58:06Z

We tried that in the other approach and it didn't pan out

To add a bit of context from the previous conversations, that approach turned out to be surprisingly invasive, with lots of methods suddenly having the potential to modify the account state and need to trigger the persistence callback.

vladikoff · 2020-01-26T17:56:08Z

Updated!

rfk

Thanks @vladikoff. The shape of the API here looks inline with what we discussed in the meeting last week.

However, I still think we're missing some logic to differentiate between transient errors (e.g. network failure, server error) and permanent errors (e.g. bad session token). We don't need to expose that over the FFI, but we do still need to handle it internally, otherwise we risk being permanently stuck with isInMigrationState returning true but retryMigrateFromSessionToken failing with the same permanent error over and over again.

My suggestion would be to have an outer catch block around try_migration, something like:

pub fn try_migration() {
  let res = { ... all the code that's there currently ... }
  if let Err(err) = res {
    // If it's a transient error, leave the state so we can retry later.
    if (err is a network error or  500-server-error) {
      return res;
    }
  }
  // We've either succeeded, or are never going to succeed.
  self.state.in_flight_migration = None;
  res
}

rfk · 2020-01-27T10:22:27Z

components/fxa-client/examples/migration.rs

+                    let retry = fxa.try_migration();
+                    match retry {
+                        Ok(result) => break result,
+                        Err(_) => println!("Retrying... Are you connected to the internet?"),


I know it's only for testing purposes, but I'd like to see this example use the same logic as we expect consumers to use in practice, which IIUC is basically:

match migrate_from_session_token(...) { Ok(result) => result, Err(...) => { while is_in_migration_state() { sleep(...) match retry_migrate_from_session_token() { Ok(result) => result, Err(...) => continue, } } } }

rfk · 2020-01-27T10:27:41Z

CHANGES_UNRELEASED.md

+* `migrateFromSessionToken` now handles offline use cases. It caches the data the consumers originally provide.
+  If there's no network connectivity then the migration could be retried using the new `retryMigrateFromSessionToken` method. 
+  Consumers may also use the `isInMigrationState` method to check if there's a migration in progress.
+  ([#2492](https://github.com/mozilla/application-services/pull/2492))


I suggest rewording this for clarity, to focus on the things that consumers need to know in the order they need to know them. Along the lines of:

migrateFromSessionToken now has special handling for transient failures such as network errors. If the migration fails due to a transient error, then the provided credentials will be cached so that the migration can be retried later. Consumers can call the isInMigrationState method to check if there is a cached migration in progress, and retryMigrateFromSessionToken to retry it.

rfk · 2020-01-27T10:30:02Z

components/fxa-client/src/migrator.rs

+        self.try_migration()
+    }
+
+    /// Check if the client is


If the client is what? :-P

rfk · 2020-01-27T10:31:31Z

components/fxa-client/src/migrator.rs

+        self.state.in_flight_migration.is_some()
+    }
+
+    pub fn try_migration(&mut self) -> Result<FxAMigrationResult> {


Naming nit: I think the overall structure of the code would be a bit clearer if this were called retry_migrate_from_session_token for symmetry with what it's called in the higher layers.

rfk

I have many remaining nits, but let's get this into a build and try it out ;-)

Fixes #2396

vladikoff force-pushed the migration-offline-4 branch from 23d2d0a to 33a2cca Compare January 20, 2020 20:22

vladikoff mentioned this pull request Jan 20, 2020

Add offline use case for migrateFromSessionToken #2481

Closed

4 tasks

vladikoff force-pushed the migration-offline-4 branch 4 times, most recently from 09aa470 to b6f3266 Compare January 21, 2020 04:20

vladikoff requested review from eoger and rfk January 21, 2020 04:22

rfk requested a review from grigoryk January 21, 2020 04:38

rfk reviewed Jan 21, 2020

View reviewed changes

grigoryk reviewed Jan 23, 2020

View reviewed changes

vladikoff force-pushed the migration-offline-4 branch 4 times, most recently from a781306 to 60dfc28 Compare January 26, 2020 17:55

rfk suggested changes Jan 27, 2020

View reviewed changes

eoger mentioned this pull request Jan 29, 2020

Expose session token migration methods to Swift FxAccountManager #2435

Closed

vladikoff force-pushed the migration-offline-4 branch from 60dfc28 to 18eb8d3 Compare January 31, 2020 12:48

rfk approved these changes Jan 31, 2020

View reviewed changes

vladikoff force-pushed the migration-offline-4 branch from 18eb8d3 to 3fbfeeb Compare January 31, 2020 13:11

Add offline use case for 'migrateFromSessionToken'

5f67f64

Fixes #2396

vladikoff force-pushed the migration-offline-4 branch from 3fbfeeb to 5f67f64 Compare January 31, 2020 14:51

eoger approved these changes Jan 31, 2020

View reviewed changes

vladikoff merged commit 807c176 into master Jan 31, 2020

rfk deleted the migration-offline-4 branch June 7, 2021 10:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add offline use case for 'migrateFromSessionToken' #2492

Add offline use case for 'migrateFromSessionToken' #2492

vladikoff commented Jan 20, 2020 •

edited

vladikoff commented Jan 21, 2020

rfk commented Jan 21, 2020

rfk left a comment

This comment was marked as resolved.

rfk Jan 21, 2020

rfk Jan 21, 2020

rfk Jan 21, 2020

rfk Jan 27, 2020

grigoryk left a comment

vladikoff commented Jan 23, 2020

rfk commented Jan 23, 2020 •

edited

vladikoff commented Jan 26, 2020

rfk left a comment

rfk Jan 27, 2020

rfk Jan 27, 2020

rfk Jan 27, 2020

rfk Jan 27, 2020

rfk left a comment

Add offline use case for 'migrateFromSessionToken' #2492

Add offline use case for 'migrateFromSessionToken' #2492

Conversation

vladikoff commented Jan 20, 2020 • edited

Pull Request checklist

vladikoff commented Jan 21, 2020

rfk commented Jan 21, 2020

rfk left a comment

Choose a reason for hiding this comment

This comment was marked as resolved.

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

grigoryk left a comment

Choose a reason for hiding this comment

vladikoff commented Jan 23, 2020

rfk commented Jan 23, 2020 • edited

vladikoff commented Jan 26, 2020

rfk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rfk left a comment

Choose a reason for hiding this comment

vladikoff commented Jan 20, 2020 •

edited

rfk commented Jan 23, 2020 •

edited