Pass arguments through promise chain instead of attaching to generator #186

jgraff2 · 2019-05-23T01:06:45Z

Pass arguments through the promise chain instead of attaching to the generator, so we don't have to clone generators for by-subject collection and we can track the OAuth token on the generator object.

This replaces the global auth tracking fix from 183.

Updated tests here: salesforce/refocus#1182

iamigo · 2019-05-23T16:26:15Z

fyi bunch of style changes but I'll start reviewing

iamigo

Nice! OK to merge once the jscs issues are cleaned up.

coveralls · 2019-05-23T18:44:05Z

Coverage decreased (-0.3%) to 93.617% when pulling 53c8844 on generator-oauth-fix into 3fdd9e5 on master.

iamigo · 2019-05-23T18:53:29Z

@jgraff2 have you tested this anywhere with an Argus generator?

jgraff2 · 2019-05-23T19:03:34Z

I ran it with the integration tests which include an OAuth generator and mocked auth endpoints.

pallavi2209 · 2019-05-23T21:35:10Z

src/remoteCollection/collect.js

-      return doCollect(_g);
-    }));
+  return getSubjectsForGenerator(generator)
+         .mapSeries((subject) =>


Would this mean the collections will happen sequentially instead of parallel (like before)? If yes, could this have any side effect? What if a repeater process is not finished until next repeater starts? Just throwing my thoughts out there.

Hmm, that's a good point, that would definitely be a plausible scenario when run sequentially if we had lots of subjects. We don't have any way of detecting or handling that case. Thinking this through, the next repeat cycle would start while the previous one is still finishing up, so the last few requests would be sent in parallel with the first few for the next cycle. The sample upserts aren't sent until all requests have completed, and the requests are tracked in-memory separately for each collection cycle, so the overlapping cycles wouldn't interfere with each other, and they would end up finishing and sending upserts a minute apart, as normal. So it would still work normally, just with some requests being sent in parallel, which is how it was before anyway. The only impact would be the offset from the repeat cycle would be greater. And probably greater memory usage because we would be tracking two collection cycles at once.

So I think if the repeater cycles overlapped it would be fine. However, just the fact that a collection could take a lot longer this way is maybe reason to reconsider.

I did it this way to make sure we only do the token request once. If we did it in parallel, when the token expires, the next collection cycle would send a separate auth request for each subject; this way it only sends it the first time. But maybe this is another case where it would be better to rethink the current approach than try and force the solution to fit it.

The reason this is necessary in the first place is because the OAuth logic is implemented as part of sending the request. We could instead pull that out into a separate function, and run that once before sending any of the requests. I'm not sure exactly how that would work though since we currently rely on the result of the requests to tell us whether the token needs to be regenerated. Maybe when the token expires we just miss that cycle and request a new one next time? Or if any of the requests fail, we re-do all of them?

Thank you both for thinking that through. Good question and good answer.

The sample upserts aren't sent until all requests have completed

Does this mean that all the samples could be sitting and waiting because one of the requests might be slow and eventually times out at 30s, but with retries, everything else still waiting... ? So the rest of the samples don't get sent until that slow one finally times out again after all the retries and generates its error samples?

Maybe when the token expires we just miss that cycle and request
a new one next time?

IMO, missing a cycle is not an acceptable solution.

Correct, they won't get sent until all the promises complete.

Updated.

I moved the token creation and request retries outside the request-specific code, and went back to making the requests in parallel.

So now the by-subject collection flow looks like this:

make a request to refocus to get subjects from subject query

If necessary, make a request to the OAuth server to get a new token

for each subject, in parallel, make a request to the data source

if any of the requests failed because of an expired token, retry the entire collection cycle, which will generate a new token

if any of the requests failed for other reasons, do nothing

generate transform samples from the successful responses, or error samples from the failed ones

This way, all the asynchronous stuff happens outside the subject loop. (subject query, token creation, retries)

pallavi2209 · 2019-05-23T21:46:14Z

I would suggest testing this manually as well since the tests are not very thorough.

…ct for all subjects in parallel.

pallavi2209 · 2019-05-28T18:37:08Z

@jgraff2 Were you able to test this manually?

jgraff2 · 2019-05-28T19:00:39Z

@pallavi2209 the tests here are not very thorough, but the integration tests include an OAuth section that tests the whole sequence. I did run it against those.

However, looking at them again I realized they only cover bulk requests. I will add some by-subject tests. I will also see if I can add tests to make sure it's only requesting a new token when necessary.

jgraff2 · 2019-06-04T23:06:21Z

Updated: further restructuring of the collection flow to ensure correct error handling for by-subject collection.

Previously a token login error would return a single error object, but the collection handler would be expecting an array, which would cause an error instead of generating error samples.
The problem was that the previous changes had moved the auxiliary requests outside of the subjects loop, but were still using the error handling flow from before, which relied on the errors being created within the loop.

Now, we only call the request handler if the requests were actually made; otherwise we handle the error and generate error samples in the repeater-level catch.

iamigo · 2019-06-06T16:35:17Z

@jgraff2 don't forget to npm publish!

jgraff2 · 2019-06-06T21:04:31Z

Published to npm.

jgraff2 added 3 commits May 22, 2019 18:03

core

256f39c

misc

307443f

tests

08dbe26

jgraff2 requested review from iamigo, pallavi2209 and kriscfoster and removed request for iamigo and kriscfoster May 23, 2019 01:06

jgraff2 changed the title ~~Generator oauth fix~~ Pass arguments through promise chain instead May 23, 2019

jgraff2 changed the title ~~Pass arguments through promise chain instead~~ Pass arguments through promise chain instead of attaching to generator May 23, 2019

iamigo approved these changes May 23, 2019

View reviewed changes

code style

2be199c

pallavi2209 reviewed May 23, 2019

View reviewed changes

Move token creation outside the request-specific code so we can colle…

928478c

…ct for all subjects in parallel.

handle non-collection errors outside the repeater chain

fe6dc17

jgraff2 mentioned this pull request Jun 4, 2019

collector integration tests: run every test with both bulk and by-sub… salesforce/refocus#1182

Merged

code style

53c8844

jgraff2 requested a review from iamigo June 4, 2019 23:06

pallavi2209 approved these changes Jun 5, 2019

View reviewed changes

iamigo approved these changes Jun 6, 2019

View reviewed changes

jgraff2 merged commit 32ce4cd into master Jun 6, 2019

jgraff2 deleted the generator-oauth-fix branch June 6, 2019 21:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pass arguments through promise chain instead of attaching to generator #186

Pass arguments through promise chain instead of attaching to generator #186

jgraff2 commented May 23, 2019 •

edited

iamigo commented May 23, 2019

iamigo left a comment

coveralls commented May 23, 2019 •

edited

iamigo commented May 23, 2019

jgraff2 commented May 23, 2019 •

edited

pallavi2209 May 23, 2019

jgraff2 May 23, 2019

iamigo May 23, 2019

jgraff2 May 23, 2019

jgraff2 May 25, 2019

pallavi2209 commented May 23, 2019 •

edited

pallavi2209 commented May 28, 2019

jgraff2 commented May 28, 2019

jgraff2 commented Jun 4, 2019 •

edited

iamigo commented Jun 6, 2019

jgraff2 commented Jun 6, 2019 •

edited

Pass arguments through promise chain instead of attaching to generator #186

Pass arguments through promise chain instead of attaching to generator #186

Conversation

jgraff2 commented May 23, 2019 • edited

iamigo commented May 23, 2019

iamigo left a comment

Choose a reason for hiding this comment

coveralls commented May 23, 2019 • edited

iamigo commented May 23, 2019

jgraff2 commented May 23, 2019 • edited

pallavi2209 May 23, 2019

Choose a reason for hiding this comment

jgraff2 May 23, 2019

Choose a reason for hiding this comment

iamigo May 23, 2019

Choose a reason for hiding this comment

jgraff2 May 23, 2019

Choose a reason for hiding this comment

jgraff2 May 25, 2019

Choose a reason for hiding this comment

pallavi2209 commented May 23, 2019 • edited

pallavi2209 commented May 28, 2019

jgraff2 commented May 28, 2019

jgraff2 commented Jun 4, 2019 • edited

iamigo commented Jun 6, 2019

jgraff2 commented Jun 6, 2019 • edited

jgraff2 commented May 23, 2019 •

edited

coveralls commented May 23, 2019 •

edited

jgraff2 commented May 23, 2019 •

edited

pallavi2209 commented May 23, 2019 •

edited

jgraff2 commented Jun 4, 2019 •

edited

jgraff2 commented Jun 6, 2019 •

edited