Agent CLI should be less-brittle when starting by acunniffe · Pull Request #207 · opticdev/optic

acunniffe · 2020-06-02T13:45:35Z

I made some changes to the Agent CLI to make it less brittle in production.

The run command would crash on any invalid input to --config. Invalid tokens, strings, json of a different shape, etc. I wrapped that destabilization in a try/catch to we can branch the code.

Branch 1 is followed when your start config and monitoring config is valid and starts the agent as usual

Branch 2 is followed when the start config is valid and the monitoring config is not. It starts the API and proxy as usual and swaps out the agent persistence manager for the DoNothingCapture -- just a proxy with no recording, batching, etc :)

Optic CI API having downtime, or some accidental clearing of the config environment variable should never affect your ability to deploy your code to prod.

I also renamed --masquerade to --listen to match other proxies/http CLIs. Also, the context around the word masquerade is not welcome cognitive load for security teams.

JaapRood · 2020-06-02T14:21:45Z

 //@ts-ignore
 import jwtDecode from 'jwt-decode';
+//@ts-ignore
+import niceTry from 'nice-try';


🚨 swallowing errors, pretty much never the thing we want to do. I totally agree that the UX should be nice and that Optic service being down shouldn't stop services. Swallowing errors, however, is the total enemy to visibility and complete ignores that thrown errors could have put the process in an undefined state.

I don't see it used anywhere, but it shouldn't even be imported. We have to do the work of actually hardening this, not just hide the issues!

JaapRood · 2020-06-02T14:27:20Z

Optic CI API having downtime, or some accidental clearing of the config environment variable should never affect your ability to deploy your code to prod.

Totally agree here. Expected errors in operating of a distributed system should never bring the process down. Network partitions, downtime of other systems, all of that is expected.

However, misconfiguration should always be as loud as possible and fail a deployment. There's nothing as frustration when running multiple instances on servers, with log outputs all collated, to have no idea why things aren't working, only to find out there is 1 tiny log line amongst millions indicating a failed parse somewhere, causing an entire branch to go ignored.

When users choose to deploy Optic in a place, that's a direct intent, so if they have misconfigured things, it's our jobs to let them know.

acunniffe · 2020-06-02T14:32:43Z

Good points @JaapRood -- I'll find a better balance between preventing Optic from being the thing that takes down your production infra and providing better visibility.

Pending :)

JaapRood · 2020-06-02T14:41:42Z

Past-Jaap who spent hours sifting through aggregated logs looking for similar issues thanks you!

JaapRood

LGTM!

Aidan Cunniffe added 3 commits June 2, 2020 08:23

ui should have its build dependencies in devDependencies

40e2ec8

Agent CLI should be less brittle since it's going into production

a0b2497

Agent CLI should be less brittle since it's going into production

9bed7e4

JaapRood suggested changes Jun 2, 2020

View reviewed changes

Aidan Cunniffe added 3 commits June 2, 2020 10:46

stopped swallowing errors, made issues more transparent

b934152

Switched to ocliff conventions

e894e81

updated test case

5e0b5c9

JaapRood approved these changes Jun 2, 2020

View reviewed changes

Merge branch 'develop' into qa-fixes

52f77f0

acunniffe merged commit a4224f8 into opticdev:develop Jun 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agent CLI should be less-brittle when starting#207

Agent CLI should be less-brittle when starting#207
acunniffe merged 7 commits intoopticdev:developfrom
acunniffe:qa-fixes

acunniffe commented Jun 2, 2020

Uh oh!

JaapRood Jun 2, 2020

Uh oh!

JaapRood commented Jun 2, 2020 •

edited

Loading

Uh oh!

acunniffe commented Jun 2, 2020 •

edited

Loading

Uh oh!

JaapRood commented Jun 2, 2020

Uh oh!

JaapRood left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

acunniffe commented Jun 2, 2020

Uh oh!

JaapRood Jun 2, 2020

Choose a reason for hiding this comment

Uh oh!

JaapRood commented Jun 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

acunniffe commented Jun 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JaapRood commented Jun 2, 2020

Uh oh!

JaapRood left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JaapRood commented Jun 2, 2020 •

edited

Loading

acunniffe commented Jun 2, 2020 •

edited

Loading