Additional safeguards against printing Secret content #474

KnVerey · 2019-04-24T20:44:24Z

What are you trying to accomplish with this PR?
Further reduce the likelihood that we'll accidentally print the content of sensitive resources (i.e. Secrets) when something goes wrong. Now that we officially support the Secret resource and push our EJSON secrets through that code path, this is super important.

How is this accomplished?
These are the ways I thought of that secrets could get logged:

They come from a regular template, and rendering it fails
They are missing critical data, and we raise during resource instance instantiation
They fail validation (i.e. dry run)
The apply they are part of fails, whether because of them or not

I added tests for the three additional cases (test_apply_failure_with_sensitive_resources_hides_raw_output covers the last one already) and tried to fix them as solidly as I could think to. record_invalid_template is a bit of a choke point for these problems, so in addition to fixing the specific cases, I made that method do a blunt scan of the content it is about to log and replace it if it contains kind: Secret.

I also made the ejson secret provisioner proactively validate the resulting resources and raise its own error if it isn't valid. This is duplicated effort, but it gives us stronger guarantees that nothing will go wrong further down the path.

~~Note that this is based on #473 to avoid linter noise.~~

What could go wrong?
It's still possible there's a code path that I haven't thought of that will still log secret content on error. It is also possible someone will add a new way for this to go wrong in the future, and nothing will stand in their way (note that our contributor's guide does warn about this: We handle Kubernetes secrets, so it is critical that changes do not cause the contents of any secrets in the template set to be logged.).

An alternative / additional safeguard could be to have our logger class scan every single string we give it for kind: Secret. However, that is very heavy-handed and arguably not effective, since not all strings with that text contain sensitive data, and not all secrets have that string (e.g. they could be invalid, or the string could be a partial resource).

If we don't think we can get to Good Enough here, we could revert Secret support. I would rather not though--the community asked for it, and deploying them always actually worked even before we made it official. Even if we took it a step further and prevented Secrets from being deployed with this gem, that still doesn't prevent this gem from handling them.

Another thing that occurred to me is that CRs could contain sensitive data for all we know. We should consider letting administrators tell us this in our CRD annotations. I'll open an issue about this.

@Shopify/cloudx

DazWorrall

Code LGTM

As for the philosophical question of how we should handle secrets: I think we should continue our best effort support, emphasise that it's just that, and commit to handling leaks in our logs as high priority issues. As you say the tool may still handle them anyway, and treating them like any other resource seems like it could result in more accidental leakage than our current approach. Actively refusing support doesn't seem pragmatic either, though perhaps we could add a flag for users to opt into this - if secrets are detected, we fail early and refuse to continue.

DazWorrall · 2019-04-25T10:27:05Z

lib/kubernetes-deploy/ejson_secret_provisioner.rb

@@ -62,7 +62,11 @@ def build_secrets

        secrets.map do |secret_name, secret_spec|
          validate_secret_spec(secret_name, secret_spec)
-          generate_secret_resource(secret_name, secret_spec["_type"], secret_spec["data"])
+          resource = generate_secret_resource(secret_name, secret_spec["_type"], secret_spec["data"])
+          unless resource.validate_definition(@kubectl)


Does this result in additional calls to the API server compared to today, or is it processed entirely locally? I ask because we've seen a recent increase in deploy failures to due overladed masters at the validation stage, and this might add additional load and exacerbate the issue. It's not a good argument to skip this, but perhaps raises the priority of adding some resiliency patterns around these calls.

It does make API calls, but these new ones will be in serial. I don't think we were aware that the validation site was a particular problem. It may be easy to add additional attempts, but I'd need to check what validation failure looks like to make sure the retries would be appropriate. Can you PM me an example?

KnVerey · 2019-04-25T14:57:49Z

I think we should continue our best effort support, emphasise that it's just that, and commit to handling leaks in our logs as high priority issues.

How does this sound as an addition to the README? I'm thinking it could go at the end of the Usage section.

NOTICE: Deploy Secret resources at your own risk. Although we will make every effort not to log their contents and will fix any reported leak vectors with urgency, we cannot guarantee that sensitive information will never be logged.

KnVerey requested a review from DazWorrall April 24, 2019 20:44

KnVerey force-pushed the secret_censoring branch 2 times, most recently from a06920f to 99497da Compare April 24, 2019 21:19

Additional safeguards against printing Secret content

f52b7ed

KnVerey force-pushed the secret_censoring branch from 99497da to f52b7ed Compare April 24, 2019 21:24

KnVerey changed the base branch from rubocop to master April 24, 2019 21:25

DazWorrall approved these changes Apr 25, 2019

View reviewed changes

dturn approved these changes Apr 25, 2019

View reviewed changes

Add notice about deploying Secrets

493df95

KnVerey merged commit 6ebe55a into master Apr 25, 2019

KnVerey deleted the secret_censoring branch April 25, 2019 18:45

KnVerey temporarily deployed to rubygems April 29, 2019 17:38 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Additional safeguards against printing Secret content #474

Additional safeguards against printing Secret content #474

KnVerey commented Apr 24, 2019 •

edited

Loading

DazWorrall left a comment

DazWorrall Apr 25, 2019 •

edited

Loading

KnVerey Apr 25, 2019

KnVerey commented Apr 25, 2019

Additional safeguards against printing Secret content #474

Additional safeguards against printing Secret content #474

Conversation

KnVerey commented Apr 24, 2019 • edited Loading

DazWorrall left a comment

Choose a reason for hiding this comment

DazWorrall Apr 25, 2019 • edited Loading

Choose a reason for hiding this comment

KnVerey Apr 25, 2019

Choose a reason for hiding this comment

KnVerey commented Apr 25, 2019

KnVerey commented Apr 24, 2019 •

edited

Loading

DazWorrall Apr 25, 2019 •

edited

Loading