feat(sql): Add an SQL permission repository #838

deverton · 2021-02-26T05:48:17Z

This PR maintains backwards compatibility and does not modify existing controllers.

Implements a permission repository backed by SQL with both MySQL and Postgres supported.

Enabling SQL as the permission repository is done as follows:

sql:
  enabled: true
  connectionPools:
    default:
      jdbcUrl: jdbc:mysql://.....
      user: ...
      password: ...
      default: true
  migration:
      jdbcUrl: jdbc:mysql://.....
      user: ...
      password: ...

permissionsRepository:
  redis:
    enabled: false
  sql:
    enabled: true

Integration tests have been updated to enable testing with the SQL provider (using testcontainers), but still default to Redis only as that's faster.

Depends on #835 a bit for testcontainer usage.

This PR maintains backwards compatibility and does not modify existing controllers. Implements a permission repository backed by SQL. No migration is provided as the data is recreated on the next role sync or at startup. Enabling SQL as the permission repository ```yaml sql: enabled: true connectionPools: default: jdbcUrl: jdbc:mysql://..... user: ... password: ... default: true migration: jdbcUrl: jdbc:mysql://..... user: ... password: ... permissionRepository: redis: enabled: false sql: enabled: true ``` Both MySQL and Postgres are supported. Integration tests have been updated to enable testing with the SQL provider (using testcontainers), but still default to Redis only as that's faster.

Attempting to optimise the delete results in dead locks so just wipe the table.

robzienert

I'm no fiat expert, but what you have here makes sense to me from a general technical perspective. Will defer to @jonsie and @cfieber for actual implementation bits.

Have you built and validated this change in any environment (e.g. load tests)?

Great work, thanks for taking this on.

robzienert · 2021-03-02T22:46:16Z

fiat-core/src/main/java/com/netflix/spinnaker/fiat/model/UserPermission.java

-            extensionResources.add(resource);
-          }
-        });
+    resources.forEach(this::addResource);


👍 Good improvement.

deverton · 2021-03-02T23:12:41Z

It's been subjected to some load testing in our staging environment. For the workload there we see performance is roughly the same. Results with the current set of patches and with one fiat instance running 4 vCPU and 8 GB.

Elasticache

Amazon ElastiCache running cache.m6g.2xlarge

Action	Min	Max	Average
`get`	35 ms	109 ms	57 ms
`put`	4 ms	16 ms	6 ms
`getAllById`	319 ms	915 ms	510 ms

SQL

Amazon RDS db.r6g.large running 5.7.mysql_aurora.2.09.1

Action	Min	Max	Average
`get`	28 ms	39 ms	34 ms
`put`	32 ms	63 ms	37 ms
`getAllById`	102 ms	130 ms	111 ms

deverton · 2021-03-02T23:18:35Z

Once I get the data migration logic in place we'll try rolling this in our production environment soonish.

jonsie

LGTM. Very curious to see how this performs in your production environment. I may take a shot at running this in our preprod env too.

deverton · 2021-03-03T04:10:21Z

Just ran in to an issue with how the dual repository looks up the beans from the context which I'm hoping to fix today.

jonsie · 2021-03-03T04:16:57Z

@deverton Fiat should rebuild all permissions on boot (and on a schedule) so I'm not sure the dual repository is necessary anyways.

deverton · 2021-03-03T04:57:13Z

It might be how we've got things configured, but new instances of Fiat don't seem to role sync immediately. There's typically roughly 10 minutes before the first scheduled sync runs and there's only a single instance in this case.

{"@timestamp":"2021-03-03T04:45:02.602+00:00","@version":1,"message":"Server is now HEALTHY. Hooray!","logger_name":"com.netflix.spinnaker.fiat.config.ResourceProvidersHealthIndicator","thread_name":"http-nio-0.0.0.0-7003-exec-2","level":"INFO","level_value":20000}
...
{"@timestamp":"2021-03-03T04:53:27.007+00:00","@version":1,"message":"Acquired Lock Lock{name='fiat.userrolessyncer', ownerName='spin-fiat-6954658dd9-nrhmj', leaseDurationMillis=10000, successIntervalMillis=600000, failureIntervalMillis=600000, version=1614747207004, ownerSystemTimestamp=1614747207004, attributes=''}.","logger_name":"com.netflix.spinnaker.kork.jedis.lock.RedisLockManager","thread_name":"scheduler-3","level":"INFO","level_value":20000}

deverton · 2021-03-03T05:37:45Z

I've dropped the dual repository code out of this PR for now. I'll do some more testing locally to see if the issues on cutover persist for us.

Would you like me to squash these commits up before merge?

jonsie · 2021-03-03T18:48:57Z

Yeah, 10 minutes is the default of syncDelayMs so perhaps you are using the default here? We have that dropped down to 2 minutes in our config.

No need to squash commits, I'll squash them on merge.

deverton · 2021-03-08T20:38:23Z

@jonsie so rolled this out in production and it seems to be working fine. Performance for bulk operations like getAllById() and getAllByRole() are much faster. The get() operation is also faster for our data set. The big downside is that put() is much, much slower, nearly 10x as much (50 ms vs 500 ms).

Overall, this means that the user role sync process is still faster than when we were on Redis but is still too slow. The majority of our sync time was spent in getAllById() on Redis so the slower put() is made up for by that.

The put() operation was implemented pretty naively by just deleting and re-inserting records so there's obviously room for improvement there.

jonsie · 2021-03-09T03:03:12Z

@deverton Yeah I was wondering about that delete/insert in the put operation but I decided not to say anything since it's +/- the same behavior in the Redis permission repository.

I think all in all this is a good sign though. We're running this in our test environment right now and things look good, I will promote this to our staging environment soon.

deverton marked this pull request as ready for review February 26, 2021 06:33

deverton requested review from cfieber, jonsie and robzienert as code owners February 26, 2021 06:33

Dan Everton added 4 commits March 1, 2021 14:14

fix(sql): delete dangling permissions properly

ed7dc73

fix(sql): simplify delete of permission

5c00fea

Attempting to optimise the delete results in dead locks so just wipe the table.

fix(sql): bulk insert of permissions

2060a27

fix(sql): specialised getAll* implementations

feb5eee

robzienert approved these changes Mar 2, 2021

View reviewed changes

Dan Everton added 2 commits March 3, 2021 11:27

fix(sql): correct configuration value to match class

485760e

fix(sql): add dual repository for migration

d86b180

jonsie approved these changes Mar 3, 2021

View reviewed changes

Dan Everton added 2 commits March 3, 2021 15:09

Merge remote-tracking branch 'origin/master' into fiat-sql

c9d4cb8

fix(sql): drop dual repository code for now

050425b

Merge branch 'master' into fiat-sql

242ab64

jonsie merged commit 1be70ba into spinnaker:master Mar 3, 2021

spinnakerbot added the target-release/1.26 label Mar 3, 2021

deverton mentioned this pull request Sep 19, 2021

REQUEST: New Reviewer status for deverton spinnaker/governance#269

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sql): Add an SQL permission repository #838

feat(sql): Add an SQL permission repository #838

deverton commented Feb 26, 2021 •

edited

robzienert left a comment

robzienert Mar 2, 2021

deverton commented Mar 2, 2021 •

edited

deverton commented Mar 2, 2021

jonsie left a comment

deverton commented Mar 3, 2021

jonsie commented Mar 3, 2021

deverton commented Mar 3, 2021

deverton commented Mar 3, 2021

jonsie commented Mar 3, 2021

deverton commented Mar 8, 2021 •

edited

jonsie commented Mar 9, 2021

feat(sql): Add an SQL permission repository #838

feat(sql): Add an SQL permission repository #838

Conversation

deverton commented Feb 26, 2021 • edited

robzienert left a comment

Choose a reason for hiding this comment

robzienert Mar 2, 2021

Choose a reason for hiding this comment

deverton commented Mar 2, 2021 • edited

Elasticache

SQL

deverton commented Mar 2, 2021

jonsie left a comment

Choose a reason for hiding this comment

deverton commented Mar 3, 2021

jonsie commented Mar 3, 2021

deverton commented Mar 3, 2021

deverton commented Mar 3, 2021

jonsie commented Mar 3, 2021

deverton commented Mar 8, 2021 • edited

jonsie commented Mar 9, 2021

deverton commented Feb 26, 2021 •

edited

deverton commented Mar 2, 2021 •

edited

deverton commented Mar 8, 2021 •

edited