KNOX-2658 - Skipping in-memory lookup while fetching expiration/metadata for a token by smolnar82 · Pull Request #490 · apache/knox

smolnar82 · 2021-09-07T13:43:06Z

What changes were proposed in this pull request?

As described in the JIRA; in-memory lookup is skipped in JDBC token state service while fetching token expiration or metadata. Instead, we go directly to the DB in this implementation.
Since those queries are simple (there is no table join involved) and they rely on indexed columns the is no significant performance issue.

How was this patch tested?

Updated JUnit test cases and repeated the same steps as described in the JIRA. With my fix, the disabled authentication request fails on node 2 like this:

$ curl -ku Passcode:WXpVNVlUZGlNRGt0WWpFMk5TMDBZamxsTFRobVpEY3ROV0psWW1WbE1EVTRZakF4OjpOVFptWW1VNVlXVXROelppTVMwME5URmhMVGcxWXpRdFl6Z3hNVEUwTmpkak5XUTA= https://localhost:8444/gateway/tokenbased/webhdfs/v1?op=LISTSTATUS
<html>
<head>
<meta http-equiv="Content-Type" content="text/html;charset=utf-8"/>
<title>Error 401 Token c59a7b09...5bebee058b01 is disabled</title>
</head>
<body><h2>HTTP ERROR 401 Token c59a7b09...5bebee058b01 is disabled</h2>
<table>
<tr><th>URI:</th><td>/gateway/tokenbased/webhdfs/v1</td></tr>
<tr><th>STATUS:</th><td>401</td></tr>
<tr><th>MESSAGE:</th><td>Token c59a7b09...5bebee058b01 is disabled</td></tr>
<tr><th>SERVLET:</th><td>tokenbased-knox-gateway-servlet</td></tr>
</table>

</body>
</html>

smolnar82 · 2021-09-07T13:43:50Z

Cc. @zeroflag

zeroflag · 2021-09-07T13:47:39Z

LGTM

…ata for a token

moresandeep

Github does not let me click on line numbers that have not changed so adding my comments here. Why do we have some places that use in-memory implementation and some places that do not? it is a bit confusing.

I think instead of removing the token completely can we adjust the TTL just for the metadata in case of JDBC to be short, say 3-5 mins and we can have that as a "margin of error". Or instead of removing the code, use a switch to figure out when HA is enabled so that non-HA JDBC implementations can still benefit from the cache performance. Thoughts?

smolnar82 · 2021-09-08T09:16:33Z

Thanks, @moresandeep for the detailed comment. Let me try to answer your questions.

Github does not let me click on line numbers that have not changed so adding my comments here. Why do we have some places that use in-memory implementation and some places that do not? it is a bit confusing.

I tried to keep the in-memory lookups where we fetch data that cannot be changed. So that it's gonna be the same on each node all the time (until the token is removed/expired).

I think instead of removing the token completely can we adjust the TTL just for the metadata in case of JDBC to be short, say 3-5 mins and we can have that as a "margin of error". Or instead of removing the code, use a switch to figure out when HA is enabled so that non-HA JDBC implementations can still benefit from the cache performance. Thoughts?

I'd the idea of replacing the current ConcurrentHashMap caches into Caffeine.Cache instances but we would still have an issue with eventual consistency within the configured entry TTL in those caches ("margin of error"). I discussed this approach with @zeroflag and we felt that it'd be pre-mature optimization that can be added later if we hit any performance issues. On the other hand, I'm open to re-evaluate that area if we insist this has to be done now.

moresandeep · 2021-09-08T13:12:55Z

I'd the idea of replacing the current ConcurrentHashMap caches into Caffeine.Cache instances but we would still have an issue with eventual consistency within the configured entry TTL in those caches ("margin of error"). I discussed this approach with @zeroflag and we felt that it'd be pre-mature optimization that can be added later if we hit any performance issues. On the other hand, I'm open to re-evaluate that area if we insist this has to be done now.

I see, I am cool with it knowing this has been discussed and will be addressed in the future patches :)
Thanks!

smolnar82 · 2021-09-09T07:33:02Z

I'd the idea of replacing the current ConcurrentHashMap caches into Caffeine.Cache instances but we would still have an issue with eventual consistency within the configured entry TTL in those caches ("margin of error"). I discussed this approach with @zeroflag and we felt that it'd be pre-mature optimization that can be added later if we hit any performance issues. On the other hand, I'm open to re-evaluate that area if we insist this has to be done now.

I see, I am cool with it knowing this has been discussed and will be addressed in the future patches :)
Thanks!

https://issues.apache.org/jira/browse/KNOX-2660

…ation/metadata for a token (apache#490) Change-Id: I9836e03efd4d5e37ad7e30ea9806119910b5e7dc

…g expiration/metadata for a token (apache#490)" into cdpd-master

smolnar82 requested review from moresandeep and pzampino September 7, 2021 13:43

smolnar82 self-assigned this Sep 7, 2021

smolnar82 added the knoxtoken label Sep 7, 2021

KNOX-2658 - Skipping in-memory lookup while fetching expiration/metad…

bff0e52

…ata for a token

smolnar82 force-pushed the KNOX-2658 branch from c1b0393 to bff0e52 Compare September 7, 2021 17:46

moresandeep requested changes Sep 7, 2021

View reviewed changes

moresandeep self-requested a review September 8, 2021 13:13

moresandeep approved these changes Sep 8, 2021

View reviewed changes

smolnar82 merged commit 486abb0 into apache:master Sep 9, 2021

smolnar82 deleted the KNOX-2658 branch September 9, 2021 07:33

stoty pushed a commit to stoty/knox that referenced this pull request May 14, 2024

CDPD-28979 KNOX-2658 - Skipping in-memory lookup while fetching expir…

9748391

…ation/metadata for a token (apache#490) Change-Id: I9836e03efd4d5e37ad7e30ea9806119910b5e7dc

stoty pushed a commit to stoty/knox that referenced this pull request May 14, 2024

Merge "CDPD-28979 KNOX-2658 - Skipping in-memory lookup while fetchin…

d50c022

…g expiration/metadata for a token (apache#490)" into cdpd-master

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KNOX-2658 - Skipping in-memory lookup while fetching expiration/metadata for a token#490

KNOX-2658 - Skipping in-memory lookup while fetching expiration/metadata for a token#490
smolnar82 merged 1 commit intoapache:masterfrom
smolnar82:KNOX-2658

smolnar82 commented Sep 7, 2021

Uh oh!

smolnar82 commented Sep 7, 2021

Uh oh!

zeroflag commented Sep 7, 2021

Uh oh!

moresandeep left a comment

Uh oh!

smolnar82 commented Sep 8, 2021

Uh oh!

moresandeep commented Sep 8, 2021

Uh oh!

smolnar82 commented Sep 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

smolnar82 commented Sep 7, 2021

What changes were proposed in this pull request?

How was this patch tested?

Uh oh!

smolnar82 commented Sep 7, 2021

Uh oh!

zeroflag commented Sep 7, 2021

Uh oh!

moresandeep left a comment

Choose a reason for hiding this comment

Uh oh!

smolnar82 commented Sep 8, 2021

Uh oh!

moresandeep commented Sep 8, 2021

Uh oh!

smolnar82 commented Sep 9, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants